I am having rampant failures with the newest NVIDIA drivers

It seems that a lot of GPU problems revolve around specific versions of drivers. Though NVidia has their own support structure, you can often learn from information reported by others who fold.

Moderators: Site Moderators, FAHC Science Team

Turbo_T
Posts: 26
Joined: Mon Mar 11, 2013 1:46 am

I am having rampant failures with the newest NVIDIA drivers

Post by Turbo_T »

My EVGA 660Ti is shutting down from rapid fire failures on project 8018. I loaded the latest drivers 332.21 but they don't seem to help at all. the system runs fine 24/7 but lately the GPU(stock settings, no personal OC) keeps getting the error "GPU 1 failed to complete a project 8018 WU (unstable_machine) until 10 fail and it shuts down for 4 hours. the failures are within 2 minutes of starting the new project and it's all on the same WU, 8018. Help, I am a dedicated folder and have 2 GPU's folding in this particular machine. The EVGA GTX 480 FTW has no problems with the WU but the 660 TI is having fits.

Thanks,

Turbo_T
PantherX
Site Moderator
Posts: 7020
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: I am having rampant failures with the newest NVIDIA driv

Post by PantherX »

Welcome to the F@H Forum Turbo_T,

Is your GPU operating within normal temperatures? Is there a dust build-up in your system? If your GPU is having a factory overclock, you could try using the Nvidia stock frequencies.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Kjetil
Posts: 178
Joined: Sat Apr 14, 2012 5:56 pm
Location: Stavanger Norway

Re: I am having rampant failures with the newest NVIDIA driv

Post by Kjetil »

Use this driver on 6xx card 327.23. I have 660Ti and PPD on p8900 is 68K
PantherX
Site Moderator
Posts: 7020
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: I am having rampant failures with the newest NVIDIA driv

Post by PantherX »

Welcome to the F@H Forum Kjetil,

That observation applies only to FahCore_17 WUs. However, Project 8018 is using FahCore_15 and I haven't read any reports if the performance is the same or lower with the newer drivers. Moreover, the latest WHQL Drivers lowers the performance of FahCore_17, it doesn't cause the WU to error out.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Turbo_T
Posts: 26
Joined: Mon Mar 11, 2013 1:46 am

Re: I am having rampant failures with the newest NVIDIA driv

Post by Turbo_T »

my GPU temperatures stay in the mid 60's so its not a problem. I do clean the dust out regularly. the card is a sc model so I may try reducing the clocks and see if it helps. This only happens with that one wu. I have no errors with any other WU's.
bollix47
Posts: 2941
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: I am having rampant failures with the newest NVIDIA driv

Post by bollix47 »

If you're going to try reducing the clocks I suggest you just reduce the Memory clock to stock first and try again. I've had some success doing this and leaving the Core clock at it's o/c. Memory speed makes very little difference to folding but it can cause problems with some overclocked setups.
Image
Turbo_T
Posts: 26
Joined: Mon Mar 11, 2013 1:46 am

Re: I am having rampant failures with the newest NVIDIA driv

Post by Turbo_T »

Ok i'll try that first. Thanks
Turbo_T
Posts: 26
Joined: Mon Mar 11, 2013 1:46 am

Re: I am having rampant failures with the newest NVIDIA driv

Post by Turbo_T »

I have dropped my Memory clocks by a measly 50 Hz and it has performed 37% of the WU with to failures, so we may have got it. Thanks, I'll confirm after a couple WU process correctly. As far as performance though I am curious.

My system config is: ASUS P6X58D motherboard with an intel 980X CPU, 3X4GB of ram in the primary channel, b slots vacant, an EVGA GTX 480 FTW (liquid cooled) in the primary PCI EX slot and the EVGA 650TI in the second PCI EX slot, one Kingston 300 series 126GB SSD boot drive, 1X 1TB Western digital Black edition HDD, 1X 2TB storage drive, a Swiftech 3X120 radiator and high airflow fans with a custom water loop (that I made) cooling the CPU and the 480 GTX the 650 TI is air cooled. I have the CPU 24/7 stable at 4.0 GHZ with no PCI overclock and Memory running at a hair over the stock 1600 setting (1642 I think) my CPU temps never exceed 65 and average is 57 while fording on 10 of the 12 cores. I run both GPU's as well, the 480 max temp over the last 48 hours was 68C and the 660 was 69. My temps are never really a problem, I certainly don't ever get into the range you would see any throttling. But here's the catch, my CPU will run in SMP mode at 25-29K PPD and the two cards end up running identical WU's at about 15K PPD. That seems way low to me considering the numbers kjetil posted above. I had a short period back in October when both cards seemed to be getting some higher point WU's and I was pulling 29-30K PPD with either of them too, but that didn't last long for whatever reason. I have not been able to figure out why that stopped. There have been no significant configuration changes in the last year. The system was built in October of 2010 and has been in continuous use since then. I can perform all my daily functions on the last 2 CPU cores I don't have folding, and I only shut down the primary GPU when I am playing a particularly graphics intensive game on the weekends. Do you have any idea what I should expect to get with this hardware?
n_w95482
Posts: 66
Joined: Tue May 01, 2012 12:46 am
Hardware configuration: CPU: Ryzen 7 5800X3D

GPU: Radeon RX 6700 XT, Radeon RX 6900 XT
Location: California

Re: I am having rampant failures with the newest NVIDIA driv

Post by n_w95482 »

If the cards are working on core 15 WUs, the PPD will be lower (around what you mentioned). With core 17, it'll be higher.

As Kjetil mentioned, switch to the 327.23 driver. Anything higher than that will cause your 650 to run much slower than normal when working on core 17 WUs. Using FAHBench as an example, I had a performance loss of 55% with explcit single-precision, which correlated with the huge loss in PPD that I experienced.

Right now, demand for core 17 WUs is greater than the supply of said WUs, so you'll see the cards bounce back and forth between core 15 and 17, depending on what's available at the time.
Folding since December 2003. In memory of my mother, who lost her battle with cancer.

Image
Turbo_T
Posts: 26
Joined: Mon Mar 11, 2013 1:46 am

Re: I am having rampant failures with the newest NVIDIA driv

Post by Turbo_T »

All I see is Core 15 on my system, I don't have a core 17 even loaded. Why wouldn't the system pick up a core 17 when it is available? Is there a setting that activates that option? Thanks for all the help and information.
P5-133XL
Posts: 2948
Joined: Sun Dec 02, 2007 4:36 am
Hardware configuration: Machine #1:

Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).

Machine #2:

Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.

Machine 3:

Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32

I am currently folding just on the 5x GTX 460's for aprox. 70K PPD
Location: Salem. OR USA

Re: I am having rampant failures with the newest NVIDIA driv

Post by P5-133XL »

No there is no setting that guarantee's Core_17. You get whatever the work servers give you. The priorities for individual projects are set by PG. The best you can do is fiddle with the client-type but that really doesn't get what you want. All it does is change the set of projects available to you to a more or less risky (more or less tested and thereby more or less likely to be a bad WU) set.

That being said, Core_15 is relatively old and its projects are well tested, and generally in general release. However, Core_17 is relatively new and thereby less tested and more risky. So if you want to try to chase Core_17 then you can change client-type to something else (beta, advanced, or non-existent for general release)
Image
7im
Posts: 10189
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: I am having rampant failures with the newest NVIDIA driv

Post by 7im »

P5-133XL wrote:No there is no setting that guarantee's Core_17. You get whatever the work servers give you. The priorities for individual projects are set by PG. The best you can do is fiddle with the client-type but that really doesn't get what you want. All it does is change the set of projects available to you to a more or less risky (more or less tested and thereby more or less likely to be a bad WU) set.

That being said, Core_15 is relatively old and its projects are well tested, and generally in general release. However, Core_17 is relatively new and thereby less tested and more risky. So if you want to try to chase Core_17 then you can change client-type to something else (beta, advanced, or non-existent for general release)
But unless you are a member of the beta team, there is no support for using the beta setting.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
P5-133XL
Posts: 2948
Joined: Sun Dec 02, 2007 4:36 am
Hardware configuration: Machine #1:

Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).

Machine #2:

Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.

Machine 3:

Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32

I am currently folding just on the 5x GTX 460's for aprox. 70K PPD
Location: Salem. OR USA

Re: I am having rampant failures with the newest NVIDIA driv

Post by P5-133XL »

So true
Image
Turbo_T
Posts: 26
Joined: Mon Mar 11, 2013 1:46 am

Re: I am having rampant failures with the newest NVIDIA driv

Post by Turbo_T »

I have dialed everything back to OEM settings and continue to have regular failures on my slot 00 primary GPU. It is an EVGA GTX 480 hydro copper gpu that has been folding for 3 years with little, if any problems. I have just increased the log file setting to 4 but this version I am pasting in is from the last output while still on setting 3. It may require a change to the GPU forum as I am not certain the drivers are the problem any more. I am using the suggested drivers 327.23 and the GPU fails soon after starting, but the system is not BSOD. it just fails in the F@H control. I have no system instability during games or in previous folding using the V2tracker console to run my folding. however, that console doesn't support core 17, so I changed recently. The following is the log from yesterday.

Code: Select all

*********************** Log Started 2014-02-15T06:10:39Z ***********************
06:10:40:WU02:FS00:Connecting to assign-GPU.stanford.edu:80
06:10:41:WU02:FS00:News: Welcome to Folding@Home
06:10:41:WU02:FS00:Assigned to work server 171.64.65.69
06:10:41:WU02:FS00:Requesting new work unit for slot 00: READY gpu:0:GF100 [GeForce GTX 480] from 171.64.65.69
06:10:41:WU02:FS00:Connecting to 171.64.65.69:8080
06:10:41:WU02:FS00:Downloading 4.17MiB
06:10:44:WU02:FS00:Download complete
06:10:44:WU02:FS00:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:8900 run:649 clone:4 gen:83 core:0x17 unit:0x00000089028c126651a6b710bf45e11f
06:10:44:WU02:FS00:Starting
06:10:44:WU02:FS00:Running FahCore: "E:\Stanford FAH\FAHClient/FAHCoreWrapper.exe" "E:/Stanford FAH/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe" -dir 02 -suffix 01 -version 703 -lifeline 6268 -checkpoint 30 -gpu 0 -gpu-vendor nvidia
06:10:44:WU02:FS00:Started FahCore on PID 6380
06:10:44:WU02:FS00:Core PID:6448
06:10:44:WU02:FS00:FahCore 0x17 started
06:10:45:WU02:FS00:0x17:*********************** Log Started 2014-02-15T06:10:44Z ***********************
06:10:45:WU02:FS00:0x17:Project: 8900 (Run 649, Clone 4, Gen 83)
06:10:45:WU02:FS00:0x17:Unit: 0x00000089028c126651a6b710bf45e11f
06:10:45:WU02:FS00:0x17:CPU: 0x00000000000000000000000000000000
06:10:45:WU02:FS00:0x17:Machine: 0
06:10:45:WU02:FS00:0x17:Reading tar file state.xml
06:10:45:WU02:FS00:0x17:Reading tar file system.xml
06:10:45:WU02:FS00:0x17:Reading tar file integrator.xml
06:10:45:WU02:FS00:0x17:Reading tar file core.xml
06:10:45:WU02:FS00:0x17:Digital signatures verified
06:10:45:WU02:FS00:0x17:Folding@home GPU core17
06:10:45:WU02:FS00:0x17:Version 0.0.52
06:14:04:WU02:FS00:0x17:Completed 0 out of 2500000 steps (0%)
06:14:04:WU02:FS00:0x17:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
06:34:39:WU02:FS00:0x17:Completed 25000 out of 2500000 steps (1%)
06:46:32:WU02:FS00:0x17:ERROR:exception: First periodic box vector must be parallel to x.
06:46:32:WU02:FS00:0x17:Saving result file logfile_01.txt
06:46:32:WU02:FS00:0x17:Saving result file log.txt
06:46:32:WU02:FS00:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
06:46:33:WARNING:WU02:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
06:46:33:WU02:FS00:Sending unit results: id:02 state:SEND error:FAULTY project:8900 run:649 clone:4 gen:83 core:0x17 unit:0x00000089028c126651a6b710bf45e11f
06:46:33:WU02:FS00:Uploading 2.47KiB to 171.64.65.69
06:46:33:WU02:FS00:Connecting to 171.64.65.69:8080
06:46:33:WU03:FS00:Connecting to assign-GPU.stanford.edu:80
06:46:33:WU02:FS00:Upload complete
06:46:33:WU02:FS00:Server responded WORK_ACK (400)
06:46:34:WU02:FS00:Cleaning up
06:46:34:WU03:FS00:News: Welcome to Folding@Home
06:46:34:WU03:FS00:Assigned to work server 171.64.65.69
06:46:34:WU03:FS00:Requesting new work unit for slot 00: READY gpu:0:GF100 [GeForce GTX 480] from 171.64.65.69
06:46:34:WU03:FS00:Connecting to 171.64.65.69:8080
06:46:35:WU03:FS00:Downloading 4.17MiB
06:46:37:WU03:FS00:Download complete
06:46:38:WU03:FS00:Received Unit: id:03 state:DOWNLOAD error:NO_ERROR project:8900 run:627 clone:1 gen:286 core:0x17 unit:0x00000174028c126651a6b21720aa8f18
06:46:38:WU03:FS00:Starting
06:46:38:WU03:FS00:Running FahCore: "E:\Stanford FAH\FAHClient/FAHCoreWrapper.exe" "E:/Stanford FAH/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe" -dir 03 -suffix 01 -version 703 -lifeline 6268 -checkpoint 30 -gpu 0 -gpu-vendor nvidia
06:46:38:WU03:FS00:Started FahCore on PID 6792
06:46:38:WU03:FS00:Core PID:6756
06:46:38:WU03:FS00:FahCore 0x17 started
06:46:38:WU03:FS00:0x17:*********************** Log Started 2014-02-15T06:46:38Z ***********************
06:46:38:WU03:FS00:0x17:Project: 8900 (Run 627, Clone 1, Gen 286)
06:46:38:WU03:FS00:0x17:Unit: 0x00000174028c126651a6b21720aa8f18
06:46:38:WU03:FS00:0x17:CPU: 0x00000000000000000000000000000000
06:46:38:WU03:FS00:0x17:Machine: 0
06:46:38:WU03:FS00:0x17:Reading tar file state.xml
06:46:39:WU03:FS00:0x17:Reading tar file system.xml
06:46:39:WU03:FS00:0x17:Reading tar file integrator.xml
06:46:39:WU03:FS00:0x17:Reading tar file core.xml
06:46:39:WU03:FS00:0x17:Digital signatures verified
06:46:39:WU03:FS00:0x17:Folding@home GPU core17
06:46:39:WU03:FS00:0x17:Version 0.0.52
06:49:44:WU03:FS00:0x17:Completed 0 out of 2500000 steps (0%)
06:49:44:WU03:FS00:0x17:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
07:06:40:WU03:FS00:0x17:ERROR:exception: The periodic box size has decreased to less than twice the nonbonded cutoff.
07:06:40:WU03:FS00:0x17:Saving result file logfile_01.txt
07:06:40:WU03:FS00:0x17:Saving result file log.txt
07:06:40:WU03:FS00:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
07:06:41:WARNING:WU03:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
07:06:41:WU03:FS00:Sending unit results: id:03 state:SEND error:FAULTY project:8900 run:627 clone:1 gen:286 core:0x17 unit:0x00000174028c126651a6b21720aa8f18
07:06:41:WU03:FS00:Uploading 2.48KiB to 171.64.65.69
07:06:41:WU03:FS00:Connecting to 171.64.65.69:8080
07:06:41:WU02:FS00:Connecting to assign-GPU.stanford.edu:80
07:06:41:WU03:FS00:Upload complete
07:06:41:WU03:FS00:Server responded WORK_ACK (400)
07:06:41:WU03:FS00:Cleaning up
07:06:42:WU02:FS00:News: Welcome to Folding@Home
07:06:42:WU02:FS00:Assigned to work server 171.64.65.69
07:06:42:WU02:FS00:Requesting new work unit for slot 00: READY gpu:0:GF100 [GeForce GTX 480] from 171.64.65.69
07:06:42:WU02:FS00:Connecting to 171.64.65.69:8080
07:06:43:WU02:FS00:Downloading 4.17MiB
07:06:45:WU02:FS00:Download complete
07:06:46:WU02:FS00:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:8900 run:446 clone:3 gen:67 core:0x17 unit:0x0000006e028c126651a689e60b6e5447
07:06:46:WU02:FS00:Starting
07:06:46:WU02:FS00:Running FahCore: "E:\Stanford FAH\FAHClient/FAHCoreWrapper.exe" "E:/Stanford FAH/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe" -dir 02 -suffix 01 -version 703 -lifeline 6268 -checkpoint 30 -gpu 0 -gpu-vendor nvidia
07:06:46:WU02:FS00:Started FahCore on PID 5880
07:06:46:WU02:FS00:Core PID:5532
07:06:46:WU02:FS00:FahCore 0x17 started
07:06:46:WU02:FS00:0x17:*********************** Log Started 2014-02-15T07:06:46Z ***********************
07:06:46:WU02:FS00:0x17:Project: 8900 (Run 446, Clone 3, Gen 67)
07:06:46:WU02:FS00:0x17:Unit: 0x0000006e028c126651a689e60b6e5447
07:06:46:WU02:FS00:0x17:CPU: 0x00000000000000000000000000000000
07:06:46:WU02:FS00:0x17:Machine: 0
07:06:46:WU02:FS00:0x17:Reading tar file state.xml
07:06:47:WU02:FS00:0x17:Reading tar file system.xml
07:06:48:WU02:FS00:0x17:Reading tar file integrator.xml
07:06:48:WU02:FS00:0x17:Reading tar file core.xml
07:06:48:WU02:FS00:0x17:Digital signatures verified
07:06:48:WU02:FS00:0x17:Folding@home GPU core17
07:06:48:WU02:FS00:0x17:Version 0.0.52
07:09:54:WU02:FS00:0x17:Completed 0 out of 2500000 steps (0%)
07:09:54:WU02:FS00:0x17:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
07:30:22:WU02:FS00:0x17:Completed 25000 out of 2500000 steps (1%)
07:36:44:WU02:FS00:0x17:ERROR:exception: The periodic box size has decreased to less than twice the nonbonded cutoff.
07:36:44:WU02:FS00:0x17:Saving result file logfile_01.txt
07:36:44:WU02:FS00:0x17:Saving result file log.txt
07:36:44:WU02:FS00:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
07:36:45:WARNING:WU02:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
07:36:45:WU02:FS00:Sending unit results: id:02 state:SEND error:FAULTY project:8900 run:446 clone:3 gen:67 core:0x17 unit:0x0000006e028c126651a689e60b6e5447
07:36:45:WU02:FS00:Uploading 2.50KiB to 171.64.65.69
07:36:45:WU02:FS00:Connecting to 171.64.65.69:8080
07:36:45:WU03:FS00:Connecting to assign-GPU.stanford.edu:80
07:36:45:WU02:FS00:Upload complete
07:36:45:WU02:FS00:Server responded WORK_ACK (400)
07:36:45:WU02:FS00:Cleaning up
07:36:46:WU03:FS00:News: Welcome to Folding@Home
07:36:46:WU03:FS00:Assigned to work server 171.64.65.69
07:36:46:WU03:FS00:Requesting new work unit for slot 00: READY gpu:0:GF100 [GeForce GTX 480] from 171.64.65.69
07:36:46:WU03:FS00:Connecting to 171.64.65.69:8080
07:36:46:WU03:FS00:Downloading 4.18MiB
07:36:49:WU03:FS00:Download complete
07:36:49:WU03:FS00:Received Unit: id:03 state:DOWNLOAD error:NO_ERROR project:8900 run:294 clone:3 gen:43 core:0x17 unit:0x00000040028c126651a6680f910665fa
07:36:49:WU03:FS00:Starting
07:36:49:WU03:FS00:Running FahCore: "E:\Stanford FAH\FAHClient/FAHCoreWrapper.exe" "E:/Stanford FAH/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe" -dir 03 -suffix 01 -version 703 -lifeline 6268 -checkpoint 30 -gpu 0 -gpu-vendor nvidia
07:36:49:WU03:FS00:Started FahCore on PID 5456
07:36:49:WU03:FS00:Core PID:5504
07:36:49:WU03:FS00:FahCore 0x17 started
07:36:50:WU03:FS00:0x17:*********************** Log Started 2014-02-15T07:36:49Z ***********************
07:36:50:WU03:FS00:0x17:Project: 8900 (Run 294, Clone 3, Gen 43)
07:36:50:WU03:FS00:0x17:Unit: 0x00000040028c126651a6680f910665fa
07:36:50:WU03:FS00:0x17:CPU: 0x00000000000000000000000000000000
07:36:50:WU03:FS00:0x17:Machine: 0
07:36:50:WU03:FS00:0x17:Reading tar file state.xml
07:36:51:WU03:FS00:0x17:Reading tar file system.xml
07:36:51:WU03:FS00:0x17:Reading tar file integrator.xml
07:36:51:WU03:FS00:0x17:Reading tar file core.xml
07:36:51:WU03:FS00:0x17:Digital signatures verified
07:36:51:WU03:FS00:0x17:Folding@home GPU core17
07:36:51:WU03:FS00:0x17:Version 0.0.52
07:39:51:WU03:FS00:0x17:Completed 0 out of 2500000 steps (0%)
07:39:51:WU03:FS00:0x17:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
08:00:23:WU03:FS00:0x17:Completed 25000 out of 2500000 steps (1%)
08:01:33:WU03:FS00:0x17:ERROR:exception: The periodic box size has decreased to less than twice the nonbonded cutoff.
08:01:33:WU03:FS00:0x17:Saving result file logfile_01.txt
08:01:33:WU03:FS00:0x17:Saving result file log.txt
08:01:33:WU03:FS00:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
08:01:34:WARNING:WU03:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
08:01:34:WU03:FS00:Sending unit results: id:03 state:SEND error:FAULTY project:8900 run:294 clone:3 gen:43 core:0x17 unit:0x00000040028c126651a6680f910665fa
08:01:34:WU03:FS00:Uploading 2.50KiB to 171.64.65.69
08:01:34:WU03:FS00:Connecting to 171.64.65.69:8080
08:01:34:WU02:FS00:Connecting to assign-GPU.stanford.edu:80
08:01:34:WU03:FS00:Upload complete
08:01:34:WU03:FS00:Server responded WORK_ACK (400)
08:01:34:WU03:FS00:Cleaning up
08:01:35:WU02:FS00:News: Welcome to Folding@Home
08:01:35:WU02:FS00:Assigned to work server 171.64.65.69
08:01:35:WU02:FS00:Requesting new work unit for slot 00: READY gpu:0:GF100 [GeForce GTX 480] from 171.64.65.69
08:01:35:WU02:FS00:Connecting to 171.64.65.69:8080
08:01:35:WU02:FS00:Downloading 4.18MiB
08:01:38:WU02:FS00:Download complete
08:01:38:WU02:FS00:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:8900 run:302 clone:0 gen:296 core:0x17 unit:0x0000017e028c126651a669c3592a8a93
08:01:38:WU02:FS00:Starting
08:01:38:WU02:FS00:Running FahCore: "E:\Stanford FAH\FAHClient/FAHCoreWrapper.exe" "E:/Stanford FAH/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe" -dir 02 -suffix 01 -version 703 -lifeline 6268 -checkpoint 30 -gpu 0 -gpu-vendor nvidia
08:01:38:WU02:FS00:Started FahCore on PID 4656
08:01:38:WU02:FS00:Core PID:5552
08:01:38:WU02:FS00:FahCore 0x17 started
08:01:39:WU02:FS00:0x17:*********************** Log Started 2014-02-15T08:01:38Z ***********************
08:01:39:WU02:FS00:0x17:Project: 8900 (Run 302, Clone 0, Gen 296)
08:01:39:WU02:FS00:0x17:Unit: 0x0000017e028c126651a669c3592a8a93
08:01:39:WU02:FS00:0x17:CPU: 0x00000000000000000000000000000000
08:01:39:WU02:FS00:0x17:Machine: 0
08:01:39:WU02:FS00:0x17:Reading tar file state.xml
08:01:39:WU02:FS00:0x17:Reading tar file system.xml
08:01:40:WU02:FS00:0x17:Reading tar file integrator.xml
08:01:40:WU02:FS00:0x17:Reading tar file core.xml
08:01:40:WU02:FS00:0x17:Digital signatures verified
08:01:40:WU02:FS00:0x17:Folding@home GPU core17
08:01:40:WU02:FS00:0x17:Version 0.0.52
08:04:45:WU02:FS00:0x17:Completed 0 out of 2500000 steps (0%)
08:04:45:WU02:FS00:0x17:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
08:25:12:WU02:FS00:0x17:Completed 25000 out of 2500000 steps (1%)
08:46:40:WU02:FS00:0x17:Completed 50000 out of 2500000 steps (2%)
08:46:41:WU02:FS00:0x17:Bad State detected... attempting to resume from last good checkpoint
08:46:44:WU02:FS00:0x17:ERROR:exception: First periodic box vector must be parallel to x.
08:46:44:WU02:FS00:0x17:Saving result file logfile_01.txt
08:46:44:WU02:FS00:0x17:Saving result file log.txt
08:46:44:WU02:FS00:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
08:46:47:WARNING:WU02:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
08:46:48:WU02:FS00:Sending unit results: id:02 state:SEND error:FAULTY project:8900 run:302 clone:0 gen:296 core:0x17 unit:0x0000017e028c126651a669c3592a8a93
08:46:48:WU02:FS00:Uploading 2.54KiB to 171.64.65.69
08:46:48:WU02:FS00:Connecting to 171.64.65.69:8080
08:46:48:WU02:FS00:Upload complete
08:46:48:WU02:FS00:Server responded WORK_ACK (400)
08:46:48:WU02:FS00:Cleaning up
******************************* Date: 2014-02-15 *******************************
14:23:06:FS00:Paused
14:24:27:FS00:Unpaused
14:24:27:FS00:Finishing
PantherX
Site Moderator
Posts: 7020
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: I am having rampant failures with the newest NVIDIA driv

Post by PantherX »

I haven't come across these errors before:
06:46:32:WU02:FS00:0x17:ERROR:exception: First periodic box vector must be parallel to x.
07:06:40:WU03:FS00:0x17:ERROR:exception: The periodic box size has decreased to less than twice the nonbonded cutoff.

I assume that it could successfully fold FahCore_15 WUs and now, is throwing up errors only on FahCore_17 WUs?
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Post Reply