GPU slot failed, PRCG(5767,4,125,552) and some others

Moderators: Site Moderators, FAHC Science Team

Post Reply
iceman1992
Posts: 527
Joined: Fri Mar 23, 2012 5:16 pm

GPU slot failed, PRCG(5767,4,125,552) and some others

Post by iceman1992 »

Okay first of all apologies if I posted in the wrong forum. I started the GPU slot then left the computer, when I came back the Folding Slots status shows "Failed". It seems to have downloaded and failed several times. (5767,4,125,552), (5770,4,103,400), (5771,6,228,432), (5769,14,44,1079), (5768,6,223,885). Here's a copy of the log

Code: Select all

******************************** Date: 24/04/12 ********************************
11:22:17:WU01:FS00:Connecting to assign-GPU.stanford.edu:80
11:22:18:WU01:FS00:News: Welcome to Folding@Home
11:22:18:WU01:FS00:Assigned to work server 171.67.108.11
11:22:18:WU01:FS00:Requesting new work unit for slot 00: READY gpu:0:"GT200 [GeForce GTX 260]" from 171.67.108.11
11:22:18:WU01:FS00:Connecting to 171.67.108.11:8080
11:22:20:WU01:FS00:Downloading 46.17KiB
11:22:20:WU01:FS00:Download complete
11:22:20:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:OK project:5767 run:4 clone:125 gen:552 core:0x11 unit:0x40a448b34f968cde0228007d00041687
11:22:20:WU01:FS00:Starting
11:22:20:WU01:FS00:Running FahCore: "J:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" J:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_11.fah/FahCore_11.exe -dir 01 -suffix 01 -version 701 -lifeline 488 -checkpoint 15 -gpu 0
11:22:20:WU01:FS00:Started FahCore on PID 3012
11:22:21:WU01:FS00:Core PID:7936
11:22:21:WU01:FS00:FahCore 0x11 started
11:22:22:WU01:FS00:0x11:
11:22:22:WU01:FS00:0x11:*------------------------------*
11:22:22:WU01:FS00:0x11:Folding@Home GPU Core
11:22:22:WU01:FS00:0x11:Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
11:22:22:WU01:FS00:0x11:
11:22:22:WU01:FS00:0x11:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
11:22:22:WU01:FS00:0x11:Build host: amoeba
11:22:22:WU01:FS00:0x11:Board Type: Nvidia
11:22:22:WU01:FS00:0x11:Core      : 
11:22:22:WU01:FS00:0x11:Preparing to commence simulation
11:22:22:WU01:FS00:0x11:- Looking at optimizations...
11:22:22:WU01:FS00:0x11:DeleteFrameFiles: successfully deleted file=01/wudata_01.ckp
11:22:22:WU01:FS00:0x11:- Created dyn
11:22:22:WU01:FS00:0x11:- Files status OK
11:22:22:WU01:FS00:0x11:- Expanded 46762 -> 252912 (decompressed 540.8 percent)
11:22:22:WU01:FS00:0x11:Called DecompressByteArray: compressed_data_size=46762 data_size=252912, decompressed_data_size=252912 diff=0
11:22:22:WU01:FS00:0x11:- Digital signature verified
11:22:22:WU01:FS00:0x11:
11:22:22:WU01:FS00:0x11:Project: 5767 (Run 4, Clone 125, Gen 552)
11:22:22:WU01:FS00:0x11:
11:22:22:WU01:FS00:0x11:Assembly optimizations on if available.
11:22:22:WU01:FS00:0x11:Entering M.D.
11:22:28:WU01:FS00:0x11:Tpr hash 01/wudata_01.tpr:  1141236586 2951761996 4162596706 2572557757 868389311
11:22:28:WU01:FS00:0x11:
11:22:28:WU01:FS00:0x11:Calling fah_main args: 14 usage=100
11:22:28:WU01:FS00:0x11:
11:22:28:WU01:FS00:0x11:mdrun_gpu returned 
11:22:28:WU01:FS00:0x11:Going to send back what have done -- stepsTotalG=0
11:22:28:WU01:FS00:0x11:Work fraction=0.0000 steps=0.
11:22:32:WU01:FS00:0x11:logfile size=0 infoLength=0 edr=0 trr=25
11:22:32:WU01:FS00:0x11:+ Opened results file
11:22:32:WU01:FS00:0x11:- Writing 635 bytes of core data to disk...
11:22:32:WU01:FS00:0x11:Done: 123 -> 120 (compressed to 97.5 percent)
11:22:32:WU01:FS00:0x11:  ... Done.
11:22:32:WU01:FS00:0x11:DeleteFrameFiles: successfully deleted file=01/wudata_01.ckp
11:22:32:WU01:FS00:0x11:
11:22:32:WU01:FS00:0x11:Folding@home Core Shutdown: UNSTABLE_MACHINE
11:22:33:WU01:FS00:FahCore returned: UNSTABLE_MACHINE (122 = 0x7a)
11:22:33:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:5767 run:4 clone:125 gen:552 core:0x11 unit:0x40a448b34f968cde0228007d00041687
11:22:33:WU01:FS00:Uploading 632B to 171.67.108.11
11:22:33:WU01:FS00:Connecting to 171.67.108.11:8080
11:22:33:WU02:FS00:Connecting to assign-GPU.stanford.edu:80
11:22:33:WU01:FS00:Upload complete
11:22:33:WU01:FS00:Server responded WORK_ACK (400)
11:22:33:WU01:FS00:Cleaning up
11:22:34:WU02:FS00:News: Welcome to Folding@Home
11:22:34:WU02:FS00:Assigned to work server 171.67.108.11
11:22:34:WU02:FS00:Requesting new work unit for slot 00: READY gpu:0:"GT200 [GeForce GTX 260]" from 171.67.108.11
11:22:34:WU02:FS00:Connecting to 171.67.108.11:8080
11:22:35:WU02:FS00:Downloading 44.78KiB
11:22:35:WU02:FS00:Download complete
11:22:35:WU02:FS00:Received Unit: id:02 state:DOWNLOAD error:OK project:5770 run:4 clone:103 gen:400 core:0x11 unit:0x1507835c4f968ced019000670004168a
11:22:35:WU02:FS00:Starting
11:22:35:WU02:FS00:Running FahCore: "J:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" J:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_11.fah/FahCore_11.exe -dir 02 -suffix 01 -version 701 -lifeline 488 -checkpoint 15 -gpu 0
11:22:35:WU02:FS00:Started FahCore on PID 7904
11:22:35:WU02:FS00:Core PID:2128
11:22:35:WU02:FS00:FahCore 0x11 started
11:22:36:WU02:FS00:Downloading project 5770 description
11:22:36:WU02:FS00:Connecting to fah-web.stanford.edu:80
11:22:36:WU02:FS00:0x11:
11:22:36:WU02:FS00:0x11:*------------------------------*
11:22:36:WU02:FS00:0x11:Folding@Home GPU Core
11:22:36:WU02:FS00:0x11:Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
11:22:36:WU02:FS00:0x11:
11:22:36:WU02:FS00:0x11:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
11:22:36:WU02:FS00:0x11:Build host: amoeba
11:22:36:WU02:FS00:0x11:Board Type: Nvidia
11:22:36:WU02:FS00:0x11:Core      : 
11:22:36:WU02:FS00:0x11:Preparing to commence simulation
11:22:36:WU02:FS00:0x11:- Looking at optimizations...
11:22:36:WU02:FS00:0x11:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
11:22:36:WU02:FS00:0x11:- Created dyn
11:22:36:WU02:FS00:0x11:- Files status OK
11:22:36:WU02:FS00:0x11:- Expanded 45340 -> 251112 (decompressed 553.8 percent)
11:22:36:WU02:FS00:0x11:Called DecompressByteArray: compressed_data_size=45340 data_size=251112, decompressed_data_size=251112 diff=0
11:22:36:WU02:FS00:0x11:- Digital signature verified
11:22:36:WU02:FS00:0x11:
11:22:36:WU02:FS00:0x11:Project: 5770 (Run 4, Clone 103, Gen 400)
11:22:36:WU02:FS00:0x11:
11:22:36:WU02:FS00:0x11:Assembly optimizations on if available.
11:22:36:WU02:FS00:0x11:Entering M.D.
11:22:36:WU02:FS00:Project 5770 description downloaded successfully
11:22:42:WU02:FS00:0x11:Tpr hash 02/wudata_01.tpr:  1772729197 390797860 1820863660 4038395437 3089067917
11:22:42:WU02:FS00:0x11:
11:22:42:WU02:FS00:0x11:Calling fah_main args: 14 usage=100
11:22:42:WU02:FS00:0x11:
11:22:42:WU02:FS00:0x11:mdrun_gpu returned 
11:22:42:WU02:FS00:0x11:Going to send back what have done -- stepsTotalG=0
11:22:42:WU02:FS00:0x11:Work fraction=0.0000 steps=0.
11:22:46:WU02:FS00:0x11:logfile size=0 infoLength=0 edr=0 trr=25
11:22:46:WU02:FS00:0x11:+ Opened results file
11:22:46:WU02:FS00:0x11:- Writing 635 bytes of core data to disk...
11:22:46:WU02:FS00:0x11:Done: 123 -> 123 (compressed to 100.0 percent)
11:22:46:WU02:FS00:0x11:  ... Done.
11:22:46:WU02:FS00:0x11:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
11:22:46:WU02:FS00:FahCore returned: UNSTABLE_MACHINE (122 = 0x7a)
11:22:46:WU02:FS00:Sending unit results: id:02 state:SEND error:FAULTY project:5770 run:4 clone:103 gen:400 core:0x11 unit:0x1507835c4f968ced019000670004168a
11:22:46:WU02:FS00:Uploading 635B to 171.67.108.11
11:22:46:WU02:FS00:Connecting to 171.67.108.11:8080
11:22:46:WU01:FS00:Connecting to assign-GPU.stanford.edu:80
11:22:47:WU01:FS00:News: Welcome to Folding@Home
11:22:47:WU01:FS00:Assigned to work server 171.67.108.11
11:22:47:WU01:FS00:Requesting new work unit for slot 00: READY gpu:0:"GT200 [GeForce GTX 260]" from 171.67.108.11
11:22:47:WU01:FS00:Connecting to 171.67.108.11:8080
11:22:47:WU02:FS00:Upload complete
11:22:47:WU02:FS00:Server responded WORK_ACK (400)
11:22:47:WU02:FS00:Cleaning up
11:22:48:WU01:FS00:Downloading 44.83KiB
11:22:48:WU01:FS00:Download complete
11:22:48:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:OK project:5771 run:6 clone:228 gen:432 core:0x11 unit:0x534615684f968cfa01b000e40006168b
11:22:48:WU01:FS00:Starting
11:22:48:WU01:FS00:Running FahCore: "J:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" J:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_11.fah/FahCore_11.exe -dir 01 -suffix 01 -version 701 -lifeline 488 -checkpoint 15 -gpu 0
11:22:48:WU01:FS00:Started FahCore on PID 3760
11:22:48:WU01:FS00:Core PID:4768
11:22:48:WU01:FS00:FahCore 0x11 started
11:22:49:WU01:FS00:Downloading project 5771 description
11:22:49:WU01:FS00:Connecting to fah-web.stanford.edu:80
11:22:49:WU01:FS00:0x11:
11:22:49:WU01:FS00:0x11:*------------------------------*
11:22:49:WU01:FS00:0x11:Folding@Home GPU Core
11:22:49:WU01:FS00:0x11:Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
11:22:49:WU01:FS00:0x11:
11:22:49:WU01:FS00:0x11:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
11:22:49:WU01:FS00:0x11:Build host: amoeba
11:22:49:WU01:FS00:0x11:Board Type: Nvidia
11:22:49:WU01:FS00:0x11:Core      : 
11:22:49:WU01:FS00:0x11:Preparing to commence simulation
11:22:49:WU01:FS00:0x11:- Looking at optimizations...
11:22:49:WU01:FS00:0x11:DeleteFrameFiles: successfully deleted file=01/wudata_01.ckp
11:22:49:WU01:FS00:0x11:- Created dyn
11:22:49:WU01:FS00:0x11:- Files status OK
11:22:49:WU01:FS00:0x11:- Expanded 45394 -> 251112 (decompressed 553.1 percent)
11:22:49:WU01:FS00:0x11:Called DecompressByteArray: compressed_data_size=45394 data_size=251112, decompressed_data_size=251112 diff=0
11:22:49:WU01:FS00:0x11:- Digital signature verified
11:22:49:WU01:FS00:0x11:
11:22:49:WU01:FS00:0x11:Project: 5771 (Run 6, Clone 228, Gen 432)
11:22:49:WU01:FS00:0x11:
11:22:49:WU01:FS00:0x11:Assembly optimizations on if available.
11:22:49:WU01:FS00:0x11:Entering M.D.
11:22:49:WU01:FS00:Project 5771 description downloaded successfully
11:22:55:WU01:FS00:0x11:Tpr hash 01/wudata_01.tpr:  2736385263 3425066878 3469081229 2169510608 3451924373
11:22:55:WU01:FS00:0x11:
11:22:55:WU01:FS00:0x11:Calling fah_main args: 14 usage=100
11:22:55:WU01:FS00:0x11:
11:22:55:WU01:FS00:0x11:mdrun_gpu returned 
11:22:55:WU01:FS00:0x11:Going to send back what have done -- stepsTotalG=0
11:22:55:WU01:FS00:0x11:Work fraction=0.0000 steps=0.
11:22:59:WU01:FS00:0x11:logfile size=0 infoLength=0 edr=0 trr=25
11:22:59:WU01:FS00:0x11:+ Opened results file
11:22:59:WU01:FS00:0x11:- Writing 635 bytes of core data to disk...
11:22:59:WU01:FS00:0x11:Done: 123 -> 120 (compressed to 97.5 percent)
11:22:59:WU01:FS00:0x11:  ... Done.
11:22:59:WU01:FS00:0x11:DeleteFrameFiles: successfully deleted file=01/wudata_01.ckp
11:22:59:WU01:FS00:0x11:
11:22:59:WU01:FS00:0x11:Folding@home Core Shutdown: UNSTABLE_MACHINE
11:22:59:WU01:FS00:FahCore returned: UNSTABLE_MACHINE (122 = 0x7a)
11:22:59:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:5771 run:6 clone:228 gen:432 core:0x11 unit:0x534615684f968cfa01b000e40006168b
11:22:59:WU01:FS00:Uploading 632B to 171.67.108.11
11:22:59:WU01:FS00:Connecting to 171.67.108.11:8080
11:22:59:WU02:FS00:Connecting to assign-GPU.stanford.edu:80
11:23:00:WU01:FS00:Upload complete
11:23:00:WU01:FS00:Server responded WORK_ACK (400)
11:23:00:WU01:FS00:Cleaning up
11:23:00:WU02:FS00:News: Welcome to Folding@Home
11:23:00:WU02:FS00:Assigned to work server 171.67.108.11
11:23:00:WU02:FS00:Requesting new work unit for slot 00: READY gpu:0:"GT200 [GeForce GTX 260]" from 171.67.108.11
11:23:00:WU02:FS00:Connecting to 171.67.108.11:8080
11:23:02:WU02:FS00:Downloading 44.90KiB
11:23:02:WU02:FS00:Download complete
11:23:02:WU02:FS00:Received Unit: id:02 state:DOWNLOAD error:OK project:5769 run:14 clone:44 gen:1079 core:0x11 unit:0x08758f644f968d080437002c000e1689
11:23:02:WU02:FS00:Starting
11:23:02:WU02:FS00:Running FahCore: "J:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" J:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_11.fah/FahCore_11.exe -dir 02 -suffix 01 -version 701 -lifeline 488 -checkpoint 15 -gpu 0
11:23:02:WU02:FS00:Started FahCore on PID 6952
11:23:02:WU02:FS00:Core PID:7008
11:23:02:WU02:FS00:FahCore 0x11 started
11:23:03:WU02:FS00:0x11:
11:23:03:WU02:FS00:0x11:*------------------------------*
11:23:03:WU02:FS00:0x11:Folding@Home GPU Core
11:23:03:WU02:FS00:0x11:Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
11:23:03:WU02:FS00:0x11:
11:23:03:WU02:FS00:0x11:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
11:23:03:WU02:FS00:0x11:Build host: amoeba
11:23:03:WU02:FS00:0x11:Board Type: Nvidia
11:23:03:WU02:FS00:0x11:Core      : 
11:23:03:WU02:FS00:0x11:Preparing to commence simulation
11:23:03:WU02:FS00:0x11:- Looking at optimizations...
11:23:03:WU02:FS00:0x11:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
11:23:03:WU02:FS00:0x11:- Created dyn
11:23:03:WU02:FS00:0x11:- Files status OK
11:23:03:WU02:FS00:0x11:- Expanded 45466 -> 251112 (decompressed 552.3 percent)
11:23:03:WU02:FS00:0x11:Called DecompressByteArray: compressed_data_size=45466 data_size=251112, decompressed_data_size=251112 diff=0
11:23:03:WU02:FS00:0x11:- Digital signature verified
11:23:03:WU02:FS00:0x11:
11:23:03:WU02:FS00:0x11:Project: 5769 (Run 14, Clone 44, Gen 1079)
11:23:03:WU02:FS00:0x11:
11:23:03:WU02:FS00:0x11:Assembly optimizations on if available.
11:23:03:WU02:FS00:0x11:Entering M.D.
11:23:08:WU02:FS00:0x11:Tpr hash 02/wudata_01.tpr:  566093127 780121182 2845536910 501223541 1743460211
11:23:08:WU02:FS00:0x11:
11:23:08:WU02:FS00:0x11:Calling fah_main args: 14 usage=100
11:23:08:WU02:FS00:0x11:
11:23:08:WU02:FS00:0x11:mdrun_gpu returned 
11:23:08:WU02:FS00:0x11:Going to send back what have done -- stepsTotalG=0
11:23:08:WU02:FS00:0x11:Work fraction=0.0000 steps=0.
11:23:12:WU02:FS00:0x11:logfile size=0 infoLength=0 edr=0 trr=25
11:23:12:WU02:FS00:0x11:+ Opened results file
11:23:12:WU02:FS00:0x11:- Writing 635 bytes of core data to disk...
11:23:12:WU02:FS00:0x11:Done: 123 -> 123 (compressed to 100.0 percent)
11:23:12:WU02:FS00:0x11:  ... Done.
11:23:12:WU02:FS00:0x11:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
11:23:12:WU02:FS00:0x11:
11:23:12:WU02:FS00:0x11:Folding@home Core Shutdown: UNSTABLE_MACHINE
11:23:13:WU02:FS00:FahCore returned: UNSTABLE_MACHINE (122 = 0x7a)
11:23:13:WU02:FS00:Sending unit results: id:02 state:SEND error:FAULTY project:5769 run:14 clone:44 gen:1079 core:0x11 unit:0x08758f644f968d080437002c000e1689
11:23:13:WU02:FS00:Uploading 635B to 171.67.108.11
11:23:13:WU02:FS00:Connecting to 171.67.108.11:8080
11:23:13:WU01:FS00:Connecting to assign-GPU.stanford.edu:80
11:23:13:WU02:FS00:Upload complete
11:23:13:WU02:FS00:Server responded WORK_ACK (400)
11:23:13:WU02:FS00:Cleaning up
11:23:14:WU01:FS00:News: Welcome to Folding@Home
11:23:14:WU01:FS00:Assigned to work server 171.67.108.11
11:23:14:WU01:FS00:Requesting new work unit for slot 00: READY gpu:0:"GT200 [GeForce GTX 260]" from 171.67.108.11
11:23:14:WU01:FS00:Connecting to 171.67.108.11:8080
11:23:15:WU01:FS00:Downloading 46.12KiB
11:23:15:WU01:FS00:Download complete
11:23:15:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:OK project:5768 run:6 clone:223 gen:885 core:0x11 unit:0x6b1ddd654f968d15037500df00061688
11:23:15:WU01:FS00:Starting
11:23:15:WU01:FS00:Running FahCore: "J:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" J:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_11.fah/FahCore_11.exe -dir 01 -suffix 01 -version 701 -lifeline 488 -checkpoint 15 -gpu 0
11:23:15:WU01:FS00:Started FahCore on PID 3324
11:23:15:WU01:FS00:Core PID:4244
11:23:15:WU01:FS00:FahCore 0x11 started
11:23:16:WU01:FS00:0x11:
11:23:16:WU01:FS00:0x11:*------------------------------*
11:23:16:WU01:FS00:0x11:Folding@Home GPU Core
11:23:16:WU01:FS00:0x11:Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
11:23:16:WU01:FS00:0x11:
11:23:16:WU01:FS00:0x11:Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86 
11:23:16:WU01:FS00:0x11:Build host: amoeba
11:23:16:WU01:FS00:0x11:Board Type: Nvidia
11:23:16:WU01:FS00:0x11:Core      : 
11:23:16:WU01:FS00:0x11:Preparing to commence simulation
11:23:16:WU01:FS00:0x11:- Looking at optimizations...
11:23:16:WU01:FS00:0x11:DeleteFrameFiles: successfully deleted file=01/wudata_01.ckp
11:23:16:WU01:FS00:0x11:- Created dyn
11:23:16:WU01:FS00:0x11:- Files status OK
11:23:16:WU01:FS00:0x11:- Expanded 46714 -> 252912 (decompressed 541.4 percent)
11:23:16:WU01:FS00:0x11:Called DecompressByteArray: compressed_data_size=46714 data_size=252912, decompressed_data_size=252912 diff=0
11:23:16:WU01:FS00:0x11:- Digital signature verified
11:23:16:WU01:FS00:0x11:
11:23:16:WU01:FS00:0x11:Project: 5768 (Run 6, Clone 223, Gen 885)
11:23:16:WU01:FS00:0x11:
11:23:16:WU01:FS00:0x11:Assembly optimizations on if available.
11:23:16:WU01:FS00:0x11:Entering M.D.
11:23:22:WU01:FS00:0x11:Tpr hash 01/wudata_01.tpr:  3121343094 3632663828 133009389 2312126529 4115683823
11:23:22:WU01:FS00:0x11:
11:23:22:WU01:FS00:0x11:Calling fah_main args: 14 usage=100
11:23:22:WU01:FS00:0x11:
11:23:22:WU01:FS00:0x11:mdrun_gpu returned 
11:23:22:WU01:FS00:0x11:Going to send back what have done -- stepsTotalG=0
11:23:22:WU01:FS00:0x11:Work fraction=0.0000 steps=0.
11:23:26:WU01:FS00:0x11:logfile size=0 infoLength=0 edr=0 trr=25
11:23:26:WU01:FS00:0x11:+ Opened results file
11:23:26:WU01:FS00:0x11:- Writing 635 bytes of core data to disk...
11:23:26:WU01:FS00:0x11:Done: 123 -> 120 (compressed to 97.5 percent)
11:23:26:WU01:FS00:0x11:  ... Done.
11:23:26:WU01:FS00:0x11:DeleteFrameFiles: successfully deleted file=01/wudata_01.ckp
11:23:26:WU01:FS00:0x11:
11:23:26:WU01:FS00:0x11:Folding@home Core Shutdown: UNSTABLE_MACHINE
11:23:26:WU01:FS00:FahCore returned: UNSTABLE_MACHINE (122 = 0x7a)
11:23:26:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:5768 run:6 clone:223 gen:885 core:0x11 unit:0x6b1ddd654f968d15037500df00061688
11:23:26:WU01:FS00:Uploading 632B to 171.67.108.11
11:23:26:WU01:FS00:Connecting to 171.67.108.11:8080
11:23:27:WU01:FS00:Upload complete
11:23:27:WU01:FS00:Server responded WORK_ACK (400)
11:23:27:WU01:FS00:Cleaning up
I was also running the SMP slot. I have run both at the same time, many times before with no problems at all. In fact this is the first time I get an error from the v7 client. Since it failed several times, it seems unlikely to be a bad WU, but I have absolutely no idea what's wrong. My GTX260 is not overclocked.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: GPU slot failed, PRCG(5767,4,125,552) and some others

Post by bruce »

UNSTABLE_MACHINE (0x7a) and sometimes UNKNOWN_ENUM (-1 = 0xffffffff) are most likely indicative of a hardware failure in the GPU. Overclocking (including so-called factory overclocking) is one possibility, but so is overheating and power supply limitations.

Is your GPU some place that's not getting enough air circulation? Has dust accumulated in the heatsink? Can you increase the GPU fan speed? Does the GPU + SMP add enough extra heat inside the case for another fan to be useful?

Are you sure your PS can supply enough amps on the power circuit(s) it's attached to?

. . . and then there's always the possiblity that the hardware is showing signs of it's age. I started having those types of errors and finally the whole system crashed. I still need to fix it. I already did a RMA once on the GPU when it failed memory diagnostics and it worked for a while. Now it looks like the PS has died. It's tme for some one-on-one time with my hardware.
iceman1992
Posts: 527
Joined: Fri Mar 23, 2012 5:16 pm

Re: GPU slot failed, PRCG(5767,4,125,552) and some others

Post by iceman1992 »

bruce wrote:UNSTABLE_MACHINE (0x7a) and sometimes UNKNOWN_ENUM (-1 = 0xffffffff) are most likely indicative of a hardware failure in the GPU. Overclocking (including so-called factory overclocking) is one possibility, but so is overheating and power supply limitations.
It is not overclocked at all. Not factory overclocked, not overclocked by me.
bruce wrote:Is your GPU some place that's not getting enough air circulation? Has dust accumulated in the heatsink? Can you increase the GPU fan speed? Does the GPU + SMP add enough extra heat inside the case for another fan to be useful?
I run MSI Afterburner and the fan speed is set to increase with GPU temperature, so no overheating issues here, max temps while folding is only around 69-75C.
bruce wrote:Are you sure your PS can supply enough amps on the power circuit(s) it's attached to?
This I'm not so sure, but as I said, I've had both slots running together many times with no problems at all.
bruce wrote:. . . and then there's always the possiblity that the hardware is showing signs of it's age. I started having those types of errors and finally the whole system crashed. I still need to fix it. I already did a RMA once on the GPU when it failed memory diagnostics and it worked for a while. Now it looks like the PS has died. It's tme for some one-on-one time with my hardware.
Well if my hardware is going bad, not good lol. although I'll have a reason to upgrade if it fails :P
To update, I've paused the GPU slot and restarted it, and it has since successfully completed 2 WUs with no errors. So this confuses me even more. Was that just a one time hiccup or what?
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: GPU slot failed, PRCG(5767,4,125,552) and some others

Post by bruce »

The difference between a one-time hiccup and a problem that you'll run into again is up to you to judge. Two completed WUs and one with an error isn't exactly enough to establish a trend.

All I can really tell you is that FAH stresses your hardware to the max, and sometimes that turns up hardware-based problems that you have not seen before. If you decide to ignore that possiblity, just let it run and judge for yourself if it's something you need to fix or not.

The FahClient does check the hardware for possible computational errors. Sometimes those errors show up as a -1 and sometimes as a 7a error. There's a future bug fix that will transform some or all of the -1 (unknown) errors into 7a (computational error) but until that code is distributed, we have to assume they're probably the same type of error.
iceman1992
Posts: 527
Joined: Fri Mar 23, 2012 5:16 pm

Re: GPU slot failed, PRCG(5767,4,125,552) and some others

Post by iceman1992 »

bruce wrote:All I can really tell you is that FAH stresses your hardware to the max, and sometimes that turns up hardware-based problems that you have not seen before. If you decide to ignore that possiblity, just let it run and judge for yourself if it's something you need to fix or not.
Well I'm not sure it stresses my GPU to the very max. Because when I use it for gaming, it goes up to around 78-83 C, much hotter than when it folds.
Post Reply