Page 1 of 8

GPU_MEMTEST_ERROR

PostPosted: Sat Dec 08, 2012 10:08 pm
by VijayPande
For those getting the GPU_MEMTEST_ERROR, we have a question for you which would help us debug this. Which work server (WS) are you getting assigned to?

Re: GPU_MEMTEST_ERROR

PostPosted: Sat Dec 08, 2012 10:15 pm
by Sliced
171.67.108.143
If I'm reading it correctly.

Re: GPU_MEMTEST_ERROR

PostPosted: Sat Dec 08, 2012 10:16 pm
by Mactin
Code: Select all
21:25:37:WU02:FS01:News: Welcome to Folding@Home
21:25:37:WU02:FS01:Assigned to work server 171.64.65.105
21:25:37:WU02:FS01:Requesting new work unit for slot 01: READY gpu:0:\"G92 [GeForce GTS 250]\" from 171.64.65.105
21:25:37:WU02:FS01:Connecting to 171.64.65.105:8080
21:25:38:WU02:FS01:Downloading 122.69KiB
21:25:39:WU02:FS01:Download complete
21:25:39:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:7625 run:233 clone:0 gen:80 core:0x15 unit:0x0000007e664f2dd14fe61678a4c4fb02
21:25:39:WU02:FS01:Starting
21:25:39:WU02:FS01:Running FahCore: \"C:\\Program Files (x86)\\FAHClient/FAHCoreWrapper.exe\" C:/Users/folding/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -version 702 -lifeline 4480 -checkpoint 15 -gpu 1
21:25:39:WU02:FS01:Started FahCore on PID 3428
21:25:39:WU02:FS01:Core PID:4204
21:25:39:WU02:FS01:FahCore 0x15 started
21:25:39:WU02:FS01:Downloading project 7625 description
21:25:39:WU02:FS01:Connecting to fah-web.stanford.edu:80
21:25:39:WU02:FS01:Project 7625 description downloaded successfully
21:25:39:WU02:FS01:0x15:
21:25:39:WU02:FS01:0x15:*------------------------------*
21:25:39:WU02:FS01:0x15:Folding@Home GPU Core
21:25:39:WU02:FS01:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
21:25:39:WU02:FS01:0x15:Build host             AmoebaRemote
21:25:39:WU02:FS01:0x15:Board Type             NVIDIA/CUDA
21:25:39:WU02:FS01:0x15:Core                   15
21:25:39:WU02:FS01:0x15:GPU device info vendor=0 device=0 name=NA match=0 deviceId=1
21:25:39:WU02:FS01:0x15:
21:25:39:WU02:FS01:0x15:Window's signal control handler registered.
21:25:39:WU02:FS01:0x15:Preparing to commence simulation
21:25:39:WU02:FS01:0x15:- Looking at optimizations...
21:25:39:WU02:FS01:0x15:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
21:25:39:WU02:FS01:0x15:- Created dyn
21:25:39:WU02:FS01:0x15:- Files status OK
21:25:39:WU02:FS01:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
21:25:39:WU02:FS01:0x15:- Expanded 125126 -> 502918 (decompressed 401.9 percent)
21:25:39:WU02:FS01:0x15:Called DecompressByteArray: compressed_data_size=125126 data_size=502918, decompressed_data_size=502918 diff=0
21:25:39:WU02:FS01:0x15:- Digital signature verified
21:25:39:WU02:FS01:0x15:
21:25:39:WU02:FS01:0x15:Project: 7625 (Run 233, Clone 0, Gen 80)
21:25:39:WU02:FS01:0x15:
21:25:39:WU02:FS01:0x15:Assembly optimizations on if available.
21:25:39:WU02:FS01:0x15:Entering M.D.
21:25:41:WU02:FS01:0x15:Tpr hash 02/wudata_01.tpr:  4294052510 2327148517 1321333049 3722041980 3894124969
21:25:41:WU02:FS01:0x15:GPU device id=1
21:25:41:WU02:FS01:0x15:Working on Protein
21:25:41:WU02:FS01:0x15:Client config unavailable.
21:25:41:WU02:FS01:0x15:Starting GUI Server
21:26:45:WU02:FS01:0x15:Finished fah_main status=59
21:26:45:WU02:FS01:0x15:mdrun_gpu returned 59
21:26:45:WU02:FS01:0x15:GPU memtest failure
21:26:45:WU02:FS01:0x15:
21:26:45:WU02:FS01:0x15:Folding@home Core Shutdown: GPU_MEMTEST_ERROR
21:26:45:WARNING:WU02:FS01:FahCore returned: GPU_MEMTEST_ERROR (124 = 0x7c)

Re: GPU_MEMTEST_ERROR

PostPosted: Sat Dec 08, 2012 10:18 pm
by VijayPande
PS With all the reports, it seems like the memtest issue is something on our side (nothing wrong with the GPUs). We're looking into it right now.

Re: GPU_MEMTEST_ERROR

PostPosted: Sat Dec 08, 2012 10:59 pm
by JacobKlein
I have 2 GeForce GTS 240 cards, on Windows 8 Pro with Media Center x64.
They were previously both experiencing the CORE_OUTDATED loop, but a restart of the client has since resulted in the download/installation of the new core.

The current error I get, for both cards, is GPU_MEMTEST_ERROR.
I believe they both are currently now looping, and continually getting the error, and failing tasks because of too many errors, and downloading new tasks to fail them too.
These cards are stock Dell cards, and are not overclocked; I sincerely believe the error is erroneous.

The work server, information, and relevant logs, are below, for each slot.
If you need anything else that can help you to solve this issue, please let me know in this thread.
Regards,
Jacob Klein

Slot 0: nVidia GTS 240; Work Server: 171.64.65.105; Collection Server: 171.65.103.160
Code: Select all
*********************** Log Started 2012-12-08T22:47:15Z ***********************
22:47:25:WU02:FS00:Starting
22:47:25:WU02:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -version 702 -lifeline 5052 -checkpoint 15 -gpu 1
22:47:25:WU02:FS00:Started FahCore on PID 4624
22:47:25:WU02:FS00:Core PID:5464
22:47:25:WU02:FS00:FahCore 0x15 started
22:47:26:WU02:FS00:0x15:
22:47:26:WU02:FS00:0x15:*------------------------------*
22:47:26:WU02:FS00:0x15:Folding@Home GPU Core
22:47:26:WU02:FS00:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
22:47:26:WU02:FS00:0x15:Build host             AmoebaRemote
22:47:26:WU02:FS00:0x15:Board Type             NVIDIA/CUDA
22:47:26:WU02:FS00:0x15:Core                   15
22:47:26:WU02:FS00:0x15:GPU device info vendor=0 device=0 name=NA match=0 deviceId=1
22:47:26:WU02:FS00:0x15:
22:47:26:WU02:FS00:0x15:Window's signal control handler registered.
22:47:26:WU02:FS00:0x15:Preparing to commence simulation
22:47:26:WU02:FS00:0x15:- Looking at optimizations...
22:47:26:WU02:FS00:0x15:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
22:47:26:WU02:FS00:0x15:- Created dyn
22:47:26:WU02:FS00:0x15:- Files status OK
22:47:26:WU02:FS00:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
22:47:26:WU02:FS00:0x15:- Expanded 126221 -> 507182 (decompressed 401.8 percent)
22:47:26:WU02:FS00:0x15:Called DecompressByteArray: compressed_data_size=126221 data_size=507182, decompressed_data_size=507182 diff=0
22:47:26:WU02:FS00:0x15:- Digital signature verified
22:47:26:WU02:FS00:0x15:
22:47:26:WU02:FS00:0x15:Project: 7624 (Run 698, Clone 1, Gen 1)
22:47:26:WU02:FS00:0x15:
22:47:26:WU02:FS00:0x15:Assembly optimizations on if available.
22:47:26:WU02:FS00:0x15:Entering M.D.
22:47:28:WU02:FS00:0x15:Tpr hash 02/wudata_01.tpr:  4068508733 2064919972 2088961993 2530427452 806200720
22:47:28:WU02:FS00:0x15:GPU device id=1
22:47:28:WU02:FS00:0x15:Working on Protein
22:47:28:WU02:FS00:0x15:Client config unavailable.
22:47:28:WU02:FS00:0x15:Starting GUI Server
22:48:32:WU02:FS00:0x15:Finished fah_main status=59
22:48:32:WU02:FS00:0x15:mdrun_gpu returned 59
22:48:32:WU02:FS00:0x15:GPU memtest failure
22:48:32:WU02:FS00:0x15:
22:48:32:WU02:FS00:0x15:Folding@home Core Shutdown: GPU_MEMTEST_ERROR
22:48:33:WARNING:WU02:FS00:FahCore returned: GPU_MEMTEST_ERROR (124 = 0x7c)
22:48:33:WU02:FS00:Starting
22:48:33:WU02:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -version 702 -lifeline 5052 -checkpoint 15 -gpu 1
22:48:33:WU02:FS00:Started FahCore on PID 6192
22:48:33:WU02:FS00:Core PID:6148
22:48:33:WU02:FS00:FahCore 0x15 started
22:48:33:WU02:FS00:0x15:
22:48:33:WU02:FS00:0x15:*------------------------------*
22:48:33:WU02:FS00:0x15:Folding@Home GPU Core
22:48:33:WU02:FS00:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
22:48:33:WU02:FS00:0x15:Build host             AmoebaRemote
22:48:33:WU02:FS00:0x15:Board Type             NVIDIA/CUDA
22:48:33:WU02:FS00:0x15:Core                   15
22:48:33:WU02:FS00:0x15:GPU device info vendor=0 device=0 name=NA match=0 deviceId=1
22:48:33:WU02:FS00:0x15:
22:48:33:WU02:FS00:0x15:Window's signal control handler registered.
22:48:33:WU02:FS00:0x15:Preparing to commence simulation
22:48:33:WU02:FS00:0x15:- Looking at optimizations...
22:48:33:WU02:FS00:0x15:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
22:48:33:WU02:FS00:0x15:- Created dyn
22:48:33:WU02:FS00:0x15:- Files status OK
22:48:33:WU02:FS00:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
22:48:33:WU02:FS00:0x15:- Expanded 126221 -> 507182 (decompressed 401.8 percent)
22:48:33:WU02:FS00:0x15:Called DecompressByteArray: compressed_data_size=126221 data_size=507182, decompressed_data_size=507182 diff=0
22:48:33:WU02:FS00:0x15:- Digital signature verified
22:48:33:WU02:FS00:0x15:
22:48:33:WU02:FS00:0x15:Project: 7624 (Run 698, Clone 1, Gen 1)
22:48:33:WU02:FS00:0x15:
22:48:33:WU02:FS00:0x15:Assembly optimizations on if available.
22:48:33:WU02:FS00:0x15:Entering M.D.
22:48:35:WU02:FS00:0x15:Tpr hash 02/wudata_01.tpr:  4068508733 2064919972 2088961993 2530427452 806200720
22:48:35:WU02:FS00:0x15:GPU device id=1
22:48:35:WU02:FS00:0x15:Working on Protein
22:48:35:WU02:FS00:0x15:Client config unavailable.
22:48:35:WU02:FS00:0x15:Starting GUI Server
22:49:39:WU02:FS00:0x15:Finished fah_main status=59
22:49:39:WU02:FS00:0x15:mdrun_gpu returned 59
22:49:39:WU02:FS00:0x15:GPU memtest failure
22:49:39:WU02:FS00:0x15:
22:49:39:WU02:FS00:0x15:Folding@home Core Shutdown: GPU_MEMTEST_ERROR
22:49:40:WARNING:WU02:FS00:FahCore returned: GPU_MEMTEST_ERROR (124 = 0x7c)
22:49:40:WU02:FS00:Starting
22:49:40:WU02:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -version 702 -lifeline 5052 -checkpoint 15 -gpu 1
22:49:40:WU02:FS00:Started FahCore on PID 7980
22:49:40:WU02:FS00:Core PID:3828
22:49:40:WU02:FS00:FahCore 0x15 started
22:49:40:WU02:FS00:0x15:
22:49:40:WU02:FS00:0x15:*------------------------------*
22:49:40:WU02:FS00:0x15:Folding@Home GPU Core
22:49:40:WU02:FS00:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
22:49:40:WU02:FS00:0x15:Build host             AmoebaRemote
22:49:40:WU02:FS00:0x15:Board Type             NVIDIA/CUDA
22:49:40:WU02:FS00:0x15:Core                   15
22:49:40:WU02:FS00:0x15:GPU device info vendor=0 device=0 name=NA match=0 deviceId=1
22:49:40:WU02:FS00:0x15:
22:49:40:WU02:FS00:0x15:Window's signal control handler registered.
22:49:40:WU02:FS00:0x15:Preparing to commence simulation
22:49:40:WU02:FS00:0x15:- Looking at optimizations...
22:49:40:WU02:FS00:0x15:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
22:49:40:WU02:FS00:0x15:- Created dyn
22:49:40:WU02:FS00:0x15:- Files status OK
22:49:40:WU02:FS00:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
22:49:40:WU02:FS00:0x15:- Expanded 126221 -> 507182 (decompressed 401.8 percent)
22:49:40:WU02:FS00:0x15:Called DecompressByteArray: compressed_data_size=126221 data_size=507182, decompressed_data_size=507182 diff=0
22:49:40:WU02:FS00:0x15:- Digital signature verified
22:49:40:WU02:FS00:0x15:
22:49:40:WU02:FS00:0x15:Project: 7624 (Run 698, Clone 1, Gen 1)
22:49:40:WU02:FS00:0x15:
22:49:40:WU02:FS00:0x15:Assembly optimizations on if available.
22:49:40:WU02:FS00:0x15:Entering M.D.
22:49:42:WU02:FS00:0x15:Tpr hash 02/wudata_01.tpr:  4068508733 2064919972 2088961993 2530427452 806200720
22:49:42:WU02:FS00:0x15:GPU device id=1
22:49:42:WU02:FS00:0x15:Working on Protein
22:49:42:WU02:FS00:0x15:Client config unavailable.
22:49:42:WU02:FS00:0x15:Starting GUI Server
22:50:47:WARNING:WU02:FS00:FahCore returned: GPU_MEMTEST_ERROR (124 = 0x7c)
22:51:17:WU02:FS00:Starting
22:51:17:WU02:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -version 702 -lifeline 5052 -checkpoint 15 -gpu 1
22:51:17:WU02:FS00:Started FahCore on PID 6272
22:51:17:WU02:FS00:Core PID:3964
22:51:17:WU02:FS00:FahCore 0x15 started
22:51:17:WU02:FS00:0x15:
22:51:17:WU02:FS00:0x15:*------------------------------*
22:51:17:WU02:FS00:0x15:Folding@Home GPU Core
22:51:17:WU02:FS00:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
22:51:17:WU02:FS00:0x15:Build host             AmoebaRemote
22:51:17:WU02:FS00:0x15:Board Type             NVIDIA/CUDA
22:51:17:WU02:FS00:0x15:Core                   15
22:51:17:WU02:FS00:0x15:GPU device info vendor=0 device=0 name=NA match=0 deviceId=1
22:51:17:WU02:FS00:0x15:
22:51:17:WU02:FS00:0x15:Window's signal control handler registered.
22:51:17:WU02:FS00:0x15:Preparing to commence simulation
22:51:17:WU02:FS00:0x15:- Looking at optimizations...
22:51:17:WU02:FS00:0x15:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
22:51:17:WU02:FS00:0x15:- Created dyn
22:51:17:WU02:FS00:0x15:- Files status OK
22:51:17:WU02:FS00:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
22:51:17:WU02:FS00:0x15:- Expanded 126221 -> 507182 (decompressed 401.8 percent)
22:51:17:WU02:FS00:0x15:Called DecompressByteArray: compressed_data_size=126221 data_size=507182, decompressed_data_size=507182 diff=0
22:51:17:WU02:FS00:0x15:- Digital signature verified
22:51:17:WU02:FS00:0x15:
22:51:17:WU02:FS00:0x15:Project: 7624 (Run 698, Clone 1, Gen 1)
22:51:17:WU02:FS00:0x15:
22:51:17:WU02:FS00:0x15:Assembly optimizations on if available.
22:51:17:WU02:FS00:0x15:Entering M.D.
22:51:19:WU02:FS00:0x15:Tpr hash 02/wudata_01.tpr:  4068508733 2064919972 2088961993 2530427452 806200720
22:51:19:WU02:FS00:0x15:GPU device id=1
22:51:19:WU02:FS00:0x15:Working on Protein
22:51:19:WU02:FS00:0x15:Client config unavailable.
22:51:19:WU02:FS00:0x15:Starting GUI Server
22:52:21:WARNING:WU02:FS00:FahCore returned: GPU_MEMTEST_ERROR (124 = 0x7c)
22:53:54:WU02:FS00:Starting
22:53:54:WU02:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -version 702 -lifeline 5052 -checkpoint 15 -gpu 1
22:53:54:WU02:FS00:Started FahCore on PID 5812
22:53:54:WU02:FS00:Core PID:5356
22:53:54:WU02:FS00:FahCore 0x15 started
22:53:54:WU02:FS00:0x15:
22:53:54:WU02:FS00:0x15:*------------------------------*
22:53:54:WU02:FS00:0x15:Folding@Home GPU Core
22:53:54:WU02:FS00:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
22:53:54:WU02:FS00:0x15:Build host             AmoebaRemote
22:53:54:WU02:FS00:0x15:Board Type             NVIDIA/CUDA
22:53:54:WU02:FS00:0x15:Core                   15
22:53:54:WU02:FS00:0x15:GPU device info vendor=0 device=0 name=NA match=0 deviceId=1
22:53:54:WU02:FS00:0x15:
22:53:54:WU02:FS00:0x15:Window's signal control handler registered.
22:53:54:WU02:FS00:0x15:Preparing to commence simulation
22:53:54:WU02:FS00:0x15:- Looking at optimizations...
22:53:54:WU02:FS00:0x15:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
22:53:54:WU02:FS00:0x15:- Created dyn
22:53:54:WU02:FS00:0x15:- Files status OK
22:53:54:WU02:FS00:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
22:53:54:WU02:FS00:0x15:- Expanded 126221 -> 507182 (decompressed 401.8 percent)
22:53:54:WU02:FS00:0x15:Called DecompressByteArray: compressed_data_size=126221 data_size=507182, decompressed_data_size=507182 diff=0
22:53:54:WU02:FS00:0x15:- Digital signature verified
22:53:54:WU02:FS00:0x15:
22:53:54:WU02:FS00:0x15:Project: 7624 (Run 698, Clone 1, Gen 1)
22:53:54:WU02:FS00:0x15:
22:53:54:WU02:FS00:0x15:Assembly optimizations on if available.
22:53:54:WU02:FS00:0x15:Entering M.D.
22:53:56:WU02:FS00:0x15:Tpr hash 02/wudata_01.tpr:  4068508733 2064919972 2088961993 2530427452 806200720
22:53:56:WU02:FS00:0x15:GPU device id=1
22:53:56:WU02:FS00:0x15:Working on Protein
22:53:56:WU02:FS00:0x15:Client config unavailable.
22:53:56:WU02:FS00:0x15:Starting GUI Server
22:55:01:WU02:FS00:0x15:Finished fah_main status=59
22:55:01:WU02:FS00:0x15:mdrun_gpu returned 59
22:55:01:WU02:FS00:0x15:GPU memtest failure
22:55:01:WU02:FS00:0x15:
22:55:01:WU02:FS00:0x15:Folding@home Core Shutdown: GPU_MEMTEST_ERROR
22:55:01:WARNING:WU02:FS00:FahCore returned: GPU_MEMTEST_ERROR (124 = 0x7c)
22:55:01:WARNING:WU02:FS00:Too many errors, failing
22:55:01:WU02:FS00:Sending unit results: id:02 state:SEND error:FAILED project:7624 run:698 clone:1 gen:1 core:0x15 unit:0x00000001664f2dd14fe6148ce7c3b5c9
22:55:01:WU02:FS00:Connecting to 171.64.65.105:8080
22:55:01:WU00:FS00:Connecting to assign-GPU.stanford.edu:80
22:55:02:WU02:FS00:Server responded WORK_QUIT (404)
22:55:02:WARNING:WU02:FS00:Server did not like results, dumping
22:55:02:WU02:FS00:Cleaning up
22:55:02:WU00:FS00:News: Welcome to Folding@Home
22:55:02:WU00:FS00:Assigned to work server 171.64.65.105
22:55:02:WU00:FS00:Requesting new work unit for slot 00: READY gpu:0:"G92 [GeForce GTS 240]" from 171.64.65.105
22:55:02:WU00:FS00:Connecting to 171.64.65.105:8080
22:55:02:WU00:FS00:Downloading 122.61KiB
22:55:03:WU00:FS00:Download complete
22:55:03:WU00:FS00:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:7623 run:270 clone:0 gen:151 core:0x15 unit:0x000000c1664f2dd14fe4f9fa8a4bfdf6
22:55:03:WU00:FS00:Starting
22:55:03:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_15.fah/FahCore_15.exe -dir 00 -suffix 01 -version 702 -lifeline 5052 -checkpoint 15 -gpu 1
22:55:03:WU00:FS00:Started FahCore on PID 6552
22:55:03:WU00:FS00:Core PID:2524
22:55:03:WU00:FS00:FahCore 0x15 started
22:55:04:WU00:FS00:0x15:
22:55:04:WU00:FS00:0x15:*------------------------------*
22:55:04:WU00:FS00:0x15:Folding@Home GPU Core
22:55:04:WU00:FS00:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
22:55:04:WU00:FS00:0x15:Build host             AmoebaRemote
22:55:04:WU00:FS00:0x15:Board Type             NVIDIA/CUDA
22:55:04:WU00:FS00:0x15:Core                   15
22:55:04:WU00:FS00:0x15:GPU device info vendor=0 device=0 name=NA match=0 deviceId=1
22:55:04:WU00:FS00:0x15:
22:55:04:WU00:FS00:0x15:Window's signal control handler registered.
22:55:04:WU00:FS00:0x15:Preparing to commence simulation
22:55:04:WU00:FS00:0x15:- Looking at optimizations...
22:55:04:WU00:FS00:0x15:DeleteFrameFiles: successfully deleted file=00/wudata_01.ckp
22:55:04:WU00:FS00:0x15:- Created dyn
22:55:04:WU00:FS00:0x15:- Files status OK
22:55:04:WU00:FS00:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
22:55:04:WU00:FS00:0x15:- Expanded 125037 -> 501826 (decompressed 401.3 percent)
22:55:04:WU00:FS00:0x15:Called DecompressByteArray: compressed_data_size=125037 data_size=501826, decompressed_data_size=501826 diff=0
22:55:04:WU00:FS00:0x15:- Digital signature verified
22:55:04:WU00:FS00:0x15:
22:55:04:WU00:FS00:0x15:Project: 7623 (Run 270, Clone 0, Gen 151)
22:55:04:WU00:FS00:0x15:
22:55:04:WU00:FS00:0x15:Assembly optimizations on if available.
22:55:04:WU00:FS00:0x15:Entering M.D.
22:55:05:WU00:FS00:0x15:Tpr hash 00/wudata_01.tpr:  1376015233 1021321501 2713841013 335081520 619206503
22:55:05:WU00:FS00:0x15:GPU device id=1
22:55:05:WU00:FS00:0x15:Working on Protein
22:55:05:WU00:FS00:0x15:Client config unavailable.
22:55:06:WU00:FS00:0x15:Starting GUI Server


Slot 1: nVidia GTS 240; Work Server: 171.64.65.105; Collection Server: 171.65.103.160
Code: Select all
*********************** Log Started 2012-12-08T22:47:15Z ***********************
22:47:15:WU01:FS01:Downloading core from http://www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_15.fah
22:47:16:WU01:FS01:Connecting to www.stanford.edu:80
22:47:16:WU01:FS01:FahCore 15: Downloading 1.88MiB
22:47:22:WU01:FS01:FahCore 15: 53.32%
22:47:25:WU01:FS01:FahCore 15: Download complete
22:47:25:WU01:FS01:Valid core signature
22:47:25:WU01:FS01:Unpacked 7.71MiB to cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_15.fah/FahCore_15.exe
22:47:25:WU01:FS01:Starting
22:47:25:WU01:FS01:Removing old file './work/01/logfile_01-20121207-230021.txt'
22:47:25:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_15.fah/FahCore_15.exe -dir 01 -suffix 01 -version 702 -lifeline 5052 -checkpoint 15 -gpu 2
22:47:25:WU01:FS01:Started FahCore on PID 7876
22:47:25:WU01:FS01:Core PID:6620
22:47:25:WU01:FS01:FahCore 0x15 started
22:47:26:WU01:FS01:0x15:
22:47:26:WU01:FS01:0x15:*------------------------------*
22:47:26:WU01:FS01:0x15:Folding@Home GPU Core
22:47:26:WU01:FS01:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
22:47:26:WU01:FS01:0x15:Build host             AmoebaRemote
22:47:26:WU01:FS01:0x15:Board Type             NVIDIA/CUDA
22:47:26:WU01:FS01:0x15:Core                   15
22:47:26:WU01:FS01:0x15:GPU device info vendor=0 device=0 name=NA match=0 deviceId=2
22:47:26:WU01:FS01:0x15:
22:47:26:WU01:FS01:0x15:Window's signal control handler registered.
22:47:26:WU01:FS01:0x15:Preparing to commence simulation
22:47:26:WU01:FS01:0x15:- Looking at optimizations...
22:47:26:WU01:FS01:0x15:DeleteFrameFiles: successfully deleted file=01/wudata_01.ckp
22:47:26:WU01:FS01:0x15:- Created dyn
22:47:26:WU01:FS01:0x15:- Files status OK
22:47:26:WU01:FS01:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
22:47:26:WU01:FS01:0x15:- Expanded 124634 -> 501826 (decompressed 402.6 percent)
22:47:26:WU01:FS01:0x15:Called DecompressByteArray: compressed_data_size=124634 data_size=501826, decompressed_data_size=501826 diff=0
22:47:26:WU01:FS01:0x15:- Digital signature verified
22:47:26:WU01:FS01:0x15:
22:47:26:WU01:FS01:0x15:Project: 7623 (Run 502, Clone 2, Gen 1)
22:47:26:WU01:FS01:0x15:
22:47:26:WU01:FS01:0x15:Assembly optimizations on if available.
22:47:26:WU01:FS01:0x15:Entering M.D.
22:47:28:WU01:FS01:0x15:Tpr hash 01/wudata_01.tpr:  65967345 409364089 2322475341 2362913948 2715724567
22:47:28:WU01:FS01:0x15:GPU device id=2
22:47:28:WU01:FS01:0x15:Working on Protein
22:47:28:WU01:FS01:0x15:Client config unavailable.
22:47:28:WU01:FS01:0x15:Starting GUI Server
22:48:33:WARNING:WU01:FS01:FahCore returned: GPU_MEMTEST_ERROR (124 = 0x7c)
22:48:33:WU01:FS01:Starting
22:48:33:WU01:FS01:Removing old file './work/01/logfile_01-20121207-230159.txt'
22:48:33:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_15.fah/FahCore_15.exe -dir 01 -suffix 01 -version 702 -lifeline 5052 -checkpoint 15 -gpu 2
22:48:33:WU01:FS01:Started FahCore on PID 4820
22:48:33:WU01:FS01:Core PID:4148
22:48:33:WU01:FS01:FahCore 0x15 started
22:48:33:WU01:FS01:0x15:
22:48:33:WU01:FS01:0x15:*------------------------------*
22:48:33:WU01:FS01:0x15:Folding@Home GPU Core
22:48:33:WU01:FS01:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
22:48:33:WU01:FS01:0x15:Build host             AmoebaRemote
22:48:33:WU01:FS01:0x15:Board Type             NVIDIA/CUDA
22:48:33:WU01:FS01:0x15:Core                   15
22:48:33:WU01:FS01:0x15:GPU device info vendor=0 device=0 name=NA match=0 deviceId=2
22:48:33:WU01:FS01:0x15:
22:48:33:WU01:FS01:0x15:Window's signal control handler registered.
22:48:33:WU01:FS01:0x15:Preparing to commence simulation
22:48:33:WU01:FS01:0x15:- Looking at optimizations...
22:48:33:WU01:FS01:0x15:DeleteFrameFiles: successfully deleted file=01/wudata_01.ckp
22:48:33:WU01:FS01:0x15:- Created dyn
22:48:33:WU01:FS01:0x15:- Files status OK
22:48:33:WU01:FS01:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
22:48:33:WU01:FS01:0x15:- Expanded 124634 -> 501826 (decompressed 402.6 percent)
22:48:33:WU01:FS01:0x15:Called DecompressByteArray: compressed_data_size=124634 data_size=501826, decompressed_data_size=501826 diff=0
22:48:33:WU01:FS01:0x15:- Digital signature verified
22:48:33:WU01:FS01:0x15:
22:48:33:WU01:FS01:0x15:Project: 7623 (Run 502, Clone 2, Gen 1)
22:48:33:WU01:FS01:0x15:
22:48:33:WU01:FS01:0x15:Assembly optimizations on if available.
22:48:33:WU01:FS01:0x15:Entering M.D.
22:48:35:WU01:FS01:0x15:Tpr hash 01/wudata_01.tpr:  65967345 409364089 2322475341 2362913948 2715724567
22:48:35:WU01:FS01:0x15:GPU device id=2
22:48:35:WU01:FS01:0x15:Working on Protein
22:48:35:WU01:FS01:0x15:Client config unavailable.
22:48:35:WU01:FS01:0x15:Starting GUI Server
22:49:40:WU01:FS01:0x15:Finished fah_main status=59
22:49:40:WU01:FS01:0x15:mdrun_gpu returned 59
22:49:40:WU01:FS01:0x15:GPU memtest failure
22:49:40:WU01:FS01:0x15:
22:49:40:WU01:FS01:0x15:Folding@home Core Shutdown: GPU_MEMTEST_ERROR
22:49:40:WARNING:WU01:FS01:FahCore returned: GPU_MEMTEST_ERROR (124 = 0x7c)
22:49:40:WU01:FS01:Starting
22:49:40:WU01:FS01:Removing old file './work/01/logfile_01-20121207-230436.txt'
22:49:40:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_15.fah/FahCore_15.exe -dir 01 -suffix 01 -version 702 -lifeline 5052 -checkpoint 15 -gpu 2
22:49:40:WU01:FS01:Started FahCore on PID 5108
22:49:40:WU01:FS01:Core PID:6204
22:49:40:WU01:FS01:FahCore 0x15 started
22:49:41:WU01:FS01:0x15:
22:49:41:WU01:FS01:0x15:*------------------------------*
22:49:41:WU01:FS01:0x15:Folding@Home GPU Core
22:49:41:WU01:FS01:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
22:49:41:WU01:FS01:0x15:Build host             AmoebaRemote
22:49:41:WU01:FS01:0x15:Board Type             NVIDIA/CUDA
22:49:41:WU01:FS01:0x15:Core                   15
22:49:41:WU01:FS01:0x15:GPU device info vendor=0 device=0 name=NA match=0 deviceId=2
22:49:41:WU01:FS01:0x15:
22:49:41:WU01:FS01:0x15:Window's signal control handler registered.
22:49:41:WU01:FS01:0x15:Preparing to commence simulation
22:49:41:WU01:FS01:0x15:- Looking at optimizations...
22:49:41:WU01:FS01:0x15:DeleteFrameFiles: successfully deleted file=01/wudata_01.ckp
22:49:41:WU01:FS01:0x15:- Created dyn
22:49:41:WU01:FS01:0x15:- Files status OK
22:49:41:WU01:FS01:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
22:49:41:WU01:FS01:0x15:- Expanded 124634 -> 501826 (decompressed 402.6 percent)
22:49:41:WU01:FS01:0x15:Called DecompressByteArray: compressed_data_size=124634 data_size=501826, decompressed_data_size=501826 diff=0
22:49:41:WU01:FS01:0x15:- Digital signature verified
22:49:41:WU01:FS01:0x15:
22:49:41:WU01:FS01:0x15:Project: 7623 (Run 502, Clone 2, Gen 1)
22:49:41:WU01:FS01:0x15:
22:49:41:WU01:FS01:0x15:Assembly optimizations on if available.
22:49:41:WU01:FS01:0x15:Entering M.D.
22:49:43:WU01:FS01:0x15:Tpr hash 01/wudata_01.tpr:  65967345 409364089 2322475341 2362913948 2715724567
22:49:43:WU01:FS01:0x15:GPU device id=2
22:49:43:WU01:FS01:0x15:Working on Protein
22:49:43:WU01:FS01:0x15:Client config unavailable.
22:49:43:WU01:FS01:0x15:Starting GUI Server
22:50:47:WU01:FS01:0x15:Finished fah_main status=59
22:50:47:WU01:FS01:0x15:mdrun_gpu returned 59
22:50:47:WU01:FS01:0x15:GPU memtest failure
22:50:47:WU01:FS01:0x15:
22:50:47:WU01:FS01:0x15:Folding@home Core Shutdown: GPU_MEMTEST_ERROR
22:50:47:WARNING:WU01:FS01:FahCore returned: GPU_MEMTEST_ERROR (124 = 0x7c)
22:51:18:WU01:FS01:Starting
22:51:18:WU01:FS01:Removing old file './work/01/logfile_01-20121207-230850.txt'
22:51:18:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_15.fah/FahCore_15.exe -dir 01 -suffix 01 -version 702 -lifeline 5052 -checkpoint 15 -gpu 2
22:51:18:WU01:FS01:Started FahCore on PID 2376
22:51:18:WU01:FS01:Core PID:6808
22:51:18:WU01:FS01:FahCore 0x15 started
22:51:18:WU01:FS01:0x15:
22:51:18:WU01:FS01:0x15:*------------------------------*
22:51:18:WU01:FS01:0x15:Folding@Home GPU Core
22:51:18:WU01:FS01:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
22:51:18:WU01:FS01:0x15:Build host             AmoebaRemote
22:51:18:WU01:FS01:0x15:Board Type             NVIDIA/CUDA
22:51:18:WU01:FS01:0x15:Core                   15
22:51:18:WU01:FS01:0x15:GPU device info vendor=0 device=0 name=NA match=0 deviceId=2
22:51:18:WU01:FS01:0x15:
22:51:18:WU01:FS01:0x15:Window's signal control handler registered.
22:51:18:WU01:FS01:0x15:Preparing to commence simulation
22:51:18:WU01:FS01:0x15:- Looking at optimizations...
22:51:18:WU01:FS01:0x15:DeleteFrameFiles: successfully deleted file=01/wudata_01.ckp
22:51:18:WU01:FS01:0x15:- Created dyn
22:51:18:WU01:FS01:0x15:- Files status OK
22:51:18:WU01:FS01:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
22:51:18:WU01:FS01:0x15:- Expanded 124634 -> 501826 (decompressed 402.6 percent)
22:51:18:WU01:FS01:0x15:Called DecompressByteArray: compressed_data_size=124634 data_size=501826, decompressed_data_size=501826 diff=0
22:51:18:WU01:FS01:0x15:- Digital signature verified
22:51:18:WU01:FS01:0x15:
22:51:18:WU01:FS01:0x15:Project: 7623 (Run 502, Clone 2, Gen 1)
22:51:18:WU01:FS01:0x15:
22:51:18:WU01:FS01:0x15:Assembly optimizations on if available.
22:51:18:WU01:FS01:0x15:Entering M.D.
22:51:20:WU01:FS01:0x15:Tpr hash 01/wudata_01.tpr:  65967345 409364089 2322475341 2362913948 2715724567
22:51:20:WU01:FS01:0x15:GPU device id=2
22:51:20:WU01:FS01:0x15:Working on Protein
22:51:20:WU01:FS01:0x15:Client config unavailable.
22:51:20:WU01:FS01:0x15:Starting GUI Server
22:52:22:WARNING:WU01:FS01:FahCore returned: GPU_MEMTEST_ERROR (124 = 0x7c)
22:53:55:WU01:FS01:Starting
22:53:55:WU01:FS01:Removing old file './work/01/logfile_01-20121207-231541.txt'
22:53:55:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_15.fah/FahCore_15.exe -dir 01 -suffix 01 -version 702 -lifeline 5052 -checkpoint 15 -gpu 2
22:53:55:WU01:FS01:Started FahCore on PID 2772
22:53:55:WU01:FS01:Core PID:940
22:53:55:WU01:FS01:FahCore 0x15 started
22:53:55:WU01:FS01:0x15:
22:53:55:WU01:FS01:0x15:*------------------------------*
22:53:55:WU01:FS01:0x15:Folding@Home GPU Core
22:53:55:WU01:FS01:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
22:53:55:WU01:FS01:0x15:Build host             AmoebaRemote
22:53:55:WU01:FS01:0x15:Board Type             NVIDIA/CUDA
22:53:55:WU01:FS01:0x15:Core                   15
22:53:55:WU01:FS01:0x15:GPU device info vendor=0 device=0 name=NA match=0 deviceId=2
22:53:55:WU01:FS01:0x15:
22:53:55:WU01:FS01:0x15:Window's signal control handler registered.
22:53:55:WU01:FS01:0x15:Preparing to commence simulation
22:53:55:WU01:FS01:0x15:- Looking at optimizations...
22:53:55:WU01:FS01:0x15:DeleteFrameFiles: successfully deleted file=01/wudata_01.ckp
22:53:55:WU01:FS01:0x15:- Created dyn
22:53:55:WU01:FS01:0x15:- Files status OK
22:53:55:WU01:FS01:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
22:53:55:WU01:FS01:0x15:- Expanded 124634 -> 501826 (decompressed 402.6 percent)
22:53:55:WU01:FS01:0x15:Called DecompressByteArray: compressed_data_size=124634 data_size=501826, decompressed_data_size=501826 diff=0
22:53:55:WU01:FS01:0x15:- Digital signature verified
22:53:55:WU01:FS01:0x15:
22:53:55:WU01:FS01:0x15:Project: 7623 (Run 502, Clone 2, Gen 1)
22:53:55:WU01:FS01:0x15:
22:53:55:WU01:FS01:0x15:Assembly optimizations on if available.
22:53:55:WU01:FS01:0x15:Entering M.D.
22:53:57:WU01:FS01:0x15:Tpr hash 01/wudata_01.tpr:  65967345 409364089 2322475341 2362913948 2715724567
22:53:57:WU01:FS01:0x15:GPU device id=2
22:53:57:WU01:FS01:0x15:Working on Protein
22:53:57:WU01:FS01:0x15:Client config unavailable.
22:53:57:WU01:FS01:0x15:Starting GUI Server
22:55:02:WU01:FS01:0x15:Finished fah_main status=59
22:55:02:WU01:FS01:0x15:mdrun_gpu returned 59
22:55:02:WU01:FS01:0x15:GPU memtest failure
22:55:02:WU01:FS01:0x15:
22:55:02:WU01:FS01:0x15:Folding@home Core Shutdown: GPU_MEMTEST_ERROR
22:55:02:WARNING:WU01:FS01:FahCore returned: GPU_MEMTEST_ERROR (124 = 0x7c)
22:55:02:WARNING:WU01:FS01:Too many errors, failing
22:55:02:WU01:FS01:Sending unit results: id:01 state:SEND error:FAILED project:7623 run:502 clone:2 gen:1 core:0x15 unit:0x00000002664f2dd14fe4fb448fd0a6ad
22:55:03:WU01:FS01:Connecting to 171.64.65.105:8080
22:55:03:WU02:FS01:Connecting to assign-GPU.stanford.edu:80
22:55:03:WU02:FS01:News: Welcome to Folding@Home
22:55:03:WU02:FS01:Assigned to work server 171.64.65.105
22:55:03:WU02:FS01:Requesting new work unit for slot 01: READY gpu:2:"G92 [GeForce GTS 240]" from 171.64.65.105
22:55:03:WU02:FS01:Connecting to 171.64.65.105:8080
22:55:06:WU01:FS01:Server responded WORK_QUIT (404)
22:55:06:WARNING:WU01:FS01:Server did not like results, dumping
22:55:06:WU01:FS01:Cleaning up
22:55:06:WU02:FS01:Downloading 124.18KiB
22:55:07:WU02:FS01:Download complete
22:55:07:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:7626 run:306 clone:2 gen:1 core:0x15 unit:0x00000001664f2dd14fe61b826a50828f
22:55:07:WU02:FS01:Starting
22:55:07:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/G80/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -version 702 -lifeline 5052 -checkpoint 15 -gpu 2
22:55:07:WU02:FS01:Started FahCore on PID 2412
22:55:07:WU02:FS01:Core PID:6220
22:55:07:WU02:FS01:FahCore 0x15 started
22:55:07:WU02:FS01:Downloading project 7626 description
22:55:07:WU02:FS01:Connecting to fah-web.stanford.edu:80
22:55:08:WU02:FS01:0x15:
22:55:08:WU02:FS01:0x15:*------------------------------*
22:55:08:WU02:FS01:0x15:Folding@Home GPU Core
22:55:08:WU02:FS01:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
22:55:08:WU02:FS01:0x15:Build host             AmoebaRemote
22:55:08:WU02:FS01:0x15:Board Type             NVIDIA/CUDA
22:55:08:WU02:FS01:0x15:Core                   15
22:55:08:WU02:FS01:0x15:GPU device info vendor=0 device=0 name=NA match=0 deviceId=2
22:55:08:WU02:FS01:0x15:
22:55:08:WU02:FS01:0x15:Window's signal control handler registered.
22:55:08:WU02:FS01:0x15:Preparing to commence simulation
22:55:08:WU02:FS01:0x15:- Looking at optimizations...
22:55:08:WU02:FS01:0x15:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
22:55:08:WU02:FS01:0x15:- Created dyn
22:55:08:WU02:FS01:0x15:- Files status OK
22:55:08:WU02:FS01:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
22:55:08:WU02:FS01:0x15:- Expanded 126647 -> 507182 (decompressed 400.4 percent)
22:55:08:WU02:FS01:0x15:Called DecompressByteArray: compressed_data_size=126647 data_size=507182, decompressed_data_size=507182 diff=0
22:55:08:WU02:FS01:0x15:- Digital signature verified
22:55:08:WU02:FS01:0x15:
22:55:08:WU02:FS01:0x15:Project: 7626 (Run 306, Clone 2, Gen 1)
22:55:08:WU02:FS01:0x15:
22:55:08:WU02:FS01:0x15:Assembly optimizations on if available.
22:55:08:WU02:FS01:0x15:Entering M.D.
22:55:08:WU02:FS01:Project 7626 description downloaded successfully
22:55:09:WU02:FS01:0x15:Tpr hash 02/wudata_01.tpr:  2483958772 2627897025 2702481938 2514206710 1200227223
22:55:09:WU02:FS01:0x15:GPU device id=2
22:55:09:WU02:FS01:0x15:Working on Protein
22:55:09:WU02:FS01:0x15:Client config unavailable.
22:55:10:WU02:FS01:0x15:Starting GUI Server

Re: GPU_MEMTEST_ERROR

PostPosted: Sat Dec 08, 2012 11:13 pm
by smackiethefrog
171.67.108.141 -> GPU_MEMTEST_ERROR on 9800 gtx+
171.67.108.36 -> stalls at "WU00:FS00:0x15:Starting GUI Server" with no further logging and no GPU load on 8800 gts

Re: GPU_MEMTEST_ERROR

PostPosted: Sun Dec 09, 2012 12:12 am
by farmpuma
171.64.65.105 for my 9600GSO

Re: GPU_MEMTEST_ERROR

PostPosted: Sun Dec 09, 2012 1:13 am
by VijayPande
Here's another update. Joe has been working on this since around noon and sees the issues. We now think the memtest issue is due to an AS issue sending G80 clients to get work from non-G80 compatible servers. Your posts here were helpful to confirm this.

He's still working on the issue, hopefully with some fix in a day or so. This should also fix the ATI assignment issue.

Re: GPU_MEMTEST_ERROR

PostPosted: Sun Dec 09, 2012 1:18 am
by mmonnin
VijayPande wrote:Here's another update. Joe has been working on this since around noon and sees the issues. We now think the memtest issue is due to an AS issue sending G80 clients to get work from non-G80 compatible servers. Your posts here were helpful to confirm this.

He's still working on the issue, hopefully with some fix in a day or so. This should also fix the ATI assignment issue.

Good news!

Re: GPU_MEMTEST_ERROR

PostPosted: Sun Dec 09, 2012 4:00 am
by Mactin
Mr Pande,
Thank you.
Both my GTS250 and HD5870 are now folding normally (Core_11 and Core_16 WUs respectively).
I look forward to the resolving of the GPU_MEMTEST_ERROR issue.

Re: GPU_MEMTEST_ERROR

PostPosted: Sun Dec 09, 2012 5:26 am
by Sahkolihaa
171.67.108.36 for my 9800GT.

Re: GPU_MEMTEST_ERROR

PostPosted: Sun Dec 09, 2012 6:07 am
by VijayPande
We've put in a new patch on the AS. We think this should fix the problem, but time will tell.

Re: GPU_MEMTEST_ERROR

PostPosted: Sun Dec 09, 2012 6:21 am
by JacobKlein
Forgive me for sounding ignorant, but what is an "AS"?
Also, after restarting my client, my GPU_MEMTEST_ERROR message still persist; the issue is not yet resolved.

Re: GPU_MEMTEST_ERROR

PostPosted: Sun Dec 09, 2012 7:00 am
by codysluder
AS = Assignment Server.

When your client needs new work, it contacts the Assignment Server which starts a multi-step process that should give you work that your computer can process. Since you're getting the wrong kind of assignment, there must be something wrong with the AS.

Re: GPU_MEMTEST_ERROR

PostPosted: Sun Dec 09, 2012 8:35 am
by farmpuma
Sifting through the posts in this thread and the CORE_OUTDATED thread it seems apparent to me that the problem lies in FahCore_15. It was the last and only thing that changed. If it were an issue with the WUs then all the G80+ video cards would probably be failing. However, even some 8800GT cards are happily crunching WUs which are failing on my G92 9600GSO.

I suspect the GPU_MEMTEST is an arbitrary check and subsequent rejection of any card with less than one GB of memory. Were it an actual check of the ratio of memory to GPU cores the one GB 9800GTX would fail worse than my 384MB to 96core 9600GSO. And it is my understanding (I could be wrong) that the video card memory is only used as a buffer to shuffle data to and from the processing cores rather than hold the entire work unit data set. Again, I could be wrong, but it would seem to be the logical way to handle the data.