A few problems (faulty WUs/crash/hard-reset)

Moderators: Site Moderators, PandeGroup

A few problems (faulty WUs/crash/hard-reset)

Postby IsraeliRD » Sat Feb 01, 2014 1:10 pm

Heya,

Earlier today I decided to upgrade to F@H V7.4.2 beta just to try it out while keeping an eye on it. During the day everything seemed fine, so during the evening I went out. When I came back, I couldn't even get to the log in screen, and the computer just froze. The restart button didn't work, so I did a hard-reset. Ends up that couple of hours before I came back, the computer locked itself up due to F@H.
Checking the logs, I found that I had two faulty WUs back to back, followed by FahCore_a3 crashing.

After the hard-reset I went ahead and downgraded to V7.3.6. The logs showed that FahCore_a3 was on 40560 out of 500000 steps. As of right now it's happily folding. FahCore_17 continued from its last checkpoint (84%).
For your convenience I removed 7 hours from the log because everything was fine during that time.

Thanks!

Code: Select all
*********************** Log Started 2014-02-01T02:34:32Z ***********************
02:34:32:************************* Folding@home Client *************************
02:34:32:      Website: http://folding.stanford.edu/
02:34:32:    Copyright: (c) 2009-2014 Stanford University
02:34:32:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
02:34:32:         Args: --open-web-control
02:34:32:       Config: C:/Users/Matan/AppData/Roaming/FAHClient/config.xml
02:34:32:******************************** Build ********************************
02:34:32:      Version: 7.4.2
02:34:32:         Date: Jan 24 2014
02:34:32:         Time: 13:51:17
02:34:32:      SVN Rev: 4112
02:34:32:       Branch: fah/trunk/client
02:34:32:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
02:34:32:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
02:34:32:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
02:34:32:     Platform: win32 XP
02:34:32:         Bits: 32
02:34:32:         Mode: Release
02:34:32:******************************* System ********************************
02:34:32:          CPU: Intel(R) Core(TM) i7-4770K CPU @ 3.50GHz
02:34:32:       CPU ID: GenuineIntel Family 6 Model 60 Stepping 3
02:34:32:         CPUs: 8
02:34:32:       Memory: 15.94GiB
02:34:32:  Free Memory: 12.22GiB
02:34:32:      Threads: WINDOWS_THREADS
02:34:32:   OS Version: 6.1
02:34:32:  Has Battery: false
02:34:32:   On Battery: false
02:34:32:   UTC Offset: 11
02:34:32:          PID: 8444
02:34:32:          CWD: C:/Users/Matan/AppData/Roaming/FAHClient
02:34:32:           OS: Windows 7 Professional
02:34:32:      OS Arch: AMD64
02:34:32:         GPUs: 1
02:34:32:        GPU 0: NVIDIA:3 GK104 [GeForce GTX 660 Ti]
02:34:32:         CUDA: 3.0
02:34:32:  CUDA Driver: 5050
02:34:32:Win32 Service: false
02:34:32:***********************************************************************
02:34:32:<config>
02:34:32:  <!-- Folding Core -->
02:34:32:  <checkpoint v='3'/>
02:34:32:
02:34:32:  <!-- Network -->
02:34:32:  <proxy v=':8080'/>
02:34:32:
02:34:32:  <!-- Slot Control -->
02:34:32:  <power v='full'/>
02:34:32:
02:34:32:  <!-- User Information -->
02:34:32:  <passkey v='********************************'/>
02:34:32:  <team v='110285'/>
02:34:32:  <user v='IsraeliRD'/>
02:34:32:
02:34:32:  <!-- Folding Slots -->
02:34:32:  <slot id='0' type='GPU'/>
02:34:32:  <slot id='1' type='CPU'/>
02:34:32:</config>
02:34:32:Trying to access database...
02:34:32:Successfully acquired database lock
02:34:32:Enabled folding slot 00: READY gpu:0:GK104 [GeForce GTX 660 Ti]
02:34:32:Enabled folding slot 01: READY cpu:7
02:34:32:WU00:FS00:Connecting to assign-GPU.stanford.edu:80
02:34:32:WU01:FS01:Connecting to assign3.stanford.edu:8080
02:34:33:WU01:FS01:Assigned to work server 171.64.65.124
02:34:33:WU01:FS01:Requesting new work unit for slot 01: READY cpu:7 from 171.64.65.124
02:34:33:WU00:FS00:Assigned to work server 171.64.65.69
02:34:33:WU01:FS01:Connecting to 171.64.65.124:8080
02:34:33:WU00:FS00:Requesting new work unit for slot 00: READY gpu:0:GK104 [GeForce GTX 660 Ti] from 171.64.65.69
02:34:33:WU00:FS00:Connecting to 171.64.65.69:8080
02:34:41:WU01:FS01:Downloading 862.20KiB
02:34:41:WU00:FS00:Downloading 4.17MiB
02:34:43:WU01:FS01:Download complete
02:34:43:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:9005 run:2 clone:29 gen:81 core:0xa4 unit:0x00000056664f2de452b80040bc3b8d33
02:34:43:WU01:FS01:Starting
02:34:43:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Matan/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 01 -suffix 01 -version 704 -lifeline 8444 -checkpoint 3 -np 7
02:34:43:WU01:FS01:Started FahCore on PID 7392
02:34:43:WU01:FS01:Core PID:6576
02:34:43:WU01:FS01:FahCore 0xa4 started
02:34:44:WU01:FS01:0xa4:
02:34:44:WU01:FS01:0xa4:*------------------------------*
02:34:44:WU01:FS01:0xa4:Folding@Home Gromacs GB Core
02:34:44:WU01:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
02:34:44:WU01:FS01:0xa4:
02:34:44:WU01:FS01:0xa4:Preparing to commence simulation
02:34:44:WU01:FS01:0xa4:- Looking at optimizations...
02:34:44:WU01:FS01:0xa4:- Created dyn
02:34:44:WU01:FS01:0xa4:- Files status OK
02:34:44:WU01:FS01:0xa4:- Expanded 882385 -> 1469104 (decompressed 166.4 percent)
02:34:44:WU01:FS01:0xa4:Called DecompressByteArray: compressed_data_size=882385 data_size=1469104, decompressed_data_size=1469104 diff=0
02:34:44:WU01:FS01:0xa4:- Digital signature verified
02:34:44:WU01:FS01:0xa4:
02:34:44:WU01:FS01:0xa4:Project: 9005 (Run 2, Clone 29, Gen 81)
02:34:44:WU01:FS01:0xa4:
02:34:44:WU01:FS01:0xa4:Assembly optimizations on if available.
02:34:44:WU01:FS01:0xa4:Entering M.D.
02:34:50:WU01:FS01:0xa4:Mapping NT from 7 to 7
02:34:50:WU01:FS01:0xa4:Completed 0 out of 250000 steps  (0%)
02:34:51:WU00:FS00:Download 47.91%
02:34:57:WU00:FS00:Download 64.37%
02:35:07:WU00:FS00:Download 86.83%
02:35:07:WU00:FS00:Download complete
02:35:07:WU00:FS00:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:8900 run:678 clone:2 gen:92 core:0x17 unit:0x00000083028c126651a6bd78010744de
02:35:07:WU00:FS00:Starting
02:35:07:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Matan/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 00 -suffix 01 -version 704 -lifeline 8444 -checkpoint 3 -gpu 0 -gpu-vendor nvidia
02:35:07:WU00:FS00:Started FahCore on PID 1984
02:35:07:WU00:FS00:Core PID:7212
02:35:07:WU00:FS00:FahCore 0x17 started
02:35:07:WU00:FS00:0x17:*********************** Log Started 2014-02-01T02:35:07Z ***********************
02:35:07:WU00:FS00:0x17:Project: 8900 (Run 678, Clone 2, Gen 92)
02:35:07:WU00:FS00:0x17:Unit: 0x00000083028c126651a6bd78010744de
02:35:07:WU00:FS00:0x17:CPU: 0x00000000000000000000000000000000
02:35:07:WU00:FS00:0x17:Machine: 0
02:35:07:WU00:FS00:0x17:Reading tar file state.xml
02:35:08:WU00:FS00:0x17:Reading tar file system.xml
02:35:08:WU00:FS00:0x17:Reading tar file integrator.xml
02:35:08:WU00:FS00:0x17:Reading tar file core.xml
02:35:08:WU00:FS00:0x17:Digital signatures verified
02:35:08:WU00:FS00:0x17:Folding@home GPU core17
02:35:08:WU00:FS00:0x17:Version 0.0.52
02:36:16:WU01:FS01:0xa4:Completed 2500 out of 250000 steps  (1%)
02:37:35:WU00:FS00:0x17:Completed 0 out of 2500000 steps (0%)
02:37:35:WU00:FS00:0x17:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
02:37:42:WU01:FS01:0xa4:Completed 5000 out of 250000 steps  (2%)
02:39:07:WU01:FS01:0xa4:Completed 7500 out of 250000 steps  (3%)
02:40:31:WU01:FS01:0xa4:Completed 10000 out of 250000 steps  (4%)
02:41:55:WU01:FS01:0xa4:Completed 12500 out of 250000 steps  (5%)

...

09:34:04:WU01:FS01:0xa4:Completed 240000 out of 250000 steps  (96%)
09:35:28:WU01:FS01:0xa4:Completed 242500 out of 250000 steps  (97%)
09:36:52:WU01:FS01:0xa4:Completed 245000 out of 250000 steps  (98%)
09:37:45:WU00:FS00:0x17:Completed 1925000 out of 2500000 steps (77%)
09:38:16:WU01:FS01:0xa4:Completed 247500 out of 250000 steps  (99%)
09:38:17:WU02:FS01:Connecting to assign3.stanford.edu:8080
09:38:18:WU02:FS01:Assigned to work server 128.143.199.97
09:38:18:WU02:FS01:Requesting new work unit for slot 01: RUNNING cpu:7 from 128.143.199.97
09:38:18:WU02:FS01:Connecting to 128.143.199.97:8080
09:38:25:WU02:FS01:Downloading 1.19MiB
09:38:26:WU02:FS01:Download complete
09:38:26:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:7501 run:0 clone:367 gen:476 core:0xa3 unit:0x00000273fbcb017d4de79ee2ecea8914
09:39:40:WU01:FS01:0xa4:Completed 250000 out of 250000 steps  (100%)
09:39:40:WU01:FS01:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
09:39:50:WU01:FS01:0xa4:
09:39:50:WU01:FS01:0xa4:Finished Work Unit:
09:39:50:WU01:FS01:0xa4:- Reading up to 811800 from "01/wudata_01.trr": Read 811800
09:39:50:WU01:FS01:0xa4:trr file hash check passed.
09:39:50:WU01:FS01:0xa4:- Reading up to 746392 from "01/wudata_01.xtc": Read 746392
09:39:50:WU01:FS01:0xa4:xtc file hash check passed.
09:39:50:WU01:FS01:0xa4:edr file hash check passed.
09:39:50:WU01:FS01:0xa4:logfile size: 26168
09:39:50:WU01:FS01:0xa4:Leaving Run
09:39:51:WU01:FS01:0xa4:- Writing 1586848 bytes of core data to disk...
09:39:52:WU01:FS01:0xa4:Done: 1586336 -> 1539030 (compressed to 97.0 percent)
09:39:52:WU01:FS01:0xa4:  ... Done.
09:39:52:WU01:FS01:0xa4:- Shutting down core
09:39:52:WU01:FS01:0xa4:
09:39:52:WU01:FS01:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
09:39:52:WU01:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
09:39:52:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:9007 run:51 clone:18 gen:29 core:0xa4 unit:0x00000023664f2de452ba2c06df12d4c4
09:39:52:WU01:FS01:Uploading 1.47MiB to 171.64.65.124
09:39:52:WU01:FS01:Connecting to 171.64.65.124:8080
09:39:52:WU02:FS01:Starting
09:39:52:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Matan/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 02 -suffix 01 -version 704 -lifeline 8444 -checkpoint 3 -np 7
09:39:52:WU02:FS01:Started FahCore on PID 2120
09:39:52:WU02:FS01:Core PID:4648
09:39:52:WU02:FS01:FahCore 0xa3 started
09:39:53:WU02:FS01:0xa3:
09:39:53:WU02:FS01:0xa3:*------------------------------*
09:39:53:WU02:FS01:0xa3:Folding@Home Gromacs SMP Core
09:39:53:WU02:FS01:0xa3:Version 2.27 (Dec. 15, 2010)
09:39:53:WU02:FS01:0xa3:
09:39:53:WU02:FS01:0xa3:Preparing to commence simulation
09:39:53:WU02:FS01:0xa3:- Looking at optimizations...
09:39:53:WU02:FS01:0xa3:- Created dyn
09:39:53:WU02:FS01:0xa3:- Files status OK
09:39:53:WU02:FS01:0xa3:- Expanded 1248198 -> 2077020 (decompressed 166.4 percent)
09:39:53:WU02:FS01:0xa3:Called DecompressByteArray: compressed_data_size=1248198 data_size=2077020, decompressed_data_size=2077020 diff=0
09:39:53:WU02:FS01:0xa3:- Digital signature verified
09:39:53:WU02:FS01:0xa3:
09:39:53:WU02:FS01:0xa3:Project: 7501 (Run 0, Clone 367, Gen 476)
09:39:53:WU02:FS01:0xa3:
09:39:53:WU02:FS01:0xa3:Assembly optimizations on if available.
09:39:53:WU02:FS01:0xa3:Entering M.D.
09:39:58:WU02:FS01:0xa3:Mapping NT from 7 to 7
09:39:58:WU02:FS01:0xa3:mdrun returned 255
09:39:58:WU02:FS01:0xa3:Going to send back what have done -- stepsTotalG=500000
09:39:58:WU02:FS01:0xa3:Work fraction=0.0000 steps=500000.
09:39:59:WU01:FS01:Upload 55.34%
09:40:02:WU02:FS01:0xa3:logfile size=0 infoLength=0 edr=0 trr=25
09:40:02:WU02:FS01:0xa3:logfile size: 0 info=0 bed=0 hdr=25
09:40:02:WU02:FS01:0xa3:- Writing 640 bytes of core data to disk...
09:40:02:WU02:FS01:0xa3:Done: 128 -> 144 (compressed to 112.5 percent)
09:40:02:WU02:FS01:0xa3:  ... Done.
09:40:02:WU02:FS01:0xa3:
09:40:02:WU02:FS01:0xa3:Folding@home Core Shutdown: EARLY_UNIT_END
09:40:03:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
09:40:03:WU02:FS01:Sending unit results: id:02 state:SEND error:FAULTY project:7501 run:0 clone:367 gen:476 core:0xa3 unit:0x00000273fbcb017d4de79ee2ecea8914
09:40:03:WU02:FS01:Uploading 656B to 128.143.199.97
09:40:03:WU02:FS01:Connecting to 128.143.199.97:8080
09:40:03:WU03:FS01:Connecting to assign3.stanford.edu:8080
09:40:03:WU02:FS01:Upload complete
09:40:03:WU02:FS01:Server responded WORK_ACK (400)
09:40:03:WU02:FS01:Cleaning up
09:40:04:WU03:FS01:Assigned to work server 128.143.199.97
09:40:04:WU03:FS01:Requesting new work unit for slot 01: READY cpu:7 from 128.143.199.97
09:40:04:WU03:FS01:Connecting to 128.143.199.97:8080
09:40:05:WU01:FS01:Upload complete
09:40:05:WU01:FS01:Server responded WORK_ACK (400)
09:40:05:WU01:FS01:Final credit estimate, 1636.00 points
09:40:05:WU01:FS01:Cleaning up
09:40:14:WU03:FS01:Downloading 1.20MiB
09:40:14:WU03:FS01:Download complete
09:40:14:WU03:FS01:Received Unit: id:03 state:DOWNLOAD error:NO_ERROR project:7501 run:0 clone:387 gen:405 core:0xa3 unit:0x00000206fbcb017d4de79ef35202c322
09:40:14:WU03:FS01:Starting
09:40:14:WU03:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Matan/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 03 -suffix 01 -version 704 -lifeline 8444 -checkpoint 3 -np 7
09:40:14:WU03:FS01:Started FahCore on PID 8568
09:40:14:WU03:FS01:Core PID:1880
09:40:14:WU03:FS01:FahCore 0xa3 started
09:40:15:WU03:FS01:0xa3:
09:40:15:WU03:FS01:0xa3:*------------------------------*
09:40:15:WU03:FS01:0xa3:Folding@Home Gromacs SMP Core
09:40:15:WU03:FS01:0xa3:Version 2.27 (Dec. 15, 2010)
09:40:15:WU03:FS01:0xa3:
09:40:15:WU03:FS01:0xa3:Preparing to commence simulation
09:40:15:WU03:FS01:0xa3:- Looking at optimizations...
09:40:15:WU03:FS01:0xa3:- Created dyn
09:40:15:WU03:FS01:0xa3:- Files status OK
09:40:15:WU03:FS01:0xa3:- Expanded 1254787 -> 2077020 (decompressed 165.5 percent)
09:40:15:WU03:FS01:0xa3:Called DecompressByteArray: compressed_data_size=1254787 data_size=2077020, decompressed_data_size=2077020 diff=0
09:40:15:WU03:FS01:0xa3:- Digital signature verified
09:40:15:WU03:FS01:0xa3:
09:40:15:WU03:FS01:0xa3:Project: 7501 (Run 0, Clone 387, Gen 405)
09:40:15:WU03:FS01:0xa3:
09:40:15:WU03:FS01:0xa3:Assembly optimizations on if available.
09:40:15:WU03:FS01:0xa3:Entering M.D.
09:40:21:WU03:FS01:0xa3:Mapping NT from 7 to 7
09:40:21:WU03:FS01:0xa3:mdrun returned 255
09:40:21:WU03:FS01:0xa3:Going to send back what have done -- stepsTotalG=500000
09:40:21:WU03:FS01:0xa3:Work fraction=0.0000 steps=500000.
09:40:25:WU03:FS01:0xa3:logfile size=0 infoLength=0 edr=0 trr=25
09:40:25:WU03:FS01:0xa3:logfile size: 0 info=0 bed=0 hdr=25
09:40:25:WU03:FS01:0xa3:- Writing 640 bytes of core data to disk...
09:40:25:WU03:FS01:0xa3:Done: 128 -> 144 (compressed to 112.5 percent)
09:40:25:WU03:FS01:0xa3:  ... Done.
09:40:25:WU03:FS01:0xa3:
09:40:25:WU03:FS01:0xa3:Folding@home Core Shutdown: EARLY_UNIT_END
09:40:25:WARNING:WU03:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
09:40:25:WU03:FS01:Sending unit results: id:03 state:SEND error:FAULTY project:7501 run:0 clone:387 gen:405 core:0xa3 unit:0x00000206fbcb017d4de79ef35202c322
09:40:25:WU03:FS01:Uploading 656B to 128.143.199.97
09:40:25:WU03:FS01:Connecting to 128.143.199.97:8080
09:40:25:WU01:FS01:Connecting to assign3.stanford.edu:8080
09:40:26:WU03:FS01:Upload complete
09:40:26:WU03:FS01:Server responded WORK_ACK (400)
09:40:26:WU03:FS01:Cleaning up
09:40:29:WU01:FS01:Assigned to work server 128.143.199.97
09:40:29:WU01:FS01:Requesting new work unit for slot 01: READY cpu:7 from 128.143.199.97
09:40:29:WU01:FS01:Connecting to 128.143.199.97:8080
09:40:39:WU01:FS01:Downloading 1.84MiB
09:40:48:WU01:FS01:Download 37.44%
09:40:49:WU01:FS01:Download complete
09:40:49:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:7514 run:0 clone:100 gen:397 core:0xa3 unit:0x000001edfbcb017d4ff73d9e18140b52
09:40:49:WU01:FS01:Starting
09:40:49:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Matan/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 01 -suffix 01 -version 704 -lifeline 8444 -checkpoint 3 -np 7
09:40:49:WU01:FS01:Started FahCore on PID 7024
09:40:49:WU01:FS01:Core PID:8376
09:40:49:WU01:FS01:FahCore 0xa3 started
09:40:50:WU01:FS01:0xa3:
09:40:50:WU01:FS01:0xa3:*------------------------------*
09:40:50:WU01:FS01:0xa3:Folding@Home Gromacs SMP Core
09:40:50:WU01:FS01:0xa3:Version 2.27 (Dec. 15, 2010)
09:40:50:WU01:FS01:0xa3:
09:40:50:WU01:FS01:0xa3:Preparing to commence simulation
09:40:50:WU01:FS01:0xa3:- Looking at optimizations...
09:40:50:WU01:FS01:0xa3:- Created dyn
09:40:50:WU01:FS01:0xa3:- Files status OK
09:40:50:WU01:FS01:0xa3:- Expanded 1924719 -> 2865272 (decompressed 148.8 percent)
09:40:50:WU01:FS01:0xa3:Called DecompressByteArray: compressed_data_size=1924719 data_size=2865272, decompressed_data_size=2865272 diff=0
09:40:50:WU01:FS01:0xa3:- Digital signature verified
09:40:50:WU01:FS01:0xa3:
09:40:50:WU01:FS01:0xa3:Project: 7514 (Run 0, Clone 100, Gen 397)
09:40:50:WU01:FS01:0xa3:
09:40:50:WU01:FS01:0xa3:Assembly optimizations on if available.
09:40:50:WU01:FS01:0xa3:Entering M.D.
09:40:55:WU01:FS01:0xa3:Mapping NT from 7 to 7
09:40:56:WU01:FS01:0xa3:Completed 0 out of 500000 steps  (0%)
09:43:04:WU00:FS00:0x17:Completed 1950000 out of 2500000 steps (78%)
09:44:58:WU01:FS01:0xa3:Completed 5000 out of 500000 steps  (1%)
09:48:39:WU00:FS00:0x17:Completed 1975000 out of 2500000 steps (79%)
09:49:00:WU01:FS01:0xa3:Completed 10000 out of 500000 steps  (2%)
09:53:02:WU01:FS01:0xa3:Completed 15000 out of 500000 steps  (3%)
09:53:58:WU00:FS00:0x17:Completed 2000000 out of 2500000 steps (80%)
09:57:04:WU01:FS01:0xa3:Completed 20000 out of 500000 steps  (4%)
09:59:33:WU00:FS00:0x17:Completed 2025000 out of 2500000 steps (81%)
10:01:20:WU01:FS01:0xa3:Completed 25000 out of 500000 steps  (5%)
10:04:53:WU00:FS00:0x17:Completed 2050000 out of 2500000 steps (82%)
10:05:22:WU01:FS01:0xa3:Completed 30000 out of 500000 steps  (6%)
10:09:24:WU01:FS01:0xa3:Completed 35000 out of 500000 steps  (7%)
10:10:29:WU00:FS00:0x17:Completed 2075000 out of 2500000 steps (83%)
10:12:55:WARNING:WU01:FS01:FahCore returned an unknown error code which probably indicates that it crashed
10:12:55:WARNING:WU01:FS01:FahCore returned: UNKNOWN_ENUM (-1073741783 = 0xc0000029)
10:12:56:WU01:FS01:Starting
10:12:56:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Matan/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 01 -suffix 01 -version 704 -lifeline 8444 -checkpoint 3 -np 7
10:12:56:WU01:FS01:Started FahCore on PID 1208
10:12:56:WU01:FS01:Core PID:8860
10:12:56:WU01:FS01:FahCore 0xa3 started
10:12:56:WU01:FS01:0xa3:
10:12:56:WU01:FS01:0xa3:*------------------------------*
10:12:56:WU01:FS01:0xa3:Folding@Home Gromacs SMP Core
10:12:56:WU01:FS01:0xa3:Version 2.27 (Dec. 15, 2010)
10:12:56:WU01:FS01:0xa3:
10:12:56:WU01:FS01:0xa3:Preparing to commence simulation
10:12:56:WU01:FS01:0xa3:- Ensuring status. Please wait.
10:13:05:WU01:FS01:0xa3:- Looking at optimizations...
10:13:05:WU01:FS01:0xa3:- Working with standard loops on this execution.
10:13:05:WU01:FS01:0xa3:- Previous termination of core was improper.
10:13:05:WU01:FS01:0xa3:- Files status OK
10:13:05:WU01:FS01:0xa3:- Expanded 1924719 -> 2865272 (decompressed 148.8 percent)
10:13:05:WU01:FS01:0xa3:Called DecompressByteArray: compressed_data_size=1924719 data_size=2865272, decompressed_data_size=2865272 diff=0
10:13:05:WU01:FS01:0xa3:- Digital signature verified
10:13:05:WU01:FS01:0xa3:
10:13:05:WU01:FS01:0xa3:Project: 7514 (Run 0, Clone 100, Gen 397)
10:13:05:WU01:FS01:0xa3:
10:13:05:WU01:FS01:0xa3:Entering M.D.
10:13:11:WU01:FS01:0xa3:Using Gromacs checkpoints
10:13:11:WU01:FS01:0xa3:Mapping NT from 7 to 7
10:13:12:WU01:FS01:0xa3:Resuming from checkpoint
10:13:12:WU01:FS01:0xa3:Verified 01/wudata_01.log
10:13:12:WU01:FS01:0xa3:Verified 01/wudata_01.trr
10:13:12:WU01:FS01:0xa3:Verified 01/wudata_01.xtc
10:13:12:WU01:FS01:0xa3:Verified 01/wudata_01.edr
10:13:12:WU01:FS01:0xa3:Completed 36910 out of 500000 steps  (7%)
10:15:42:WU01:FS01:0xa3:Completed 40000 out of 500000 steps  (8%)
10:15:49:WU00:FS00:0x17:Completed 2100000 out of 2500000 steps (84%)
IsraeliRD
 
Posts: 13
Joined: Wed Nov 13, 2013 7:38 am

Re: A few problems (faulty WUs/crash/hard-reset)

Postby bollix47 » Sat Feb 01, 2014 3:47 pm

AFAIK some a3 projects do have a problem with cpu:7 and higher prime numbers. Since your gpu requires a core for itself I would change the cpu slot to use 6 cores instead of it's current setting which is probably -1.
Image
bollix47
 
Posts: 3398
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: A few problems (faulty WUs/crash/hard-reset)

Postby PantherX » Sun Feb 02, 2014 5:01 am

IsraeliRD wrote:...When I came back, I couldn't even get to the log in screen, and the computer just froze. The restart button didn't work, so I did a hard-reset. Ends up that couple of hours before I came back, the computer locked itself up due to F@H...

Please note that is could be possible that your system overheated or that you have an unstable overclock on your CPU or GPU. F@H is known to stress the system more than an average application and the system should be able to handle it if it is properly maintained. Can you verify if your overclocks are stable and that the system didn't overheat?
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Chrome Folding App (Beta) Ӂ Troubleshooting "Bad WUs" Ӂ Troubleshooting Server Connectivity Issues
User avatar
PantherX
Site Moderator
 
Posts: 6614
Joined: Wed Dec 23, 2009 9:33 am

Re: A few problems (faulty WUs/crash/hard-reset)

Postby IsraeliRD » Tue Feb 04, 2014 5:36 am

I set the CPU to 6, although with 7.3.6 it was on 8 cores and the GPU never complained.
Regarding the CPU/GPU: neither of them is overclocked and both are running in normal temperatures.

However a few hours ago while I was cleaning my room I got a 0x101 BSOD error which is unfortunate.
Hours before the BSOD I decided to see whether 7.4.2. would be happy with the CPU set from 6->8. The GPU seemed to have suffered by 2 mins for TPF, so I lowered it back to 6. F@H was happy and completed the WU successfully. The next one, on the other hand...

Code: Select all
22:28:23:WU00:FS01:Connecting to assign3.stanford.edu:8080
22:28:24:WU00:FS01:Assigned to work server 171.64.65.124
22:28:24:WU00:FS01:Requesting new work unit for slot 01: RUNNING cpu:6 from 171.64.65.124
22:28:24:WU00:FS01:Connecting to 171.64.65.124:8080
22:28:32:WU00:FS01:Downloading 862.14KiB
22:28:33:WU00:FS01:Download complete
22:28:33:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:9005 run:91 clone:28 gen:49 core:0xa4 unit:0x00000037664f2de452b80510765c2d88
22:32:16:WU01:FS00:0x17:Completed 2375000 out of 2500000 steps (95%)
22:37:36:WU01:FS00:0x17:Completed 2400000 out of 2500000 steps (96%)
22:38:41:WU02:FS01:0xa3:Completed 500000 out of 500000 steps  (100%)
22:38:42:WU02:FS01:0xa3:DynamicWrapper: Finished Work Unit: sleep=10000
22:38:52:WU02:FS01:0xa3:
22:38:52:WU02:FS01:0xa3:Finished Work Unit:
22:38:52:WU02:FS01:0xa3:- Reading up to 8057232 from "02/wudata_01.trr": Read 8057232
22:38:52:WU02:FS01:0xa3:trr file hash check passed.
22:38:52:WU02:FS01:0xa3:edr file hash check passed.
22:38:52:WU02:FS01:0xa3:logfile size: 59956
22:38:52:WU02:FS01:0xa3:Leaving Run
22:38:55:WU02:FS01:0xa3:- Writing 8154020 bytes of core data to disk...
22:38:56:WU02:FS01:0xa3:Done: 8153508 -> 7529799 (compressed to 92.3 percent)
22:38:56:WU02:FS01:0xa3:  ... Done.
22:38:57:WU02:FS01:0xa3:- Shutting down core
22:38:57:WU02:FS01:0xa3:
22:38:57:WU02:FS01:0xa3:Folding@home Core Shutdown: FINISHED_UNIT
22:38:57:WU02:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
22:38:57:WU02:FS01:Sending unit results: id:02 state:SEND error:NO_ERROR project:8573 run:1 clone:5 gen:219 core:0xa3 unit:0x000008620a3b1e5952288595c5db88be
22:38:57:WU02:FS01:Uploading 7.18MiB to 128.143.231.202
22:38:57:WU02:FS01:Connecting to 128.143.231.202:8080
22:38:57:WU00:FS01:Starting
22:38:57:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Matan/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 00 -suffix 01 -version 704 -lifeline 5016 -checkpoint 15 -np 6
22:38:57:WU00:FS01:Started FahCore on PID 8024
22:38:57:WU00:FS01:Core PID:5124
22:38:57:WU00:FS01:FahCore 0xa4 started
22:38:57:WU00:FS01:0xa4:
22:38:57:WU00:FS01:0xa4:*------------------------------*
22:38:57:WU00:FS01:0xa4:Folding@Home Gromacs GB Core
22:38:57:WU00:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
22:38:57:WU00:FS01:0xa4:
22:38:57:WU00:FS01:0xa4:Preparing to commence simulation
22:38:57:WU00:FS01:0xa4:- Looking at optimizations...
22:38:57:WU00:FS01:0xa4:- Created dyn
22:38:57:WU00:FS01:0xa4:- Files status OK
22:38:57:WU00:FS01:0xa4:- Expanded 882324 -> 1469104 (decompressed 166.5 percent)
22:38:57:WU00:FS01:0xa4:Called DecompressByteArray: compressed_data_size=882324 data_size=1469104, decompressed_data_size=1469104 diff=0
22:38:57:WU00:FS01:0xa4:- Digital signature verified
22:38:57:WU00:FS01:0xa4:
22:38:57:WU00:FS01:0xa4:Project: 9005 (Run 91, Clone 28, Gen 49)
22:38:57:WU00:FS01:0xa4:
22:38:57:WU00:FS01:0xa4:Assembly optimizations on if available.
22:38:57:WU00:FS01:0xa4:Entering M.D.
22:39:03:WU00:FS01:0xa4:Mapping NT from 6 to 6
22:39:03:WU00:FS01:0xa4:Completed 0 out of 250000 steps  (0%)
22:39:05:WU02:FS01:Upload 10.44%
22:39:12:WU02:FS01:Upload 21.76%
22:39:20:WU02:FS01:Upload 34.81%
22:39:27:WU02:FS01:Upload 47.87%
22:39:35:WU02:FS01:Upload 60.05%
22:39:43:WU02:FS01:Upload 73.98%
22:39:50:WU02:FS01:Upload 86.16%
22:39:57:WU02:FS01:Upload 97.47%
22:40:00:WU02:FS01:Upload complete
22:40:00:WU02:FS01:Server responded WORK_ACK (400)
22:40:00:WU02:FS01:Final credit estimate, 12419.00 points
22:40:00:WU02:FS01:Cleaning up
22:40:28:WU00:FS01:0xa4:Completed 2500 out of 250000 steps  (1%)
22:41:53:WU00:FS01:0xa4:Completed 5000 out of 250000 steps  (2%)
22:43:11:WU01:FS00:0x17:Completed 2425000 out of 2500000 steps (97%)
22:43:18:WU00:FS01:0xa4:Completed 7500 out of 250000 steps  (3%)
22:44:42:WU00:FS01:0xa4:Completed 10000 out of 250000 steps  (4%)
22:46:07:WU00:FS01:0xa4:Completed 12500 out of 250000 steps  (5%)
22:47:32:WU00:FS01:0xa4:Completed 15000 out of 250000 steps  (6%)
22:48:30:WU01:FS00:0x17:Completed 2450000 out of 2500000 steps (98%)
22:48:57:WU00:FS01:0xa4:Completed 17500 out of 250000 steps  (7%)
22:50:21:WU00:FS01:0xa4:Completed 20000 out of 250000 steps  (8%)
22:51:46:WU00:FS01:0xa4:Completed 22500 out of 250000 steps  (9%)
22:53:11:WU00:FS01:0xa4:Completed 25000 out of 250000 steps  (10%)
22:54:07:WU01:FS00:0x17:Completed 2475000 out of 2500000 steps (99%)
22:54:36:WU00:FS01:0xa4:Completed 27500 out of 250000 steps  (11%)
22:56:01:WU00:FS01:0xa4:Completed 30000 out of 250000 steps  (12%)
22:57:26:WU00:FS01:0xa4:Completed 32500 out of 250000 steps  (13%)
22:58:50:WU00:FS01:0xa4:Completed 35000 out of 250000 steps  (14%)
22:59:29:WU01:FS00:0x17:Completed 2500000 out of 2500000 steps (100%)
22:59:30:WU02:FS00:Connecting to assign-GPU.stanford.edu:80
22:59:31:WU02:FS00:Assigned to work server 171.64.65.105
22:59:31:WU02:FS00:Requesting new work unit for slot 00: RUNNING gpu:0:GK104 [GeForce GTX 660 Ti] from 171.64.65.105
22:59:31:WU02:FS00:Connecting to 171.64.65.105:8080
22:59:34:WU02:FS00:Downloading 78.68KiB
22:59:34:WU02:FS00:Download complete
22:59:34:WU02:FS00:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:7660 run:458 clone:0 gen:362 core:0x15 unit:0x00000215664f2dd150f83f66a3bb1c3c
22:59:44:WU01:FS00:0x17:Saving result file logfile_01.txt
22:59:44:WU01:FS00:0x17:Saving result file checkpointState.xml
22:59:46:WU01:FS00:0x17:Saving result file checkpt.crc
22:59:46:WU01:FS00:0x17:Saving result file log.txt
22:59:46:WU01:FS00:0x17:Saving result file positions.xtc
22:59:49:WU01:FS00:0x17:Folding@home Core Shutdown: FINISHED_UNIT
22:59:49:WU01:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
22:59:49:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:8900 run:139 clone:13 gen:23 core:0x17 unit:0x00000021028c126652b4c38fc8e33e13
22:59:49:WU01:FS00:Uploading 12.96MiB to 171.64.65.69
22:59:49:WU01:FS00:Connecting to 171.64.65.69:8080
22:59:49:WU02:FS00:Starting
22:59:49:WU02:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Matan/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -version 704 -lifeline 5016 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
22:59:49:WU02:FS00:Started FahCore on PID 4224
22:59:50:WU02:FS00:Core PID:1432
22:59:50:WU02:FS00:FahCore 0x15 started
22:59:50:WU02:FS00:0x15:
22:59:50:WU02:FS00:0x15:*------------------------------*
22:59:50:WU02:FS00:0x15:Folding@Home GPU Core
22:59:50:WU02:FS00:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
22:59:50:WU02:FS00:0x15:Build host             AmoebaRemote
22:59:50:WU02:FS00:0x15:Board Type             NVIDIA/CUDA
22:59:50:WU02:FS00:0x15:Core                   15
22:59:50:WU02:FS00:0x15:
22:59:50:WU02:FS00:0x15:Window's signal control handler registered.
22:59:50:WU02:FS00:0x15:Preparing to commence simulation
22:59:50:WU02:FS00:0x15:- Looking at optimizations...
22:59:50:WU02:FS00:0x15:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
22:59:50:WU02:FS00:0x15:- Created dyn
22:59:50:WU02:FS00:0x15:- Files status OK
22:59:50:WU02:FS00:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
22:59:50:WU02:FS00:0x15:- Expanded 80058 -> 307810 (decompressed 384.4 percent)
22:59:50:WU02:FS00:0x15:Called DecompressByteArray: compressed_data_size=80058 data_size=307810, decompressed_data_size=307810 diff=0
22:59:50:WU02:FS00:0x15:- Digital signature verified
22:59:50:WU02:FS00:0x15:
22:59:50:WU02:FS00:0x15:Project: 7660 (Run 458, Clone 0, Gen 362)
22:59:50:WU02:FS00:0x15:
22:59:50:WU02:FS00:0x15:Assembly optimizations on if available.
22:59:50:WU02:FS00:0x15:Entering M.D.
22:59:52:WU02:FS00:0x15:Tpr hash 02/wudata_01.tpr:  1084040353 2098037389 688454802 3775256636 3968620388
22:59:52:WU02:FS00:0x15:GPU device id=0
22:59:52:WU02:FS00:0x15:Working on Protein
22:59:52:WU02:FS00:0x15:Client config unavailable.
22:59:52:WU02:FS00:0x15:Starting GUI Server
22:59:55:WU01:FS00:Upload 4.82%
23:00:02:WU01:FS00:Upload 11.57%
23:00:09:WU01:FS00:Upload 17.84%
23:00:15:WU00:FS01:0xa4:Completed 37500 out of 250000 steps  (15%)
23:00:16:WU01:FS00:Upload 23.63%
23:00:23:WU01:FS00:Upload 30.87%
23:00:30:WU01:FS00:Upload 37.62%
23:00:37:WU01:FS00:Upload 43.89%
23:00:44:WU01:FS00:Upload 50.64%
23:00:50:WU01:FS00:Upload 55.46%
23:00:57:WU01:FS00:Upload 61.73%
23:01:01:WU02:FS00:0x15:Setting checkpoint frequency: 400000
23:01:01:WU02:FS00:0x15:Completed         3 out of 40000000 steps (0%).
23:01:04:WU01:FS00:Upload 68.00%
23:01:11:WU01:FS00:Upload 74.75%
23:01:17:WU01:FS00:Upload 80.54%
23:01:24:WU01:FS00:Upload 87.78%
23:01:31:WU01:FS00:Upload 93.56%
23:01:39:WU00:FS01:0xa4:Completed 40000 out of 250000 steps  (16%)
23:01:46:WU01:FS00:Upload complete
23:01:46:WU01:FS00:Server responded WORK_ACK (400)
23:01:46:WU01:FS00:Final credit estimate, 24916.00 points
23:01:46:WU01:FS00:Cleaning up
23:03:03:WU00:FS01:0xa4:Completed 42500 out of 250000 steps  (17%)
23:03:40:WU02:FS00:0x15:Completed    400000 out of 40000000 steps (1%).
23:04:27:WU00:FS01:0xa4:Completed 45000 out of 250000 steps  (18%)
23:05:50:WU00:FS01:0xa4:Completed 47500 out of 250000 steps  (19%)
23:06:19:WU02:FS00:0x15:Completed    800000 out of 40000000 steps (2%).
23:07:14:WU00:FS01:0xa4:Completed 50000 out of 250000 steps  (20%)
23:08:38:WU00:FS01:0xa4:Completed 52500 out of 250000 steps  (21%)
23:08:58:WU02:FS00:0x15:Completed   1200000 out of 40000000 steps (3%).
23:10:02:WU00:FS01:0xa4:Completed 55000 out of 250000 steps  (22%)
23:11:26:WU00:FS01:0xa4:Completed 57500 out of 250000 steps  (23%)
23:11:37:WU02:FS00:0x15:Completed   1600000 out of 40000000 steps (4%).
23:12:50:WU00:FS01:0xa4:Completed 60000 out of 250000 steps  (24%)
23:14:14:WU00:FS01:0xa4:Completed 62500 out of 250000 steps  (25%)
23:14:16:WU02:FS00:0x15:Completed   2000000 out of 40000000 steps (5%).
23:15:38:WU00:FS01:0xa4:Completed 65000 out of 250000 steps  (26%)
23:16:55:WU02:FS00:0x15:Completed   2400000 out of 40000000 steps (6%).
23:17:02:WU00:FS01:0xa4:Completed 67500 out of 250000 steps  (27%)
23:18:26:WU00:FS01:0xa4:Completed 70000 out of 250000 steps  (28%)
23:19:34:WU02:FS00:0x15:Completed   2800000 out of 40000000 steps (7%).
23:19:50:WU00:FS01:0xa4:Completed 72500 out of 250000 steps  (29%)
23:20:59:Removing old file 'configs/config-20131130-101709.xml'
23:20:59:Saving configuration to config.xml
23:20:59:<config>
23:20:59:  <!-- Network -->
23:20:59:  <proxy v=':8080'/>
23:20:59:
23:20:59:  <!-- Slot Control -->
23:20:59:  <power v='full'/>
23:20:59:
23:20:59:  <!-- User Information -->
23:20:59:  <passkey v='********************************'/>
23:20:59:  <team v='110285'/>
23:20:59:  <user v='IsraeliRD'/>
23:20:59:
23:20:59:  <!-- Folding Slots -->
23:20:59:  <slot id='0' type='GPU'/>
23:20:59:  <slot id='1' type='CPU'>
23:20:59:    <cpus v='8'/>
23:20:59:  </slot>
23:20:59:</config>
23:20:59:FS01:Shutting core down
23:21:07:WU00:FS01:0xa4:Client no longer detected. Shutting down core
23:21:07:WU00:FS01:0xa4:
23:21:07:WU00:FS01:0xa4:Folding@home Core Shutdown: CLIENT_DIED
23:21:08:WU00:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
23:21:08:WU00:FS01:Starting
23:21:08:WARNING:WU00:FS01:Changed SMP threads from 6 to 8 this can cause some work units to fail
23:21:08:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Matan/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 00 -suffix 01 -version 704 -lifeline 5016 -checkpoint 15 -np 8
23:21:08:WU00:FS01:Started FahCore on PID 6340
23:21:08:WU00:FS01:Core PID:6260
23:21:08:WU00:FS01:FahCore 0xa4 started
23:21:08:WU00:FS01:0xa4:
23:21:08:WU00:FS01:0xa4:*------------------------------*
23:21:08:WU00:FS01:0xa4:Folding@Home Gromacs GB Core
23:21:08:WU00:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
23:21:08:WU00:FS01:0xa4:
23:21:08:WU00:FS01:0xa4:Preparing to commence simulation
23:21:08:WU00:FS01:0xa4:- Looking at optimizations...
23:21:08:WU00:FS01:0xa4:- Files status OK
23:21:08:WU00:FS01:0xa4:- Expanded 882324 -> 1469104 (decompressed 166.5 percent)
23:21:08:WU00:FS01:0xa4:Called DecompressByteArray: compressed_data_size=882324 data_size=1469104, decompressed_data_size=1469104 diff=0
23:21:08:WU00:FS01:0xa4:- Digital signature verified
23:21:08:WU00:FS01:0xa4:
23:21:08:WU00:FS01:0xa4:Project: 9005 (Run 91, Clone 28, Gen 49)
23:21:08:WU00:FS01:0xa4:
23:21:08:WU00:FS01:0xa4:Assembly optimizations on if available.
23:21:08:WU00:FS01:0xa4:Entering M.D.
23:21:12:Removing old file 'configs/config-20131203-135427.xml'
23:21:12:Saving configuration to config.xml
23:21:12:<config>
23:21:12:  <!-- Network -->
23:21:12:  <proxy v=':8080'/>
23:21:12:
23:21:12:  <!-- Slot Control -->
23:21:12:  <power v='full'/>
23:21:12:
23:21:12:  <!-- User Information -->
23:21:12:  <passkey v='********************************'/>
23:21:12:  <team v='110285'/>
23:21:12:  <user v='IsraeliRD'/>
23:21:12:
23:21:12:  <!-- Folding Slots -->
23:21:12:  <slot id='0' type='GPU'/>
23:21:12:  <slot id='1' type='CPU'>
23:21:12:    <cpus v='8'/>
23:21:12:  </slot>
23:21:12:</config>
23:21:14:WU00:FS01:0xa4:Using Gromacs checkpoints
23:21:14:WU00:FS01:0xa4:Mapping NT from 8 to 8
23:21:14:WU00:FS01:0xa4:Resuming from checkpoint
23:21:14:WU00:FS01:0xa4:Verified 00/wudata_01.log
23:21:14:WU00:FS01:0xa4:Verified 00/wudata_01.trr
23:21:14:WU00:FS01:0xa4:Verified 00/wudata_01.xtc
23:21:14:WU00:FS01:0xa4:Verified 00/wudata_01.edr
23:21:14:WU00:FS01:0xa4:Completed 53230 out of 250000 steps  (21%)
23:22:08:WU00:FS01:0xa4:Completed 55000 out of 250000 steps  (22%)
23:22:14:WU02:FS00:0x15:Completed   3200000 out of 40000000 steps (8%).
23:22:26:Removing old file 'configs/config-20131203-135437.xml'
23:22:26:Saving configuration to config.xml
23:22:26:<config>
23:22:26:  <!-- Network -->
23:22:26:  <proxy v=':8080'/>
23:22:26:
23:22:26:  <!-- Slot Control -->
23:22:26:  <power v='full'/>
23:22:26:
23:22:26:  <!-- User Information -->
23:22:26:  <passkey v='********************************'/>
23:22:26:  <team v='110285'/>
23:22:26:  <user v='IsraeliRD'/>
23:22:26:
23:22:26:  <!-- Folding Slots -->
23:22:26:  <slot id='0' type='GPU'/>
23:22:26:  <slot id='1' type='CPU'>
23:22:26:    <cpus v='6'/>
23:22:26:  </slot>
23:22:26:</config>
23:22:26:FS01:Shutting core down
23:22:28:WU00:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
23:22:28:WU00:FS01:Starting
23:22:28:WARNING:WU00:FS01:Changed SMP threads from 8 to 6 this can cause some work units to fail
23:22:28:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Matan/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 00 -suffix 01 -version 704 -lifeline 5016 -checkpoint 15 -np 6
23:22:28:WU00:FS01:Started FahCore on PID 7848
23:22:28:WU00:FS01:Core PID:1640
23:22:28:WU00:FS01:FahCore 0xa4 started
23:22:29:WU00:FS01:0xa4:
23:22:29:WU00:FS01:0xa4:*------------------------------*
23:22:29:WU00:FS01:0xa4:Folding@Home Gromacs GB Core
23:22:29:WU00:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
23:22:29:WU00:FS01:0xa4:
23:22:29:WU00:FS01:0xa4:Preparing to commence simulation
23:22:29:WU00:FS01:0xa4:- Looking at optimizations...
23:22:29:WU00:FS01:0xa4:- Files status OK
23:22:29:WU00:FS01:0xa4:- Expanded 882324 -> 1469104 (decompressed 166.5 percent)
23:22:29:WU00:FS01:0xa4:Called DecompressByteArray: compressed_data_size=882324 data_size=1469104, decompressed_data_size=1469104 diff=0
23:22:29:WU00:FS01:0xa4:- Digital signature verified
23:22:29:WU00:FS01:0xa4:
23:22:29:WU00:FS01:0xa4:Project: 9005 (Run 91, Clone 28, Gen 49)
23:22:29:WU00:FS01:0xa4:
23:22:29:WU00:FS01:0xa4:Assembly optimizations on if available.
23:22:29:WU00:FS01:0xa4:Entering M.D.
23:22:35:WU00:FS01:0xa4:Using Gromacs checkpoints
23:22:35:WU00:FS01:0xa4:Mapping NT from 6 to 6
23:22:35:WU00:FS01:0xa4:Resuming from checkpoint
23:22:35:WU00:FS01:0xa4:Verified 00/wudata_01.log
23:22:35:WU00:FS01:0xa4:Verified 00/wudata_01.trr
23:22:35:WU00:FS01:0xa4:Verified 00/wudata_01.xtc
23:22:35:WU00:FS01:0xa4:Verified 00/wudata_01.edr
23:22:35:WU00:FS01:0xa4:Completed 53230 out of 250000 steps  (21%)
23:23:14:Removing old file 'configs/config-20131203-135545.xml'
23:23:14:Saving configuration to config.xml
23:23:14:<config>
23:23:14:  <!-- Network -->
23:23:14:  <proxy v=':8080'/>
23:23:14:
23:23:14:  <!-- Slot Control -->
23:23:14:  <power v='full'/>
23:23:14:
23:23:14:  <!-- User Information -->
23:23:14:  <passkey v='********************************'/>
23:23:14:  <team v='110285'/>
23:23:14:  <user v='IsraeliRD'/>
23:23:14:
23:23:14:  <!-- Folding Slots -->
23:23:14:  <slot id='0' type='GPU'/>
23:23:14:  <slot id='1' type='CPU'>
23:23:14:    <cpus v='6'/>
23:23:14:  </slot>
23:23:14:</config>
23:23:35:WU00:FS01:0xa4:Completed 55000 out of 250000 steps  (22%)
23:24:54:WU02:FS00:0x15:Completed   3600000 out of 40000000 steps (9%).
23:24:59:WU00:FS01:0xa4:Completed 57500 out of 250000 steps  (23%)
23:26:23:WU00:FS01:0xa4:Completed 60000 out of 250000 steps  (24%)
23:27:34:WU02:FS00:0x15:Completed   4000000 out of 40000000 steps (10%).

...

01:08:55:WU00:FS01:0xa4:Completed 242500 out of 250000 steps  (97%)
01:10:18:WU00:FS01:0xa4:Completed 245000 out of 250000 steps  (98%)
01:11:04:WU02:FS00:0x15:Completed  19600000 out of 40000000 steps (49%).
01:11:41:WU00:FS01:0xa4:Completed 247500 out of 250000 steps  (99%)
01:11:42:WU01:FS01:Connecting to assign3.stanford.edu:8080
01:11:43:WU01:FS01:Assigned to work server 171.64.65.124
01:11:43:WU01:FS01:Requesting new work unit for slot 01: RUNNING cpu:6 from 171.64.65.124
01:11:43:WU01:FS01:Connecting to 171.64.65.124:8080
01:11:51:WU01:FS01:Downloading 862.96KiB
01:11:51:WU01:FS01:Download complete
01:11:51:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:9005 run:5 clone:1 gen:56 core:0xa4 unit:0x0000003c664f2de452b80061a81fe752
01:13:03:WU00:FS01:0xa4:Completed 250000 out of 250000 steps  (100%)
01:13:04:WU00:FS01:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
01:13:14:WU00:FS01:0xa4:
01:13:14:WU00:FS01:0xa4:Finished Work Unit:
01:13:14:WU00:FS01:0xa4:- Reading up to 872304 from "00/wudata_01.trr": Read 872304
01:13:14:WU00:FS01:0xa4:trr file hash check passed.
01:13:14:WU00:FS01:0xa4:- Reading up to 799976 from "00/wudata_01.xtc": Read 799976
01:13:14:WU00:FS01:0xa4:xtc file hash check passed.
01:13:14:WU00:FS01:0xa4:edr file hash check passed.
01:13:14:WU00:FS01:0xa4:logfile size: 24373
01:13:14:WU00:FS01:0xa4:Leaving Run
01:13:14:WU00:FS01:0xa4:- Writing 1699141 bytes of core data to disk...
01:13:14:WU00:FS01:0xa4:Done: 1698629 -> 1646149 (compressed to 96.9 percent)
01:13:14:WU00:FS01:0xa4:  ... Done.
01:13:14:WU00:FS01:0xa4:- Shutting down core
01:13:14:WU00:FS01:0xa4:
01:13:14:WU00:FS01:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
01:13:15:WU00:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
01:13:15:WU00:FS01:Sending unit results: id:00 state:SEND error:NO_ERROR project:9005 run:91 clone:28 gen:49 core:0xa4 unit:0x00000037664f2de452b80510765c2d88
01:13:15:WU00:FS01:Uploading 1.57MiB to 171.64.65.124
01:13:15:WU00:FS01:Connecting to 171.64.65.124:8080
01:13:15:WU01:FS01:Starting
01:13:15:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Matan/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 01 -suffix 01 -version 704 -lifeline 5016 -checkpoint 15 -np 6
01:13:15:WU01:FS01:Started FahCore on PID 7176
01:13:15:WU01:FS01:Core PID:8080
01:13:15:WU01:FS01:FahCore 0xa4 started
01:13:15:WU01:FS01:0xa4:
01:13:15:WU01:FS01:0xa4:*------------------------------*
01:13:15:WU01:FS01:0xa4:Folding@Home Gromacs GB Core
01:13:15:WU01:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
01:13:15:WU01:FS01:0xa4:
01:13:15:WU01:FS01:0xa4:Preparing to commence simulation
01:13:15:WU01:FS01:0xa4:- Looking at optimizations...
01:13:15:WU01:FS01:0xa4:- Created dyn
01:13:15:WU01:FS01:0xa4:- Files status OK
01:13:15:WU01:FS01:0xa4:- Expanded 883155 -> 1469104 (decompressed 166.3 percent)
01:13:15:WU01:FS01:0xa4:Called DecompressByteArray: compressed_data_size=883155 data_size=1469104, decompressed_data_size=1469104 diff=0
01:13:15:WU01:FS01:0xa4:- Digital signature verified
01:13:15:WU01:FS01:0xa4:
01:13:15:WU01:FS01:0xa4:Project: 9005 (Run 5, Clone 1, Gen 56)
01:13:15:WU01:FS01:0xa4:
01:13:15:WU01:FS01:0xa4:Assembly optimizations on if available.
01:13:15:WU01:FS01:0xa4:Entering M.D.
01:13:21:WU01:FS01:0xa4:Mapping NT from 6 to 6
01:13:21:WU01:FS01:0xa4:Completed 0 out of 250000 steps  (0%)
01:13:24:WU00:FS01:Upload 3.98%
01:13:30:WU00:FS01:Upload 51.74%
01:13:37:WU00:FS01:Upload complete
01:13:37:WU00:FS01:Server responded WORK_ACK (400)
01:13:37:WU00:FS01:Final credit estimate, 1590.00 points
01:13:37:WU00:FS01:Cleaning up
01:13:43:WU02:FS00:0x15:Completed  20000000 out of 40000000 steps (50%).
01:14:44:WU01:FS01:0xa4:Completed 2500 out of 250000 steps  (1%)
01:16:06:WU01:FS01:0xa4:Completed 5000 out of 250000 steps  (2%)
01:16:23:WU02:FS00:0x15:Completed  20400000 out of 40000000 steps (51%).
01:17:28:WU01:FS01:0xa4:Completed 7500 out of 250000 steps  (3%)

...

02:13:29:WU01:FS01:0xa4:Completed 102500 out of 250000 steps  (41%)
02:14:58:WU01:FS01:0xa4:Completed 105000 out of 250000 steps  (42%)
02:15:21:WU02:FS00:0x15:Completed  28800000 out of 40000000 steps (72%).
02:16:38:WU01:FS01:0xa4:Completed 107500 out of 250000 steps  (43%)
02:18:23:WU01:FS01:0xa4:Completed 110000 out of 250000 steps  (44%)
02:18:27:WU02:FS00:0x15:Completed  29200000 out of 40000000 steps (73%).
02:20:19:WU01:FS01:0xa4:Completed 112500 out of 250000 steps  (45%)
02:21:52:WU02:FS00:0x15:Completed  29600000 out of 40000000 steps (74%).
02:22:02:WU01:FS01:0xa4:Completed 115000 out of 250000 steps  (46%)
02:23:53:WU01:FS01:0xa4:Completed 117500 out of 250000 steps  (47%)
02:25:20:WU02:FS00:0x15:Completed  30000000 out of 40000000 steps (75%).
02:25:41:WU01:FS01:0xa4:Completed 120000 out of 250000 steps  (48%)
02:27:06:WU01:FS01:0xa4:Completed 122500 out of 250000 steps  (49%)
02:28:02:WU02:FS00:0x15:Completed  30400000 out of 40000000 steps (76%).
02:28:31:WU01:FS01:0xa4:Completed 125000 out of 250000 steps  (50%)


And the crash happened at 50% as it seems.
After the BSOD:

Code: Select all
*********************** Log Started 2014-02-04T03:45:05Z ***********************
03:45:05:************************* Folding@home Client *************************
03:45:05:      Website: http://folding.stanford.edu/
03:45:05:    Copyright: (c) 2009-2014 Stanford University
03:45:05:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
03:45:05:         Args: --open-web-control
03:45:05:       Config: C:/Users/Matan/AppData/Roaming/FAHClient/config.xml
03:45:05:******************************** Build ********************************
03:45:05:      Version: 7.4.2
03:45:05:         Date: Jan 24 2014
03:45:05:         Time: 13:51:17
03:45:05:      SVN Rev: 4112
03:45:05:       Branch: fah/trunk/client
03:45:05:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
03:45:05:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
03:45:05:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
03:45:05:     Platform: win32 XP
03:45:05:         Bits: 32
03:45:05:         Mode: Release
03:45:05:******************************* System ********************************
03:45:05:          CPU: Intel(R) Core(TM) i7-4770K CPU @ 3.50GHz
03:45:05:       CPU ID: GenuineIntel Family 6 Model 60 Stepping 3
03:45:05:         CPUs: 8
03:45:05:       Memory: 15.94GiB
03:45:05:  Free Memory: 13.83GiB
03:45:05:      Threads: WINDOWS_THREADS
03:45:05:   OS Version: 6.1
03:45:05:  Has Battery: false
03:45:05:   On Battery: false
03:45:05:   UTC Offset: 11
03:45:05:          PID: 6632
03:45:05:          CWD: C:/Users/Matan/AppData/Roaming/FAHClient
03:45:05:           OS: Windows 7 Professional
03:45:05:      OS Arch: AMD64
03:45:05:         GPUs: 1
03:45:05:        GPU 0: NVIDIA:3 GK104 [GeForce GTX 660 Ti]
03:45:05:         CUDA: 3.0
03:45:05:  CUDA Driver: 5050
03:45:05:Win32 Service: false
03:45:05:***********************************************************************
03:45:05:<config>
03:45:05:  <!-- Network -->
03:45:05:  <proxy v=':8080'/>
03:45:05:
03:45:05:  <!-- Slot Control -->
03:45:05:  <power v='full'/>
03:45:05:
03:45:05:  <!-- User Information -->
03:45:05:  <passkey v='********************************'/>
03:45:05:  <team v='110285'/>
03:45:05:  <user v='IsraeliRD'/>
03:45:05:
03:45:05:  <!-- Folding Slots -->
03:45:05:  <slot id='0' type='GPU'/>
03:45:05:  <slot id='1' type='CPU'>
03:45:05:    <cpus v='6'/>
03:45:05:  </slot>
03:45:05:</config>
03:45:05:Trying to access database...
03:45:05:Successfully acquired database lock
03:45:05:Enabled folding slot 00: READY gpu:0:GK104 [GeForce GTX 660 Ti]
03:45:05:Enabled folding slot 01: READY cpu:6
03:45:05:WU02:FS00:Starting
03:45:05:WU02:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Matan/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -version 704 -lifeline 6632 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
03:45:05:WU02:FS00:Started FahCore on PID 6708
03:45:05:WU02:FS00:Core PID:6728
03:45:05:WU02:FS00:FahCore 0x15 started
03:45:05:WU01:FS01:Starting
03:45:05:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Matan/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 01 -suffix 01 -version 704 -lifeline 6632 -checkpoint 15 -np 6
03:45:05:WU01:FS01:Started FahCore on PID 6756
03:45:05:WU01:FS01:Core PID:6796
03:45:05:WU01:FS01:FahCore 0xa4 started
03:45:06:WU02:FS00:0x15:
03:45:06:WU02:FS00:0x15:*------------------------------*
03:45:06:WU02:FS00:0x15:Folding@Home GPU Core
03:45:06:WU02:FS00:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
03:45:06:WU02:FS00:0x15:Build host             AmoebaRemote
03:45:06:WU02:FS00:0x15:Board Type             NVIDIA/CUDA
03:45:06:WU02:FS00:0x15:Core                   15
03:45:06:WU02:FS00:0x15:
03:45:06:WU02:FS00:0x15:Window's signal control handler registered.
03:45:06:WU02:FS00:0x15:Preparing to commence simulation
03:45:06:WU02:FS00:0x15:- Ensuring status. Please wait.
03:45:06:WU01:FS01:0xa4:
03:45:06:WU01:FS01:0xa4:*------------------------------*
03:45:06:WU01:FS01:0xa4:Folding@Home Gromacs GB Core
03:45:06:WU01:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
03:45:06:WU01:FS01:0xa4:
03:45:06:WU01:FS01:0xa4:Preparing to commence simulation
03:45:06:WU01:FS01:0xa4:- Ensuring status. Please wait.
03:45:15:WU02:FS00:0x15:- Looking at optimizations...
03:45:15:WU02:FS00:0x15:- Working with standard loops on this execution.
03:45:15:WU02:FS00:0x15:- Previous termination of core was improper.
03:45:15:WU02:FS00:0x15:- Files status OK
03:45:15:WU02:FS00:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
03:45:15:WU02:FS00:0x15:- Expanded 80058 -> 307810 (decompressed 384.4 percent)
03:45:15:WU02:FS00:0x15:Called DecompressByteArray: compressed_data_size=80058 data_size=307810, decompressed_data_size=307810 diff=0
03:45:15:WU02:FS00:0x15:- Digital signature verified
03:45:15:WU02:FS00:0x15:
03:45:15:WU02:FS00:0x15:Project: 7660 (Run 458, Clone 0, Gen 362)
03:45:15:WU02:FS00:0x15:
03:45:15:WU02:FS00:0x15:Entering M.D.
03:45:15:WU01:FS01:0xa4:- Looking at optimizations...
03:45:15:WU01:FS01:0xa4:- Working with standard loops on this execution.
03:45:15:WU01:FS01:0xa4:- Previous termination of core was improper.
03:45:15:WU01:FS01:0xa4:- Files status OK
03:45:15:WU01:FS01:0xa4:- Expanded 883155 -> 1469104 (decompressed 166.3 percent)
03:45:15:WU01:FS01:0xa4:Called DecompressByteArray: compressed_data_size=883155 data_size=1469104, decompressed_data_size=1469104 diff=0
03:45:15:WU01:FS01:0xa4:- Digital signature verified
03:45:15:WU01:FS01:0xa4:
03:45:15:WU01:FS01:0xa4:Project: 9005 (Run 5, Clone 1, Gen 56)
03:45:15:WU01:FS01:0xa4:
03:45:15:WU01:FS01:0xa4:Entering M.D.
03:45:17:WU02:FS00:0x15:Will resume from checkpoint file 02/wudata_01.ckp
03:45:17:WU02:FS00:0x15:Tpr hash 02/wudata_01.tpr:  1084040353 2098037389 688454802 3775256636 3968620388
03:45:17:WU02:FS00:0x15:GPU device id=0
03:45:17:WU02:FS00:0x15:Working on Protein
03:45:17:WU02:FS00:0x15:Client config unavailable.
03:45:17:WU02:FS00:0x15:Starting GUI Server
03:45:21:WU01:FS01:0xa4:Using Gromacs checkpoints
03:45:21:WU01:FS01:0xa4:Mapping NT from 6 to 6
03:45:21:WU01:FS01:0xa4:mdrun returned 255
03:45:21:WU01:FS01:0xa4:Going to send back what have done -- stepsTotalG=0
03:45:21:WU01:FS01:0xa4:Work fraction=0.0000 steps=0.
03:45:25:WU01:FS01:0xa4:logfile size=14548 infoLength=14548 edr=0 trr=25
03:45:25:WU01:FS01:0xa4:logfile size: 14548 info=14548 bed=0 hdr=25
03:45:25:WU01:FS01:0xa4:- Writing 15086 bytes of core data to disk...
03:45:25:WU01:FS01:0xa4:Done: 14574 -> 4934 (compressed to 33.8 percent)
03:45:25:WU01:FS01:0xa4:  ... Done.
03:45:25:WU01:FS01:0xa4:
03:45:25:WU01:FS01:0xa4:Folding@home Core Shutdown: UNSTABLE_MACHINE
03:45:25:WARNING:WU01:FS01:FahCore returned: UNSTABLE_MACHINE (122 = 0x7a)
03:45:25:WU01:FS01:Sending unit results: id:01 state:SEND error:FAULTY project:9005 run:5 clone:1 gen:56 core:0xa4 unit:0x0000003c664f2de452b80061a81fe752
03:45:25:WU01:FS01:Uploading 5.32KiB to 171.64.65.124
03:45:25:WU01:FS01:Connecting to 171.64.65.124:8080
03:45:26:WU00:FS01:Connecting to assign3.stanford.edu:8080
03:45:26:WU01:FS01:Upload complete
03:45:26:WU01:FS01:Server responded WORK_ACK (400)
03:45:26:WU01:FS01:Cleaning up
03:45:27:WU00:FS01:Assigned to work server 129.74.246.143
03:45:27:WU00:FS01:Requesting new work unit for slot 01: READY cpu:6 from 129.74.246.143
03:45:27:WU00:FS01:Connecting to 129.74.246.143:8080
03:45:38:WU00:FS01:Downloading 132.59KiB
03:45:38:WU00:FS01:Download complete
03:45:38:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:7046 run:0 clone:110 gen:4 core:0xa4 unit:0x000000210001329c4f395bf9f12f724d
03:45:38:WU00:FS01:Starting
03:45:38:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Matan/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 00 -suffix 01 -version 704 -lifeline 6632 -checkpoint 15 -np 6
03:45:38:WU00:FS01:Started FahCore on PID 6808
03:45:38:WU00:FS01:Core PID:6848
03:45:38:WU00:FS01:FahCore 0xa4 started
03:45:38:WU00:FS01:0xa4:
03:45:38:WU00:FS01:0xa4:*------------------------------*
03:45:38:WU00:FS01:0xa4:Folding@Home Gromacs GB Core
03:45:38:WU00:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
03:45:38:WU00:FS01:0xa4:
03:45:38:WU00:FS01:0xa4:Preparing to commence simulation
03:45:38:WU00:FS01:0xa4:- Looking at optimizations...
03:45:38:WU00:FS01:0xa4:- Created dyn
03:45:38:WU00:FS01:0xa4:- Files status OK
03:45:38:WU00:FS01:0xa4:- Expanded 135256 -> 305824 (decompressed 226.1 percent)
03:45:38:WU00:FS01:0xa4:Called DecompressByteArray: compressed_data_size=135256 data_size=305824, decompressed_data_size=305824 diff=0
03:45:38:WU00:FS01:0xa4:- Digital signature verified
03:45:38:WU00:FS01:0xa4:
03:45:38:WU00:FS01:0xa4:Project: 7046 (Run 0, Clone 110, Gen 4)
03:45:38:WU00:FS01:0xa4:
03:45:38:WU00:FS01:0xa4:Assembly optimizations on if available.
03:45:38:WU00:FS01:0xa4:Entering M.D.
03:45:44:WU00:FS01:0xa4:Mapping NT from 6 to 6
03:45:44:WU00:FS01:0xa4:Completed 0 out of 25000000 steps  (0%)
03:46:18:WU02:FS00:0x15:Resuming from checkpoint
03:46:18:WU02:FS00:0x15:fcCheckPointResume: retreived and current tpr file hash:
03:46:18:WU02:FS00:0x15:   0   1084040353   1084040353
03:46:18:WU02:FS00:0x15:   1   2098037389   2098037389
03:46:18:WU02:FS00:0x15:   2    688454802    688454802
03:46:18:WU02:FS00:0x15:   3   3775256636   3775256636
03:46:18:WU02:FS00:0x15:   4   3968620388   3968620388
03:46:18:WU02:FS00:0x15:fcCheckPointResume: file hashes same.
03:46:18:WU02:FS00:0x15:fcCheckPointResume: state restored.
03:46:18:WU02:FS00:0x15:fcCheckPointResume: name 02/wudata_01.log Verified 02/wudata_01.log
03:46:18:WU02:FS00:0x15:fcCheckPointResume: name 02/wudata_01.trr Verified 02/wudata_01.trr
03:46:18:WU02:FS00:0x15:fcCheckPointResume: name 02/wudata_01.xtc Verified 02/wudata_01.xtc
03:46:18:WU02:FS00:0x15:fcCheckPointResume: name 02/wudata_01.edr Verified 02/wudata_01.edr
03:46:18:WU02:FS00:0x15:fcCheckPointResume: state restored 2
03:46:18:WU02:FS00:0x15:Resumed from checkpoint
03:46:18:WU02:FS00:0x15:Setting checkpoint frequency: 400000
03:46:18:WU02:FS00:0x15:Completed  30400001 out of 40000000 steps (76%).
03:46:19:WARNING:WU02:FS00:Detected clock skew (1 mins 05 secs), adjusting time estimates
03:48:13:FS01:Paused
03:48:13:FS01:Shutting core down
03:48:18:WU00:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
03:48:46:FS00:Shutting core down
03:48:55:Removing old file 'configs/config-20131221-072123.xml'
03:48:55:Saving configuration to config.xml
03:48:55:<config>
03:48:55:  <!-- Network -->
03:48:55:  <proxy v=':8080'/>
03:48:55:
03:48:55:  <!-- Slot Control -->
03:48:55:  <power v='full'/>
03:48:55:
03:48:55:  <!-- User Information -->
03:48:55:  <passkey v='********************************'/>
03:48:55:  <team v='110285'/>
03:48:55:  <user v='IsraeliRD'/>
03:48:55:
03:48:55:  <!-- Folding Slots -->
03:48:55:  <slot id='0' type='GPU'/>
03:48:55:  <slot id='1' type='CPU'>
03:48:55:    <cpus v='6'/>
03:48:55:    <pause-on-start v='true'/>
03:48:55:  </slot>
03:48:55:</config>
03:48:55:Clean exit
03:48:55:WU02:FS00:0x15:Client no longer detected. Shutting down core
03:48:55:WU02:FS00:0x15:
03:48:55:WU02:FS00:0x15:Folding@home Core Shutdown: CLIENT_DIED


This time I manually shut down F@H and destroyed the WU. The lack of movement for nearly 2 minutes on the progress bar (not even 0.01%) and no CPU usage (!) made me fear for a second BSOD.

Code: Select all
*********************** Log Started 2014-02-04T03:49:44Z ***********************
03:49:44:************************* Folding@home Client *************************
03:49:44:      Website: http://folding.stanford.edu/
03:49:44:    Copyright: (c) 2009-2014 Stanford University
03:49:44:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
03:49:44:         Args: --open-web-control
03:49:44:       Config: C:/Users/Matan/AppData/Roaming/FAHClient/config.xml
03:49:44:******************************** Build ********************************
03:49:44:      Version: 7.4.2
03:49:44:         Date: Jan 24 2014
03:49:44:         Time: 13:51:17
03:49:44:      SVN Rev: 4112
03:49:44:       Branch: fah/trunk/client
03:49:44:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
03:49:44:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
03:49:44:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
03:49:44:     Platform: win32 XP
03:49:44:         Bits: 32
03:49:44:         Mode: Release
03:49:44:******************************* System ********************************
03:49:44:          CPU: Intel(R) Core(TM) i7-4770K CPU @ 3.50GHz
03:49:44:       CPU ID: GenuineIntel Family 6 Model 60 Stepping 3
03:49:44:         CPUs: 8
03:49:44:       Memory: 15.94GiB
03:49:44:  Free Memory: 13.60GiB
03:49:44:      Threads: WINDOWS_THREADS
03:49:44:   OS Version: 6.1
03:49:44:  Has Battery: false
03:49:44:   On Battery: false
03:49:44:   UTC Offset: 11
03:49:44:          PID: 6964
03:49:44:          CWD: C:/Users/Matan/AppData/Roaming/FAHClient
03:49:44:           OS: Windows 7 Professional
03:49:44:      OS Arch: AMD64
03:49:44:         GPUs: 1
03:49:44:        GPU 0: NVIDIA:3 GK104 [GeForce GTX 660 Ti]
03:49:44:         CUDA: 3.0
03:49:44:  CUDA Driver: 5050
03:49:44:Win32 Service: false
03:49:44:***********************************************************************
03:49:44:<config>
03:49:44:  <!-- Network -->
03:49:44:  <proxy v=':8080'/>
03:49:44:
03:49:44:  <!-- Slot Control -->
03:49:44:  <power v='full'/>
03:49:44:
03:49:44:  <!-- User Information -->
03:49:44:  <passkey v='********************************'/>
03:49:44:  <team v='110285'/>
03:49:44:  <user v='IsraeliRD'/>
03:49:44:
03:49:44:  <!-- Folding Slots -->
03:49:44:  <slot id='0' type='GPU'/>
03:49:44:  <slot id='1' type='CPU'>
03:49:44:    <cpus v='6'/>
03:49:44:    <pause-on-start v='true'/>
03:49:44:  </slot>
03:49:44:</config>
03:49:44:Trying to access database...
03:49:44:Successfully acquired database lock
03:49:44:Enabled folding slot 00: READY gpu:0:GK104 [GeForce GTX 660 Ti]
03:49:44:Enabled folding slot 01: PAUSED cpu:6 (by user)
03:49:44:WARNING:WU00:Missing data files, dumping
03:49:44:WU02:FS00:Starting
03:49:44:WU02:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Matan/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -version 704 -lifeline 6964 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
03:49:44:WU02:FS00:Started FahCore on PID 1360
03:49:44:WU02:FS00:Core PID:6308
03:49:44:WU02:FS00:FahCore 0x15 started
03:49:44:WU00:FS01:Cleaning up
03:49:45:WU02:FS00:0x15:
03:49:45:WU02:FS00:0x15:*------------------------------*
03:49:45:WU02:FS00:0x15:Folding@Home GPU Core
03:49:45:WU02:FS00:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
03:49:45:WU02:FS00:0x15:Build host             AmoebaRemote
03:49:45:WU02:FS00:0x15:Board Type             NVIDIA/CUDA
03:49:45:WU02:FS00:0x15:Core                   15
03:49:45:WU02:FS00:0x15:
03:49:45:WU02:FS00:0x15:Window's signal control handler registered.
03:49:45:WU02:FS00:0x15:Preparing to commence simulation
03:49:45:WU02:FS00:0x15:- Looking at optimizations...
03:49:45:WU02:FS00:0x15:- Files status OK
03:49:45:WU02:FS00:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
03:49:45:WU02:FS00:0x15:- Expanded 80058 -> 307810 (decompressed 384.4 percent)
03:49:45:WU02:FS00:0x15:Called DecompressByteArray: compressed_data_size=80058 data_size=307810, decompressed_data_size=307810 diff=0
03:49:45:WU02:FS00:0x15:- Digital signature verified
03:49:45:WU02:FS00:0x15:
03:49:45:WU02:FS00:0x15:Project: 7660 (Run 458, Clone 0, Gen 362)
03:49:45:WU02:FS00:0x15:
03:49:45:WU02:FS00:0x15:Assembly optimizations on if available.
03:49:45:WU02:FS00:0x15:Entering M.D.
03:49:46:WU02:FS00:0x15:Will resume from checkpoint file 02/wudata_01.ckp
03:49:46:WU02:FS00:0x15:Tpr hash 02/wudata_01.tpr:  1084040353 2098037389 688454802 3775256636 3968620388
03:49:46:WU02:FS00:0x15:GPU device id=0
03:49:46:WU02:FS00:0x15:Working on Protein
03:49:46:WU02:FS00:0x15:Client config unavailable.
03:49:47:WU02:FS00:0x15:Starting GUI Server
03:49:51:FS01:Unpaused
03:49:51:WU00:FS01:Connecting to assign3.stanford.edu:8080
03:49:52:WU00:FS01:Assigned to work server 129.74.246.143
03:49:52:WU00:FS01:Requesting new work unit for slot 01: READY cpu:6 from 129.74.246.143
03:49:52:WU00:FS01:Connecting to 129.74.246.143:8080
03:49:54:WU00:FS01:Downloading 131.78KiB
03:49:54:WU00:FS01:Download complete
03:49:54:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:7045 run:0 clone:159 gen:6 core:0xa4 unit:0x0000003d0001329c4f395a20a563297c
03:49:54:WU00:FS01:Starting
03:49:54:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Matan/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 00 -suffix 01 -version 704 -lifeline 6964 -checkpoint 15 -np 6
03:49:54:WU00:FS01:Started FahCore on PID 6508
03:49:54:WU00:FS01:Core PID:6704
03:49:54:WU00:FS01:FahCore 0xa4 started
03:49:55:WU00:FS01:0xa4:
03:49:55:WU00:FS01:0xa4:*------------------------------*
03:49:55:WU00:FS01:0xa4:Folding@Home Gromacs GB Core
03:49:55:WU00:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
03:49:55:WU00:FS01:0xa4:
03:49:55:WU00:FS01:0xa4:Preparing to commence simulation
03:49:55:WU00:FS01:0xa4:- Looking at optimizations...
03:49:55:WU00:FS01:0xa4:- Created dyn
03:49:55:WU00:FS01:0xa4:- Files status OK
03:49:55:WU00:FS01:0xa4:- Expanded 134427 -> 304512 (decompressed 226.5 percent)
03:49:55:WU00:FS01:0xa4:Called DecompressByteArray: compressed_data_size=134427 data_size=304512, decompressed_data_size=304512 diff=0
03:49:55:WU00:FS01:0xa4:- Digital signature verified
03:49:55:WU00:FS01:0xa4:
03:49:55:WU00:FS01:0xa4:Project: 7045 (Run 0, Clone 159, Gen 6)
03:49:55:WU00:FS01:0xa4:
03:49:55:WU00:FS01:0xa4:Assembly optimizations on if available.
03:49:55:WU00:FS01:0xa4:Entering M.D.
03:50:01:WU00:FS01:0xa4:Mapping NT from 6 to 6
03:50:01:WU00:FS01:0xa4:Completed 0 out of 25000000 steps  (0%)
03:50:45:Removing old file 'configs/config-20131225-013216.xml'
03:50:45:Saving configuration to config.xml
03:50:45:<config>
03:50:45:  <!-- Network -->
03:50:45:  <proxy v=':8080'/>
03:50:45:
03:50:45:  <!-- Slot Control -->
03:50:45:  <power v='full'/>
03:50:45:
03:50:45:  <!-- User Information -->
03:50:45:  <passkey v='********************************'/>
03:50:45:  <team v='110285'/>
03:50:45:  <user v='IsraeliRD'/>
03:50:45:
03:50:45:  <!-- Folding Slots -->
03:50:45:  <slot id='0' type='GPU'/>
03:50:45:  <slot id='1' type='CPU'>
03:50:45:    <cpus v='6'/>
03:50:45:  </slot>
03:50:45:</config>
03:50:55:WU02:FS00:0x15:Resuming from checkpoint
03:50:55:WU02:FS00:0x15:fcCheckPointResume: retreived and current tpr file hash:
03:50:55:WU02:FS00:0x15:   0   1084040353   1084040353
03:50:55:WU02:FS00:0x15:   1   2098037389   2098037389
03:50:55:WU02:FS00:0x15:   2    688454802    688454802
03:50:55:WU02:FS00:0x15:   3   3775256636   3775256636
03:50:55:WU02:FS00:0x15:   4   3968620388   3968620388
03:50:55:WU02:FS00:0x15:fcCheckPointResume: file hashes same.
03:50:55:WU02:FS00:0x15:fcCheckPointResume: state restored.
03:50:55:WU02:FS00:0x15:fcCheckPointResume: name 02/wudata_01.log Verified 02/wudata_01.log
03:50:55:WU02:FS00:0x15:fcCheckPointResume: name 02/wudata_01.trr Verified 02/wudata_01.trr
03:50:55:WU02:FS00:0x15:fcCheckPointResume: name 02/wudata_01.xtc Verified 02/wudata_01.xtc
03:50:55:WU02:FS00:0x15:fcCheckPointResume: name 02/wudata_01.edr Verified 02/wudata_01.edr
03:50:55:WU02:FS00:0x15:fcCheckPointResume: state restored 2
03:50:55:WU02:FS00:0x15:Resumed from checkpoint
03:50:55:WU02:FS00:0x15:Setting checkpoint frequency: 400000
03:50:55:WU02:FS00:0x15:Completed  30400001 out of 40000000 steps (76%).
03:50:55:WARNING:WU02:FS00:Detected clock skew (1 mins 11 secs), adjusting time estimates
03:53:34:WU02:FS00:0x15:Completed  30800000 out of 40000000 steps (77%).
03:56:14:WU02:FS00:0x15:Completed  31200000 out of 40000000 steps (78%).
03:58:53:WU02:FS00:0x15:Completed  31600000 out of 40000000 steps (79%).
04:01:33:WU02:FS00:0x15:Completed  32000000 out of 40000000 steps (80%).
04:01:47:WU00:FS01:0xa4:Completed 250000 out of 25000000 steps  (1%)
04:04:12:WU02:FS00:0x15:Completed  32400000 out of 40000000 steps (81%).
04:06:52:WU02:FS00:0x15:Completed  32800000 out of 40000000 steps (82%).
04:09:31:WU02:FS00:0x15:Completed  33200000 out of 40000000 steps (83%).
04:12:11:WU02:FS00:0x15:Completed  33600000 out of 40000000 steps (84%).
04:13:33:WU00:FS01:0xa4:Completed 500000 out of 25000000 steps  (2%)
04:14:50:WU02:FS00:0x15:Completed  34000000 out of 40000000 steps (85%).
04:17:30:WU02:FS00:0x15:Completed  34400000 out of 40000000 steps (86%).
04:20:09:WU02:FS00:0x15:Completed  34800000 out of 40000000 steps (87%).
04:22:49:WU02:FS00:0x15:Completed  35200000 out of 40000000 steps (88%).
04:25:14:WU00:FS01:0xa4:Completed 750000 out of 25000000 steps  (3%)
04:25:28:WU02:FS00:0x15:Completed  35600000 out of 40000000 steps (89%).
04:28:08:WU02:FS00:0x15:Completed  36000000 out of 40000000 steps (90%).
04:30:47:WU02:FS00:0x15:Completed  36400000 out of 40000000 steps (91%).
04:33:26:WU02:FS00:0x15:Completed  36800000 out of 40000000 steps (92%).
04:36:06:WU02:FS00:0x15:Completed  37200000 out of 40000000 steps (93%).
04:36:45:WU00:FS01:0xa4:Completed 1000000 out of 25000000 steps  (4%)
04:38:45:WU02:FS00:0x15:Completed  37600000 out of 40000000 steps (94%).
04:41:25:WU02:FS00:0x15:Completed  38000000 out of 40000000 steps (95%).
04:44:04:WU02:FS00:0x15:Completed  38400000 out of 40000000 steps (96%).
04:46:44:WU02:FS00:0x15:Completed  38800000 out of 40000000 steps (97%).
04:48:55:WU00:FS01:0xa4:Completed 1250000 out of 25000000 steps  (5%)
04:49:22:WU02:FS00:0x15:Completed  39200000 out of 40000000 steps (98%).
04:52:02:WU02:FS00:0x15:Completed  39600000 out of 40000000 steps (99%).
04:52:03:WU01:FS00:Connecting to assign-GPU.stanford.edu:80
04:52:04:WU01:FS00:Assigned to work server 171.64.65.105
04:52:04:WU01:FS00:Requesting new work unit for slot 00: RUNNING gpu:0:GK104 [GeForce GTX 660 Ti] from 171.64.65.105
04:52:04:WU01:FS00:Connecting to 171.64.65.105:8080
04:52:06:WU01:FS00:Downloading 78.87KiB
04:52:06:WU01:FS00:Download complete
04:52:06:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:7660 run:246 clone:0 gen:393 core:0x15 unit:0x00000246664f2dd150f83f48def6587b
04:54:42:WU02:FS00:0x15:Completed  40000000 out of 40000000 steps (100%).
04:54:42:WU02:FS00:0x15:Finished fah_main status=0
04:54:42:WU02:FS00:0x15:Successful run
04:54:42:WU02:FS00:0x15:DynamicWrapper: Finished Work Unit: sleep=10000
04:54:52:WU02:FS00:0x15:Reserved 450628 bytes for xtc file; Cosm status=0
04:54:52:WU02:FS00:0x15:Allocated 450628 bytes for xtc file
04:54:52:WU02:FS00:0x15:- Reading up to 450628 from "02/wudata_01.xtc": Read 450628
04:54:52:WU02:FS00:0x15:Read 450628 bytes from xtc file; available packet space=785979836
04:54:52:WU02:FS00:0x15:xtc file hash check passed.
04:54:52:WU02:FS00:0x15:Reserved 28344 28344 785979836 bytes for arc file=<02/wudata_01.trr> Cosm status=0
04:54:52:WU02:FS00:0x15:Allocated 28344 bytes for arc file
04:54:52:WU02:FS00:0x15:- Reading up to 28344 from "02/wudata_01.trr": Read 28344
04:54:52:WU02:FS00:0x15:Read 28344 bytes from arc file; available packet space=785951492
04:54:52:WU02:FS00:0x15:trr file hash check passed.
04:54:52:WU02:FS00:0x15:Allocated 544 bytes for edr file
04:54:52:WU02:FS00:0x15:Read bedfile
04:54:52:WU02:FS00:0x15:edr file hash check passed.
04:54:52:WU02:FS00:0x15:Allocated 36951 bytes for logfile
04:54:52:WU02:FS00:0x15:Read logfile
04:54:52:WU02:FS00:0x15:GuardedRun: success in DynamicWrapper
04:54:52:WU02:FS00:0x15:GuardedRun: done
04:54:52:WU02:FS00:0x15:Run: GuardedRun completed.
04:54:54:WU02:FS00:0x15:+ Opened results file
04:54:54:WU02:FS00:0x15:- Writing 516979 bytes of core data to disk...
04:54:54:WU02:FS00:0x15:Done: 516467 -> 486552 (compressed to 94.2 percent)
04:54:54:WU02:FS00:0x15:  ... Done.
04:54:54:WU02:FS00:0x15:DeleteFrameFiles: successfully deleted file=02/wudata_01.ckp
04:54:54:WU02:FS00:0x15:Shutting down core
04:54:54:WU02:FS00:0x15:
04:54:54:WU02:FS00:0x15:Folding@home Core Shutdown: FINISHED_UNIT
04:54:55:WU02:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
04:54:55:WU02:FS00:Sending unit results: id:02 state:SEND error:NO_ERROR project:7660 run:458 clone:0 gen:362 core:0x15 unit:0x00000215664f2dd150f83f66a3bb1c3c
04:54:55:WU02:FS00:Uploading 475.65KiB to 171.64.65.105
04:54:55:WU02:FS00:Connecting to 171.64.65.105:8080
04:54:55:WU01:FS00:Starting
04:54:55:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Matan/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 01 -suffix 01 -version 704 -lifeline 6964 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
04:54:55:WU01:FS00:Started FahCore on PID 5104
04:54:55:WU01:FS00:Core PID:4628
04:54:55:WU01:FS00:FahCore 0x15 started
04:54:55:WU01:FS00:0x15:
04:54:55:WU01:FS00:0x15:*------------------------------*
04:54:55:WU01:FS00:0x15:Folding@Home GPU Core
04:54:55:WU01:FS00:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
04:54:55:WU01:FS00:0x15:Build host             AmoebaRemote
04:54:55:WU01:FS00:0x15:Board Type             NVIDIA/CUDA
04:54:55:WU01:FS00:0x15:Core                   15
04:54:55:WU01:FS00:0x15:
04:54:55:WU01:FS00:0x15:Window's signal control handler registered.
04:54:55:WU01:FS00:0x15:Preparing to commence simulation
04:54:55:WU01:FS00:0x15:- Looking at optimizations...
04:54:55:WU01:FS00:0x15:DeleteFrameFiles: successfully deleted file=01/wudata_01.ckp
04:54:55:WU01:FS00:0x15:- Created dyn
04:54:55:WU01:FS00:0x15:- Files status OK
04:54:55:WU01:FS00:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
04:54:55:WU01:FS00:0x15:- Expanded 80255 -> 307810 (decompressed 383.5 percent)
04:54:55:WU01:FS00:0x15:Called DecompressByteArray: compressed_data_size=80255 data_size=307810, decompressed_data_size=307810 diff=0
04:54:55:WU01:FS00:0x15:- Digital signature verified
04:54:55:WU01:FS00:0x15:
04:54:55:WU01:FS00:0x15:Project: 7660 (Run 246, Clone 0, Gen 393)
04:54:55:WU01:FS00:0x15:
04:54:55:WU01:FS00:0x15:Assembly optimizations on if available.
04:54:55:WU01:FS00:0x15:Entering M.D.
04:54:57:WU01:FS00:0x15:Tpr hash 01/wudata_01.tpr:  2691453271 130386452 2316240890 801121561 2993768453
04:54:57:WU01:FS00:0x15:GPU device id=0
04:54:57:WU01:FS00:0x15:Working on Protein
04:54:57:WU01:FS00:0x15:Client config unavailable.
04:54:57:WU01:FS00:0x15:Starting GUI Server
04:55:00:WU02:FS00:Upload complete
04:55:00:WU02:FS00:Server responded WORK_ACK (400)
04:55:00:WU02:FS00:Final credit estimate, 4431.00 points
04:55:00:WU02:FS00:Cleaning up
04:56:06:WU01:FS00:0x15:Setting checkpoint frequency: 400000
04:56:06:WU01:FS00:0x15:Completed         3 out of 40000000 steps (0%).
04:58:45:WU01:FS00:0x15:Completed    400000 out of 40000000 steps (1%).
05:01:06:WU00:FS01:0xa4:Completed 1500000 out of 25000000 steps  (6%)
05:01:25:WU01:FS00:0x15:Completed    800000 out of 40000000 steps (2%).
05:03:28:FS01:Finishing
05:04:04:WU01:FS00:0x15:Completed   1200000 out of 40000000 steps (3%).
05:06:43:WU01:FS00:0x15:Completed   1600000 out of 40000000 steps (4%).
05:09:23:WU01:FS00:0x15:Completed   2000000 out of 40000000 steps (5%).
05:12:02:WU01:FS00:0x15:Completed   2400000 out of 40000000 steps (6%).
05:12:58:WU00:FS01:0xa4:Completed 1750000 out of 25000000 steps  (7%)
05:14:42:WU01:FS00:0x15:Completed   2800000 out of 40000000 steps (7%).
05:17:22:WU01:FS00:0x15:Completed   3200000 out of 40000000 steps (8%).
05:20:03:WU01:FS00:0x15:Completed   3600000 out of 40000000 steps (9%).
05:22:43:WU01:FS00:0x15:Completed   4000000 out of 40000000 steps (10%).
05:25:24:WU01:FS00:0x15:Completed   4400000 out of 40000000 steps (11%).
05:25:29:WU00:FS01:0xa4:Completed 2000000 out of 25000000 steps  (8%)
05:28:03:WU01:FS00:0x15:Completed   4800000 out of 40000000 steps (12%).


This is the program as of right now. Currently I'm thinking whether to let this run, or stop after the CPU WU finishes and revert back to 7.3.6.
7.4.2. hasn't been nice with this second freeze, plus CPU:6 instead of CPU:8 means less TPF too, whereas with 7.3.6 I had CPU:8.
Does leaving it as CPU:6 really makes the TPF for the GPU be faster? If yes, then I'll probably leave it at 6, but against those two crashes... ugh.
IsraeliRD
 
Posts: 13
Joined: Wed Nov 13, 2013 7:38 am

Re: A few problems (faulty WUs/crash/hard-reset)

Postby PantherX » Tue Feb 04, 2014 12:24 pm

IsraeliRD wrote:...Does leaving it as CPU:6 really makes the TPF for the GPU be faster? If yes, then I'll probably leave it at 6, but against those two crashes... ugh.

It depends on the WU assigned to your system:
FahCore_15 WUs -> They will hardly use any CPU Usage so using 8 CPUs for the CPU Slot would be fine
FahCore_17 WUs -> They will need 1 CPU per GPU so using 8 CPUs for the CPU Slot will over-subscribe your CPU which will cause a significant slow-down for the CPU WU and possibly the GPU WU too.

Since your GPU supports both types of WUs, if you want a set-and-forget configuration, use 6 CPUs for the CPU Slot which leaves 2 CPUs free for either your GPU or your OS if the system is a non-dedicated one. Alternatively, you can switch between 6 CPUs and 8 CPUs every time your GPU gets FahCore_15 WU so you would have to monitor your system closely.
User avatar
PantherX
Site Moderator
 
Posts: 6614
Joined: Wed Dec 23, 2009 9:33 am

Re: A few problems (faulty WUs/crash/hard-reset)

Postby bruce » Wed Feb 05, 2014 6:10 pm

The most important thing you can do is figure out how to avoid the BSOD. First guess: Overclocking or overheating.

If overclocked, I'd go back to stock clocks for a while.

If the computer has been running long enough to accumulate dust in the narrow heatsink passages, I'd give it a good cleaning with canned air. If you're willing to invalidate your warranty and you trust your computer-building skills, I'd remove the thermal "gum" between the CPU and the heatsink and replace it with a very thin layer of high-quality thermal compound. ETC. (If you don't trust your skills, any computer shop will do it for a fee.)

If your gpu is a reference design that blows the heat out the back, good. If not, be sure you have enough air circulating through the case to keep it from trapping the heat inside.
bruce
 
Posts: 21282
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: A few problems (faulty WUs/crash/hard-reset)

Postby IsraeliRD » Thu Feb 06, 2014 10:21 am

I don't overclock, and the system doesn't overheat. It actually got a good cleaning last month, and has plenty of air circulation. The GPU is steady at 62-63 degs and CPU is just as happy.

I decided to go back to 7.3.6, got a new WU for the CPU but then it got stuck for 10 minutes and made no progress despite task manager showing it running at 75% (CPU:6). I decided to dump it and got another one with the same result. I upgraded back to 7.4.2 and lo and behold, the WU started work instantly.
IsraeliRD
 
Posts: 13
Joined: Wed Nov 13, 2013 7:38 am

Re: A few problems (faulty WUs/crash/hard-reset)

Postby PantherX » Thu Feb 06, 2014 11:48 am

IsraeliRD wrote:...it got stuck for 10 minutes and made no progress despite task manager showing it running at 75% (CPU:6)...

Please note that it may take several minutes for a single percentage to be completed especially if other processes are using the CPU. Since you checked in Task Manager and found that FahCore_a* was using 75% of the CPU, it wasn't stuck, rather, it was busy folding. Generally, if the system is left overnight and no progress has been made, you may then call it as being stuck.

Nonetheless, glad that V7.4.2 is working out for you.
User avatar
PantherX
Site Moderator
 
Posts: 6614
Joined: Wed Dec 23, 2009 9:33 am

Re: A few problems (faulty WUs/crash/hard-reset)

Postby IsraeliRD » Thu Feb 06, 2014 1:15 pm

Checking my logs again, it was 14 minutes, not 10. P8566 (R1/C4/G222). 8577 (R1/C5/G232) did the same thing, and 8570 (R1/C4/G226) is when things straightened out.
When I relaunched F@H (P8566) the progress was still on 0 steps out of 500000, so I presumed nothing happened. Considering I always got 1% within 10 minutes, and NOTHING was sitting on the CPU, I presumed something messed up and I decided to avoid the BSOD altogether :)
IsraeliRD
 
Posts: 13
Joined: Wed Nov 13, 2013 7:38 am

Re: A few problems (faulty WUs/crash/hard-reset)

Postby Joe_H » Thu Feb 06, 2014 5:18 pm

If you look at the Project Summary page it shows that projects in that 85nn range are on the larger size. So they will take a bit of time before reaching 1%. Due to the way the client displays the estimates based on information cached from prior runs, if a WU from that particular project has not been processed before, then the client can not provide an estimate until a full frame has been completed.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
Joe_H
Site Admin
 
Posts: 3820
Joined: Tue Apr 21, 2009 4:41 pm
Location: W. MA

Re: A few problems (faulty WUs/crash/hard-reset)

Postby IsraeliRD » Thu Feb 06, 2014 9:02 pm

Ah ha! I thought it was just my client about to give me a BSOD heh. Thanks for the explanation :)
IsraeliRD
 
Posts: 13
Joined: Wed Nov 13, 2013 7:38 am

Re: A few problems (faulty WUs/crash/hard-reset)

Postby PantherX » Fri Feb 07, 2014 3:45 am

Generally speaking BSOD is usually cause by hardware issues and/or drivers (http://pcsupport.about.com/od/fixthepro ... errors.htm) (http://www.howtogeek.com/163452/everyth ... -of-death/).
User avatar
PantherX
Site Moderator
 
Posts: 6614
Joined: Wed Dec 23, 2009 9:33 am


Return to CPU Projects - released FAHCores _a4 & _a7

Who is online

Users browsing this forum: No registered users and 1 guest

cron