Problem with 8036

Moderators: Site Moderators, FAHC Science Team

Post Reply
Ripper36
Posts: 60
Joined: Sun Sep 18, 2011 8:55 am

Problem with 8036

Post by Ripper36 »

I am getting the old "UNKNOWN_ENUM (-1 = 0xffffffff)" error on my machine which had trouble previously.

The problem appearss the same as Bruce's reported on the Beta forum (I can't post there)

Code: Select all

*********************** Log Started 2012-04-19T20:47:30Z ***********************
20:47:30:************************* Folding@home Client *************************
20:47:30:      Website: http://folding.stanford.edu/
20:47:30:    Copyright: (c) 2009-2012 Stanford University
20:47:30:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
20:47:30:         Args: --lifeline 2100 --command-port=36330
20:47:30:       Config: C:/Users/John/AppData/Roaming/FAHClient/config.xml
20:47:30:******************************** Build ********************************
20:47:30:      Version: 7.1.50
20:47:30:         Date: Mar 3 2012
20:47:30:         Time: 15:59:15
20:47:30:      SVN Rev: 3277
20:47:30:       Branch: fah/trunk/client
20:47:30:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
20:47:30:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
20:47:30:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT
20:47:30:     Platform: win32 XP
20:47:30:         Bits: 32
20:47:30:         Mode: Release
20:47:30:******************************* System ********************************
20:47:30:          CPU: Intel(R) Core(TM)2 Quad CPU Q9450 @ 2.66GHz
20:47:30:       CPU ID: GenuineIntel Family 6 Model 23 Stepping 7
20:47:30:         CPUs: 4
20:47:30:       Memory: 4.00GiB
20:47:30:  Free Memory: 2.91GiB
20:47:30:      Threads: WINDOWS_THREADS
20:47:30:   On Battery: false
20:47:30:   UTC offset: 10
20:47:30:          PID: 3380
20:47:30:          CWD: C:/Users/John/AppData/Roaming/FAHClient
20:47:30:           OS: Windows 7 Ultimate
20:47:30:      OS Arch: AMD64
20:47:30:         GPUs: 1
20:47:30:        GPU 0: FERMI:1 GF110 [Geforce GTX 570 HD]
20:47:30:         CUDA: 2.0
20:47:30:  CUDA Driver: 4020
20:47:30:Win32 Service: false
20:47:30:***********************************************************************
20:47:30:<config>
20:47:30:  <service-description v='Folding@home Client'/>
20:47:30:  <service-restart v='true'/>
20:47:30:  <service-restart-delay v='5000'/>
20:47:30:
20:47:30:  <!-- Client Control -->
20:47:30:  <cycle-rate v='4'/>
20:47:30:  <cycles v='-1'/>
20:47:30:  <data-directory v='.'/>
20:47:30:  <disable-project-lookup v='false'/>
20:47:30:  <exec-directory v='C:\Program Files (x86)\FAHClient'/>
20:47:30:  <exit-when-done v='false'/>
20:47:30:  <threads v='4'/>
20:47:30:
20:47:30:  <!-- Configuration -->
20:47:30:  <config-rotate v='true'/>
20:47:30:  <config-rotate-dir v='configs'/>
20:47:30:  <config-rotate-max v='16'/>
20:47:30:
20:47:30:  <!-- Debugging -->
20:47:30:  <assignment-servers>
20:47:30:    assign3.stanford.edu:8080 assign4.stanford.edu:80
20:47:30:  </assignment-servers>
20:47:30:  <capture-directory v='capture'/>
20:47:30:  <capture-sockets v='false'/>
20:47:30:  <debug-sockets v='false'/>
20:47:30:  <exception-locations v='true'/>
20:47:30:  <gpu-assignment-servers>
20:47:30:    assign-GPU.stanford.edu:80 assign-GPU.stanford.edu:8080
20:47:30:  </gpu-assignment-servers>
20:47:30:  <stack-traces v='false'/>
20:47:30:
20:47:30:  <!-- Error Handling -->
20:47:30:  <max-slot-errors v='5'/>
20:47:30:  <max-unit-errors v='5'/>
20:47:30:
20:47:30:  <!-- FahCore Control -->
20:47:30:  <checkpoint v='11'/>
20:47:30:  <core-dir v='cores'/>
20:47:30:  <core-priority v='idle'/>
20:47:30:  <cpu-affinity v='false'/>
20:47:30:  <cpu-usage v='100'/>
20:47:30:  <no-assembly v='false'/>
20:47:30:
20:47:30:  <!-- Folding Slot Configuration -->
20:47:30:  <client-subtype v='STDCLI'/>
20:47:30:  <client-type v='normal'/>
20:47:30:  <cpu-species v='X86_PENTIUM_II'/>
20:47:30:  <cpu-type v='AMD64'/>
20:47:30:  <cpus v='-1'/>
20:47:30:  <cuda-index v='0'/>
20:47:30:  <gpu v='true'/>
20:47:30:  <gpu-usage v='90'/>
20:47:30:  <max-packet-size v='normal'/>
20:47:30:  <opencl-index v='0'/>
20:47:30:  <os-species v='UNKNOWN'/>
20:47:30:  <os-type v='WIN32'/>
20:47:30:  <project-key v='0'/>
20:47:30:  <smp v='true'/>
20:47:30:
20:47:30:  <!-- Logging -->
20:47:30:  <log v='log.txt'/>
20:47:30:  <log-color v='false'/>
20:47:30:  <log-crlf v='true'/>
20:47:30:  <log-date v='false'/>
20:47:30:  <log-date-periodically v='21600'/>
20:47:30:  <log-debug v='true'/>
20:47:30:  <log-domain v='false'/>
20:47:30:  <log-header v='true'/>
20:47:30:  <log-level v='true'/>
20:47:30:  <log-no-info-header v='true'/>
20:47:30:  <log-redirect v='false'/>
20:47:30:  <log-rotate v='true'/>
20:47:30:  <log-rotate-dir v='logs'/>
20:47:30:  <log-rotate-max v='16'/>
20:47:30:  <log-short-level v='false'/>
20:47:30:  <log-simple-domains v='true'/>
20:47:30:  <log-thread-id v='false'/>
20:47:30:  <log-thread-prefix v='true'/>
20:47:30:  <log-time v='true'/>
20:47:30:  <log-to-screen v='true'/>
20:47:30:  <log-truncate v='false'/>
20:47:30:  <verbosity v='5'/>
20:47:30:
20:47:30:  <!-- Network -->
20:47:30:  <proxy v=':8080'/>
20:47:30:  <proxy-enable v='false'/>
20:47:30:  <proxy-pass v=''/>
20:47:30:  <proxy-user v=''/>
20:47:30:
20:47:30:  <!-- Process Control -->
20:47:30:  <child v='false'/>
20:47:30:  <daemon v='false'/>
20:47:30:  <pid v='false'/>
20:47:30:  <pid-file v='Folding@home Client.pid'/>
20:47:30:  <respawn v='false'/>
20:47:30:  <service v='false'/>
20:47:30:
20:47:30:  <!-- Remote Command Server -->
20:47:30:  <command-address v='0.0.0.0'/>
20:47:30:  <command-allow v='127.0.0.1,192.168.1.0/24'/>
20:47:30:  <command-allow-no-pass v='127.0.0.1'/>
20:47:30:  <command-deny v='0.0.0.0/0'/>
20:47:30:  <command-deny-no-pass v='0.0.0.0/0'/>
20:47:30:  <command-port v='36330'/>
20:47:30:  <password v='*******'/>
20:47:30:
20:47:30:  <!-- Slot Control -->
20:47:30:  <max-shutdown-wait v='60'/>
20:47:30:  <pause-on-battery v='false'/>
20:47:30:  <pause-on-start v='false'/>
20:47:30:
20:47:30:  <!-- User Information -->
20:47:30:  <machine-id v='0'/>
20:47:30:  <passkey v='********************************'/>
20:47:30:  <team v='24'/>
20:47:30:  <user v='jrimmer'/>
20:47:30:
20:47:30:  <!-- Work Unit Control -->
20:47:30:  <dump-after-deadline v='true'/>
20:47:30:  <max-queue v='16'/>
20:47:30:  <max-units v='0'/>
20:47:30:  <next-unit-percentage v='98'/>
20:47:30:
20:47:30:  <!-- Folding Slots -->
20:47:30:  <slot id='0' type='GPU'/>
20:47:30:  <slot id='1' type='SMP'>
20:47:30:    <cpus v='-1'/>
20:47:30:  </slot>
20:47:30:</config>
20:47:31:Trying to access database...
20:47:31:Successfully acquired database lock
20:47:31:Enabled folding slot 00: READY gpu:0:"GF110 [Geforce GTX 570 HD]"
20:47:31:Enabled folding slot 01: READY smp:4
20:47:31:Started thread 1 on PID 3380
20:47:31:Started thread 5 on PID 3380
20:47:31:Started thread 3 on PID 3380
20:47:31:Started thread 6 on PID 3380
20:47:31:Started thread 4 on PID 3380
20:47:31:WU02:FS00:Starting
20:47:31:WU02:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/John/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -version 701 -lifeline 3380 -checkpoint 11 -gpu 0
20:47:34:Server connection id=1 on 0.0.0.0:36330 from 127.0.0.1
20:47:34:Started thread 7 on PID 3380
20:47:35:WU02:FS00:Started FahCore on PID 2164
20:47:35:Started thread 8 on PID 3380
20:47:37:Server connection id=2 on 0.0.0.0:36330 from 192.168.1.3
20:47:37:Started thread 9 on PID 3380
20:47:38:WU02:FS00:Core PID:220
20:47:38:WU02:FS00:FahCore 0x15 started
20:47:39:Server connection id=3 on 0.0.0.0:36330 from 192.168.1.9
20:47:39:Started thread 10 on PID 3380
20:47:39:WU00:FS01:Starting
20:47:39:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/John/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 00 -suffix 01 -version 701 -lifeline 3380 -checkpoint 11 -np 4
20:47:39:WU02:FS00:0x15:
20:47:39:WU02:FS00:0x15:*------------------------------*
20:47:39:WU02:FS00:0x15:Folding@Home GPU Core
20:47:39:WU02:FS00:0x15:Version                2.22 (Thu Dec 8 17:08:05 PST 2011)
20:47:39:WU02:FS00:0x15:Build host             SimbiosNvdWin7
20:47:39:WU02:FS00:0x15:Board Type             NVIDIA/CUDA
20:47:39:WU02:FS00:0x15:Core                   15
20:47:39:WU02:FS00:0x15:
20:47:39:WU02:FS00:0x15:Window's signal control handler registered.
20:47:39:WU02:FS00:0x15:Preparing to commence simulation
20:47:39:WU02:FS00:0x15:- Ensuring status. Please wait.
20:47:40:WU00:FS01:Started FahCore on PID 3676
20:47:40:Started thread 11 on PID 3380
20:47:43:WU00:FS01:Core PID:1612
20:47:43:WU00:FS01:FahCore 0xa4 started
20:47:43:WU00:FS01:0xa4:
20:47:43:WU00:FS01:0xa4:*------------------------------*
20:47:43:WU00:FS01:0xa4:Folding@Home Gromacs GB Core
20:47:43:WU00:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
20:47:43:WU00:FS01:0xa4:
20:47:43:WU00:FS01:0xa4:Preparing to commence simulation
20:47:43:WU00:FS01:0xa4:- Looking at optimizations...
20:47:43:WU00:FS01:0xa4:- Files status OK
20:47:43:WU00:FS01:0xa4:- Expanded 881544 -> 1984368 (decompressed 225.1 percent)
20:47:43:WU00:FS01:0xa4:Called DecompressByteArray: compressed_data_size=881544 data_size=1984368, decompressed_data_size=1984368 diff=0
20:47:43:WU00:FS01:0xa4:- Digital signature verified
20:47:43:WU00:FS01:0xa4:
20:47:43:WU00:FS01:0xa4:Project: 8024 (Run 117, Clone 13, Gen 19)
20:47:43:WU00:FS01:0xa4:
20:47:43:WU00:FS01:0xa4:Assembly optimizations on if available.
20:47:43:WU00:FS01:0xa4:Entering M.D.
20:47:48:WU02:FS00:0x15:- Looking at optimizations...
20:47:48:WU02:FS00:0x15:- Working with standard loops on this execution.
20:47:48:WU02:FS00:0x15:- Previous termination of core was improper.
20:47:48:WU02:FS00:0x15:- Going to use standard loops.
20:47:48:WU02:FS00:0x15:- Files status OK
20:47:49:WU02:FS00:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
20:47:49:WU02:FS00:0x15:- Expanded 121860 -> 544090 (decompressed 446.4 percent)
20:47:49:WU02:FS00:0x15:Called DecompressByteArray: compressed_data_size=121860 data_size=544090, decompressed_data_size=544090 diff=0
20:47:49:WU02:FS00:0x15:- Digital signature verified
20:47:49:WU02:FS00:0x15:
20:47:49:WU02:FS00:0x15:Project: 8036 (Run 4, Clone 38, Gen 4)
20:47:49:WU02:FS00:0x15:
20:47:49:WU02:FS00:0x15:Entering M.D.
20:47:49:WU00:FS01:0xa4:Using Gromacs checkpoints
20:47:50:WU00:FS01:0xa4:Mapping NT from 4 to 4 
20:47:51:WU02:FS00:0x15:Will resume from checkpoint file 02/wudata_01.ckp
20:47:51:WU02:FS00:0x15:Tpr hash 02/wudata_01.tpr:  2690400245 412859068 1927453023 1097432841 2413346664
20:47:51:WU02:FS00:0x15:GPU device info: vendor=0 device=0 name=<NA> match=0
20:47:51:WU02:FS00:0x15:Working on Protein
20:47:51:WU02:FS00:0x15:Client config unavailable.
20:47:53:WU00:FS01:0xa4:Resuming from checkpoint
20:47:53:WU00:FS01:0xa4:Verified 00/wudata_01.log
20:47:53:WU02:FS00:0x15:Starting GUI Server
20:47:53:WU00:FS01:0xa4:Verified 00/wudata_01.trr
20:47:53:WU00:FS01:0xa4:Verified 00/wudata_01.xtc
20:47:53:WU00:FS01:0xa4:Verified 00/wudata_01.edr
20:47:53:WU00:FS01:0xa4:Completed 62050 out of 250000 steps  (24%)
20:48:19:WU00:FS01:0xa4:Completed 62500 out of 250000 steps  (25%)
20:48:57:WU02:FS00:0x15:Resuming from checkpoint
20:48:57:WU02:FS00:0x15:fcCheckPointResume: retreived and current tpr file hash:
20:48:57:WU02:FS00:0x15:   0   2690400245   2690400245
20:48:57:WU02:FS00:0x15:   1    412859068    412859068
20:48:57:WU02:FS00:0x15:   2   1927453023   1927453023
20:48:57:WU02:FS00:0x15:   3   1097432841   1097432841
20:48:57:WU02:FS00:0x15:   4   2413346664   2413346664
20:48:57:WU02:FS00:0x15:fcCheckPointResume: file hashes same.
20:48:57:WU02:FS00:0x15:fcCheckPointResume: state restored.
20:48:57:WU02:FS00:0x15:fcCheckPointResume: name 02/wudata_01.log Verified 02/wudata_01.log
20:48:57:WU02:FS00:0x15:fcCheckPointResume: name 02/wudata_01.trr Verified 02/wudata_01.trr
20:48:57:WU02:FS00:0x15:fcCheckPointResume: name 02/wudata_01.xtc Verified 02/wudata_01.xtc
20:48:57:WU02:FS00:0x15:fcCheckPointResume: name 02/wudata_01.edr Verified 02/wudata_01.edr
20:48:57:WU02:FS00:0x15:fcCheckPointResume: state restored 2
20:48:57:WU02:FS00:0x15:Resumed from checkpoint
20:48:57:WU02:FS00:0x15:Setting checkpoint frequency: 250000
20:48:57:WU02:FS00:0x15:Completed   7250001 out of 25000000 steps (29%).
20:51:25:WU02:FS00:0x15:Completed   7500000 out of 25000000 steps (30%).
20:52:32:WU00:FS01:0xa4:Completed 65000 out of 250000 steps  (26%)
20:53:54:WU02:FS00:0x15:Completed   7750000 out of 25000000 steps (31%).
20:56:23:WU02:FS00:0x15:Completed   8000000 out of 25000000 steps (32%).
20:57:01:WU00:FS01:0xa4:Completed 67500 out of 250000 steps  (27%)
20:58:52:WU02:FS00:0x15:Completed   8250000 out of 25000000 steps (33%).
20:59:27:WU02:FS00:FahCore returned: UNKNOWN_ENUM (-1 = 0xffffffff)
20:59:27:WARNING:WU02:FS00:FahCore returned an unknown error code which probably indicates that it crashed
20:59:28:WU02:FS00:Starting
20:59:28:WU02:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/John/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 02 -suffix 01 -version 701 -lifeline 3380 -checkpoint 11 -gpu 0
20:59:28:WU02:FS00:Started FahCore on PID 4960
20:59:28:Started thread 12 on PID 3380
20:59:28:WU02:FS00:Core PID:5040
20:59:28:WU02:FS00:FahCore 0x15 started
20:59:28:WU02:FS00:0x15:
20:59:35:WU02:FS00:0x15:*------------------------------*
20:59:35:WU02:FS00:0x15:Folding@Home GPU Core
20:59:35:WU02:FS00:0x15:Version                2.22 (Thu Dec 8 17:08:05 PST 2011)
20:59:35:WU02:FS00:0x15:Build host             SimbiosNvdWin7
20:59:35:WU02:FS00:0x15:Board Type             NVIDIA/CUDA
20:59:35:WU02:FS00:0x15:Core                   15
20:59:35:WU02:FS00:0x15:
20:59:35:WU02:FS00:0x15:Window's signal control handler registered.
20:59:35:WU02:FS00:0x15:Preparing to commence simulation
20:59:35:WU02:FS00:0x15:- Ensuring status. Please wait.
20:59:37:WU02:FS00:0x15:- Looking at optimizations...
20:59:37:WU02:FS00:0x15:- Working with standard loops on this execution.
20:59:38:WU02:FS00:0x15:- Previous termination of core was improper.
20:59:38:WU02:FS00:0x15:- Going to use standard loops.
20:59:38:WU02:FS00:0x15:- Files status OK
20:59:38:WU02:FS00:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
20:59:38:WU02:FS00:0x15:- Expanded 121860 -> 544090 (decompressed 446.4 percent)
20:59:38:WU02:FS00:0x15:Called DecompressByteArray: compressed_data_size=121860 data_size=544090, decompressed_data_size=544090 diff=0
20:59:38:WU02:FS00:0x15:- Digital signature verified
20:59:38:WU02:FS00:0x15:
20:59:38:WU02:FS00:0x15:Project: 8036 (Run 4, Clone 38, Gen 4)
20:59:38:WU02:FS00:0x15:
20:59:38:WU02:FS00:0x15:Entering M.D.
20:59:40:WU02:FS00:0x15:Will resume from checkpoint file 02/wudata_01.ckp
20:59:40:WU02:FS00:0x15:Tpr hash 02/wudata_01.tpr:  2690400245 412859068 1927453023 1097432841 2413346664
20:59:40:WU02:FS00:0x15:GPU device info: vendor=0 device=0 name=<NA> match=0
20:59:40:WU02:FS00:0x15:Working on Protein
20:59:40:WU02:FS00:0x15:Client config unavailable.
20:59:40:WU02:FS00:0x15:Starting GUI Server
20:59:53:Server connection id=4 on 0.0.0.0:36330 from 192.168.1.3
20:59:53:Started thread 13 on PID 3380
21:00:47:WU02:FS00:0x15:Resuming from checkpoint
21:00:47:WU02:FS00:0x15:fcCheckPointResume: retreived and current tpr file hash:
21:00:47:WU02:FS00:0x15:   0   2690400245   2690400245
21:00:47:WU02:FS00:0x15:   1    412859068    412859068
21:00:47:WU02:FS00:0x15:   2   1927453023   1927453023
21:00:47:WU02:FS00:0x15:   3   1097432841   1097432841
21:00:47:WU02:FS00:0x15:   4   2413346664   2413346664
21:00:47:WU02:FS00:0x15:fcCheckPointResume: file hashes same.
21:00:47:WU02:FS00:0x15:fcCheckPointResume: state restored.
21:00:47:WU02:FS00:0x15:fcCheckPointResume: name 02/wudata_01.log Verified 02/wudata_01.log
21:00:47:WU02:FS00:0x15:fcCheckPointResume: name 02/wudata_01.trr Verified 02/wudata_01.trr
21:00:47:WU02:FS00:0x15:fcCheckPointResume: name 02/wudata_01.xtc Verified 02/wudata_01.xtc
21:00:47:WU02:FS00:0x15:fcCheckPointResume: name 02/wudata_01.edr Verified 02/wudata_01.edr
21:00:47:WU02:FS00:0x15:fcCheckPointResume: state restored 2
21:00:47:WU02:FS00:0x15:Resumed from checkpoint
21:00:47:WU02:FS00:0x15:Setting checkpoint frequency: 250000
21:00:47:WU02:FS00:0x15:Completed   8250001 out of 25000000 steps (33%).
21:00:47:WARNING:WU02:FS00:Detected clock skew, adjusting time estimates
21:01:05:WU00:FS01:0xa4:Completed 70000 out of 250000 steps  (28%)
21:01:06:WU02:FS00:FahCore returned: UNKNOWN_ENUM (-1 = 0xffffffff)
21:01:06:WARNING:WU02:FS00:FahCore returned an unknown error code which probably indicates that it crashed
21:01:06:WARNING:WU02:FS00:Too many errors, failing
21:01:06:WU02:FS00:Sending unit results: id:02 state:SEND error:FAILED project:8036 run:4 clone:38 gen:4 core:0x15 unit:0x000000066953ee2e4f8a6f19d13e102b
21:01:06:WU02:FS00:Connecting to 171.67.108.142:8080
21:01:07:WU01:FS00:Connecting to assign-GPU.stanford.edu:80
21:01:07:WU02:FS00:Server responded WORK_QUIT (404)
21:01:07:WARNING:WU02:FS00:Server did not like results, dumping
21:01:07:WU02:FS00:Cleaning up
21:01:07:ERROR:WU02:FS00:Exception: Failed to remove directory './work/02': boost::filesystem::remove: The process cannot access the file because it is being used by another process: ".\work\02\wudata_01.edr"
21:01:07:WU02:FS00:Cleaning up
21:01:07:ERROR:WU02:FS00:Exception: Failed to remove directory './work/02': boost::filesystem:remove_all: Access is denied: ".\work\02"
21:01:08:WU01:FS00:News: Welcome to Folding@Home
21:01:08:WU01:FS00:Assigned to work server 171.67.108.142
21:01:08:WU01:FS00:Requesting new work unit for slot 00: READY gpu:0:"GF110 [Geforce GTX 570 HD]" from 171.67.108.142
21:01:08:WU01:FS00:Connecting to 171.67.108.142:8080
21:01:09:WU01:FS00:Downloading 119.33KiB
21:01:11:WU01:FS00:Download complete
21:01:11:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:OK project:8036 run:1 clone:82 gen:5 core:0x15 unit:0x000000076953ee2e4f8a6e4ee51f3d29
21:01:11:WU01:FS00:Starting
21:01:11:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/John/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 01 -suffix 01 -version 701 -lifeline 3380 -checkpoint 11 -gpu 0
21:01:11:WU01:FS00:Started FahCore on PID 2500
21:01:11:Started thread 14 on PID 3380
21:01:11:WU01:FS00:Core PID:4324
21:01:11:WU01:FS00:FahCore 0x15 started
21:01:12:WU01:FS00:0x15:
21:01:12:WU01:FS00:0x15:*------------------------------*
21:01:12:WU01:FS00:0x15:Folding@Home GPU Core
21:01:12:WU01:FS00:0x15:Version                2.22 (Thu Dec 8 17:08:05 PST 2011)
21:01:12:WU01:FS00:0x15:Build host             SimbiosNvdWin7
21:01:12:WU01:FS00:0x15:Board Type             NVIDIA/CUDA
21:01:12:WU01:FS00:0x15:Core                   15
21:01:12:WU01:FS00:0x15:
21:01:12:WU01:FS00:0x15:Window's signal control handler registered.
21:01:12:WU01:FS00:0x15:Preparing to commence simulation
21:01:12:WU01:FS00:0x15:- Looking at optimizations...
21:01:12:WU01:FS00:0x15:DeleteFrameFiles: successfully deleted file=01/wudata_01.ckp
21:01:12:WU01:FS00:0x15:- Created dyn
21:01:12:WU01:FS00:0x15:- Files status OK
21:01:12:WU01:FS00:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
21:01:12:WU01:FS00:0x15:- Expanded 121680 -> 544090 (decompressed 447.1 percent)
21:01:12:WU01:FS00:0x15:Called DecompressByteArray: compressed_data_size=121680 data_size=544090, decompressed_data_size=544090 diff=0
21:01:12:WU01:FS00:0x15:- Digital signature verified
21:01:12:WU01:FS00:0x15:
21:01:12:WU01:FS00:0x15:Project: 8036 (Run 1, Clone 82, Gen 5)
21:01:12:WU01:FS00:0x15:
21:01:12:WU01:FS00:0x15:Assembly optimizations on if available.
21:01:12:WU01:FS00:0x15:Entering M.D.
21:01:14:WU01:FS00:0x15:Tpr hash 01/wudata_01.tpr:  1967339865 3940803010 1407893090 4069358753 2448718741
21:01:14:WU01:FS00:0x15:GPU device info: vendor=0 device=0 name=<NA> match=0
21:01:14:WU01:FS00:0x15:Working on Protein
21:01:14:WU01:FS00:0x15:Client config unavailable.
21:01:14:WU01:FS00:0x15:Starting GUI Server
21:02:00:Server connection id=1 ended
21:02:00:Lost lifeline PID 2100, exiting
21:02:01:FS00:Shutting core down
21:02:01:FS01:Shutting core down
21:02:06:WU00:FS01:0xa4:Client no longer detected. Shutting down core 
21:02:06:WU00:FS01:0xa4:
21:02:06:WU00:FS01:0xa4:Folding@home Core Shutdown: CLIENT_DIED
21:02:12:Clean exit
21:02:12:WU01:FS00:0x15:Client no longer detected. Shutting down core 
21:02:12:WU01:FS00:0x15:
21:02:12:WU01:FS00:0x15:Folding@home Core Shutdown: CLIENT_DIED
I am folding 8036 without problem on three otherGTX570 cards - it is only the card on the Core2 Quad CPU Q9450 that is having problems.

I have tried all the usual - I will try a memory test to see if that throws up any clues, but in the meantime is anyone else having problems with 8036?
Image
GreyWhiskers
Posts: 660
Joined: Mon Oct 25, 2010 5:57 am
Hardware configuration: a) Main unit
Sandybridge in HAF922 w/200 mm side fan
--i7 2600K@4.2 GHz
--ASUS P8P67 DeluxeB3
--4GB ADATA 1600 RAM
--750W Corsair PS
--2Seagate Hyb 750&500 GB--WD Caviar Black 1TB
--EVGA 660GTX-Ti FTW - Signature 2 GPU@ 1241 Boost
--MSI GTX560Ti @900MHz
--Win7Home64; FAH V7.3.2; 327.23 drivers

b) 2004 HP a475c desktop, 1 core Pent 4 HT@3.2 GHz; Mem 2GB;HDD 160 GB;Zotac GT430PCI@900 MHz
WinXP SP3-32 FAH v7.3.6 301.42 drivers - GPU slot only

c) 2005 Toshiba M45-S551 laptop w/2 GB mem, 160GB HDD;Pent M 740 CPU @ 1.73 GHz
WinXP SP3-32 FAH v7.3.6 [Receiving Core A4 work units]
d) 2011 lappy-15.6"-1920x1080;i7-2860QM,2.5;IC Diamond Thermal Compound;GTX 560M 1,536MB u/c@700;16GB-1333MHz RAM;HDD:500GBHyb w/ 4GB SSD;Win7HomePrem64;320.18 drivers FAH 7.4.2ß
Location: Saratoga, California USA

Re: Problem with 8036

Post by GreyWhiskers »

I'm finding a problem with 8036 too - looks like the old "2D Clocks" problem. This is on my year-old i7 2600K desktop with GTX560Ti card.

Funny, I had never run into this before. I had a hardware issue, so rebuilt my Windows 7 environment from scratch yesterday. The Nvidia driver that came with the system in Mar 2011 was 266.44, and I never touched it until my rebuild. I loaded the "recommended" driver 296.10, which has been commented on by a number of folks here on the forum.

Anyway, here's what I know:
Win 7 Home Prem SP1 x64
Currently installed: Nvidia 296.10
GTX650TI (MSI TwinFrozr packaging)
i7 2600K CPU
ASUS P8P67 Deluxe MOBO
4 GB Adata 1600 MHz ram (2x2GB)
Nvidia Control panel: "Manage3D Settings" "Power Management" set to "Prefer Maximum Performance".
Power Options control panel: High Performance; Turn off the display: Never.
Personalization control panel" Screen saver: None
Running Afterburner 2.2.0

Symptoms

- Seen issue with Project: 8013 (Run 249, Clone 18, Gen 13)
- currently getting 9:53 TPF, 5599 ppd, where typical is at least 14Kppd, See my History of one GTX560Ti GPU over one year - 4/1/11 to 4/6/12.
- starts up OK in selected core clock (I've tried between stock of 880 and my normal mode of 948 - seems to do the same thing for all settings)
- after a few minutes of folding on the GPU WU with Core 15 version 2.24,
--- the afterburner set clock stays where it was,
--- but the Core Clock in the hardware monitor steps down to something like 408. It's not immediate.
--- The total system power, shown in my CyberPower UPS gadget drops from, say, 300 to 180.
--- The GPU temp goes down to almost ambient, along with GPU fan speed.

- I've rebooted several times while folding - after the reboot, the core clock is the "high" value for a very short value, then downshifts to the lower clock.

So, what's the current consensus around 8036 - drop the driver from 296.10 to something lower?
GreyWhiskers
Posts: 660
Joined: Mon Oct 25, 2010 5:57 am
Hardware configuration: a) Main unit
Sandybridge in HAF922 w/200 mm side fan
--i7 2600K@4.2 GHz
--ASUS P8P67 DeluxeB3
--4GB ADATA 1600 RAM
--750W Corsair PS
--2Seagate Hyb 750&500 GB--WD Caviar Black 1TB
--EVGA 660GTX-Ti FTW - Signature 2 GPU@ 1241 Boost
--MSI GTX560Ti @900MHz
--Win7Home64; FAH V7.3.2; 327.23 drivers

b) 2004 HP a475c desktop, 1 core Pent 4 HT@3.2 GHz; Mem 2GB;HDD 160 GB;Zotac GT430PCI@900 MHz
WinXP SP3-32 FAH v7.3.6 301.42 drivers - GPU slot only

c) 2005 Toshiba M45-S551 laptop w/2 GB mem, 160GB HDD;Pent M 740 CPU @ 1.73 GHz
WinXP SP3-32 FAH v7.3.6 [Receiving Core A4 work units]
d) 2011 lappy-15.6"-1920x1080;i7-2860QM,2.5;IC Diamond Thermal Compound;GTX 560M 1,536MB u/c@700;16GB-1333MHz RAM;HDD:500GBHyb w/ 4GB SSD;Win7HomePrem64;320.18 drivers FAH 7.4.2ß
Location: Saratoga, California USA

Re: Problem with 8036

Post by GreyWhiskers »

Quick update.

The core problem, as it seems to me now, is that this project is quite stressing on the GPU hardware. (again, mine is the GTX560Ti MSI TwinFrozr variety). The remedy seems to have been downclocking the GPU from the 948 MHz I'd been running for a long time to 900 MHz to make it more stable. That seems to have done the trick - I uneventfully completed project:8036 run:5 clone:21 gen:9 at full 3d clocks, and am at 17% on the next Project: 8036 (Run 6, Clone 44, Gen 6).

Details:

-- There has been a lot said about having to manipulate the GPU fan settings in Afterburner, if that's what you use. And, I've done that - making the fan get up to 100% at about 65 deg C. I've never had to do that before. The stock fan control seems to have been fine for the thousands of WUs that GPU has folded over the last year.

-- I also wanted to see of changing the Nvidia driver from 296.10 to 285.62 would help. I still saw the periodic downshifts to 2D clocks, so that didn't make any difference.

-- Closer examination of the logs showed that I had a string of attempts to process one particular WU (with all the reboots, the logs are messed up so I don't have the PRCGs) - they all kept failing with an "unstable machine" error code. And, then project:8036 run:6 clone:91 gen:3 failed after 22%:

00:21:47:WU01:FS00:0x15:Run: exception thrown in GuardedRun -- cannot continue further.
00:21:47:WU01:FS00:0x15:Going to send back what have done -- stepsTotalG=25000000
---
00:21:52:WU01:FS00:0x15:Folding@home Core Shutdown: EARLY_UNIT_END
00:21:52:WU01:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
00:21:52:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:8036 run:6 clone:91 gen:3 core:0x15 unit:0x000000036953ee2e4f8a6facc094d9a9

-- It turns out that the downshift seems to have been triggered by an Unstable Machine failure. When the FAH client loaded the next WU after the EUE, which was the same old WU it had been failing on, it came back up in 2D clocks. Rebooting the computer brought back the 3d clocks, but I now see that the downshift was triggered by yet more Unstable machine errors.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Problem with 8036

Post by bruce »

Preliminary indications are similar to what GreyWhiskers is reporting. Even though I'm not overclocking my GPU, it is having occasional hardware problems which result in the message UNKNOWN_ENUM (-1 = 0xffffffff). That is certainly not the most helpful message . . . (it doesn't suggest "Run diagnostics on your GPU" or anything like that, but my failures are not something that can be reproduced or diagnosed by the FAH software.)

Note that the -1 error has been with us in V6.
http://fahwiki.net/index.php/CoreStatus ... 0xffffffff
http://fahwiki.net/index.php/CoreStatus ... eturned_-1
GreyWhiskers
Posts: 660
Joined: Mon Oct 25, 2010 5:57 am
Hardware configuration: a) Main unit
Sandybridge in HAF922 w/200 mm side fan
--i7 2600K@4.2 GHz
--ASUS P8P67 DeluxeB3
--4GB ADATA 1600 RAM
--750W Corsair PS
--2Seagate Hyb 750&500 GB--WD Caviar Black 1TB
--EVGA 660GTX-Ti FTW - Signature 2 GPU@ 1241 Boost
--MSI GTX560Ti @900MHz
--Win7Home64; FAH V7.3.2; 327.23 drivers

b) 2004 HP a475c desktop, 1 core Pent 4 HT@3.2 GHz; Mem 2GB;HDD 160 GB;Zotac GT430PCI@900 MHz
WinXP SP3-32 FAH v7.3.6 301.42 drivers - GPU slot only

c) 2005 Toshiba M45-S551 laptop w/2 GB mem, 160GB HDD;Pent M 740 CPU @ 1.73 GHz
WinXP SP3-32 FAH v7.3.6 [Receiving Core A4 work units]
d) 2011 lappy-15.6"-1920x1080;i7-2860QM,2.5;IC Diamond Thermal Compound;GTX 560M 1,536MB u/c@700;16GB-1333MHz RAM;HDD:500GBHyb w/ 4GB SSD;Win7HomePrem64;320.18 drivers FAH 7.4.2ß
Location: Saratoga, California USA

Re: Problem with 8036

Post by GreyWhiskers »

Downclocking to 900 MHz seems to have fixed the problem - I've completed two of the 8036s in the last 24 hours, and am 86% into the third, with no EUEs and no reversion to the 2D clocks. The PPD of 14958 on both of the completed WUs (remarkably consistent TPF, by the way) is well within the norms for this GTX560Ti card, even with the downclocking.

The temps seem very good - steady 62 deg c as reported by Afterburner - with the fan at 90%. BTW on the temps - I've also had both side panels on the HAF922 case off because of other issues I've been having with my HDDs.
Ripper36
Posts: 60
Joined: Sun Sep 18, 2011 8:55 am

Re: Problem with 8036

Post by Ripper36 »

I do have a hardware problem, hopefully just memory, which I am having fixed now. Hopefully that is all, and 8036 just gives the memory a wilder workout!
Post Reply