9016 (Run 501, Clone 6, Gen 5) Bad WU?

Moderators: Site Moderators, FAHC Science Team

Post Reply
Kornflake
Posts: 44
Joined: Mon Dec 10, 2012 7:29 pm

9016 (Run 501, Clone 6, Gen 5) Bad WU?

Post by Kornflake »

My system is not overclocked and has been stable so far after a week or two of folding. It's a new system from Dell Alienware.

Relevant section showing failure:

Code: Select all

17:23:19:WU01:FS00:Connecting to 171.67.108.200:8080
17:23:20:WU01:FS00:Assigned to work server 171.64.65.124
17:23:20:WU01:FS00:Requesting new work unit for slot 00: RUNNING cpu:9 from 171.64.65.124
17:23:20:WU01:FS00:Connecting to 171.64.65.124:8080
17:23:21:WU01:FS00:Downloading 60.91KiB
17:23:21:WU01:FS00:Download complete
17:23:21:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:9016 run:501 clone:6 gen:5 core:0xa4 unit:0x0000000a664f2de4549de988b0744e63
17:24:53:WU01:FS00:Starting
17:24:53:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 01 -suffix 01 -version 704 -lifeline 3536 -checkpoint 15 -np 9
17:24:53:WU01:FS00:Started FahCore on PID 509564
17:24:53:WU01:FS00:Core PID:506636
17:24:53:WU01:FS00:FahCore 0xa4 started
17:24:53:WU01:FS00:0xa4:
17:24:53:WU01:FS00:0xa4:*------------------------------*
17:24:53:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
17:24:53:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
17:24:53:WU01:FS00:0xa4:
17:24:53:WU01:FS00:0xa4:Preparing to commence simulation
17:24:53:WU01:FS00:0xa4:- Looking at optimizations...
17:24:53:WU01:FS00:0xa4:- Created dyn
17:24:53:WU01:FS00:0xa4:- Files status OK
17:24:53:WU01:FS00:0xa4:- Expanded 61860 -> 1397548 (decompressed 2259.2 percent)
17:24:53:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=61860 data_size=1397548, decompressed_data_size=1397548 diff=0
17:24:53:WU01:FS00:0xa4:- Digital signature verified
17:24:53:WU01:FS00:0xa4:
17:24:53:WU01:FS00:0xa4:Project: 9016 (Run 501, Clone 6, Gen 5)
17:24:53:WU01:FS00:0xa4:
17:24:53:WU01:FS00:0xa4:Assembly optimizations on if available.
17:24:53:WU01:FS00:0xa4:Entering M.D.
17:24:59:WU01:FS00:0xa4:Mapping NT from 9 to 9 
17:24:59:WU01:FS00:0xa4:mdrun returned 255
17:24:59:WU01:FS00:0xa4:Going to send back what have done -- stepsTotalG=250000
17:24:59:WU01:FS00:0xa4:Work fraction=0.0000 steps=250000.
17:25:03:WU01:FS00:0xa4:logfile size=0 infoLength=0 edr=0 trr=25
17:25:03:WU01:FS00:0xa4:logfile size: 0 info=0 bed=0 hdr=25
17:25:03:WU01:FS00:0xa4:- Writing 640 bytes of core data to disk...
17:25:03:WU01:FS00:0xa4:Done: 128 -> 146 (compressed to 114.0 percent)
17:25:03:WU01:FS00:0xa4:  ... Done.
17:25:03:WU01:FS00:0xa4:
17:25:03:WU01:FS00:0xa4:Folding@home Core Shutdown: EARLY_UNIT_END
17:25:03:WARNING:WU01:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
17:25:03:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:9016 run:501 clone:6 gen:5 core:0xa4 unit:0x0000000a664f2de4549de988b0744e63
17:25:03:WU01:FS00:Uploading 658B to 171.64.65.124
17:25:03:WU01:FS00:Connecting to 171.64.65.124:8080
17:25:03:WU01:FS00:Upload complete
17:25:04:WU01:FS00:Server responded WORK_ACK (400)
17:25:04:WU01:FS00:Cleaning up
Start of the log file:

Code: Select all

*********************** Log Started 2014-12-21T01:41:21Z ***********************
01:41:21:************************* Folding@home Client *************************
01:41:21:      Website: http://folding.stanford.edu/
01:41:21:    Copyright: (c) 2009-2014 Stanford University
01:41:21:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
01:41:21:         Args: 
01:41:21:       Config: C:/ProgramData/FAHClient/config.xml
01:41:21:******************************** Build ********************************
01:41:21:      Version: 7.4.4
01:41:21:         Date: Mar 4 2014
01:41:21:         Time: 20:26:54
01:41:21:      SVN Rev: 4130
01:41:21:       Branch: fah/trunk/client
01:41:21:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
01:41:21:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
01:41:21:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
01:41:21:     Platform: win32 XP
01:41:21:         Bits: 32
01:41:21:         Mode: Release
01:41:21:******************************* System ********************************
01:41:21:          CPU: Intel(R) Core(TM) i7-5930K CPU @ 3.50GHz
01:41:21:       CPU ID: GenuineIntel Family 6 Model 63 Stepping 2
01:41:21:         CPUs: 12
01:41:21:       Memory: 31.89GiB
01:41:21:  Free Memory: 30.35GiB
01:41:21:      Threads: WINDOWS_THREADS
01:41:21:   OS Version: 6.2
01:41:21:  Has Battery: false
01:41:21:   On Battery: false
01:41:21:   UTC Offset: -5
01:41:21:          PID: 3536
01:41:21:          CWD: C:/ProgramData/FAHClient
01:41:21:           OS: Windows 8.1 Pro
01:41:21:      OS Arch: AMD64
01:41:21:         GPUs: 3
01:41:21:        GPU 0: NVIDIA:4 GM204 [GeForce GTX 980]
01:41:21:        GPU 1: NVIDIA:4 GM204 [GeForce GTX 980]
01:41:21:        GPU 2: NVIDIA:4 GM204 [GeForce GTX 980]
01:41:21:         CUDA: 5.2
01:41:21:  CUDA Driver: 6050
01:41:21:Win32 Service: false
01:41:21:***********************************************************************
01:41:21:<config>
01:41:21:  <!-- Network -->
01:41:21:  <proxy v=':8080'/>
01:41:21:
01:41:21:  <!-- Slot Control -->
01:41:21:  <power v='FULL'/>
01:41:21:
01:41:21:  <!-- User Information -->
01:41:21:  <passkey v='********************************'/>
01:41:21:  <team v='182919'/>
01:41:21:  <user v='Kornflake'/>
01:41:21:
01:41:21:  <!-- Folding Slots -->
01:41:21:  <slot id='0' type='CPU'>
01:41:21:    <paused v='true'/>
01:41:21:  </slot>
01:41:21:  <slot id='1' type='GPU'>
01:41:21:    <paused v='true'/>
01:41:21:  </slot>
01:41:21:  <slot id='2' type='GPU'>
01:41:21:    <paused v='true'/>
01:41:21:  </slot>
01:41:21:  <slot id='3' type='GPU'>
01:41:21:    <paused v='true'/>
01:41:21:  </slot>
01:41:21:</config>
01:41:21:Trying to access database...
01:41:21:Successfully acquired database lock
01:41:21:Enabled folding slot 00: PAUSED cpu:9 (by user)
01:41:21:Enabled folding slot 01: PAUSED gpu:0:GM204 [GeForce GTX 980] (by user)
01:41:21:Enabled folding slot 02: PAUSED gpu:1:GM204 [GeForce GTX 980] (by user)
01:41:21:Enabled folding slot 03: PAUSED gpu:2:GM204 [GeForce GTX 980] (by user)
02:44:22:FS00:Unpaused
02:44:22:FS01:Unpaused
02:44:22:FS02:Unpaused
02:44:22:FS03:Unpaused
02:44:22:WU03:FS02:Starting
02:44:22:WU03:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 03 -suffix 01 -version 704 -lifeline 3536 -checkpoint 15 -gpu 1 -gpu-vendor nvidia
02:44:22:WU03:FS02:Started FahCore on PID 11548
02:44:23:WU03:FS02:Core PID:11968
02:44:23:WU03:FS02:FahCore 0x17 started
02:44:23:WU04:FS03:Starting
02:44:23:WU04:FS03:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_17.fah/FahCore_17.exe -dir 04 -suffix 01 -version 704 -lifeline 3536 -checkpoint 15 -gpu 2 -gpu-vendor nvidia
02:44:23:WU04:FS03:Started FahCore on PID 11456
02:44:23:WU04:FS03:Core PID:11060
02:44:23:WU04:FS03:FahCore 0x17 started
02:44:23:WU02:FS01:Starting
02:44:23:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_18.fah/FahCore_18.exe -dir 02 -suffix 01 -version 704 -lifeline 3536 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
02:44:23:WU02:FS01:Started FahCore on PID 7328
02:44:23:WU03:FS02:0x17:*********************** Log Started 2014-12-21T02:44:23Z ***********************
02:44:23:WU03:FS02:0x17:Project: 9201 (Run 684, Clone 2, Gen 88)
02:44:23:WU03:FS02:0x17:Unit: 0x000000946652edc45399f0f1c7a87b25
02:44:23:WU03:FS02:0x17:CPU: 0x00000000000000000000000000000000
02:44:23:WU03:FS02:0x17:Machine: 2
02:44:23:WU03:FS02:0x17:Digital signatures verified
02:44:23:WU03:FS02:0x17:Folding@home GPU core17
02:44:23:WU03:FS02:0x17:Version 0.0.52
02:44:23:WU03:FS02:0x17:  Found a checkpoint file
02:44:23:WU04:FS03:0x17:*********************** Log Started 2014-12-21T02:44:23Z ***********************
02:44:23:WU04:FS03:0x17:Project: 9201 (Run 284, Clone 3, Gen 92)
02:44:23:WU04:FS03:0x17:Unit: 0x000000826652edc45399e13adea1da94
02:44:23:WU04:FS03:0x17:CPU: 0x00000000000000000000000000000000
02:44:23:WU04:FS03:0x17:Machine: 3
02:44:23:WU04:FS03:0x17:Digital signatures verified
02:44:23:WU04:FS03:0x17:Folding@home GPU core17
02:44:23:WU04:FS03:0x17:Version 0.0.52
02:44:23:WU04:FS03:0x17:  Found a checkpoint file
02:44:24:WU02:FS01:Core PID:10744
02:44:24:WU02:FS01:FahCore 0x18 started
02:44:24:WU01:FS00:Starting
02:44:24:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe -dir 01 -suffix 01 -version 704 -lifeline 3536 -checkpoint 15 -np 9
02:44:24:WU01:FS00:Started FahCore on PID 12124
02:44:24:WU01:FS00:Core PID:6768
02:44:24:WU01:FS00:FahCore 0xa3 started
02:44:24:Saving configuration to config.xml
02:44:24:<config>
02:44:24:  <!-- Network -->
02:44:24:  <proxy v=':8080'/>
02:44:24:
02:44:24:  <!-- Slot Control -->
02:44:24:  <power v='FULL'/>
02:44:24:
02:44:24:  <!-- User Information -->
02:44:24:  <passkey v='********************************'/>
02:44:24:  <team v='182919'/>
02:44:24:  <user v='Kornflake'/>
02:44:24:
02:44:24:  <!-- Folding Slots -->
02:44:24:  <slot id='0' type='CPU'/>
02:44:24:  <slot id='1' type='GPU'/>
02:44:24:  <slot id='2' type='GPU'/>
02:44:24:  <slot id='3' type='GPU'/>
02:44:24:</config>
Image
Joe_H
Site Admin
Posts: 7870
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: 9016 (Run 501, Clone 6, Gen 5) Bad WU?

Post by Joe_H »

There are multiple failures reported for this WU in the database, so it does appear to be bad.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
Kornflake
Posts: 44
Joined: Mon Dec 10, 2012 7:29 pm

Re: 9016 (Run 501, Clone 6, Gen 5) Bad WU?

Post by Kornflake »

Thanks for the report, is this something I should look up myself somewhere, or post here?
Image
Joe_H
Site Admin
Posts: 7870
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: 9016 (Run 501, Clone 6, Gen 5) Bad WU?

Post by Joe_H »

The database search is restricted to PG members and forum moderators. So if you do get an apparent failure of a WU, just report it here like you did for this WU and one of the moderators will check on the status.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
sryckbos
Pande Group Member
Posts: 116
Joined: Wed Jun 26, 2013 10:23 pm

Re: 9016 (Run 501, Clone 6, Gen 5) Bad WU?

Post by sryckbos »

Thanks for the heads up. Definitely looks like a bad one. Sorry about that! Shouldn't be a problem any more.
Post Reply