EUE: "FAULTY project:8020 run:4 clone:388 gen:60" at 91%

Moderators: Site Moderators, FAHC Science Team

Post Reply
GreyWhiskers
Posts: 660
Joined: Mon Oct 25, 2010 5:57 am
Hardware configuration: a) Main unit
Sandybridge in HAF922 w/200 mm side fan
--i7 2600K@4.2 GHz
--ASUS P8P67 DeluxeB3
--4GB ADATA 1600 RAM
--750W Corsair PS
--2Seagate Hyb 750&500 GB--WD Caviar Black 1TB
--EVGA 660GTX-Ti FTW - Signature 2 GPU@ 1241 Boost
--MSI GTX560Ti @900MHz
--Win7Home64; FAH V7.3.2; 327.23 drivers

b) 2004 HP a475c desktop, 1 core Pent 4 HT@3.2 GHz; Mem 2GB;HDD 160 GB;Zotac GT430PCI@900 MHz
WinXP SP3-32 FAH v7.3.6 301.42 drivers - GPU slot only

c) 2005 Toshiba M45-S551 laptop w/2 GB mem, 160GB HDD;Pent M 740 CPU @ 1.73 GHz
WinXP SP3-32 FAH v7.3.6 [Receiving Core A4 work units]
d) 2011 lappy-15.6"-1920x1080;i7-2860QM,2.5;IC Diamond Thermal Compound;GTX 560M 1,536MB u/c@700;16GB-1333MHz RAM;HDD:500GBHyb w/ 4GB SSD;Win7HomePrem64;320.18 drivers FAH 7.4.2ß
Location: Saratoga, California USA

EUE: "FAULTY project:8020 run:4 clone:388 gen:60" at 91%

Post by GreyWhiskers »

My first EUE post in a while.

[edit] corrected EUE time.
After the EUE at 91% - time 19:16:59 in the log below - after nearly 2 days of folding :cry:, the log entry says, "Faulty Project". It did upload partial results, but the log did not indicate whether partial credit was awarded or not (5238.87 partial credit would have been welcome after almost two days if it was, in fact, a faulty WU).

FYI, I've included at the bottom of the post an extract from the HFM benchmarks viewer for project 8020 on my 3 fermi cards: the new GT430, a workhorse GTX560Ti, and a laptop GTX560M.

This is on my new GT430 (PCI non-express bus version) I just acquired to replace the deplenished AGP bus ATI HD4670 I had been folding on my old HP for two years.

Not in the log, but this is running Nvidia driver set 280.26, and has seemed stable at a moderate 750 MHz OC (vs 700 stock). It's been running about 2800-2900 TPF, successfully completing a series of 13 project 8020 (real horse-chokers for this GPU), 8008 and 8010 Core 15 work units since June 11th.

I did notice that the GPU dropped down to 2D clocks when the EUE occurred. I will have to reboot the computer as soon as I finish this post, since rebooting is the only way I've found to reset the 3d clocks. (btw, since I'm running Win XP, the Nvidia driver doesn't have a control panel setting for Power Management Mode like the Win7 version has). I do have the WinXP screen savers disabled, and just turn the monitor on and off when i need to use the computer for something other than folding.

Code: Select all

*********************** Log Started 2012-07-09T00:05:11Z ***********************

00:05:11:************************* Folding@home Client *************************
00:05:11:      Website: http://folding.stanford.edu/
00:05:11:    Copyright: (c) 2009-2012 Stanford University
00:05:11:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
00:05:11:         Args: --lifeline 4292 --command-port=36330
00:05:11:       Config: C:/Documents and Settings/Owner/Application
00:05:11:               Data/FAHClient/config.xml
00:05:11:******************************** Build ********************************
00:05:11:      Version: 7.1.52
00:05:11:         Date: Mar 20 2012
00:05:11:         Time: 19:37:42
00:05:11:      SVN Rev: 3515
00:05:11:       Branch: fah/trunk/client
00:05:11:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
00:05:11:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
00:05:11:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT
00:05:11:     Platform: win32 XP
00:05:11:         Bits: 32
00:05:11:         Mode: Release
00:05:11:******************************* System ********************************
00:05:11:          CPU: Intel(R) Pentium(R) 4 CPU 3.20GHz
00:05:11:       CPU ID: GenuineIntel Family 15 Model 2 Stepping 9
00:05:11:         CPUs: 2
00:05:11:       Memory: 2.00GiB
00:05:11:  Free Memory: 1.02GiB
00:05:11:      Threads: WINDOWS_THREADS
00:05:11:   On Battery: false
00:05:11:   UTC offset: -7
00:05:11:          PID: 4276
00:05:11:          CWD: C:/Documents and Settings/Owner/Application Data/FAHClient
00:05:11:           OS: Microsoft Windows XP Service Pack 3
00:05:11:      OS Arch: X86
00:05:11:         GPUs: 1
00:05:11:        GPU 0: FERMI:1 GF108 [GeForce GT 430]
00:05:11:         CUDA: 2.1
00:05:11:  CUDA Driver: 4000
00:05:11:Win32 Service: false

...

18:01:55:Saving configuration to config.xml
18:01:55:<config>
18:01:55:  <!-- Folding Slot Configuration -->
18:01:55:  <extra-core-args v='-forceasm'/>
18:01:55:
18:01:55:  <!-- Logging -->
18:01:55:  <log-rotate-max v='60'/>
18:01:55:
18:01:55:  <!-- Network -->
18:01:55:  <proxy v=':8080'/>
18:01:55:
18:01:55:  <!-- Remote Command Server -->
18:01:55:  <password v='********************************'/>
18:01:55:
18:01:55:  <!-- User Information -->
18:01:55:  <passkey v='********************************'/>
18:01:55:  <user v='GreyWhiskers'/>
18:01:55:
18:01:55:  <!-- Work Unit Control -->
18:01:55:  <next-unit-percentage v='100'/>
18:01:55:
18:01:55:  <!-- Folding Slots -->
18:01:55:  <slot id='1' type='GPU'>
18:01:55:    <client-type v='beta'/>
18:01:55:    <core-priority v='low'/>
18:01:55:  </slot>
18:01:55:  <slot id='0' type='UNIPROCESSOR'/>
18:01:55:  <slot id='2' type='UNIPROCESSOR'/>
18:01:55:</config>

...

09:34:25:FS01:Unpaused
09:34:25:WU03:FS01:Starting
09:34:25:WU03:FS01:Running FahCore: "C:\Program Files\FAHClient/FAHCoreWrapper.exe" "C:/Documents and Settings/Owner/Application Data/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/NVIDIA/Fermi/beta/Core_15.fah/FahCore_15.exe" -dir 03 -suffix 01 -version 701 -lifeline 4276 -checkpoint 15 -gpu 0 -forceasm
09:34:25:WU03:FS01:Started FahCore on PID 2696
09:34:25:WU03:FS01:Core PID:3644
09:34:25:WU03:FS01:FahCore 0x15 started
09:34:27:WU03:FS01:0x15:
09:34:27:WU03:FS01:0x15:*------------------------------*
09:34:27:WU03:FS01:0x15:Folding@Home GPU Core
09:34:27:WU03:FS01:0x15:Version                2.25 (Wed May 9 17:03:01 EDT 2012)
09:34:27:WU03:FS01:0x15:Build host             AmoebaRemote
09:34:27:WU03:FS01:0x15:Board Type             NVIDIA/CUDA
09:34:27:WU03:FS01:0x15:Core                   15
09:34:27:WU03:FS01:0x15:
09:34:27:WU03:FS01:0x15:Window's signal control handler registered.
09:34:27:WU03:FS01:0x15:Preparing to commence simulation
09:34:27:WU03:FS01:0x15:- Assembly optimizations manually forced on.
09:34:27:WU03:FS01:0x15:- Not checking prior termination.
09:34:27:WU03:FS01:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
09:34:27:WU03:FS01:0x15:- Expanded 146012 -> 660994 (decompressed 452.6 percent)
09:34:27:WU03:FS01:0x15:Called DecompressByteArray: compressed_data_size=146012 data_size=660994, decompressed_data_size=660994 diff=0
09:34:27:WU03:FS01:0x15:- Digital signature verified
09:34:27:WU03:FS01:0x15:
09:34:27:WU03:FS01:0x15:Project: 8020 (Run 4, Clone 388, Gen 60)
09:34:27:WU03:FS01:0x15:
09:34:27:WU03:FS01:0x15:Assembly optimizations on if available.
09:34:27:WU03:FS01:0x15:Entering M.D.
09:34:29:WU03:FS01:0x15:Will resume from checkpoint file 03/wudata_01.ckp
09:34:29:WU03:FS01:0x15:Tpr hash 03/wudata_01.tpr:  2721984265 3637194864 3902196694 1657303368 878376608
09:34:29:WU03:FS01:0x15:GPU device id=0
09:34:30:WU03:FS01:0x15:Working on Gromacs Runs On Most of All Computer Systems
09:34:30:WU03:FS01:0x15:Client config unavailable.
09:34:30:WU03:FS01:0x15:Starting GUI Server
09:35:45:WU03:FS01:0x15:Resuming from checkpoint
09:35:45:WU03:FS01:0x15:fcCheckPointResume: retreived and current tpr file hash:
09:35:45:WU03:FS01:0x15:   0   2721984265   2721984265
09:35:45:WU03:FS01:0x15:   1   3637194864   3637194864
09:35:45:WU03:FS01:0x15:   2   3902196694   3902196694
09:35:45:WU03:FS01:0x15:   3   1657303368   1657303368
09:35:45:WU03:FS01:0x15:   4    878376608    878376608
09:35:45:WU03:FS01:0x15:fcCheckPointResume: file hashes same.
09:35:45:WU03:FS01:0x15:fcCheckPointResume: state restored.
09:35:45:WU03:FS01:0x15:fcCheckPointResume: name 03/wudata_01.log Verified 03/wudata_01.log
09:35:46:WU03:FS01:0x15:fcCheckPointResume: name 03/wudata_01.trr Verified 03/wudata_01.trr
09:35:46:WU03:FS01:0x15:fcCheckPointResume: name 03/wudata_01.xtc Verified 03/wudata_01.xtc
09:35:46:WU03:FS01:0x15:fcCheckPointResume: name 03/wudata_01.edr Verified 03/wudata_01.edr
09:35:46:WU03:FS01:0x15:fcCheckPointResume: state restored 2
09:35:46:WU03:FS01:0x15:Resumed from checkpoint
09:35:46:WU03:FS01:0x15:Setting checkpoint frequency: 250000
09:35:46:WU03:FS01:0x15:Completed   5500001 out of 25000000 steps (22%).

.
18:45:02:WU03:FS01:0x15:Completed  22500000 out of 25000000 steps (90%).
******************************** Date: 10/07/12 ********************************
19:14:25:WU03:FS01:0x15:Completed  22750000 out of 25000000 steps (91%).
19:16:59:WU03:FS01:0x15:Run: exception thrown in GuardedRun -- cannot continue further.
19:16:59:WU03:FS01:0x15:Going to send back what have done -- stepsTotalG=25000000
19:16:59:WU03:FS01:0x15:Work fraction=0.9109 steps=25000000.
19:17:03:WU03:FS01:0x15:logfile size=27945 infoLength=27945 edr=0 trr=23
19:17:03:WU03:FS01:0x15:+ Opened results file
19:17:03:WU03:FS01:0x15:- Writing 28481 bytes of core data to disk...
19:17:09:WU03:FS01:0x15:Done: 27969 -> 6421 (compressed to 22.9 percent)
19:17:09:WU03:FS01:0x15:  ... Done.
19:17:10:WU03:FS01:0x15:DeleteFrameFiles: successfully deleted file=03/wudata_01.ckp
19:17:14:WU03:FS01:0x15:
19:17:14:WU03:FS01:0x15:Folding@home Core Shutdown: EARLY_UNIT_END
19:17:15:WU03:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
19:17:15:WU03:FS01:Sending unit results: id:03 state:SEND error:FAULTY project:8020 run:4 clone:388 gen:60 core:0x15 unit:0x000000466953ee2f4f967a17927e29ad
19:17:16:WU03:FS01:Uploading 6.77KiB to 171.67.108.143
19:17:16:WU03:FS01:Connecting to 171.67.108.143:8080
19:17:16:WU00:FS01:Connecting to assign-GPU.stanford.edu:80
19:17:16:WU00:FS01:News: Welcome to Folding@Home
19:17:16:WU00:FS01:Assigned to work server 171.67.108.143
19:17:16:WU03:FS01:Upload complete
19:17:16:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:"GF108 [GeForce GT 430]" from 171.67.108.143
19:17:16:WU00:FS01:Connecting to 171.67.108.143:8080
19:17:16:WU03:FS01:Server responded WORK_ACK (400)
19:17:17:WU03:FS01:Cleaning up
19:17:21:WU00:FS01:Downloading 142.69KiB
19:17:21:WU00:FS01:Download complete
19:17:21:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:OK project:8020 run:5 clone:25 gen:99 core:0x15 unit:0x000000746953ee2f4f967a2d64b6ceb4
19:17:24:WU00:FS01:Starting
HFM Benchmarks Viewer for Proj 8020

Code: Select all

 Project ID: 8020
 Core: OPENMMGPU
 Credit: 5757
 
 Name: GT430 (PCI Non-express) on Pent 4/HT 3.2 GHz desktop Win XP
 Number of Frames Observed: 300

 Min. Time / Frame : 00:22:43 - 3,649.3 PPD
 Avg. Time / Frame : 00:29:05 - 2,850.5 PPD
 Cur. Time / Frame : 00:29:05 - 2,850.5 PPD

====================================
FYI: 
 Name: GTX560M Underclocked to @ 700 MHz on Sager8150 Laptop (Win7) 
 Number of Frames Observed: 300
 Min. Time / Frame : 00:13:19 - 6,225.3 PPD
 Avg. Time / Frame : 00:14:08 - 5,865.6 PPD
 Cur. Time / Frame : 00:14:34 - 5,691.1 PPD

 Name: GTX560Ti @ 900 MHz on i7 2600K DigiStorm desktop (Win7)
 Path: Al-pc-36330
 Number of Frames Observed: 300

 Min. Time / Frame : 00:05:27 - 15,211.2 PPD
 Avg. Time / Frame : 00:05:27 - 15,211.2 PPD
 Cur. Time / Frame : 00:05:28 - 15,164.8 PPD
Last edited by GreyWhiskers on Tue Jul 10, 2012 11:54 pm, edited 1 time in total.
7im
Posts: 10189
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: EUE: "FAULTY project:8020 run:4 clone:388 gen:60" at 91%

Post by 7im »

Please upgrade to 301.xx drivers.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
Post Reply