3 failed 9679

Moderators: Site Moderators, FAHC Science Team

Post Reply
rwh202
Posts: 425
Joined: Mon Nov 15, 2010 8:51 pm
Hardware configuration: 8x GTX 1080
3x GTX 1080 Ti
3x GTX 1060
Various other bits and pieces
Location: South Coast, UK

3 failed 9679

Post by rwh202 »

I was just going through my HFM log and there were 3 failed WUs in the 3000 or so logged recently - all 3 failures were project 9679 (46 from this project have completed successfully, including some previous gens of the same clones):

9679 (Run 0, Clone 47, Gen 95) (factory OC GTX 1060 Win 10 Pro)
9679 (Run 1, Clone 43, Gen 166) (stock GTX 980 Linux)
9679 (Run 0, Clone 45, Gen 178) (factory OC GTX 980 Linux)

Maybe this a particularly sensitive project and my 99.9% successful completion is probably good enough, but if there's something to bring it up to 100% then I'm all ears!

Log for one example:

Code: Select all

15:59:30:WU01:FS02:Connecting to 171.67.108.45:80
15:59:31:WU01:FS02:Assigned to work server 171.67.108.155
15:59:31:WU01:FS02:Requesting new work unit for slot 02: RUNNING gpu:1:GM204 [GeForce GTX 980] from 171.67.108.155
15:59:31:WU01:FS02:Connecting to 171.67.108.155:8080
15:59:32:WU01:FS02:Downloading 390.26KiB
15:59:33:WU01:FS02:Download complete
15:59:33:WU01:FS02:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:9679 run:0 clone:45 gen:178 core:0x18 unit:0x000000c2ab436c9b56de69bf1cb223eb
16:05:26:WU01:FS02:Starting
16:05:26:WU01:FS02:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/web.stanford.edu/~pande/Linux/AMD64/NVIDIA/Fermi/Core_18.fah/FahCore_18 -dir 01 -suffix 01 -version 704 -lifeline 1363 -checkpoint 15 -gpu 1 -gpu-vendor nvidia
16:05:26:WU01:FS02:Started FahCore on PID 29176
16:05:26:WU01:FS02:Core PID:29180
16:05:26:WU01:FS02:FahCore 0x18 started
16:05:27:WU01:FS02:0x18:*********************** Log Started 2016-09-19T16:05:26Z ***********************
16:05:27:WU01:FS02:0x18:Project: 9679 (Run 0, Clone 45, Gen 178)
16:05:27:WU01:FS02:0x18:Unit: 0x000000c2ab436c9b56de69bf1cb223eb
16:05:27:WU01:FS02:0x18:CPU: 0x00000000000000000000000000000000
16:05:27:WU01:FS02:0x18:Machine: 2
16:05:27:WU01:FS02:0x18:Reading tar file core.xml
16:05:27:WU01:FS02:0x18:Reading tar file integrator.xml
16:05:27:WU01:FS02:0x18:Reading tar file state.xml
16:05:27:WU01:FS02:0x18:Reading tar file system.xml
16:05:27:WU01:FS02:0x18:Digital signatures verified
16:05:27:WU01:FS02:0x18:Folding@home GPU core18
16:05:27:WU01:FS02:0x18:Version 0.0.4
16:05:28:WU01:FS02:0x18:Completed 0 out of 2000000 steps (0%)
16:05:28:WU01:FS02:0x18:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
16:05:41:WU01:FS02:0x18:Completed 20000 out of 2000000 steps (1%)
16:05:53:WU01:FS02:0x18:Completed 40000 out of 2000000 steps (2%)
16:06:06:WU01:FS02:0x18:Completed 60000 out of 2000000 steps (3%)
16:06:19:WU01:FS02:0x18:Completed 80000 out of 2000000 steps (4%)
16:06:31:WU01:FS02:0x18:Completed 100000 out of 2000000 steps (5%)
16:06:45:WU01:FS02:0x18:Completed 120000 out of 2000000 steps (6%)
16:06:58:WU01:FS02:0x18:Completed 140000 out of 2000000 steps (7%)
16:07:10:WU01:FS02:0x18:Completed 160000 out of 2000000 steps (8%)
16:07:23:WU01:FS02:0x18:Completed 180000 out of 2000000 steps (9%)
16:07:36:WU01:FS02:0x18:Completed 200000 out of 2000000 steps (10%)
16:07:49:WU01:FS02:0x18:Completed 220000 out of 2000000 steps (11%)
16:08:01:WU01:FS02:0x18:Completed 240000 out of 2000000 steps (12%)
16:08:14:WU01:FS02:0x18:Completed 260000 out of 2000000 steps (13%)
16:08:27:WU01:FS02:0x18:Completed 280000 out of 2000000 steps (14%)
16:08:39:WU01:FS02:0x18:Completed 300000 out of 2000000 steps (15%)
16:08:52:WU01:FS02:0x18:Completed 320000 out of 2000000 steps (16%)
16:09:05:WU01:FS02:0x18:Completed 340000 out of 2000000 steps (17%)
16:09:18:WU01:FS02:0x18:Completed 360000 out of 2000000 steps (18%)
16:09:30:WU01:FS02:0x18:Completed 380000 out of 2000000 steps (19%)
16:09:43:WU01:FS02:0x18:Completed 400000 out of 2000000 steps (20%)
16:09:56:WU01:FS02:0x18:Completed 420000 out of 2000000 steps (21%)
16:10:09:WU01:FS02:0x18:Completed 440000 out of 2000000 steps (22%)
16:10:21:WU01:FS02:0x18:Completed 460000 out of 2000000 steps (23%)
16:10:34:WU01:FS02:0x18:Completed 480000 out of 2000000 steps (24%)
16:10:47:WU01:FS02:0x18:Completed 500000 out of 2000000 steps (25%)
16:10:47:WU01:FS02:0x18:Bad State detected... attempting to resume from last good checkpoint
16:10:59:WU01:FS02:0x18:Completed 420000 out of 2000000 steps (21%)
16:11:12:WU01:FS02:0x18:Completed 440000 out of 2000000 steps (22%)
16:11:24:WU01:FS02:0x18:Completed 460000 out of 2000000 steps (23%)
16:11:37:WU01:FS02:0x18:Completed 480000 out of 2000000 steps (24%)
16:11:50:WU01:FS02:0x18:Completed 500000 out of 2000000 steps (25%)
16:11:50:WU01:FS02:0x18:Bad State detected... attempting to resume from last good checkpoint
16:12:02:WU01:FS02:0x18:Completed 420000 out of 2000000 steps (21%)
16:12:15:WU01:FS02:0x18:Completed 440000 out of 2000000 steps (22%)
16:12:27:WU01:FS02:0x18:Completed 460000 out of 2000000 steps (23%)
16:12:40:WU01:FS02:0x18:Completed 480000 out of 2000000 steps (24%)
16:12:53:WU01:FS02:0x18:Completed 500000 out of 2000000 steps (25%)
16:12:53:WU01:FS02:0x18:Bad State detected... attempting to resume from last good checkpoint
16:12:53:WU01:FS02:0x18:Max number of retries reached. Aborting.
16:12:53:WU01:FS02:0x18:ERROR:exception: Max Retries Reached
16:12:53:WU01:FS02:0x18:Saving result file logfile_01.txt
16:12:53:WU01:FS02:0x18:Saving result file log.txt
16:12:53:WU01:FS02:0x18:Folding@home Core Shutdown: BAD_WORK_UNIT
16:12:53:WARNING:WU01:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:12:53:WU01:FS02:Sending unit results: id:01 state:SEND error:FAULTY project:9679 run:0 clone:45 gen:178 core:0x18 unit:0x000000c2ab436c9b56de69bf1cb223eb
16:12:53:WU01:FS02:Uploading 2.76KiB to 171.67.108.155
16:12:53:WU01:FS02:Connecting to 171.67.108.155:8080
16:12:55:WU01:FS02:Upload complete
16:12:56:WU01:FS02:Server responded WORK_ACK (400)
16:12:56:WU01:FS02:Cleaning up
System:

Code: Select all

*********************** Log Started 2016-09-14T11:37:24Z ***********************
11:37:24:************************* Folding@home Client *************************
11:37:24:    Website: http://folding.stanford.edu/
11:37:24:  Copyright: (c) 2009-2014 Stanford University
11:37:24:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
11:37:24:       Args: --child --lifeline 1361 /etc/fahclient/config.xml --run-as
11:37:24:             fahclient --pid-file=/var/run/fahclient.pid --daemon
11:37:24:     Config: /etc/fahclient/config.xml
11:37:24:******************************** Build ********************************
11:37:24:    Version: 7.4.4
11:37:24:       Date: Mar 4 2014
11:37:24:       Time: 12:02:38
11:37:24:    SVN Rev: 4130
11:37:24:     Branch: fah/trunk/client
11:37:24:   Compiler: GNU 4.4.7
11:37:24:    Options: -std=gnu++98 -O3 -funroll-loops -mfpmath=sse -ffast-math
11:37:24:             -fno-unsafe-math-optimizations -msse2
11:37:24:   Platform: linux2 3.2.0-1-amd64
11:37:24:       Bits: 64
11:37:24:       Mode: Release
11:37:24:******************************* System ********************************
11:37:24:        CPU: Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz
11:37:24:     CPU ID: GenuineIntel Family 6 Model 42 Stepping 7
11:37:24:       CPUs: 8
11:37:24:     Memory: 7.78GiB
11:37:24:Free Memory: 7.21GiB
11:37:24:    Threads: POSIX_THREADS
11:37:24: OS Version: 3.13
11:37:24:Has Battery: false
11:37:24: On Battery: false
11:37:24: UTC Offset: 1
11:37:24:        PID: 1363
11:37:24:        CWD: /var/lib/fahclient
11:37:24:         OS: Linux 3.13.0-37-generic x86_64
11:37:24:    OS Arch: AMD64
11:37:24:       GPUs: 2
11:37:24:      GPU 0: NVIDIA:5 GM204 [GeForce GTX 980]
11:37:24:      GPU 1: NVIDIA:5 GM204 [GeForce GTX 980]
11:37:24:       CUDA: 5.2
11:37:24:CUDA Driver: 7000
11:37:24:***********************************************************************
Joe_H
Site Admin
Posts: 7856
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: 3 failed 9679

Post by Joe_H »

There is no particular pattern in the results for these WU's. The first WU shows multiple failures in the database, and no successful return. It may have been marked bad by the assignment code, the last return was over a week ago.

The second WU was completed successfully by the second system in the database.

The third had several failures reported before being completed successfully by a folder.

So beyond watching to see if a particular GPU has a higher failure rate, possibly due to differences in cooling in a multiple GPU installation, I can not think of any suggestions that would apply to improve the success rate.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
rwh202
Posts: 425
Joined: Mon Nov 15, 2010 8:51 pm
Hardware configuration: 8x GTX 1080
3x GTX 1080 Ti
3x GTX 1060
Various other bits and pieces
Location: South Coast, UK

Re: 3 failed 9679

Post by rwh202 »

Thanks for checking on the WUs.

It's kind of reassuring that others have had failures with the same WUs, but then seems odd that others can complete them OK. I'm guessing that perhaps minor differences in code between linux and windows clients plus differences in hardware could result in the error checking code producing 'bad state detected' on one type of system, but not another?
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 3 failed 9679

Post by bruce »

There are two types of error checking. One happens after a successful upload, and such errors are pretty rare once you allow for somebody intentionally trying to submit falsified data. (That's almost unheard of here at FAH.) The other type is what you're talking about.

Small floating point roundoff-errors do accumulate but they can be compared to a value calculated by a different method. If the difference becomes too large, something serious is wrong and the WU is aborted. This takes care of calculations that become unstable due to heat or overclocking -- plus the occasional cosmic ray changing a value in VRAM. (Consumer-grade GPUs do not include memory parity checking.)

Then there's the occasional "bad WU" which reaches some unrealistic shape from which the software cannot proceed -- probably including your first WU. (Repeated failures get discarded, too.)
Duce H_K_
Posts: 113
Joined: Mon Nov 09, 2015 3:52 pm
Hardware configuration: MoBo•Gigabye X99 UD4-CF F24
CPU•<UPD 20.05.2023>Xeon V3 2680 V4 14c28t 35Mb L3
RAM•DDR4 Hynix 2133 CL14 4*16 DualRank Quad channel
HDD•ST1000DM003 Sata3 NCQ
GFX•GT220
PSU•Chieftec GPS750C 80+ Gold after repair
Cooling•Air 2xDeepCool UF120

Internet•200Mbit/s FTTB↓ white dynamic, ERTH, router RB951G-2HnD

Other•Redmi 7A <runs WUProp :-/>
Location: Russia
Contact:

Re: 3 failed 9679

Post by Duce H_K_ »

Here already topic about 9679. Previous one was ok. A GTX970 manually overclocked

Code: Select all

06:20:37:WU00:FS00:Connecting to 171.67.108.45:80
06:20:38:WU00:FS00:Assigned to work server 171.67.108.155
06:20:38:WU00:FS00:Requesting new work unit for slot 00: READY gpu:1:GM204 [GeForce GTX 970] from 171.67.108.155
06:20:38:WU00:FS00:Connecting to 171.67.108.155:8080
06:20:39:WU00:FS00:Downloading 388.85KiB
06:20:43:WU00:FS00:Download complete
06:20:43:WU00:FS00:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:9679 run:1 clone:72 gen:79 core:0x18 unit:0x00000058ab436c9b56de69bff684d949
06:20:43:WU00:FS00:Starting
06:20:43:WU00:FS00:Running FahCore: C:\NO_UAC\FAHClient_7.4.15_x64/FAHCoreWrapper.exe E:\Docbase\FaH-workdir\cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_18.fah/FahCore_18.exe -dir 00 -suffix 01 -version 704 -lifeline 4728 -checkpoint 6 -opencl-platform 1 -gpu-vendor nvidia -gpu 0 -forceasm -twait=80
06:20:44:WU00:FS00:Started FahCore on PID 3428
06:20:45:WU00:FS00:Core PID:5948
06:20:45:WU00:FS00:FahCore 0x18 started
06:20:49:WU00:FS00:0x18:*********************** Log Started 2016-10-10T06:20:49Z ***********************
06:20:49:WU00:FS00:0x18:Project: 9679 (Run 1, Clone 72, Gen 79)
06:20:49:WU00:FS00:0x18:Unit: 0x00000058ab436c9b56de69bff684d949
06:20:49:WU00:FS00:0x18:CPU: 0x00000000000000000000000000000000
06:20:49:WU00:FS00:0x18:Machine: 0
06:20:49:WU00:FS00:0x18:Reading tar file core.xml
06:20:49:WU00:FS00:0x18:Reading tar file integrator.xml
06:20:49:WU00:FS00:0x18:Reading tar file state.xml
06:20:49:WU00:FS00:0x18:Reading tar file system.xml
06:20:49:WU00:FS00:0x18:Digital signatures verified
06:20:49:WU00:FS00:0x18:Folding@home GPU core18
06:20:49:WU00:FS00:0x18:Version 0.0.4
06:21:02:WU00:FS00:0x18:Completed 0 out of 2000000 steps (0%)
06:21:02:WU00:FS00:0x18:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
06:21:19:WU00:FS00:0x18:Completed 20000 out of 2000000 steps (1%)
06:21:35:WU00:FS00:0x18:Completed 40000 out of 2000000 steps (2%)
06:21:51:WU00:FS00:0x18:Completed 60000 out of 2000000 steps (3%)
06:22:07:WU00:FS00:0x18:Completed 80000 out of 2000000 steps (4%)
06:22:23:WU00:FS00:0x18:Completed 100000 out of 2000000 steps (5%)
06:22:40:WU00:FS00:0x18:Completed 120000 out of 2000000 steps (6%)
06:22:56:WU00:FS00:0x18:Completed 140000 out of 2000000 steps (7%)
06:23:12:WU00:FS00:0x18:Completed 160000 out of 2000000 steps (8%)
06:23:28:WU00:FS00:0x18:Completed 180000 out of 2000000 steps (9%)
06:23:44:WU00:FS00:0x18:Completed 200000 out of 2000000 steps (10%)
06:24:02:WU00:FS00:0x18:Completed 220000 out of 2000000 steps (11%)
06:24:18:WU00:FS00:0x18:Completed 240000 out of 2000000 steps (12%)
06:24:34:WU00:FS00:0x18:Completed 260000 out of 2000000 steps (13%)
06:24:49:WU00:FS00:0x18:Completed 280000 out of 2000000 steps (14%)
06:25:05:WU00:FS00:0x18:Completed 300000 out of 2000000 steps (15%)
06:25:23:WU00:FS00:0x18:Completed 320000 out of 2000000 steps (16%)
06:25:39:WU00:FS00:0x18:Completed 340000 out of 2000000 steps (17%)
06:25:55:WU00:FS00:0x18:Completed 360000 out of 2000000 steps (18%)
06:26:11:WU00:FS00:0x18:Completed 380000 out of 2000000 steps (19%)
06:26:26:WU00:FS00:0x18:Completed 400000 out of 2000000 steps (20%)
06:26:43:WU00:FS00:0x18:Completed 420000 out of 2000000 steps (21%)
06:26:59:WU00:FS00:0x18:Completed 440000 out of 2000000 steps (22%)
06:27:15:WU00:FS00:0x18:Completed 460000 out of 2000000 steps (23%)
06:27:31:WU00:FS00:0x18:Completed 480000 out of 2000000 steps (24%)
06:27:46:WU00:FS00:0x18:Completed 500000 out of 2000000 steps (25%)
06:28:03:WU00:FS00:0x18:Completed 520000 out of 2000000 steps (26%)
06:28:19:WU00:FS00:0x18:Completed 540000 out of 2000000 steps (27%)
06:28:35:WU00:FS00:0x18:Completed 560000 out of 2000000 steps (28%)
06:28:51:WU00:FS00:0x18:Completed 580000 out of 2000000 steps (29%)
06:29:06:WU00:FS00:0x18:Completed 600000 out of 2000000 steps (30%)
06:29:24:WU00:FS00:0x18:Completed 620000 out of 2000000 steps (31%)
06:29:40:WU00:FS00:0x18:Completed 640000 out of 2000000 steps (32%)
06:29:55:WU00:FS00:0x18:Completed 660000 out of 2000000 steps (33%)
06:30:11:WU00:FS00:0x18:Completed 680000 out of 2000000 steps (34%)
06:30:27:WU00:FS00:0x18:Completed 700000 out of 2000000 steps (35%)
06:30:44:WU00:FS00:0x18:Completed 720000 out of 2000000 steps (36%)
06:31:00:WU00:FS00:0x18:Completed 740000 out of 2000000 steps (37%)
06:31:15:WU00:FS00:0x18:Completed 760000 out of 2000000 steps (38%)
06:31:31:WU00:FS00:0x18:Completed 780000 out of 2000000 steps (39%)
06:31:47:WU00:FS00:0x18:Completed 800000 out of 2000000 steps (40%)
06:32:04:WU00:FS00:0x18:Completed 820000 out of 2000000 steps (41%)
06:32:20:WU00:FS00:0x18:Completed 840000 out of 2000000 steps (42%)
06:32:35:WU00:FS00:0x18:Completed 860000 out of 2000000 steps (43%)
06:32:51:WU00:FS00:0x18:Completed 880000 out of 2000000 steps (44%)
06:33:07:WU00:FS00:0x18:Completed 900000 out of 2000000 steps (45%)
06:33:24:WU00:FS00:0x18:Completed 920000 out of 2000000 steps (46%)
06:33:40:WU00:FS00:0x18:Completed 940000 out of 2000000 steps (47%)
06:33:55:WU00:FS00:0x18:Completed 960000 out of 2000000 steps (48%)
06:34:11:WU00:FS00:0x18:Completed 980000 out of 2000000 steps (49%)
06:34:27:WU00:FS00:0x18:Completed 1000000 out of 2000000 steps (50%)
06:34:44:WU00:FS00:0x18:Completed 1020000 out of 2000000 steps (51%)
06:35:00:WU00:FS00:0x18:Completed 1040000 out of 2000000 steps (52%)
06:35:16:WU00:FS00:0x18:Completed 1060000 out of 2000000 steps (53%)
06:35:31:WU00:FS00:0x18:Completed 1080000 out of 2000000 steps (54%)
06:35:47:WU00:FS00:0x18:Completed 1100000 out of 2000000 steps (55%)
06:36:04:WU00:FS00:0x18:Completed 1120000 out of 2000000 steps (56%)
06:36:20:WU00:FS00:0x18:Completed 1140000 out of 2000000 steps (57%)
06:36:36:WU00:FS00:0x18:Completed 1160000 out of 2000000 steps (58%)
06:36:51:WU00:FS00:0x18:Completed 1180000 out of 2000000 steps (59%)
06:37:07:WU00:FS00:0x18:Completed 1200000 out of 2000000 steps (60%)
06:37:24:WU00:FS00:0x18:Completed 1220000 out of 2000000 steps (61%)
06:37:40:WU00:FS00:0x18:Completed 1240000 out of 2000000 steps (62%)
06:37:56:WU00:FS00:0x18:Completed 1260000 out of 2000000 steps (63%)
06:38:12:WU00:FS00:0x18:Completed 1280000 out of 2000000 steps (64%)
06:38:27:WU00:FS00:0x18:Completed 1300000 out of 2000000 steps (65%)
06:38:45:WU00:FS00:0x18:Completed 1320000 out of 2000000 steps (66%)
06:39:00:WU00:FS00:0x18:Completed 1340000 out of 2000000 steps (67%)
06:39:16:WU00:FS00:0x18:Completed 1360000 out of 2000000 steps (68%)
06:39:32:WU00:FS00:0x18:Completed 1380000 out of 2000000 steps (69%)
06:39:48:WU00:FS00:0x18:Completed 1400000 out of 2000000 steps (70%)
06:40:05:WU00:FS00:0x18:Completed 1420000 out of 2000000 steps (71%)
06:40:20:WU00:FS00:0x18:Completed 1440000 out of 2000000 steps (72%)
06:40:36:WU00:FS00:0x18:Completed 1460000 out of 2000000 steps (73%)
06:40:52:WU00:FS00:0x18:Completed 1480000 out of 2000000 steps (74%)
06:41:07:WU00:FS00:0x18:Completed 1500000 out of 2000000 steps (75%)
06:41:25:WU00:FS00:0x18:Completed 1520000 out of 2000000 steps (76%)
06:41:40:WU00:FS00:0x18:Completed 1540000 out of 2000000 steps (77%)
06:41:56:WU00:FS00:0x18:Completed 1560000 out of 2000000 steps (78%)
06:42:12:WU00:FS00:0x18:Completed 1580000 out of 2000000 steps (79%)
06:42:27:WU00:FS00:0x18:Completed 1600000 out of 2000000 steps (80%)
06:42:45:WU00:FS00:0x18:Completed 1620000 out of 2000000 steps (81%)
06:43:00:WU00:FS00:0x18:Completed 1640000 out of 2000000 steps (82%)
06:43:16:WU00:FS00:0x18:Completed 1660000 out of 2000000 steps (83%)
06:43:32:WU00:FS00:0x18:Completed 1680000 out of 2000000 steps (84%)
06:43:48:WU00:FS00:0x18:Completed 1700000 out of 2000000 steps (85%)
06:44:05:WU00:FS00:0x18:Completed 1720000 out of 2000000 steps (86%)
06:44:21:WU00:FS00:0x18:Completed 1740000 out of 2000000 steps (87%)
06:44:36:WU00:FS00:0x18:Completed 1760000 out of 2000000 steps (88%)
06:44:52:WU00:FS00:0x18:Completed 1780000 out of 2000000 steps (89%)
06:45:08:WU00:FS00:0x18:Completed 1800000 out of 2000000 steps (90%)
06:45:25:WU00:FS00:0x18:Completed 1820000 out of 2000000 steps (91%)
06:45:41:WU00:FS00:0x18:Completed 1840000 out of 2000000 steps (92%)
06:45:56:WU00:FS00:0x18:Completed 1860000 out of 2000000 steps (93%)
06:46:12:WU00:FS00:0x18:Completed 1880000 out of 2000000 steps (94%)
06:46:28:WU00:FS00:0x18:Completed 1900000 out of 2000000 steps (95%)
06:46:45:WU00:FS00:0x18:Completed 1920000 out of 2000000 steps (96%)
06:47:01:WU00:FS00:0x18:Completed 1940000 out of 2000000 steps (97%)
06:47:16:WU00:FS00:0x18:Completed 1960000 out of 2000000 steps (98%)
06:47:32:WU00:FS00:0x18:Completed 1980000 out of 2000000 steps (99%)
06:47:48:WU00:FS00:0x18:Completed 2000000 out of 2000000 steps (100%)
06:47:49:WU01:FS00:Connecting to 171.67.108.45:80
06:47:50:WU00:FS00:0x18:Saving result file logfile_01.txt
06:47:50:WU00:FS00:0x18:Saving result file checkpointState.xml
06:47:50:WU00:FS00:0x18:Saving result file checkpt.crc
06:47:50:WU00:FS00:0x18:Saving result file log.txt
06:47:50:WU00:FS00:0x18:Saving result file positions.xtc
06:47:50:WU00:FS00:0x18:Folding@home Core Shutdown: FINISHED_UNIT
06:47:50:WU01:FS00:Assigned to work server 171.67.108.104
06:47:51:WU01:FS00:Requesting new work unit for slot 00: RUNNING gpu:1:GM204 [GeForce GTX 970] from 171.67.108.104
06:47:51:WU01:FS00:Connecting to 171.67.108.104:8080
06:47:52:WU00:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
06:47:52:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:9679 run:1 clone:72 gen:79 core:0x18 unit:0x00000058ab436c9b56de69bff684d949
06:47:52:WU00:FS00:Uploading 755.84KiB to 171.67.108.155
06:47:52:WU00:FS00:Connecting to 171.67.108.155:8080
06:47:54:WU00:FS00:Upload complete
06:47:54:WU00:FS00:Server responded WORK_ACK (400)
06:47:54:WU00:FS00:Final credit estimate, 4592.00 points
06:47:54:WU00:FS00:Cleaning up
Then I caught 9679 again

Code: Select all

06:48:12:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
06:48:12:WU01:FS00:Connecting to 171.67.108.104:80
06:48:16:ERROR:WU01:FS00:Exception: Failed to connect to 171.67.108.104:80: Подключение не установлено, т.к. конечный компьютер отверг запрос на подключение.
06:48:16:WU01:FS00:Connecting to 171.67.108.45:80
06:48:17:WU01:FS00:Assigned to work server 171.67.108.155
06:48:17:WU01:FS00:Requesting new work unit for slot 00: READY gpu:1:GM204 [GeForce GTX 970] from 171.67.108.155
06:48:17:WU01:FS00:Connecting to 171.67.108.155:8080
06:48:18:WU01:FS00:Downloading 389.33KiB
06:48:22:WU01:FS00:Download complete
06:48:22:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:9679 run:1 clone:65 gen:158 core:0x18 unit:0x000000b5ab436c9b56de69bf452c1290
06:48:22:WU01:FS00:Starting
06:48:22:WU01:FS00:Running FahCore: C:\NO_UAC\FAHClient_7.4.15_x64/FAHCoreWrapper.exe E:\Docbase\FaH-workdir\cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_18.fah/FahCore_18.exe -dir 01 -suffix 01 -version 704 -lifeline 4728 -checkpoint 6 -opencl-platform 1 -gpu-vendor nvidia -gpu 0 -forceasm -twait=80
06:48:22:WU01:FS00:Started FahCore on PID 2908
06:48:22:WU01:FS00:Core PID:4452
06:48:22:WU01:FS00:FahCore 0x18 started
06:48:24:WU01:FS00:0x18:*********************** Log Started 2016-10-10T06:48:23Z ***********************
06:48:24:WU01:FS00:0x18:Project: 9679 (Run 1, Clone 65, Gen 158)
06:48:24:WU01:FS00:0x18:Unit: 0x000000b5ab436c9b56de69bf452c1290
06:48:24:WU01:FS00:0x18:CPU: 0x00000000000000000000000000000000
06:48:24:WU01:FS00:0x18:Machine: 0
06:48:24:WU01:FS00:0x18:Reading tar file core.xml
06:48:24:WU01:FS00:0x18:Reading tar file integrator.xml
06:48:24:WU01:FS00:0x18:Reading tar file state.xml
06:48:24:WU01:FS00:0x18:Reading tar file system.xml
06:48:24:WU01:FS00:0x18:Digital signatures verified
06:48:24:WU01:FS00:0x18:Folding@home GPU core18
06:48:24:WU01:FS00:0x18:Version 0.0.4
06:48:29:WU01:FS00:0x18:Completed 0 out of 2000000 steps (0%)
06:48:29:WU01:FS00:0x18:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
06:48:46:WU01:FS00:0x18:Completed 20000 out of 2000000 steps (1%)
06:49:02:WU01:FS00:0x18:Completed 40000 out of 2000000 steps (2%)
06:49:18:WU01:FS00:0x18:Completed 60000 out of 2000000 steps (3%)
06:49:33:WU01:FS00:0x18:Completed 80000 out of 2000000 steps (4%)
06:49:49:WU01:FS00:0x18:Completed 100000 out of 2000000 steps (5%)
06:50:06:WU01:FS00:0x18:Completed 120000 out of 2000000 steps (6%)
06:50:22:WU01:FS00:0x18:Completed 140000 out of 2000000 steps (7%)
06:50:38:WU01:FS00:0x18:Completed 160000 out of 2000000 steps (8%)
06:50:53:WU01:FS00:0x18:Completed 180000 out of 2000000 steps (9%)
06:51:09:WU01:FS00:0x18:Completed 200000 out of 2000000 steps (10%)
06:51:26:WU01:FS00:0x18:Completed 220000 out of 2000000 steps (11%)
06:51:42:WU01:FS00:0x18:Completed 240000 out of 2000000 steps (12%)
06:51:58:WU01:FS00:0x18:Completed 260000 out of 2000000 steps (13%)
06:52:14:WU01:FS00:0x18:Completed 280000 out of 2000000 steps (14%)
06:52:29:WU01:FS00:0x18:Completed 300000 out of 2000000 steps (15%)
06:52:29:WU01:FS00:0x18:Bad State detected... attempting to resume from last good checkpoint
06:52:45:WU01:FS00:0x18:Completed 220000 out of 2000000 steps (11%)
06:53:01:WU01:FS00:0x18:Completed 240000 out of 2000000 steps (12%)
06:53:17:WU01:FS00:0x18:Completed 260000 out of 2000000 steps (13%)
06:53:32:WU01:FS00:0x18:Completed 280000 out of 2000000 steps (14%)
06:53:48:WU01:FS00:0x18:Completed 300000 out of 2000000 steps (15%)
06:53:48:WU01:FS00:0x18:Bad State detected... attempting to resume from last good checkpoint
06:54:04:WU01:FS00:0x18:Completed 220000 out of 2000000 steps (11%)
06:54:20:WU01:FS00:0x18:Completed 240000 out of 2000000 steps (12%)
06:54:36:WU01:FS00:0x18:Completed 260000 out of 2000000 steps (13%)
06:54:52:WU01:FS00:0x18:Completed 280000 out of 2000000 steps (14%)
06:55:08:WU01:FS00:0x18:Completed 300000 out of 2000000 steps (15%)
06:55:08:WU01:FS00:0x18:Bad State detected... attempting to resume from last good checkpoint
06:55:08:WU01:FS00:0x18:Max number of retries reached. Aborting.
06:55:08:WU01:FS00:0x18:ERROR:exception: Max Retries Reached
06:55:08:WU01:FS00:0x18:Saving result file logfile_01.txt
06:55:08:WU01:FS00:0x18:Saving result file log.txt
06:55:08:WU01:FS00:0x18:Folding@home Core Shutdown: BAD_WORK_UNIT
06:55:09:WARNING:WU01:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
06:55:09:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:9679 run:1 clone:65 gen:158 core:0x18 unit:0x000000b5ab436c9b56de69bf452c1290
06:55:09:WU01:FS00:Uploading 2.85KiB to 171.67.108.155
06:55:09:WU01:FS00:Connecting to 171.67.108.155:8080
06:55:10:WU00:FS00:Connecting to 171.67.108.45:80
06:55:10:WU01:FS00:Upload complete
06:55:10:WU01:FS00:Server responded WORK_ACK (400)
06:55:10:WU01:FS00:Cleaning up
After first bad state message I decreased GPU clock to 1480MHz but it didn't help
   510 290 819 pts earned in Folding@home project
Joe_H
Site Admin
Posts: 7856
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: 3 failed 9679

Post by Joe_H »

There are several failure reports in the database for Project: 9679 (Run 1, Clone 65, Gen 158) including yours showing partial completion. A final report shows the WU was successfully completed by one folder. This particular WU may have been close to a critical point, I checked and the next few Gen's were completed with only one error report.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 3 failed 9679

Post by bruce »

"Bad State detected...." is a report implying that a calculation error was detected The most common causes of calculation errors are excessive overclocking or overheating. If this happens on a different WU, remove the overclocking and see if that fixes it.

The previous topic was when p9679 was being beta tested. It's a released project now, so a new topic is fine.
Duce H_K_
Posts: 113
Joined: Mon Nov 09, 2015 3:52 pm
Hardware configuration: MoBo•Gigabye X99 UD4-CF F24
CPU•<UPD 20.05.2023>Xeon V3 2680 V4 14c28t 35Mb L3
RAM•DDR4 Hynix 2133 CL14 4*16 DualRank Quad channel
HDD•ST1000DM003 Sata3 NCQ
GFX•GT220
PSU•Chieftec GPS750C 80+ Gold after repair
Cooling•Air 2xDeepCool UF120

Internet•200Mbit/s FTTB↓ white dynamic, ERTH, router RB951G-2HnD

Other•Redmi 7A <runs WUProp :-/>
Location: Russia
Contact:

Re: 3 failed 9679

Post by Duce H_K_ »

Thank God my system completed it normally after downclocking a GTX970 by few megahertz.

Code: Select all

14:26:43:WU01:FS00:Connecting to 171.67.108.45:80
14:26:43:WU01:FS00:Assigned to work server 171.67.108.155
14:26:43:WU01:FS00:Requesting new work unit for slot 00: RUNNING gpu:0:GM204 [GeForce GTX 970] from 171.67.108.155
14:26:43:WU01:FS00:Connecting to 171.67.108.155:8080
14:26:45:WU01:FS00:Downloading 390.88KiB
14:26:46:WU01:FS00:Download complete
14:26:46:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:9679 run:0 clone:79 gen:96 core:0x18 unit:0x00000069ab436c9b56de69bf3d80a24d
14:26:46:WU01:FS00:Starting
14:26:46:WU01:FS00:Running FahCore: C:\NO_UAC\FAHClient_7.4.15_x64/FAHCoreWrapper.exe E:\Docbase\FaH-workdir\cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_18.fah/FahCore_18.exe -dir 01 -suffix 01 -version 704 -lifeline 4884 -checkpoint 6 -opencl-platform 1 -gpu-vendor nvidia -gpu 1 -forceasm -twait=80
14:26:46:WU01:FS00:Started FahCore on PID 4056
14:26:46:WU01:FS00:Core PID:3620
14:26:46:WU01:FS00:FahCore 0x18 started
14:26:48:WU01:FS00:0x18:*********************** Log Started 2016-10-17T14:26:47Z ***********************
14:26:48:WU01:FS00:0x18:Project: 9679 (Run 0, Clone 79, Gen 96)
14:26:48:WU01:FS00:0x18:Unit: 0x00000069ab436c9b56de69bf3d80a24d
14:26:48:WU01:FS00:0x18:CPU: 0x00000000000000000000000000000000
14:26:48:WU01:FS00:0x18:Machine: 0
14:26:48:WU01:FS00:0x18:Reading tar file core.xml
14:26:48:WU01:FS00:0x18:Reading tar file integrator.xml
14:26:48:WU01:FS00:0x18:Reading tar file state.xml
14:26:48:WU01:FS00:0x18:Reading tar file system.xml
14:26:48:WU01:FS00:0x18:Digital signatures verified
14:26:48:WU01:FS00:0x18:Folding@home GPU core18
14:26:48:WU01:FS00:0x18:Version 0.0.4
14:26:54:WU01:FS00:0x18:Completed 0 out of 2000000 steps (0%)
14:26:54:WU01:FS00:0x18:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
14:27:17:WU01:FS00:0x18:Completed 20000 out of 2000000 steps (1%)
14:27:40:WU01:FS00:0x18:Completed 40000 out of 2000000 steps (2%)
14:28:02:WU01:FS00:0x18:Completed 60000 out of 2000000 steps (3%)
14:28:24:WU01:FS00:0x18:Completed 80000 out of 2000000 steps (4%)
14:28:46:WU01:FS00:0x18:Completed 100000 out of 2000000 steps (5%)
14:29:10:WU01:FS00:0x18:Completed 120000 out of 2000000 steps (6%)
14:29:32:WU01:FS00:0x18:Completed 140000 out of 2000000 steps (7%)
14:29:55:WU01:FS00:0x18:Completed 160000 out of 2000000 steps (8%)
14:30:17:WU01:FS00:0x18:Completed 180000 out of 2000000 steps (9%)
14:30:39:WU01:FS00:0x18:Completed 200000 out of 2000000 steps (10%)
14:31:03:WU01:FS00:0x18:Completed 220000 out of 2000000 steps (11%)
14:31:25:WU01:FS00:0x18:Completed 240000 out of 2000000 steps (12%)
14:31:47:WU01:FS00:0x18:Completed 260000 out of 2000000 steps (13%)
14:32:10:WU01:FS00:0x18:Completed 280000 out of 2000000 steps (14%)
14:32:32:WU01:FS00:0x18:Completed 300000 out of 2000000 steps (15%)
14:32:56:WU01:FS00:0x18:Completed 320000 out of 2000000 steps (16%)
14:33:18:WU01:FS00:0x18:Completed 340000 out of 2000000 steps (17%)
14:33:40:WU01:FS00:0x18:Completed 360000 out of 2000000 steps (18%)
14:34:03:WU01:FS00:0x18:Completed 380000 out of 2000000 steps (19%)
14:34:25:WU01:FS00:0x18:Completed 400000 out of 2000000 steps (20%)
14:34:49:WU01:FS00:0x18:Completed 420000 out of 2000000 steps (21%)
14:35:11:WU01:FS00:0x18:Completed 440000 out of 2000000 steps (22%)
14:35:33:WU01:FS00:0x18:Completed 460000 out of 2000000 steps (23%)
14:35:56:WU01:FS00:0x18:Completed 480000 out of 2000000 steps (24%)
14:36:18:WU01:FS00:0x18:Completed 500000 out of 2000000 steps (25%)
14:36:41:WU01:FS00:0x18:Completed 520000 out of 2000000 steps (26%)
14:37:04:WU01:FS00:0x18:Completed 540000 out of 2000000 steps (27%)
14:37:26:WU01:FS00:0x18:Completed 560000 out of 2000000 steps (28%)
14:37:48:WU01:FS00:0x18:Completed 580000 out of 2000000 steps (29%)
14:38:10:WU01:FS00:0x18:Completed 600000 out of 2000000 steps (30%)
14:38:34:WU01:FS00:0x18:Completed 620000 out of 2000000 steps (31%)
14:38:56:WU01:FS00:0x18:Completed 640000 out of 2000000 steps (32%)
14:39:19:WU01:FS00:0x18:Completed 660000 out of 2000000 steps (33%)
14:39:41:WU01:FS00:0x18:Completed 680000 out of 2000000 steps (34%)
14:40:03:WU01:FS00:0x18:Completed 700000 out of 2000000 steps (35%)
14:40:27:WU01:FS00:0x18:Completed 720000 out of 2000000 steps (36%)
14:40:50:WU01:FS00:0x18:Completed 740000 out of 2000000 steps (37%)
14:41:12:WU01:FS00:0x18:Completed 760000 out of 2000000 steps (38%)
14:41:34:WU01:FS00:0x18:Completed 780000 out of 2000000 steps (39%)
14:41:56:WU01:FS00:0x18:Completed 800000 out of 2000000 steps (40%)
14:42:20:WU01:FS00:0x18:Completed 820000 out of 2000000 steps (41%)
14:42:42:WU01:FS00:0x18:Completed 840000 out of 2000000 steps (42%)
14:43:05:WU01:FS00:0x18:Completed 860000 out of 2000000 steps (43%)
14:43:27:WU01:FS00:0x18:Completed 880000 out of 2000000 steps (44%)
14:43:49:WU01:FS00:0x18:Completed 900000 out of 2000000 steps (45%)
14:44:13:WU01:FS00:0x18:Completed 920000 out of 2000000 steps (46%)
14:44:35:WU01:FS00:0x18:Completed 940000 out of 2000000 steps (47%)
14:44:57:WU01:FS00:0x18:Completed 960000 out of 2000000 steps (48%)
14:45:20:WU01:FS00:0x18:Completed 980000 out of 2000000 steps (49%)
14:45:42:WU01:FS00:0x18:Completed 1000000 out of 2000000 steps (50%)
14:46:06:WU01:FS00:0x18:Completed 1020000 out of 2000000 steps (51%)
14:46:28:WU01:FS00:0x18:Completed 1040000 out of 2000000 steps (52%)
14:46:50:WU01:FS00:0x18:Completed 1060000 out of 2000000 steps (53%)
14:47:13:WU01:FS00:0x18:Completed 1080000 out of 2000000 steps (54%)
14:47:35:WU01:FS00:0x18:Completed 1100000 out of 2000000 steps (55%)
14:47:59:WU01:FS00:0x18:Completed 1120000 out of 2000000 steps (56%)
14:48:21:WU01:FS00:0x18:Completed 1140000 out of 2000000 steps (57%)
14:48:43:WU01:FS00:0x18:Completed 1160000 out of 2000000 steps (58%)
14:49:06:WU01:FS00:0x18:Completed 1180000 out of 2000000 steps (59%)
14:49:28:WU01:FS00:0x18:Completed 1200000 out of 2000000 steps (60%)
14:49:52:WU01:FS00:0x18:Completed 1220000 out of 2000000 steps (61%)
14:50:14:WU01:FS00:0x18:Completed 1240000 out of 2000000 steps (62%)
14:50:36:WU01:FS00:0x18:Completed 1260000 out of 2000000 steps (63%)
14:50:59:WU01:FS00:0x18:Completed 1280000 out of 2000000 steps (64%)
14:51:21:WU01:FS00:0x18:Completed 1300000 out of 2000000 steps (65%)
14:51:45:WU01:FS00:0x18:Completed 1320000 out of 2000000 steps (66%)
14:52:07:WU01:FS00:0x18:Completed 1340000 out of 2000000 steps (67%)
14:52:29:WU01:FS00:0x18:Completed 1360000 out of 2000000 steps (68%)
14:52:52:WU01:FS00:0x18:Completed 1380000 out of 2000000 steps (69%)
14:53:14:WU01:FS00:0x18:Completed 1400000 out of 2000000 steps (70%)
14:53:38:WU01:FS00:0x18:Completed 1420000 out of 2000000 steps (71%)
14:54:00:WU01:FS00:0x18:Completed 1440000 out of 2000000 steps (72%)
14:54:23:WU01:FS00:0x18:Completed 1460000 out of 2000000 steps (73%)
14:54:45:WU01:FS00:0x18:Completed 1480000 out of 2000000 steps (74%)
14:55:07:WU01:FS00:0x18:Completed 1500000 out of 2000000 steps (75%)
14:55:31:WU01:FS00:0x18:Completed 1520000 out of 2000000 steps (76%)
14:55:53:WU01:FS00:0x18:Completed 1540000 out of 2000000 steps (77%)
14:56:15:WU01:FS00:0x18:Completed 1560000 out of 2000000 steps (78%)
14:56:38:WU01:FS00:0x18:Completed 1580000 out of 2000000 steps (79%)
14:57:00:WU01:FS00:0x18:Completed 1600000 out of 2000000 steps (80%)
14:57:24:WU01:FS00:0x18:Completed 1620000 out of 2000000 steps (81%)
14:57:46:WU01:FS00:0x18:Completed 1640000 out of 2000000 steps (82%)
14:58:08:WU01:FS00:0x18:Completed 1660000 out of 2000000 steps (83%)
14:58:30:WU01:FS00:0x18:Completed 1680000 out of 2000000 steps (84%)
14:58:52:WU01:FS00:0x18:Completed 1700000 out of 2000000 steps (85%)
14:59:16:WU01:FS00:0x18:Completed 1720000 out of 2000000 steps (86%)
14:59:38:WU01:FS00:0x18:Completed 1740000 out of 2000000 steps (87%)
15:00:01:WU01:FS00:0x18:Completed 1760000 out of 2000000 steps (88%)
15:00:23:WU01:FS00:0x18:Completed 1780000 out of 2000000 steps (89%)
15:00:45:WU01:FS00:0x18:Completed 1800000 out of 2000000 steps (90%)
15:01:09:WU01:FS00:0x18:Completed 1820000 out of 2000000 steps (91%)
15:01:31:WU01:FS00:0x18:Completed 1840000 out of 2000000 steps (92%)
15:01:53:WU01:FS00:0x18:Completed 1860000 out of 2000000 steps (93%)
15:02:15:WU01:FS00:0x18:Completed 1880000 out of 2000000 steps (94%)
15:02:38:WU01:FS00:0x18:Completed 1900000 out of 2000000 steps (95%)
15:03:01:WU01:FS00:0x18:Completed 1920000 out of 2000000 steps (96%)
15:03:24:WU01:FS00:0x18:Completed 1940000 out of 2000000 steps (97%)
15:03:46:WU01:FS00:0x18:Completed 1960000 out of 2000000 steps (98%)
15:04:08:WU01:FS00:0x18:Completed 1980000 out of 2000000 steps (99%)
15:04:30:WU01:FS00:0x18:Completed 2000000 out of 2000000 steps (100%)
15:04:32:WU01:FS00:0x18:Saving result file logfile_01.txt
15:04:32:WU01:FS00:0x18:Saving result file checkpointState.xml
15:04:32:WU01:FS00:0x18:Saving result file checkpt.crc
15:04:32:WU01:FS00:0x18:Saving result file log.txt
15:04:32:WU01:FS00:0x18:Saving result file positions.xtc
15:04:32:WU01:FS00:0x18:Folding@home Core Shutdown: FINISHED_UNIT
15:04:33:WU01:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
15:04:33:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:9679 run:0 clone:79 gen:96 core:0x18 unit:0x00000069ab436c9b56de69bf3d80a24d
15:04:33:WU01:FS00:Uploading 757.72KiB to 171.67.108.155
15:04:33:WU01:FS00:Connecting to 171.67.108.155:8080
15:04:35:WU01:FS00:Upload complete
15:04:35:WU01:FS00:Server responded WORK_ACK (400)
15:04:35:WU01:FS00:Final credit estimate, 3897.00 points
15:04:35:WU01:FS00:Cleaning up
Seems to be, FahCore 0x21 holds higher OC modes with more card heating more stable than 0x18. At least it applies to Maxwells. Logged WU drew power ~50% of max TDP which resulted in 155.5KPPD only. The videocard was cold enough to push slightly its clock up (1491<->1503MHz). This variety of clock, I was afraid, could lead to poor quality crunched WU & bad states. But this time it didn't happen. Sorry for probably incorrect English.
   510 290 819 pts earned in Folding@home project
Duce H_K_
Posts: 113
Joined: Mon Nov 09, 2015 3:52 pm
Hardware configuration: MoBo•Gigabye X99 UD4-CF F24
CPU•<UPD 20.05.2023>Xeon V3 2680 V4 14c28t 35Mb L3
RAM•DDR4 Hynix 2133 CL14 4*16 DualRank Quad channel
HDD•ST1000DM003 Sata3 NCQ
GFX•GT220
PSU•Chieftec GPS750C 80+ Gold after repair
Cooling•Air 2xDeepCool UF120

Internet•200Mbit/s FTTB↓ white dynamic, ERTH, router RB951G-2HnD

Other•Redmi 7A <runs WUProp :-/>
Location: Russia
Contact:

Re: 3 failed 9679

Post by Duce H_K_ »

__________________________________________________________________________________________

Code: Select all

******************************* Date: 2016-10-28 *******************************
05:39:01:WU01:FS01:Connecting to 171.67.108.45:80
05:39:03:WU01:FS01:Assigned to work server 171.67.108.155
05:39:03:WU01:FS01:Requesting new work unit for slot 01: READY gpu:1:GP104 [GeForce GTX 1070] from 171.67.108.155
05:39:03:WU01:FS01:Connecting to 171.67.108.155:8080
05:39:04:WU01:FS01:Downloading 394.33KiB
05:39:07:WU01:FS01:Download complete
05:39:07:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:9678 run:1 clone:62 gen:71 core:0x18 unit:0x0000005dab436c9b56de69bf85733cad
05:39:07:WU01:FS01:Starting
05:39:07:WU01:FS01:Running FahCore: C:\NO_UAC\FAHClient_7.4.15_x64/FAHCoreWrapper.exe E:\Docbase\FaH-workdir\cores/web.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_18.fah/FahCore_18.exe -dir 01 -suffix 01 -version 704 -lifeline 5092 -checkpoint 6 -opencl-platform 1 -gpu-vendor nvidia -gpu 0 -forceasm -twait=80
05:39:08:WU01:FS01:Started FahCore on PID 4760
05:39:09:WU01:FS01:Core PID:192
05:39:09:WU01:FS01:FahCore 0x18 started
05:39:11:WU01:FS01:0x18:*********************** Log Started 2016-10-28T05:39:10Z ***********************
05:39:11:WU01:FS01:0x18:Project: 9678 (Run 1, Clone 62, Gen 71)
05:39:11:WU01:FS01:0x18:Unit: 0x0000005dab436c9b56de69bf85733cad
05:39:11:WU01:FS01:0x18:CPU: 0x00000000000000000000000000000000
05:39:11:WU01:FS01:0x18:Machine: 1
05:39:11:WU01:FS01:0x18:Reading tar file core.xml
05:39:11:WU01:FS01:0x18:Reading tar file integrator.xml
05:39:11:WU01:FS01:0x18:Reading tar file state.xml
05:39:11:WU01:FS01:0x18:Reading tar file system.xml
05:39:11:WU01:FS01:0x18:Digital signatures verified
05:39:11:WU01:FS01:0x18:Folding@home GPU core18
05:39:11:WU01:FS01:0x18:Version 0.0.4
05:39:17:WU01:FS01:0x18:Completed 0 out of 2000000 steps (0%)
05:39:17:WU01:FS01:0x18:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
05:39:29:WU01:FS01:0x18:Completed 20000 out of 2000000 steps (1%)
05:39:39:WU01:FS01:0x18:Completed 40000 out of 2000000 steps (2%)
05:39:49:WU01:FS01:0x18:Completed 60000 out of 2000000 steps (3%)
05:39:59:WU01:FS01:0x18:Completed 80000 out of 2000000 steps (4%)
05:40:09:WU01:FS01:0x18:Completed 100000 out of 2000000 steps (5%)
05:40:21:WU01:FS01:0x18:Completed 120000 out of 2000000 steps (6%)
05:40:31:WU01:FS01:0x18:Completed 140000 out of 2000000 steps (7%)
05:40:41:WU01:FS01:0x18:Completed 160000 out of 2000000 steps (8%)
05:40:51:WU01:FS01:0x18:Completed 180000 out of 2000000 steps (9%)
05:41:01:WU01:FS01:0x18:Completed 200000 out of 2000000 steps (10%)
05:41:01:WU01:FS01:0x18:Bad State detected... attempting to resume from last good checkpoint
05:41:11:WU01:FS01:0x18:Completed 120000 out of 2000000 steps (6%)
05:41:21:WU01:FS01:0x18:Completed 140000 out of 2000000 steps (7%)
05:41:30:WU01:FS01:0x18:Completed 160000 out of 2000000 steps (8%)
05:41:40:WU01:FS01:0x18:Completed 180000 out of 2000000 steps (9%)
05:41:50:WU01:FS01:0x18:Completed 200000 out of 2000000 steps (10%)
05:41:50:WU01:FS01:0x18:Bad State detected... attempting to resume from last good checkpoint
05:42:00:WU01:FS01:0x18:Completed 120000 out of 2000000 steps (6%)
05:42:10:WU01:FS01:0x18:Completed 140000 out of 2000000 steps (7%)
05:42:20:WU01:FS01:0x18:Completed 160000 out of 2000000 steps (8%)
05:42:30:WU01:FS01:0x18:Completed 180000 out of 2000000 steps (9%)
05:42:40:WU01:FS01:0x18:Completed 200000 out of 2000000 steps (10%)
05:42:40:WU01:FS01:0x18:Bad State detected... attempting to resume from last good checkpoint
05:42:40:WU01:FS01:0x18:Max number of retries reached. Aborting.
05:42:40:WU01:FS01:0x18:ERROR:exception: Max Retries Reached
05:42:40:WU01:FS01:0x18:Saving result file logfile_01.txt
05:42:40:WU01:FS01:0x18:Saving result file log.txt
05:42:40:WU01:FS01:0x18:Folding@home Core Shutdown: BAD_WORK_UNIT
05:42:41:WARNING:WU01:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
05:42:41:WU01:FS01:Sending unit results: id:01 state:SEND error:FAULTY project:9678 run:1 clone:62 gen:71 core:0x18 unit:0x0000005dab436c9b56de69bf85733cad
05:42:41:WU01:FS01:Uploading 2.86KiB to 171.67.108.155
05:42:41:WU01:FS01:Connecting to 171.67.108.155:8080
05:42:41:WU01:FS01:Upload complete
05:42:41:WU01:FS01:Server responded WORK_ACK (400)
05:42:41:WU01:FS01:Cleaning up
got 22 points according to EOC stats.
   510 290 819 pts earned in Folding@home project
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 3 failed 9679

Post by bruce »

Duce H_K_ wrote:got 22 points according to EOC stats.
Could be. You didn't specify which driver's you're running. Perhaps it the drivers; perhaps it's the overclocking.
Forum topic
Blog post
Post Reply