Project: 7810 (Run 0, Clone 66, Gen 402)

Moderators: Site Moderators, FAHC Science Team

Post Reply
ChristianVirtual
Posts: 1596
Joined: Tue May 28, 2013 12:14 pm
Location: Tokyo

Project: 7810 (Run 0, Clone 66, Gen 402)

Post by ChristianVirtual »

One Faulty 7810, also after 72% as folding_hoomer before ...

Latest NV driver 331.20

Code: Select all

*********************** Log Started 2013-11-08T04:30:28Z ***********************
04:30:28:************************* Folding@home Client *************************
04:30:28:    Website: http://folding.stanford.edu/
04:30:28:  Copyright: (c) 2009-2013 Stanford University
04:30:28:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
04:30:28:       Args: --child --lifeline 1113 /etc/fahclient/config.xml --run-as
04:30:28:             fahclient --pid-file=/var/run/fahclient.pid --daemon
04:30:28:     Config: /etc/fahclient/config.xml
04:30:28:******************************** Build ********************************
04:30:28:    Version: 7.3.6
04:30:28:       Date: Feb 18 2013
04:30:28:       Time: 07:24:08
04:30:28:    SVN Rev: 3923
04:30:28:     Branch: fah/trunk/client
04:30:28:   Compiler: GNU 4.4.7
04:30:28:    Options: -std=gnu++98 -O3 -funroll-loops -mfpmath=sse -ffast-math
04:30:28:             -fno-unsafe-math-optimizations -msse2
04:30:28:   Platform: linux2 3.2.0-1-amd64
04:30:28:       Bits: 64
04:30:28:       Mode: Release
04:30:28:******************************* System ********************************
04:30:28:        CPU: Intel(R) Core(TM) i7-2600S CPU @ 2.80GHz
04:30:28:     CPU ID: GenuineIntel Family 6 Model 42 Stepping 7
04:30:28:       CPUs: 8
04:30:28:     Memory: 7.74GiB
04:30:28:Free Memory: 7.38GiB
04:30:28:    Threads: POSIX_THREADS
04:30:28:Has Battery: false
04:30:28: On Battery: false
04:30:28: UTC offset: 9
04:30:28:        PID: 1243
04:30:28:        CWD: /var/lib/fahclient
04:30:28:         OS: Linux 3.8.0-32-generic x86_64
04:30:28:    OS Arch: AMD64
04:30:28:       GPUs: 3
04:30:28:      GPU 0: NVIDIA:3 GK110 [GeForce GTX 780]
04:30:28:      GPU 1: NVIDIA:3 GK110 [GeForce GTX 780]
04:30:28:      GPU 2: NVIDIA:3 GK104 [GeForce GTX 660 Ti]
04:30:28:       CUDA: 3.5
04:30:28:CUDA Driver: 6000
04:30:28:***********************************************************************
04:30:28:<config>
04:30:28:  <!-- Folding Slot Configuration -->
04:30:28:  <gpu v='true'/>
04:30:28:  <power v='full'/>
04:30:28:
04:30:28:  <!-- HTTP Server -->
04:30:28:
04:30:28:  <!-- Logging -->
04:30:28:  <log-rotate-max v='1024'/>
04:30:28:
04:30:28:  <!-- Network -->
04:30:28:  <proxy v=':8080'/>
04:30:28:
04:30:28:  <!-- User Information -->
04:30:28:  <team v='3446'/>
04:30:28:  <user v='ChristianFAH'/>
04:30:28:
04:30:28:  <!-- Folding Slots -->
04:30:28:  <slot id='0' type='CPU'>
04:30:28:    <client-type v='beta'/>
04:30:28:    <cpus v='4'/>
04:30:28:    <next-unit-percentage v='99'/>
04:30:28:    <pause-on-start v='true'/>
04:30:28:  </slot>
04:30:28:  <slot id='1' type='GPU'>
04:30:28:    <client-type v='beta'/>
04:30:28:    <pause-on-start v='true'/>
04:30:28:  </slot>
04:30:28:  <slot id='2' type='GPU'>
04:30:28:    <client-type v='beta'/>
04:30:28:    <pause-on-start v='true'/>
04:30:28:  </slot>
04:30:28:  <slot id='3' type='GPU'>
04:30:28:    <client-type v='beta'/>
04:30:28:    <pause-on-start v='true'/>
04:30:28:  </slot>
04:30:28:</config>
04:30:28:Switching to user fahclient
04:30:28:Trying to access database...
04:30:28:Successfully acquired database lock
04:30:28:Enabled folding slot 00: PAUSED cpu:4 (paused)
04:30:28:Enabled folding slot 01: PAUSED gpu:0:GK110 [GeForce GTX 780] (paused)
04:30:28:Enabled folding slot 02: PAUSED gpu:1:GK110 [GeForce GTX 780] (paused)
04:30:28:Enabled folding slot 03: PAUSED gpu:2:GK104 [GeForce GTX 660 Ti] (paused)
04:36:01:FS01:Unpaused

...

18:44:02:WU01:FS02:0x17:Saving result file log.txt
18:44:02:WU01:FS02:0x17:Saving result file positions.xtc
18:44:03:WU01:FS02:0x17:Folding@home Core Shutdown: FINISHED_UNIT
18:44:03:WU01:FS02:FahCore returned: FINISHED_UNIT (100 = 0x64)
18:44:03:WU01:FS02:Sending unit results: id:01 state:SEND error:NO_ERROR project:7810 run:0 clone:694 gen:292 core:0x17 unit:0x0000013e0a3b1e8651d34d8137690c39
18:44:03:WU01:FS02:Uploading 5.74MiB to 171.64.65.98
18:44:03:WU01:FS02:Connecting to 171.64.65.98:8080
18:44:06:WU00:FS02:Download 29.89%
18:44:12:WU00:FS02:Download 56.78%
18:44:12:WU01:FS02:Upload complete
18:44:12:WU01:FS02:Server responded WORK_ACK (400)
18:44:12:WU01:FS02:Final credit estimate, 16751.00 points
18:44:12:WU01:FS02:Cleaning up
18:44:18:WU00:FS02:Download 92.65%
18:44:18:WU00:FS02:Download complete


18:44:18:WU00:FS02:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:7810 run:0 clone:66 gen:402 core:0x17 unit:0x000001b50a3b1e8651d34661a08b8f39
18:44:18:WU00:FS02:Starting
18:44:18:WU00:FS02:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/www.stanford.edu/~pande/Linux/AMD64/NVIDIA/Fermi/beta/Core_17.fah/FahCore_17 -dir 00 -suffix 01 -version 703 -lifeline 1243 -checkpoint 15 -gpu 1 -gpu-vendor nvidia
18:44:18:WU00:FS02:Started FahCore on PID 7316
18:44:18:WU00:FS02:Core PID:7320
18:44:18:WU00:FS02:FahCore 0x17 started
18:44:19:WU00:FS02:0x17:*********************** Log Started 2013-11-10T18:44:18Z ***********************
18:44:19:WU00:FS02:0x17:Project: 7810 (Run 0, Clone 66, Gen 402)
18:44:19:WU00:FS02:0x17:Unit: 0x000001b50a3b1e8651d34661a08b8f39
18:44:19:WU00:FS02:0x17:CPU: 0x00000000000000000000000000000000
18:44:19:WU00:FS02:0x17:Machine: 2
18:44:19:WU00:FS02:0x17:Reading tar file state.xml
18:44:19:WU00:FS02:0x17:Reading tar file system.xml
18:44:19:WU00:FS02:0x17:Reading tar file integrator.xml
18:44:19:WU00:FS02:0x17:Reading tar file core.xml
18:44:19:WU00:FS02:0x17:Digital signatures verified
18:44:38:WU00:FS02:0x17:Completed 0 out of 2000000 steps (0%)
18:46:12:WU00:FS02:0x17:Completed 20000 out of 2000000 steps (1%)
18:47:43:WU00:FS02:0x17:Completed 40000 out of 2000000 steps (2%)
18:48:22:WU02:FS01:0x17:Completed 880000 out of 2000000 steps (44%)
18:49:17:WU00:FS02:0x17:Completed 60000 out of 2000000 steps (3%)
18:49:56:WU02:FS01:0x17:Completed 900000 out of 2000000 steps (45%)
18:50:48:WU00:FS02:0x17:Completed 80000 out of 2000000 steps (4%)
18:51:32:WU02:FS01:0x17:Completed 920000 out of 2000000 steps (46%)
18:52:20:WU00:FS02:0x17:Completed 100000 out of 2000000 steps (5%)
18:53:06:WU02:FS01:0x17:Completed 940000 out of 2000000 steps (47%)
18:53:54:WU00:FS02:0x17:Completed 120000 out of 2000000 steps (6%)
18:54:43:WU02:FS01:0x17:Completed 960000 out of 2000000 steps (48%)
18:55:25:WU00:FS02:0x17:Completed 140000 out of 2000000 steps (7%)
18:56:17:WU02:FS01:0x17:Completed 980000 out of 2000000 steps (49%)
18:56:59:WU00:FS02:0x17:Completed 160000 out of 2000000 steps (8%)

...

19:32:24:WU00:FS02:0x17:Completed 620000 out of 2000000 steps (31%)
19:32:38:WU02:FS01:0x17:Completed 1440000 out of 2000000 steps (72%)
19:33:56:WU00:FS02:0x17:Completed 640000 out of 2000000 steps (32%)
19:34:15:WU02:FS01:0x17:Completed 1460000 out of 2000000 steps (73%)
19:34:49:WU00:FS02:0x17:Bad State detected... attempting to resume from last good checkpoint
19:35:48:WU02:FS01:0x17:Completed 1480000 out of 2000000 steps (74%)
19:36:20:WU00:FS02:0x17:Completed 620000 out of 2000000 steps (31%)
19:37:22:WU02:FS01:0x17:Completed 1500000 out of 2000000 steps (75%)
19:37:52:WU00:FS02:0x17:Completed 640000 out of 2000000 steps (32%)
19:38:45:WU00:FS02:0x17:Bad State detected... attempting to resume from last good checkpoint
19:38:59:WU02:FS01:0x17:Completed 1520000 out of 2000000 steps (76%)
19:40:17:WU00:FS02:0x17:Completed 620000 out of 2000000 steps (31%)
19:40:32:WU02:FS01:0x17:Completed 1540000 out of 2000000 steps (77%)
19:41:48:WU00:FS02:0x17:Completed 640000 out of 2000000 steps (32%)
19:42:09:WU02:FS01:0x17:Completed 1560000 out of 2000000 steps (78%)
19:42:42:WU00:FS02:0x17:Bad State detected... attempting to resume from last good checkpoint
19:42:42:WU00:FS02:0x17:Max number of retries reached. Aborting.
19:42:42:WU00:FS02:0x17:ERROR:exception: Max Retries Reached
19:42:42:WU00:FS02:0x17:Saving result file logfile_01.txt
19:42:42:WU00:FS02:0x17:Saving result file badStateCheckpoint_416496834
19:42:42:WU00:FS02:0x17:Saving result file badStateCheckpoint_798946464
19:42:43:WU00:FS02:0x17:Saving result file badStateCheckpoint_8351929
19:42:43:WU00:FS02:0x17:Saving result file badStateForceGroup0_416496834Core.xml
19:42:44:WU00:FS02:0x17:Saving result file badStateForceGroup0_416496834Ref.xml
19:42:45:WU00:FS02:0x17:Saving result file badStateForceGroup0_798946464Core.xml
19:42:46:WU00:FS02:0x17:Saving result file badStateForceGroup0_798946464Ref.xml
19:42:47:WU00:FS02:0x17:Saving result file badStateForceGroup0_8351929Core.xml
19:42:48:WU00:FS02:0x17:Saving result file badStateForceGroup0_8351929Ref.xml
19:42:48:WU00:FS02:0x17:Saving result file badStateForceGroup1_416496834Core.xml
19:42:49:WU00:FS02:0x17:Saving result file badStateForceGroup1_416496834Ref.xml
19:42:50:WU00:FS02:0x17:Saving result file badStateForceGroup1_798946464Core.xml
19:42:51:WU00:FS02:0x17:Saving result file badStateForceGroup1_798946464Ref.xml
19:42:52:WU00:FS02:0x17:Saving result file badStateForceGroup1_8351929Core.xml
19:42:52:WU00:FS02:0x17:Saving result file badStateForceGroup1_8351929Ref.xml
19:42:53:WU00:FS02:0x17:Saving result file badStateForceGroup2_416496834Core.xml
19:42:54:WU00:FS02:0x17:Saving result file badStateForceGroup2_416496834Ref.xml
19:42:55:WU00:FS02:0x17:Saving result file badStateForceGroup2_798946464Core.xml
19:42:55:WU00:FS02:0x17:Saving result file badStateForceGroup2_798946464Ref.xml
19:42:56:WU00:FS02:0x17:Saving result file badStateForceGroup2_8351929Core.xml
19:42:57:WU00:FS02:0x17:Saving result file badStateForceGroup2_8351929Ref.xml
19:42:58:WU00:FS02:0x17:Saving result file log.txt
19:42:58:WU00:FS02:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
[93m19:42:58:WARNING:WU00:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)[0m
19:42:58:WU00:FS02:Sending unit results: id:00 state:SEND error:FAULTY project:7810 run:0 clone:66 gen:402 core:0x17 unit:0x000001b50a3b1e8651d34661a08b8f39
19:42:58:WU00:FS02:Uploading 37.33MiB to 171.64.65.98
19:42:58:WU00:FS02:Connecting to 171.64.65.98:8080
19:42:58:WU01:FS02:Connecting to assign-GPU.stanford.edu:80
19:42:59:WU01:FS02:News: Welcome to Folding@Home
19:42:59:WU01:FS02:Assigned to work server 171.64.65.98
19:42:59:WU01:FS02:Requesting new work unit for slot 02: READY gpu:1:GK110 [GeForce GTX 780] from 171.64.65.98
19:42:59:WU01:FS02:Connecting to 171.64.65.98:8080
19:42:59:WU01:FS02:Downloading 2.07MiB
19:43:04:WU00:FS02:Upload 17.75%
19:43:05:WU01:FS02:Download 24.10%
19:43:10:WU00:FS02:Upload 34.99%
19:43:11:WU01:FS02:Download 60.25%
19:43:16:WU00:FS02:Upload 50.73%
19:43:17:WU01:FS02:Download 84.36%
19:43:21:WU01:FS02:Download complete
19:43:21:WU01:FS02:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:7810 run:0 clone:465 gen:423 core:0x17 unit:0x000001c70a3b1e8651d34ae9e3c92149
19:43:21:WU01:FS02:Starting
19:43:21:WU01:FS02:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/www.stanford.edu/~pande/Linux/AMD64/NVIDIA/Fermi/beta/Core_17.fah/FahCore_17 -dir 01 -suffix 01 -version 703 -lifeline 1243 -checkpoint 15 -gpu 1 -gpu-vendor nvidia
19:43:21:WU01:FS02:Started FahCore on PID 10881
19:43:21:WU01:FS02:Core PID:10885
19:43:21:WU01:FS02:FahCore 0x17 started
19:43:22:WU01:FS02:0x17:*********************** Log Started 2013-11-10T19:43:21Z ***********************
19:43:22:WU01:FS02:0x17:Project: 7810 (Run 0, Clone 465, Gen 423)
19:43:22:WU01:FS02:0x17:Unit: 0x000001c70a3b1e8651d34ae9e3c92149
19:43:22:WU01:FS02:0x17:CPU: 0x00000000000000000000000000000000
19:43:22:WU01:FS02:0x17:Machine: 2
19:43:22:WU01:FS02:0x17:Reading tar file state.xml
19:43:22:WU01:FS02:0x17:Reading tar file system.xml
19:43:22:WU01:FS02:0x17:Reading tar file integrator.xml
19:43:22:WU01:FS02:0x17:Reading tar file core.xml
19:43:22:WU01:FS02:0x17:Digital signatures verified
19:43:22:WU00:FS02:Upload 61.28%
19:43:28:WU00:FS02:Upload 71.83%
19:43:34:WU00:FS02:Upload 83.55%
19:43:40:WU00:FS02:Upload 95.10%
19:43:41:WU01:FS02:0x17:Completed 0 out of 2000000 steps (0%)
19:43:42:WU02:FS01:0x17:Completed 1580000 out of 2000000 steps (79%)
19:43:43:WU00:FS02:Upload complete
19:43:43:WU00:FS02:Server responded WORK_ACK (400)
19:43:43:WU00:FS02:Cleaning up
19:45:14:WU01:FS02:0x17:Completed 20000 out of 2000000 steps (1%)
Before and after that WU the GPU was working ok; bad WU ?
ImageImage
Please contribute your logs to http://ppd.fahmm.net
P5-133XL
Posts: 2948
Joined: Sun Dec 02, 2007 4:36 am
Hardware configuration: Machine #1:

Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).

Machine #2:

Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.

Machine 3:

Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32

I am currently folding just on the 5x GTX 460's for aprox. 70K PPD
Location: Salem. OR USA

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Post by P5-133XL »

Looks bad, It has failed on seven people so far. I marked it bad, so it won't be given out any more.
Image
ChristianVirtual
Posts: 1596
Joined: Tue May 28, 2013 12:14 pm
Location: Tokyo

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Post by ChristianVirtual »

Thank you for the quick action P5-133XL; good to know it will not come back to others, too.
ImageImage
Please contribute your logs to http://ppd.fahmm.net
Hans
Posts: 15
Joined: Mon Sep 02, 2013 9:21 am

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Post by Hans »

I'm still folding one and 7811 is doing the same .
ChristianVirtual
Posts: 1596
Joined: Tue May 28, 2013 12:14 pm
Location: Tokyo

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Post by ChristianVirtual »

Which Run/Clone/Gen ?

Can you share relevant parts from log file incl. config ? Makes it easier to get checked ...
ImageImage
Please contribute your logs to http://ppd.fahmm.net
Hans
Posts: 15
Joined: Mon Sep 02, 2013 9:21 am

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Post by Hans »

18:56:28:WU01:FS00:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
18:56:28:WARNING:WU01:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
18:56:28:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:7811 run:0 clone:110 gen:536 core:0x17 unit:0x000002490a3b1e8651db472bbc4d0986

4 hours 16 mins 67658 7810 (0, 262, 516)
1 hours 00 mins 115082 7811 (0, 188, 538)
ChristianVirtual
Posts: 1596
Joined: Tue May 28, 2013 12:14 pm
Location: Tokyo

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Post by ChristianVirtual »

What GPU and Os are you using ? Any overclocked version ? Can you share the first part of the log file with with configuration ? Just remove your password or passkeys if visible.

As for checking the projects: this needs a helping hand of a mod to check if done by others ...
ImageImage
Please contribute your logs to http://ppd.fahmm.net
P5-133XL
Posts: 2948
Joined: Sun Dec 02, 2007 4:36 am
Hardware configuration: Machine #1:

Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).

Machine #2:

Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.

Machine 3:

Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32

I am currently folding just on the 5x GTX 460's for aprox. 70K PPD
Location: Salem. OR USA

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Post by P5-133XL »

Someone else successfully completed project:7811 run:0 clone:110 gen:536

The problem is on your end rather than a faulty WU.
Image
Hans
Posts: 15
Joined: Mon Sep 02, 2013 9:21 am

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Post by Hans »

Shure whatever :)
The faulty WU is info from the logfile so ?
I've finnished also a lot of the 7810 and 7811.

So both wu's can have similar problems. ??
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Post by bruce »

Unfortunately the BAD_WORK_UNIT characterization does not guarantee that the WU was corrupt when it was downloaded. Things can happen to them after they're downloaded. Two somethat common reports: (1) An AV program decides something looks like a virus and "protects" your system by corrupting some of FAH's data. (2) Dust or overclocking or fan limitations push the temperatures in a system too close to unstable (or a voltage glitch or ??) and a hardware error occurs. Though it may not crash the OS, corrupt data may be introduced in the calculations which are detected by FAH's quality assurance software.

Failures of two WUs that did not fail on other hardware is not definitive proof of anything, but I'd take it as a strong indication that something in your system is too close to instability.
Hans
Posts: 15
Joined: Mon Sep 02, 2013 9:21 am

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Post by Hans »

It's brand new machine and working 24/7 ....

Another 7811 last night.

Code: Select all

03:15:35:WU02:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/admin/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/ATI/R600/Core_17.fah/FahCore_17.exe -dir 02 -suffix 01 -version 703 -lifeline 4744 -checkpoint 15 -gpu 1 -gpu-vendor ati
03:15:35:WU02:FS00:Started FahCore on PID 4392
03:15:35:WU02:FS00:Core PID:2236
03:15:35:WU02:FS00:FahCore 0x17 started
03:15:35:WU02:FS00:0x17:*********************** Log Started 2013-11-19T03:15:35Z ***********************
03:15:35:WU02:FS00:0x17:Project: 7811 (Run 0, Clone 524, Gen 491)
03:15:35:WU02:FS00:0x17:Unit: 0x000002180a3b1e8651db4ad9849b66d4
03:15:35:WU02:FS00:0x17:CPU: 0x00000000000000000000000000000000
03:15:35:WU02:FS00:0x17:Machine: 0
03:15:35:WU02:FS00:0x17:Reading tar file state.xml
03:15:35:WU02:FS00:0x17:Reading tar file system.xml
03:15:36:WU02:FS00:0x17:Reading tar file integrator.xml
03:15:36:WU02:FS00:0x17:Reading tar file core.xml
03:15:36:WU02:FS00:0x17:Digital signatures verified
03:15:41:WU01:FS00:Upload 7.59%
03:15:47:WU01:FS00:Upload 15.18%
03:15:53:WU01:FS00:Upload 22.77%
03:15:59:WU01:FS00:Upload 29.28%
03:16:01:WU02:FS00:0x17:Completed 0 out of 2000000 steps (0%)
03:16:05:WU01:FS00:Upload 36.87%
03:16:11:WU01:FS00:Upload 44.46%
03:16:17:WU01:FS00:Upload 52.05%
03:16:23:WU01:FS00:Upload 59.64%
03:16:29:WU01:FS00:Upload 67.24%
03:16:35:WU01:FS00:Upload 74.83%
03:16:41:WU01:FS00:Upload 82.42%
03:16:42:WU00:FS01:0x17:Completed 840000 out of 2000000 steps (42%)
03:16:47:WU01:FS00:Upload 88.92%
03:16:53:WU01:FS00:Upload 96.52%
03:17:00:WU01:FS00:Upload complete
03:17:00:WU01:FS00:Server responded WORK_ACK (400)
03:17:00:WU01:FS00:Final credit estimate, 13907.00 points
03:17:00:WU01:FS00:Cleaning up
03:17:46:WU02:FS00:0x17:Completed 20000 out of 2000000 steps (1%)
03:19:00:WU00:FS01:0x17:Completed 860000 out of 2000000 steps (43%)
03:19:25:WU02:FS00:0x17:Completed 40000 out of 2000000 steps (2%)
03:21:10:WU00:FS01:0x17:Completed 880000 out of 2000000 steps (44%)
03:21:11:WU02:FS00:0x17:Completed 60000 out of 2000000 steps (3%)
03:22:50:WU02:FS00:0x17:Completed 80000 out of 2000000 steps (4%)
03:23:19:WU00:FS01:0x17:Completed 900000 out of 2000000 steps (45%)
03:24:29:WU02:FS00:0x17:Completed 100000 out of 2000000 steps (5%)
03:25:38:WU00:FS01:0x17:Completed 920000 out of 2000000 steps (46%)
03:26:14:WU02:FS00:0x17:Completed 120000 out of 2000000 steps (6%)
03:27:47:WU00:FS01:0x17:Completed 940000 out of 2000000 steps (47%)
03:27:53:WU02:FS00:0x17:Completed 140000 out of 2000000 steps (7%)
03:29:39:WU02:FS00:0x17:Completed 160000 out of 2000000 steps (8%)
03:30:06:WU00:FS01:0x17:Completed 960000 out of 2000000 steps (48%)
03:31:18:WU02:FS00:0x17:Completed 180000 out of 2000000 steps (9%)
03:32:15:WU00:FS01:0x17:Completed 980000 out of 2000000 steps (49%)
03:32:57:WU02:FS00:0x17:Completed 200000 out of 2000000 steps (10%)
03:33:11:WU02:FS00:0x17:Bad State detected... attempting to resume from last good checkpoint
03:34:01:WU02:FS00:0x17:Completed 160000 out of 2000000 steps (8%)
03:34:25:WU00:FS01:0x17:Completed 1000000 out of 2000000 steps (50%)
03:35:39:WU02:FS00:0x17:Completed 180000 out of 2000000 steps (9%)
03:36:43:WU00:FS01:0x17:Completed 1020000 out of 2000000 steps (51%)
03:37:18:WU02:FS00:0x17:Completed 200000 out of 2000000 steps (10%)
03:37:33:WU02:FS00:0x17:Bad State detected... attempting to resume from last good checkpoint
03:38:22:WU02:FS00:0x17:Completed 160000 out of 2000000 steps (8%)
03:38:53:WU00:FS01:0x17:Completed 1040000 out of 2000000 steps (52%)
03:40:01:WU02:FS00:0x17:Completed 180000 out of 2000000 steps (9%)
03:41:12:WU00:FS01:0x17:Completed 1060000 out of 2000000 steps (53%)
03:41:40:WU02:FS00:0x17:Completed 200000 out of 2000000 steps (10%)
03:41:54:WU02:FS00:0x17:Bad State detected... attempting to resume from last good checkpoint
03:41:54:WU02:FS00:0x17:Max number of retries reached. Aborting.
03:41:54:WU02:FS00:0x17:ERROR:exception: Max Retries Reached
03:41:54:WU02:FS00:0x17:Saving result file logfile_01.txt
03:41:54:WU02:FS00:0x17:Saving result file badStateCheckpoint_18467
03:41:54:WU02:FS00:0x17:Saving result file badStateCheckpoint_41
03:41:54:WU02:FS00:0x17:Saving result file badStateCheckpoint_6334
03:41:55:WU02:FS00:0x17:Saving result file badStateForceGroup0_18467Core.xml
03:41:55:WU02:FS00:0x17:Saving result file badStateForceGroup0_18467Ref.xml
03:41:56:WU02:FS00:0x17:Saving result file badStateForceGroup0_41Core.xml
03:41:57:WU02:FS00:0x17:Saving result file badStateForceGroup0_41Ref.xml
03:41:58:WU02:FS00:0x17:Saving result file badStateForceGroup0_6334Core.xml
03:41:59:WU02:FS00:0x17:Saving result file badStateForceGroup0_6334Ref.xml
03:41:59:WU02:FS00:0x17:Saving result file badStateForceGroup1_18467Core.xml
03:42:00:WU02:FS00:0x17:Saving result file badStateForceGroup1_18467Ref.xml
03:42:01:WU02:FS00:0x17:Saving result file badStateForceGroup1_41Core.xml
03:42:01:WU02:FS00:0x17:Saving result file badStateForceGroup1_41Ref.xml
03:42:02:WU02:FS00:0x17:Saving result file badStateForceGroup1_6334Core.xml
03:42:03:WU02:FS00:0x17:Saving result file badStateForceGroup1_6334Ref.xml
03:42:03:WU02:FS00:0x17:Saving result file badStateForceGroup2_18467Core.xml
03:42:04:WU02:FS00:0x17:Saving result file badStateForceGroup2_18467Ref.xml
03:42:04:WU02:FS00:0x17:Saving result file badStateForceGroup2_41Core.xml
03:42:05:WU02:FS00:0x17:Saving result file badStateForceGroup2_41Ref.xml
03:42:05:WU02:FS00:0x17:Saving result file badStateForceGroup2_6334Core.xml
03:42:06:WU02:FS00:0x17:Saving result file badStateForceGroup2_6334Ref.xml
03:42:07:WU02:FS00:0x17:Saving result file log.txt
03:42:07:WU02:FS00:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
03:42:07:WARNING:WU02:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
03:42:07:WU02:FS00:Sending unit results: id:02 state:SEND error:FAULTY project:7811 run:0 clone:524 gen:491 core:0x17 unit:0x000002180a3b1e8651db4ad9849b66d4
03:42:07:WU02:FS00:Uploading 28.24MiB to 171.64.65.98
03:42:07:WU02:FS00:Connecting to 171.64.65.98:8080
03:42:07:WU01:FS00:Connecting to assign-GPU.stanford.edu:80
03:42:08:WU01:FS00:News: Welcome to Folding@Home
03:42:08:WU01:FS00:Assigned to work server 171.64.65.98
03:42:08:WU01:FS00:Requesting new work unit for slot 00: READY gpu:1:Tahiti [Radeon HD 7900 Series] from 171.64.65.98
03:42:08:WU01:FS00:Connecting to 171.64.65.98:8080
03:42:09:WU01:FS00:Downloading 2.09MiB
03:42:13:WU01:FS00:Download complete
03:42:13:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:7810 run:0 clone:153 gen:437 core:0x17 unit:0x000001d00a3b1e8651d3475d01d47255
and hell I don't know :)
bollix47
Posts: 2942
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Post by bollix47 »

It's not unusual for a core 17 work unit to fail if the GPU is overclocked. If that's the case try returning the memory clock back to stock to see if that helps. The memory clock speed has very little effect on folding performance and reducing it can result in fewer failures.

Currently your latest work unit has failed for a number of folders but that's not necessarily an indication of a bad work unit for core 17. I have seen this happen numerous times, especially with Run 0, but eventually someone does complete the work successfully.
Hans
Posts: 15
Joined: Mon Sep 02, 2013 9:21 am

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Post by Hans »

Thank you all.

i think its safe to say that the 7810 WU is stil available and therefor not faulty.
My new compu is doing mainly 7810 at the time and some 7811.
I'll monitor the log file more closely for e while cause i think there where no problems with
the 9800 WU's on my side the past months.
Clocking i did not do on purpose ( folding is the only game the pc plays and i know overclocking to be of no use)but i am a little in doubt about the catalys control center ??
Should it continue i'll try using one gpu to see.
PantherX
Site Moderator
Posts: 7020
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: Project: 7810 (Run 0, Clone 66, Gen 402)

Post by PantherX »

Hans wrote:...Clocking i did not do on purpose ( folding is the only game the pc plays and i know overclocking to be of no use)but i am a little in doubt about the catalys control center ??...
Generally speaking, overclocking your GPU to a folding stable setting would increase your PPD. However, the increase would vary on the amount of OC and the Project. For F@H, overclocking the Shaders is more useful than overclocking the memory. Please note that overclocking would require you to monitor your GPU for instabilities for future projects since they may be more computationally intensive and thus, could push your once stable OC into the unstable zone. For a (mostly) hassle free option, use the vendor stock settings.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Post Reply