Project: 9406 Run:726 Clone:0 Gen:38 FAULTY from start

Moderators: Site Moderators, FAHC Science Team

Post Reply
Nicolas_orleans
Posts: 106
Joined: Wed Aug 08, 2012 3:08 am

Project: 9406 Run:726 Clone:0 Gen:38 FAULTY from start

Post by Nicolas_orleans »

Log :

Code: Select all

15:31:59:WU01:FS02:Connecting to 171.67.108.201:80
15:32:00:WU01:FS02:Assigned to work server 171.64.65.56
15:32:00:WU01:FS02:Requesting new work unit for slot 02: RUNNING gpu:1:GK104 [GeForce GTX 770] from 171.64.65.56
15:32:00:WU01:FS02:Connecting to 171.64.65.56:8080
15:32:01:WU01:FS02:Downloading 4.73MiB
15:32:07:WU01:FS02:Download 69.98%
15:32:08:WU01:FS02:Download complete
15:32:08:WU01:FS02:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:9406 run:726 clone:0 gen:38 core:0x17 unit:0x000000380a3b1e5c533e4fbf57d2c050
15:32:08:WU01:FS02:Starting
15:32:08:WU01:FS02:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/web.stanford.edu/~pande/Linux/AMD64/NVIDIA/Fermi/beta/Core_17.fah/FahCore_17 -dir 01 -suffix 01 -version 704 -lifeline 1012 -checkpoint 15 -gpu 1 -gpu-vendor nvidia
15:32:08:WU01:FS02:Started FahCore on PID 3403
15:32:08:WU01:FS02:Core PID:3407
15:32:08:WU01:FS02:FahCore 0x17 started
15:32:09:WU01:FS02:0x17:*********************** Log Started 2014-05-28T15:32:08Z ***********************
15:32:09:WU01:FS02:0x17:Project: 9406 (Run 726, Clone 0, Gen 38)
15:32:09:WU01:FS02:0x17:Unit: 0x000000380a3b1e5c533e4fbf57d2c050
15:32:09:WU01:FS02:0x17:CPU: 0x00000000000000000000000000000000
15:32:09:WU01:FS02:0x17:Machine: 2
15:32:09:WU01:FS02:0x17:Reading tar file state.xml
15:32:09:WU01:FS02:0x17:Reading tar file system.xml
15:32:09:WU01:FS02:0x17:Reading tar file integrator.xml
15:32:09:WU01:FS02:0x17:Reading tar file core.xml
15:32:09:WU01:FS02:0x17:Digital signatures verified
15:34:47:WU01:FS02:0x17:ERROR:exception: Potential energy error of 14.2157, threshold of 10
15:34:47:WU01:FS02:0x17:ERROR:Reference Potential Energy: -846644 | Given Potential Energy: -846658
15:34:47:WU01:FS02:0x17:Saving result file logfile_01.txt
15:34:47:WU01:FS02:0x17:Saving result file badStateCheckpoint_1443051361
15:34:48:WU01:FS02:0x17:Saving result file badStateForceGroup0_1443051361Core.xml
15:34:50:WU01:FS02:0x17:Saving result file badStateForceGroup0_1443051361Ref.xml
15:34:53:WU01:FS02:0x17:Saving result file badStateForceGroup1_1443051361Core.xml
15:34:56:WU01:FS02:0x17:Saving result file badStateForceGroup1_1443051361Ref.xml
15:34:58:WU01:FS02:0x17:Saving result file badStateForceGroup2_1443051361Core.xml
15:35:00:WU01:FS02:0x17:Saving result file badStateForceGroup2_1443051361Ref.xml
15:35:02:WU01:FS02:0x17:Saving result file log.txt
15:35:02:WU01:FS02:0x17:Folding@home Core Shutdown: BAD_WORK_UNIT
15:35:02:WARNING:WU01:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
15:35:02:WU01:FS02:Sending unit results: id:01 state:SEND error:FAULTY project:9406 run:726 clone:0 gen:38 core:0x17 unit:0x000000380a3b1e5c533e4fbf57d2c050
15:35:02:WU01:FS02:Uploading 27.42MiB to 171.64.65.56
15:35:02:WU01:FS02:Connecting to 171.64.65.56:8080
15:37:09:WARNING:WU01:FS02:WorkServer connection failed on port 8080 trying 80
15:37:09:WU01:FS02:Connecting to 171.64.65.56:80
15:39:17:WARNING:WU01:FS02:Exception: Failed to send results to work server: Failed to connect to 171.64.65.56:80: Connection timed out
15:39:17:WU01:FS02:Trying to send results to collection server
15:39:17:WU01:FS02:Uploading 27.42MiB to 171.65.103.160
15:39:17:WU01:FS02:Connecting to 171.65.103.160:8080
15:39:23:WU01:FS02:Upload 1.82%
15:39:30:WU01:FS02:Upload 4.10%
15:39:36:WU01:FS02:Upload 5.70%
15:39:43:WU01:FS02:Upload 7.75%
15:39:50:WU01:FS02:Upload 10.03%
15:39:56:WU01:FS02:Upload 11.63%
15:40:02:WU01:FS02:Upload 13.45%
15:40:08:WU01:FS02:Upload 15.27%
15:40:14:WU01:FS02:Upload 17.10%
15:40:20:WU01:FS02:Upload 18.92%
15:40:26:WU01:FS02:Upload 20.74%
15:40:33:WU01:FS02:Upload 22.57%
15:40:39:WU01:FS02:Upload 24.39%
15:40:46:WU01:FS02:Upload 25.99%
15:40:53:WU01:FS02:Upload 28.27%
15:41:00:WU01:FS02:Upload 30.32%
15:41:07:WU01:FS02:Upload 32.37%
15:41:13:WU01:FS02:Upload 34.19%
15:41:20:WU01:FS02:Upload 36.25%
15:41:26:WU01:FS02:Upload 38.07%
15:41:32:WU01:FS02:Upload 39.89%
15:41:39:WU01:FS02:Upload 41.72%
15:41:45:WU01:FS02:Upload 43.54%
15:41:51:WU01:FS02:Upload 45.37%
15:41:57:WU01:FS02:Upload 46.96%
15:42:03:WU01:FS02:Upload 48.78%
15:42:10:WU01:FS02:Upload 50.61%
15:42:18:WU01:FS02:Upload 52.89%
15:42:24:WU01:FS02:Upload 54.71%
15:42:30:WU01:FS02:Upload 56.31%
15:42:36:WU01:FS02:Upload 58.13%
15:42:42:WU01:FS02:Upload 59.96%
15:42:48:WU01:FS02:Upload 61.78%
15:42:55:WU01:FS02:Upload 63.60%
15:43:01:WU01:FS02:Upload 65.65%
15:43:08:WU01:FS02:Upload 67.48%
15:43:14:WU01:FS02:Upload 69.30%
15:43:21:WU01:FS02:Upload 71.13%
15:43:27:WU01:FS02:Upload 72.95%
15:43:33:WU01:FS02:Upload 74.77%
15:43:39:WU01:FS02:Upload 76.60%
15:43:45:WU01:FS02:Upload 78.42%
15:43:52:WU01:FS02:Upload 79.79%
15:43:58:WU01:FS02:Upload 81.16%
15:44:04:WU01:FS02:Upload 82.75%
15:44:10:WU01:FS02:Upload 84.58%
15:44:16:WU01:FS02:Upload 86.40%
15:44:22:WU01:FS02:Upload 87.77%
15:44:28:WU01:FS02:Upload 89.59%
15:44:34:WU01:FS02:Upload 91.41%
15:44:40:WU01:FS02:Upload 93.24%
15:44:46:WU01:FS02:Upload 94.61%
15:44:52:WU01:FS02:Upload 96.66%
15:44:58:WU01:FS02:Upload 98.25%
15:45:07:WU01:FS02:Upload complete
15:45:07:WU01:FS02:Server responded WORK_ACK (400)
15:45:07:WU01:FS02:Cleaning up
System - no known cooling or stability issues (only 8 wu failures in 10 months running almost 24/7)

Code: Select all

*********************** Log Started 2014-05-27T08:44:09Z ***********************
08:44:09:************************* Folding@home Client *************************
08:44:09:    Website: http://folding.stanford.edu/
08:44:09:  Copyright: (c) 2009-2014 Stanford University
08:44:09:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
08:44:09:       Args: --child --lifeline 1010 /etc/fahclient/config.xml --run-as
08:44:09:             fahclient --pid-file=/var/run/fahclient.pid --daemon
08:44:09:     Config: /etc/fahclient/config.xml
08:44:09:******************************** Build ********************************
08:44:09:    Version: 7.4.4
08:44:09:       Date: Mar 4 2014
08:44:09:       Time: 12:02:38
08:44:09:    SVN Rev: 4130
08:44:09:     Branch: fah/trunk/client
08:44:09:   Compiler: GNU 4.4.7
08:44:09:    Options: -std=gnu++98 -O3 -funroll-loops -mfpmath=sse -ffast-math
08:44:09:             -fno-unsafe-math-optimizations -msse2
08:44:09:   Platform: linux2 3.2.0-1-amd64
08:44:09:       Bits: 64
08:44:09:       Mode: Release
08:44:09:******************************* System ********************************
08:44:09:        CPU: Intel(R) Celeron(R) CPU G1610 @ 2.60GHz
08:44:09:     CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
08:44:09:       CPUs: 2
08:44:09:     Memory: 3.81GiB
08:44:09:Free Memory: 3.37GiB
08:44:09:    Threads: POSIX_THREADS
08:44:09: OS Version: 3.8
08:44:09:Has Battery: false
08:44:09: On Battery: false
08:44:09: UTC Offset: 2
08:44:09:        PID: 1012
08:44:09:        CWD: /var/lib/fahclient
08:44:09:         OS: Linux 3.8.0-27-generic x86_64
08:44:09:    OS Arch: AMD64
08:44:09:       GPUs: 2
08:44:09:      GPU 0: NVIDIA:3 GK104 [GeForce GTX 770]
08:44:09:      GPU 1: NVIDIA:3 GK104 [GeForce GTX 770]
08:44:09:       CUDA: 3.0
08:44:09:CUDA Driver: 5050
08:44:09:***********************************************************************
08:44:09:<config>
08:44:09:  <!-- Client Control -->
08:44:09:  <fold-anon v='true'/>
08:44:09:
08:44:09:  <!-- HTTP Server -->
08:44:09:  <allow v='127.0.0.1,192.168.1.32'/>
08:44:09:
08:44:09:  <!-- Network -->
08:44:09:  <proxy v=':8080'/>
08:44:09:
08:44:09:  <!-- Remote Command Server -->
08:44:09:  <password v='******'/>
08:44:09:
08:44:09:  <!-- Slot Control -->
08:44:09:  <power v='full'/>
08:44:09:
08:44:09:  <!-- User Information -->
08:44:09:  <passkey v='********************************'/>
08:44:09:  <team v='33'/>
08:44:09:  <user v='Nicolas_orleans'/>
08:44:09:
08:44:09:  <!-- Folding Slots -->
08:44:09:  <slot id='1' type='GPU'>
08:44:09:    <client-type v='beta'/>
08:44:09:    <gpu-index v='0'/>
08:44:09:    <next-unit-percentage v='100'/>
08:44:09:  </slot>
08:44:09:  <slot id='2' type='GPU'>
08:44:09:    <client-type v='beta'/>
08:44:09:    <gpu-index v='1'/>
08:44:09:    <next-unit-percentage v='100'/>
08:44:09:  </slot>
08:44:09:</config>
MSI Z77A-GD55 - i5-3550 - 16 Go RAM - GTX 980 Ti Hybrid @1461 MHz + GTX 770 @ 1124 MHz + GTX 750 Ti @ 1306 MHz - Ubuntu 16.10
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 9406 Run:726 Clone:0 Gen:38 FAULTY from start

Post by bruce »

According to your log, your client attempted to report the bad WU to the Work Server 171.64.65.56 but failed. (The server was down briefly this morning.) It then successfully sent the results to the Collection Server at 171.65.103.160. That report does not appear in the Moderator Database yet, nor has the WU been reassigned and reported by anyone else. We'll have to wait until the records are updated before we can tell you anything else about that WU.
bollix47
Posts: 2941
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Project: 9406 Run:726 Clone:0 Gen:38 FAULTY from start

Post by bollix47 »

Another folder was able to complete the WU successfully:

Hi xxxx (team xxxx),
Your WU (P9406 R726 C0 G38) was added to the stats database on 2014-06-01 06:04:59 for 36834 points of credit.
Image
Post Reply