failure to connect to the collection server

Moderators: Site Moderators, PandeGroup

failure to connect to the collection server

Postby djgibbons » Sun Jan 28, 2018 2:43 pm

I have a NVidia Quadro K4200 running driver 390.65. The current job is 100% complete but there is a failure to connect to the collection server. The log has this information:
14:03:00:WU01:FS00:Trying to send results to collection server
14:03:00:WU01:FS00:Uploading 94.63KiB to 171.67.108.25
14:03:00:WU01:FS00:Connecting to 171.67.108.25:8080
14:03:21:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
14:03:21:WU01:FS00:Connecting to 171.67.108.25:80
14:03:42:ERROR:WU01:FS00:Exception: Failed to connect to 171.67.108.25:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.

This has been going on for many days now. The GPU job is stuck in SEND status. Apparently there is hundreds of billions of years left to complete the work.

1) Is this a normal state?
2) Is there a way to help the SEND take place?

Duncan
djgibbons
 
Posts: 19
Joined: Tue Sep 24, 2013 11:02 pm

Re: failure to connect to the collection server

Postby Joe_H » Sun Jan 28, 2018 5:54 pm

You have shown too little of the log entry, what is the WU that is being returned and what is the WS it also is being uploaded to? The CS is the fallback, so returns are first tried to the WS before the CS.

171.67.108.25 is not shown on the Server Status page, so it is either an old CS that was configured for the project at some time in the past, or a new one that has not been added yet to the page. The best that I can tell is that it is not accepting connections, I tried the address in a browser window. We need the WU and WS information to check if the WS is down.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
Joe_H
Site Admin
 
Posts: 4210
Joined: Tue Apr 21, 2009 4:41 pm
Location: W. MA

Re: failure to connect to the collection server

Postby bruce » Mon Jan 29, 2018 6:43 am

See the signature block below (in this post).

Do you connect through a proxy serveror is this a normal home connection?
bruce
 
Posts: 21679
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: failure to connect to the collection server

Postby djgibbons » Tue Jan 30, 2018 12:58 am

This is a normal home connection. Log below with (xx%) lines and repetition deleted as needed:

Code: Select all
*********************** Log Started 2018-01-21T18:00:24Z ***********************
18:00:24:************************* Folding@home Client *************************
18:00:24:      Website: http://folding.stanford.edu/
18:00:24:    Copyright: (c) 2009-2014 Stanford University
18:00:24:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
18:00:24:         Args: --open-web-control
18:00:24:       Config: C:/Users/Duncan/AppData/Roaming/FAHClient/config.xml
18:00:24:******************************** Build ********************************
18:00:24:      Version: 7.4.4
18:00:24:         Date: Mar 4 2014
18:00:24:         Time: 20:26:54
18:00:24:      SVN Rev: 4130
18:00:24:       Branch: fah/trunk/client
18:00:24:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
18:00:24:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
18:00:24:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
18:00:24:     Platform: win32 XP
18:00:24:         Bits: 32
18:00:24:         Mode: Release
18:00:24:******************************* System ********************************
18:00:24:          CPU: Intel(R) Core(TM) i7 CPU 870 @ 2.93GHz
18:00:24:       CPU ID: GenuineIntel Family 6 Model 30 Stepping 5
18:00:24:         CPUs: 8
18:00:24:       Memory: 7.99GiB
18:00:24:  Free Memory: 5.76GiB
18:00:24:      Threads: WINDOWS_THREADS
18:00:24:   OS Version: 6.2
18:00:24:  Has Battery: false
18:00:24:   On Battery: false
18:00:24:   UTC Offset: -5
18:00:24:          PID: 9192
18:00:24:          CWD: C:/Users/Duncan/AppData/Roaming/FAHClient
18:00:24:           OS: Windows 10 Pro
18:00:24:      OS Arch: AMD64
18:00:24:         GPUs: 1
18:00:24:        GPU 0: NVIDIA:3 GK104 [Quadro K4200]
18:00:24:         CUDA: 3.0
18:00:24:  CUDA Driver: 9010
18:00:24:Win32 Service: false
18:00:24:***********************************************************************
18:00:25:<config>
18:00:25:  <!-- Network -->
18:00:25:  <proxy v=':8080'/>
18:00:25:
18:00:25:  <!-- Slot Control -->
18:00:25:  <pause-on-battery v='false'/>
18:00:25:  <power v='LIGHT'/>
18:00:25:
18:00:25:  <!-- User Information -->
18:00:25:  <user v='Duncan_Gibbons'/>
18:00:25:
18:00:25:  <!-- Folding Slots -->
18:00:25:  <slot id='0' type='GPU'>
18:00:25:    <idle v='true'/>
18:00:25:  </slot>
18:00:25:  <slot id='1' type='CPU'>
18:00:25:    <idle v='true'/>
18:00:25:  </slot>
18:00:25:</config>
18:00:25:Trying to access database...
18:00:25:Successfully acquired database lock
18:00:25:Enabled folding slot 00: PAUSED gpu:0:GK104 [Quadro K4200] (waiting for idle)
18:00:25:Enabled folding slot 01: PAUSED cpu:4 (waiting for idle)
18:00:25:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:5771 run:13 clone:150 gen:3784 core:0x11 unit:0x7293eb0553c5269f0ec80096000d168b
18:00:25:WU01:FS00:Uploading 94.63KiB to 171.67.108.11
18:00:25:WU01:FS00:Connecting to 171.67.108.11:8080
18:00:34:WU02:FS01:Starting
18:00:34:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Duncan/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 02 -suffix 01 -version 704 -lifeline 9192 -checkpoint 15 -np 4
18:00:34:WU02:FS01:Started FahCore on PID 11364
18:00:34:WU02:FS01:Core PID:11396
18:00:34:WU02:FS01:FahCore 0xa4 started
18:00:35:WU02:FS01:0xa4:
18:00:35:WU02:FS01:0xa4:*------------------------------*
18:00:35:WU02:FS01:0xa4:Folding@Home Gromacs GB Core
18:00:35:WU02:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
18:00:35:WU02:FS01:0xa4:
18:00:35:WU02:FS01:0xa4:Preparing to commence simulation
18:00:35:WU02:FS01:0xa4:- Looking at optimizations...
18:00:35:WU02:FS01:0xa4:- Files status OK
18:00:35:WU02:FS01:0xa4:- Expanded 739106 -> 1929672 (decompressed 261.0 percent)
18:00:35:WU02:FS01:0xa4:Called DecompressByteArray: compressed_data_size=739106 data_size=1929672, decompressed_data_size=1929672 diff=0
18:00:35:WU02:FS01:0xa4:- Digital signature verified
18:00:35:WU02:FS01:0xa4:
18:00:35:WU02:FS01:0xa4:Project: 14005 (Run 0, Clone 524, Gen 6)
18:00:35:WU02:FS01:0xa4:
18:00:35:WU02:FS01:0xa4:Assembly optimizations on if available.
18:00:35:WU02:FS01:0xa4:Entering M.D.
18:00:41:WU02:FS01:0xa4:Using Gromacs checkpoints
18:00:41:WU02:FS01:0xa4:Mapping NT from 4 to 4
18:00:41:WU02:FS01:0xa4:Resuming from checkpoint
18:00:41:WU02:FS01:0xa4:Verified 02/wudata_01.log
18:00:41:WU02:FS01:0xa4:Verified 02/wudata_01.trr
18:00:41:WU02:FS01:0xa4:Verified 02/wudata_01.xtc
18:00:41:WU02:FS01:0xa4:Verified 02/wudata_01.edr
18:00:41:WU02:FS01:0xa4:Completed 52790 out of 2500000 steps  (2%)

18:01:26:Removing old file 'configs/config-20171230-195107.xml'
18:01:26:Saving configuration to config.xml
18:01:26:<config>
18:01:26:  <!-- Network -->
18:01:26:  <proxy v=':8080'/>
18:01:26:
18:01:26:  <!-- Slot Control -->
18:01:26:  <pause-on-battery v='false'/>
18:01:26:  <power v='LIGHT'/>
18:01:26:
18:01:26:  <!-- User Information -->
18:01:26:  <user v='Duncan_Gibbons'/>
18:01:26:
18:01:26:  <!-- Folding Slots -->
18:01:26:  <slot id='0' type='GPU'/>
18:01:26:  <slot id='1' type='CPU'/>
18:01:26:</config>
18:01:28:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
18:01:28:WU01:FS00:Connecting to 171.67.108.25:80
18:01:49:ERROR:WU01:FS00:Exception: Failed to connect to 171.67.108.25:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.

******************************* Date: 2018-01-22 *******************************

02:40:14:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:5771 run:13 clone:150 gen:3784 core:0x11 unit:0x7293eb0553c5269f0ec80096000d168b
02:40:15:WU01:FS00:Uploading 94.63KiB to 171.67.108.11
02:40:15:WU01:FS00:Connecting to 171.67.108.11:8080
02:40:36:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
02:40:36:WU01:FS00:Connecting to 171.67.108.11:80
02:40:57:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 171.67.108.11:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.

******************************* Date: 2018-01-23 *******************************

02:02:15:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:5771 run:13 clone:150 gen:3784 core:0x11 unit:0x7293eb0553c5269f0ec80096000d168b
02:02:15:WU01:FS00:Uploading 94.63KiB to 171.67.108.11
02:02:15:WU01:FS00:Connecting to 171.67.108.11:8080
02:02:36:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
02:02:36:WU01:FS00:Connecting to 171.67.108.11:80
02:02:57:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 171.67.108.11:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.

15:59:50:WU00:FS01:Connecting to 171.67.108.45:8080
15:59:51:WU00:FS01:Assigned to work server 155.247.166.219
15:59:51:WU00:FS01:Requesting new work unit for slot 01: RUNNING cpu:4 from 155.247.166.219
15:59:51:WU00:FS01:Connecting to 155.247.166.219:8080
15:59:51:WU00:FS01:Downloading 141.38KiB
15:59:51:WU00:FS01:Download complete
15:59:51:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:13743 run:20 clone:18 gen:47 core:0xa7 unit:0x000000300002894b59d5a2cae00f1174
16:28:10:WU02:FS01:0xa4:Completed 2500000 out of 2500000 steps  (100%)
16:28:11:WU02:FS01:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
16:28:21:WU02:FS01:0xa4:
16:28:21:WU02:FS01:0xa4:Finished Work Unit:
16:28:21:WU02:FS01:0xa4:- Reading up to 4508208 from "02/wudata_01.trr": Read 4508208
16:28:21:WU02:FS01:0xa4:trr file hash check passed.
16:28:21:WU02:FS01:0xa4:- Reading up to 501844 from "02/wudata_01.xtc": Read 501844
16:28:21:WU02:FS01:0xa4:xtc file hash check passed.
16:28:21:WU02:FS01:0xa4:edr file hash check passed.
16:28:21:WU02:FS01:0xa4:logfile size: 65250
16:28:21:WU02:FS01:0xa4:Leaving Run
16:28:26:WU02:FS01:0xa4:- Writing 5112370 bytes of core data to disk...
16:28:27:WU02:FS01:0xa4:Done: 5111858 -> 4341216 (compressed to 84.9 percent)
16:28:27:WU02:FS01:0xa4:  ... Done.
16:28:27:WU02:FS01:0xa4:- Shutting down core
16:28:27:WU02:FS01:0xa4:
16:28:27:WU02:FS01:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
16:28:28:WU02:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
16:28:28:WU02:FS01:Sending unit results: id:02 state:SEND error:NO_ERROR project:14005 run:0 clone:524 gen:6 core:0xa4 unit:0x000000060002894c59e4cffd66316bdc
16:28:28:WU02:FS01:Uploading 4.14MiB to 155.247.166.220
16:28:28:WU00:FS01:Starting
16:28:28:WU02:FS01:Connecting to 155.247.166.220:8080
16:28:28:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Duncan/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/Core_a7.fah/FahCore_a7.exe -dir 00 -suffix 01 -version 704 -lifeline 9192 -checkpoint 15 -np 4
16:28:28:WU00:FS01:Started FahCore on PID 3296
16:28:28:WU00:FS01:Core PID:5824
16:28:28:WU00:FS01:FahCore 0xa7 started
16:28:29:WU00:FS01:0xa7:*********************** Log Started 2018-01-23T16:28:29Z ***********************
16:28:29:WU00:FS01:0xa7:************************** Gromacs Folding@home Core ***************************
16:28:29:WU00:FS01:0xa7:       Type: 0xa7
16:28:29:WU00:FS01:0xa7:       Core: Gromacs
16:28:29:WU00:FS01:0xa7:    Website: http://folding.stanford.edu/
16:28:29:WU00:FS01:0xa7:  Copyright: (c) 2009-2016 Stanford University
16:28:29:WU00:FS01:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
16:28:29:WU00:FS01:0xa7:       Args: -dir 00 -suffix 01 -version 704 -lifeline 3296 -checkpoint 15 -np 4
16:28:29:WU00:FS01:0xa7:     Config: <none>
16:28:29:WU00:FS01:0xa7:************************************ Build *************************************
16:28:29:WU00:FS01:0xa7:    Version: 0.0.16
16:28:29:WU00:FS01:0xa7:       Date: Oct 31 2017
16:28:29:WU00:FS01:0xa7:       Time: 14:04:33
16:28:29:WU00:FS01:0xa7: Repository: Git
16:28:29:WU00:FS01:0xa7:   Revision: 2f0a8a3d0b0698be48154fe99a0216f289060932
16:28:29:WU00:FS01:0xa7:     Branch: master
16:28:29:WU00:FS01:0xa7:   Compiler: Visual C++ 2008
16:28:29:WU00:FS01:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
16:28:29:WU00:FS01:0xa7:   Platform: win32 10
16:28:29:WU00:FS01:0xa7:       Bits: 64
16:28:29:WU00:FS01:0xa7:       Mode: Release
16:28:29:WU00:FS01:0xa7:       SIMD: sse2
16:28:29:WU00:FS01:0xa7:************************************ System ************************************
16:28:29:WU00:FS01:0xa7:        CPU: Unknown
16:28:29:WU00:FS01:0xa7:     CPU ID:
16:28:29:WU00:FS01:0xa7:       CPUs: 8
16:28:29:WU00:FS01:0xa7:     Memory: 7.99GiB
16:28:29:WU00:FS01:0xa7:Free Memory: 5.73GiB
16:28:29:WU00:FS01:0xa7:    Threads: WINDOWS_THREADS
16:28:29:WU00:FS01:0xa7: OS Version: 6.2
16:28:29:WU00:FS01:0xa7:Has Battery: false
16:28:29:WU00:FS01:0xa7: On Battery: false
16:28:29:WU00:FS01:0xa7: UTC Offset: -5
16:28:29:WU00:FS01:0xa7:        PID: 5824
16:28:29:WU00:FS01:0xa7:        CWD: C:\Users\Duncan\AppData\Roaming\FAHClient\work
16:28:29:WU00:FS01:0xa7:         OS: Windows 10 Pro
16:28:29:WU00:FS01:0xa7:    OS Arch: AMD64
16:28:29:WU00:FS01:0xa7:********************************************************************************
16:28:29:WU00:FS01:0xa7:Project: 13743 (Run 20, Clone 18, Gen 47)
16:28:29:WU00:FS01:0xa7:Unit: 0x000000300002894b59d5a2cae00f1174
16:28:29:WU00:FS01:0xa7:Reading tar file core.xml
16:28:29:WU00:FS01:0xa7:Reading tar file frame47.tpr
16:28:29:WU00:FS01:0xa7:Digital signatures verified
16:28:29:WU00:FS01:0xa7:Calling: mdrun -s frame47.tpr -o frame47.trr -cpt 15 -nt 4
16:28:29:WU00:FS01:0xa7:Steps: first=117500000 total=2500000
16:28:30:WU00:FS01:0xa7:Completed 1 out of 2500000 steps (0%)
16:28:33:WU02:FS01:Upload complete
16:28:33:WU02:FS01:Server responded WORK_ACK (400)
16:28:33:WU02:FS01:Final credit estimate, 3666.00 points
16:28:33:WU02:FS01:Cleaning up
16:32:32:WU00:FS01:0xa7:Completed 25000 out of 2500000 steps (1%)
16:36:37:WU00:FS01:0xa7:Completed 50000 out of 2500000 steps (2%)

******************************* Date: 2018-01-23 *******************************

20:02:15:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:5771 run:13 clone:150 gen:3784 core:0x11 unit:0x7293eb0553c5269f0ec80096000d168b
20:02:15:WU01:FS00:Uploading 94.63KiB to 171.67.108.11
20:02:15:WU01:FS00:Connecting to 171.67.108.11:8080
20:02:36:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
20:02:36:WU01:FS00:Connecting to 171.67.108.11:80
20:02:57:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 171.67.108.11:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.

23:19:53:WU00:FS01:0xa7:Completed 2475000 out of 2500000 steps (99%)
23:19:54:WU02:FS01:Connecting to 171.67.108.45:8080
23:19:55:WU02:FS01:Assigned to work server 155.247.166.219
23:19:55:WU02:FS01:Requesting new work unit for slot 01: RUNNING cpu:4 from 155.247.166.219
23:19:55:WU02:FS01:Connecting to 155.247.166.219:8080
23:19:56:WU02:FS01:Downloading 1.08MiB
23:19:56:WU02:FS01:Download complete
23:19:56:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:8635 run:1 clone:131 gen:152 core:0xa4 unit:0x000000a60002894b582b55aec1ce2c57
23:25:58:WU00:FS01:0xa7:Completed 2500000 out of 2500000 steps (100%)
23:25:59:WU00:FS01:0xa7:Saving result file ..\logfile_01.txt
23:25:59:WU00:FS01:0xa7:Saving result file frame47.trr
23:25:59:WU00:FS01:0xa7:Saving result file md.log
23:25:59:WU00:FS01:0xa7:Saving result file science.log
23:25:59:WU00:FS01:0xa7:Saving result file traj_comp.xtc
23:26:00:WU00:FS01:0xa7:Folding@home Core Shutdown: FINISHED_UNIT
23:26:01:WU00:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
23:26:01:WU00:FS01:Sending unit results: id:00 state:SEND error:NO_ERROR project:13743 run:20 clone:18 gen:47 core:0xa7 unit:0x000000300002894b59d5a2cae00f1174
23:26:01:WU00:FS01:Uploading 1.66MiB to 155.247.166.219
23:26:01:WU02:FS01:Starting
23:26:01:WU00:FS01:Connecting to 155.247.166.219:8080
23:26:01:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Duncan/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 02 -suffix 01 -version 704 -lifeline 9192 -checkpoint 15 -np 4
23:26:01:WU02:FS01:Started FahCore on PID 10716
23:26:01:WU02:FS01:Core PID:7728
23:26:01:WU02:FS01:FahCore 0xa4 started
23:26:02:WU02:FS01:0xa4:
23:26:02:WU02:FS01:0xa4:*------------------------------*
23:26:02:WU02:FS01:0xa4:Folding@Home Gromacs GB Core
23:26:02:WU02:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
23:26:02:WU02:FS01:0xa4:
23:26:02:WU02:FS01:0xa4:Preparing to commence simulation
23:26:02:WU02:FS01:0xa4:- Looking at optimizations...
23:26:02:WU02:FS01:0xa4:- Created dyn
23:26:02:WU02:FS01:0xa4:- Files status OK
23:26:02:WU02:FS01:0xa4:- Expanded 1128446 -> 2619636 (decompressed 232.1 percent)
23:26:02:WU02:FS01:0xa4:Called DecompressByteArray: compressed_data_size=1128446 data_size=2619636, decompressed_data_size=2619636 diff=0
23:26:02:WU02:FS01:0xa4:- Digital signature verified
23:26:02:WU02:FS01:0xa4:
23:26:02:WU02:FS01:0xa4:Project: 8635 (Run 1, Clone 131, Gen 152)
23:26:02:WU02:FS01:0xa4:
23:26:02:WU02:FS01:0xa4:Assembly optimizations on if available.
23:26:02:WU02:FS01:0xa4:Entering M.D.
23:26:03:WU00:FS01:Upload complete
23:26:03:WU00:FS01:Server responded WORK_ACK (400)
23:26:03:WU00:FS01:Final credit estimate, 731.00 points
23:26:03:WU00:FS01:Cleaning up
23:26:08:WU02:FS01:0xa4:Mapping NT from 4 to 4
23:26:08:WU02:FS01:0xa4:Completed 0 out of 1250000 steps  (0%)
23:50:46:WU02:FS01:0xa4:Completed 12500 out of 1250000 steps  (1%)
00:13:04:WU02:FS01:0xa4:Completed 25000 out of 1250000 steps  (2%)

******************************* Date: 2018-01-24 *******************************

02:02:15:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:5771 run:13 clone:150 gen:3784 core:0x11 unit:0x7293eb0553c5269f0ec80096000d168b
02:02:15:WU01:FS00:Uploading 94.63KiB to 171.67.108.11
02:02:15:WU01:FS00:Connecting to 171.67.108.11:8080
02:02:36:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
02:02:36:WU01:FS00:Connecting to 171.67.108.11:80
02:02:57:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 171.67.108.11:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.

******************************* Date: 2018-01-25 *******************************

13:24:56:WU02:FS01:0xa4:Completed 1237500 out of 1250000 steps  (99%)
13:24:56:WU00:FS01:Connecting to 171.67.108.45:8080
13:24:56:WU00:FS01:Assigned to work server 155.247.166.219
13:24:56:WU00:FS01:Requesting new work unit for slot 01: RUNNING cpu:4 from 155.247.166.219
13:24:57:WU00:FS01:Connecting to 155.247.166.219:8080
13:24:57:WU00:FS01:Downloading 177.35KiB
13:24:57:WU00:FS01:Download complete
13:24:57:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:13740 run:113 clone:18 gen:33 core:0xa7 unit:0x000000210002894b59d62fd21bf8948e
13:47:23:WU02:FS01:0xa4:Completed 1250000 out of 1250000 steps  (100%)
13:47:25:WU02:FS01:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
13:47:35:WU02:FS01:0xa4:
13:47:35:WU02:FS01:0xa4:Finished Work Unit:
13:47:35:WU02:FS01:0xa4:- Reading up to 5122368 from "02/wudata_01.trr": Read 5122368
13:47:35:WU02:FS01:0xa4:trr file hash check passed.
13:47:35:WU02:FS01:0xa4:- Reading up to 186448 from "02/wudata_01.xtc": Read 186448
13:47:35:WU02:FS01:0xa4:xtc file hash check passed.
13:47:35:WU02:FS01:0xa4:edr file hash check passed.
13:47:35:WU02:FS01:0xa4:logfile size: 44813
13:47:35:WU02:FS01:0xa4:Leaving Run
13:47:38:WU02:FS01:0xa4:- Writing 5373329 bytes of core data to disk...
13:47:39:WU02:FS01:0xa4:Done: 5372817 -> 4419403 (compressed to 82.2 percent)
13:47:39:WU02:FS01:0xa4:  ... Done.
13:47:40:WU02:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
13:47:41:WU02:FS01:Sending unit results: id:02 state:SEND error:NO_ERROR project:8635 run:1 clone:131 gen:152 core:0xa4 unit:0x000000a60002894b582b55aec1ce2c57
13:47:41:WU02:FS01:Uploading 4.22MiB to 155.247.166.219
13:47:41:WU00:FS01:Starting
13:47:41:WU02:FS01:Connecting to 155.247.166.219:8080
13:47:41:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Duncan/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/Core_a7.fah/FahCore_a7.exe -dir 00 -suffix 01 -version 704 -lifeline 9192 -checkpoint 15 -np 4
13:47:41:WU00:FS01:Started FahCore on PID 9484
13:47:41:WU00:FS01:Core PID:11160
13:47:41:WU00:FS01:FahCore 0xa7 started
13:47:41:WU00:FS01:0xa7:*********************** Log Started 2018-01-25T13:47:41Z ***********************
13:47:41:WU00:FS01:0xa7:************************** Gromacs Folding@home Core ***************************
13:47:41:WU00:FS01:0xa7:       Type: 0xa7
13:47:41:WU00:FS01:0xa7:       Core: Gromacs
13:47:41:WU00:FS01:0xa7:    Website: http://folding.stanford.edu/
13:47:41:WU00:FS01:0xa7:  Copyright: (c) 2009-2016 Stanford University
13:47:41:WU00:FS01:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
13:47:41:WU00:FS01:0xa7:       Args: -dir 00 -suffix 01 -version 704 -lifeline 9484 -checkpoint 15 -np 4
13:47:42:WU00:FS01:0xa7:     Config: <none>
13:47:42:WU00:FS01:0xa7:************************************ Build *************************************
13:47:42:WU00:FS01:0xa7:    Version: 0.0.16
13:47:42:WU00:FS01:0xa7:       Date: Oct 31 2017
13:47:42:WU00:FS01:0xa7:       Time: 14:04:33
13:47:42:WU00:FS01:0xa7: Repository: Git
13:47:42:WU00:FS01:0xa7:   Revision: 2f0a8a3d0b0698be48154fe99a0216f289060932
13:47:42:WU00:FS01:0xa7:     Branch: master
13:47:42:WU00:FS01:0xa7:   Compiler: Visual C++ 2008
13:47:42:WU00:FS01:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
13:47:42:WU00:FS01:0xa7:   Platform: win32 10
13:47:42:WU00:FS01:0xa7:       Bits: 64
13:47:42:WU00:FS01:0xa7:       Mode: Release
13:47:42:WU00:FS01:0xa7:       SIMD: sse2
13:47:42:WU00:FS01:0xa7:************************************ System ************************************
13:47:42:WU00:FS01:0xa7:        CPU: Unknown
13:47:42:WU00:FS01:0xa7:     CPU ID:
13:47:42:WU00:FS01:0xa7:       CPUs: 8
13:47:42:WU00:FS01:0xa7:     Memory: 7.99GiB
13:47:42:WU00:FS01:0xa7:Free Memory: 5.83GiB
13:47:42:WU00:FS01:0xa7:    Threads: WINDOWS_THREADS
13:47:42:WU00:FS01:0xa7: OS Version: 6.2
13:47:42:WU00:FS01:0xa7:Has Battery: false
13:47:42:WU00:FS01:0xa7: On Battery: false
13:47:42:WU00:FS01:0xa7: UTC Offset: -5
13:47:42:WU00:FS01:0xa7:        PID: 11160
13:47:42:WU00:FS01:0xa7:        CWD: C:\Users\Duncan\AppData\Roaming\FAHClient\work
13:47:42:WU00:FS01:0xa7:         OS: Windows 10 Pro
13:47:42:WU00:FS01:0xa7:    OS Arch: AMD64
13:47:42:WU00:FS01:0xa7:********************************************************************************
13:47:42:WU00:FS01:0xa7:Project: 13740 (Run 113, Clone 18, Gen 33)
13:47:42:WU00:FS01:0xa7:Unit: 0x000000210002894b59d62fd21bf8948e
13:47:42:WU00:FS01:0xa7:Reading tar file core.xml
13:47:42:WU00:FS01:0xa7:Reading tar file frame33.tpr
13:47:42:WU00:FS01:0xa7:Digital signatures verified
13:47:42:WU00:FS01:0xa7:Calling: mdrun -s frame33.tpr -o frame33.trr -cpt 15 -nt 4
13:47:42:WU00:FS01:0xa7:Steps: first=82500000 total=2500000
13:47:42:WU00:FS01:0xa7:Completed 1 out of 2500000 steps (0%)
13:47:45:WU02:FS01:Upload complete
13:47:46:WU02:FS01:Server responded WORK_ACK (400)
13:47:46:WU02:FS01:Final credit estimate, 2888.00 points
13:47:46:WU02:FS01:Cleaning up
13:51:42:WU00:FS01:0xa7:Completed 25000 out of 2500000 steps (1%)
13:55:58:WU00:FS01:0xa7:Completed 50000 out of 2500000 steps (2%)

20:36:43:WU00:FS01:0xa7:Completed 2475000 out of 2500000 steps (99%)
20:36:44:WU02:FS01:Connecting to 171.67.108.45:8080
20:36:45:WU02:FS01:Assigned to work server 155.247.166.220
20:36:45:WU02:FS01:Requesting new work unit for slot 01: RUNNING cpu:4 from 155.247.166.220
20:36:45:WU02:FS01:Connecting to 155.247.166.220:8080
20:36:46:WU02:FS01:Downloading 344.40KiB
20:36:46:WU02:FS01:Download complete
20:36:47:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:8616 run:1264 clone:0 gen:2 core:0xa4 unit:0x000000030002894c57b79616d56f633a
20:40:48:WU00:FS01:0xa7:Completed 2500000 out of 2500000 steps (100%)
20:40:49:WU00:FS01:0xa7:Saving result file ..\logfile_01.txt
20:40:49:WU00:FS01:0xa7:Saving result file frame33.trr
20:40:49:WU00:FS01:0xa7:Saving result file md.log
20:40:49:WU00:FS01:0xa7:Saving result file science.log
20:40:49:WU00:FS01:0xa7:Saving result file traj_comp.xtc
20:40:50:WU00:FS01:0xa7:Folding@home Core Shutdown: FINISHED_UNIT
20:40:50:WU00:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
20:40:50:WU00:FS01:Sending unit results: id:00 state:SEND error:NO_ERROR project:13740 run:113 clone:18 gen:33 core:0xa7 unit:0x000000210002894b59d62fd21bf8948e
20:40:50:WU00:FS01:Uploading 1.66MiB to 155.247.166.219
20:40:50:WU00:FS01:Connecting to 155.247.166.219:8080
20:40:50:WU02:FS01:Starting
20:40:50:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Duncan/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 02 -suffix 01 -version 704 -lifeline 9192 -checkpoint 15 -np 4
20:40:51:WU02:FS01:Started FahCore on PID 2668
20:40:51:WU02:FS01:Core PID:1612
20:40:51:WU02:FS01:FahCore 0xa4 started
20:40:51:WU02:FS01:0xa4:
20:40:51:WU02:FS01:0xa4:*------------------------------*
20:40:51:WU02:FS01:0xa4:Folding@Home Gromacs GB Core
20:40:51:WU02:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
20:40:51:WU02:FS01:0xa4:
20:40:51:WU02:FS01:0xa4:Preparing to commence simulation
20:40:51:WU02:FS01:0xa4:- Looking at optimizations...
20:40:51:WU02:FS01:0xa4:- Created dyn
20:40:51:WU02:FS01:0xa4:- Files status OK
20:40:51:WU02:FS01:0xa4:- Expanded 352151 -> 597116 (decompressed 169.5 percent)
20:40:51:WU02:FS01:0xa4:Called DecompressByteArray: compressed_data_size=352151 data_size=597116, decompressed_data_size=597116 diff=0
20:40:51:WU02:FS01:0xa4:- Digital signature verified
20:40:51:WU02:FS01:0xa4:
20:40:51:WU02:FS01:0xa4:Project: 8616 (Run 1264, Clone 0, Gen 2)
20:40:51:WU02:FS01:0xa4:
20:40:51:WU02:FS01:0xa4:Assembly optimizations on if available.
20:40:51:WU02:FS01:0xa4:Entering M.D.
20:40:52:WU00:FS01:Upload complete
20:40:52:WU00:FS01:Server responded WORK_ACK (400)
20:40:52:WU00:FS01:Final credit estimate, 739.00 points
20:40:53:WU00:FS01:Cleaning up
20:40:57:WU02:FS01:0xa4:Mapping NT from 4 to 4
20:40:57:WU02:FS01:0xa4:Completed 0 out of 2500000 steps  (0%)
20:51:34:WU02:FS01:0xa4:Completed 25000 out of 2500000 steps  (1%)

******************************* Date: 2018-01-26 *******************************
02:02:16:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:5771 run:13 clone:150 gen:3784 core:0x11 unit:0x7293eb0553c5269f0ec80096000d168b
02:02:16:WU01:FS00:Uploading 94.63KiB to 171.67.108.11
02:02:16:WU01:FS00:Connecting to 171.67.108.11:8080
02:02:37:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
02:02:37:WU01:FS00:Connecting to 171.67.108.11:80
02:02:58:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 171.67.108.11:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.

14:14:42:WU02:FS01:0xa4:Completed 2475000 out of 2500000 steps  (99%)
14:14:43:WU00:FS01:Connecting to 171.67.108.45:8080
14:14:44:WU00:FS01:Assigned to work server 155.247.166.219
14:14:44:WU00:FS01:Requesting new work unit for slot 01: RUNNING cpu:4 from 155.247.166.219
14:14:44:WU00:FS01:Connecting to 155.247.166.219:8080
14:14:44:WU00:FS01:Downloading 142.08KiB
14:14:44:WU00:FS01:Download complete
14:14:45:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:13749 run:168 clone:17 gen:27 core:0xa7 unit:0x0000001d0002894b59d5492bbe04f025
14:29:01:WU02:FS01:0xa4:Completed 2500000 out of 2500000 steps  (100%)
14:29:01:WU02:FS01:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
14:29:11:WU02:FS01:0xa4:
14:29:11:WU02:FS01:0xa4:Finished Work Unit:
14:29:11:WU02:FS01:0xa4:- Reading up to 1953936 from "02/wudata_01.trr": Read 1953936
14:29:11:WU02:FS01:0xa4:trr file hash check passed.
14:29:11:WU02:FS01:0xa4:- Reading up to 2462032 from "02/wudata_01.xtc": Read 2462032
14:29:11:WU02:FS01:0xa4:xtc file hash check passed.
14:29:11:WU02:FS01:0xa4:edr file hash check passed.
14:29:11:WU02:FS01:0xa4:logfile size: 61350
14:29:11:WU02:FS01:0xa4:Leaving Run
14:29:13:WU02:FS01:0xa4:- Writing 4508498 bytes of core data to disk...
14:29:14:WU02:FS01:0xa4:Done: 4507986 -> 4191674 (compressed to 92.9 percent)
14:29:15:WU02:FS01:0xa4:  ... Done.
14:29:15:WU02:FS01:0xa4:- Shutting down core
14:29:15:WU02:FS01:0xa4:
14:29:15:WU02:FS01:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
14:29:15:WU02:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
14:29:16:WU02:FS01:Sending unit results: id:02 state:SEND error:NO_ERROR project:8616 run:1264 clone:0 gen:2 core:0xa4 unit:0x000000030002894c57b79616d56f633a
14:29:16:WU02:FS01:Uploading 4.00MiB to 155.247.166.220
14:29:16:WU00:FS01:Starting
14:29:16:WU02:FS01:Connecting to 155.247.166.220:8080
14:29:16:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Duncan/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/Core_a7.fah/FahCore_a7.exe -dir 00 -suffix 01 -version 704 -lifeline 9192 -checkpoint 15 -np 4
14:29:16:WU00:FS01:Started FahCore on PID 5640
14:29:16:WU00:FS01:Core PID:972
14:29:16:WU00:FS01:FahCore 0xa7 started
14:29:17:WU00:FS01:0xa7:*********************** Log Started 2018-01-26T14:29:16Z ***********************
14:29:17:WU00:FS01:0xa7:************************** Gromacs Folding@home Core ***************************
14:29:17:WU00:FS01:0xa7:       Type: 0xa7
14:29:17:WU00:FS01:0xa7:       Core: Gromacs
14:29:17:WU00:FS01:0xa7:    Website: http://folding.stanford.edu/
14:29:17:WU00:FS01:0xa7:  Copyright: (c) 2009-2016 Stanford University
14:29:17:WU00:FS01:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
14:29:17:WU00:FS01:0xa7:       Args: -dir 00 -suffix 01 -version 704 -lifeline 5640 -checkpoint 15 -np 4
14:29:17:WU00:FS01:0xa7:     Config: <none>
14:29:17:WU00:FS01:0xa7:************************************ Build *************************************
14:29:17:WU00:FS01:0xa7:    Version: 0.0.16
14:29:17:WU00:FS01:0xa7:       Date: Oct 31 2017
14:29:17:WU00:FS01:0xa7:       Time: 14:04:33
14:29:17:WU00:FS01:0xa7: Repository: Git
14:29:17:WU00:FS01:0xa7:   Revision: 2f0a8a3d0b0698be48154fe99a0216f289060932
14:29:17:WU00:FS01:0xa7:     Branch: master
14:29:17:WU00:FS01:0xa7:   Compiler: Visual C++ 2008
14:29:17:WU00:FS01:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
14:29:17:WU00:FS01:0xa7:   Platform: win32 10
14:29:17:WU00:FS01:0xa7:       Bits: 64
14:29:17:WU00:FS01:0xa7:       Mode: Release
14:29:17:WU00:FS01:0xa7:       SIMD: sse2
14:29:17:WU00:FS01:0xa7:************************************ System ************************************
14:29:17:WU00:FS01:0xa7:        CPU: Unknown
14:29:17:WU00:FS01:0xa7:     CPU ID:
14:29:17:WU00:FS01:0xa7:       CPUs: 8
14:29:17:WU00:FS01:0xa7:     Memory: 7.99GiB
14:29:17:WU00:FS01:0xa7:Free Memory: 4.30GiB
14:29:17:WU00:FS01:0xa7:    Threads: WINDOWS_THREADS
14:29:17:WU00:FS01:0xa7: OS Version: 6.2
14:29:17:WU00:FS01:0xa7:Has Battery: false
14:29:17:WU00:FS01:0xa7: On Battery: false
14:29:17:WU00:FS01:0xa7: UTC Offset: -5
14:29:17:WU00:FS01:0xa7:        PID: 972
14:29:17:WU00:FS01:0xa7:        CWD: C:\Users\Duncan\AppData\Roaming\FAHClient\work
14:29:17:WU00:FS01:0xa7:         OS: Windows 10 Pro
14:29:17:WU00:FS01:0xa7:    OS Arch: AMD64
14:29:17:WU00:FS01:0xa7:********************************************************************************
14:29:17:WU00:FS01:0xa7:Project: 13749 (Run 168, Clone 17, Gen 27)
14:29:17:WU00:FS01:0xa7:Unit: 0x0000001d0002894b59d5492bbe04f025
14:29:17:WU00:FS01:0xa7:Reading tar file core.xml
14:29:17:WU00:FS01:0xa7:Reading tar file frame27.tpr
14:29:17:WU00:FS01:0xa7:Digital signatures verified
14:29:17:WU00:FS01:0xa7:Calling: mdrun -s frame27.tpr -o frame27.trr -cpt 15 -nt 4
14:29:17:WU00:FS01:0xa7:Steps: first=67500000 total=2500000
14:29:18:WU00:FS01:0xa7:Completed 1 out of 2500000 steps (0%)
14:29:20:WU02:FS01:Upload complete
14:29:20:WU02:FS01:Server responded WORK_ACK (400)
14:29:20:WU02:FS01:Final credit estimate, 1593.00 points
14:29:20:WU02:FS01:Cleaning up
14:34:54:WU00:FS01:0xa7:Completed 25000 out of 2500000 steps (1%)
14:40:29:WU00:FS01:0xa7:Completed 50000 out of 2500000 steps (2%)

******************************* Date: 2018-01-26 *******************************

23:34:43:WU00:FS01:0xa7:Completed 2475000 out of 2500000 steps (99%)
23:34:44:WU02:FS01:Connecting to 171.67.108.45:8080
23:34:45:WU02:FS01:Assigned to work server 155.247.166.219
23:34:45:WU02:FS01:Requesting new work unit for slot 01: RUNNING cpu:4 from 155.247.166.219
23:34:45:WU02:FS01:Connecting to 155.247.166.219:8080
23:34:46:WU02:FS01:Downloading 1.08MiB
23:34:46:WU02:FS01:Download complete
23:34:47:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:8631 run:5 clone:20 gen:154 core:0xa4 unit:0x000000ad0002894b57f6ecd5c0c214ff
23:38:48:WU00:FS01:0xa7:Completed 2500000 out of 2500000 steps (100%)
23:38:49:WU00:FS01:0xa7:Saving result file ..\logfile_01.txt
23:38:49:WU00:FS01:0xa7:Saving result file frame27.trr
23:38:49:WU00:FS01:0xa7:Saving result file md.log
23:38:49:WU00:FS01:0xa7:Saving result file science.log
23:38:49:WU00:FS01:0xa7:Saving result file traj_comp.xtc
23:38:49:WU00:FS01:0xa7:Folding@home Core Shutdown: FINISHED_UNIT
23:38:50:WU00:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
23:38:50:WU00:FS01:Sending unit results: id:00 state:SEND error:NO_ERROR project:13749 run:168 clone:17 gen:27 core:0xa7 unit:0x0000001d0002894b59d5492bbe04f025
23:38:50:WU00:FS01:Uploading 1.66MiB to 155.247.166.219
23:38:50:WU02:FS01:Starting
23:38:50:WU00:FS01:Connecting to 155.247.166.219:8080
23:38:50:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Duncan/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 02 -suffix 01 -version 704 -lifeline 9192 -checkpoint 15 -np 4
23:38:50:WU02:FS01:Started FahCore on PID 8912
23:38:50:WU02:FS01:Core PID:10204
23:38:50:WU02:FS01:FahCore 0xa4 started
23:38:51:WU02:FS01:0xa4:
23:38:51:WU02:FS01:0xa4:*------------------------------*
23:38:51:WU02:FS01:0xa4:Folding@Home Gromacs GB Core
23:38:51:WU02:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
23:38:51:WU02:FS01:0xa4:
23:38:51:WU02:FS01:0xa4:Preparing to commence simulation
23:38:51:WU02:FS01:0xa4:- Looking at optimizations...
23:38:51:WU02:FS01:0xa4:- Created dyn
23:38:51:WU02:FS01:0xa4:- Files status OK
23:38:51:WU02:FS01:0xa4:- Expanded 1129657 -> 2621880 (decompressed 232.0 percent)
23:38:51:WU02:FS01:0xa4:Called DecompressByteArray: compressed_data_size=1129657 data_size=2621880, decompressed_data_size=2621880 diff=0
23:38:51:WU02:FS01:0xa4:- Digital signature verified
23:38:51:WU02:FS01:0xa4:
23:38:51:WU02:FS01:0xa4:Project: 8631 (Run 5, Clone 20, Gen 154)
23:38:51:WU02:FS01:0xa4:
23:38:51:WU02:FS01:0xa4:Assembly optimizations on if available.
23:38:51:WU02:FS01:0xa4:Entering M.D.
23:38:52:WU00:FS01:Upload complete
23:38:52:WU00:FS01:Server responded WORK_ACK (400)
23:38:52:WU00:FS01:Final credit estimate, 650.00 points
23:38:52:WU00:FS01:Cleaning up
23:38:57:WU02:FS01:0xa4:Mapping NT from 4 to 4
23:38:57:WU02:FS01:0xa4:Completed 0 out of 1250000 steps  (0%)
00:00:48:WU02:FS01:0xa4:Completed 12500 out of 1250000 steps  (1%)
00:22:46:WU02:FS01:0xa4:Completed 25000 out of 1250000 steps  (2%)

******************************* Date: 2018-01-27 *******************************
02:02:17:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:5771 run:13 clone:150 gen:3784 core:0x11 unit:0x7293eb0553c5269f0ec80096000d168b
02:02:17:WU01:FS00:Uploading 94.63KiB to 171.67.108.11
02:02:17:WU01:FS00:Connecting to 171.67.108.11:8080
02:02:38:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
02:02:38:WU01:FS00:Connecting to 171.67.108.11:80
02:02:59:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 171.67.108.11:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.

******************************* Date: 2018-01-28 *******************************
02:02:17:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:5771 run:13 clone:150 gen:3784 core:0x11 unit:0x7293eb0553c5269f0ec80096000d168b
02:02:17:WU01:FS00:Uploading 94.63KiB to 171.67.108.11
02:02:17:WU01:FS00:Connecting to 171.67.108.11:8080
02:02:38:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
02:02:38:WU01:FS00:Connecting to 171.67.108.11:80
02:03:00:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 171.67.108.11:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.

12:30:27:WU02:FS01:0xa4:Completed 1237500 out of 1250000 steps  (99%)
12:30:28:WU00:FS01:Connecting to 171.67.108.45:8080
12:30:29:WU00:FS01:Assigned to work server 171.67.108.158
12:30:29:WU00:FS01:Requesting new work unit for slot 01: RUNNING cpu:4 from 171.67.108.158
12:30:29:WU00:FS01:Connecting to 171.67.108.158:8080
12:30:31:WU00:FS01:Downloading 806.39KiB
12:30:31:WU00:FS01:Download complete
12:30:31:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:9033 run:556 clone:2 gen:1044 core:0xa4 unit:0x00000477ab436c9e5698313dd620dd6d
12:53:07:WU02:FS01:0xa4:Completed 1250000 out of 1250000 steps  (100%)
12:53:08:WU02:FS01:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
12:53:18:WU02:FS01:0xa4:
12:53:18:WU02:FS01:0xa4:Finished Work Unit:
12:53:18:WU02:FS01:0xa4:- Reading up to 5134080 from "02/wudata_01.trr": Read 5134080
12:53:18:WU02:FS01:0xa4:trr file hash check passed.
12:53:18:WU02:FS01:0xa4:- Reading up to 188604 from "02/wudata_01.xtc": Read 188604
12:53:18:WU02:FS01:0xa4:xtc file hash check passed.
12:53:18:WU02:FS01:0xa4:edr file hash check passed.
12:53:18:WU02:FS01:0xa4:logfile size: 45280
12:53:18:WU02:FS01:0xa4:Leaving Run
12:53:20:WU02:FS01:0xa4:- Writing 5387664 bytes of core data to disk...
12:53:22:WU02:FS01:0xa4:Done: 5387152 -> 4430892 (compressed to 82.2 percent)
12:53:22:WU02:FS01:0xa4:  ... Done.
12:53:23:WU02:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
12:53:23:WU02:FS01:Sending unit results: id:02 state:SEND error:NO_ERROR project:8631 run:5 clone:20 gen:154 core:0xa4 unit:0x000000ad0002894b57f6ecd5c0c214ff
12:53:23:WU02:FS01:Uploading 4.23MiB to 155.247.166.219
12:53:23:WU00:FS01:Starting
12:53:23:WU02:FS01:Connecting to 155.247.166.219:8080
12:53:23:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Duncan/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 00 -suffix 01 -version 704 -lifeline 9192 -checkpoint 15 -np 4
12:53:23:WU00:FS01:Started FahCore on PID 9272
12:53:23:WU00:FS01:Core PID:3768
12:53:23:WU00:FS01:FahCore 0xa4 started
12:53:24:WU00:FS01:0xa4:
12:53:24:WU00:FS01:0xa4:*------------------------------*
12:53:24:WU00:FS01:0xa4:Folding@Home Gromacs GB Core
12:53:24:WU00:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
12:53:24:WU00:FS01:0xa4:
12:53:24:WU00:FS01:0xa4:Preparing to commence simulation
12:53:24:WU00:FS01:0xa4:- Looking at optimizations...
12:53:24:WU00:FS01:0xa4:- Created dyn
12:53:24:WU00:FS01:0xa4:- Files status OK
12:53:24:WU00:FS01:0xa4:- Expanded 825228 -> 1401332 (decompressed 169.8 percent)
12:53:24:WU00:FS01:0xa4:Called DecompressByteArray: compressed_data_size=825228 data_size=1401332, decompressed_data_size=1401332 diff=0
12:53:24:WU00:FS01:0xa4:- Digital signature verified
12:53:24:WU00:FS01:0xa4:
12:53:24:WU00:FS01:0xa4:Project: 9033 (Run 556, Clone 2, Gen 1044)
12:53:24:WU00:FS01:0xa4:
12:53:24:WU00:FS01:0xa4:Assembly optimizations on if available.
12:53:24:WU00:FS01:0xa4:Entering M.D.
12:53:28:WU02:FS01:Upload complete
12:53:28:WU02:FS01:Server responded WORK_ACK (400)
12:53:28:WU02:FS01:Final credit estimate, 2945.00 points
12:53:28:WU02:FS01:Cleaning up
12:53:30:WU00:FS01:0xa4:Mapping NT from 4 to 4
12:53:30:WU00:FS01:0xa4:Completed 0 out of 250000 steps  (0%)
12:59:31:WU00:FS01:0xa4:Completed 2500 out of 250000 steps  (1%)
13:05:33:WU00:FS01:0xa4:Completed 5000 out of 250000 steps  (2%)

******************************* Date: 2018-01-28 *******************************

14:28:33:26:127.0.0.1:New Web connection

******************************* Date: 2018-01-28 *******************************
20:02:18:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:5771 run:13 clone:150 gen:3784 core:0x11 unit:0x7293eb0553c5269f0ec80096000d168b
20:02:18:WU01:FS00:Uploading 94.63KiB to 171.67.108.11
20:02:18:WU01:FS00:Connecting to 171.67.108.11:8080
20:02:39:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
20:02:39:WU01:FS00:Connecting to 171.67.108.11:80
20:03:00:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 171.67.108.11:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.

23:29:47:WU00:FS01:0xa4:Completed 247500 out of 250000 steps  (99%)
23:29:48:WU02:FS01:Connecting to 171.67.108.45:8080
23:29:49:WU02:FS01:Assigned to work server 155.247.166.219
23:29:49:WU02:FS01:Requesting new work unit for slot 01: RUNNING cpu:4 from 155.247.166.219
23:29:49:WU02:FS01:Connecting to 155.247.166.219:8080
23:29:49:WU02:FS01:Downloading 867.45KiB
23:29:50:WU02:FS01:Download complete
23:29:50:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:8633 run:5 clone:48 gen:244 core:0xa4 unit:0x0000010c0002894b57f6f35e36d7262a
23:35:54:WU00:FS01:0xa4:Completed 250000 out of 250000 steps  (100%)
23:35:56:WU00:FS01:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
23:36:06:WU00:FS01:0xa4:
23:36:06:WU00:FS01:0xa4:Finished Work Unit:
23:36:06:WU00:FS01:0xa4:- Reading up to 811392 from "00/wudata_01.trr": Read 811392
23:36:06:WU00:FS01:0xa4:trr file hash check passed.
23:36:06:WU00:FS01:0xa4:- Reading up to 745728 from "00/wudata_01.xtc": Read 745728
23:36:06:WU00:FS01:0xa4:xtc file hash check passed.
23:36:06:WU00:FS01:0xa4:edr file hash check passed.
23:36:06:WU00:FS01:0xa4:logfile size: 25142
23:36:06:WU00:FS01:0xa4:Leaving Run
23:36:08:WU00:FS01:0xa4:- Writing 1584750 bytes of core data to disk...
23:36:08:WU00:FS01:0xa4:Done: 1584238 -> 1537577 (compressed to 97.0 percent)
23:36:08:WU00:FS01:0xa4:  ... Done.
23:36:09:WU00:FS01:0xa4:- Shutting down core
23:36:09:WU00:FS01:0xa4:
23:36:09:WU00:FS01:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
23:36:09:WU00:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
23:36:09:WU00:FS01:Sending unit results: id:00 state:SEND error:NO_ERROR project:9033 run:556 clone:2 gen:1044 core:0xa4 unit:0x00000477ab436c9e5698313dd620dd6d
23:36:09:WU00:FS01:Uploading 1.47MiB to 171.67.108.158
23:36:09:WU02:FS01:Starting
23:36:09:WU00:FS01:Connecting to 171.67.108.158:8080
23:36:09:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Duncan/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 02 -suffix 01 -version 704 -lifeline 9192 -checkpoint 15 -np 4
23:36:09:WU02:FS01:Started FahCore on PID 11504
23:36:10:WU02:FS01:Core PID:1312
23:36:10:WU02:FS01:FahCore 0xa4 started
23:36:10:WU02:FS01:0xa4:
23:36:10:WU02:FS01:0xa4:*------------------------------*
23:36:10:WU02:FS01:0xa4:Folding@Home Gromacs GB Core
23:36:10:WU02:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
23:36:10:WU02:FS01:0xa4:
23:36:10:WU02:FS01:0xa4:Preparing to commence simulation
23:36:10:WU02:FS01:0xa4:- Looking at optimizations...
23:36:10:WU02:FS01:0xa4:- Created dyn
23:36:10:WU02:FS01:0xa4:- Files status OK
23:36:10:WU02:FS01:0xa4:- Expanded 887760 -> 2072336 (decompressed 233.4 percent)
23:36:10:WU02:FS01:0xa4:Called DecompressByteArray: compressed_data_size=887760 data_size=2072336, decompressed_data_size=2072336 diff=0
23:36:10:WU02:FS01:0xa4:- Digital signature verified
23:36:10:WU02:FS01:0xa4:
23:36:10:WU02:FS01:0xa4:Project: 8633 (Run 5, Clone 48, Gen 244)
23:36:10:WU02:FS01:0xa4:
23:36:10:WU02:FS01:0xa4:Assembly optimizations on if available.
23:36:10:WU02:FS01:0xa4:Entering M.D.
23:36:12:WU00:FS01:Upload complete
23:36:12:WU00:FS01:Server responded WORK_ACK (400)
23:36:12:WU00:FS01:Final credit estimate, 760.00 points
23:36:12:WU00:FS01:Cleaning up
23:36:16:WU02:FS01:0xa4:Mapping NT from 4 to 4
23:36:16:WU02:FS01:0xa4:Completed 0 out of 1250000 steps  (0%)
23:53:13:WU02:FS01:0xa4:Completed 12500 out of 1250000 steps  (1%)
00:10:20:WU02:FS01:0xa4:Completed 25000 out of 1250000 steps  (2%)

******************************* Date: 2018-01-29 *******************************
02:02:18:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:5771 run:13 clone:150 gen:3784 core:0x11 unit:0x7293eb0553c5269f0ec80096000d168b
02:02:18:WU01:FS00:Uploading 94.63KiB to 171.67.108.11
02:02:18:WU01:FS00:Connecting to 171.67.108.11:8080
02:02:39:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
02:02:39:WU01:FS00:Connecting to 171.67.108.11:80
02:03:00:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 171.67.108.11:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.

******************************* Date: 2018-01-29 *******************************
20:02:18:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:5771 run:13 clone:150 gen:3784 core:0x11 unit:0x7293eb0553c5269f0ec80096000d168b
20:02:18:WU01:FS00:Uploading 94.63KiB to 171.67.108.11
20:02:18:WU01:FS00:Connecting to 171.67.108.11:8080
20:02:39:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
20:02:39:WU01:FS00:Connecting to 171.67.108.11:80
20:03:00:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 171.67.108.11:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.


Mod edit: added code tags to log file listing
djgibbons
 
Posts: 19
Joined: Tue Sep 24, 2013 11:02 pm

Re: failure to connect to the collection server

Postby Joe_H » Tue Jan 30, 2018 2:44 am

It looks like you updated an older machine and changed the video card to a Quadro K4200, but it still had an old WU from years in the past. The WU trying to be uploaded is project:5771 run:13 clone:150 gen:3784 and was run on Core_11. That would have been assigned over 3 1/2 years ago, all Core_11 projects are now done. So both the WS and CS for that project are no longer in service. The folding client should have discarded the WU as it is long past its expiration, but that might be due to it being not correctly detecting that due to the 3+ years of being out of date.

From the log it looks like you haven't enabled folding o the GPU, or it is sending the wrong configuration information to the Assignment Server and not getting a WU. Probably the best way to fix this is to set the client to Finish, and uninstall it with its data after the current work is completed. Then reinstall the client.

One additional note, I did not see a passkey configured. You may want to get one and set that as part of your reinstalling of the client.
Joe_H
Site Admin
 
Posts: 4210
Joined: Tue Apr 21, 2009 4:41 pm
Location: W. MA

Re: failure to connect to the collection server

Postby bruce » Tue Jan 30, 2018 2:56 am

In addition to Joe_H's suggestions, there's one more thing I'd suggest:

> 18:00:25: <slot id='1' type='CPU'>
> 18:00:25: <idle v='true'/>
> 18:00:25: </slot>

I have not found a good reason to use the "idle" option for a CPU slot. Operating systems do an excellent job of managing CPU priority. There's no perceptible lag introduced when foreground tasks supersede FAH's CPU processing whereas when the idle state comes an goes over the course of normal foreground usage there is a measurable loss in throughput.

I wish I could make a similar claim for a GPU slot but GPUs cannot be interrupted quickly and cleanly so the idle setting is a useful setting for slower GPUs -- depending on whether you notice screen-lag or not.
bruce
 
Posts: 21679
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: failure to connect to the collection server

Postby djgibbons » Tue Jan 30, 2018 10:41 pm

Thank you, Gentlemen. I will try these things to see if it clears the issue.
djgibbons
 
Posts: 19
Joined: Tue Sep 24, 2013 11:02 pm

Re: failure to connect to the collection server

Postby djgibbons » Fri Feb 02, 2018 1:01 am

The uninstall/re-install seems to have worked. Regarding the 'waiting for idle' status on the GPU job, does the screen saver affect idle status? Or does this simply mean that graceful GPU interrupts are being used?
djgibbons
 
Posts: 19
Joined: Tue Sep 24, 2013 11:02 pm

Re: failure to connect to the collection server

Postby bruce » Fri Feb 02, 2018 2:38 am

Your OS sets a IDLE status indicator whenever it detects the right conditions. Both screen-savers and FAH query the status of that indicator.
bruce
 
Posts: 21679
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.


Return to Issues with a specific server

Who is online

Users browsing this forum: No registered users and 5 guests

cron