Client Randomly Getting Stuck On "Connecting to"

If you think it might be a driver problem, see viewforum.php?f=79

Moderators: Site Moderators, FAHC Science Team

Post Reply
MrFrizzy
Posts: 123
Joined: Fri Feb 14, 2020 4:48 am

Client Randomly Getting Stuck On "Connecting to"

Post by MrFrizzy »

I've noticed an odd behavior that I need some insight on.

I have a Windows 8.1 machine running a GTX 980 Ti and randomly the client will get stuck on "Connecting to" some server and never times out, gets an assignment, or gets a WU. I have seen it happen with an assignment server and a few work servers. Pausing the client or removing and then readding the GPU slot does not fix the issue, I have to close the client (sometimes it continues to run and has to be ended through the task manager) and then reopen for it to start again. I had a separate issue happen with the machine and ended up blowing out the RAID setup and reinstalling Windows, but the issue has come up twice since then.

Below are the logs. Any thoughts? It seems to potentially be network based, but why would it work most of the time and then just randomly get stuck without timing out? I tried turning on the various debugging options, but then the client would never get a WU from the servers so I turned them off.

April 4th:

Code: Select all

17:17:08:WU00:FS01:0x22:Completed 495000 out of 500000 steps (99%)
17:17:08:WU01:FS01:Connecting to 65.254.110.245:8080
17:17:08:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
17:17:08:WU01:FS01:Connecting to 18.218.241.186:80
17:17:08:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': 10002: Received short response, expected 272 bytes, got 0
17:17:08:ERROR:WU01:FS01:Exception: Could not get an assignment
17:17:08:WU01:FS01:Connecting to 65.254.110.245:8080
17:17:08:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
17:17:08:WU01:FS01:Connecting to 18.218.241.186:80
17:17:08:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': 10002: Received short response, expected 272 bytes, got 0
17:17:08:ERROR:WU01:FS01:Exception: Could not get an assignment
17:18:09:WU01:FS01:Connecting to 65.254.110.245:8080
17:18:09:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
17:18:09:WU01:FS01:Connecting to 18.218.241.186:80
17:18:10:WARNING:WU01:FS01:Failed to get assignment from '18.218.241.186:80': 10002: Received short response, expected 272 bytes, got 0
17:18:10:ERROR:WU01:FS01:Exception: Could not get an assignment
17:19:06:WU00:FS01:0x22:Completed 500000 out of 500000 steps (100%)
17:19:33:WU00:FS01:0x22:Saving result file ..\logfile_01.txt
17:19:33:WU00:FS01:0x22:Saving result file checkpointState.xml
17:19:34:WU00:FS01:0x22:Saving result file checkpt.crc
17:19:34:WU00:FS01:0x22:Saving result file positions.xtc
17:19:34:WU00:FS01:0x22:Saving result file science.log
17:19:34:WU00:FS01:0x22:Folding@home Core Shutdown: FINISHED_UNIT
17:19:35:WU00:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
17:19:35:WU00:FS01:Sending unit results: id:00 state:SEND error:NO_ERROR project:14414 run:0 clone:225 gen:1 core:0x22 unit:0x000000010d5262775e821476c761e756
17:19:35:WU00:FS01:Uploading 109.84MiB to 13.82.98.119
17:19:35:WU00:FS01:Connecting to 13.82.98.119:8080
17:19:41:WU00:FS01:Upload 7.45%
17:19:46:WU01:FS01:Connecting to 65.254.110.245:8080
17:19:46:WU01:FS01:Assigned to work server 140.163.4.231
17:19:46:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:GM200 [GeForce GTX 980 Ti] 5632 from 140.163.4.231
17:19:46:WU01:FS01:Connecting to 140.163.4.231:8080
17:19:47:WU00:FS01:Upload 16.44%
17:19:53:WU00:FS01:Upload 25.15%
17:19:59:WU00:FS01:Upload 33.74%
17:20:05:WU00:FS01:Upload 42.33%
17:20:07:WARNING:WU01:FS01:WorkServer connection failed on port 8080 trying 80
17:20:07:WU01:FS01:Connecting to 140.163.4.231:80
17:20:11:WU00:FS01:Upload 50.58%
17:20:17:WU00:FS01:Upload 59.63%
17:20:23:WU00:FS01:Upload 68.62%
17:20:29:WU00:FS01:Upload 77.55%
17:20:35:WU00:FS01:Upload 85.86%
17:20:41:WU00:FS01:Upload 92.58%
17:20:47:WU00:FS01:Upload complete
17:20:47:WU00:FS01:Server responded WORK_ACK (400)
17:20:47:WU00:FS01:Final credit estimate, 132550.00 points
17:20:47:WU00:FS01:Cleaning up
April 7th:

Code: Select all

05:38:30:WU00:FS01:Connecting to 65.254.110.245:8080
05:38:30:WU00:FS01:Assigned to work server 128.252.203.10
05:38:30:WU00:FS01:Requesting new work unit for slot 01: RUNNING gpu:0:GM200 [GeForce GTX 980 Ti] 5632 from 128.252.203.10
05:38:30:WU00:FS01:Connecting to 128.252.203.10:8080
05:38:51:WARNING:WU00:FS01:WorkServer connection failed on port 8080 trying 80
05:38:51:WU00:FS01:Connecting to 128.252.203.10:80
05:39:21:WU01:FS01:0x22:Completed 8000000 out of 8000000 steps (100%)
05:39:23:WU01:FS01:0x22:Saving result file ..\logfile_01.txt
05:39:23:WU01:FS01:0x22:Saving result file checkpointState.xml
05:39:23:WU01:FS01:0x22:Saving result file checkpt.crc
05:39:23:WU01:FS01:0x22:Saving result file positions.xtc
05:39:23:WU01:FS01:0x22:Saving result file science.log
05:39:23:WU01:FS01:0x22:Folding@home Core Shutdown: FINISHED_UNIT
05:39:24:WU01:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
05:39:24:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:14549 run:0 clone:182 gen:7 core:0x22 unit:0x000000090d5262775e863e39a6b52a0c
05:39:24:WU01:FS01:Uploading 22.07MiB to 13.82.98.119
05:39:24:WU01:FS01:Connecting to 13.82.98.119:8080
05:39:30:WU01:FS01:Upload 36.53%
05:39:36:WU01:FS01:Upload 73.63%
05:39:40:WU01:FS01:Upload complete
05:39:40:WU01:FS01:Server responded WORK_ACK (400)
05:39:40:WU01:FS01:Final credit estimate, 102915.00 points
05:39:40:WU01:FS01:Cleaning up
******************************* Date: 2020-04-07 *******************************
18:57:05:FS01:Paused
April 10th:

Code: Select all

01:05:29:WU00:FS01:0x22:Completed 1980000 out of 2000000 steps (99%)
01:05:30:WU01:FS01:Connecting to 65.254.110.245:8080
01:05:30:WARNING:WU01:FS01:Failed to get assignment from '65.254.110.245:8080': No WUs available for this configuration
01:05:30:WU01:FS01:Connecting to 18.218.241.186:80
01:08:02:WU00:FS01:0x22:Completed 2000000 out of 2000000 steps (100%)
01:08:12:WU00:FS01:0x22:Saving result file ..\logfile_01.txt
01:08:12:WU00:FS01:0x22:Saving result file checkpointState.xml
01:08:12:WU00:FS01:0x22:Saving result file checkpt.crc
01:08:12:WU00:FS01:0x22:Saving result file positions.xtc
01:08:12:WU00:FS01:0x22:Saving result file science.log
01:08:13:WU00:FS01:0x22:Folding@home Core Shutdown: FINISHED_UNIT
01:08:13:WU00:FS01:FahCore returned: FINISHED_UNIT (100 = 0x64)
01:08:13:WU00:FS01:Sending unit results: id:00 state:SEND error:NO_ERROR project:14430 run:0 clone:36 gen:4 core:0x22 unit:0x000000040d5262775e8b4d5daf9877e0
01:08:13:WU00:FS01:Uploading 43.99MiB to 13.82.98.119
01:08:13:WU00:FS01:Connecting to 13.82.98.119:8080
01:08:19:WU00:FS01:Upload 14.92%
01:08:25:WU00:FS01:Upload 24.44%
01:08:31:WU00:FS01:Upload 39.22%
01:08:37:WU00:FS01:Upload 53.71%
01:08:43:WU00:FS01:Upload 63.66%
01:08:49:WU00:FS01:Upload 75.59%
01:08:55:WU00:FS01:Upload 89.09%
01:08:59:WU00:FS01:Upload complete
01:08:59:WU00:FS01:Server responded WORK_ACK (400)
01:08:59:WU00:FS01:Final credit estimate, 178331.00 points
01:08:59:WU00:FS01:Cleaning up
April 12th:

Code: Select all

04:55:54:WU00:FS01:Connecting to 65.254.110.245:8080
04:55:54:WU00:FS01:Assigned to work server 128.252.203.10
04:55:54:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GM200 [GeForce GTX 980 Ti] 5632 from 128.252.203.10
04:55:54:WU00:FS01:Connecting to 128.252.203.10:8080
04:56:15:WARNING:WU00:FS01:WorkServer connection failed on port 8080 trying 80
04:56:15:WU00:FS01:Connecting to 128.252.203.10:80
******************************* Date: 2020-04-12 *******************************
19:36:25:Saving configuration to config.xml
April 16th:

Code: Select all

2020-04-15:01:15:04:WU02:FS01:Connecting to 65.254.110.245:8080
2020-04-15:01:15:04:WU02:FS01:Assigned to work server 52.224.109.74
2020-04-15:01:15:04:WU02:FS01:Requesting new work unit for slot 01: READY gpu:0:GM200 [GeForce GTX 980 Ti] 5632 from 52.224.109.74
2020-04-15:01:15:04:WU02:FS01:Connecting to 52.224.109.74:8080
2020-04-15:01:15:25:WARNING:WU02:FS01:WorkServer connection failed on port 8080 trying 80
2020-04-15:01:15:25:WU02:FS01:Connecting to 52.224.109.74:80
2020-04-15:01:17:44:WU01:FS00:0xa7:Completed 1950000 out of 2500000 steps (78%)
2020-04-15:01:20:42:WU01:FS00:0xa7:Completed 1975000 out of 2500000 steps (79%)
2020-04-15:01:23:43:WU01:FS00:0xa7:Completed 2000000 out of 2500000 steps (80%)
2020-04-15:01:26:34:WU01:FS00:0xa7:Completed 2025000 out of 2500000 steps (81%)
2020-04-15:01:29:16:WU01:FS00:0xa7:Completed 2050000 out of 2500000 steps (82%)
2020-04-15:01:32:05:WU01:FS00:0xa7:Completed 2075000 out of 2500000 steps (83%)
2020-04-15:01:34:57:WU01:FS00:0xa7:Completed 2100000 out of 2500000 steps (84%)
2020-04-15:01:37:45:WU01:FS00:0xa7:Completed 2125000 out of 2500000 steps (85%)
2020-04-15:01:38:51:FS01:Finishing
2020-04-15:01:40:39:WU01:FS00:0xa7:Completed 2150000 out of 2500000 steps (86%)
2020-04-15:01:43:18:Removing old file 'configs/config-20200414-191119.xml'
2020-04-15:01:43:18:Saving configuration to config.xml
2020-04-15:01:43:18:<config>
2020-04-15:01:43:18:  <!-- Folding Slot Configuration -->
2020-04-15:01:43:18:  <client-type v='beta'/>
2020-04-15:01:43:18:
2020-04-15:01:43:18:  <!-- Logging -->
2020-04-15:01:43:18:  <log-date v='true'/>
2020-04-15:01:43:18:  <log-date-periodically v=''/>
2020-04-15:01:43:18:  <verbosity v='5'/>
2020-04-15:01:43:18:
2020-04-15:01:43:18:  <!-- Network -->
2020-04-15:01:43:18:  <proxy v=':8080'/>
2020-04-15:01:43:18:
2020-04-15:01:43:18:  <!-- Slot Control -->
2020-04-15:01:43:18:  <pause-on-start v='true'/>
2020-04-15:01:43:18:  <power v='full'/>
2020-04-15:01:43:18:
2020-04-15:01:43:18:  <!-- User Information -->
2020-04-15:01:43:18:  <passkey v='********************************'/>
2020-04-15:01:43:18:  <team v='223518'/>
2020-04-15:01:43:18:  <user v='MrFrizzy'/>
2020-04-15:01:43:18:
2020-04-15:01:43:18:  <!-- Folding Slots -->
2020-04-15:01:43:18:  <slot id='0' type='CPU'>
2020-04-15:01:43:18:    <cpus v='6'/>
2020-04-15:01:43:18:  </slot>
2020-04-15:01:43:18:  <slot id='1' type='GPU'>
2020-04-15:01:43:18:    <core-priority v='low'/>
2020-04-15:01:43:18:  </slot>
2020-04-15:01:43:18:</config>
2020-04-15:01:43:18:FS00:Shutting core down
2020-04-15:01:43:18:WU01:FS00:0xa7:WARNING:Console control signal 1 on PID 5972
2020-04-15:01:43:18:WU01:FS00:0xa7:Exiting, please wait. . .
2020-04-15:01:43:19:WU01:FS00:0xa7:Folding@home Core Shutdown: INTERRUPTED
2020-04-15:01:43:19:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
2020-04-15:01:43:19:WU01:FS00:Starting
2020-04-15:01:43:19:WARNING:WU01:FS00:Changed SMP threads from 5 to 6 this can cause some work units to fail
2020-04-15:01:43:19:WARNING:WU01:FS00:AS lowered CPUs from 6 to 5
2020-04-15:01:43:19:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\VuVault\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/v7/win/64bit/avx/Core_a7.fah/FahCore_a7.exe -dir 01 -suffix 01 -version 705 -lifeline 2904 -checkpoint 15 -np 5
2020-04-15:01:43:19:WU01:FS00:Started FahCore on PID 5644
2020-04-15:01:43:19:Started thread 15 on PID 2904
2020-04-15:01:43:19:WU01:FS00:Core PID:2784
2020-04-15:01:43:19:WU01:FS00:FahCore 0xa7 started
2020-04-15:01:43:20:WU01:FS00:0xa7:*********************** Log Started 2020-04-15T01:43:19Z ***********************
2020-04-15:01:43:20:WU01:FS00:0xa7:************************** Gromacs Folding@home Core ***************************
2020-04-15:01:43:20:WU01:FS00:0xa7:       Type: 0xa7
2020-04-15:01:43:20:WU01:FS00:0xa7:       Core: Gromacs
2020-04-15:01:43:20:WU01:FS00:0xa7:       Args: -dir 01 -suffix 01 -version 705 -lifeline 5644 -checkpoint 15 -np 5
2020-04-15:01:43:20:WU01:FS00:0xa7:************************************ CBang *************************************
2020-04-15:01:43:20:WU01:FS00:0xa7:       Date: Oct 26 2019
2020-04-15:01:43:20:WU01:FS00:0xa7:       Time: 01:38:25
2020-04-15:01:43:20:WU01:FS00:0xa7:   Revision: c46a1a011a24143739ac7218c5a435f66777f62f
2020-04-15:01:43:20:WU01:FS00:0xa7:     Branch: master
2020-04-15:01:43:20:WU01:FS00:0xa7:   Compiler: Visual C++ 2008
2020-04-15:01:43:20:WU01:FS00:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
2020-04-15:01:43:20:WU01:FS00:0xa7:   Platform: win32 10
2020-04-15:01:43:20:WU01:FS00:0xa7:       Bits: 64
2020-04-15:01:43:20:WU01:FS00:0xa7:       Mode: Release
2020-04-15:01:43:20:WU01:FS00:0xa7:************************************ System ************************************
2020-04-15:01:43:20:WU01:FS00:0xa7:        CPU: Intel(R) Xeon(R) CPU E5-2609 v3 @ 1.90GHz
2020-04-15:01:43:20:WU01:FS00:0xa7:     CPU ID: GenuineIntel Family 6 Model 63 Stepping 2
2020-04-15:01:43:20:WU01:FS00:0xa7:       CPUs: 6
2020-04-15:01:43:20:WU01:FS00:0xa7:     Memory: 7.93GiB
2020-04-15:01:43:20:WU01:FS00:0xa7:Free Memory: 5.88GiB
2020-04-15:01:43:20:WU01:FS00:0xa7:    Threads: WINDOWS_THREADS
2020-04-15:01:43:20:WU01:FS00:0xa7: OS Version: 6.2
2020-04-15:01:43:20:WU01:FS00:0xa7:Has Battery: false
2020-04-15:01:43:20:WU01:FS00:0xa7: On Battery: false
2020-04-15:01:43:20:WU01:FS00:0xa7: UTC Offset: -5
2020-04-15:01:43:20:WU01:FS00:0xa7:        PID: 2784
2020-04-15:01:43:20:WU01:FS00:0xa7:        CWD: C:\Users\VuVault\AppData\Roaming\FAHClient\work
2020-04-15:01:43:20:WU01:FS00:0xa7:******************************** Build - libFAH ********************************
2020-04-15:01:43:20:WU01:FS00:0xa7:    Version: 0.0.18
2020-04-15:01:43:20:WU01:FS00:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
2020-04-15:01:43:20:WU01:FS00:0xa7:  Copyright: 2019 foldingathome.org
2020-04-15:01:43:20:WU01:FS00:0xa7:   Homepage: https://foldingathome.org/
2020-04-15:01:43:20:WU01:FS00:0xa7:       Date: Oct 26 2019
2020-04-15:01:43:20:WU01:FS00:0xa7:       Time: 01:52:30
2020-04-15:01:43:20:WU01:FS00:0xa7:   Revision: c1e3513b1bc0c16013668f2173ee969e5995b38e
2020-04-15:01:43:20:WU01:FS00:0xa7:     Branch: master
2020-04-15:01:43:20:WU01:FS00:0xa7:   Compiler: Visual C++ 2008
2020-04-15:01:43:20:WU01:FS00:0xa7:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
2020-04-15:01:43:20:WU01:FS00:0xa7:   Platform: win32 10
2020-04-15:01:43:20:WU01:FS00:0xa7:       Bits: 64
2020-04-15:01:43:20:WU01:FS00:0xa7:       Mode: Release
2020-04-15:01:43:20:WU01:FS00:0xa7:************************************ Build *************************************
2020-04-15:01:43:20:WU01:FS00:0xa7:       SIMD: avx_256
2020-04-15:01:43:20:WU01:FS00:0xa7:********************************************************************************
2020-04-15:01:43:20:WU01:FS00:0xa7:Project: 14380 (Run 52, Clone 0, Gen 4)
2020-04-15:01:43:20:WU01:FS00:0xa7:Unit: 0x00000004455e42075e929e325c292609
2020-04-15:01:43:20:WU01:FS00:0xa7:Digital signatures verified
2020-04-15:01:43:20:WU01:FS00:0xa7:Reducing thread count from 5 to 4 to avoid domain decomposition by a prime number > 3
2020-04-15:01:43:20:WU01:FS00:0xa7:Calling: mdrun -s frame4.tpr -o frame4.trr -cpi state.cpt -cpt 15 -nt 4
2020-04-15:01:43:20:WU01:FS00:0xa7:Steps: first=0 total=2500000
2020-04-15:01:43:20:WU01:FS00:0xa7:Completed 2172310 out of 2500000 steps (86%)
2020-04-15:01:43:40:WU01:FS00:0xa7:Completed 2175000 out of 2500000 steps (87%)
2020-04-15:01:43:42:Removing old file 'configs/config-20200414-191126.xml'
2020-04-15:01:43:42:Saving configuration to config.xml
2020-04-15:01:43:42:<config>
2020-04-15:01:43:42:  <!-- Folding Slot Configuration -->
2020-04-15:01:43:42:  <client-type v='beta'/>
2020-04-15:01:43:42:
2020-04-15:01:43:42:  <!-- Logging -->
2020-04-15:01:43:42:  <log-date v='true'/>
2020-04-15:01:43:42:  <log-date-periodically v=''/>
2020-04-15:01:43:42:  <verbosity v='5'/>
2020-04-15:01:43:42:
2020-04-15:01:43:42:  <!-- Network -->
2020-04-15:01:43:42:  <proxy v=':8080'/>
2020-04-15:01:43:42:
2020-04-15:01:43:42:  <!-- Slot Control -->
2020-04-15:01:43:42:  <pause-on-start v='true'/>
2020-04-15:01:43:42:  <power v='full'/>
2020-04-15:01:43:42:
2020-04-15:01:43:42:  <!-- User Information -->
2020-04-15:01:43:42:  <passkey v='********************************'/>
2020-04-15:01:43:42:  <team v='223518'/>
2020-04-15:01:43:42:  <user v='MrFrizzy'/>
2020-04-15:01:43:42:
2020-04-15:01:43:42:  <!-- Folding Slots -->
2020-04-15:01:43:42:  <slot id='0' type='CPU'>
2020-04-15:01:43:42:    <cpus v='6'/>
2020-04-15:01:43:42:  </slot>
2020-04-15:01:43:42:  <slot id='1' type='GPU'>
2020-04-15:01:43:42:    <core-priority v='low'/>
2020-04-15:01:43:42:  </slot>
2020-04-15:01:43:42:</config>
2020-04-15:01:46:30:WU01:FS00:0xa7:Completed 2200000 out of 2500000 steps (88%)
2020-04-15:01:49:15:WU01:FS00:0xa7:Completed 2225000 out of 2500000 steps (89%)
2020-04-15:01:52:10:WU01:FS00:0xa7:Completed 2250000 out of 2500000 steps (90%)
2020-04-15:01:54:58:WU01:FS00:0xa7:Completed 2275000 out of 2500000 steps (91%)
2020-04-15:01:57:43:WU01:FS00:0xa7:Completed 2300000 out of 2500000 steps (92%)
2020-04-15:02:00:28:WU01:FS00:0xa7:Completed 2325000 out of 2500000 steps (93%)
2020-04-15:02:03:25:WU01:FS00:0xa7:Completed 2350000 out of 2500000 steps (94%)
2020-04-15:02:06:19:WU01:FS00:0xa7:Completed 2375000 out of 2500000 steps (95%)
2020-04-15:02:09:00:WU01:FS00:0xa7:Completed 2400000 out of 2500000 steps (96%)
2020-04-15:02:11:44:WU01:FS00:0xa7:Completed 2425000 out of 2500000 steps (97%)
2020-04-15:02:14:32:WU01:FS00:0xa7:Completed 2450000 out of 2500000 steps (98%)
2020-04-15:02:17:22:WU01:FS00:0xa7:Completed 2475000 out of 2500000 steps (99%)
2020-04-15:02:20:15:WU01:FS00:0xa7:Completed 2500000 out of 2500000 steps (100%)
2020-04-15:02:20:16:WU01:FS00:0xa7:Saving result file ..\logfile_01.txt
2020-04-15:02:20:16:WU01:FS00:0xa7:Saving result file dhdl.xvg
2020-04-15:02:20:16:WU01:FS00:0xa7:Saving result file frame4.trr
2020-04-15:02:20:16:WU01:FS00:0xa7:Saving result file md.log
2020-04-15:02:20:16:WU01:FS00:0xa7:Saving result file science.log
2020-04-15:02:20:16:WU01:FS00:0xa7:Saving result file traj_comp.xtc
2020-04-15:02:20:16:WU01:FS00:0xa7:Folding@home Core Shutdown: FINISHED_UNIT
2020-04-15:02:20:17:WU01:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
2020-04-15:02:20:17:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:14380 run:52 clone:0 gen:4 core:0xa7 unit:0x00000004455e42075e929e325c292609
2020-04-15:02:20:17:WU01:FS00:Uploading 8.35MiB to 69.94.66.7
2020-04-15:02:20:17:WU01:FS00:Connecting to 69.94.66.7:8080
2020-04-15:02:20:23:WU01:FS00:Upload 28.43%
2020-04-15:02:20:29:WU01:FS00:Upload 59.86%
2020-04-15:02:20:35:WU01:FS00:Upload 92.78%
2020-04-15:02:20:36:WU01:FS00:Upload complete
2020-04-15:02:20:36:WU01:FS00:Server responded WORK_ACK (400)
2020-04-15:02:20:36:WU01:FS00:Final credit estimate, 2558.00 points
2020-04-15:02:20:36:WU01:FS00:Cleaning up
April 17th:

Code: Select all

02:49:23:WU00:FS01:Connecting to 18.218.241.186:80
02:49:23:WARNING:WU00:FS01:Failed to get assignment from '18.218.241.186:80': 10002: Received short response, expected 272 bytes, got 0
02:49:23:ERROR:WU00:FS01:Exception: Could not get an assignment
02:49:39:WARNING:WU01:FS01:WorkServer connection failed on port 8080 trying 80
02:49:39:WU01:FS01:Connecting to 128.252.203.2:80
02:49:39:WU01:FS01:Upload 0.11%
02:49:50:WU01:FS01:Upload 0.22%
02:49:51:WARNING:WU01:FS01:Exception: Failed to send results to work server: Transfer failed
02:49:51:WU01:FS01:Trying to send results to collection server
02:49:51:WU01:FS01:Uploading 56.46MiB to 52.224.109.74
02:49:51:WU01:FS01:Connecting to 52.224.109.74:8080
02:49:57:WU01:FS01:Upload 11.29%
02:50:03:WU01:FS01:Upload 18.93%
02:50:09:WU01:FS01:Upload 32.55%
02:50:15:WU01:FS01:Upload 42.73%
02:50:21:WU01:FS01:Upload 54.91%
02:50:27:WU01:FS01:Upload 64.87%
02:50:33:WU01:FS01:Upload 71.62%
02:50:39:WU01:FS01:Upload 82.58%
02:50:45:WU01:FS01:Upload 94.09%
02:50:57:WU01:FS01:Upload complete
02:50:57:WU01:FS01:Server responded WORK_ACK (400)
02:50:57:WU01:FS01:Final credit estimate, 129614.00 points
02:50:57:WU01:FS01:Cleaning up
02:52:00:WU00:FS01:Connecting to 65.254.110.245:8080
02:52:00:WU00:FS01:Assigned to work server 128.252.203.2
02:52:00:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GM200 [GeForce GTX 980 Ti] 5632 from 128.252.203.2
02:52:00:WU00:FS01:Connecting to 128.252.203.2:8080
02:52:21:WARNING:WU00:FS01:WorkServer connection failed on port 8080 trying 80
02:52:21:WU00:FS01:Connecting to 128.252.203.2:80
******************************* Date: 2020-04-18 *******************************
03:27:07:FS01:Paused
S1: AMD R5 3600 & Sapphire RX 5700 XT Reference @2.1GHz under water
S2: Intel Xeon E5-2620v3 & MSI GTX 1650

RX 5700 XT Project & PPD Tracking Spreadsheet

Image
PantherX
Site Moderator
Posts: 7020
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: Client Randomly Getting Stuck On "Connecting to"

Post by PantherX »

I believe that what you encounter on a "regular" basis is something that we did struggle to fully resolve it: https://github.com/FoldingAtHome/fah-issues/issues/983

I will link to this topic for reference.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
MrFrizzy
Posts: 123
Joined: Fri Feb 14, 2020 4:48 am

Re: Client Randomly Getting Stuck On "Connecting to"

Post by MrFrizzy »

Thank you, PantherX!

Funnily enough, I got this to happen again immediately after restarting the client. Below is the full log (paused at the end to add a new timestamp).

Is there anything I can do with debugging on my end to try and figure out what is going on? I can run ProcMon and capture logs if that is of any help.

Code: Select all

*********************** Log Started 2020-04-18T03:47:49Z ***********************
03:47:49:************************* Folding@home Client *************************
03:47:49:        Website: https://foldingathome.org/
03:47:49:      Copyright: (c) 2009-2018 foldingathome.org
03:47:49:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
03:47:49:           Args: 
03:47:49:         Config: C:\Users\VuVault\AppData\Roaming\FAHClient\config.xml
03:47:49:******************************** Build ********************************
03:47:49:        Version: 7.5.1
03:47:49:           Date: May 11 2018
03:47:49:           Time: 13:06:32
03:47:49:     Repository: Git
03:47:49:       Revision: 4705bf53c635f88b8fe85af7675557e15d491ff0
03:47:49:         Branch: master
03:47:49:       Compiler: Visual C++ 2008
03:47:49:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
03:47:49:       Platform: win32 10
03:47:49:           Bits: 32
03:47:49:           Mode: Release
03:47:49:******************************* System ********************************
03:47:49:            CPU: Intel(R) Xeon(R) CPU E5-2609 v3 @ 1.90GHz
03:47:49:         CPU ID: GenuineIntel Family 6 Model 63 Stepping 2
03:47:49:           CPUs: 6
03:47:49:         Memory: 7.93GiB
03:47:49:    Free Memory: 5.42GiB
03:47:49:        Threads: WINDOWS_THREADS
03:47:49:     OS Version: 6.2
03:47:49:    Has Battery: false
03:47:49:     On Battery: false
03:47:49:     UTC Offset: -7
03:47:49:            PID: 6016
03:47:49:            CWD: C:\Users\VuVault\AppData\Roaming\FAHClient
03:47:49:             OS: Windows 8.1 Pro
03:47:49:        OS Arch: AMD64
03:47:49:           GPUs: 1
03:47:49:          GPU 0: Bus:3 Slot:0 Func:0 NVIDIA:6 GM200 [GeForce GTX 980 Ti] 5632
03:47:49:  CUDA Device 0: Platform:0 Device:0 Bus:3 Slot:0 Compute:5.2 Driver:11.0
03:47:49:OpenCL Device 0: Platform:0 Device:0 Bus:3 Slot:0 Compute:1.2 Driver:445.87
03:47:49:  Win32 Service: false
03:47:49:***********************************************************************
03:47:49:<config>
03:47:49:  <service-description v='Folding@home Client'/>
03:47:49:  <service-restart v='true'/>
03:47:49:  <service-restart-delay v='5000'/>
03:47:49:
03:47:49:  <!-- Client Control -->
03:47:49:  <client-threads v='6'/>
03:47:49:  <cycle-rate v='4'/>
03:47:49:  <cycles v='-1'/>
03:47:49:  <data-directory v='.'/>
03:47:49:  <disable-sleep-when-active v='true'/>
03:47:49:  <exec-directory v='C:\Program Files (x86)\FAHClient'/>
03:47:49:  <exit-when-done v='false'/>
03:47:49:  <fold-anon v='false'/>
03:47:49:  <open-web-control v='false'/>
03:47:49:
03:47:49:  <!-- Configuration -->
03:47:49:  <config-rotate v='true'/>
03:47:49:  <config-rotate-dir v='configs'/>
03:47:49:  <config-rotate-max v='16'/>
03:47:49:
03:47:49:  <!-- Debugging -->
03:47:49:  <assignment-servers>
03:47:49:    assign1.foldingathome.org:8080 assign2.foldingathome.org:80
03:47:49:  </assignment-servers>
03:47:49:  <auth-as v='true'/>
03:47:49:  <capture-directory v='capture'/>
03:47:49:  <capture-on-error v='false'/>
03:47:49:  <capture-packets v='false'/>
03:47:49:  <capture-requests v='false'/>
03:47:49:  <capture-responses v='false'/>
03:47:49:  <capture-sockets v='false'/>
03:47:49:  <core-exec v='FahCore_$type'/>
03:47:49:  <core-wrapper-exec v='FAHCoreWrapper'/>
03:47:49:  <debug-sockets v='false'/>
03:47:49:  <exception-locations v='true'/>
03:47:49:  <stack-traces v='false'/>
03:47:49:
03:47:49:  <!-- Error Handling -->
03:47:49:  <max-slot-errors v='10'/>
03:47:49:  <max-unit-errors v='5'/>
03:47:49:
03:47:49:  <!-- Folding Core -->
03:47:49:  <checkpoint v='15'/>
03:47:49:  <core-dir v='cores'/>
03:47:49:  <core-priority v='idle'/>
03:47:49:  <cpu-affinity v='false'/>
03:47:49:  <cpu-usage v='100'/>
03:47:49:  <gpu-usage v='100'/>
03:47:49:  <no-assembly v='false'/>
03:47:49:
03:47:49:  <!-- Folding Slot Configuration -->
03:47:49:  <cause v='ANY'/>
03:47:49:  <client-subtype v='STDCLI'/>
03:47:49:  <client-type v='beta'/>
03:47:49:  <cpu-species v='X86_PENTIUM_II'/>
03:47:49:  <cpu-type v='AMD64'/>
03:47:49:  <cpus v='-1'/>
03:47:49:  <disable-viz v='false'/>
03:47:49:  <gpu v='true'/>
03:47:49:  <max-packet-size v='normal'/>
03:47:49:  <os-species v='WIN_8'/>
03:47:49:  <os-type v='WIN32'/>
03:47:49:  <project-key v='0'/>
03:47:49:  <smp v='true'/>
03:47:49:
03:47:49:  <!-- GUI -->
03:47:49:  <gui-enabled v='true'/>
03:47:49:
03:47:49:  <!-- HTTP Server -->
03:47:49:  <allow v='127.0.0.1'/>
03:47:49:  <connection-timeout v='60'/>
03:47:49:  <deny v='0/0'/>
03:47:49:  <http-addresses v='0:7396'/>
03:47:49:  <https-addresses v=''/>
03:47:49:  <max-connect-time v='900'/>
03:47:49:  <max-connections v='800'/>
03:47:49:  <max-request-length v='52428800'/>
03:47:49:  <min-connect-time v='300'/>
03:47:49:
03:47:49:  <!-- Logging -->
03:47:49:  <log v='log.txt'/>
03:47:49:  <log-color v='false'/>
03:47:49:  <log-crlf v='true'/>
03:47:49:  <log-date v='false'/>
03:47:49:  <log-date-periodically v='3600'/>
03:47:49:  <log-domain v='false'/>
03:47:49:  <log-header v='true'/>
03:47:49:  <log-level v='true'/>
03:47:49:  <log-no-info-header v='true'/>
03:47:49:  <log-redirect v='false'/>
03:47:49:  <log-rotate v='true'/>
03:47:49:  <log-rotate-dir v='logs'/>
03:47:49:  <log-rotate-max v='16'/>
03:47:49:  <log-short-level v='false'/>
03:47:49:  <log-simple-domains v='true'/>
03:47:49:  <log-thread-id v='false'/>
03:47:49:  <log-thread-prefix v='true'/>
03:47:49:  <log-time v='true'/>
03:47:49:  <log-to-screen v='true'/>
03:47:49:  <log-truncate v='false'/>
03:47:49:  <verbosity v='5'/>
03:47:49:
03:47:49:  <!-- Network -->
03:47:49:  <proxy v=':8080'/>
03:47:49:  <proxy-enable v='false'/>
03:47:49:  <proxy-pass v=''/>
03:47:49:  <proxy-user v=''/>
03:47:49:
03:47:49:  <!-- Process Control -->
03:47:49:  <child v='false'/>
03:47:49:  <daemon v='false'/>
03:47:49:  <pid v='false'/>
03:47:49:  <pid-file v='Folding@home Client.pid'/>
03:47:49:  <respawn v='false'/>
03:47:49:  <service v='false'/>
03:47:49:
03:47:49:  <!-- Remote Command Server -->
03:47:49:  <command-address v='0.0.0.0'/>
03:47:49:  <command-allow-no-pass v='127.0.0.1'/>
03:47:49:  <command-deny-no-pass v='0/0'/>
03:47:49:  <command-enable v='true'/>
03:47:49:  <command-port v='36330'/>
03:47:49:
03:47:49:  <!-- Slot Control -->
03:47:49:  <idle v='false'/>
03:47:49:  <max-shutdown-wait v='60'/>
03:47:49:  <pause-on-battery v='true'/>
03:47:49:  <pause-on-start v='true'/>
03:47:49:  <paused v='false'/>
03:47:49:  <power v='full'/>
03:47:49:
03:47:49:  <!-- User Information -->
03:47:49:  <machine-id v='0'/>
03:47:49:  <passkey v='********************************'/>
03:47:49:  <team v='223518'/>
03:47:49:  <user v='MrFrizzy'/>
03:47:49:
03:47:49:  <!-- Web Server -->
03:47:49:  <web-allow v='127.0.0.1'/>
03:47:49:  <web-deny v='0/0'/>
03:47:49:  <web-enable v='true'/>
03:47:49:
03:47:49:  <!-- Web Server Sessions -->
03:47:49:  <session-cookie v='sid'/>
03:47:49:  <session-lifetime v='86400'/>
03:47:49:  <session-timeout v='3600'/>
03:47:49:
03:47:49:  <!-- Work Unit Control -->
03:47:49:  <dump-after-deadline v='true'/>
03:47:49:  <max-queue v='16'/>
03:47:49:  <max-units v='0'/>
03:47:49:  <next-unit-percentage v='99'/>
03:47:49:  <stall-detection-enabled v='false'/>
03:47:49:  <stall-percent v='5'/>
03:47:49:  <stall-timeout v='1800'/>
03:47:49:
03:47:49:  <!-- Folding Slots -->
03:47:49:  <slot id='1' type='GPU'>
03:47:49:    <core-priority v='low'/>
03:47:49:    <paused v='true'/>
03:47:49:  </slot>
03:47:49:</config>
03:47:49:Trying to access database...
03:47:49:Successfully acquired database lock
03:47:49:Enabled folding slot 01: PAUSED gpu:0:GM200 [GeForce GTX 980 Ti] 5632 (by user)
03:47:49:Started thread 8 on PID 6016
03:47:49:Started thread 6 on PID 6016
03:47:49:Started thread 5 on PID 6016
03:47:49:Started thread 4 on PID 6016
03:47:49:Started thread 7 on PID 6016
03:47:49:Started thread 9 on PID 6016
03:47:57:Started thread 10 on PID 6016
03:48:10:FS01:Unpaused
03:48:10:WU00:FS01:Connecting to 65.254.110.245:8080
03:48:10:WU00:FS01:Assigned to work server 128.252.203.2
03:48:10:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GM200 [GeForce GTX 980 Ti] 5632 from 128.252.203.2
03:48:10:WU00:FS01:Connecting to 128.252.203.2:8080
03:48:31:WARNING:WU00:FS01:WorkServer connection failed on port 8080 trying 80
03:48:31:WU00:FS01:Connecting to 128.252.203.2:80
03:48:50:Removing old file 'configs/config-20200414-210203.xml'
03:48:50:Saving configuration to config.xml
03:48:50:<config>
03:48:50:  <!-- Folding Slot Configuration -->
03:48:50:  <client-type v='beta'/>
03:48:50:
03:48:50:  <!-- Logging -->
03:48:50:  <log-date-periodically v='3600'/>
03:48:50:  <verbosity v='5'/>
03:48:50:
03:48:50:  <!-- Network -->
03:48:50:  <proxy v=':8080'/>
03:48:50:
03:48:50:  <!-- Slot Control -->
03:48:50:  <pause-on-start v='true'/>
03:48:50:  <power v='full'/>
03:48:50:
03:48:50:  <!-- User Information -->
03:48:50:  <passkey v='********************************'/>
03:48:50:  <team v='223518'/>
03:48:50:  <user v='MrFrizzy'/>
03:48:50:
03:48:50:  <!-- Folding Slots -->
03:48:50:  <slot id='1' type='GPU'>
03:48:50:    <core-priority v='low'/>
03:48:50:  </slot>
03:48:50:</config>
03:49:28:Started thread 11 on PID 6016
03:59:07:FS01:Paused
S1: AMD R5 3600 & Sapphire RX 5700 XT Reference @2.1GHz under water
S2: Intel Xeon E5-2620v3 & MSI GTX 1650

RX 5700 XT Project & PPD Tracking Spreadsheet

Image
PantherX
Site Moderator
Posts: 7020
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: Client Randomly Getting Stuck On "Connecting to"

Post by PantherX »

Can you please:
1) Use verbosity level 3 since 5 makes it very difficult to read and troubleshoot.
2) Considered upgrading to V7.6.9 which is the next stable version: viewtopic.php?f=24&t=34466

While I don't know if this solves your issue or not, it would be good to update once you have finished all the WUs by setting your client to finish.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
MrFrizzy
Posts: 123
Joined: Fri Feb 14, 2020 4:48 am

Re: Client Randomly Getting Stuck On "Connecting to"

Post by MrFrizzy »

PantherX wrote:Can you please:
1) Use verbosity level 3 since 5 makes it very difficult to read and troubleshoot.
2) Considered upgrading to V7.6.9 which is the next stable version: viewtopic.php?f=24&t=34466
Ah, yeah I have dropped it back to 3. I initially did that when I enabled all of the debugging flags, but then I couldn't get any WUs. I have updated to the latest release and will continue to monitor!
S1: AMD R5 3600 & Sapphire RX 5700 XT Reference @2.1GHz under water
S2: Intel Xeon E5-2620v3 & MSI GTX 1650

RX 5700 XT Project & PPD Tracking Spreadsheet

Image
MrFrizzy
Posts: 123
Joined: Fri Feb 14, 2020 4:48 am

Re: Client Randomly Getting Stuck On "Connecting to"

Post by MrFrizzy »

MrFrizzy wrote:I have updated to the latest release and will continue to monitor!
So i noticed that every single attempt to connect to the servers was met with the message about receiving a short response. After several hours of that, I tried to uninstall and reinstall; same thing. Rebooted; same thing. Did some windows updates, rebooted; same thing. Uninstalled, rebooted, and reinstalled; same thing. I then proceeded to uninstalled with removing the data, rebooted, then reinstalled and now the GPU isn't detected. Rebooted again; not detected. Uninstalled with removing data, rebooted, reinstalled; still not detected. Uninstalled with removing data, rebooted, installed the old version; still not detected. Did a clean reinstall of the latest Nvidia drivers, rebooted; still not detected.

So I don't know what is going on now, but sounds like I have more research to do to figure out how to fix this. Hopefully I can do everything remotely as I won't get physical access to the machine again until sometime next week.

EDIT: My last idea was to check the gpus.txt file and found that it was 0KB. I copied the file from my working machine and now it sees the GPU with no problem. If I try to get an assignment, I get the "received short response" message from both assignment servers, the same message that started me trying to uninstall and reinstall so much. As soon as I turn my VPN on, I immediately get an assignment but continue to have problems getting a WU. I will leave the VPN on overnight and see what happens.
S1: AMD R5 3600 & Sapphire RX 5700 XT Reference @2.1GHz under water
S2: Intel Xeon E5-2620v3 & MSI GTX 1650

RX 5700 XT Project & PPD Tracking Spreadsheet

Image
PantherX
Site Moderator
Posts: 7020
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: Client Randomly Getting Stuck On "Connecting to"

Post by PantherX »

Can you please post the log file if possible? Do you have any firewall settings (hardware/software) that could cause issues. For troubleshooting Server connectivity issues, please view this thread: viewtopic.php?f=18&t=17794
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Post Reply