Problem with stats. [Lots of 0 point WUs]

Moderators: Site Moderators, FAHC Science Team

Tobit
Posts: 342
Joined: Thu Apr 17, 2008 2:35 pm
Location: Manchester, NH USA

Re: Problem with stats server

Post by Tobit »

Different problem but related. 485 failed uploads of this unit to 128.143.199.96 and it recording stats each time tells me it's related to the WS somehow. As I posted above, I suspect the WS is seeing a partial connection/upload and reporting something to the stats server but the client never fully transmits the entire results file back so it stays in queue to repeat the process.
artoar_11
Posts: 657
Joined: Sun Nov 22, 2009 8:42 pm
Hardware configuration: AMD R7 3700X @ 4.0 GHz; ASUS ROG STRIX X470-F GAMING; DDR4 2x8GB @ 3.0 GHz; GByte RTX 3060 Ti @ 1890 MHz; Fortron-550W 80+ bronze; Win10 Pro/64
Location: Bulgaria/Team #224497/artoar11_ALL_....

Re: Problem with stats server

Post by artoar_11 »

Tobit wrote:In the log file I posted, I suspect what is happening is:

Client tries to upload to 128.143.199.96
Server has been in and out of reject state for awhile
Client makes a long enough connection that the WS thinks it receives a WU and records 0 credit or partial credit
Client ultimately timesout transmitting results
Keeps in queue to try again
Rinse - Repeat
If I remember correctly of SMP partial credit not given.
Tobit
Posts: 342
Joined: Thu Apr 17, 2008 2:35 pm
Location: Manchester, NH USA

Re: Problem with stats server

Post by Tobit »

artoar_11 wrote:If I remember correctly of SMP partial credit not given.
Based on the weird stats I've been seeing, that doesn't agree in this instance.
Tobit
Posts: 342
Joined: Thu Apr 17, 2008 2:35 pm
Location: Manchester, NH USA

Re: Problem with stats server

Post by Tobit »

Bruce, do you have enough log file data now? I have two more logs from KMac available but it's essentially the same data I already posted. A quote from KMac - "My total upload failure count from 3 machines is around 1000 in the last 12 hours. That is almost exactly how many zero point completed WUs I have in the same timeframe."
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Problem with stats server

Post by bruce »

Tobit wrote:Bruce, do you have enough log file data now? I have two more logs from KMac available but it's essentially the same data I already posted. A quote from KMac - "My total upload failure count from 3 machines is around 1000 in the last 12 hours. That is almost exactly how many zero point completed WUs I have in the same timeframe."
No.

I still can' tell for sure which PRCG is causing the problem and/or which server they are downloading from. Has KMac's machine been rebooted or (temporarily) taken off-line yet?
KMac
Posts: 31
Joined: Thu Feb 17, 2011 6:50 pm

Re: Problem with stats server

Post by KMac »

I shut down two machines and am watching the remaining problem one currently at 73%.

Code: Select all

--- Opening Log file [October 6 16:25:48 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Program Files (x86)\FAH\SMP
Executable: C:\Program Files (x86)\FAH\FAH.exe
Arguments: -oneunit -forceasm -smp -verbosity 9 

[16:25:48] - Ask before connecting: No
[16:25:48] - User name: KMac (Team 33)
[16:25:48] - User ID: 7F87BEF1323CE7CF
[16:25:48] - Machine ID: 2
[16:25:48] 
[16:25:48] Loaded queue successfully.
[16:25:48] - Preparing to get new work unit...
[16:25:48] - Autosending finished units... [October 6 16:25:48 UTC]
[16:25:48] Trying to send all finished work units
[16:25:48] Cleaning up work directory
[16:25:48] Project: 6995 (Run 0, Clone 38, Gen 399)
[16:25:48] + Attempting to get work packet


[16:25:48] Passkey found
[16:25:48] + Attempting to send results [October 6 16:25:48 UTC]
[16:25:48] - Will indicate memory of 16359 MB
[16:25:48] - Reading file work/wuresults_01.dat from core
[16:25:48] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 10, Stepping: 7
[16:25:48] - Connecting to assignment server
[16:25:48]   (Read 3520864 bytes from disk)
[16:25:48] Connecting to http://assign.stanford.edu:8080/
[16:25:48] Connecting to http://128.143.199.96:8080/
[16:25:49] Posted data.
[16:25:49] Initial: 43AB; - Successful: assigned to (171.67.108.58).
[16:25:49] + News From Folding@Home: Welcome to Folding@Home
[16:25:49] Loaded queue successfully.
[16:25:49] Sent data
[16:25:49] Connecting to http://171.67.108.58:8080/
[16:25:50] - Couldn't send HTTP request to server
[16:25:50] + Could not connect to Work Server (results)
[16:25:50]     (128.143.199.96:8080)
[16:25:50] + Retrying using alternative port
[16:25:50] Connecting to http://128.143.199.96:80/
[16:25:50] Posted data.
[16:25:50] Initial: 0000; - Receiving payload (expected size: 544510)
[16:25:51] - Couldn't send HTTP request to server
[16:25:51] + Could not connect to Work Server (results)
[16:25:51]     (128.143.199.96:80)
[16:25:51] - Error: Could not transmit unit 01 (completed October 6) to work server.
[16:25:51] - 439 failed uploads of this unit.


[16:25:51] + Attempting to send results [October 6 16:25:51 UTC]
[16:25:51] - Reading file work/wuresults_01.dat from core
[16:25:51]   (Read 3520864 bytes from disk)
[16:25:51] Connecting to http://130.237.165.141:8080/
[16:25:51] - Downloaded at ~531 kB/s
[16:25:51] - Averaged speed for that direction ~642 kB/s
[16:25:51] + Received work.
[16:25:51] + Closed connections
[16:25:51] 
[16:25:51] + Processing work unit
[16:25:51] A4 will attempt to use 8 threads.
[16:25:51] Core required: FahCore_a4.exe
[16:25:51] Core found.
[16:25:51] Working on queue slot 03 [October 6 16:25:51 UTC]
[16:25:51] + Working ...
[16:25:51] - Calling '.\FahCore_a4.exe -dir work/ -nice 19 -suffix 03 -np 8 -nocpulock -checkpoint 3 -forceasm -verbose -lifeline 4140 -version 634'

[16:25:51] 
[16:25:51] *------------------------------*
[16:25:51] Folding@Home Gromacs GB Core
[16:25:51] Version 2.27 (Dec. 15, 2010)
[16:25:51] 
[16:25:51] Preparing to commence simulation
[16:25:51] - Assembly optimizations manually forced on.
[16:25:51] - Not checking prior termination.
[16:25:51] - Expanded 543998 -> 1305456 (decompressed 239.9 percent)
[16:25:51] Called DecompressByteArray: compressed_data_size=543998 data_size=1305456, decompressed_data_size=1305456 diff=0
[16:25:51] - Digital signature verified
[16:25:51] 
[16:25:51] Project: 8001 (Run 105, Clone 110, Gen 26)
[16:25:51] 
[16:25:51] Assembly optimizations on if available.
[16:25:51] Entering M.D.
[16:25:58] Mapping NT from 8 to 8 
[16:25:58] Completed 0 out of 250000 steps  (0%)
[16:26:37] Completed 2500 out of 250000 steps  (1%)
[16:26:38] Posted data.
[16:26:38] Initial: 0000; - Uploaded at ~73 kB/s
[16:26:38] - Averaged speed for that direction ~66 kB/s
[16:26:38] - Server does not have record of this unit. Will try again later.
[16:26:38]   Could not transmit unit 01 to Collection server; keeping in queue.
[16:26:38] + Sent 0 of 1 completed units to the server
[16:26:38] - Autosend completed


--- Opening Log file [October 6 16:32:45 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Program Files (x86)\FAH\SMP
Executable: C:\Program Files (x86)\FAH\FAH.exe
Arguments: -oneunit -forceasm -smp -verbosity 9 

[16:32:45] - Ask before connecting: No
[16:32:45] - User name: KMac (Team 33)
[16:32:45] - User ID: 7F87BEF1323CE7CF
[16:32:45] - Machine ID: 2
[16:32:45] 
[16:32:46] Loaded queue successfully.
[16:32:46] 
[16:32:46] - Autosending finished units... [October 6 16:32:46 UTC]
[16:32:46] + Processing work unit
[16:32:46] Trying to send all finished work units
[16:32:46] A4 will attempt to use 8 threads.
[16:32:46] Project: 6995 (Run 0, Clone 38, Gen 399)
[16:32:46] Core required: FahCore_a4.exe
[16:32:46] Core found.


[16:32:46] + Attempting to send results [October 6 16:32:46 UTC]
[16:32:46] - Reading file work/wuresults_01.dat from core
[16:32:46] Working on queue slot 03 [October 6 16:32:46 UTC]
[16:32:46] + Working ...
[16:32:46] - Calling '.\FahCore_a4.exe -dir work/ -nice 19 -suffix 03 -np 8 -nocpulock -checkpoint 3 -forceasm -verbose -lifeline 6916 -version 634'

[16:32:46]   (Read 3520864 bytes from disk)
[16:32:46] Connecting to http://128.143.199.96:8080/
[16:32:46] 
[16:32:46] *------------------------------*
[16:32:46] Folding@Home Gromacs GB Core
[16:32:46] Version 2.27 (Dec. 15, 2010)
[16:32:46] 
[16:32:46] Preparing to commence simulation
[16:32:46] - Ensuring status. Please wait.
[16:32:49] - Couldn't send HTTP request to server
[16:32:49] + Could not connect to Work Server (results)
[16:32:49]     (128.143.199.96:8080)
[16:32:49] + Retrying using alternative port
[16:32:49] Connecting to http://128.143.199.96:80/
[16:32:50] - Couldn't send HTTP request to server
[16:32:50] + Could not connect to Work Server (results)
[16:32:50]     (128.143.199.96:80)
[16:32:50] - Error: Could not transmit unit 01 (completed October 6) to work server.
[16:32:50] - 440 failed uploads of this unit.


[16:32:50] + Attempting to send results [October 6 16:32:50 UTC]
[16:32:50] - Reading file work/wuresults_01.dat from core
[16:32:50]   (Read 3520864 bytes from disk)
[16:32:50] Connecting to http://130.237.165.141:8080/
[16:32:55] - Assembly optimizations manually forced on.
[16:32:55] - Not checking prior termination.
[16:32:55] - Expanded 543998 -> 1305456 (decompressed 239.9 percent)
[16:32:55] Called DecompressByteArray: compressed_data_size=543998 data_size=1305456, decompressed_data_size=1305456 diff=0
[16:32:55] - Digital signature verified
[16:32:55] 
[16:32:55] Project: 8001 (Run 105, Clone 110, Gen 26)
[16:32:55] 
[16:32:55] Assembly optimizations on if available.
[16:32:55] Entering M.D.
[16:33:02] Mapping NT from 8 to 8 
[16:33:21] Completed 0 out of 250000 steps  (0%)
[16:34:21] Posted data.
[16:34:21] Initial: 0000; - Uploaded at ~37 kB/s
[16:34:21] - Averaged speed for that direction ~60 kB/s
[16:34:21] - Server does not have record of this unit. Will try again later.
[16:34:21]   Could not transmit unit 01 to Collection server; keeping in queue.
[16:34:21] + Sent 0 of 1 completed units to the server
[16:34:21] - Autosend completed


--- Opening Log file [October 6 16:52:45 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Program Files (x86)\FAH\SMP
Executable: C:\Program Files (x86)\FAH\FAH.exe
Arguments: -oneunit -forceasm -smp -verbosity 9 

[16:52:45] - Ask before connecting: No
[16:52:45] - User name: KMac (Team 33)
[16:52:45] - User ID: 7F87BEF1323CE7CF
[16:52:45] - Machine ID: 2
[16:52:45] 
[16:52:45] Loaded queue successfully.
[16:52:45] 
[16:52:45] - Autosending finished units... [October 6 16:52:45 UTC]
[16:52:45] + Processing work unit
[16:52:45] Trying to send all finished work units
[16:52:45] A4 will attempt to use 8 threads.
[16:52:45] Project: 6995 (Run 0, Clone 38, Gen 399)
[16:52:45] Core required: FahCore_a4.exe


[16:52:45] Core found.
[16:52:45] + Attempting to send results [October 6 16:52:45 UTC]
[16:52:45] - Reading file work/wuresults_01.dat from core
[16:52:45] Working on queue slot 03 [October 6 16:52:45 UTC]
[16:52:45] + Working ...
[16:52:45] - Calling '.\FahCore_a4.exe -dir work/ -nice 19 -suffix 03 -np 8 -nocpulock -checkpoint 3 -forceasm -verbose -lifeline 6428 -version 634'

[16:52:45]   (Read 3520864 bytes from disk)
[16:52:45] Connecting to http://128.143.199.96:8080/
[16:52:45] 
[16:52:45] *------------------------------*
[16:52:45] Folding@Home Gromacs GB Core
[16:52:45] Version 2.27 (Dec. 15, 2010)
[16:52:45] 
[16:52:45] Preparing to commence simulation
[16:52:45] - Ensuring status. Please wait.
[16:52:46] - Couldn't send HTTP request to server
[16:52:46] + Could not connect to Work Server (results)
[16:52:46]     (128.143.199.96:8080)
[16:52:46] + Retrying using alternative port
[16:52:46] Connecting to http://128.143.199.96:80/
[16:52:48] - Couldn't send HTTP request to server
[16:52:48] + Could not connect to Work Server (results)
[16:52:48]     (128.143.199.96:80)
[16:52:48] - Error: Could not transmit unit 01 (completed October 6) to work server.
[16:52:48] - 441 failed uploads of this unit.


[16:52:48] + Attempting to send results [October 6 16:52:48 UTC]
[16:52:48] - Reading file work/wuresults_01.dat from core
[16:52:48]   (Read 3520864 bytes from disk)
[16:52:48] Connecting to http://130.237.165.141:8080/
[16:52:55] - Assembly optimizations manually forced on.
[16:52:55] - Not checking prior termination.
[16:52:55] - Expanded 543998 -> 1305456 (decompressed 239.9 percent)
[16:52:55] Called DecompressByteArray: compressed_data_size=543998 data_size=1305456, decompressed_data_size=1305456 diff=0
[16:52:55] - Digital signature verified
[16:52:55] 
[16:52:55] Project: 8001 (Run 105, Clone 110, Gen 26)
[16:52:55] 
[16:52:55] Assembly optimizations on if available.
[16:52:55] Entering M.D.
[16:53:01] Using Gromacs checkpoints
[16:53:01] Mapping NT from 8 to 8 
[16:54:03] Posted data.
[16:54:03] Initial: 0000; - Uploaded at ~45 kB/s
[16:54:03] - Averaged speed for that direction ~57 kB/s
[16:54:03] - Server does not have record of this unit. Will try again later.
[16:54:03]   Could not transmit unit 01 to Collection server; keeping in queue.
[16:54:03] + Sent 0 of 1 completed units to the server
[16:54:03] - Autosend completed
[16:54:26] Resuming from checkpoint
[16:54:26] Verified work/wudata_03.log
[16:54:26] Verified work/wudata_03.trr
[16:54:26] Verified work/wudata_03.xtc
[16:54:26] Verified work/wudata_03.edr
[16:54:40] Completed 80 out of 250000 steps  (0%)
[17:07:55] Completed 2500 out of 250000 steps  (1%)
[17:17:22] Completed 5000 out of 250000 steps  (2%)
[17:21:37] Completed 7500 out of 250000 steps  (3%)
[17:34:40] Completed 10000 out of 250000 steps  (4%)
[17:37:51] Killing all core threads
[17:37:51] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown at user request.
[17:37:51] ***** Got a SIGTERM signal (2)
[17:37:51] Killing all core threads
[17:37:51] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown.


--- Opening Log file [October 6 20:53:39 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Program Files (x86)\FAH\SMP
Executable: C:\Program Files (x86)\FAH\FAH.exe
Arguments: -oneunit -forceasm -smp -verbosity 9 

[20:53:39] - Ask before connecting: No
[20:53:39] - User name: KMac (Team 33)
[20:53:39] - User ID: 7F87BEF1323CE7CF
[20:53:39] - Machine ID: 2
[20:53:39] 
[20:53:40] Loaded queue successfully.
[20:53:40] 
[20:53:40] + Processing work unit
[20:53:40] A4 will attempt to use 8 threads.
[20:53:40] Core required: FahCore_a4.exe
[20:53:40] - Autosending finished units... [October 6 20:53:40 UTC]
[20:53:40] Core found.
[20:53:40] Trying to send all finished work units
[20:53:40] Project: 6995 (Run 0, Clone 38, Gen 399)


[20:53:40] + Attempting to send results [October 6 20:53:40 UTC]
[20:53:40] - Reading file work/wuresults_01.dat from core
[20:53:40] Working on queue slot 03 [October 6 20:53:40 UTC]
[20:53:40] + Working ...
[20:53:40] - Calling '.\FahCore_a4.exe -dir work/ -nice 19 -suffix 03 -np 8 -nocpulock -checkpoint 3 -forceasm -verbose -lifeline 7052 -version 634'

[20:53:40]   (Read 3520864 bytes from disk)
[20:53:40] Connecting to http://128.143.199.96:8080/
[20:53:40] 
[20:53:40] *------------------------------*
[20:53:40] Folding@Home Gromacs GB Core
[20:53:40] Version 2.27 (Dec. 15, 2010)
[20:53:40] 
[20:53:40] Preparing to commence simulation
[20:53:40] - Ensuring status. Please wait.
[20:53:41] - Couldn't send HTTP request to server
[20:53:41] + Could not connect to Work Server (results)
[20:53:41]     (128.143.199.96:8080)
[20:53:41] + Retrying using alternative port
[20:53:41] Connecting to http://128.143.199.96:80/
[20:53:42] - Couldn't send HTTP request to server
[20:53:42] + Could not connect to Work Server (results)
[20:53:42]     (128.143.199.96:80)
[20:53:42] - Error: Could not transmit unit 01 (completed October 6) to work server.
[20:53:42] - 442 failed uploads of this unit.


[20:53:42] + Attempting to send results [October 6 20:53:42 UTC]
[20:53:42] - Reading file work/wuresults_01.dat from core
[20:53:42]   (Read 3520864 bytes from disk)
[20:53:42] Connecting to http://130.237.165.141:8080/
[20:53:49] - Assembly optimizations manually forced on.
[20:53:49] - Not checking prior termination.
[20:53:49] - Expanded 543998 -> 1305456 (decompressed 239.9 percent)
[20:53:49] Called DecompressByteArray: compressed_data_size=543998 data_size=1305456, decompressed_data_size=1305456 diff=0
[20:53:49] - Digital signature verified
[20:53:49] 
[20:53:49] Project: 8001 (Run 105, Clone 110, Gen 26)
[20:53:49] 
[20:53:49] Assembly optimizations on if available.
[20:53:49] Entering M.D.
[20:53:56] Using Gromacs checkpoints
[20:53:56] Mapping NT from 8 to 8 
[20:53:56] Resuming from checkpoint
[20:53:56] Verified work/wudata_03.log
[20:53:56] Verified work/wudata_03.trr
[20:53:56] Verified work/wudata_03.xtc
[20:53:56] Verified work/wudata_03.edr
[20:53:56] Completed 10000 out of 250000 steps  (4%)
[20:54:11] Posted data.
[20:54:11] Initial: 0000; - Uploaded at ~118 kB/s
[20:54:11] - Averaged speed for that direction ~69 kB/s
[20:54:11] - Server does not have record of this unit. Will try again later.
[20:54:11]   Could not transmit unit 01 to Collection server; keeping in queue.
[20:54:11] + Sent 0 of 1 completed units to the server
[20:54:11] - Autosend completed
[20:54:34] Completed 12500 out of 250000 steps  (5%)
[20:55:11] Completed 15000 out of 250000 steps  (6%)
[20:55:49] Completed 17500 out of 250000 steps  (7%)
[20:56:30] Completed 20000 out of 250000 steps  (8%)
[20:57:12] Completed 22500 out of 250000 steps  (9%)
[20:57:50] Completed 25000 out of 250000 steps  (10%)
[20:58:27] Completed 27500 out of 250000 steps  (11%)
[20:59:05] Completed 30000 out of 250000 steps  (12%)
[20:59:52] Completed 32500 out of 250000 steps  (13%)
[21:00:30] Completed 35000 out of 250000 steps  (14%)
[21:01:09] Completed 37500 out of 250000 steps  (15%)
[21:01:48] Completed 40000 out of 250000 steps  (16%)
[21:02:27] Completed 42500 out of 250000 steps  (17%)
[21:03:10] Completed 45000 out of 250000 steps  (18%)
[21:03:50] Completed 47500 out of 250000 steps  (19%)
[21:04:33] Completed 50000 out of 250000 steps  (20%)
[21:05:14] Completed 52500 out of 250000 steps  (21%)
[21:05:57] Completed 55000 out of 250000 steps  (22%)
[21:06:38] Completed 57500 out of 250000 steps  (23%)
[21:07:16] Completed 60000 out of 250000 steps  (24%)
[21:07:54] Completed 62500 out of 250000 steps  (25%)
[21:08:31] Completed 65000 out of 250000 steps  (26%)
[21:09:10] Completed 67500 out of 250000 steps  (27%)
[21:09:57] Completed 70000 out of 250000 steps  (28%)
[21:10:47] Completed 72500 out of 250000 steps  (29%)
[21:11:32] Completed 75000 out of 250000 steps  (30%)
[21:12:14] Completed 77500 out of 250000 steps  (31%)
[21:12:53] Completed 80000 out of 250000 steps  (32%)
[21:13:31] Completed 82500 out of 250000 steps  (33%)
[21:14:16] Completed 85000 out of 250000 steps  (34%)
[21:14:55] Completed 87500 out of 250000 steps  (35%)
[21:15:40] Completed 90000 out of 250000 steps  (36%)
[21:16:22] Completed 92500 out of 250000 steps  (37%)
[21:17:03] Completed 95000 out of 250000 steps  (38%)
[21:17:40] Completed 97500 out of 250000 steps  (39%)
[21:18:17] Completed 100000 out of 250000 steps  (40%)
[21:18:57] Completed 102500 out of 250000 steps  (41%)
[21:19:37] Completed 105000 out of 250000 steps  (42%)
[21:20:16] Completed 107500 out of 250000 steps  (43%)
[21:20:58] Completed 110000 out of 250000 steps  (44%)
[21:21:39] Completed 112500 out of 250000 steps  (45%)
[21:22:16] Completed 115000 out of 250000 steps  (46%)
[21:22:54] Completed 117500 out of 250000 steps  (47%)
[21:23:33] Completed 120000 out of 250000 steps  (48%)
[21:24:18] Completed 122500 out of 250000 steps  (49%)
[21:25:02] Completed 125000 out of 250000 steps  (50%)
[21:25:40] Completed 127500 out of 250000 steps  (51%)
[21:26:19] Completed 130000 out of 250000 steps  (52%)
[21:26:56] Completed 132500 out of 250000 steps  (53%)
[21:27:34] Completed 135000 out of 250000 steps  (54%)
[21:28:13] Completed 137500 out of 250000 steps  (55%)
[21:28:50] Completed 140000 out of 250000 steps  (56%)
[21:29:28] Completed 142500 out of 250000 steps  (57%)
[21:30:09] Completed 145000 out of 250000 steps  (58%)
[21:30:48] Completed 147500 out of 250000 steps  (59%)
[21:31:26] Completed 150000 out of 250000 steps  (60%)
[21:32:06] Completed 152500 out of 250000 steps  (61%)
[21:32:47] Completed 155000 out of 250000 steps  (62%)
[21:33:25] Completed 157500 out of 250000 steps  (63%)
[21:34:10] Completed 160000 out of 250000 steps  (64%)
[21:34:47] Completed 162500 out of 250000 steps  (65%)
[21:35:25] Completed 165000 out of 250000 steps  (66%)
[21:36:07] Completed 167500 out of 250000 steps  (67%)
[21:36:45] Completed 170000 out of 250000 steps  (68%)
[21:37:22] Completed 172500 out of 250000 steps  (69%)
[21:38:01] Completed 175000 out of 250000 steps  (70%)
[21:38:40] Completed 177500 out of 250000 steps  (71%)
[21:39:18] Completed 180000 out of 250000 steps  (72%)
[21:39:55] Completed 182500 out of 250000 steps  (73%)
Nathan_P
Posts: 1180
Joined: Wed Apr 01, 2009 9:22 pm
Hardware configuration: Asus Z8NA D6C, 2 x5670@3.2 Ghz, , 12gb Ram, GTX 980ti, AX650 PSU, win 10 (daily use)

Asus Z87 WS, Xeon E3-1230L v3, 8gb ram, KFA GTX 1080, EVGA 750ti , AX760 PSU, Mint 18.2 OS

Not currently folding
Asus Z9PE- D8 WS, 2 E5-2665@2.3 Ghz, 16Gb 1.35v Ram, Ubuntu (Fold only)
Asus Z9PA, 2 Ivy 12 core, 16gb Ram, H folding appliance (fold only)
Location: Jersey, Channel islands

Re: Problem with stats. [Lots of 0 point WUs]

Post by Nathan_P »

I have a similar problem on my v6.34 client, i've only just started getting theproblem though so my WU count is currently normal

Here's the log

Code: Select all

[16:05:58] Completed 400000 out of 500000 steps  (80%)
[16:07:56] Completed 405000 out of 500000 steps  (81%)
[16:09:55] Completed 410000 out of 500000 steps  (82%)
[16:11:56] Completed 415000 out of 500000 steps  (83%)
[16:13:54] Completed 420000 out of 500000 steps  (84%)
[16:15:54] Completed 425000 out of 500000 steps  (85%)
[16:17:54] Completed 430000 out of 500000 steps  (86%)
[16:19:55] Completed 435000 out of 500000 steps  (87%)
[16:21:56] Completed 440000 out of 500000 steps  (88%)
[16:23:56] Completed 445000 out of 500000 steps  (89%)
[16:25:55] Completed 450000 out of 500000 steps  (90%)
[16:27:55] Completed 455000 out of 500000 steps  (91%)
[16:29:55] Completed 460000 out of 500000 steps  (92%)
[16:32:34] Completed 465000 out of 500000 steps  (93%)
[16:34:36] Completed 470000 out of 500000 steps  (94%)
[16:36:37] Completed 475000 out of 500000 steps  (95%)
[16:38:37] Completed 480000 out of 500000 steps  (96%)
[16:40:40] Completed 485000 out of 500000 steps  (97%)
[16:42:42] Completed 490000 out of 500000 steps  (98%)
[16:44:42] Completed 495000 out of 500000 steps  (99%)
[16:46:43] Completed 500000 out of 500000 steps  (100%)
[16:46:45] DynamicWrapper: Finished Work Unit: sleep=10000
[16:46:55] 
[16:46:55] Finished Work Unit:
[16:46:55] - Reading up to 3697200 from "work/wudata_06.trr": Read 3697200
[16:46:55] trr file hash check passed.
[16:46:55] edr file hash check passed.
[16:46:55] logfile size: 58567
[16:46:55] Leaving Run
[16:46:59] - Writing 3791095 bytes of core data to disk...
[16:47:00] Done: 3790583 -> 3519596 (compressed to 92.8 percent)
[16:47:00]   ... Done.
[16:47:01] - Shutting down core
[16:47:01] 
[16:47:01] Folding@home Core Shutdown: FINISHED_UNIT
[16:47:04] CoreStatus = 64 (100)
[16:47:04] Unit 6 finished with 86 percent of time to deadline remaining.
[16:47:04] Updated performance fraction: 0.928540
[16:47:04] Sending work to server
[16:47:04] Project: 6997 (Run 0, Clone 96, Gen 371)


[16:47:04] + Attempting to send results [October 6 16:47:04 UTC]
[16:47:04] - Reading file work/wuresults_06.dat from core
[16:47:04]   (Read 3520108 bytes from disk)
[16:47:04] Connecting to http://128.143.199.96:8080/
[16:47:05] - Couldn't send HTTP request to server
[16:47:05] + Could not connect to Work Server (results)
[16:47:05]     (128.143.199.96:8080)
[16:47:05] + Retrying using alternative port
[16:47:05] Connecting to http://128.143.199.96:80/
[16:47:07] - Couldn't send HTTP request to server
[16:47:07] + Could not connect to Work Server (results)
[16:47:07]     (128.143.199.96:80)
[16:47:07] - Error: Could not transmit unit 06 (completed October 6) to work server.
[16:47:07] - 1 failed uploads of this unit.
[16:47:07]   Keeping unit 06 in queue.
[16:47:07] Trying to send all finished work units
[16:47:07] Project: 6997 (Run 0, Clone 96, Gen 371)


[16:47:07] + Attempting to send results [October 6 16:47:07 UTC]
[16:47:07] - Reading file work/wuresults_06.dat from core
[16:47:07]   (Read 3520108 bytes from disk)
[16:47:07] Connecting to http://128.143.199.96:8080/
[16:47:08] - Couldn't send HTTP request to server
[16:47:08] + Could not connect to Work Server (results)
[16:47:08]     (128.143.199.96:8080)
[16:47:08] + Retrying using alternative port
[16:47:08] Connecting to http://128.143.199.96:80/
[16:47:10] - Couldn't send HTTP request to server
[16:47:10] + Could not connect to Work Server (results)
[16:47:10]     (128.143.199.96:80)
[16:47:10] - Error: Could not transmit unit 06 (completed October 6) to work server.
[16:47:10] - 2 failed uploads of this unit.


[16:47:10] + Attempting to send results [October 6 16:47:10 UTC]
[16:47:10] - Reading file work/wuresults_06.dat from core
[16:47:10]   (Read 3520108 bytes from disk)
[16:47:10] Connecting to http://130.237.165.141:8080/
[16:48:35] Posted data.
[16:48:35] Initial: 0000; - Uploaded at ~40 kB/s
[16:48:35] - Averaged speed for that direction ~38 kB/s
[16:48:35] - Server does not have record of this unit. Will try again later.
[16:48:35]   Could not transmit unit 06 to Collection server; keeping in queue.
[16:48:35] + Sent 0 of 1 completed units to the server
[16:48:35] - Preparing to get new work unit...
[16:48:35] Cleaning up work directory
[16:48:35] + Attempting to get work packet
[16:48:35] Passkey found
[16:48:35] - Will indicate memory of 12279 MB
[16:48:35] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 12, Stepping: 0
[16:48:35] - Connecting to assignment server
[16:48:35] Connecting to http://assign.stanford.edu:8080/
[16:48:40] Posted data.
[16:48:40] Initial: 8F80; - Successful: assigned to (128.143.231.202).
[16:48:40] + News From Folding@Home: Welcome to Folding@Home
[16:48:40] Loaded queue successfully.
[16:48:40] Sent data
[16:48:40] Connecting to http://128.143.231.202:8080/
[16:48:42] Posted data.
[16:48:42] Initial: 0000; - Receiving payload (expected size: 3808051)
[16:49:00] - Downloaded at ~206 kB/s
[16:49:00] - Averaged speed for that direction ~203 kB/s
[16:49:00] + Received work.
[16:49:00] Trying to send all finished work units
[16:49:00] Project: 6997 (Run 0, Clone 96, Gen 371)


[16:49:00] + Attempting to send results [October 6 16:49:00 UTC]
[16:49:00] - Reading file work/wuresults_06.dat from core
[16:49:00]   (Read 3520108 bytes from disk)
[16:49:00] Connecting to http://128.143.199.96:8080/
[16:49:14] - Couldn't send HTTP request to server
[16:49:14] + Could not connect to Work Server (results)
[16:49:14]     (128.143.199.96:8080)
[16:49:14] + Retrying using alternative port
[16:49:14] Connecting to http://128.143.199.96:80/
[16:49:15] - Couldn't send HTTP request to server
[16:49:15] + Could not connect to Work Server (results)
[16:49:15]     (128.143.199.96:80)
[16:49:15] - Error: Could not transmit unit 06 (completed October 6) to work server.
[16:49:15] - 3 failed uploads of this unit.


[16:49:15] + Attempting to send results [October 6 16:49:15 UTC]
[16:49:15] - Reading file work/wuresults_06.dat from core
[16:49:15]   (Read 3520108 bytes from disk)
[16:49:15] Connecting to http://130.237.165.141:8080/
[16:50:40] Posted data.
[16:50:40] Initial: 0000; - Uploaded at ~40 kB/s
[16:50:40] - Averaged speed for that direction ~38 kB/s
[16:50:40] - Server does not have record of this unit. Will try again later.
[16:50:40]   Could not transmit unit 06 to Collection server; keeping in queue.
[16:50:40] + Sent 0 of 1 completed units to the server
[16:50:40] + Closed connections
[16:50:40] 
[16:50:40] + Processing work unit
[16:50:40] Core required: FahCore_a3.exe
[16:50:40] Core found.
[16:50:40] Working on queue slot 07 [October 6 16:50:40 UTC]
[16:50:40] + Working ...
[16:50:40] - Calling '.\FahCore_a3.exe -dir work/ -nice 19 -suffix 07 -np 24 -checkpoint 15 -verbose -lifeline 5536 -version 634'

[16:50:44] 
[16:50:44] *------------------------------*
[16:50:44] Folding@Home Gromacs SMP Core
[16:50:44] Version 2.27 (Dec. 15, 2010)
[16:50:44] 
[16:50:44] Preparing to commence simulation
[16:50:44] - Looking at optimizations...
[16:50:44] - Created dyn
[16:50:44] - Files status OK
[16:50:44] - Expanded 3807539 -> 4136808 (decompressed 108.6 percent)
[16:50:44] Called DecompressByteArray: compressed_data_size=3807539 data_size=4136808, decompressed_data_size=4136808 diff=0
[16:50:44] - Digital signature verified
[16:50:44] 
[16:50:44] Project: 6098 (Run 1, Clone 34, Gen 239)
[16:50:44] 
[16:50:44] Assembly optimizations on if available.
[16:50:44] Entering M.D.
[16:50:51] Mapping NT from 24 to 24 
[16:50:51] Completed 0 out of 500000 steps  (0%)
[16:56:25] Completed 5000 out of 500000 steps  (1%)
[17:02:00] Completed 10000 out of 500000 steps  (2%)
[17:07:34] Completed 15000 out of 500000 steps  (3%)
[17:13:10] Completed 20000 out of 500000 steps  (4%)
[17:18:47] Completed 25000 out of 500000 steps  (5%)
[17:24:21] Completed 30000 out of 500000 steps  (6%)
[17:29:54] Completed 35000 out of 500000 steps  (7%)
[17:36:10] Completed 40000 out of 500000 steps  (8%)
[17:41:45] Completed 45000 out of 500000 steps  (9%)
[17:47:19] Completed 50000 out of 500000 steps  (10%)
[17:52:55] Completed 55000 out of 500000 steps  (11%)
[17:58:31] Completed 60000 out of 500000 steps  (12%)
[18:04:04] Completed 65000 out of 500000 steps  (13%)
[18:09:38] Completed 70000 out of 500000 steps  (14%)
[18:15:11] Completed 75000 out of 500000 steps  (15%)
[18:20:43] Completed 80000 out of 500000 steps  (16%)
[18:26:14] Completed 85000 out of 500000 steps  (17%)
[18:32:25] Completed 90000 out of 500000 steps  (18%)
[18:38:05] Completed 95000 out of 500000 steps  (19%)
[18:43:44] Completed 100000 out of 500000 steps  (20%)
[18:49:20] Completed 105000 out of 500000 steps  (21%)
[18:54:56] Completed 110000 out of 500000 steps  (22%)
[19:00:30] Completed 115000 out of 500000 steps  (23%)
[19:06:05] Completed 120000 out of 500000 steps  (24%)
[19:11:39] Completed 125000 out of 500000 steps  (25%)
[19:17:15] Completed 130000 out of 500000 steps  (26%)
[19:22:51] Completed 135000 out of 500000 steps  (27%)
[19:28:27] Completed 140000 out of 500000 steps  (28%)
[19:30:55] - Autosending finished units... [October 6 19:30:55 UTC]
[19:30:55] Trying to send all finished work units
[19:30:55] Project: 6997 (Run 0, Clone 96, Gen 371)


[19:30:55] + Attempting to send results [October 6 19:30:55 UTC]
[19:30:55] - Reading file work/wuresults_06.dat from core
[19:30:55]   (Read 3520108 bytes from disk)
[19:30:55] Connecting to http://128.143.199.96:8080/
[19:30:56] - Couldn't send HTTP request to server
[19:30:56] + Could not connect to Work Server (results)
[19:30:56]     (128.143.199.96:8080)
[19:30:56] + Retrying using alternative port
[19:30:56] Connecting to http://128.143.199.96:80/
[19:30:58] - Couldn't send HTTP request to server
[19:30:58] + Could not connect to Work Server (results)
[19:30:58]     (128.143.199.96:80)
[19:30:58] - Error: Could not transmit unit 06 (completed October 6) to work server.
[19:30:58] - 4 failed uploads of this unit.


[19:30:58] + Attempting to send results [October 6 19:30:58 UTC]
[19:30:58] - Reading file work/wuresults_06.dat from core
[19:30:58]   (Read 3520108 bytes from disk)
[19:30:58] Connecting to http://130.237.165.141:8080/
[19:32:23] Posted data.
[19:32:23] Initial: 0000; - Uploaded at ~40 kB/s
[19:32:23] - Averaged speed for that direction ~38 kB/s
[19:32:23] - Server does not have record of this unit. Will try again later.
[19:32:23]   Could not transmit unit 06 to Collection server; keeping in queue.
[19:32:23] + Sent 0 of 1 completed units to the server
[19:32:23] - Autosend completed
[19:34:05] Completed 145000 out of 500000 steps  (29%)
[19:39:42] Completed 150000 out of 500000 steps  (30%)
[19:45:22] Completed 155000 out of 500000 steps  (31%)
[19:50:58] Completed 160000 out of 500000 steps  (32%)
[19:56:33] Completed 165000 out of 500000 steps  (33%)
[20:02:07] Completed 170000 out of 500000 steps  (34%)
[20:07:43] Completed 175000 out of 500000 steps  (35%)
[20:13:19] Completed 180000 out of 500000 steps  (36%)
[20:18:56] Completed 185000 out of 500000 steps  (37%)
[20:24:33] Completed 190000 out of 500000 steps  (38%)
[20:30:06] Completed 195000 out of 500000 steps  (39%)
[20:36:24] Completed 200000 out of 500000 steps  (40%)
[20:42:00] Completed 205000 out of 500000 steps  (41%)
[20:47:34] Completed 210000 out of 500000 steps  (42%)
[20:53:11] Completed 215000 out of 500000 steps  (43%)
[20:58:43] Completed 220000 out of 500000 steps  (44%)
[21:04:16] Completed 225000 out of 500000 steps  (45%)
[21:09:51] Completed 230000 out of 500000 steps  (46%)
[21:15:26] Completed 235000 out of 500000 steps  (47%)
[21:21:03] Completed 240000 out of 500000 steps  (48%)
[21:26:37] Completed 245000 out of 500000 steps  (49%)
[21:33:08] Completed 250000 out of 500000 steps  (50%)
[21:40:49] Completed 255000 out of 500000 steps  (51%)
[21:48:46] Completed 260000 out of 500000 steps  (52%)
Image
Tobit
Posts: 342
Joined: Thu Apr 17, 2008 2:35 pm
Location: Manchester, NH USA

Re: Problem with stats. [Lots of 0 point WUs]

Post by Tobit »

Nathan_P wrote:I have a similar problem on my v6.34 client, i've only just started getting theproblem though so my WU count is currently normal
Likely a different problem. The same WS is in play but two different problems.. very frustrating.
KMac
Posts: 31
Joined: Thu Feb 17, 2011 6:50 pm

Re: Problem with stats. [Lots of 0 point WUs]

Post by KMac »

Upon completion, the client goes into another neverending return loop.
If I stop and restart the client, it downloads a new WU, processes normally and then starts another return loop.
I am stopping all Windows SPM clients until resolved.

Code: Select all

[21:40:34] Completed 185000 out of 250000 steps  (74%)
[21:41:12] Completed 187500 out of 250000 steps  (75%)
[21:41:50] Completed 190000 out of 250000 steps  (76%)
[21:42:30] Completed 192500 out of 250000 steps  (77%)
[21:43:09] Completed 195000 out of 250000 steps  (78%)
[21:43:48] Completed 197500 out of 250000 steps  (79%)
[21:44:31] Completed 200000 out of 250000 steps  (80%)
[21:45:08] Completed 202500 out of 250000 steps  (81%)
[21:45:45] Completed 205000 out of 250000 steps  (82%)
[21:46:24] Completed 207500 out of 250000 steps  (83%)
[21:47:03] Completed 210000 out of 250000 steps  (84%)
[21:47:46] Completed 212500 out of 250000 steps  (85%)
[21:48:24] Completed 215000 out of 250000 steps  (86%)
[21:49:05] Completed 217500 out of 250000 steps  (87%)
[21:49:47] Completed 220000 out of 250000 steps  (88%)
[21:50:25] Completed 222500 out of 250000 steps  (89%)
[21:51:03] Completed 225000 out of 250000 steps  (90%)
[21:51:47] Completed 227500 out of 250000 steps  (91%)
[21:52:28] Completed 230000 out of 250000 steps  (92%)
[21:53:08] Completed 232500 out of 250000 steps  (93%)
[21:53:51] Completed 235000 out of 250000 steps  (94%)
[21:54:32] Completed 237500 out of 250000 steps  (95%)
[21:55:13] Completed 240000 out of 250000 steps  (96%)
[21:55:57] Completed 242500 out of 250000 steps  (97%)
[21:56:36] Completed 245000 out of 250000 steps  (98%)
[21:57:21] Completed 247500 out of 250000 steps  (99%)
[21:58:00] Completed 250000 out of 250000 steps  (100%)
[21:58:00] DynamicWrapper: Finished Work Unit: sleep=10000
[21:58:10] 
[21:58:10] Finished Work Unit:
[21:58:10] - Reading up to 769992 from "work/wudata_03.trr": Read 769992
[21:58:10] trr file hash check passed.
[21:58:10] - Reading up to 457012 from "work/wudata_03.xtc": Read 457012
[21:58:10] xtc file hash check passed.
[21:58:10] edr file hash check passed.
[21:58:10] logfile size: 26269
[21:58:10] Leaving Run
[21:58:14] - Writing 1258677 bytes of core data to disk...
[21:58:14] Done: 1258165 -> 1195168 (compressed to 94.9 percent)
[21:58:14]   ... Done.
[21:58:14] - Shutting down core
[21:58:14] 
[21:58:14] Folding@home Core Shutdown: FINISHED_UNIT
[21:58:18] CoreStatus = 64 (100)
[21:58:18] Unit 3 finished with 93 percent of time to deadline remaining.
[21:58:18] Updated performance fraction: 0.955077
[21:58:18] Sending work to server
[21:58:18] Project: 8001 (Run 105, Clone 110, Gen 26)


[21:58:18] + Attempting to send results [October 6 21:58:18 UTC]
[21:58:18] - Reading file work/wuresults_03.dat from core
[21:58:18]   (Read 1195680 bytes from disk)
[21:58:18] Connecting to http://171.67.108.58:8080/
[21:58:28] Posted data.
[21:58:28] Initial: 0000; - Uploaded at ~116 kB/s
[21:58:28] - Averaged speed for that direction ~79 kB/s
[21:58:28] + Results successfully sent
[21:58:28] Thank you for your contribution to Folding@Home.
[21:58:28] + Number of Units Completed: 202

[21:58:32] Trying to send all finished work units
[21:58:32] Project: 6995 (Run 0, Clone 38, Gen 399)


[21:58:32] + Attempting to send results [October 6 21:58:32 UTC]
[21:58:32] - Reading file work/wuresults_01.dat from core
[21:58:32]   (Read 3520864 bytes from disk)
[21:58:32] Connecting to http://128.143.199.96:8080/
[21:58:33] - Couldn't send HTTP request to server
[21:58:33] + Could not connect to Work Server (results)
[21:58:33]     (128.143.199.96:8080)
[21:58:33] + Retrying using alternative port
[21:58:33] Connecting to http://128.143.199.96:80/
[21:58:35] - Couldn't send HTTP request to server
[21:58:35] + Could not connect to Work Server (results)
[21:58:35]     (128.143.199.96:80)
[21:58:35] - Error: Could not transmit unit 01 (completed October 6) to work server.
[21:58:35] - 443 failed uploads of this unit.


[21:58:35] + Attempting to send results [October 6 21:58:35 UTC]
[21:58:35] - Reading file work/wuresults_01.dat from core
[21:58:35]   (Read 3520864 bytes from disk)
[21:58:35] Connecting to http://130.237.165.141:8080/
[21:59:13] Posted data.
[21:59:13] Initial: 0000; - Uploaded at ~90 kB/s
[21:59:13] - Averaged speed for that direction ~81 kB/s
[21:59:13] - Server does not have record of this unit. Will try again later.
[21:59:13]   Could not transmit unit 01 to Collection server; keeping in queue.
[21:59:13] + Sent 0 of 1 completed units to the server
[21:59:43] Trying to send all finished work units
[21:59:43] Project: 6995 (Run 0, Clone 38, Gen 399)


[21:59:43] + Attempting to send results [October 6 21:59:43 UTC]
[21:59:43] - Reading file work/wuresults_01.dat from core
[21:59:43]   (Read 3520864 bytes from disk)
[21:59:43] Connecting to http://128.143.199.96:8080/
[21:59:44] - Couldn't send HTTP request to server
[21:59:44] + Could not connect to Work Server (results)
[21:59:44]     (128.143.199.96:8080)
[21:59:44] + Retrying using alternative port
[21:59:44] Connecting to http://128.143.199.96:80/
[21:59:45] - Couldn't send HTTP request to server
[21:59:45] + Could not connect to Work Server (results)
[21:59:45]     (128.143.199.96:80)
[21:59:45] - Error: Could not transmit unit 01 (completed October 6) to work server.
[21:59:45] - 444 failed uploads of this unit.


[21:59:45] + Attempting to send results [October 6 21:59:45 UTC]
[21:59:45] - Reading file work/wuresults_01.dat from core
[21:59:45]   (Read 3520864 bytes from disk)
[21:59:45] Connecting to http://130.237.165.141:8080/
[22:00:22] Posted data.
[22:00:35] Killing all core threads
[22:00:35] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown at user request.
[22:00:35] ***** Got a SIGTERM signal (2)
[22:00:35] Killing all core threads
[22:00:35] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown.


--- Opening Log file [October 6 22:01:09 UTC] 


# Windows SMP Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Program Files (x86)\FAH\SMP
Executable: C:\Program Files (x86)\FAH\FAH.exe
Arguments: -oneunit -forceasm -smp -verbosity 9 

[22:01:09] - Ask before connecting: No
[22:01:09] - User name: KMac (Team 33)
[22:01:09] - User ID: 7F87BEF1323CE7CF
[22:01:09] - Machine ID: 2
[22:01:09] 
[22:01:09] Loaded queue successfully.
[22:01:09] - Preparing to get new work unit...
[22:01:09] - Autosending finished units... [October 6 22:01:09 UTC]
[22:01:09] Cleaning up work directory
[22:01:09] Trying to send all finished work units
[22:01:09] Project: 6995 (Run 0, Clone 38, Gen 399)
[22:01:09] + Attempting to get work packet
[22:01:09] Passkey found


[22:01:09] - Will indicate memory of 16359 MB
[22:01:09] + Attempting to send results [October 6 22:01:09 UTC]
[22:01:09] - Detect CPU.[22:01:09] - Reading file work/wuresults_01.dat from core
 Vendor: GenuineIntel, Family: 6, Model: 10, Stepping: 7
[22:01:09] - Connecting to assignment server
[22:01:09]   (Read 3520864 bytes from disk)
[22:01:09] Connecting to http://assign.stanford.edu:8080/
[22:01:09] Connecting to http://128.143.199.96:8080/
[22:01:10] Posted data.
[22:01:10] Initial: 43AB; - Successful: assigned to (171.67.108.58).
[22:01:10] + News From Folding@Home: Welcome to Folding@Home
[22:01:10] Loaded queue successfully.
[22:01:10] Sent data
[22:01:10] Connecting to http://171.67.108.58:8080/
[22:01:10] - Couldn't send HTTP request to server
[22:01:10] + Could not connect to Work Server (results)
[22:01:10]     (128.143.199.96:8080)
[22:01:10] + Retrying using alternative port
[22:01:10] Connecting to http://128.143.199.96:80/
[22:01:10] Posted data.
[22:01:10] Initial: 0000; - Receiving payload (expected size: 545127)
[22:01:11] - Downloaded at ~532 kB/s
[22:01:11] - Averaged speed for that direction ~614 kB/s
[22:01:11] + Received work.
[22:01:11] + Closed connections
[22:01:11] 
[22:01:11] + Processing work unit
[22:01:11] A4 will attempt to use 8 threads.
[22:01:11] Core required: FahCore_a4.exe
[22:01:11] Core found.
[22:01:11] Working on queue slot 04 [October 6 22:01:11 UTC]
[22:01:11] + Working ...
[22:01:11] - Calling '.\FahCore_a4.exe -dir work/ -nice 19 -suffix 04 -np 8 -nocpulock -checkpoint 3 -forceasm -verbose -lifeline 6636 -version 634'

[22:01:11] - Couldn't send HTTP request to server
[22:01:11] + Could not connect to Work Server (results)
[22:01:11]     (128.143.199.96:80)
[22:01:11] - Error: Could not transmit unit 01 (completed October 6) to work server.
[22:01:11] - 444 failed uploads of this unit.


[22:01:11] + Attempting to send results [October 6 22:01:11 UTC]
[22:01:11] - Reading file work/wuresults_01.dat from core
[22:01:11]   (Read 3520864 bytes from disk)
[22:01:11] Connecting to http://130.237.165.141:8080/
[22:01:11] 
[22:01:11] *------------------------------*
[22:01:11] Folding@Home Gromacs GB Core
[22:01:11] Version 2.27 (Dec. 15, 2010)
[22:01:11] 
[22:01:11] Preparing to commence simulation
[22:01:11] - Assembly optimizations manually forced on.
[22:01:11] - Not checking prior termination.
[22:01:11] - Expanded 544615 -> 1305600 (decompressed 239.7 percent)
[22:01:11] Called DecompressByteArray: compressed_data_size=544615 data_size=1305600, decompressed_data_size=1305600 diff=0
[22:01:11] - Digital signature verified
[22:01:11] 
[22:01:11] Project: 8001 (Run 62, Clone 37, Gen 220)
[22:01:11] 
[22:01:11] Assembly optimizations on if available.
[22:01:11] Entering M.D.
[22:01:18] Mapping NT from 8 to 8 
[22:01:18] Completed 0 out of 250000 steps  (0%)
[22:01:44] Killing all core threads
[22:01:44] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown at user request.
[22:01:44] ***** Got a SIGTERM signal (2)
[22:01:44] Killing all core threads
[22:01:44] Could not get process id information.  Please kill core process manually

Folding@Home Client Shutdown.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Problem with stats. [Lots of 0 point WUs]

Post by bruce »

Interesting.

A WU finishes an successfully uploads to 171.67.108.58.
A WU that failed to upload to 128.143.199.96 earlier is retried and fails again, keeping it in queue to be retried later.
The concept of "later" SHOULD start out with a short wait which grows progressively longer as the number of retries increases.
That concept may be different in V6 and V7 but I'm not sure.

Thanks for the report.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Problem with stats. [Lots of 0 point WUs]

Post by bruce »

I have been asked (in a Private Message) if it's worth noting that the project(s) that are not uploading are no longer listed on psummary. There's nothing private about that question so I'm replying here in a public forum.

Certainly.

You already know the Work Server has a problem since it is not accepting your upload. It's HIGHTLY likely that the same server is no longer distributing new WUs.

The Psummary pages are automatically generated from the projects that are currently being distributed (as of maybe an hour or two ago). They will appear on Psummary again when they're being distributed again.
Gary480six
Posts: 91
Joined: Mon Jan 21, 2008 6:42 pm

Re: Problem with stats. [Lots of 0 point WUs]

Post by Gary480six »

I am also one of those people who had a P6951 stuck in my work queue.. and was getting zero point "work units" recorded for that user.

I see that server 128.143.199.96 has been brought back on line and my P6951 work unit has finally been sent back. (around 4:00 am Eastern)

Will Stanford make an effort to correct the faulty data that the stats server collected?

Because of this problem, about 69 zero point "work units" were generated under the username Ella~B. (team 40098)

This is a single, 6-core system and a new username I created 16 days ago - so it's pretty easy to track the stats.

The P6951 finished Saturday morning, but the server was down - so it was put on hold in the queue. Which, as you said, would initially try often to return the work - extending the range each time until it was only trying once every six hours or so. However... with 128.143.199.96 being down, the PC began getting assigned P8001 type work units, which only take 1.5 hours for my system to complete. And each time Those work units finished, the client also tried to re-send the completed P6951.

And as others have said, each time an attempt was made to send that P6951 work unit back to Stanford, one or more zero point "work unit" credits were generated.

Windows 7 Ultimate 64-bit with Client 6.34 on an AMD Phenom II X6 1090T Processor
7im
Posts: 10189
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: Problem with stats. [Lots of 0 point WUs]

Post by 7im »

Not likely, no, according to policy.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
P5-133XL
Posts: 2948
Joined: Sun Dec 02, 2007 4:36 am
Hardware configuration: Machine #1:

Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).

Machine #2:

Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.

Machine 3:

Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32

I am currently folding just on the 5x GTX 460's for aprox. 70K PPD
Location: Salem. OR USA

Re: Problem with stats. [Lots of 0 point WUs]

Post by P5-133XL »

They know about the issue. You need to wait to see what PG says.
Image
kasson
Pande Group Member
Posts: 1459
Joined: Thu Nov 29, 2007 9:37 pm

Re: Problem with stats. [Lots of 0 point WUs]

Post by kasson »

I don't want to make any promises yet, but we're looking at it. I'm coordinating with both the WS maintainer and the stats team to investigate.
Post Reply