failure to upload 206.223.170.146

Moderators: Site Moderators, FAHC Science Team

failure to upload 206.223.170.146

Postby verdeva » Fri Nov 20, 2020 8:19 pm

Things have been going great for the past couple of weeks, but this just cropped up today:

Code: Select all
18:47:17:WU01:FS00:0xa7:Completed 242500 out of 250000 steps (97%)
18:48:04:WU01:FS00:0xa7:Completed 245000 out of 250000 steps (98%)
18:48:50:WU01:FS00:0xa7:Completed 247500 out of 250000 steps (99%)
18:49:37:WU01:FS00:0xa7:Completed 250000 out of 250000 steps (100%)
18:49:38:WU01:FS00:0xa7:Saving result file ../logfile_01.txt
18:49:38:WU01:FS00:0xa7:Saving result file frame208.trr
18:49:38:WU01:FS00:0xa7:Saving result file frame208.xtc
18:49:38:WU01:FS00:0xa7:Saving result file md.log
18:49:38:WU01:FS00:0xa7:Saving result file science.log
18:49:38:WU01:FS00:0xa7:Folding@home Core Shutdown: FINISHED_UNIT
18:49:38:WU01:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
18:49:38:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:14255 run:0 clone:2047 gen:208 core:0xa7 unit:0x000000f3cedfaa9200000000000007ff
18:49:38:WU01:FS00:Uploading 2.85MiB to 206.223.170.146
18:49:38:WU01:FS00:Connecting to 206.223.170.146:8080
18:51:48:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
18:51:48:WU01:FS00:Connecting to 206.223.170.146:80
18:53:59:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 206.223.170.146:80: Connection timed out
18:53:59:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:14255 run:0 clone:2047 gen:208 core:0xa7 unit:0x000000f3cedfaa9200000000000007ff
18:53:59:WU01:FS00:Uploading 2.85MiB to 206.223.170.146
18:53:59:WU01:FS00:Connecting to 206.223.170.146:8080
18:56:10:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
18:56:10:WU01:FS00:Connecting to 206.223.170.146:80
18:58:21:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 206.223.170.146:80: Connection timed out
18:58:21:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:14255 run:0 clone:2047 gen:208 core:0xa7 unit:0x000000f3cedfaa9200000000000007ff
18:58:21:WU01:FS00:Uploading 2.85MiB to 206.223.170.146
18:58:21:WU01:FS00:Connecting to 206.223.170.146:8080
19:00:32:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
19:00:32:WU01:FS00:Connecting to 206.223.170.146:80
19:02:43:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 206.223.170.146:80: Connection timed out
19:02:44:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:14255 run:0 clone:2047 gen:208 core:0xa7 unit:0x000000f3cedfaa9200000000000007ff
19:02:44:WU01:FS00:Uploading 2.85MiB to 206.223.170.146
19:02:44:WU01:FS00:Connecting to 206.223.170.146:8080
19:04:54:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
19:04:54:WU01:FS00:Connecting to 206.223.170.146:80
19:07:05:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 206.223.170.146:80: Connection timed out
19:07:06:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:14255 run:0 clone:2047 gen:208 core:0xa7 unit:0x000000f3cedfaa9200000000000007ff
19:07:06:WU01:FS00:Uploading 2.85MiB to 206.223.170.146
19:07:06:WU01:FS00:Connecting to 206.223.170.146:8080
19:09:16:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
19:09:16:WU01:FS00:Connecting to 206.223.170.146:80
19:11:28:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 206.223.170.146:80: Connection timed out
19:11:28:WU01:FS00:Sending unit results: id:01 state:SEND error:NO_ERROR project:14255 run:0 clone:2047 gen:208 core:0xa7 unit:0x000000f3cedfaa9200000000000007ff
19:11:28:WU01:FS00:Uploading 2.85MiB to 206.223.170.146
19:11:28:WU01:FS00:Connecting to 206.223.170.146:8080
19:13:39:WARNING:WU01:FS00:WorkServer connection failed on port 8080 trying 80
19:13:39:WU01:FS00:Connecting to 206.223.170.146:80
19:15:50:WARNING:WU01:FS00:Exception: Failed to send results to work server: Failed to connect to 206.223.170.146:80: Connection timed out
verdeva
 
Posts: 21
Joined: Mon Dec 03, 2007 2:40 pm
Location: Seattle, WA

Re: failure to upload 206.223.170.146

Postby Neil-B » Fri Nov 20, 2020 8:24 pm

That server is currently showing as down on server status page ... the client will periodically try to upload the wu results until the server is up again and the upload can be recieved
1: 2x Xeon E5-2697v3@2.60GHz, 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent, Quadro K420 1GB, FAH 7.6.21
2: Xeon E3-1505Mv5@2.80GHz, 32GB DDR4, NVME, Win10 Pro, Quadro M1000M 2GB, FAH 7.6.21
3: i7-960@3.20GHz, 12GB DDR3, SSD, Win10 Pro, GTX 750Ti 2GB, FAH 7.6.21
Neil-B
 
Posts: 1503
Joined: Sun Mar 22, 2020 6:52 pm
Location: UK

Re: failure to upload 206.223.170.146

Postby Badsinger » Sat Nov 21, 2020 3:37 am

Its been 24 attempts now to upload for me. Out of curiosity, should a second server fail, as the gpu continues to fold, will it save both tasks or is one lost?
Badsinger
 
Posts: 6
Joined: Tue May 19, 2020 10:01 am

Re: failure to upload 206.223.170.146

Postby Neil-B » Sat Nov 21, 2020 10:49 am

It will save both WU .. it will keep retrying until either it uploads or until the expiration deadline is reached at which point the client will dump the wu .. hopefully the server will be up and receiving before then but it kind of depends why the server is down and it is the weekend so fixes tend to be slower
Neil-B
 
Posts: 1503
Joined: Sun Mar 22, 2020 6:52 pm
Location: UK

Re: failure to upload 206.223.170.146

Postby psaam0001 » Sat Nov 21, 2020 3:49 pm

I also have a WU needing to be returned as completed, but I'm not going to sweat it.

Stay calm... Keep folding!!!

Paul
psaam0001
 
Posts: 154
Joined: Mon May 18, 2020 3:02 am

Re: failure to upload 206.223.170.146

Postby Neil-B » Sat Nov 21, 2020 3:54 pm

I have seen a post on discord which confirms the server is under maintenance .. it is simply a matter of waiting .. there is nothing you can do to get it to upload until the server is back up
Neil-B
 
Posts: 1503
Joined: Sun Mar 22, 2020 6:52 pm
Location: UK

Re: failure to upload 206.223.170.146

Postby MoelTryfan » Sat Nov 21, 2020 4:29 pm

They'd better get a move on. My WU expires at 1826 today.
MoelTryfan
 
Posts: 10
Joined: Sun Apr 19, 2020 12:00 pm

Re: failure to upload 206.223.170.146

Postby Gnomuz » Sat Nov 21, 2020 6:22 pm

Same here with a WU (Project: 14254 (Run 0, Clone 2459, Gen 172)) finished yesterday at 11/20/2020 19:11. F@H has been retrying to upload the results for more 23 hours now.
Folding keeps on running and uploading to other servers, so let's be patient. But I admit it's easier for me as the expiration is on 11/27/2020 18:17, which gives them more time to fix the issue than MoelTryfan :wink:
Gnomuz
 
Posts: 4
Joined: Sat Nov 21, 2020 6:07 pm

Re: failure to upload 206.223.170.146

Postby ViTe » Sat Nov 21, 2020 6:51 pm

I have the same issue and my WU pretty close to time out. I think to have a different collection server should be mandatory.
ViTe
 
Posts: 20
Joined: Tue Feb 14, 2012 3:22 am

Re: failure to upload 206.223.170.146

Postby Neil-B » Sat Nov 21, 2020 7:19 pm

It doesn't always work that way and isnt always possible or actually the best thing for the science .. reaching timeout isnt when the client dumps wu it is the expiration deadline .. CS can sometimes cause more issues than benefits
Neil-B
 
Posts: 1503
Joined: Sun Mar 22, 2020 6:52 pm
Location: UK

Re: failure to upload 206.223.170.146

Postby PFM » Sat Nov 21, 2020 9:33 pm

Any ETA on this ?
uploads failing for me too on this server....
PFM
 
Posts: 33
Joined: Sat Jan 03, 2009 4:14 am
Location: Bay Area, USA

Re: failure to upload 206.223.170.146

Postby Neil-B » Sat Nov 21, 2020 9:52 pm

unfortunately fah doesn't tend to do etas .. rest assured they will get it up as quick as they can .. but it is the weekend and that does slow things down especially with the current pandemic .. the researchers will be even more gutted than the folders - but sometimes these things just happen
Neil-B
 
Posts: 1503
Joined: Sun Mar 22, 2020 6:52 pm
Location: UK

Re: failure to upload 206.223.170.146

Postby tmccarty729 » Sat Nov 21, 2020 11:33 pm

Stuck sending to 170.146 also.
tmccarty729
 
Posts: 1
Joined: Sat Nov 21, 2020 11:31 pm

Re: failure to upload 206.223.170.146

Postby Neil-B » Sun Nov 22, 2020 10:39 am

Server still under maintenance
Neil-B
 
Posts: 1503
Joined: Sun Mar 22, 2020 6:52 pm
Location: UK

Re: failure to upload 206.223.170.146

Postby MoelTryfan » Sun Nov 22, 2020 4:35 pm

Still down at 1535 UTC
MoelTryfan
 
Posts: 10
Joined: Sun Apr 19, 2020 12:00 pm

Next

Return to Issues with a specific server

Who is online

Users browsing this forum: No registered users and 2 guests

cron