155.247.166.220 error uploading WU

Moderators: Site Moderators, FAHC Science Team

Post Reply
jinjonBoo
Posts: 8
Joined: Thu Jan 17, 2013 4:30 am

155.247.166.220 error uploading WU

Post by jinjonBoo »

hi there.

FAHControl shows "Collection Server" as 0.0.0.0.
After WU is finished, it shows this error:

"15:06:29:WU00:FS00:Uploading 25.70MiB to 155.247.166.220
15:06:29:WU00:FS00:Connecting to 155.247.166.220:8080
15:06:30:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
15:06:30:WU00:FS00:Connecting to 155.247.166.220:80
15:06:51:WARNING:WU00:FS00:Exception: Failed to send results to work server: Failed to connect to 155.247.166.220:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond."

Any ideas?

Thanks
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: error uploading WU

Post by bruce »

Yes, that server is having problems right now. The client should retry until the server is fixed.

What project was that?
(The owner of the project needs to add a setting for the Collection Server for a long-term fix.)
parkut
Posts: 364
Joined: Tue Feb 12, 2008 7:33 am
Hardware configuration: Running exclusively Linux headless blades. All are dedicated crunching machines.
Location: SE Michigan, USA

Re: 155.247.166.220 error uploading WU

Post by parkut »

I have (2) WU's stuck on upload : Failed to connect to 155.247.166.220

Machine-107 .. project:13795 run:18 clone:12 gen:36
Machine-104 .. project:14004 run:0 clone:5 gen:34

Both show Collection server 0.0.0.0
jinjonBoo
Posts: 8
Joined: Thu Jan 17, 2013 4:30 am

Re: error uploading WU

Post by jinjonBoo »

bruce wrote:Yes, that server is having problems right now. The client should retry until the server is fixed.

What project was that?
(The owner of the project needs to add a setting for the Collection Server for a long-term fix.)
Project seems to be 13797, here is the log:

Code: Select all

14:21:01:WU00:FS00:0xa7:Completed 2500000 out of 2500000 steps (100%)
14:21:02:WU00:FS00:0xa7:Saving result file ..\logfile_01.txt
14:21:02:WU00:FS00:0xa7:Saving result file ener.edr
14:21:02:WU00:FS00:0xa7:Saving result file frame0.trr
14:21:05:WU00:FS00:0xa7:Saving result file md.log
14:21:05:WU00:FS00:0xa7:Saving result file pullf.xvg
14:21:05:WU00:FS00:0xa7:Saving result file pullx.xvg
14:21:05:WU00:FS00:0xa7:Saving result file science.log
14:21:05:WU00:FS00:0xa7:Saving result file traj_comp.xtc
14:21:05:WU00:FS00:0xa7:Folding@home Core Shutdown: FINISHED_UNIT
14:21:06:WU00:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
14:21:06:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:13797 run:17 clone:18 gen:0 core:0xa7 unit:0x000000000002894c5a13683b0c19086c
14:21:06:WU00:FS00:Uploading 25.70MiB to 155.247.166.220
14:21:06:WU00:FS00:Connecting to 155.247.166.220:8080
14:21:07:WARNING:WU00:FS00:WorkServer connection failed on port 8080 trying 80
14:21:07:WU00:FS00:Connecting to 155.247.166.220:80
14:21:28:WARNING:WU00:FS00:Exception: Failed to send results to work server: Failed to connect to 155.247.166.220:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
14:21:28:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:13797 run:17 clone:18 gen:0 core:0xa7 unit:0x000000000002894c5a13683b0c19086c
jinjonBoo
Posts: 8
Joined: Thu Jan 17, 2013 4:30 am

Re: 155.247.166.220 error uploading WU

Post by jinjonBoo »

still failing to upload the data.... any feedback on what's going on?
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 155.247.166.220 error uploading WU

Post by bruce »

Vvoelz Lab at Temple U. has had more than it's share of troubles. When something gets fixed, it often breaks again. I don't have enough information to figure out what the problem is, but it's not uncommon for the Campus Networking security folks to have different ideas about what needs to be done than the scientists actually running the servers. The red-tape
required to resolve such issues can be very complex.

Right now, that server is listed as DOWN. Since your WU was assigned without a Collection Server, the WU has to wait until it can get through to that particular server. We can hope that it's down temporarily to make some permanent change that will improve its apparent unreliability, but I don't have any facts, either way.
Joe_H
Site Admin
Posts: 7856
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: 155.247.166.220 error uploading WU

Post by Joe_H »

Is this still the same WU you reported issues with uploading on Wednesday? That server was brought back up later in the day on Wednesday, and accepted thousands of WU's from that time until it went down Saturday morning.

So if it is the same WU, then you have some problem in addition to the WS being down.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
jinjonBoo
Posts: 8
Joined: Thu Jan 17, 2013 4:30 am

Re: 155.247.166.220 error uploading WU

Post by jinjonBoo »

hey all, i don't keep FAH Client 24/7, so i wouldn't know if the server was up.
i tried today and i managed to upload the WU, although it showed "WARNING:WU02:FS01:Past final deadline 2018-02-12T15:05:08Z, dumping", hope my folding didn't go to waste :):

Thanks for the help!
Regards

<code>
14:06:58:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:13797 run:17 clone:18 gen:0 core:0xa7 unit:0x000000000002894c5a13683b0c19086c
14:06:58:WARNING:WU02:FS01:Past final deadline 2018-02-12T15:05:08Z, dumping
14:06:58:WU00:FS00:Uploading 25.70MiB to 155.247.166.220
14:06:58:WU00:FS00:Connecting to 155.247.166.220:8080
14:06:58:WU02:FS01:Cleaning up
14:07:04:WU00:FS00:Upload 8.76%
14:07:10:WU00:FS00:Upload 16.54%
14:07:16:WU00:FS00:Upload 24.08%
14:07:22:WU00:FS00:Upload 35.02%
14:07:28:WU00:FS00:Upload 43.05%
14:07:34:WU00:FS00:Upload 50.83%
14:07:40:WU00:FS00:Upload 60.32%
14:07:46:WU00:FS00:Upload 74.67%
14:07:52:WU00:FS00:Upload 84.40%
14:07:58:WU00:FS00:Upload 97.53%
14:08:02:WU00:FS00:Upload complete
14:08:02:WU00:FS00:Server responded WORK_ACK (400)
14:08:02:WU00:FS00:Final credit estimate, 2214.00 points
14:08:02:WU00:FS00:Cleaning up
</code>
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 155.247.166.220 error uploading WU

Post by bruce »

jinjonBoo wrote:i tried today and i managed to upload the WU, although it showed "WARNING:WU02:FS01:Past final deadline 2018-02-12T15:05:08Z, dumping", hope my folding didn't go to waste :):
It depends on your definition of "going to waste"

From a points perspective, you did receive a small credit, but the points always decrease when you hold on to a WU longer than necessary.

From a scientific perspective, a WU that is returned after the deadline is rarely useful.

FAH generates a new WU whenever a WU is completed, and to guarantee that progress is being made the deadlines are strictly enforced. When a WU passes the Preferred Deadline, it's assumed to be lost and it is duplicated and reassigned to someone else who will (hopefully) complete it in a reasonable amount of time. They earn full credit for the WU. If, as in your case, the WU is returned later, partial credit may or may not be awarded.

Here are the particulars on your WU. In this case, the completed WU was returned on 2017-11-05 and the analysis of that trajectory could be continued. When your WU arrived on 2018-02-15, it was no longer needed so it was discarded.

Code: Select all

Hi xxxxxx (team yyyy),  Days taken to complete WU: 0.17
Your WU (P13797 R17 C18 G0) was added to the stats database on 2017-11-05 03:06:15 for 16892.8 points of credit.

Hi jinjonBoo (team 223650),  Days taken to complete WU: 9.06
Your WU (P13797 R17 C18 G0) was added to the stats database on 2018-02-15 06:18:17 for 2214.3 points of credit.
For those who shut down their computer for long periods of time, we recommend that you set the client to FINISH the active WU(s) and wait until that process can be completed before shutting down.
Post Reply