aws1 and aws2.foldingathome.org both not responding

Moderators: Site Moderators, FAHC Science Team

aws1 and aws2.foldingathome.org both not responding

Postby tbonse » Fri May 01, 2020 1:40 am

Both of these servers (3.133.76.19 and 3.21.157.11 show an uptime of 1 hour, but are not accepting communication to upload completed jobs.

Code: Select all
******************************* Date: 2020-05-01 *******************************
00:32:33:WU01:FS02:0x22:Completed 7840000 out of 8000000 steps (98%)
00:33:35:WU01:FS02:0x22:Completed 7920000 out of 8000000 steps (99%)
00:34:36:WU01:FS02:0x22:Completed 8000000 out of 8000000 steps (100%)
00:34:37:WU01:FS02:0x22:Saving result file ..\logfile_01.txt
00:34:37:WU01:FS02:0x22:Saving result file checkpointState.xml
00:34:37:WU01:FS02:0x22:Saving result file checkpt.crc
00:34:37:WU01:FS02:0x22:Saving result file positions.xtc
00:34:38:WU01:FS02:0x22:Saving result file science.log
00:34:38:WU01:FS02:0x22:Folding@home Core Shutdown: FINISHED_UNIT
00:34:39:WU01:FS02:FahCore returned: FINISHED_UNIT (100 = 0x64)
00:34:39:WU01:FS02:Sending unit results: id:01 state:SEND error:NO_ERROR project:16433 run:1408 clone:3 gen:3 core:0x22 unit:0x0000000803854c135e9a4efe84a09c2f
00:34:39:WU01:FS02:Uploading 59.00MiB to 3.133.76.19
00:34:39:WU01:FS02:Connecting to 3.133.76.19:8080
00:35:00:WARNING:WU01:FS02:WorkServer connection failed on port 8080 trying 80
00:35:00:WU01:FS02:Connecting to 3.133.76.19:80
00:35:22:WARNING:WU01:FS02:Exception: Failed to send results to work server: Failed to connect to 3.133.76.19:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
00:35:22:WU01:FS02:Trying to send results to collection server
00:35:22:WU01:FS02:Uploading 59.00MiB to 3.21.157.11
00:35:22:WU01:FS02:Connecting to 3.21.157.11:8080
00:35:43:WARNING:WU01:FS02:WorkServer connection failed on port 8080 trying 80
00:35:43:WU01:FS02:Connecting to 3.21.157.11:80
00:36:05:ERROR:WU01:FS02:Exception: Failed to connect to 3.21.157.11:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
00:36:05:WU01:FS02:Sending unit results: id:01 state:SEND error:NO_ERROR project:16433 run:1408 clone:3 gen:3 core:0x22 unit:0x0000000803854c135e9a4efe84a09c2f
00:36:05:WU01:FS02:Uploading 59.00MiB to 3.133.76.19
00:36:05:WU01:FS02:Connecting to 3.133.76.19:8080
00:36:26:WARNING:WU01:FS02:WorkServer connection failed on port 8080 trying 80
00:36:26:WU01:FS02:Connecting to 3.133.76.19:80
00:36:48:WARNING:WU01:FS02:Exception: Failed to send results to work server: Failed to connect to 3.133.76.19:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
00:36:48:WU01:FS02:Trying to send results to collection server
00:36:48:WU01:FS02:Uploading 59.00MiB to 3.21.157.11
00:36:48:WU01:FS02:Connecting to 3.21.157.11:8080
00:37:09:WARNING:WU01:FS02:WorkServer connection failed on port 8080 trying 80
00:37:09:WU01:FS02:Connecting to 3.21.157.11:80
00:37:31:ERROR:WU01:FS02:Exception: Failed to connect to 3.21.157.11:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
00:37:31:WU01:FS02:Sending unit results: id:01 state:SEND error:NO_ERROR project:16433 run:1408 clone:3 gen:3 core:0x22 unit:0x0000000803854c135e9a4efe84a09c2f
00:37:31:WU01:FS02:Uploading 59.00MiB to 3.133.76.19
00:37:31:WU01:FS02:Connecting to 3.133.76.19:8080
00:37:52:WARNING:WU01:FS02:WorkServer connection failed on port 8080 trying 80
tbonse
 
Posts: 25
Joined: Wed Apr 15, 2020 11:32 am

Re: aws1 and aws2.foldingathome.org both not responding

Postby tbonse » Fri May 01, 2020 1:43 am

Update:

3.133.76.19:80 did finally start responding.

I believe the assessment from a prior thread about these cloud servers being poorly suited to the task of supporting the FAH work was very apt. It seems that just about every time there is a server malfunctioning, it is either an Azure or AWS server.
tbonse
 
Posts: 25
Joined: Wed Apr 15, 2020 11:32 am

Re: aws1 and aws2.foldingathome.org both not responding

Postby ChristianVirtual » Fri May 01, 2020 3:55 am

aws2 still not collaborative

Code: Select all
02:51:52:ERROR:WU00:FS01:Exception: Failed to connect to 3.21.157.11:80: Connection timed out
ImageImage
Please contribute your logs to http://ppd.fahmm.net
User avatar
ChristianVirtual
 
Posts: 1596
Joined: Tue May 28, 2013 1:14 pm
Location: Tokyo

Re: aws1 and aws2.foldingathome.org both not responding

Postby Crunchtimer » Tue May 05, 2020 6:58 am

Hi!
I believe I've had all of the aforementioned problems with AWS GPU server and it seems all be down to GPU driver issues for me.
Getting CPU-only-AWS-servers crunching, I never had problems only following a guide on Medium on EC2 and Folding@home by Julien Simon, installing Fahclient and Fahcontrol + config.xml-file; using the download links on the Faolding@home support-page for Linux manual installtion.

However getting GPU going for a g4dn.xlarge was something else. I finally follow the exect steps mentioned in one of the responses to the guide linked above, and got it working with an Amazon LInux 2. It's a huge difference running GPU.

Now my only problem is that AWS are not responding to my 'Limit Increase: EC2 Instances' request for addition G4 vCPU limit.
The first time it only took a couple of hours to get the increase of 4 vCPU, but now I've waited +24hours with nothing but the initial automatic reply.

Good luck everyone!
Crunchtimer
 
Posts: 45
Joined: Tue May 05, 2020 6:34 am

Re: aws1 and aws2.foldingathome.org both not responding

Postby anandhanju » Tue May 05, 2020 10:53 am

Hi Crunchtimer, welcome to Folding and to the forum.

The issues being discussed here in this topic relate to the two Folding@home work servers hosted on AWS, which were having connection issues when assigning work or accepting results. This doesn't involve running the Folding client (CPU or GPU) on AWS virtual machines, which I believe is what you're doing.
anandhanju
 
Posts: 516
Joined: Mon Dec 03, 2007 5:33 am
Location: Australia

Re: aws1 and aws2.foldingathome.org both not responding

Postby Crunchtimer » Tue May 05, 2020 6:20 pm

Hi, yes you're right I didn't check close enough.
However, you will get the same error message even though the assignment servers are up due to wrong GPU-drivers.

I guess I jinxed everything posting here as my GPU has been unable to get assignments all day :(

Code: Select all
17:07:11:WU00:FS01:Connecting to 128.252.203.10:8080
17:07:26:WU00:FS01:Downloading 50.73MiB
17:08:00:WU00:FS01:Download 0.74%
17:08:47:WU00:FS01:Connecting to 65.254.110.245:80
17:08:48:WARNING:WU00:FS01:Failed to get assignment from '65.254.110.245:80': Empty work server assignment
17:08:48:WU00:FS01:Connecting to 18.218.241.186:80
17:08:48:WARNING:WU00:FS01:Failed to get assignment from '18.218.241.186:80': Empty work server assignment
17:08:48:ERROR:WU00:FS01:Exception: Could not get an assignment


One of the server are not even in the serverstats list.
What to do? Wait?

So setting up a GPU server today for the first time musn't be easy ......
Crunchtimer
 
Posts: 45
Joined: Tue May 05, 2020 6:34 am

Re: aws1 and aws2.foldingathome.org both not responding

Postby Neil-B » Tue May 05, 2020 8:16 pm

Crunchtimer wrote:One of the server are not even in the serverstats list.


It is - but just under a different guise :) ... It is an Assignment Server see … https://foldingforum.org/viewtopic.php?f=18&t=34034&p=323083&hilit=assignment+server+private+ip#p323085
1: 2x Xeon E5-2697v3@2.60GHz, 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent, Quadro K420 1GB, FAH 7.6.13
2: Xeon E3-1505Mv5@2.80GHz, 32GB DDR4, NVME, Win10 Pro, Quadro M1000M 2GB, FAH 7.6.13
3: i7-960@3.20GHz, 12GB DDR3, SSD, Win10 Pro, GTX 750Ti 2GB, FAH 7.6.13
Neil-B
 
Posts: 1342
Joined: Sun Mar 22, 2020 6:52 pm
Location: UK

Re: aws1 and aws2.foldingathome.org both not responding

Postby Crunchtimer » Tue May 05, 2020 8:38 pm

Neil-B wrote:
Crunchtimer wrote:One of the server are not even in the serverstats list.


It is - but just under a different guise :) ... It is an Assignment Server see … https://foldingforum.org/viewtopic.php?f=18&t=34034&p=323083&hilit=assignment+server+private+ip#p323085


Ah, I see thanks! Well now I've upgraded from 7.4.4. to 7.6.13 and then it uses FQDN assign1-4.foldingathome.org:80 instead of ip.
Still Failed to get an assignment until I killed the process for FahCore_a7 as it wouldn´t let me control it anymore.
A reboot later and GPU working again, magic!
Crunchtimer
 
Posts: 45
Joined: Tue May 05, 2020 6:34 am

Can't send result - 3.133.76.19

Postby ppbering » Wed May 20, 2020 4:58 pm

Hi,
New french guy here, and I have an issue trying to send result to the 3.133.76.19 server.
Tried opening the adress trhu firefox and no result neither.
Is there a problem with this server ?

Thanks

Here are the logs :
Code: Select all
15:48:26:WU00:FS01:Upload 1.44%
15:48:26:WARNING:WU00:FS01:Exception: Failed to send results to work server: Transfer failed
15:54:04:WU00:FS01:Sending unit results: id:00 state:SEND error:NO_ERROR project:14439 run:0 clone:1858 gen:24 core:0x22 unit:0x0000002803854c135ea0a3014b7d5a75
15:54:04:WU00:FS01:Uploading 78.07MiB to 3.133.76.19
15:54:04:WU00:FS01:Connecting to 3.133.76.19:8080
15:54:25:WARNING:WU00:FS01:WorkServer connection failed on port 8080 trying 80
15:54:25:WU00:FS01:Connecting to 3.133.76.19:80
15:54:46:WARNING:WU00:FS01:Exception: Failed to send results to work server: Failed to connect to 3.133.76.19:80: Une tentative de connexion a échoué car le parti connecté n’a pas répondu convenablement au-delà d’une certaine durée ou une connexion établie a échoué car l’hôte de connexion n’a pas répondu.


Regards
ppbering
 
Posts: 2
Joined: Wed May 20, 2020 4:52 pm

Re: Can't send result - 3.133.76.19

Postby ppbering » Wed May 20, 2020 6:30 pm

You can close this topic because for my client it's OK, the upload is complete.
Thanks guys.
ppbering
 
Posts: 2
Joined: Wed May 20, 2020 4:52 pm

Re: Can't send result - 3.133.76.19

Postby PantherX » Wed May 20, 2020 8:25 pm

Welcome to the F@H Forum ppbering,

I am glad that the issue is resolved for you. Please note that for future reference, when a F@H Server is under load, it might not accept new connections but your client will try to send the WU later so you can leave your client running. If after several hours, the issue isn't resolved, then you can search the Forum to see if there are recent posts or not about the Server in question.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
User avatar
PantherX
Site Moderator
 
Posts: 6588
Joined: Wed Dec 23, 2009 10:33 am
Location: Land Of The Long White Cloud


Return to Issues with a specific server

Who is online

Users browsing this forum: ipkh and 0 guests

cron