Is plfah1-1.mskcc.org (140.163.4.231) down?

Moderators: Site Moderators, FAHC Science Team

Post Reply
bronozoj
Posts: 6
Joined: Sun Mar 15, 2020 2:23 pm

Is plfah1-1.mskcc.org (140.163.4.231) down?

Post by bronozoj »

Code: Select all

12:29:42:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:11748 run:0 clone:5487 gen:10 core:0x22 unit:0x0000001a8ca304e75e6bafe323dd07eb
12:29:42:WU01:FS01:Uploading 12.58MiB to 140.163.4.231
12:29:42:WU01:FS01:Connecting to 140.163.4.231:8080
12:30:04:WARNING:WU01:FS01:WorkServer connection failed on port 8080 trying 80
12:30:04:WU01:FS01:Connecting to 140.163.4.231:80
12:30:19:WU01:FS01:Upload 0.50%
12:31:01:WU01:FS01:Upload 0.99%
12:31:01:WARNING:WU01:FS01:Exception: Failed to send results to work server: Transfer failed
12:31:19:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:11748 run:0 clone:5487 gen:10 core:0x22 unit:0x0000001a8ca304e75e6bafe323dd07eb
12:31:19:WU01:FS01:Uploading 12.58MiB to 140.163.4.231
12:31:19:WU01:FS01:Connecting to 140.163.4.231:8080
12:32:02:WU01:FS01:Upload 0.99%
12:32:02:WARNING:WU01:FS01:Exception: Failed to send results to work server: Transfer failed
12:33:56:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:11748 run:0 clone:5487 gen:10 core:0x22 unit:0x0000001a8ca304e75e6bafe323dd07eb
12:33:56:WU01:FS01:Uploading 12.58MiB to 140.163.4.231
12:33:56:WU01:FS01:Connecting to 140.163.4.231:8080
12:35:36:WU01:FS01:Upload 0.99%
12:35:36:WARNING:WU01:FS01:Exception: Failed to send results to work server: Transfer failed
12:38:11:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:11748 run:0 clone:5487 gen:10 core:0x22 unit:0x0000001a8ca304e75e6bafe323dd07eb
12:38:11:WU01:FS01:Uploading 12.58MiB to 140.163.4.231
12:38:11:WU01:FS01:Connecting to 140.163.4.231:8080
12:38:32:WARNING:WU01:FS01:WorkServer connection failed on port 8080 trying 80
12:38:32:WU01:FS01:Connecting to 140.163.4.231:80
12:38:53:WARNING:WU01:FS01:Exception: Failed to send results to work server: Failed to connect to 140.163.4.231:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
12:45:02:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:11748 run:0 clone:5487 gen:10 core:0x22 unit:0x0000001a8ca304e75e6bafe323dd07eb
12:45:02:WU01:FS01:Uploading 12.58MiB to 140.163.4.231
12:45:02:WU01:FS01:Connecting to 140.163.4.231:8080
12:45:23:WARNING:WU01:FS01:WorkServer connection failed on port 8080 trying 80
12:45:23:WU01:FS01:Connecting to 140.163.4.231:80
12:45:44:WARNING:WU01:FS01:Exception: Failed to send results to work server: Failed to connect to 140.163.4.231:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
12:56:07:WU01:FS01:Sending unit results: id:01 state:SEND error:NO_ERROR project:11748 run:0 clone:5487 gen:10 core:0x22 unit:0x0000001a8ca304e75e6bafe323dd07eb
12:56:07:WU01:FS01:Uploading 12.58MiB to 140.163.4.231
12:56:07:WU01:FS01:Connecting to 140.163.4.231:8080
12:56:28:WARNING:WU01:FS01:WorkServer connection failed on port 8080 trying 80
12:56:28:WU01:FS01:Connecting to 140.163.4.231:80
12:56:50:WARNING:WU01:FS01:Exception: Failed to send results to work server: Failed to connect to 140.163.4.231:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
This seems to suggest that the work server is either unreachable or is not responding. Other applications work fine and communicate with other fah servers
Neil-B
Posts: 2027
Joined: Sun Mar 22, 2020 5:52 pm
Hardware configuration: 1: 2x Xeon E5-2697v3@2.60GHz, 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent 20H2, Quadro K420 1GB, FAH 7.6.21
2: Xeon E3-1505Mv5@2.80GHz, 32GB DDR4, NVME, Win10 Pro 20H2, Quadro M1000M 2GB, FAH 7.6.21 (actually have two of these)
3: i7-960@3.20GHz, 12GB DDR3, SSD, Win10 Pro 20H2, GTX 750Ti 2GB, GTX 1080Ti 11GB, FAH 7.6.21
Location: UK

Re: Is plfah1-1.mskcc.org (140.163.4.231) down?

Post by Neil-B »

Not showing so at the moment https://apps.foldingathome.org/serverstats and looks like still OK for storage so might just be overload at the moment.
2x Xeon E5-2697v3, 512GB DDR4 LRDIMM, SSD Raid, W10-Ent, Quadro K420
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070

(Green/Bold = Active)
bronozoj
Posts: 6
Joined: Sun Mar 15, 2020 2:23 pm

Re: Is plfah1-1.mskcc.org (140.163.4.231) down?

Post by bronozoj »

Is there any other way to upload a work unit? It continues to fail with the same error until now and its nearing its expiration (2020-04-16T08:31:28Z). Connecting with a VPN to multiple locations make no difference and trying to ping the server results to 100% dropped packets. The server does have a warning that the collection server is not connected. Can this affect the ability to upload finished work units?
PantherX
Site Moderator
Posts: 7020
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: Is plfah1-1.mskcc.org (140.163.4.231) down?

Post by PantherX »

bronozoj wrote:Is there any other way to upload a work unit?...
Unfortunately there isn't.
bronozoj wrote:...It continues to fail with the same error until now and its nearing its expiration (2020-04-16T08:31:28Z)...
Today is 2020-04-10T01:16:50Z which means 6 days until it expires. I am hopeful that the issue will be resolved before then
bronozoj wrote:...The server does have a warning that the collection server is not connected. Can this affect the ability to upload finished work units?
The "warning" isn't really a warning per se. The configuration of a CS is entirely optional and depends on the researcher. Not having a CS for a WS means that if the WS is unable to accept the completed WUs, no CS can collect it. If a WS has a CS, then the completed WU can be uploaded to the CS if the WS fails.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
iceman1992
Posts: 527
Joined: Fri Mar 23, 2012 5:16 pm

Re: Is plfah1-1.mskcc.org (140.163.4.231) down?

Post by iceman1992 »

Do the assignment servers check if the work servers are up? I keep getting assigned to 140.163.4.231, when it's currently down.
Post Reply