Page 2 of 2

Re: Project: 9704 (Run 26, Clone 1, Gen 177)

Posted: Fri Jul 08, 2016 2:32 pm
by Joe_H
ChristianVirtual wrote:What looks strange in the server log are the changes from classic to GPU and back
That is normal, and depends on what projects are being served by that WS and other settings.

Re: Project: 9704 (Run 26, Clone 1, Gen 177)

Posted: Fri Jul 08, 2016 2:34 pm
by Joe_H
ChristianVirtual wrote:171.64.65.98, also the server for a core 18 failure in the other thread viewtopic.php?nomobile=1&f=74&t=28948
You misread the log file, the error was reported for this same server and project - not a Core_18.

Re: Project: 9704 (Run 26, Clone 1, Gen 177)

Posted: Fri Jul 08, 2016 2:47 pm
by ChristianVirtual
Joe_H wrote:
ChristianVirtual wrote:What looks strange in the server log are the changes from classic to GPU and back
That is normal, and depends on what projects are being served by that WS and other settings.
Joe_H wrote:
ChristianVirtual wrote:171.64.65.98, also the server for a core 18 failure in the other thread viewtopic.php?nomobile=1&f=74&t=28948
You misread the log file, the error was reported for this same server and project - not a Core_18.
Mea culpa :oops:

Re: Project: 9704 [WS - 171.64.65.98]

Posted: Fri Jul 08, 2016 3:38 pm
by Joe_H
I have merged the two topics about problems with Project 9704 and WS 171.64.65.98. I will notify the project leader about the issue.

Re: Project: 9704 [WS - 171.64.65.98]

Posted: Fri Jul 08, 2016 4:24 pm
by mpharrigan
Sorry everyone. I'm working to fix this now

Re: Project: 9704 [WS - 171.64.65.98]

Posted: Fri Jul 08, 2016 4:47 pm
by mpharrigan
Should be fixed

Re: Project: 9704 [WS - 171.64.65.98]

Posted: Fri Jul 08, 2016 5:00 pm
by rwh202
mpharrigan wrote:Should be fixed
Wow, thanks for the quick response. I'll report back if I process any more 9704.

Re: Project: 9704 (Run 26, Clone 1, Gen 177)

Posted: Sat Jul 09, 2016 2:59 am
by _r2w_ben
For future reference, notice in the log file that WU RCV is 0 from Wed Jul 6 23:00:11 PDT 2016 to Fri Jul 8 09:40:16 PDT 2016. This is probably a good indicator that it's a server side problem.

Re: Project: 9704 [WS - 171.64.65.98]

Posted: Sat Jul 09, 2016 8:01 am
by rwh202
All looks good - seen a few 9704 successfully uploading to 171.64.65.98

For future reference, I'm wondering whether donor reports are the only way that project owners will get notified of problems? Does the work server code provide them with any automated monitoring and notification of issues? If not, is it a feature request? As _r2w_ben mentions, there are a number of stats reported (and probably many more not currently reported) that could be used to identify and flag up issues much earlier than waiting for multiple corroborative reports on here and relying on a mod spotting it and raising it to PG.

Re: Project: 9704 [WS - 171.64.65.98]

Posted: Sat Jul 09, 2016 1:59 pm
by Joe_H
As I understand it, a number of conditions are already being automatically monitored and checks on additional items have been requested. But those additional checks would show up in a future version of the WS software. As for the time frame mentioned where 0 WU were received, that includes the time from 8:00 PDT where the server status was Reject. That is when the problem was noticed and the server placed in that status to prevent further dumping while the problem was fixed.

Re: Project: 9704 [WS - 171.64.65.98]

Posted: Sat Jul 09, 2016 3:02 pm
by 7im
Server status monitoring tools with failure notification has been a perpetual feature request since before I started folding more than a decade ago. Considering the speed of technology advancements there really is no excuse for not having this feature already. A hard drive should never fill up and cause and cause folding to stop.