Project: 6904 (Run 2, Clone 13, Gen 22)

The most demanding Projects are only available to a small percentage of very high-end servers.

Moderators: Site Moderators, PandeGroup

Project: 6904 (Run 2, Clone 13, Gen 22)

Postby filu » Mon Sep 19, 2011 6:07 am

Can someone explain to me what happened with the results? It seems that the results were sent only knows where. I've lost 3.5 days on this project to see the statistics today 0 points. :?

Code: Select all
[01:42:49] Completed 250000 out of 250000 steps  (100%)
[01:43:11] DynamicWrapper: Finished Work Unit: sleep=10000
[01:43:21]
[01:43:21] Finished Work Unit:
[01:43:21] - Reading up to 121544064 from "work/wudata_07.trr": Read 121544064
[01:43:22] trr file hash check passed.
[01:43:22] - Reading up to 108715704 from "work/wudata_07.xtc": Read 108715704
[01:43:23] xtc file hash check passed.
[01:43:23] edr file hash check passed.
[01:43:23] logfile size: 218625
[01:43:23] Leaving Run
[01:43:25] - Writing 230651385 bytes of core data to disk...
[01:44:13] Done: 230650873 -> 222319631 (compressed to 3.2 percent)
[01:44:13]   ... Done.
[01:44:33] - Shutting down core
[01:44:33]
[01:44:33] Folding@home Core Shutdown: FINISHED_UNIT
[01:44:35] CoreStatus = 64 (100)
[01:44:35] Unit 7 finished with 78 percent of time to deadline remaining.
[01:44:35] Updated performance fraction: 0.777327
[01:44:35] Sending work to server
[01:44:35] Project: 6904 (Run 2, Clone 13, Gen 22)


[01:44:35] + Attempting to send results [September 19 01:44:35 UTC]
[01:44:35] - Reading file work/wuresults_07.dat from core
[01:44:35]   (Read 222320143 bytes from disk)
[01:44:35] Connecting to http://130.237.232.237:8080/
[02:17:32] Posted data.
[02:17:32] Initial: 0000; - Uploaded at ~109 kB/s
[02:17:32] - Averaged speed for that direction ~108 kB/s
[02:17:32] - Server reports problem with unit.
[02:17:32] Trying to send all finished work units
[02:17:32] + No unsent completed units remaining.
[02:17:32] - Preparing to get new work unit...
[02:17:32] Cleaning up work directory
[02:17:34] + Attempting to get work packet
[02:17:34] Passkey found
[02:17:34] - Will indicate memory of 12033 MB
Image
i7-2600K@4.8 Asus P8P67 EVO 2x2GB GTX480
i7-920@4.0 GA-EX58-UD5 3x2GB 2xGTX560Ti
2x Xeon 5620 6x 2GB
2x Xeon 5645 6x 2GB
filu
 
Posts: 76
Joined: Mon Aug 03, 2009 9:33 am
Location: Krzeszyce, Poland

Re: Project: 6904 (Run 2, Clone 13, Gen 22)

Postby filu » Mon Sep 19, 2011 8:25 am

I used qfix and I received the following message.
Code: Select all
Found results <work/wuresults_07.dat>: proj 12080, run 52393, clone 51681, gen 19963
   -- queue entry: proj 6904, run 2, clone 13, gen 22
   -- doesn't match queue entry


It seems that file the results of a mess. Is there a tool to fix it?
filu
 
Posts: 76
Joined: Mon Aug 03, 2009 9:33 am
Location: Krzeszyce, Poland

Re: Project: 6904 (Run 2, Clone 13, Gen 22)

Postby 7im » Mon Sep 19, 2011 4:37 pm

The WU doesn't go anywhere when there is a problem detected.

[02:17:32] - Server reports problem with unit.

It is simply deleted.

[02:17:32] Cleaning up work directory

Back off the overclock a little, and the WU's will finish more reliably.
Please do not mistake my brevity as dispassion or condescension. I recognize the time you spend reading the forum is time you could use elsewhere, so my short responses save you time. Please do not hesitate to ask for clarification if I was too terse.
User avatar
7im
 
Posts: 13336
Joined: Thu Nov 29, 2007 4:30 pm
Location: Arizona

Re: Project: 6904 (Run 2, Clone 13, Gen 22)

Postby filu » Mon Sep 19, 2011 5:55 pm

7im wrote:Back off the overclock a little, and the WU's will finish more reliably.

I'm sorry but this is a server (2x Xeon 5645, SuperMicro X8DTL-i, 6x 2GB Kingston 1333 ECC), and the motherboard does not allow to overclock CPUs.
So this solution is out.
filu
 
Posts: 76
Joined: Mon Aug 03, 2009 9:33 am
Location: Krzeszyce, Poland

Re: Project: 6904 (Run 2, Clone 13, Gen 22)

Postby 7im » Mon Sep 19, 2011 6:09 pm

Let's have a mod check to see if someone else has completed this same work unit.

If yes, you may need to run diags on the hardware, network, etc. If not, then write it off as a bad work unit. It happens... Sorry.
User avatar
7im
 
Posts: 13336
Joined: Thu Nov 29, 2007 4:30 pm
Location: Arizona

Re: Project: 6904 (Run 2, Clone 13, Gen 22)

Postby sortofageek » Mon Sep 19, 2011 7:11 pm

So far, the database shows only one entry, which was not successful, for Project: 6904 (Run 2, Clone 13, Gen 22).
User avatar
sortofageek
Site Admin
 
Posts: 2862
Joined: Fri Nov 30, 2007 8:06 pm
Location: Team Helix

Re: Project: 6904 (Run 2, Clone 13, Gen 22)

Postby filu » Wed Sep 21, 2011 7:43 pm

An hour ago I sent back the results of the p6903, so a hardware error eliminated.
The server was not reset.
filu
 
Posts: 76
Joined: Mon Aug 03, 2009 9:33 am
Location: Krzeszyce, Poland

Re: Project: 6904 (Run 2, Clone 13, Gen 22)

Postby sortofageek » Wed Sep 21, 2011 10:39 pm

Yes, Project: 6904 (Run 2, Clone 13, Gen 22) looks like a bad WU. I just can't prove it at this point.
User avatar
sortofageek
Site Admin
 
Posts: 2862
Joined: Fri Nov 30, 2007 8:06 pm
Location: Team Helix

Re: Project: 6904 (Run 2, Clone 13, Gen 22)

Postby sortofageek » Mon Oct 10, 2011 7:52 pm

Two different donors did complete Project: 6904 (Run 2, Clone 13, Gen 22) successfully, so it couldn't have been a bad WU.
User avatar
sortofageek
Site Admin
 
Posts: 2862
Joined: Fri Nov 30, 2007 8:06 pm
Location: Team Helix


Return to SMP with bigadv

Who is online

Users browsing this forum: No registered users and 1 guest