Page 1 of 1

Problem with WU - Project 3061 (Run 2, Clone 5, Gen 7)

Posted: Thu Dec 27, 2007 9:16 am
by klasseng
Project 3061 (Run 2, Clone 5, Gen 7)

This WU will run to 24% and then crash out:
[08:35:32] Completed 1200000 out of 5000000 steps (24 percent)
[08:35:32]
[08:35:32] Writing local files
[08:35:32] Completed 1200000 out of 5000000 steps (24 percent)
[08:35:32] Extra SSE boost OK.
[08:50:32] Timered checkpoint triggered.
[08:53:53] Warning: long 1-4 interactions
[08:53:55]
[08:53:55] Folding@home Core Shutdown: INTERRUPTED
[08:53:59] CoreStatus = 0 (0)
[08:53:59] Client-core communications error: ERROR 0x0
[08:53:59] Deleting current work unit & continuing...
It was happening on my 8 core (2 X quad core) MacPro.
Mac OS X 10.4.11
fah6
I restarted the WU and it crashed again. I moved the WU to a Core Duo Mac Mini. Same result.

If someone wants more info, there's a zip file of all the working files at:
http://www.medianorthpro.com/fah/3061.zip 6.3MB

Re: Problem with Project 3061

Posted: Thu Dec 27, 2007 12:10 pm
by gwildperson
klasseng wrote:Warning: long 1-4 interactions
The message long 1-4 interactions is generally associated with protein atoms that are positioned incorrectly and cannot be simulated. Some WUs will inevitably reach conditions like this and if it's because of your hardware, it's probably not going to repeat exactly. If it's because of the configuration itself, the WUs with this error should be rare and should be ignored. We all get a few like that.

Re: Problem with Project 3061

Posted: Thu Dec 27, 2007 2:59 pm
by klasseng
So we don't need to report these?

peace,
Grant

Re: Problem with Project 3061

Posted: Thu Dec 27, 2007 3:07 pm
by gwildperson
klasseng wrote:So we don't need to report these?
I didn't say that ....

It's a good idea to report anything that says Deleting current work unit & continuing... but don't be surprised if you process the same WU 3 or 4 times before getting something new.

Re: Problem with Project 3061

Posted: Fri Dec 28, 2007 12:07 am
by 7im
Did this work unit stop in the same place each time? or differnt % completed?

Re: Problem with Project 3061

Posted: Fri Dec 28, 2007 1:00 am
by klasseng
same % - three times, 2X on one machine, 1X on the second

Re: Problem with Project 3061

Posted: Fri Dec 28, 2007 4:38 am
by 7im
It's most likely a bad work unit, by your description of the symptoms. And no one else has completed this WU either, so that's my diagnosis. Move on to the next WU. Thanks for the report.