Project 3059 (Run 4, Clone 75, Gen 25)

Moderators: Site Moderators, FAHC Science Team

Project 3059 (Run 4, Clone 75, Gen 25)

Postby DocJonz » Thu Dec 06, 2007 9:43 pm

Have had issues with Project 3059 (Run 4, Clone 75, Gen 25) - it has cropped up on a couple of my 64-bit Linux boxes and has dropped out right at the start each time - here's one of them....

Code: Select all
[17:31:26] *------------------------------*
[17:31:26] Folding@Home Gromacs SMP Core
[17:31:26] Version 1.74 (November 27, 2006)
[17:31:26]
[17:31:26] Preparing to commence simulation
[17:31:26] - Ensuring status. Please wait.
[17:31:43] - Assembly optimizations manually forced on.
[17:31:43] - Not checking prior termination.
[17:31:43] - Expanded 530964 -> 2922581 (decompressed 550.4 percent)
[17:31:43] - Starting from initial work packet
[17:31:43]
[17:31:43] Project: 3059 (Run 4, Clone 75, Gen 25)
[17:31:43]
[17:31:43] Assembly optimizations on if available.
[17:31:43] Entering M.D.
NNODES=4, MYRANK=1, HOSTNAME=AkasaC2Q2
NNODES=4, MYRANK=2, HOSTNAME=AkasaC2Q2
NNODES=4, MYRANK=3, HOSTNAME=AkasaC2Q2
NNODES=4, MYRANK=0, HOSTNAME=AkasaC2Q2
NODEID=1 argc=15
NODEID=3 argc=15
NODEID=2 argc=15
NODEID=0 argc=15
      Written by David van der Spoel, Erik Lindahl, Berk Hess, and others.
       Copyright (c) 1991-2000, University of Groningen, The Netherlands.
             Copyright (c) 2001-2004, The GROMACS development team,
            check out http://www.gromacs.org for more information.

        This inclusion of Gromacs code in the Folding@Home Core is under
        a special license (see http://folding.stanford.edu/gromacs.html)
         specially granted to Stanford by the copyright holders. If you
          are interested in using Gromacs, visit www.gromacs.org where
                you can download a free version of Gromacs under
         the terms of the GNU General Public License (GPL) as published
       by the Free Software Foundation; either version 2 of the License,
                     or (at your option) any later version.

[17:31:49] Protein: 19884 p3059_supervillin_newbox
[17:31:49] Writing local files
starting mdrun '19884 p3059_supervillin_newbox'
5000000 steps,  10000.0 ps.

[17:31:49] Extra SSE boost OK.
[17:31:49]
[17:31:49] Extra SSE boost OK.
[17:31:49] Warning:  long 1-4 interactions
[17:31:49] Writing local files
[17:31:49] Completed 0 out of 5000000 steps  (0 percent)
[0]0:Return code = 0, signaled with Segmentation fault
[0]1:Return code = 0, signaled with Segmentation fault
[0]2:Return code = 0, signaled with Segmentation fault
[0]3:Return code = 0, signaled with Segmentation fault
[17:31:53] CoreStatus = 0 (0)
[17:31:53] Client-core communications error: ERROR 0x0
[17:31:53] - Attempting to download new core...
[17:31:53] + Downloading new core: FahCore_a1.exe
[17:31:53] Downloading core (/~pande/Linux/x86//Core_a1.fah from www.stanford.edu)
[17:31:55] Initial: AFDE; + 10240 bytes downloaded
[17:31:55] Initial: B54E; + 20480 bytes downloaded

Image
User avatar
DocJonz
 
Posts: 211
Joined: Thu Dec 06, 2007 7:31 pm
Location: United Kingdom

Postby uncle_fungus » Fri Dec 07, 2007 3:09 pm

So far no one has returned this WU, successfully or otherwise.
User avatar
uncle_fungus
Site Admin
 
Posts: 1288
Joined: Fri Nov 30, 2007 10:37 am
Location: Oxfordshire, UK

Postby Flathead74 » Fri Dec 07, 2007 3:28 pm

Members of our team have received this WU ten different times,
with the same result every time.

Guess what, the Fahlog report is identical to that of DocJonz.

The following link points to a list of WUs that has been troublesome to our team.

http://www.fold4life.com/forum/viewtopic.php?t=886
Flathead74
 
Posts: 266
Joined: Sun Dec 02, 2007 7:08 pm
Location: Central New York

Postby DocJonz » Fri Dec 07, 2007 11:04 pm

One of my other Linux boxes has tried to pull this WU down today - exactly the same outcome as before ....

Image
User avatar
DocJonz
 
Posts: 211
Joined: Thu Dec 06, 2007 7:31 pm
Location: United Kingdom

Postby kasson » Sun Dec 09, 2007 12:54 am

Thanks for the reports. We're looking at ways to deal with such problems better; the concept is easy, but the implementation is hard, so it may take a while. :(
kasson
Pande Group Member
 
Posts: 1459
Joined: Thu Nov 29, 2007 10:37 pm

Postby Flathead74 » Mon Dec 17, 2007 4:21 am

Project: 3059 (Run 4, Clone 75, Gen 25)

Got this WU again today.

Same results as always, immediate failure.
Flathead74
 
Posts: 266
Joined: Sun Dec 02, 2007 7:08 pm
Location: Central New York


Return to Issues with a specific WU

Who is online

Users browsing this forum: No registered users and 2 guests

cron