Project: 2671 (Run 3, Clone 91, Gen 120)

Moderators: Site Moderators, FAHC Science Team

Post Reply
skinnykid63
Posts: 8
Joined: Mon Jun 23, 2008 2:11 pm

Project: 2671 (R3, C91, G120)

Post by skinnykid63 »

Code: Select all

[12:16:04] Folding@Home Gromacs SMP Core
[12:16:04] Version 2.10 (Sun Aug 30 03:43:28 CEST 2009)
[12:16:04] 
[12:16:04] Preparing to commence simulation
[12:16:04] - Ensuring status. Please wait.
[12:16:05] Called DecompressByteArray: compressed_data_size=4843329 data_size=24004881, decompressed_data_size=24004881 diff=0
[12:16:05] - Digital signature verified
[12:16:05] 
[12:16:05] Project: 2671 (Run 3, Clone 91, Gen 120)
[12:16:05] 
[12:16:05] Assembly optimizations on if available.
[12:16:05] Entering M.D.
[12:16:16] Run 3, Clone 91, Gen 120)
[12:16:16] 
[12:16:16] Entering M.D.
[12:16:40] Completed 0 out of 250000 steps  (0%)
[12:16:40] 
[12:16:40] Folding@home Core Shutdown: INTERRUPTED
[12:16:44] CoreStatus = 66 (102)
[12:16:44] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
[12:16:44] Killing all core threads

Folding@Home Client Shutdown.
The fah terminal also says there was a segfault in one of the cores.
BrokenWolf
Posts: 126
Joined: Sat Aug 02, 2008 3:08 am

Re: Project: 2671 (R3, C91, G120)

Post by BrokenWolf »

I just had this fail on one of my folders as well

Code: Select all

NOTE: The tpr file used for this simulation is in an old format, for less memory usage and possibly more performance create a new tpr file with an up to date version of grompp

Making 1D domain decomposition 1 x 1 x 4
starting mdrun '22878 system in water'
30250000 steps,  60500.0 ps (continuing from step 30000000,  60000.0 ps).

t = 60000.003 ps: Water molecule starting at atom 65638 can not be settled.
Check for bad contacts and/or reduce the timestep.

t = 60000.005 ps: Water molecule starting at atom 127297 can not be settled.
Check for bad contacts and/or reduce the timestep.

t = 60000.005 ps: Water molecule starting at atom 42760 can not be settled.
Check for bad contacts and/or reduce the timestep.
[01:30:57] lding@home Core Shutdown: INTERRUPTED
application called MPI_Abort(MPI_COMM_WORLD, 102) - process 0
[0]0:Return code = 102
[0]1:Return code = 0, signaled with Quit
[0]2:Return code = 0, signaled with Quit
[0]3:Return code = 0, signaled with Segmentation fault
[01:31:01] CoreStatus = 66 (102)
[01:31:01] + Shutdown requested by user. Exiting.***** Got a SIGTERM signal (15)
[01:31:01] Killing all core threads

Folding@Home Client Shutdown.
Image
parkut
Posts: 364
Joined: Tue Feb 12, 2008 7:33 am
Hardware configuration: Running exclusively Linux headless blades. All are dedicated crunching machines.
Location: SE Michigan, USA

Re: Project: 2671 (R3, C91, G120)

Post by parkut »

I got this bad WU also. Crashes immediately on startup.
shunter
Posts: 84
Joined: Sun Apr 06, 2008 8:22 am
Location: Hertfordshire, United Kingdom

Re: Project: 2671 (R3, C91, G120)

Post by shunter »

Now downloaded this bad WU 3 times in last 3 days - fails immediately and now having to monitor download on this pc everytime or when I wake up. Happened at 2.00 am today so I lost 6 hours of folding. Please arrange for its removal or correction ASAP.
Many thanks
Shunter
Image
parkut
Posts: 364
Joined: Tue Feb 12, 2008 7:33 am
Hardware configuration: Running exclusively Linux headless blades. All are dedicated crunching machines.
Location: SE Michigan, USA

Re: Project: 2671 (R3, C91, G120)

Post by parkut »

This is an unusually stubborn WU. I've had it assigned and re-assigned on several different machines, sometimes at the same time!

(It's still bad)
BrokenWolf
Posts: 126
Joined: Sat Aug 02, 2008 3:08 am

Project: 2671 (Run 3, Clone 91, Gen 120)

Post by BrokenWolf »

I got this one on_3_ of my folding systems. This is at least the second time I have reported this WU. Can we please do something about this work unit?

Code: Select all

Making 1D domain decomposition 1 x 1 x 4
starting mdrun '22878 system in water'
30250000 steps,  60500.0 ps (continuing from step 30000000,  60000.0 ps).

t = 60000.003 ps: Water molecule starting at atom 65638 can not be settled.
Check for bad contacts and/or reduce the timestep.

t = 60000.005 ps: Water molecule starting at atom 42760 can not be settled.
Check for bad contacts and/or reduce the timestep.

t = 60000.005 ps: Water molecule starting at atom 127297 can not be settled.
Check for bad contacts and/or reduce the timestep.
[01:45:07] lding@home Core Shutdown: INTERRUPTED
application called MPI_Abort(MPI_COMM_WORLD, 102) - process 0
Please kill this WU.

Thanks, BW
Image
Post Reply