Project: 2619 (Run 2, Clone 524, Gen 2)

Moderators: Site Moderators, FAHC Science Team

Post Reply
Phantom
Posts: 23
Joined: Mon Dec 03, 2007 2:14 am
Location: teammacosx.org
Contact:

Project: 2619 (Run 2, Clone 524, Gen 2)

Post by Phantom »

Just a note that this work unit "stalled" during its first attempt at 22% with four idling FahCore_a2.exe processes and a mpiexec process visible in Activity Monitor when I detected the "stall" in Activity Monitor and manually shut the work unit down (unfortunately, losing almost 22 hours of available folding power)... (See FahLog snippet.) I was later able to resume the work unit at the point of the last checkpoint; however, I thought I'd mention this "stall" for the benefit of others and possibly to log the situation in case there is a trend that develops.

Configuration was a dedicated folding MacMini (Mac OS X 10.5 with 2.33GHz Intel Core 2 Duo w/ 2 GB of SDRAM)... This processor has successfully folded 170 non-Project 2619 work units without incident so I do not suspect the processor. Prior to the assignment of this work unit, this processor successfully folded Project 2619 (Run 2, Clone 524, Gen 0) and was assigned Project 2619 (Run 2, Clone 524, Gen 1) which 0x1'ed at 43% and was automatically deleted. The processor then got reassigned the same WU and this time it ran successfully through completion prior to being assigned the follow-on work unit that is the subject of this topic [Project 2619 (Run 2, Clone 524, Gen 2)]...

(Autosend attempts were edited out of the log below.)

Code: Select all

[01:20:20] - Preparing to get new work unit...
[01:20:20] + Attempting to get work packet
[01:20:20] - Will indicate memory of 2048 MB
[01:20:20] - Connecting to assignment server
[01:20:20] Connecting to http://assign.stanford.edu:8080/
[01:20:20] Posted data.
[01:20:20] Initial: 40AB; - Successful: assigned to (171.64.65.56).
[01:20:20] + News From Folding@Home: Welcome to Folding@Home
[01:20:20] Loaded queue successfully.
[01:20:20] Connecting to http://171.64.65.56:8080/
[01:20:28] Posted data.
[01:20:28] Initial: 0000; - Receiving payload (expected size: 9149575)
[01:20:52] - Downloaded at ~372 kB/s
[01:20:52] - Averaged speed for that direction ~592 kB/s
[01:20:52] + Received work.
[01:20:52] Trying to send all finished work units
[01:20:52] + No unsent completed units remaining.
[01:20:52] + Closed connections
[01:20:52] 
[01:20:52] + Processing work unit
[01:20:52] Core required: FahCore_a2.exe
[01:20:52] Core found.
[01:20:52] Working on Unit 06 [March 31 01:20:52]
[01:20:52] + Working ...
[01:20:52] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 06 -checkpoint 30 -forceasm -verbose -lifeline 365 -version 601'

[01:20:52] 
[01:20:52] *------------------------------*
[01:20:52] Folding@Home Gromacs SMP Core
[01:20:52] Version 1.91 (2007)
[01:20:52] 
[01:20:52] Preparing to commence simulation
[01:20:52] - Ensuring status. Please wait.
[01:21:09] - Assembly optimizations manually forced on.
[01:21:09] - Not checking prior termination.
[01:21:09] Error: Work unit read from disk is invalid
[01:21:09] Finalizing output
[01:21:14] - Expanded 9149063 -> 48331685 (decompressed 58.8 percent)
[01:21:15] 
[01:21:15] Project: 2619 (Run 2, Clone 524, Gen 2)
[01:21:15] 
[01:21:16] Assembly optimizations on if available.
[01:21:16] Entering M.D.
[01:21:27] Completed 0 out of 125000 steps  (0)
[01:31:09] Completed 630 out of 125000 steps  (1)
[01:50:23] Completed 1880 out of 125000 steps  (2)
[02:09:39] Completed 3130 out of 125000 steps  (3)
[02:28:54] Completed 4380 out of 125000 steps  (4)
[02:48:08] Completed 5630 out of 125000 steps  (5)
[03:07:24] Completed 6880 out of 125000 steps  (6)
[03:26:39] Completed 8130 out of 125000 steps  (7)
[03:45:55] Completed 9380 out of 125000 steps  (8)
[04:05:09] Completed 10630 out of 125000 steps  (9)
[04:24:24] Completed 11880 out of 125000 steps  (10)
[04:43:40] Completed 13130 out of 125000 steps  (11)
[05:02:56] Completed 14380 out of 125000 steps  (12)
[05:22:11] Completed 15630 out of 125000 steps  (13)
[05:41:27] Completed 16880 out of 125000 steps  (14)
[06:00:42] Completed 18130 out of 125000 steps  (15)
[06:19:58] Completed 19380 out of 125000 steps  (16)
[06:39:14] Completed 20630 out of 125000 steps  (17)
[06:58:31] Completed 21880 out of 125000 steps  (18)
[07:17:49] Completed 23130 out of 125000 steps  (19)
[07:37:06] Completed 24380 out of 125000 steps  (20)
[07:56:28] Completed 25630 out of 125000 steps  (21)
[08:15:50] Completed 26880 out of 125000 steps  (22)
[08:46:00] Timer requesting checkpoint
[09:16:01] Timer requesting checkpoint
[09:46:01] Timer requesting checkpoint
[10:16:01] Timer requesting checkpoint
[10:46:01] Timer requesting checkpoint
[11:16:01] Timer requesting checkpoint
[11:46:01] Timer requesting checkpoint
[12:16:01] Timer requesting checkpoint
[12:46:01] Timer requesting checkpoint
[13:16:01] Timer requesting checkpoint
[13:46:01] Timer requesting checkpoint
[14:16:01] Timer requesting checkpoint
[14:46:01] Timer requesting checkpoint
[15:16:01] Timer requesting checkpoint
[15:46:01] Timer requesting checkpoint
[16:16:01] Timer requesting checkpoint
[16:46:01] Timer requesting checkpoint
[17:16:01] Timer requesting checkpoint
[17:46:01] Timer requesting checkpoint
[18:16:01] Timer requesting checkpoint
[18:46:01] Timer requesting checkpoint
[19:16:01] Timer requesting checkpoint
[19:46:01] Timer requesting checkpoint
[20:16:01] Timer requesting checkpoint
[20:46:01] Timer requesting checkpoint
[21:16:01] Timer requesting checkpoint
[21:46:01] Timer requesting checkpoint
[22:16:01] Timer requesting checkpoint
[22:46:01] Timer requesting checkpoint
[23:16:01] Timer requesting checkpoint
[23:46:01] Timer requesting checkpoint
[00:16:01] Timer requesting checkpoint
[00:46:01] Timer requesting checkpoint
[01:16:01] Timer requesting checkpoint
[01:46:01] Timer requesting checkpoint
[02:16:01] Timer requesting checkpoint
[02:46:01] Timer requesting checkpoint
[03:16:01] Timer requesting checkpoint
[03:46:01] Timer requesting checkpoint
[04:16:01] Timer requesting checkpoint
[04:46:01] Timer requesting checkpoint
[05:16:01] Timer requesting checkpoint
[05:46:01] Timer requesting checkpoint
[06:03:20] ***** Got a SIGTERM signal (15)
[06:03:20] Killing all core threads

Folding@Home Client Shutdown.
Hope this helps.
Post Reply