2653 (Run 33, Clone 81, Gen 23) Running on 3 cores [Solved]

Moderators: Site Moderators, FAHC Science Team

2653 (Run 33, Clone 81, Gen 23) Running on 3 cores [Solved]

Postby Cajun_Don » Sat Dec 15, 2007 1:40 pm

This WU starts with four cores running for a while. then 1 core has 00 % load and 1 core has 50 % load, with the 2 other cores fluctuating with 50 % load.

When that 1 core does not have a load, no more checkpoints are issued every 15 minutes.

OS: Windows XP SP 2 Home,
MB: MSI K9N4 SLI
CPU: AMD Athlon 64 X2 6000+
Ram: Mushkin Enhance 2 Gb DDR2 800 Dual Channel

When I shutdown the client, it states
"[11:39:43] Killing all core threads
[11:39:43] Killing SMP core threads
[11:39:43] Killing 3 cores
[11:39:43] Killing core 0
[11:39:43] Killing core 1
[11:39:43] Killing core 2"

I shutdown the client and rebooted and still going to 3 core running after one checkpoint.

Code: Select all

--- Opening Log file [December 15 11:20:24]


# SMP Client ##################################################################
###############################################################################

                       Folding@Home Client Version 5.91beta5

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\FAH\Win SMP
Service: C:\FAH\Win SMP\fah.exe
Arguments: -svcstart -verbosity 9 –forceasm

Launched as a service.
Entered C:\FAH\Win SMP to do work.

[11:20:24] - Ask before connecting: No
[11:20:24] - User name: Cajun_Don (Team 15)
[11:20:24] - User ID: D49DAA43E8935A1
[11:20:24] - Machine ID: 1
[11:20:24]
[11:20:24] Work directory not found. Creating...
[11:20:24] Could not open work queue, generating new queue...
[11:20:24] - Preparing to get new work unit...
[11:20:24] + Attempting to get work packet
[11:20:24] - Autosending finished units...
[11:20:24] - Will indicate memory of 2047 MB
[11:20:24] Trying to send all finished work units
[11:20:24] + No unsent completed units remaining.
[11:20:24] - Connecting to assignment server
[11:20:24] - Autosend completed
[11:20:24] Connecting to http://assign.stanford.edu:8080/
[11:20:25] Posted data.
[11:20:25] Initial: 40AB; - Successful: assigned to (171.64.65.64).
[11:20:25] + News From Folding@Home: Welcome to Folding@Home
[11:20:25] Loaded queue successfully.
[11:20:25] Connecting to http://171.64.65.64:8080/
[11:20:28] Posted data.
[11:20:28] Initial: 0000; - Receiving payload (expected size: 2944905)
[11:20:36] - Downloaded at ~359 kB/s
[11:20:36] - Averaged speed for that direction ~359 kB/s
[11:20:36] + Received work.
[11:20:36] + Closed connections
[11:20:36]
[11:20:36] + Processing work unit
[11:20:36] Core required: FahCore_a1.exe
[11:20:36] Core found.
[11:20:36] Working on Unit 01 [December 15 11:20:36]
[11:20:36] + Working ...
[11:20:36] - Calling 'mpiexec -channel auto -np 4 FahCore_a1.exe -dir work/ -suffix 01 -checkpoint 15 -service -verbose -lifeline 2536 -version 591'

[11:20:36]
[11:20:36] *------------------------------*
[11:20:36] Folding@Home Gromacs SMP Core
[11:20:36] Version 1.74 (March 10, 2007)
[11:20:36]
[11:20:36] Preparing to commence simulation
[11:20:36] - Ensuring status. Please wait.
[11:20:39] - Starting from initial work packet
[11:20:39]
[11:20:39] Project: 2653 (Run 33, Clone 81, Gen 23)
[11:20:39]
[11:20:39] Assembly optimizations on if available.
[11:20:39] Entering M.D.
[11:20:57]  percent)
[11:20:58] - Failed to delete work/wudata_01.pdo
[11:20:58] Warning:  check for stray files
[11:20:58] - Starting from initial work packet
[11:20:58]
[11:20:58] Project: 2653 (Run 33, Clone 81, Gen 23)
[11:20:58]
[11:20:58] les
[11:20:58] - Starting from initial work packet
[11:20:58]
[11:20:58] Project: 2653 (Run 33, Clone 81, Gen 23)
[11:20:58]
[11:20:58] packet
[11:20:58]
[11:20:58] Project: 2653 (Run 33, Clone 81, Gen 23)
[11:20:58]
[11:20:59] Entering M.D.
[11:21:05] Rejecting checkpoint
[11:21:06] Protein: Protein in POPC
[11:21:06] Writing local files
[11:21:07] Extra SSE boost OK.
[11:21:08] Writing local files
[11:21:08] Completed 0 out of 500000 steps  (0 percent)
[11:39:43] Service stop request received.
[11:39:43] ***** Got a SIGTERM signal (2)
[11:39:43] Killing all core threads
[11:39:43] Killing SMP core threads
[11:39:43] Killing 3 cores
[11:39:43] Killing core 0
[11:39:43] Killing core 1
[11:39:43] Killing core 2

Folding@Home Client Shutdown.


--- Opening Log file [December 15 11:40:46]


# SMP Client ##################################################################
###############################################################################

                       Folding@Home Client Version 5.91beta5

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\FAH\Win SMP
Executable: C:\FAH\Win SMP\fah.exe
Arguments: -forceasm -verbosity 9

Warning:
 By using the -forceasm flag, you are overriding
 safeguards in the program. If you did not intend to
 do this, please restart the program without -forceasm.
 If work units are not completing fully (and particularly
 if your machine is overclocked), then please discontinue
 use of the flag.

[11:40:46] - Ask before connecting: No
[11:40:46] - User name: Cajun_Don (Team 15)
[11:40:46] - User ID: D49DAA43E8935A1
[11:40:46] - Machine ID: 1
[11:40:46]
[11:40:47] Loaded queue successfully.
[11:40:47]
[11:40:47] - Autosending finished units...
[11:40:47] + Processing work unit
[11:40:47] Trying to send all finished work units
[11:40:47] Core required: FahCore_a1.exe
[11:40:47] + No unsent completed units remaining.
[11:40:47] - Autosend completed
[11:40:47] Core found.
[11:40:47] Working on Unit 01 [December 15 11:40:47]
[11:40:47] + Working ...
[11:40:47] - Calling 'mpiexec -channel auto -np 4 FahCore_a1.exe -dir work/ -suffix 01 -checkpoint 15 -forceasm -verbose -lifeline 3272 -version 591'

[11:40:47]
[11:40:47] *------------------------------*
[11:40:47] Folding@Home Gromacs SMP Core
[11:40:47] Version 1.74 (March 10, 2007)
[11:40:47]
[11:40:47] Preparing to commence simulation
[11:40:47] - Ensuring status. Please wait.
[11:41:04] - Assembly optimizations manually forced on.
[11:41:04] - Not checking prior termination.
[11:41:09] - Expanded 2944393 -> 15216508 (decompressed 516.7 percent)
[11:41:11]
[11:41:11] Project: 2653 (Run 33, Clone 81, Gen 23)
[11:41:11]
[11:41:11] Assembly optimizations on if available.
[11:41:11] Entering M.D.
[11:41:17] Calling FAH init
[11:41:18] in POPC
[11:41:18] Writing local files
[11:41:18]  checkpoint)
[11:41:18] Read checkpoint
[11:41:19] Protein: Protein in POPC
[11:41:19] Writing local files
[11:41:20] Extra SSE boost OK.
[11:41:20] Writing local files
[11:41:20] Completed 0 out of 500000 steps  (0 percent)
[11:56:20] Timered checkpoint triggered.
[12:14:24] Killing all core threads
[12:14:24] Killing SMP core threads
[12:14:24] Killing 3 cores
[12:14:24] Killing core 0
[12:14:24] Killing core 1
[12:14:24] Killing core 2

Folding@Home Client Shutdown at user request.
[12:14:24] ***** Got a SIGTERM signal (2)
[12:14:24] Killing all core threads
[12:14:24] Killing SMP core threads
[12:14:24] Killing 3 cores
[12:14:24] Killing core 0
[12:14:24] Killing core 1
[12:14:24] Killing core 2

Folding@Home Client Shutdown.
Last edited by Cajun_Don on Sun Dec 16, 2007 7:38 am, edited 1 time in total.
Image

Have a great day.
User avatar
Cajun_Don
 
Posts: 89
Joined: Sun Dec 02, 2007 8:05 pm
Location: Cajun Country, Louisiana

Postby toTOW » Sat Dec 15, 2007 4:41 pm

We previously saw this behaviour on this forum ... did you search the forum for it to see if it's the same WU :?:
Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.

FAH-Addict : latest news, tests and reviews about Folding@Home project.

Image
User avatar
toTOW
Site Moderator
 
Posts: 5652
Joined: Sun Dec 02, 2007 11:38 am
Location: Bordeaux, France

Postby Cajun_Don » Sun Dec 16, 2007 7:34 am

Problem resolved by doing, a clean install of Windows XP, after trying everything else to keep it running on 4 cores for over 5 hours.

Happy Holidays :D
User avatar
Cajun_Don
 
Posts: 89
Joined: Sun Dec 02, 2007 8:05 pm
Location: Cajun Country, Louisiana


Return to Issues with a specific WU

Who is online

Users browsing this forum: No registered users and 2 guests

cron