Project: 3064 (Run 4, Clone 42, Gen 16) Bad Core type?

Moderators: Site Moderators, FAHC Science Team

Post Reply
GTron
Posts: 53
Joined: Wed Dec 05, 2007 3:47 pm
Location: Denver area, Colorado

Project: 3064 (Run 4, Clone 42, Gen 16) Bad Core type?

Post by GTron »

Unusual error uploading this completed WU (or at least I haven't seen it before). The message at 07:51:47 is "Core type used on unit not what server demands" and "Successfully sent unit 08 to Collection server" (171.64.122.76:8080) followed by downloading the same WU again (but NOT the core). Following the second completion of the WU, the upload is accepted by 171.64.65.63:8080. I think the a1 core was last downloaded in February.

Snipped FAHlog.txt follows:

Code: Select all

# SMP Client ##################################################################
###############################################################################

                       Folding@Home Client Version 6.02beta

                          http://folding.stanford.edu

###############################################################################
###############################################################################
  ***  snip  ***
[17:56:39] Completed 2500000 out of 2500000 steps  (100 percent)
[17:56:39] Writing final coordinates.
[17:56:39] Past main M.D. loop
[17:56:39] Will end MPI now
[17:57:39] 
[17:57:39] Finished Work Unit:
[17:57:39] - Reading up to 1601592 from "work/wudata_07.arc": Read 1601592
[17:57:39] - Reading up to 488460 from "work/wudata_07.xtc": Read 488460
[17:57:39] goefile size: 0
[17:57:39] logfile size: 76420
[17:57:39] Leaving Run
[17:57:39] - Writing 2268316 bytes of core data to disk...
[17:57:39]   ... Done.
[17:57:40] - Shutting down core
[17:57:40] 
[17:57:40] Folding@home Core Shutdown: FINISHED_UNIT
[17:57:45] CoreStatus = 64 (100)
[17:57:45] Unit 7 finished with 84 percent of time to deadline remaining.
[17:57:45] Updated performance fraction: 0.838102
[17:57:45] Sending work to server


[17:57:45] + Attempting to send results
[17:57:45] - Reading file work/wuresults_07.dat from core
[17:57:45]   (Read 2268316 bytes from disk)
[17:57:45] Connecting to http://171.64.65.63:8080/
[17:58:36] Posted data.
[17:58:37] Initial: 0000; - Uploaded at ~42 kB/s
[17:58:37] - Averaged speed for that direction ~42 kB/s
[17:58:37] + Results successfully sent
[17:58:37] Thank you for your contribution to Folding@Home.
[17:58:37] + Number of Units Completed: 328

[18:02:46] - Warning: Could not delete all work unit files (7): Core returned invalid code
[18:02:46] Trying to send all finished work units
[18:02:46] + No unsent completed units remaining.
[18:02:46] - Preparing to get new work unit...
[18:02:46] + Attempting to get work packet
[18:02:46] - Will indicate memory of 1536 MB
[18:02:46] - Connecting to assignment server
[18:02:46] Connecting to http://assign.stanford.edu:8080/
[18:02:47] Posted data.
[18:02:47] Initial: 40AB; - Successful: assigned to (171.64.65.63).
[18:02:47] + News From Folding@Home: Welcome to Folding@Home
[18:02:47] Loaded queue successfully.
[18:02:47] Connecting to http://171.64.65.63:8080/
[18:02:48] Posted data.
[18:02:48] Initial: 0000; - Receiving payload (expected size: 608081)
[18:02:50] - Downloaded at ~296 kB/s
[18:02:50] - Averaged speed for that direction ~385 kB/s
[18:02:50] + Received work.
[18:02:50] Trying to send all finished work units
[18:02:50] + No unsent completed units remaining.
[18:02:50] + Closed connections
[18:02:50] 
[18:02:50] + Processing work unit
[18:02:50] Core required: FahCore_a1.exe
[18:02:50] Core found.
[18:02:50] Working on Unit 08 [April 12 18:02:50]
[18:02:50] + Working ...
[18:02:50] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 08 -checkpoint 15 -forceasm -verbose -lifeline 3265 -version 602'

[18:02:50] 
[18:02:50] *------------------------------*
[18:02:50] Folding@Home Gromacs SMP Core
[18:02:50] Version 1.74 (November 27, 2006)
[18:02:50] 
[18:02:50] Preparing to commence simulation
[18:02:50] - Ensuring status. Please wait.
[18:03:07] - Assembly optimizations manually forced on.
[18:03:07] - Not checking prior termination.
[18:03:07] - Expanded 607569 -> 3255941 (decompressed 535.8 percent)
[18:03:07] - Starting from initial work packet
[18:03:07] 
[18:03:07] Project: 3064 (Run 4, Clone 42, Gen 16)
[18:03:07] 
[18:03:07] Assembly optimizations on if available.
[18:03:07] Entering M.D.
[18:03:13] Protein: p3064_lambdaProtein: p3064_lambda5_2003Extra SSE boost OK.
[18:03:13] 
[18:03:13] Extra SSE boost OK.
[18:03:13] Writing local files
[18:03:13] Completed 0 out of 5000000 steps  (0 percent)
[18:11:25] Writing local files
[18:11:25] Completed 50000 out of 5000000 steps  (1 percent)
[18:19:38] Writing local files
[18:19:38] Completed 100000 out of 5000000 steps  (2 percent)
  ***  snip  ***
[07:41:33] Writing local files
[07:41:33] Completed 4950000 out of 5000000 steps  (99 percent)
[07:49:50] Writing local files
[07:49:50] Completed 5000000 out of 5000000 steps  (100 percent)
[07:49:50] Writing final coordinates.
[07:49:50] Past main M.D. loop
[07:49:50] Will end MPI now
[07:50:50] 
[07:50:50] Finished Work Unit:
[07:50:50] - Reading up to 516624 from "work/wudata_08.arc": Read 516624
[07:50:50] - Reading up to 972092 from "work/wudata_08.xtc": Read 972092
[07:50:50] goefile size: 0
[07:50:50] logfile size: 136716
[07:50:50] Leaving Run
[07:50:52] - Writing 1826476 bytes of core data to disk...
[07:50:52]   ... Done.
[07:50:52] - Shutting down core
[07:50:52] 
[07:50:52] Folding@home Core Shutdown: FINISHED_UNIT
[07:50:56] CoreStatus = 64 (100)
[07:50:56] Unit 8 finished with 84 percent of time to deadline remaining.
[07:50:56] Updated performance fraction: 0.838533
[07:50:56] Sending work to server


[07:50:56] + Attempting to send results
[07:50:56] - Reading file work/wuresults_08.dat from core
[07:50:56]   (Read 1826476 bytes from disk)
[07:50:56] Connecting to http://171.64.65.63:8080/
[07:51:04] - Couldn't send HTTP request to server
[07:51:04] + Could not connect to Work Server (results)
[07:51:04]     (171.64.65.63:8080)
[07:51:04] - Error: Could not transmit unit 08 (completed April 13) to work server.
[07:51:04] - 1 failed uploads of this unit.
[07:51:04]   Keeping unit 08 in queue.
[07:51:04] Trying to send all finished work units


[07:51:04] + Attempting to send results
[07:51:04] - Reading file work/wuresults_08.dat from core
[07:51:04]   (Read 1826476 bytes from disk)
[07:51:04] Connecting to http://171.64.65.63:8080/
[07:51:05] - Couldn't send HTTP request to server
[07:51:05] + Could not connect to Work Server (results)
[07:51:05]     (171.64.65.63:8080)
[07:51:05] - Error: Could not transmit unit 08 (completed April 13) to work server.
[07:51:05] - 2 failed uploads of this unit.


[07:51:05] + Attempting to send results
[07:51:05] - Reading file work/wuresults_08.dat from core
[07:51:05]   (Read 1826476 bytes from disk)
[07:51:05] Connecting to http://171.64.122.76:8080/
[07:51:47] Posted data.
[07:51:47] Initial: 0000; - Uploaded at ~42 kB/s
[07:51:47] - Averaged speed for that direction ~42 kB/s
[07:51:47] - Core type used on unit not what server demands.
[07:51:47]   Successfully sent unit 08 to Collection server.
[07:51:47] + Sent 1 of 1 completed units to the server
[07:51:47] - Preparing to get new work unit...
[07:51:47] + Attempting to get work packet
[07:51:47] - Will indicate memory of 1536 MB
[07:51:47] - Connecting to assignment server
[07:51:47] Connecting to http://assign.stanford.edu:8080/
[07:51:48] Posted data.
[07:51:48] Initial: 40AB; - Successful: assigned to (171.64.65.63).
[07:51:48] + News From Folding@Home: Welcome to Folding@Home
[07:51:48] Loaded queue successfully.
[07:51:48] Connecting to http://171.64.65.63:8080/
[07:51:49] Posted data.
[07:51:49] Initial: 0000; - Receiving payload (expected size: 608081)
[07:51:50] - Downloaded at ~593 kB/s
[07:51:50] - Averaged speed for that direction ~427 kB/s
[07:51:50] + Received work.
[07:51:50] Trying to send all finished work units
[07:51:50] + No unsent completed units remaining.
[07:51:50] + Closed connections
[07:51:50] 
[07:51:50] + Processing work unit
[07:51:50] Core required: FahCore_a1.exe
[07:51:50] Core found.
[07:51:50] Working on Unit 09 [April 13 07:51:50]
[07:51:50] + Working ...
[07:51:50] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 09 -checkpoint 15 -forceasm -verbose -lifeline 3265 -version 602'

[07:51:50] 
[07:51:50] *------------------------------*
[07:51:50] Folding@Home Gromacs SMP Core
[07:51:50] Version 1.74 (November 27, 2006)
[07:51:50] 
[07:51:50] Preparing to commence simulation
[07:51:50] - Ensuring status. Please wait.
[07:51:50] - Starting from initial work packet
[07:51:50] 
[07:51:50] Project: 3064 (Run 4, Clone 42, Gen 16)
[07:51:50] 
[07:51:50] Assembly optimizations on if available.
[07:51:50] Entering M.D.
[07:52:07]  on if available.
[07:52:07] Entering M.D.
[07:52:13] 064_lambda5_2003
[07:52:13] Writing local files
[07:52:13] Extra SSE boost OK.
[07:52:13]  boost OK.
[07:52:14] al files
[07:52:14] Completed 0 out of 5000000 steps  (0 percent)
[08:00:33] Writing local files
[08:00:33] Completed 50000 out of 5000000 steps  (1 percent)
[08:08:47] Writing local files
[08:08:47] Completed 100000 out of 5000000 steps  (2 percent)
  ***  snip  ***
[21:31:10] Writing local files
[21:31:10] Completed 4950000 out of 5000000 steps  (99 percent)
[21:39:22] Writing local files
[21:39:22] Completed 5000000 out of 5000000 steps  (100 percent)
[21:39:22] Writing final coordinates.
[21:39:22] Past main M.D. loop
[21:39:22] Will end MPI now
[21:40:22] 
[21:40:22] Finished Work Unit:
[21:40:22] - Reading up to 516624 from "work/wudata_09.arc": Read 516624
[21:40:22] - Reading up to 972092 from "work/wudata_09.xtc": Read 972092
[21:40:22] goefile size: 0
[21:40:22] logfile size: 136718
[21:40:22] Leaving Run
[21:40:25] - Writing 1826478 bytes of core data to disk...
[21:40:25]   ... Done.
[21:40:25] - Shutting down core
[21:40:25] 
[21:40:25] Folding@home Core Shutdown: FINISHED_UNIT
[21:40:30] CoreStatus = 64 (100)
[21:40:30] Unit 9 finished with 84 percent of time to deadline remaining.
[21:40:30] Updated performance fraction: 0.838856
[21:40:30] Sending work to server


[21:40:30] + Attempting to send results
[21:40:30] - Reading file work/wuresults_09.dat from core
[21:40:30]   (Read 1826478 bytes from disk)
[21:40:30] Connecting to http://171.64.65.63:8080/
[21:41:14] Posted data.
[21:41:14] Initial: 0000; - Uploaded at ~39 kB/s
[21:41:15] - Averaged speed for that direction ~42 kB/s
[21:41:15] + Results successfully sent
[21:41:15] Thank you for your contribution to Folding@Home.
[21:41:15] + Number of Units Completed: 329

[21:45:19] - Warning: Could not delete all work unit files (9): Core returned invalid code
[21:45:19] Trying to send all finished work units
[21:45:19] + No unsent completed units remaining.
[21:45:19] - Preparing to get new work unit...
[21:45:19] + Attempting to get work packet
[21:45:19] - Will indicate memory of 1536 MB
[21:45:19] - Connecting to assignment server
[21:45:19] Connecting to http://assign.stanford.edu:8080/
[21:45:19] Posted data.
[21:45:19] Initial: 40AB; - Successful: assigned to (171.64.65.63).
[21:45:19] + News From Folding@Home: Welcome to Folding@Home
[21:45:19] Loaded queue successfully.
[21:45:19] Connecting to http://171.64.65.63:8080/
[21:45:22] Posted data.
[21:45:22] Initial: 0000; - Receiving payload (expected size: 1639537)
[21:45:26] - Downloaded at ~400 kB/s
[21:45:26] - Averaged speed for that direction ~421 kB/s
[21:45:26] + Received work.
[21:45:26] Trying to send all finished work units
[21:45:26] + No unsent completed units remaining.
[21:45:26] + Closed connections
[21:45:26] 
[21:45:26] + Processing work unit
[21:45:26] Core required: FahCore_a1.exe
[21:45:26] Core found.
[21:45:26] Working on Unit 00 [April 13 21:45:26]
[21:45:26] + Working ...
[21:45:26] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 00 -checkpoint 15 -forceasm -verbose -lifeline 3265 -version 602'

[21:45:26] 
[21:45:26] *------------------------------*
[21:45:26] Folding@Home Gromacs SMP Core
[21:45:26] Version 1.74 (November 27, 2006)
[21:45:26] 
[21:45:26] Preparing to commence simulation
[21:45:26] - Ensuring status. Please wait.
[21:45:43] - Assembly optimizations manually forced on.
[21:45:43] - Not checking prior termination.
[21:45:43] - Expanded 1639025 -> 9524377 (decompressed 581.1 percent)
[21:45:43] - Starting from initial work packet
[21:45:43] 
[21:45:43] Project: 3065 (Run 1, Clone 54, Gen 17)
[21:45:43] 
[21:45:43] Assembly optimizations on if available.
[21:45:43] Entering M.D.
[21:45:49] Protein: 66728 p3065_Protein: 66728 p3065_lambda5_99sb_bigExtra SSE boost OK.
[21:45:49] 
[21:45:49] Extra SSE boost OK.
[21:45:49] Writing local files
[21:45:50] Completed 0 out of 2500000 steps  (0 percent)
Perhaps this is more likely a server issue?...

Greg
Post Reply