Project: 3062 (Run 1, Clone 5, Gen 56) 0x0 error

Moderators: Site Moderators, FAHC Science Team

Post Reply
parkut
Posts: 364
Joined: Tue Feb 12, 2008 7:33 am
Hardware configuration: Running exclusively Linux headless blades. All are dedicated crunching machines.
Location: SE Michigan, USA

Project: 3062 (Run 1, Clone 5, Gen 56) 0x0 error

Post by parkut »

Please let me know if reporting these failures is useful.

This WU failed three times in a row at 27%, then downloaded a new FahCore_a1.exe. Eleswhere on this forum, I read someone suggesting they had success in completing work units if restarted befpre the failure point, so at 2% into the 4th attempt, I halted fah6, and resumed. It then failed on the fourth attempt at 10%, at which point it seems to have given up and got a different WU, which ran to completion.

Intel(R) Core(TM)2 Quad CPU Q6600 @ 2.40GHz
cpu MHz : 2394.000
cache size : 4096 KB
Memory: 975.96 MB physical, 1.94 GB virtual

Selections from log file (in reverse order)

[12:21:03] + Number of Units Completed: 51
[12:21:03] Thank you for your contribution to Folding@Home.
[12:21:03] + Results successfully sent
[12:20:26] Sending work to server
[12:20:26] Updated performance fraction: 0.779774
[12:20:26] Unit 9 finished with 80 percent of time to deadline remaining.
[12:20:26] CoreStatus = 64 (100)
[12:20:22] Folding@home Core Shutdown: FINISHED_UNIT
[12:19:21] Will end MPI now
[12:19:21] Past main M.D. loop
[12:19:21] Writing final coordinates.
[12:19:21] Completed 5000000 out of 5000000 steps (100 percent)
[19:16:34] Project: 3062 (Run 3, Clolone 1, Gen 71)

==== snip ====

[19:11:30] Deleting current work unit & continuing...
[19:11:30] Client-core communications error: ERROR 0x0
[19:11:30] CoreStatus = 0 (0)
[19:11:26] Warning: long 1-4 interactions
[19:08:56] Completed 500000 out of 5000000 steps (10 percent)
[19:08:56] Writing local files
[18:58:39] Completed 450000 out of 5000000 steps (9 percent)
[18:58:39] Writing local files
[18:48:25] Completed 400000 out of 5000000 steps (8 percent)
[18:48:25] Writing local files
[18:38:06] Completed 350000 out of 5000000 steps (7 percent)
[18:38:06] Writing local files
[18:27:57] Completed 300000 out of 5000000 steps (6 percent)
[18:27:57] Writing local files
[18:17:47] Completed 250000 out of 5000000 steps (5 percent)
[18:17:47] Writing local files
[18:07:34] Completed 200000 out of 5000000 steps (4 percent)
[18:07:34] Writing local files
[17:57:16] Completed 150000 out of 5000000 steps (3 percent)
[17:57:15] Writing local files
[17:46:59] Extra SSE boost OK.
[17:46:59] 000 steps (2 percent)
[17:46:59] Writing local fProteCompleted 100000 out of 5000000 steps (2 pCompleteExtra SSE boost OK.
[17:46:59] otein: p3062_lambda5_99sb
[17:46:59] .
[17:46:53] Entering M.D.
[17:46:53]
[17:46:53] lone 5, Gen 56)
[17:46:53] Project: Entering M.D.
[17:46:53] Project: 3062 (Run 1, Clone 5,
[17:46:53] 5 (d- Expanded 607001 -> 326604
[17:46:36] Entering M.D.
[17:46:36] Assembly optimizations on if available.
[17:46:36]
[17:46:36] Project: 3062 (Run 1, Clone 5, Gen 56)
17:46:36]
[17:46:36] - Ensuring status. Please wait.
[17:46:36] Preparing to commence simulation
[17:46:36]
[17:46:36] Version 1.74 (November 27, 2006)
[17:46:36] Folding@Home Gromacs SMP Core
[17:46:36] *------------------------------*
[17:46:36]

[17:46:36] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 08 -checkpoint 15 -verbose -lifeline 27013 -version 601'
[17:46:36] + Working ...
[17:46:36] Working on Unit 08 [March 6 17:46:36]
[17:46:36] - Autosend completed
[17:46:36] + No unsent completed units remaining.
[17:46:36] Trying to send all finished work units
[17:46:36] - Autosending finished units...
[17:46:36] Core found.
[17:46:36] Core required: FahCore_a1.exe
[17:46:36] + Processing work unit
[17:46:36]
[17:46:36] Loaded queue successfully.
[17:46:35]
[17:46:35] - Machine ID: 1
[17:46:35] - User ID: 6F59A7931F7298EF
[17:46:35] - User name: parkut (Team 4)
[17:46:35] - Ask before connecting: No

Arguments: -verbosity 9 -smp
Executable: ./fah6
Launch directory: /root/fah6

###############################################################################
###############################################################################

http://folding.stanford.edu

Folding@Home Client Version 6.01beta2

###############################################################################
# SMP Client ##################################################################

--- Opening Log file [March 6 17:46:35]

4 cores detected

Folding@Home Client Shutdown.

[17:46:05] Killing all core threads
[17:46:05] ***** Got a SIGTERM signal (15)
[17:38:59] Completed 100000 out of 5000000 steps (2 percent)
[17:38:59] Writing local files
[17:28:48] Completed 50000 out of 5000000 steps (1 percent)
[17:28:48] Writing local files
[17:18:27] Completed 0 out of 5000000 steps (0 percent)
[17:18:27] Writing local files
[17:18:27] Extra SSE boost OK.
[17:18:27]
[17:18:27] t OK.
[17:18:20] Entering M.D.
[17:18:20]
[17:18:20] lone 5, Gen 56)
[17:18:20] Entering M.D.
[17:18:20]
[17:18:20] lone 5, Gen 56)
[17:18:20] Project: Entering M.D.
[17:18:20] - Previous termination of core w- Expanded 607001 -> 3266045 (d- Expanded 607001 -> 326604- Starting from initial work pa- Starting from initial work pa- Sta
[17:18:20] - Working with standard loops on this execution.
[17:18:20] - Looking at optimizations...
[17:18:03] - Ensuring status. Please wait.
[17:18:03] Preparing to commence simulation
[17:18:03]
[17:18:03] Version 1.74 (November 27, 2006)
[17:18:03] Folding@Home Gromacs SMP Core
[17:18:03] *------------------------------*
[17:18:03]

[17:18:03] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 08 -checkpoint 15 -verbose -lifeline 13946 -version 601'
[17:18:03] + Working ...
[17:18:03] Working on Unit 08 [March 6 17:18:03]
[17:18:03] Core found.
[17:18:03] Core required: FahCore_a1.exe
[17:18:03] + Processing work unit
[17:18:03]
[17:17:58] + Closed connections
[17:17:58] + Received work.
[17:17:58] - Averaged speed for that direction ~96 kB/s
[17:17:58] - Downloaded at ~84 kB/s
[17:17:51] Initial: 0000; - Receiving payload (expected size: 607513)
[17:17:51] Posted data.
[17:17:50] Connecting to http://171.64.65.63:8080/
[17:17:50] Loaded queue successfully.
[17:17:49] + News From Folding@Home: Welcome to Folding@Home
[17:17:49] Initial: 40AB; - Successful: assigned to (171.64.65.63).
[17:17:49] Posted data.
[17:17:49] Connecting to http://assign.stanford.edu:8080/
[17:17:49] - Connecting to assignment server
[17:17:49] - Will indicate memory of 975 MB
[17:17:49] + Attempting to get work packet
[17:17:49] - Preparing to get new work unit...
[17:17:49] + No unsent completed units remaining.
[17:17:49] Trying to send all finished work units
[17:17:49] - Warning: Could not delete all work unit files (7): Core returned invalid code
[17:13:28] Deleting current work unit & continuing...
[17:13:28] + Core successfully engaged
[17:13:28] Decompressed FahCore_a1.exe (3625104 bytes) successfully
[17:13:27] Trying to unzip core FahCore_a1.exe
[17:13:27]
[17:13:27] Signature is VALID
[17:13:27] Verifying core Core_a1.fah...
[17:13:27] Initial: D4FE; + 1490945 bytes downloaded
[17:13:27] Initial: 3A56; + 1484800 bytes downloaded
[17:13:27] Initial: 8542; + 1474560 bytes downloaded
[17:13:27] Initial: 789E; + 1464320 bytes downloaded
[17:13:27] Initial: 21E9; + 1454080 bytes downloaded
[17:13:27] Initial: B2A3; + 1443840 bytes downloaded
[17:13:27] Initial: 1282; + 1433600 bytes downloaded
[17:13:26] Initial: 4800; + 1423360 bytes downloaded
[17:13:26] Initial: 755F; + 1413120 bytes downloaded
[17:13:26] Initial: 3131; + 1402880 bytes downloaded
[17:13:26] Initial: 0FEB; + 1392640 bytes downloaded
[17:13:26] Initial: 7D25; + 1382400 bytes downloaded
[17:13:26] Initial: AEB7; + 1372160 bytes downloaded
[17:13:26] Initial: 30B7; + 1361920 bytes downloaded
[17:13:26] Initial: 924C; + 1351680 bytes downloaded
[17:13:26] Initial: 3596; + 1341440 bytes downloaded
[17:13:26] Initial: BBE3; + 1331200 bytes downloaded
[17:13:26] Initial: E5AF; + 1320960 bytes downloaded
[17:13:26] Initial: 9AAC; + 1310720 bytes downloaded
[17:13:26] Initial: C1E7; + 1300480 bytes downloaded
[17:13:26] Initial: 756D; + 1290240 bytes downloaded
[17:13:26] Initial: 0896; + 1280000 bytes downloaded
[17:13:26] Initial: AB37; + 1269760 bytes downloaded
[17:13:26] Initial: 3023; + 1259520 bytes downloaded
[17:13:26] Initial: F54E; + 1249280 bytes downloaded
[17:13:26] Initial: FBF2; + 1239040 bytes downloaded
[17:13:26] Initial: FA59; + 1228800 bytes downloaded
[17:13:26] Initial: A7F3; + 1218560 bytes downloaded
[17:13:26] Initial: 471E; + 1208320 bytes downloaded
[17:13:26] Initial: 0848; + 1198080 bytes downloaded
[17:13:26] Initial: 7654; + 1187840 bytes downloaded
[17:13:26] Initial: A564; + 1177600 bytes downloaded
[17:13:26] Initial: A7D8; + 1167360 bytes downloaded
[17:13:26] Initial: FCB5; + 1157120 bytes downloaded
[17:13:25] Initial: 3E65; + 1146880 bytes downloaded
[17:13:25] Initial: 0057; + 1136640 bytes downloaded
[17:13:25] Initial: D284; + 1126400 bytes downloaded
[17:13:25] Initial: ED39; + 1116160 bytes downloaded
[17:13:25] Initial: ADFC; + 1105920 bytes downloaded
[17:13:25] Initial: 338A; + 1095680 bytes downloaded
[17:13:25] Initial: 0943; + 1085440 bytes downloaded
[17:13:25] Initial: 28DC; + 1075200 bytes downloaded
[17:13:25] Initial: A5EF; + 1064960 bytes downloaded
[17:13:25] Initial: C5B8; + 1054720 bytes downloaded
[17:13:25] Initial: D9BC; + 1044480 bytes downloaded
[17:13:25] Initial: 4B95; + 1034240 bytes downloaded
[17:13:25] Initial: 3030; + 1024000 bytes downloaded
[17:13:25] Initial: 3B6B; + 1013760 bytes downloaded
[17:13:25] Initial: 84AC; + 1003520 bytes downloaded
[17:13:25] Initial: 3EEE; + 993280 bytes downloaded
[17:13:25] Initial: 96E5; + 983040 bytes downloaded
[17:13:25] Initial: 1083; + 972800 bytes downloaded
[17:13:25] Initial: 2070; + 962560 bytes downloaded
[17:13:25] Initial: 1050; + 952320 bytes downloaded
[17:13:25] Initial: E495; + 942080 bytes downloaded
[17:13:25] Initial: 6BC9; + 931840 bytes downloaded
[17:13:25] Initial: 2945; + 921600 bytes downloaded
[17:13:25] Initial: FA4E; + 911360 bytes downloaded
[17:13:25] Initial: 83FC; + 901120 bytes downloaded
[17:13:25] Initial: EB07; + 890880 bytes downloaded
[17:13:24] Initial: 4811; + 880640 bytes downloaded
[17:13:24] Initial: 5298; + 870400 bytes downloaded
[17:13:24] Initial: 3BEA; + 860160 bytes downloaded
[17:13:24] Initial: 83A3; + 849920 bytes downloaded
[17:13:24] Initial: A12A; + 839680 bytes downloaded
[17:13:24] Initial: A625; + 829440 bytes downloaded
[17:13:24] Initial: FAC6; + 819200 bytes downloaded
[17:13:23] Initial: E268; + 808960 bytes downloaded
[17:13:23] Initial: 4D75; + 798720 bytes downloaded
[17:13:23] Initial: 4B97; + 788480 bytes downloaded
[17:13:23] Initial: E5E1; + 778240 bytes downloaded
[17:13:23] Initial: EE35; + 768000 bytes downloaded
[17:13:23] Initial: 1110; + 757760 bytes downloaded
[17:13:23] Initial: 7681; + 747520 bytes downloaded
[17:13:23] Initial: CDA1; + 737280 bytes downloaded
[17:13:23] Initial: 130C; + 727040 bytes downloaded
[17:13:23] Initial: 9BD0; + 716800 bytes downloaded
[17:13:23] Initial: 8D45; + 706560 bytes downloaded
[17:13:23] Initial: C563; + 696320 bytes downloaded
[17:13:23] Initial: EA59; + 686080 bytes downloaded
[17:13:23] Initial: 8D4D; + 675840 bytes downloaded
[17:13:23] Initial: 6C54; + 665600 bytes downloaded
[17:13:23] Initial: 89C7; + 655360 bytes downloaded
[17:13:23] Initial: 3C06; + 645120 bytes downloaded
[17:13:23] Initial: 2BC3; + 634880 bytes downloaded
[17:13:23] Initial: 0F31; + 624640 bytes downloaded
[17:13:22] Initial: 5A89; + 614400 bytes downloaded
[17:13:22] Initial: 0B15; + 604160 bytes downloaded
[17:13:22] Initial: CAC0; + 593920 bytes downloaded
[17:13:22] Initial: 515D; + 583680 bytes downloaded
[17:13:22] Initial: 49ED; + 573440 bytes downloaded
[17:13:22] Initial: 9058; + 563200 bytes downloaded
[17:13:22] Initial: 9672; + 552960 bytes downloaded
[17:13:22] Initial: 66F9; + 542720 bytes downloaded
[17:13:22] Initial: 525A; + 532480 bytes downloaded
[17:13:22] Initial: 3113; + 522240 bytes downloaded
[17:13:21] Initial: 46B2; + 512000 bytes downloaded
[17:13:21] Initial: 31B5; + 501760 bytes downloaded
[17:13:21] Initial: 11F4; + 491520 bytes downloaded
[17:13:21] Initial: 0206; + 481280 bytes downloaded
[17:13:21] Initial: B1D5; + 471040 bytes downloaded
[17:13:21] Initial: 33AA; + 460800 bytes downloaded
[17:13:21] Initial: 9B7A; + 450560 bytes downloaded
[17:13:21] Initial: 414B; + 440320 bytes downloaded
[17:13:21] Initial: CAFE; + 430080 bytes downloaded
[17:13:21] Initial: 2869; + 419840 bytes downloaded
[17:13:21] Initial: 90E1; + 409600 bytes downloaded
[17:13:20] Initial: A818; + 399360 bytes downloaded
[17:13:20] Initial: BF10; + 389120 bytes downloaded
[17:13:20] Initial: 6A85; + 378880 bytes downloaded
[17:13:20] Initial: B0AA; + 368640 bytes downloaded
[17:13:20] Initial: B290; + 358400 bytes downloaded
[17:13:20] Initial: 611B; + 348160 bytes downloaded
[17:13:20] Initial: BB3B; + 337920 bytes downloaded
[17:13:20] Initial: 91D7; + 327680 bytes downloaded
[17:13:20] Initial: 9D5D; + 317440 bytes downloaded
[17:13:20] Initial: B97B; + 307200 bytes downloaded
[17:13:20] Initial: DE6D; + 296960 bytes downloaded
[17:13:20] Initial: 820A; + 286720 bytes downloaded
[17:13:20] Initial: EA6B; + 276480 bytes downloaded
[17:13:20] Initial: 6E3D; + 266240 bytes downloaded
[17:13:19] Initial: 6400; + 256000 bytes downloaded
[17:13:19] Initial: AAA5; + 245760 bytes downloaded
[17:13:19] Initial: 9F05; + 235520 bytes downloaded
[17:13:19] Initial: 8193; + 225280 bytes downloaded
[17:13:19] Initial: A37E; + 215040 bytes downloaded
[17:13:19] Initial: AE1E; + 204800 bytes downloaded
[17:13:19] Initial: 4C66; + 194560 bytes downloaded
[17:13:19] Initial: AEEC; + 184320 bytes downloaded
[17:13:19] Initial: 237E; + 174080 bytes downloaded
[17:13:19] Initial: DB18; + 163840 bytes downloaded
[17:13:19] Initial: 221C; + 153600 bytes downloaded
[17:13:19] Initial: CD6C; + 143360 bytes downloaded
[17:13:19] Initial: 5EBD; + 133120 bytes downloaded
[17:13:19] Initial: C249; + 122880 bytes downloaded
[17:13:18] Initial: 1B1E; + 112640 bytes downloaded
[17:13:18] Initial: 820B; + 102400 bytes downloaded
[17:13:18] Initial: F7AC; + 92160 bytes downloaded
[17:13:18] Initial: D218; + 81920 bytes downloaded
[17:13:17] Initial: 3141; + 71680 bytes downloaded
[17:13:17] Initial: EBA8; + 61440 bytes downloaded
[17:13:17] Initial: C6C3; + 51200 bytes downloaded
[17:13:17] Initial: 9F08; + 40960 bytes downloaded
[17:13:17] Initial: D6C2; + 30720 bytes downloaded
[17:13:17] Initial: B54E; + 20480 bytes downloaded
[17:13:17] Initial: AFDE; + 10240 bytes downloaded
[17:13:16] Downloading core (/~pande/Linux/x86//Core_a1.fah from http://www.stanford.edu)
[17:13:16] + Downloading new core: FahCore_a1.exe
[17:13:16] - Attempting to download new core...
[17:13:16] Client-core communications error: ERROR 0x0
[17:13:16] CoreStatus = 0 (0)
[17:13:12] Warning: long 1-4 interactions
[17:08:39] Completed 1350000 out of 5000000 steps (27 percent)
[17:08:39] Writing local files
[16:58:30] Completed 1300000 out of 5000000 steps (26 percent)
[16:58:30] Writing local files
[16:48:14] Completed 1250000 out of 5000000 steps (25 percent)
[16:48:14] Writing local files
[16:37:58] Completed 1200000 out of 5000000 steps (24 percent)
[16:37:58] Writing local files
[16:27:43] Completed 1150000 out of 5000000 steps (23 percent)
[16:27:43] Writing local files
[16:17:29] Completed 1100000 out of 5000000 steps (22 percent)
[16:17:29] Writing local files
[16:07:15] Completed 1050000 out of 5000000 steps (21 percent)
[16:07:15] Writing local files
[15:57:02] Completed 1000000 out of 5000000 steps (20 percent)
[15:57:02] Writing local files
[15:46:46] Completed 950000 out of 5000000 steps (19 percent)
[15:46:46] Writing local files
[15:36:37] Completed 900000 out of 5000000 steps (18 percent)
[15:36:37] Writing local files
[15:26:21] Completed 850000 out of 5000000 steps (17 percent)
[15:26:21] Writing local files
[15:16:07] Completed 800000 out of 5000000 steps (16 percent)
[15:16:07] Writing local files
[15:05:54] Completed 750000 out of 5000000 steps (15 percent)
[15:05:54] Writing local files
[14:55:42] Completed 700000 out of 5000000 steps (14 percent)
[14:55:42] Writing local files
[14:45:37] Completed 650000 out of 5000000 steps (13 percent)
[14:45:37] Writing local files
[14:35:21] Completed 600000 out of 5000000 steps (12 percent)
[14:35:20] Writing local files
[14:25:09] Completed 550000 out of 5000000 steps (11 percent)
[14:25:09] Writing local files
[14:14:51] Completed 500000 out of 5000000 steps (10 percent)
[14:14:51] Writing local files
[14:04:42] Completed 450000 out of 5000000 steps (9 percent)
[14:04:42] Writing local files
[13:54:30] Completed 400000 out of 5000000 steps (8 percent)
[13:54:30] Writing local files
[13:44:13] Completed 350000 out of 5000000 steps (7 percent)
[13:44:13] Writing local files
[13:33:57] Completed 300000 out of 5000000 steps (6 percent)
[13:33:57] Writing local files
[13:23:42] Completed 250000 out of 5000000 steps (5 percent)
[13:23:42] Writing local files
[13:13:27] Completed 200000 out of 5000000 steps (4 percent)
[13:13:27] Writing local files
[13:03:08] Completed 150000 out of 5000000 steps (3 percent)
[13:03:08] Writing local files
[12:52:53] Completed 100000 out of 5000000 steps (2 percent)
[12:52:53] Writing local files
[12:43:17] - Autosend completed
[12:43:17] + No unsent completed units remaining.
[12:43:17] Trying to send all finished work units
[12:43:17] - Autosending finished units...
[12:42:29] Completed 50000 out of 5000000 steps (1 percent)
[12:42:29] Writing local files
[12:32:14] Completed 0 out of 5000000 steps (0 percent)
[12:32:14] Writing local files
[12:32:14] Extra SSE boost OK.
[12:32:14]
[12:32:14] ambda5_99sbExtra SSE boost OK.
[12:32:14] a SSE boost OK.
[12:32:14] Rejecting checkpoint
[12:32:08] Entering M.D.
[12:32:08]
[12:32:08] lone 5, Gen 56)
[12:32:08] Project: Entering M.D.
[12:32:08]
[12:32:08] packet
[12:32:08] Project: 3062 (Run 1, Clo- Starting fromEntering M.D.
[12:32:08]
[12:32:08] - Starting from initial work packet
[12:32:08] - Previous termination of core w- Expanded 607001 -> 3266045 (decompressed 538.0 percent)
[12:32:08] - Working with standard loops on this execution.
[12:32:08] - Looking at optimizations...
[12:31:51] - Ensuring status. Please wait.
[12:31:51] Preparing to commence simulation
[12:31:51]
[12:31:51] Version 1.74 (November 27, 2006)
[12:31:51] Folding@Home Gromacs SMP Core
[12:31:51] *------------------------------*
[12:31:51]

[12:31:50] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 07 -checkpoint 15 -verbose -lifeline 13946 -version 601'
[12:31:50] + Working ...
[12:31:50] Working on Unit 07 [March 6 12:31:50]
[12:31:50] Core found.
[12:31:50] Core required: FahCore_a1.exe
[12:31:50] + Processing work unit
[12:31:50]
[12:31:45] + Closed connections
[12:31:45] + Received work.
[12:31:45] - Averaged speed for that direction ~99 kB/s
[12:31:45] - Downloaded at ~65 kB/s
[12:31:36] Initial: 0000; - Receiving payload (expected size: 607513)
[12:31:36] Posted data.
[12:31:34] Connecting to http://171.64.65.63:8080/
[12:31:34] Loaded queue successfully.
[12:31:34] + News From Folding@Home: Welcome to Folding@Home
[12:31:34] Initial: 40AB; - Successful: assigned to (171.64.65.63).
[12:31:34] Posted data.
[12:31:34] Connecting to http://assign.stanford.edu:8080/
[12:31:34] - Connecting to assignment server
[12:31:34] - Will indicate memory of 975 MB
[12:31:34] + Attempting to get work packet
[12:31:34] - Preparing to get new work unit...
[12:31:34] + No unsent completed units remaining.
[12:31:34] Trying to send all finished work units
[12:31:34] - Warning: Could not delete all work unit files (6): Core returned invalid code
[12:27:13] Deleting current work unit & continuing...
[12:27:13] Client-core communications error: ERROR 0x0
[12:27:13] CoreStatus = 0 (0)

or (at your option) any later version.
by the Free Software Foundation; either version 2 of the License,
the terms of the GNU General Public License (GPL) as published
you can download a free version of Gromacs under
are interested in using Gromacs, visit http://www.gromacs.org where
specially granted to Stanford by the copyright holders. If you
a special license (see http://folding.stanford.edu/gromacs.html)
This inclusion of Gromacs code in the Folding@Home Core is under

check out http://www.gromacs.org for more information.
Copyright (c) 2001-2004, The GROMACS development team,
Copyright (c) 1991-2000, University of Groningen, The Netherlands.
Written by David van der Spoel, Erik Lindahl, Berk Hess, and others.
[12:27:09] Warning: long 1-4 interactions
[12:22:39] Completed 1350000 out of 5000000 steps (27 percent)
[12:22:39] Writing local files
[12:12:24] Completed 1300000 out of 5000000 steps (26 percent)
[12:12:24] Writing local files
[12:02:07] Completed 1250000 out of 5000000 steps (25 percent)
[12:02:07] Writing local files
[11:51:52] Completed 1200000 out of 5000000 steps (24 percent)
[11:51:52] Writing local files
[11:41:42] Completed 1150000 out of 5000000 steps (23 percent)
[11:41:42] Writing local files
[11:31:33] Completed 1100000 out of 5000000 steps (22 percent)
[11:31:33] Writing local files
[11:21:20] Completed 1050000 out of 5000000 steps (21 percent)
[11:21:20] Writing local files
[11:11:09] Completed 1000000 out of 5000000 steps (20 percent)
[11:11:09] Writing local files
[11:00:54] Completed 950000 out of 5000000 steps (19 percent)
[11:00:54] Writing local files
[10:50:43] Completed 900000 out of 5000000 steps (18 percent)
[10:50:43] Writing local files
[10:40:35] Completed 850000 out of 5000000 steps (17 percent)
[10:40:35] Writing local files
[10:30:31] Completed 800000 out of 5000000 steps (16 percent)
[10:30:31] Writing local files
[10:20:18] Completed 750000 out of 5000000 steps (15 percent)
[10:20:18] Writing local files
[10:10:04] Completed 700000 out of 5000000 steps (14 percent)
[10:10:04] Writing local files
[09:59:58] Completed 650000 out of 5000000 steps (13 percent)
[09:59:58] Writing local files
[09:49:43] Completed 600000 out of 5000000 steps (12 percent)
[09:49:43] Writing local files
[09:39:30] Completed 550000 out of 5000000 steps (11 percent)
[09:39:30] Writing local files
[09:29:16] Completed 500000 out of 5000000 steps (10 percent)
[09:29:16] Writing local files
[09:19:06] Completed 450000 out of 5000000 steps (9 percent)
[09:19:06] Writing local files
[09:08:50] Completed 400000 out of 5000000 steps (8 percent)
[09:08:50] Writing local files
[08:58:43] Completed 350000 out of 5000000 steps (7 percent)
[08:58:43] Writing local files
[08:48:34] Completed 300000 out of 5000000 steps (6 percent)
[08:48:34] Writing local files
[08:38:22] Completed 250000 out of 5000000 steps (5 percent)
[08:38:22] Writing local files
[08:28:10] Completed 200000 out of 5000000 steps (4 percent)
[08:28:10] Writing local files
[08:18:04] Completed 150000 out of 5000000 steps (3 percent)
[08:18:04] Writing local files
[08:07:49] Completed 100000 out of 5000000 steps (2 percent)
[08:07:49] Writing local files
[07:57:30] Completed 50000 out of 5000000 steps (1 percent)
[07:57:30] Writing local files
[07:47:15] Completed 0 out of 5000000 steps (0 percent)
[07:47:15] Writing local files
[07:47:15] Extra SSE boost OK.
[07:47:15] files
[07:47:09] Entering M.D.
[07:47:09]
[07:47:09] lone 5, Gen 56)
[07:47:09] Project: Entering M.D.
[07:47:09]
[07:47:09] - Previous termination of core w- Expanded 607001 -> 3266045 (d- Expanded 607001 -> 326604- Starting from initial work pa- Starting from initial work pa- Starting from initial work packet
[07:47:09] - Working with standard loops on this execution.
[07:47:09] - Looking at optimizations...
[07:46:52] - Ensuring status. Please wait.
[07:46:52] Preparing to commence simulation
[07:46:52]
[07:46:52] Version 1.74 (November 27, 2006)
[07:46:52] Folding@Home Gromacs SMP Core
[07:46:52] *------------------------------*
[07:46:52]

[07:46:52] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 06 -checkpoint 15 -verbose -lifeline 13946 -version 601'
[07:46:52] + Working ...
[07:46:52] Working on Unit 06 [March 6 07:46:52]
[07:46:52] Core found.
[07:46:52] Core required: FahCore_a1.exe
[07:46:52] + Processing work unit
[07:46:52]
[07:46:47] + Closed connections
[07:46:47] + Received work.
[07:46:47] - Averaged speed for that direction ~107 kB/s
[07:46:47] - Downloaded at ~59 kB/s
[07:46:37] Initial: 0000; - Receiving payload (expected size: 607513)
[07:46:37] Posted data.
[07:46:36] Connecting to http://171.64.65.63:8080/
[07:46:36] Loaded queue successfully.
[07:46:36] + News From Folding@Home: Welcome to Folding@Home
[07:46:36] Initial: 40AB; - Successful: assigned to (171.64.65.63).
[07:46:36] Posted data.
[07:46:35] Connecting to http://assign.stanford.edu:8080/
[07:46:35] - Connecting to assignment server
[07:46:35] - Will indicate memory of 975 MB
[07:46:35] + Attempting to get work packet
[07:46:35] - Preparing to get new work unit...
[07:46:35] + No unsent completed units remaining.
[07:46:35] Trying to send all finished work units
[07:46:35] - Warning: Could not delete all work unit files (5): Core returned invalid code
[07:42:09] Deleting current work unit & continuing...
[07:42:09] Client-core communications error: ERROR 0x0
[07:42:09] CoreStatus = 0 (0)
[07:42:05] Warning: long 1-4 interactions
[07:37:33] Completed 1350000 out of 5000000 steps (27 percent)
[07:37:33] Writing local files
[07:27:19] Completed 1300000 out of 5000000 steps (26 percent)
[07:27:19] Writing local files
[07:17:10] Completed 1250000 out of 5000000 steps (25 percent)
[07:17:10] Writing local files
[07:07:00] Completed 1200000 out of 5000000 steps (24 percent)
[07:07:00] Writing local files
[06:56:47] Completed 1150000 out of 5000000 steps (23 percent)
[06:56:47] Writing local files
[06:46:27] Completed 1100000 out of 5000000 steps (22 percent)
[06:46:27] Writing local files
[06:43:17] - Autosend completed
[06:43:17] + No unsent completed units remaining.
[06:43:17] Trying to send all finished work units
[06:43:17] - Autosending finished units...
[06:36:11] Completed 1050000 out of 5000000 steps (21 percent)
[06:36:11] Writing local files
[06:25:53] Completed 1000000 out of 5000000 steps (20 percent)
[06:25:53] Writing local files
[06:15:37] Completed 950000 out of 5000000 steps (19 percent)
[06:15:37] Writing local files
[06:05:26] Completed 900000 out of 5000000 steps (18 percent)
[06:05:25] Writing local files
[05:55:13] Completed 850000 out of 5000000 steps (17 percent)
[05:55:13] Writing local files
[05:45:06] Completed 800000 out of 5000000 steps (16 percent)
[05:45:06] Writing local files
[05:34:55] Completed 750000 out of 5000000 steps (15 percent)
[05:34:55] Writing local files
[05:24:41] Completed 700000 out of 5000000 steps (14 percent)
[05:24:41] Writing local files
[05:14:31] Completed 650000 out of 5000000 steps (13 percent)
[05:14:31] Writing local files
[05:04:20] Completed 600000 out of 5000000 steps (12 percent)
[05:04:20] Writing local files
[04:54:07] Completed 550000 out of 5000000 steps (11 percent)
[04:54:07] Writing local files
[04:43:51] Completed 500000 out of 5000000 steps (10 percent)
[04:43:51] Writing local files
[04:33:43] Completed 450000 out of 5000000 steps (9 percent)
[04:33:43] Writing local files
[04:23:29] Completed 400000 out of 5000000 steps (8 percent)
[04:23:29] Writing local files
[04:13:14] Completed 350000 out of 5000000 steps (7 percent)
[04:13:14] Writing local files
[04:03:00] Completed 300000 out of 5000000 steps (6 percent)
[04:02:59] Writing local files
[03:52:46] Completed 250000 out of 5000000 steps (5 percent)
[03:52:46] Writing local files
[03:42:22] Completed 200000 out of 5000000 steps (4 percent)
[03:42:22] Writing local files
[03:32:07] Completed 150000 out of 5000000 steps (3 percent)
[03:32:07] Writing local files
[03:21:50] Completed 100000 out of 5000000 steps (2 percent)
[03:21:50] Writing local files
[03:11:34] Completed 50000 out of 5000000 steps (1 percent)
[03:11:34] les
[03:01:19] a SSE boost OK.
[03:01:19] Completed 0 out of 5000000 steps (0 percent)
[03:01:19] files
[03:01:12] Entering M.D.
[03:01:12]
[03:01:12] lone 5, Gen 56)
[03:01:12] Project: Entering M.D.
[03:01:12]
[03:01:12] tial work pa- Starting from initial work packet
[03:00:56] Entering M.D.
[03:00:56] Assembly optimizations on if available.
[03:00:56]
[03:00:56] Project: 3062 (Run 1, Clone 5, Gen 56)
[03:00:56]
[03:00:56] - Starting from initial work packet
[03:00:55] - Expanded 607001 -> 3266045 (decompressed 538.0 percent)
[03:00:55] - Ensuring status. Please wait.
[03:00:55] Preparing to commence simulation
[03:00:55]
[03:00:55] Version 1.74 (November 27, 2006)
[03:00:55] Folding@Home Gromacs SMP Core
[03:00:55] *------------------------------*
[03:00:55]

[03:00:55] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 05 -checkpoint 15 -verbose -lifeline 13946 -version 601'
[03:00:55] + Working ...
[03:00:55] Working on Unit 05 [March 6 03:00:55]
[03:00:55] Core found.
[03:00:55] Core required: FahCore_a1.exe
[03:00:55] + Processing work unit
[03:00:55]
[03:00:55] + Closed connections
[03:00:55] + No unsent completed units remaining.
[03:00:55] Trying to send all finished work units
[03:00:55] + Received work.
[03:00:55] - Averaged speed for that direction ~119 kB/s
[03:00:55] - Downloaded at ~148 kB/s
[03:00:51] Initial: 0000; - Receiving payload (expected size: 607513)
[03:00:51] Posted data.
[03:00:50] Connecting to http://171.64.65.63:8080/
[03:00:50] Loaded queue successfully.
[03:00:50] + News From Folding@Home: Welcome to Folding@Home
[03:00:50] Initial: 40AB; - Successful: assigned to (171.64.65.63).
[03:00:50] Posted data.
[03:00:49] Connecting to http://assign.stanford.edu:8080/
[03:00:49] - Connecting to assignment server
[03:00:49] - Will indicate memory of 975 MB
[03:00:49] + Attempting to get work packet
[03:00:49] - Preparing to get new work unit...
[03:00:49] + No unsent completed units remaining.
[03:00:49] Trying to send all finished work units
[03:00:49] - Warning: Could not delete all work unit files (4): Core returned invalid code

[02:56:40] + Number of Units Completed: 50
[02:56:40] Thank you for your contribution to Folding@Home.
[02:56:40] + Results successfully sent
[02:56:40] - Averaged speed for that direction ~58 kB/s
[02:56:40] Initial: 0000; - Uploaded at ~50 kB/s
[02:56:39] Posted data.
[02:56:05] Connecting to http://171.64.65.63:8080/
[02:56:05] (Read 1824951 bytes from disk)
[02:56:05] - Reading file work/wuresults_04.dat from core
[02:56:05] + Attempting to send results


[02:56:05] Sending work to server
[02:56:05] Updated performance fraction: 0.775522
[02:56:05] Unit 4 finished with 80 percent of time to deadline remaining.
[02:56:05] CoreStatus = 64 (100)
[02:56:00] Folding@home Core Shutdown: FINISHED_UNIT
[02:56:00]
[02:56:00] - Shutting down core
[02:56:00] ... Done.
[02:56:00] - Writing 1824951 bytes of core data to disk...
[02:55:59] Leaving Run
[02:55:59] logfile size: 136723
[02:55:59] goefile size: 0
[02:55:59] - Reading up to 970272 from "work/wudata_04.xtc": Read 970272
[02:55:59] - Reading up to 516912 from "work/wudata_04.arc": Read 516912
[02:55:59] Finished Work Unit:
[02:55:59]
[02:54:59] Will end MPI now
[02:54:59] Past main M.D. loop
[02:54:59] Writing final coordinates.
[02:54:59] Completed 5000000 out of 5000000 steps (100 percent)
[02:54:59] Writing local files
él Mero
Posts: 49
Joined: Sun Dec 02, 2007 1:14 pm

Re: Project: 3062 (Run 1, Clone 5, Gen 56) 0x0 error

Post by él Mero »

You may want to try what Bruce suggests here.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 3062 (Run 1, Clone 5, Gen 56) 0x0 error

Post by bruce »

parkut wrote:Please let me know if reporting these failures is useful.
It's really hard for me to read your FAHlog file because it's in the opposite order to what I expect, but once I figured that out, it's okay.

As a general rule, Stanford does not need the failure reports. There is some value if the error is a new one and a discussion here can go into enough detail to figure out ways to correct the error. Obviously we've seen errors like this before since you found the suggestion to restart the WU that is expected to have a Warning: long 1-4 interactions. In this case, you confirmed that restarting CHANGES what will happen. . . but it doesn't always fix it. In this case, the long 1-4 interaction warning came sooner. I don't think that has been reported before. I'm also not sure if it helps, but you never know.
Post Reply