2 Dead 8101 WU's out of 7 or so

Moderators: Site Moderators, FAHC Science Team

Post Reply
PinHead
Posts: 285
Joined: Tue Jan 24, 2012 3:43 am
Hardware configuration: Quad Q9550 2.83 contains the GPU 57xx - running SMP and GPU
Quad Q6700 2.66 running just SMP
2P 32core Interlagos SMP on linux

2 Dead 8101 WU's out of 7 or so

Post by PinHead »

Anyone else having issues with the 8101 completing but the server rejects it?

1st - No OC on this boxen and plenty of Noctua's to cool it.
2nd - [22:16:49] Project: 8101 (Run 11, Clone 4, Gen 57) - No Points
[14:41:44] Project: 8101 (Run 0, Clone 8, Gen 3) - No Points
3rd - To my knowledge, no points were awarded for either of these, they were just completly fold and then ditched.

1st WU Log

Code: Select all

[20:41:56] Project: 6903 (Run 0, Clone 7, Gen 88)


[20:41:56] + Attempting to send results [May 2 20:41:56 UTC]
[20:41:56] Connecting to http://130.237.232.237:8080/
[22:15:10] + Results successfully sent
[22:15:10] Thank you for your contribution to Folding@Home.
[22:15:10] + Number of Units Completed: 41

[22:15:17] - Preparing to get new work unit...
[22:15:17] Cleaning up work directory
[22:15:17] + Attempting to get work packet
[22:15:17] Passkey found
[22:15:17] - Connecting to assignment server
[22:15:17] Connecting to http://assign.stanford.edu:8080/
[22:15:17] - Successful: assigned to (128.143.231.201).
[22:15:17] + News From Folding@Home: Welcome to Folding@Home
[22:15:18] Loaded queue successfully.
[22:15:18] Connecting to http://128.143.231.201:8080/
[22:16:45] + Received work.
[22:16:45] + Closed connections
[22:16:45] 
[22:16:45] + Processing work unit
[22:16:45] Core required: FahCore_a5.exe
[22:16:45] Core found.
[22:16:45] Working on queue slot 02 [May 2 22:16:45 UTC]
[22:16:45] + Working ...
[22:16:45] 
[22:16:45] *------------------------------*
[22:16:45] Folding@Home Gromacs SMP Core
[22:16:45] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[22:16:45] 
[22:16:45] Preparing to commence simulation
[22:16:45] - Looking at optimizations...
[22:16:45] - Created dyn
[22:16:45] - Files status OK
[22:16:49] - Expanded 30311890 -> 33158016 (decompressed 109.3 percent)
[22:16:49] Called DecompressByteArray: compressed_data_size=30311890 data_size=33158016, decompressed_data_size=33158016 diff=0
[22:16:49] - Digital signature verified
[22:16:49] 
[22:16:49] Project: 8101 (Run 11, Clone 4, Gen 57)
[22:16:49] 
[22:16:49] Assembly optimizations on if available.
[22:16:49] Entering M.D.
[22:16:56] Mapping NT from 32 to 32 
[22:17:02] Completed 0 out of 250000 steps  (0%)
[22:43:50] Completed 2500 out of 250000 steps  (1%)
[23:10:40] Completed 5000 out of 250000 steps  (2%)
[23:37:29] Completed 7500 out of 250000 steps  (3%)
[00:04:14] Completed 10000 out of 250000 steps  (4%)
[00:31:02] Completed 12500 out of 250000 steps  (5%)
[00:57:51] Completed 15000 out of 250000 steps  (6%)
[01:24:37] Completed 17500 out of 250000 steps  (7%)
[01:51:28] Completed 20000 out of 250000 steps  (8%)
[02:18:19] Completed 22500 out of 250000 steps  (9%)
[02:45:05] Completed 25000 out of 250000 steps  (10%)
[03:11:55] Completed 27500 out of 250000 steps  (11%)
[03:38:46] Completed 30000 out of 250000 steps  (12%)
[04:05:34] Completed 32500 out of 250000 steps  (13%)
[04:32:26] Completed 35000 out of 250000 steps  (14%)
[04:59:18] Completed 37500 out of 250000 steps  (15%)
[05:26:08] Completed 40000 out of 250000 steps  (16%)
[05:53:01] Completed 42500 out of 250000 steps  (17%)
[06:19:55] Completed 45000 out of 250000 steps  (18%)
[06:46:45] Completed 47500 out of 250000 steps  (19%)
[07:13:36] Completed 50000 out of 250000 steps  (20%)
[07:40:26] Completed 52500 out of 250000 steps  (21%)
[08:07:17] Completed 55000 out of 250000 steps  (22%)
[08:34:03] Completed 57500 out of 250000 steps  (23%)
[09:00:55] Completed 60000 out of 250000 steps  (24%)
[09:27:46] Completed 62500 out of 250000 steps  (25%)
[09:54:33] Completed 65000 out of 250000 steps  (26%)
[10:21:24] Completed 67500 out of 250000 steps  (27%)
[10:48:15] Completed 70000 out of 250000 steps  (28%)
[11:15:03] Completed 72500 out of 250000 steps  (29%)
[11:41:54] Completed 75000 out of 250000 steps  (30%)
[12:08:43] Completed 77500 out of 250000 steps  (31%)
[12:35:29] Completed 80000 out of 250000 steps  (32%)
[13:02:18] Completed 82500 out of 250000 steps  (33%)
[13:29:07] Completed 85000 out of 250000 steps  (34%)
[13:55:52] Completed 87500 out of 250000 steps  (35%)
[14:22:45] Completed 90000 out of 250000 steps  (36%)
[14:49:37] Completed 92500 out of 250000 steps  (37%)
[15:16:24] Completed 95000 out of 250000 steps  (38%)
[15:43:13] Completed 97500 out of 250000 steps  (39%)
[16:10:02] Completed 100000 out of 250000 steps  (40%)
[16:36:50] Completed 102500 out of 250000 steps  (41%)
[17:03:40] Completed 105000 out of 250000 steps  (42%)
[17:30:30] Completed 107500 out of 250000 steps  (43%)
[17:57:21] Completed 110000 out of 250000 steps  (44%)
[18:24:04] Completed 112500 out of 250000 steps  (45%)
[18:50:54] Completed 115000 out of 250000 steps  (46%)
[19:17:46] Completed 117500 out of 250000 steps  (47%)
[19:44:30] Completed 120000 out of 250000 steps  (48%)
[20:11:18] Completed 122500 out of 250000 steps  (49%)
[20:38:06] Completed 125000 out of 250000 steps  (50%)
[21:04:50] Completed 127500 out of 250000 steps  (51%)
[21:31:39] Completed 130000 out of 250000 steps  (52%)
[21:58:27] Completed 132500 out of 250000 steps  (53%)
[22:25:13] Completed 135000 out of 250000 steps  (54%)
[22:52:03] Completed 137500 out of 250000 steps  (55%)
[23:18:55] Completed 140000 out of 250000 steps  (56%)
[23:45:42] Completed 142500 out of 250000 steps  (57%)
[00:12:31] Completed 145000 out of 250000 steps  (58%)
[00:39:19] Completed 147500 out of 250000 steps  (59%)
[01:06:03] Completed 150000 out of 250000 steps  (60%)
[01:32:52] Completed 152500 out of 250000 steps  (61%)
[01:59:41] Completed 155000 out of 250000 steps  (62%)
[02:26:28] Completed 157500 out of 250000 steps  (63%)
[02:53:18] Completed 160000 out of 250000 steps  (64%)
[03:20:10] Completed 162500 out of 250000 steps  (65%)
[03:46:58] Completed 165000 out of 250000 steps  (66%)
[04:13:49] Completed 167500 out of 250000 steps  (67%)
[04:40:41] Completed 170000 out of 250000 steps  (68%)
[05:07:32] Completed 172500 out of 250000 steps  (69%)
[05:34:19] Completed 175000 out of 250000 steps  (70%)
[06:01:10] Completed 177500 out of 250000 steps  (71%)
[06:28:02] Completed 180000 out of 250000 steps  (72%)
[06:54:47] Completed 182500 out of 250000 steps  (73%)
[07:21:36] Completed 185000 out of 250000 steps  (74%)
[07:48:25] Completed 187500 out of 250000 steps  (75%)
[08:15:10] Completed 190000 out of 250000 steps  (76%)
[08:42:00] Completed 192500 out of 250000 steps  (77%)
[09:08:52] Completed 195000 out of 250000 steps  (78%)
[09:35:40] Completed 197500 out of 250000 steps  (79%)
[10:02:31] Completed 200000 out of 250000 steps  (80%)
[10:29:23] Completed 202500 out of 250000 steps  (81%)
[10:56:08] Completed 205000 out of 250000 steps  (82%)
[11:22:59] Completed 207500 out of 250000 steps  (83%)
[11:49:49] Completed 210000 out of 250000 steps  (84%)
[12:16:37] Completed 212500 out of 250000 steps  (85%)
[12:43:26] Completed 215000 out of 250000 steps  (86%)
[13:10:15] Completed 217500 out of 250000 steps  (87%)
[13:37:01] Completed 220000 out of 250000 steps  (88%)
[14:03:51] Completed 222500 out of 250000 steps  (89%)
[14:30:38] Completed 225000 out of 250000 steps  (90%)
[14:57:28] Completed 227500 out of 250000 steps  (91%)
[15:24:13] Completed 230000 out of 250000 steps  (92%)
[15:51:01] Completed 232500 out of 250000 steps  (93%)
[16:17:50] Completed 235000 out of 250000 steps  (94%)
[16:44:37] Completed 237500 out of 250000 steps  (95%)
[17:11:28] Completed 240000 out of 250000 steps  (96%)
[17:38:17] Completed 242500 out of 250000 steps  (97%)
[18:05:03] Completed 245000 out of 250000 steps  (98%)
[18:31:53] Completed 247500 out of 250000 steps  (99%)
[18:58:41] Completed 250000 out of 250000 steps  (100%)
[18:58:58] DynamicWrapper: Finished Work Unit: sleep=10000
[18:59:08] 
[18:59:08] Finished Work Unit:
[18:59:08] - Reading up to 64340496 from "work/wudata_02.trr": Read 64340496
[18:59:08] trr file hash check passed.
[18:59:08] - Reading up to 31555816 from "work/wudata_02.xtc": Read 31555816
[18:59:09] xtc file hash check passed.
[18:59:09] edr file hash check passed.
[18:59:09] logfile size: 198345
[18:59:09] Leaving Run
[18:59:12] - Writing 96255533 bytes of core data to disk...
[18:59:40] Done: 96255021 -> 91526516 (compressed to 5.8 percent)
[18:59:41]   ... Done.
[18:59:52] - Shutting down core
[18:59:52] 
[18:59:52] Folding@home Core Shutdown: FINISHED_UNIT
[18:59:54] CoreStatus = 64 (100)
[18:59:54] Updated performance fraction: 0.695070
[18:59:54] Sending work to server
[18:59:54] Project: 8101 (Run 11, Clone 4, Gen 57)


[18:59:54] + Attempting to send results [May 4 18:59:54 UTC]
[18:59:54] Connecting to http://128.143.231.201:8080/
[19:38:40] - Server reports problem with unit.
[19:38:40] - Preparing to get new work unit...
[19:38:40] Cleaning up work directory
[19:38:40] + Attempting to get work packet
[19:38:40] Passkey found
[19:38:40] - Connecting to assignment server
[19:38:40] Connecting to http://assign.stanford.edu:8080/
[19:38:40] - Successful: assigned to (128.143.231.201).
[19:38:40] + News From Folding@Home: Welcome to Folding@Home
[19:38:40] Loaded queue successfully.
[19:38:40] Connecting to http://128.143.231.201:8080/
[19:40:51] + Received work.
[19:40:51] + Closed connections

2nd WU Log

Code: Select all

[14:39:31] Connecting to http://128.143.231.201:8080/
[14:41:40] + Received work.
[14:41:40] + Closed connections
[14:41:40] 
[14:41:40] + Processing work unit
[14:41:40] Core required: FahCore_a5.exe
[14:41:40] Core found.
[14:41:40] Working on queue slot 05 [May 8 14:41:40 UTC]
[14:41:40] + Working ...
thekraken: The Kraken 0.6 (compiled Sat Feb 25 16:09:42 EST 2012 by wuhog@wuhog)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 18919
thekraken: Logging to thekraken.log
[14:41:40] 
[14:41:40] *------------------------------*
[14:41:40] Folding@Home Gromacs SMP Core
[14:41:40] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[14:41:40] 
[14:41:40] Preparing to commence simulation
[14:41:40] - Looking at optimizations...
[14:41:40] - Created dyn
[14:41:40] - Files status OK
[14:41:44] - Expanded 30315626 -> 33158016 (decompressed 109.3 percent)
[14:41:44] Called DecompressByteArray: compressed_data_size=30315626 data_size=33158016, decompressed_data_size=33158016 diff=0
[14:41:44] - Digital signature verified
[14:41:44] 
[14:41:44] Project: 8101 (Run 0, Clone 8, Gen 3)
[14:41:44] 
[14:41:44] Assembly optimizations on if available.
[14:41:44] Entering M.D.
                         :-)  G  R  O  M  A  C  S  (-:

                   Groningen Machine for Chemical Simulation

                            :-)  VERSION 4.5.3  (-:

        Written by Emile Apol, Rossen Apostolov, Herman J.C. Berendsen,
      Aldert van Buuren, Pär Bjelkmar, Rudi van Drunen, Anton Feenstra, 
        Gerrit Groenhof, Peter Kasson, Per Larsson, Pieter Meulenhoff, 
           Teemu Murtola, Szilard Pall, Sander Pronk, Roland Schulz, 
                Michael Shirts, Alfons Sijbers, Peter Tieleman,

               Berk Hess, David van der Spoel, and Erik Lindahl.

       Copyright (c) 1991-2000, University of Groningen, The Netherlands.
            Copyright (c) 2001-2010, The GROMACS development team at
        Uppsala University & The Royal Institute of Technology, Sweden.
            check out http://www.gromacs.org for more information.


                               :-)  Gromacs  (-:

Reading file work/wudata_05.tpr, VERSION 4.5.4-dev-20110530-cc815 (single precision)
[14:41:51] Mapping NT from 32 to 32 
Starting 32 threads
Making 2D domain decomposition 8 x 4 x 1
starting mdrun 'FP_membrane in water'
1000000 steps,   4000.0 ps (continuing from step 750000,   3000.0 ps).
[14:41:56] Completed 0 out of 250000 steps  (0%)
[15:08:42] Completed 2500 out of 250000 steps  (1%)
[15:35:28] Completed 5000 out of 250000 steps  (2%)
[16:02:13] Completed 7500 out of 250000 steps  (3%)
[16:28:57] Completed 10000 out of 250000 steps  (4%)
[16:55:42] Completed 12500 out of 250000 steps  (5%)
[17:22:27] Completed 15000 out of 250000 steps  (6%)
[17:49:09] Completed 17500 out of 250000 steps  (7%)
[18:15:54] Completed 20000 out of 250000 steps  (8%)
[18:42:39] Completed 22500 out of 250000 steps  (9%)
[19:09:20] Completed 25000 out of 250000 steps  (10%)
[19:36:03] Completed 27500 out of 250000 steps  (11%)
[20:02:49] Completed 30000 out of 250000 steps  (12%)
[20:29:25] Completed 32500 out of 250000 steps  (13%)
[20:56:05] Completed 35000 out of 250000 steps  (14%)
[21:22:49] Completed 37500 out of 250000 steps  (15%)
[21:49:29] Completed 40000 out of 250000 steps  (16%)
[22:16:13] Completed 42500 out of 250000 steps  (17%)
[22:42:56] Completed 45000 out of 250000 steps  (18%)
[23:09:34] Completed 47500 out of 250000 steps  (19%)
[23:36:18] Completed 50000 out of 250000 steps  (20%)
[00:03:01] Completed 52500 out of 250000 steps  (21%)
[00:29:39] Completed 55000 out of 250000 steps  (22%)
[00:56:23] Completed 57500 out of 250000 steps  (23%)
[01:23:09] Completed 60000 out of 250000 steps  (24%)
[01:49:50] Completed 62500 out of 250000 steps  (25%)
[02:16:36] Completed 65000 out of 250000 steps  (26%)
[02:43:19] Completed 67500 out of 250000 steps  (27%)
[03:10:00] Completed 70000 out of 250000 steps  (28%)
[03:36:45] Completed 72500 out of 250000 steps  (29%)
[04:03:31] Completed 75000 out of 250000 steps  (30%)
[04:30:14] Completed 77500 out of 250000 steps  (31%)
[04:57:00] Completed 80000 out of 250000 steps  (32%)
[05:23:48] Completed 82500 out of 250000 steps  (33%)
[05:50:30] Completed 85000 out of 250000 steps  (34%)
[06:17:15] Completed 87500 out of 250000 steps  (35%)
[06:43:59] Completed 90000 out of 250000 steps  (36%)
[07:10:44] Completed 92500 out of 250000 steps  (37%)
[07:37:28] Completed 95000 out of 250000 steps  (38%)
[08:04:13] Completed 97500 out of 250000 steps  (39%)
[08:30:53] Completed 100000 out of 250000 steps  (40%)
[08:57:41] Completed 102500 out of 250000 steps  (41%)
[09:24:28] Completed 105000 out of 250000 steps  (42%)
[09:51:08] Completed 107500 out of 250000 steps  (43%)
[10:17:54] Completed 110000 out of 250000 steps  (44%)
[10:44:38] Completed 112500 out of 250000 steps  (45%)
[11:11:23] Completed 115000 out of 250000 steps  (46%)
[11:38:16] Completed 117500 out of 250000 steps  (47%)
[12:05:10] Completed 120000 out of 250000 steps  (48%)
[12:31:57] Completed 122500 out of 250000 steps  (49%)
[12:58:51] Completed 125000 out of 250000 steps  (50%)
[13:25:40] Completed 127500 out of 250000 steps  (51%)
[13:52:26] Completed 130000 out of 250000 steps  (52%)
[14:19:10] Completed 132500 out of 250000 steps  (53%)
[14:45:58] Completed 135000 out of 250000 steps  (54%)
[15:12:44] Completed 137500 out of 250000 steps  (55%)
[15:39:29] Completed 140000 out of 250000 steps  (56%)
[16:06:21] Completed 142500 out of 250000 steps  (57%)
[16:33:10] Completed 145000 out of 250000 steps  (58%)
[16:59:54] Completed 147500 out of 250000 steps  (59%)
[17:26:39] Completed 150000 out of 250000 steps  (60%)
[17:53:25] Completed 152500 out of 250000 steps  (61%)
[18:20:06] Completed 155000 out of 250000 steps  (62%)
[18:46:53] Completed 157500 out of 250000 steps  (63%)
[19:13:37] Completed 160000 out of 250000 steps  (64%)
[19:40:19] Completed 162500 out of 250000 steps  (65%)
[20:07:04] Completed 165000 out of 250000 steps  (66%)
[20:33:47] Completed 167500 out of 250000 steps  (67%)
[21:00:33] Completed 170000 out of 250000 steps  (68%)
[21:27:24] Completed 172500 out of 250000 steps  (69%)
[21:54:12] Completed 175000 out of 250000 steps  (70%)
[22:20:58] Completed 177500 out of 250000 steps  (71%)
[22:47:45] Completed 180000 out of 250000 steps  (72%)
[23:14:34] Completed 182500 out of 250000 steps  (73%)
[23:41:16] Completed 185000 out of 250000 steps  (74%)
[00:08:03] Completed 187500 out of 250000 steps  (75%)
[00:34:53] Completed 190000 out of 250000 steps  (76%)
[01:01:40] Completed 192500 out of 250000 steps  (77%)
[01:28:30] Completed 195000 out of 250000 steps  (78%)
[01:55:16] Completed 197500 out of 250000 steps  (79%)
[02:21:59] Completed 200000 out of 250000 steps  (80%)
[02:48:45] Completed 202500 out of 250000 steps  (81%)
[03:15:30] Completed 205000 out of 250000 steps  (82%)
[03:42:18] Completed 207500 out of 250000 steps  (83%)
[04:09:03] Completed 210000 out of 250000 steps  (84%)
[04:35:51] Completed 212500 out of 250000 steps  (85%)
[05:02:40] Completed 215000 out of 250000 steps  (86%)
[05:29:24] Completed 217500 out of 250000 steps  (87%)
[05:56:17] Completed 220000 out of 250000 steps  (88%)
[06:23:07] Completed 222500 out of 250000 steps  (89%)
[06:49:53] Completed 225000 out of 250000 steps  (90%)
[07:16:44] Completed 227500 out of 250000 steps  (91%)
[07:43:37] Completed 230000 out of 250000 steps  (92%)
[08:10:26] Completed 232500 out of 250000 steps  (93%)
[08:37:15] Completed 235000 out of 250000 steps  (94%)
[09:04:07] Completed 237500 out of 250000 steps  (95%)
[09:30:53] Completed 240000 out of 250000 steps  (96%)
[09:57:41] Completed 242500 out of 250000 steps  (97%)
[10:24:29] Completed 245000 out of 250000 steps  (98%)
[10:51:13] Completed 247500 out of 250000 steps  (99%)
[11:18:04] Completed 250000 out of 250000 steps  (100%)

Writing final coordinates.

 Average load imbalance: 12.8 %
 Part of the total run time spent waiting due to load imbalance: 3.4 %


	Parallel run - timing based on wallclock.

               NODE (s)   Real (s)      (%)
       Time: 160585.072 160585.072    100.0
                       1d20h36:25
               (Mnbf/s)   (GFlops)   (ns/day)  (hour/ns)
Performance:    675.550     35.009      0.538     44.607

Thanx for Using GROMACS - Have a Nice Day

[11:18:21] DynamicWrapper: Finished Work Unit: sleep=10000
[11:18:31] 
[11:18:31] Finished Work Unit:
[11:18:31] - Reading up to 64340496 from "work/wudata_05.trr": Read 64340496
[11:18:32] trr file hash check passed.
[11:18:32] - Reading up to 31615032 from "work/wudata_05.xtc": Read 31615032
[11:18:32] xtc file hash check passed.
[11:18:32] edr file hash check passed.
[11:18:32] logfile size: 197240
[11:18:32] Leaving Run
[11:18:35] - Writing 96313644 bytes of core data to disk...
[11:19:05] Done: 96313132 -> 91592866 (compressed to 5.9 percent)
[11:19:05]   ... Done.
[11:19:17] - Shutting down core
[11:19:17] 
[11:19:17] Folding@home Core Shutdown: FINISHED_UNIT
[11:19:18] CoreStatus = 64 (100)
[11:19:18] Updated performance fraction: 0.616484
[11:19:18] Sending work to server
[11:19:18] Project: 8101 (Run 0, Clone 8, Gen 3)


[11:19:18] + Attempting to send results [May 10 11:19:18 UTC]
[11:19:18] Connecting to http://128.143.231.201:8080/
[11:57:20] - Server reports problem with unit.
[11:57:20] - Preparing to get new work unit...
[11:57:20] Cleaning up work directory
[11:57:20] + Attempting to get work packet
[11:57:20] Passkey found
[11:57:20] - Connecting to assignment server
[11:57:20] Connecting to http://assign.stanford.edu:8080/
[11:57:21] - Successful: assigned to (128.143.231.201).
[11:57:21] + News From Folding@Home: Welcome to Folding@Home
[11:57:21] Loaded queue successfully.
[11:57:21] Connecting to http://128.143.231.201:8080/
[11:59:32] + Received work.
[11:59:32] + Closed connections

Thanks in advance for any info / help
-alias-
Posts: 121
Joined: Sun Feb 22, 2009 1:20 pm

Re: 2 Dead 8101 WU's out of 7 or so

Post by -alias- »

I have handed over more then 25 of this 8101 WUs and I have not experienced that the server has reported errors on them. Since end of April it has come down only these 8101 WUs, which is very tough for the machines and they use 7 -10% more power than the 69xx did. The heat from the machines are so much greater now than before that I must have some windows open. You should check whether there it may have been too hot for your computer, which may have caused irregularities in the folding, despite the fact that you have good cooling. I have the impression that these new WUene is not as tolerate, such that a lot or too many stop and start can make them "sick".

In the beginning, as they start coming down, I had several WUs that got out because the computer crashed, or were not delivered "healthy" enough to be accepted. but after I got tuned / adjusted the machines down to this WUs level it has has gone well. But I am now looking forward to see other more normal WUs then this devil WU. My G34 rigs like the 8101 much better than the SR-2, which I had to clock down a lot.
Post Reply