Native Linux SMP questions

Moderators: Site Moderators, PandeGroup

Native Linux SMP questions

Postby TheWolf » Sat Dec 01, 2012 4:27 am

Ok so I'm running native Linux on one of my rigs folding BigAdv WU's.
It gets 100% done & looks to hang at this spot, I waited a long time before Ctrl+C.
I did a quick back up before restarting, once I restart it starts out back at 0% on the same WU.
Project: 8101 (Run 18, Clone 1, Gen 71) have I lost 49.6 hours of work? qfix didn't do anything to help.
Help me out here?


Code: Select all
[01:11:37] Completed 250000 out of 250000 steps  (100%)
[01:11:49] DynamicWrapper: Finished Work Unit: sleep=10000
[01:11:59]
[01:11:59] Finished Work Unit:
[01:11:59] - Reading up to 64340496 from "work/wudata_02.trr": Read 64340496
[01:11:59] trr file hash check passed.
[01:11:59] - Reading up to 31677056 from "work/wudata_02.xtc": Read 31677056
[01:11:59] xtc file hash check passed.
[01:11:59] edr file hash check passed.
[01:11:59] logfile size: 221087
[01:11:59] Leaving Run
[01:12:03] - Writing 96399515 bytes of core data to disk...
[01:12:22] Done: 96399003 -> 91639782 (compressed to 5.9 percent)
[01:12:22]   ... Done.


Here is the complete log. Think I worked a couple small units before this one, but this is the complete log from when this was started. This is on a new clean install of Linux, so I had to stop & start a few time as I had other things or reboots to do.
Code: Select all

--- Opening Log file [November 28 22:06:26 UTC]


# Linux SMP Console Edition ###################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/thewolf/fah
Executable: ./fah6
Arguments: -bigadv -smp 24 -verbosity 9

[22:06:26] - Ask before connecting: No
[22:06:26] - User name: TheWolf (Team 111065)
[22:06:26] - User ID: 7161C4375E45BADA
[22:06:26] - Machine ID: 1
[22:06:26]
[22:06:26] Loaded queue successfully.
[22:06:26]
[22:06:26] + Processing work unit
[22:06:26] - Autosending finished units... [November 28 22:06:26 UTC]
[22:06:26] Trying to send all finished work units
[22:06:26] Core required: FahCore_a5.exe
[22:06:26] + No unsent completed units remaining.
[22:06:26] Core found.
[22:06:26] - Autosend completed
[22:06:26] Working on queue slot 02 [November 28 22:06:26 UTC]
[22:06:26] + Working ...
[22:06:26] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 02 -np 24 -checkpoint 6 -verbose -lifeline 3557 -version 634'

[22:06:26]
[22:06:26] *------------------------------*
[22:06:26] Folding@Home Gromacs SMP Core
[22:06:26] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[22:06:26]
[22:06:26] Preparing to commence simulation
[22:06:26] - Looking at optimizations...
[22:06:26] - Files status OK
[22:06:29] - Expanded 30307691 -> 33158020 (decompressed 109.4 percent)
[22:06:29] Called DecompressByteArray: compressed_data_size=30307691 data_size=33158020, decompressed_data_size=33158020 diff=0
[22:06:29] - Digital signature verified
[22:06:29]
[22:06:29] Project: 8101 (Run 18, Clone 1, Gen 71)
[22:06:29]
[22:06:29] Assembly optimizations on if available.
[22:06:29] Entering M.D.
[22:06:36] Mapping NT from 24 to 24
[22:06:47] Completed 0 out of 250000 steps  (0%)
[22:37:50] Completed 2500 out of 250000 steps  (1%)
[23:06:56] Completed 5000 out of 250000 steps  (2%)
[23:35:59] Completed 7500 out of 250000 steps  (3%)
[00:10:03] Completed 10000 out of 250000 steps  (4%)
[00:44:26] ***** Got an Activate signal (2)
[00:44:26] Killing all core threads

Folding@Home Client Shutdown.


--- Opening Log file [November 29 00:53:21 UTC]


# Linux SMP Console Edition ###################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/thewolf/fah
Executable: ./fah6
Arguments: -betateam -smp 24 -verbosity 9

[00:53:21] - Ask before connecting: No
[00:53:21] - User name: TheWolf (Team 111065)
[00:53:21] - User ID: 7161C4375E45BADA
[00:53:21] - Machine ID: 1
[00:53:21]
[00:53:21] Loaded queue successfully.
[00:53:21]
[00:53:21] + Processing work unit
[00:53:21] Core required: FahCore_a5.exe
[00:53:21] Core found.
[00:53:21] - Autosending finished units... [00:53:21]
[00:53:21] Trying to send all finished work units
[00:53:21] + No unsent completed units remaining.
[00:53:21] - Autosend completed
[00:53:21] Working on queue slot 02 [November 29 00:53:21 UTC]
[00:53:21] + Working ...
[00:53:21] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 02 -np 24 -checkpoint 6 -verbose -lifeline 3011 -version 634'

[00:53:21]
[00:53:21] *------------------------------*
[00:53:21] Folding@Home Gromacs SMP Core
[00:53:21] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[00:53:21]
[00:53:21] Preparing to commence simulation
[00:53:21] - Looking at optimizations...
[00:53:21] - Files status OK
[00:53:24] - Expanded 30307691 -> 33158020 (decompressed 109.4 percent)
[00:53:24] Called DecompressByteArray: compressed_data_size=30307691 data_size=33158020, decompressed_data_size=33158020 diff=0
[00:53:24] - Digital signature verified
[00:53:24]
[00:53:24] Project: 8101 (Run 18, Clone 1, Gen 71)
[00:53:24]
[00:53:25] Assembly optimizations on if available.
[00:53:25] Entering M.D.
[00:53:31] Using Gromacs checkpoints
[00:53:33] Mapping NT from 24 to 24
[00:59:29] Resuming from checkpoint
[00:59:30] Verified work/wudata_02.log
[00:59:38] Verified work/wudata_02.trr
[00:59:39] Verified work/wudata_02.xtc
[00:59:39] Verified work/wudata_02.edr
[00:59:40] Completed 12085 out of 250000 steps  (4%)
[01:08:23] Completed 12500 out of 250000 steps  (5%)
[01:45:00] Completed 15000 out of 250000 steps  (6%)
[02:15:40] Completed 17500 out of 250000 steps  (7%)
[02:46:20] Completed 20000 out of 250000 steps  (8%)
[03:17:00] Completed 22500 out of 250000 steps  (9%)
[03:47:41] Completed 25000 out of 250000 steps  (10%)
[03:53:47] ***** Got an Activate signal (2)
[03:53:47] Killing all core threads

Folding@Home Client Shutdown.


--- Opening Log file [November 29 03:57:42 UTC]


# Linux SMP Console Edition ###################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/thewolf/fah
Executable: ./fah6
Arguments: -betateam -smp 24 -verbosity 9

[03:57:42] - Ask before connecting: No
[03:57:42] - User name: TheWolf (Team 111065)
[03:57:42] - User ID: 7161C4375E45BADA
[03:57:42] - Machine ID: 1
[03:57:42]
[03:57:42] Loaded queue successfully.
[03:57:42]
[03:57:42] - Autosending finished units... [November 29 03:57:42 UTC]
[03:57:42] + Processing work unit
[03:57:42] Trying to send all finished work units
[03:57:42] Core required: FahCore_a5.exe
[03:57:42] + No unsent completed units remaining.
[03:57:42] Core found.
[03:57:42] - Autosend completed
[03:57:42] Working on queue slot 02 [November 29 03:57:42 UTC]
[03:57:42] + Working ...
[03:57:42] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 02 -np 24 -checkpoint 6 -verbose -lifeline 2687 -version 634'

[03:57:42]
[03:57:42] *------------------------------*
[03:57:42] Folding@Home Gromacs SMP Core
[03:57:42] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[03:57:42]
[03:57:42] Preparing to commence simulation
[03:57:42] - Looking at optimizations...
[03:57:42] - Files status OK
[03:57:45] - Expanded 30307691 -> 33158020 (decompressed 109.4 percent)
[03:57:45] Called DecompressByteArray: compressed_data_size=30307691 data_size=33158020, decompressed_data_size=33158020 diff=0
[03:57:46] - Digital signature verified
[03:57:46]
[03:57:46] Project: 8101 (Run 18, Clone 1, Gen 71)
[03:57:46]
[03:57:46] Assembly optimizations on if available.
[03:57:46] Entering M.D.
[03:57:52] Using Gromacs checkpoints
[03:57:54] Mapping NT from 24 to 24
[03:58:27] Resuming from checkpoint
[03:58:28] Verified work/wudata_02.log
[03:58:31] Verified work/wudata_02.trr
[03:58:32] Verified work/wudata_02.xtc
[03:58:32] Verified work/wudata_02.edr
[03:58:32] Completed 25480 out of 250000 steps  (10%)
[04:32:56] Completed 27500 out of 250000 steps  (11%)
[04:51:43] ***** Got an Activate signal (2)
[04:51:43] Killing all core threads

Folding@Home Client Shutdown.


--- Opening Log file [November 29 05:11:37 UTC]


# Linux SMP Console Edition ###################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/thewolf/fah
Executable: ./fah6
Arguments: -betateam -smp 24 -verbosity 9

[05:11:37] - Ask before connecting: No
[05:11:37] - User name: TheWolf (Team 111065)
[05:11:37] - User ID: 7161C4375E45BADA
[05:11:37] - Machine ID: 1
[05:11:37]
[05:11:37] Loaded queue successfully.
[05:11:37]
[05:11:37] + Processing work unit
[05:11:37] Core required: FahCore_a5.exe
[05:11:37] - Autosending finished units... [05:11:37]
[05:11:37] Core found.
[05:11:37] Trying to send all finished work units
[05:11:37] + No unsent completed units remaining.
[05:11:37] - Autosend completed
[05:11:38] Working on queue slot 02 [November 29 05:11:38 UTC]
[05:11:38] + Working ...
[05:11:38] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 02 -np 24 -checkpoint 6 -verbose -lifeline 2694 -version 634'

[05:11:38]
[05:11:38] *------------------------------*
[05:11:38] Folding@Home Gromacs SMP Core
[05:11:38] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[05:11:38]
[05:11:38] Preparing to commence simulation
[05:11:38] - Looking at optimizations...
[05:11:38] - Files status OK
[05:11:41] - Expanded 30307691 -> 33158020 (decompressed 109.4 percent)
[05:11:41] Called DecompressByteArray: compressed_data_size=30307691 data_size=33158020, decompressed_data_size=33158020 diff=0
[05:11:41] - Digital signature verified
[05:11:41]
[05:11:41] Project: 8101 (Run 18, Clone 1, Gen 71)
[05:11:41]
[05:11:41] Assembly optimizations on if available.
[05:11:41] Entering M.D.
[05:11:47] Using Gromacs checkpoints
[05:11:50] Mapping NT from 24 to 24
[05:12:23] Resuming from checkpoint
[05:12:26] Verified work/wudata_02.log
[05:12:28] Verified work/wudata_02.trr
[05:12:29] Verified work/wudata_02.xtc
[05:12:29] Verified work/wudata_02.edr
[05:12:30] Completed 28290 out of 250000 steps  (11%)
[05:38:22] Completed 30000 out of 250000 steps  (12%)
[06:10:37] Completed 32500 out of 250000 steps  (13%)
[06:40:05] Completed 35000 out of 250000 steps  (14%)
[07:09:32] Completed 37500 out of 250000 steps  (15%)
[07:39:00] Completed 40000 out of 250000 steps  (16%)
[08:08:28] Completed 42500 out of 250000 steps  (17%)
[08:37:55] Completed 45000 out of 250000 steps  (18%)
[09:07:22] Completed 47500 out of 250000 steps  (19%)
[09:36:49] Completed 50000 out of 250000 steps  (20%)
[10:06:12] Completed 52500 out of 250000 steps  (21%)
[10:35:39] Completed 55000 out of 250000 steps  (22%)
[11:05:05] Completed 57500 out of 250000 steps  (23%)
[11:11:37] - Autosending finished units... [November 29 11:11:37 UTC]
[11:11:37] Trying to send all finished work units
[11:11:37] + No unsent completed units remaining.
[11:11:37] - Autosend completed
[11:34:30] Completed 60000 out of 250000 steps  (24%)
[12:03:57] Completed 62500 out of 250000 steps  (25%)
[12:33:23] Completed 65000 out of 250000 steps  (26%)
[13:02:48] Completed 67500 out of 250000 steps  (27%)
[13:32:16] Completed 70000 out of 250000 steps  (28%)
[14:01:51] Completed 72500 out of 250000 steps  (29%)
[14:31:17] Completed 75000 out of 250000 steps  (30%)
[15:00:45] Completed 77500 out of 250000 steps  (31%)
[15:30:10] Completed 80000 out of 250000 steps  (32%)
[15:59:38] Completed 82500 out of 250000 steps  (33%)
[16:29:07] Completed 85000 out of 250000 steps  (34%)
[16:58:36] Completed 87500 out of 250000 steps  (35%)
[17:11:37] - Autosending finished units... [November 29 17:11:37 UTC]
[17:11:37] Trying to send all finished work units
[17:11:37] + No unsent completed units remaining.
[17:11:37] - Autosend completed
[17:28:05] Completed 90000 out of 250000 steps  (36%)
[18:03:34] Completed 92500 out of 250000 steps  (37%)
[18:35:08] Completed 95000 out of 250000 steps  (38%)
[19:05:33] Completed 97500 out of 250000 steps  (39%)
[19:35:02] Completed 100000 out of 250000 steps  (40%)
[20:04:31] Completed 102500 out of 250000 steps  (41%)
[20:33:59] Completed 105000 out of 250000 steps  (42%)
[21:03:29] Completed 107500 out of 250000 steps  (43%)
[21:32:58] Completed 110000 out of 250000 steps  (44%)
[22:02:28] Completed 112500 out of 250000 steps  (45%)
[22:31:57] Completed 115000 out of 250000 steps  (46%)
[23:01:25] Completed 117500 out of 250000 steps  (47%)
[23:11:37] - Autosending finished units... [November 29 23:11:37 UTC]
[23:11:37] Trying to send all finished work units
[23:11:37] + No unsent completed units remaining.
[23:11:37] - Autosend completed
[23:30:54] Completed 120000 out of 250000 steps  (48%)
[00:00:21] Completed 122500 out of 250000 steps  (49%)
[00:29:49] Completed 125000 out of 250000 steps  (50%)
[00:59:17] Completed 127500 out of 250000 steps  (51%)
[01:28:47] Completed 130000 out of 250000 steps  (52%)
[01:58:17] Completed 132500 out of 250000 steps  (53%)
[02:27:48] Completed 135000 out of 250000 steps  (54%)
[02:57:18] Completed 137500 out of 250000 steps  (55%)
[03:26:47] Completed 140000 out of 250000 steps  (56%)
[03:56:16] Completed 142500 out of 250000 steps  (57%)
[04:25:44] Completed 145000 out of 250000 steps  (58%)
[04:55:12] Completed 147500 out of 250000 steps  (59%)
[05:11:37] - Autosending finished units... [November 30 05:11:37 UTC]
[05:11:37] Trying to send all finished work units
[05:11:37] + No unsent completed units remaining.
[05:11:37] - Autosend completed
[05:24:41] Completed 150000 out of 250000 steps  (60%)
[05:54:07] Completed 152500 out of 250000 steps  (61%)
[06:23:37] Completed 155000 out of 250000 steps  (62%)
[06:53:05] Completed 157500 out of 250000 steps  (63%)
[07:22:34] Completed 160000 out of 250000 steps  (64%)
[07:52:03] Completed 162500 out of 250000 steps  (65%)
[08:21:31] Completed 165000 out of 250000 steps  (66%)
[08:50:58] Completed 167500 out of 250000 steps  (67%)
[09:20:27] Completed 170000 out of 250000 steps  (68%)
[09:49:56] Completed 172500 out of 250000 steps  (69%)
[10:19:24] Completed 175000 out of 250000 steps  (70%)
[10:48:54] Completed 177500 out of 250000 steps  (71%)
[11:11:37] - Autosending finished units... [November 30 11:11:37 UTC]
[11:11:37] Trying to send all finished work units
[11:11:37] + No unsent completed units remaining.
[11:11:37] - Autosend completed
[11:18:20] Completed 180000 out of 250000 steps  (72%)
[11:47:48] Completed 182500 out of 250000 steps  (73%)
[12:17:18] Completed 185000 out of 250000 steps  (74%)
[12:46:48] Completed 187500 out of 250000 steps  (75%)
[13:16:20] Completed 190000 out of 250000 steps  (76%)
[13:45:57] Completed 192500 out of 250000 steps  (77%)
[14:15:26] Completed 195000 out of 250000 steps  (78%)
[14:44:55] Completed 197500 out of 250000 steps  (79%)
[15:14:27] Completed 200000 out of 250000 steps  (80%)
[15:44:00] Completed 202500 out of 250000 steps  (81%)
[16:13:35] Completed 205000 out of 250000 steps  (82%)
[16:43:12] Completed 207500 out of 250000 steps  (83%)
[17:11:37] - Autosending finished units... [November 30 17:11:37 UTC]
[17:11:37] Trying to send all finished work units
[17:11:37] + No unsent completed units remaining.
[17:11:37] - Autosend completed
[17:12:49] Completed 210000 out of 250000 steps  (84%)
[17:42:24] Completed 212500 out of 250000 steps  (85%)
[18:12:03] Completed 215000 out of 250000 steps  (86%)
[18:41:43] Completed 217500 out of 250000 steps  (87%)
[19:11:24] Completed 220000 out of 250000 steps  (88%)
[19:41:05] Completed 222500 out of 250000 steps  (89%)
[20:10:47] Completed 225000 out of 250000 steps  (90%)
[20:40:28] Completed 227500 out of 250000 steps  (91%)
[21:10:11] Completed 230000 out of 250000 steps  (92%)
[21:39:54] Completed 232500 out of 250000 steps  (93%)
[22:09:35] Completed 235000 out of 250000 steps  (94%)
[22:39:15] Completed 237500 out of 250000 steps  (95%)
[23:10:45] Completed 240000 out of 250000 steps  (96%)
[23:11:37] - Autosending finished units... [November 30 23:11:37 UTC]
[23:11:37] Trying to send all finished work units
[23:11:37] + No unsent completed units remaining.
[23:11:37] - Autosend completed
[23:40:25] Completed 242500 out of 250000 steps  (97%)
[00:09:58] Completed 245000 out of 250000 steps  (98%)
[00:39:30] Completed 247500 out of 250000 steps  (99%)
[01:11:37] Completed 250000 out of 250000 steps  (100%)
[01:11:49] DynamicWrapper: Finished Work Unit: sleep=10000
[01:11:59]
[01:11:59] Finished Work Unit:
[01:11:59] - Reading up to 64340496 from "work/wudata_02.trr": Read 64340496
[01:11:59] trr file hash check passed.
[01:11:59] - Reading up to 31677056 from "work/wudata_02.xtc": Read 31677056
[01:11:59] xtc file hash check passed.
[01:11:59] edr file hash check passed.
[01:11:59] logfile size: 221087
[01:11:59] Leaving Run
[01:12:03] - Writing 96399515 bytes of core data to disk...
[01:12:22] Done: 96399003 -> 91639782 (compressed to 5.9 percent)
[01:12:22]   ... Done.


--- Opening Log file [December 1 02:17:42 UTC]


# Linux SMP Console Edition ###################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/thewolf/fah
Executable: ./fah6
Arguments: -send all -smp 24 -verbosity 9

[02:17:42] - Ask before connecting: No
[02:17:42] - User name: TheWolf (Team 111065)
[02:17:42] - User ID: 7161C4375E45BADA
[02:17:42] - Machine ID: 1
[02:17:42]
[02:17:42] Loaded queue successfully.
[02:17:42] Attempting to return result(s) to server...
[02:17:42] Trying to send all finished work units
[02:17:42] + No unsent completed units remaining.
[02:17:42] ***** Got a SIGTERM signal (15)
[02:17:42] Killing all core threads

Folding@Home Client Shutdown.


--- Opening Log file [December 1 02:18:56 UTC]


# Linux SMP Console Edition ###################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/thewolf/fah
Executable: ./fah6
Arguments: -betateam -smp 24 -verbosity 9

[02:18:56] - Ask before connecting: No
[02:18:56] - User name: TheWolf (Team 111065)
[02:18:56] - User ID: 7161C4375E45BADA
[02:18:56] - Machine ID: 1
[02:18:56]
[02:18:56] Loaded queue successfully.
[02:18:56]
[02:18:56] + Processing work unit
[02:18:56] Core required: FahCore_a5.exe
[02:18:56] Core found.
[02:18:56] - Autosending finished units... [02:18:56]
[02:18:56] Trying to send all finished work units
[02:18:56] + No unsent completed units remaining.
[02:18:56] - Autosend completed
[02:18:56] Working on queue slot 02 [December 1 02:18:56 UTC]
[02:18:56] + Working ...
[02:18:56] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 02 -np 24 -checkpoint 6 -verbose -lifeline 7715 -version 634'

[02:18:56]
[02:18:56] *------------------------------*
[02:18:56] Folding@Home Gromacs SMP Core
[02:18:56] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[02:18:56]
[02:18:56] Preparing to commence simulation
[02:18:56] - Ensuring status. Please wait.
[02:19:06] - Looking at optimizations...
[02:19:06] - Working with standard loops on this execution.
[02:19:06] - Previous termination of core was improper.
[02:19:06] - Files status OK
[02:19:08] - Expanded 30307691 -> 33158020 (decompressed 109.4 percent)
[02:19:08] Called DecompressByteArray: compressed_data_size=30307691 data_size=33158020, decompressed_data_size=33158020 diff=0
[02:19:09] - Digital signature verified
[02:19:09]
[02:19:09] Project: 8101 (Run 18, Clone 1, Gen 71)
[02:19:09]
[02:19:09] Entering M.D.
[02:19:15] Mapping NT from 24 to 24
[02:19:27] Completed 0 out of 250000 steps  (0%)
TheWolf
 
Posts: 267
Joined: Thu Jan 24, 2008 10:34 am

Re: Native Linux SMP questions

Postby TheWolf » Sat Dec 01, 2012 4:47 am

add more info:
I had this happen once before but in vbox Linux folding, but just let it slide.
But I can't keep letting these rigs run for 2+ day and not be able to return the work.
The wuresults_02.dat file in the backup folder is 91.6 MB in size.
Does this sound about right & could this some how be uploaded to get credit for?

I already tried qfix & -send all but nothing seems to be tided to this unit to tell it to send.
Wondering if the wuresults_02.dat file along with any other file that might be needed can be zipped up & sent to someone at FaH in to get credit?
I'd like to get this done asap if possible as we all know the QRB it inching away, plus no one should need to have to rework this WU.
Last edited by TheWolf on Sat Dec 01, 2012 5:03 am, edited 1 time in total.
TheWolf
 
Posts: 267
Joined: Thu Jan 24, 2008 10:34 am

Re: Native Linux SMP questions

Postby Joe_H » Sat Dec 01, 2012 4:51 am

You may not have waited long enough before using Ctrl-C. The core would not have finished writing out the results until a "- Shutting down core" message shows up in the log after that "[01:12:22] ... Done." line in your log. That can take a while for a bigadv WU. Also, what filesystem are you using on your Linux install? If ext4, there are settings that cause it to take even longer to write the file out to disk.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
Joe_H
Super Moderator
 
Posts: 3329
Joined: Tue Apr 21, 2009 4:41 pm
Location: W. MA

Re: Native Linux SMP questions

Postby TheWolf » Sat Dec 01, 2012 5:01 am

Latest Linux Mint 14. So you saying I probably didn't wait long enough so the WU is lost?
Would have been nice to have known in advance it might take more than 10 mins. to write.
I waited at that screen for about 10 mins. before CTRL+C. That seems mighty long, then another
15 mins for the upload to complete. We looking at 30 mins. of lost folding time with just these two things.
If my other post ever shows up there is more info in it. They have my post restricted for approval. roflol
TheWolf
 
Posts: 267
Joined: Thu Jan 24, 2008 10:34 am

Re: Native Linux SMP questions

Postby TheWolf » Sat Dec 01, 2012 5:19 am

Do you think this will work in my situation?

viewtopic.php?f=44&t=3889
TheWolf
 
Posts: 267
Joined: Thu Jan 24, 2008 10:34 am

Re: Native Linux SMP questions

Postby Joe_H » Sat Dec 01, 2012 5:31 am

I don't know of a way to use the backup file to submit the results and get credit, hopefully if someone knows of a way they will post it. But I think it is lost. As for the filesystem settings, someone who knows the specifics of filesystem defaults for a Mint install will have to weigh in on those. But there is a long thread here about use of ext4 with folding on the V6 client and long times to write out the results.

Another topic covers barrier settings on Linux filesystems and the effect on write time. You can read it in this topic. Hope these get you started on getting the write time down.

Edit - added:
Do you think this will work in my situation?

viewtopic.php?f=44&t=3889

Maybe, I have not used that method and don't know if it still works with the current V6 clients.
Joe_H
Super Moderator
 
Posts: 3329
Joined: Tue Apr 21, 2009 4:41 pm
Location: W. MA

Re: Native Linux SMP questions

Postby 7im » Sat Dec 01, 2012 5:43 am

Yes, the WU is lost. qfix doesn't work any more.

The BIG part of BIGADV indicates the much larger files sizes and wait times to both finish the WU and upload the WU. The BIG points indicate the same. That should not come as a surprise. However, if using the EXT4 file system, the waits will be VERY long.

Many of the BA install guides on team forums talk about the extended wait times using EXT4 files system and how to resolve it. It's even covered here in this forum. Linux client delay after completing unit See also: http://foldingforum.org/viewtopic.php?p=205082#p205082
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
User avatar
7im
 
Posts: 15147
Joined: Thu Nov 29, 2007 4:30 pm
Location: Arizona

Re: Native Linux SMP questions

Postby Jesse_V » Sat Dec 01, 2012 5:46 am

Joe_H wrote:As for the filesystem settings, someone who knows the specifics of filesystem defaults for a Mint install will have to weigh in on those.

I run Mint. IIRC, ext4 is default.
User avatar
Jesse_V
 
Posts: 2893
Joined: Mon Jul 18, 2011 4:44 am
Location: Logan, Utah, USA

Re: Native Linux SMP questions

Postby TheWolf » Sat Dec 01, 2012 6:12 am

TheWolf wrote:Just wanted to let everyone know that the above link I posted did & does still work.
I think it would be nice if a note was added to that link if in a case like mine where
using a back copy to use "./fah6 -send all" on the re-start of folding. This will let it close
after the upload has finished but will not download a new work-unit on the start of F@H.
Since I thought it might be lost I did a dump & started clean. I had already started a new WU.
I then did a backup a second time before start the above TuT, of the new WU that was at 25% by now.

So not thinking I just started the old backup with just "./fah6" this in turn started the upload but also downloaded a new WU.
So it would be great if the help topic was edit to reflect to use "./fah6 -send all" on a re-start, in case were you are working
in a situation like mine.

Now how to finish two WU's in time with out having to dump one or the other. lol
I'll get this worked out too ;-)


7im wrote:Yes, the WU is lost. qfix doesn't work any more.


Sorry you are wrong, it does indeed still work on V6. See my post above. :ewink:
Someone should add my findings to viewtopic.php?f=44&t=3889 to help others or make a more up to date topic on the subject.
I will say I didn't think it was going to work as I waited 10 mins after the use of ./fah6 -delete 0x but when doing a CTRL+C
it gave me the rest & the error message. I then ran ./qfix and it then said -- requeued for upload. Upon a restart of folding the upload started.

Code: Select all
[0]0:Return code = 18
[0]1:Return code = 0, signaled with Quit
[0]2:Return code = 0, signaled with Quit
[0]3:Return code = 0, signaled with Quit
[16:11:11] - Failed to delete the requested work unit

Folding@Home Client Shutdown.


Edit by Mod: Your findings have been added per your request.
TheWolf
 
Posts: 267
Joined: Thu Jan 24, 2008 10:34 am

Re: Native Linux SMP questions

Postby bollix47 » Sat Dec 01, 2012 12:48 pm

FYI

Qfix source and binaries were updated in July of 2012 to be compatible with the newer queue format in version 6.

qfix.c : qfix.c (14.01 KB)
Description : Dick Howells original qfix.c with a fix for the queue.dat files with queue version v6.00
Modified : Thu Jul 5 10:50:49 2012
Image
bollix47
 
Posts: 3345
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Native Linux SMP questions

Postby 7im » Sat Dec 01, 2012 1:19 pm

Good to learn qfix was finally fixed again. A few months ago, it wouldn't have helped.
User avatar
7im
 
Posts: 15147
Joined: Thu Nov 29, 2007 4:30 pm
Location: Arizona

Re: Native Linux SMP questions

Postby TheWolf » Sat Dec 01, 2012 1:25 pm

bollix47 wrote:FYI

Qfix source and binaries were updated in July of 2012 to be compatible with the newer queue format in version 6.

qfix.c : qfix.c (14.01 KB)
Description : Dick Howells original qfix.c with a fix for the queue.dat files with queue version v6.00
Modified : Thu Jul 5 10:50:49 2012

Glad it was and I found the newer version! Without this the work unit would have had to be redone by someone else at a later date.
Wasn't any since in my trying to do it from the start again as I was already cutting close to the QRB deadline @ 49.6 hours.
I'm sure I will probably not get full credit since it took a few hours for me to located the TuT/download qfix, install, get it fix
and another 15 or more mins to get it uploaded. But at least it was returned in a fairly timely manner.
BTW I am very new to Linux so I was very lucky to get it fixed that quick, for those of you that are rolling your eyes. :roll: :P :wink:

Thanks for the help and your post guys.
TheWolf
 
Posts: 267
Joined: Thu Jan 24, 2008 10:34 am


Return to Linux CPU V6 Client

Who is online

Users browsing this forum: No registered users and 1 guest

cron