P10424 R4584 C0 G53 - slow?

Moderators: Site Moderators, PandeGroup

P10424 R4584 C0 G53 - slow?

Postby Gary480six » Mon Oct 27, 2014 5:31 pm

Wondering if anybody else is having trouble with this work unit or others in this Run/Clone/Gen?

I've Folded a few of these P10424 work units and they usually run about 2:00 a segment.

This P10424 R4584 C0 G53 is running over 20 minutes a segment.

A friend is Folding a different P10424 work unit on similar hardware and the same V. 6.34 client and pushing out 2:20 segments.

My stock i7-2500K has been a solid performer and my diagnostic software does not show CPU throttling or any issue that would explain the drastic drop in production with this specific P10424 work unit.

Thanks
Gary480six
 
Posts: 67
Joined: Mon Jan 21, 2008 6:42 pm

Re: P10424 R4584 C0 G53 - slow?

Postby bruce » Mon Oct 27, 2014 7:39 pm

There no such thing as "others" in a single Project/Run/Clone/Gen.

Also, that WU has not been returned by anyone.

As expected, the previous Gen was completed. The fact that it was completed in 0.59 days really proves nothing because there's no way to know what hardware was used to process it.
bruce
 
Posts: 22873
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: P10424 R4584 C0 G53 - slow?

Postby Joe_H » Mon Oct 27, 2014 8:52 pm

You can post your log file, perhaps something will show as to why this particular WU is running slow. Occasionally bad WU's have been sent out with the wrong number of steps, but the only way to tell is by comparing the number of steps shown in the log with the number given for other WU's from the same project. There are other possible reasons that a particular WU will run slow, but at this point only conjecture is possible.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
Joe_H
Site Admin
 
Posts: 4598
Joined: Tue Apr 21, 2009 4:41 pm
Location: W. MA

Re: P10424 R4584 C0 G53 - slow?

Postby Gooders » Tue Oct 28, 2014 2:19 am

Code: Select all
******************************* Date: 2014-10-27 *******************************
10:12:34:WARNING:WU01:FS01:Exception: Failed to send results to work server: Transfer failed
14:15:24:WARNING:WU01:FS01:Failed to get assignment from '171.67.108.200:80': Failed to connect to 171.67.108.200:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
******************************* Date: 2014-10-27 *******************************
16:33:59:WARNING:WU02:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:39:21:WARNING:WU02:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:39:43:WARNING:WU03:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:39:58:WARNING:WU03:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:40:20:WARNING:WU02:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:40:33:WARNING:WU02:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:40:55:WARNING:WU00:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:41:10:WARNING:WU00:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:41:31:WARNING:WU02:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:41:45:WARNING:WU02:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:42:07:WARNING:WU00:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:42:21:WARNING:WU00:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:42:42:WARNING:WU02:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:42:56:WARNING:WU02:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:43:33:WARNING:WU00:FS00:Failed to get assignment from '171.67.108.200:8080': 10002: Received short response, expected 272 bytes, got 0
16:43:47:WARNING:WU00:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:44:08:WARNING:WU02:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:44:23:WARNING:WU02:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:44:44:WARNING:WU00:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:44:59:WARNING:WU00:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:48:39:WARNING:WU00:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
18:23:26:WARNING:WU02:FS01:Failed to get assignment from '171.67.108.200:80': Failed to connect to 171.67.108.200:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
******************************* Date: 2014-10-27 *******************************
02:06:47:ERROR:WU01:FS00:Exception: Server did not assign work unit
******************************* Date: 2014-10-27 *******************************
16:33:59:WARNING:WU02:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:39:21:WARNING:WU02:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:39:43:WARNING:WU03:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:39:58:WARNING:WU03:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:40:20:WARNING:WU02:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:40:33:WARNING:WU02:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:40:55:WARNING:WU00:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:41:10:WARNING:WU00:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:41:31:WARNING:WU02:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:41:45:WARNING:WU02:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:42:07:WARNING:WU00:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:42:21:WARNING:WU00:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:42:42:WARNING:WU02:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:42:56:WARNING:WU02:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:43:33:WARNING:WU00:FS00:Failed to get assignment from '171.67.108.200:8080': 10002: Received short response, expected 272 bytes, got 0
16:43:47:WARNING:WU00:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:44:08:WARNING:WU02:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:44:23:WARNING:WU02:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:44:44:WARNING:WU00:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
16:44:59:WARNING:WU00:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:48:39:WARNING:WU00:FS00:Failed to get assignment from '171.67.108.200:8080': Failed to connect to 171.67.108.200:8080: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
18:23:26:WARNING:WU02:FS01:Failed to get assignment from '171.67.108.200:80': Failed to connect to 171.67.108.200:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
******************************* Date: 2014-10-27 *******************************
02:06:47:ERROR:WU01:FS00:Exception: Server did not assign work unit
02:13:05:WARNING:WU01:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
02:13:23:WARNING:WU03:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
02:13:40:WARNING:WU01:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
02:13:56:WARNING:WU03:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
02:14:12:WARNING:WU01:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
02:14:28:WARNING:WU03:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
02:14:46:WARNING:WU01:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)


I keep getting errors downloading the 10424 WU, not sure if i have ever done one before, its not one i reconise however
Image
Gooders
 
Posts: 89
Joined: Sun Jan 12, 2014 8:17 pm
Location: UK

Re: P10424 R4584 C0 G53 - slow?

Postby Gooders » Tue Oct 28, 2014 2:20 am

Code: Select all
02:06:45:WU01:FS00:Connecting to 171.67.108.200:8080
02:06:46:WU01:FS00:Assigned to work server 143.89.28.72
02:06:46:WU01:FS00:Requesting new work unit for slot 00: RUNNING cpu:20 from 143.89.28.72
02:06:46:WU01:FS00:Connecting to 143.89.28.72:8080
02:06:47:ERROR:WU01:FS00:Exception: Server did not assign work unit
02:06:47:WU01:FS00:Connecting to 171.67.108.200:8080
02:06:48:WU01:FS00:Assigned to work server 171.64.65.79
02:06:48:WU01:FS00:Requesting new work unit for slot 00: RUNNING cpu:20 from 171.64.65.79
02:06:48:WU01:FS00:Connecting to 171.64.65.79:8080
02:06:49:WU01:FS00:Downloading 249.84KiB
02:06:51:WU01:FS00:Download complete
02:06:51:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:10424 run:42180 clone:0 gen:23 core:0xa4 unit:0x000000240a3b1e734cd02ab5bcc79935
02:08:27:WU02:FS01:0x17:Completed 4400000 out of 5000000 steps (88%)
02:12:36:WU00:FS00:0xa3:Completed 500000 out of 500000 steps  (100%)
02:12:38:WU00:FS00:0xa3:DynamicWrapper: Finished Work Unit: sleep=10000
02:12:48:WU00:FS00:0xa3:
02:12:48:WU00:FS00:0xa3:Finished Work Unit:
02:12:48:WU00:FS00:0xa3:- Reading up to 12092904 from "00/wudata_01.trr": Read 12092904
02:12:48:WU00:FS00:0xa3:trr file hash check passed.
02:12:48:WU00:FS00:0xa3:edr file hash check passed.
02:12:48:WU00:FS00:0xa3:logfile size: 55946
02:12:48:WU00:FS00:0xa3:Leaving Run
02:12:51:WU00:FS00:0xa3:- Writing 12182526 bytes of core data to disk...
02:12:53:WU00:FS00:0xa3:Done: 12182014 -> 11285581 (compressed to 92.6 percent)
02:12:53:WU00:FS00:0xa3:  ... Done.
02:12:54:WU00:FS00:0xa3:- Shutting down core
02:12:54:WU00:FS00:0xa3:
02:12:54:WU00:FS00:0xa3:Folding@home Core Shutdown: FINISHED_UNIT
02:12:55:WU00:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
02:12:55:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:6095 run:14 clone:50 gen:54 core:0xa3 unit:0x0000003f0a3b1e594f25c77fc42a9e0c
02:12:55:WU00:FS00:Uploading 10.76MiB to 128.143.231.202
02:12:55:WU00:FS00:Connecting to 128.143.231.202:8080
02:12:55:WU01:FS00:Starting
02:12:55:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 01 -suffix 01 -version 704 -lifeline 5756 -checkpoint 10 -np 20
02:12:55:WU01:FS00:Started FahCore on PID 15228
02:12:55:WU01:FS00:Core PID:15240
02:12:55:WU01:FS00:FahCore 0xa4 started
02:12:55:WU01:FS00:0xa4:
02:12:55:WU01:FS00:0xa4:*------------------------------*
02:12:55:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
02:12:55:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
02:12:55:WU01:FS00:0xa4:
02:12:55:WU01:FS00:0xa4:Preparing to commence simulation
02:12:55:WU01:FS00:0xa4:- Looking at optimizations...
02:12:55:WU01:FS00:0xa4:- Created dyn
02:12:55:WU01:FS00:0xa4:- Files status OK
02:12:55:WU01:FS00:0xa4:- Expanded 255323 -> 397584 (decompressed 155.7 percent)
02:12:55:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=255323 data_size=397584, decompressed_data_size=397584 diff=0
02:12:55:WU01:FS00:0xa4:- Digital signature verified
02:12:55:WU01:FS00:0xa4:
02:12:55:WU01:FS00:0xa4:Project: 10424 (Run 42180, Clone 0, Gen 23)
02:12:55:WU01:FS00:0xa4:
02:12:55:WU01:FS00:0xa4:Assembly optimizations on if available.
02:12:55:WU01:FS00:0xa4:Entering M.D.
02:13:01:WU00:FS00:Upload 9.87%
02:13:01:WU01:FS00:0xa4:Mapping NT from 20 to 20
02:13:01:WU01:FS00:0xa4:mdrun returned 255
02:13:01:WU01:FS00:0xa4:Going to send back what have done -- stepsTotalG=2000000
02:13:01:WU01:FS00:0xa4:Work fraction=0.0000 steps=2000000.
02:13:05:WU01:FS00:0xa4:logfile size=0 infoLength=0 edr=0 trr=25
02:13:05:WU01:FS00:0xa4:logfile size: 0 info=0 bed=0 hdr=25
02:13:05:WU01:FS00:0xa4:- Writing 641 bytes of core data to disk...
02:13:05:WU01:FS00:0xa4:Done: 129 -> 144 (compressed to 111.6 percent)
02:13:05:WU01:FS00:0xa4:  ... Done.
02:13:05:WU01:FS00:0xa4:
02:13:05:WU01:FS00:0xa4:Folding@home Core Shutdown: EARLY_UNIT_END
02:13:05:WARNING:WU01:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
02:13:05:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:10424 run:42180 clone:0 gen:23 core:0xa4 unit:0x000000240a3b1e734cd02ab5bcc79935
02:13:05:WU01:FS00:Uploading 656B to 171.64.65.79
02:13:05:WU01:FS00:Connecting to 171.64.65.79:8080
02:13:07:WU03:FS00:Connecting to 171.67.108.200:8080
02:13:07:WU00:FS00:Upload 15.68%
02:13:07:WU01:FS00:Upload complete
02:13:07:WU01:FS00:Server responded WORK_ACK (400)
02:13:07:WU01:FS00:Cleaning up
02:13:08:WU03:FS00:Assigned to work server 171.64.65.79
02:13:08:WU03:FS00:Requesting new work unit for slot 00: READY cpu:20 from 171.64.65.79
02:13:08:WU03:FS00:Connecting to 171.64.65.79:8080
02:13:10:WU03:FS00:Downloading 250.16KiB
02:13:12:WU03:FS00:Download complete
02:13:13:WU03:FS00:Received Unit: id:03 state:DOWNLOAD error:NO_ERROR project:10424 run:34257 clone:1 gen:5 core:0xa4 unit:0x000000060a3b1e734cd00df11deff5ec
02:13:13:WU03:FS00:Starting
02:13:13:WU03:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 03 -suffix 01 -version 704 -lifeline 5756 -checkpoint 10 -np 20
02:13:13:WU03:FS00:Started FahCore on PID 9484
02:13:13:WU03:FS00:Core PID:14096
02:13:13:WU03:FS00:FahCore 0xa4 started
02:13:13:WU00:FS00:Upload 21.49%
02:13:13:WU03:FS00:0xa4:
02:13:13:WU03:FS00:0xa4:*------------------------------*
02:13:13:WU03:FS00:0xa4:Folding@Home Gromacs GB Core
02:13:13:WU03:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
02:13:13:WU03:FS00:0xa4:
02:13:13:WU03:FS00:0xa4:Preparing to commence simulation
02:13:13:WU03:FS00:0xa4:- Looking at optimizations...
02:13:13:WU03:FS00:0xa4:- Created dyn
02:13:13:WU03:FS00:0xa4:- Files status OK
02:13:13:WU03:FS00:0xa4:- Expanded 255654 -> 397584 (decompressed 155.5 percent)
02:13:13:WU03:FS00:0xa4:Called DecompressByteArray: compressed_data_size=255654 data_size=397584, decompressed_data_size=397584 diff=0
02:13:13:WU03:FS00:0xa4:- Digital signature verified
02:13:13:WU03:FS00:0xa4:
02:13:13:WU03:FS00:0xa4:Project: 10424 (Run 34257, Clone 1, Gen 5)
02:13:13:WU03:FS00:0xa4:
02:13:13:WU03:FS00:0xa4:Assembly optimizations on if available.
02:13:13:WU03:FS00:0xa4:Entering M.D.
02:13:19:WU03:FS00:0xa4:Mapping NT from 20 to 20
02:13:19:WU03:FS00:0xa4:mdrun returned 255
02:13:19:WU03:FS00:0xa4:Going to send back what have done -- stepsTotalG=2000000
02:13:19:WU03:FS00:0xa4:Work fraction=0.0000 steps=2000000.
02:13:19:WU00:FS00:Upload 27.29%
02:13:23:WU03:FS00:0xa4:logfile size=0 infoLength=0 edr=0 trr=25
02:13:23:WU03:FS00:0xa4:logfile size: 0 info=0 bed=0 hdr=25
02:13:23:WU03:FS00:0xa4:- Writing 641 bytes of core data to disk...
02:13:23:WU03:FS00:0xa4:Done: 129 -> 145 (compressed to 112.4 percent)
02:13:23:WU03:FS00:0xa4:  ... Done.
02:13:23:WU03:FS00:0xa4:
02:13:23:WU03:FS00:0xa4:Folding@home Core Shutdown: EARLY_UNIT_END
02:13:23:WARNING:WU03:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
02:13:23:WU03:FS00:Sending unit results: id:03 state:SEND error:FAULTY project:10424 run:34257 clone:1 gen:5 core:0xa4 unit:0x000000060a3b1e734cd00df11deff5ec
02:13:23:WU03:FS00:Uploading 657B to 171.64.65.79
02:13:23:WU03:FS00:Connecting to 171.64.65.79:8080
02:13:24:WU01:FS00:Connecting to 171.67.108.200:8080
02:13:25:WU01:FS00:Assigned to work server 171.64.65.79
02:13:25:WU01:FS00:Requesting new work unit for slot 00: READY cpu:20 from 171.64.65.79
02:13:25:WU01:FS00:Connecting to 171.64.65.79:8080
02:13:25:WU00:FS00:Upload 33.10%
02:13:25:WU03:FS00:Upload complete
02:13:25:WU03:FS00:Server responded WORK_ACK (400)
02:13:25:WU03:FS00:Cleaning up
02:13:26:WU01:FS00:Downloading 250.37KiB
02:13:29:WU01:FS00:Download complete
02:13:29:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:10424 run:39057 clone:0 gen:25 core:0xa4 unit:0x000000250a3b1e734cd01f768482c5e3
02:13:29:WU01:FS00:Starting
02:13:29:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 01 -suffix 01 -version 704 -lifeline 5756 -checkpoint 10 -np 20
02:13:29:WU01:FS00:Started FahCore on PID 14404
02:13:29:WU01:FS00:Core PID:14456
02:13:29:WU01:FS00:FahCore 0xa4 started
02:13:29:WU01:FS00:0xa4:
02:13:29:WU01:FS00:0xa4:*------------------------------*
02:13:29:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
02:13:29:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
02:13:29:WU01:FS00:0xa4:
02:13:29:WU01:FS00:0xa4:Preparing to commence simulation
02:13:29:WU01:FS00:0xa4:- Looking at optimizations...
02:13:29:WU01:FS00:0xa4:- Created dyn
02:13:29:WU01:FS00:0xa4:- Files status OK
02:13:29:WU01:FS00:0xa4:- Expanded 255862 -> 397584 (decompressed 155.3 percent)
02:13:29:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=255862 data_size=397584, decompressed_data_size=397584 diff=0
02:13:29:WU01:FS00:0xa4:- Digital signature verified
02:13:29:WU01:FS00:0xa4:
02:13:29:WU01:FS00:0xa4:Project: 10424 (Run 39057, Clone 0, Gen 25)
02:13:29:WU01:FS00:0xa4:
02:13:29:WU01:FS00:0xa4:Assembly optimizations on if available.
02:13:29:WU01:FS00:0xa4:Entering M.D.
02:13:31:WU00:FS00:Upload 38.91%
02:13:35:WU01:FS00:0xa4:Mapping NT from 20 to 20
02:13:35:WU01:FS00:0xa4:mdrun returned 255
02:13:35:WU01:FS00:0xa4:Going to send back what have done -- stepsTotalG=2000000
02:13:35:WU01:FS00:0xa4:Work fraction=0.0000 steps=2000000.
02:13:37:WU00:FS00:Upload 44.13%
02:13:39:WU01:FS00:0xa4:logfile size=0 infoLength=0 edr=0 trr=25
02:13:39:WU01:FS00:0xa4:logfile size: 0 info=0 bed=0 hdr=25
02:13:39:WU01:FS00:0xa4:- Writing 641 bytes of core data to disk...
02:13:39:WU01:FS00:0xa4:Done: 129 -> 144 (compressed to 111.6 percent)
02:13:39:WU01:FS00:0xa4:  ... Done.
02:13:39:WU01:FS00:0xa4:
02:13:39:WU01:FS00:0xa4:Folding@home Core Shutdown: EARLY_UNIT_END
02:13:39:WU02:FS01:0x17:Completed 4450000 out of 5000000 steps (89%)
02:13:40:WARNING:WU01:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
02:13:40:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:10424 run:39057 clone:0 gen:25 core:0xa4 unit:0x000000250a3b1e734cd01f768482c5e3
02:13:40:WU01:FS00:Uploading 656B to 171.64.65.79
02:13:40:WU01:FS00:Connecting to 171.64.65.79:8080
02:13:40:WU03:FS00:Connecting to 171.67.108.200:8080
02:13:40:WU01:FS00:Upload complete
02:13:40:WU01:FS00:Server responded WORK_ACK (400)
02:13:41:WU01:FS00:Cleaning up
02:13:41:WU03:FS00:Assigned to work server 171.64.65.79
02:13:41:WU03:FS00:Requesting new work unit for slot 00: READY cpu:20 from 171.64.65.79
02:13:41:WU03:FS00:Connecting to 171.64.65.79:8080
02:13:42:WU03:FS00:Downloading 249.87KiB
02:13:43:WU00:FS00:Upload 49.94%
02:13:45:WU03:FS00:Download complete
02:13:45:WU03:FS00:Received Unit: id:03 state:DOWNLOAD error:NO_ERROR project:10424 run:19289 clone:0 gen:4 core:0xa4 unit:0x000000070a3b1e734ccfd7f6c6e3df50
02:13:45:WU03:FS00:Starting
02:13:45:WU03:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 03 -suffix 01 -version 704 -lifeline 5756 -checkpoint 10 -np 20
02:13:45:WU03:FS00:Started FahCore on PID 14700
02:13:45:WU03:FS00:Core PID:14676
02:13:45:WU03:FS00:FahCore 0xa4 started
02:13:46:WU03:FS00:0xa4:
02:13:46:WU03:FS00:0xa4:*------------------------------*
02:13:46:WU03:FS00:0xa4:Folding@Home Gromacs GB Core
02:13:46:WU03:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
02:13:46:WU03:FS00:0xa4:
02:13:46:WU03:FS00:0xa4:Preparing to commence simulation
02:13:46:WU03:FS00:0xa4:- Looking at optimizations...
02:13:46:WU03:FS00:0xa4:- Created dyn
02:13:46:WU03:FS00:0xa4:- Files status OK
02:13:46:WU03:FS00:0xa4:- Expanded 255357 -> 397584 (decompressed 155.6 percent)
02:13:46:WU03:FS00:0xa4:Called DecompressByteArray: compressed_data_size=255357 data_size=397584, decompressed_data_size=397584 diff=0
02:13:46:WU03:FS00:0xa4:- Digital signature verified
02:13:46:WU03:FS00:0xa4:
02:13:46:WU03:FS00:0xa4:Project: 10424 (Run 19289, Clone 0, Gen 4)
02:13:46:WU03:FS00:0xa4:
02:13:46:WU03:FS00:0xa4:Assembly optimizations on if available.
02:13:46:WU03:FS00:0xa4:Entering M.D.
02:13:49:WU00:FS00:Upload 55.75%
02:13:51:WU03:FS00:0xa4:Mapping NT from 20 to 20
02:13:51:WU03:FS00:0xa4:mdrun returned 255
02:13:51:WU03:FS00:0xa4:Going to send back what have done -- stepsTotalG=2000000
02:13:51:WU03:FS00:0xa4:Work fraction=0.0000 steps=2000000.
02:13:55:WU00:FS00:Upload 60.97%
02:13:55:WU03:FS00:0xa4:logfile size=0 infoLength=0 edr=0 trr=25
02:13:55:WU03:FS00:0xa4:logfile size: 0 info=0 bed=0 hdr=25
02:13:55:WU03:FS00:0xa4:- Writing 641 bytes of core data to disk...
02:13:55:WU03:FS00:0xa4:Done: 129 -> 145 (compressed to 112.4 percent)
02:13:55:WU03:FS00:0xa4:  ... Done.
02:13:55:WU03:FS00:0xa4:
02:13:55:WU03:FS00:0xa4:Folding@home Core Shutdown: EARLY_UNIT_END
02:13:56:WARNING:WU03:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
02:13:56:WU03:FS00:Sending unit results: id:03 state:SEND error:FAULTY project:10424 run:19289 clone:0 gen:4 core:0xa4 unit:0x000000070a3b1e734ccfd7f6c6e3df50
02:13:56:WU03:FS00:Uploading 657B to 171.64.65.79
02:13:56:WU03:FS00:Connecting to 171.64.65.79:8080
02:13:56:WU01:FS00:Connecting to 171.67.108.200:8080
02:13:57:WU03:FS00:Upload complete
02:13:57:WU03:FS00:Server responded WORK_ACK (400)
02:13:57:WU03:FS00:Cleaning up
02:13:57:WU01:FS00:Assigned to work server 171.64.65.79
02:13:57:WU01:FS00:Requesting new work unit for slot 00: READY cpu:20 from 171.64.65.79
02:13:57:WU01:FS00:Connecting to 171.64.65.79:8080
02:13:59:WU01:FS00:Downloading 249.68KiB
02:14:01:WU00:FS00:Upload 66.78%
02:14:01:WU01:FS00:Download complete
02:14:01:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:10424 run:14040 clone:0 gen:7 core:0xa4 unit:0x0000000a0a3b1e734ccfc5705e617953
02:14:01:WU01:FS00:Starting
02:14:01:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 01 -suffix 01 -version 704 -lifeline 5756 -checkpoint 10 -np 20
02:14:01:WU01:FS00:Started FahCore on PID 14796
02:14:01:WU01:FS00:Core PID:12784
02:14:01:WU01:FS00:FahCore 0xa4 started
02:14:02:WU01:FS00:0xa4:
02:14:02:WU01:FS00:0xa4:*------------------------------*
02:14:02:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
02:14:02:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
02:14:02:WU01:FS00:0xa4:
02:14:02:WU01:FS00:0xa4:Preparing to commence simulation
02:14:02:WU01:FS00:0xa4:- Looking at optimizations...
02:14:02:WU01:FS00:0xa4:- Created dyn
02:14:02:WU01:FS00:0xa4:- Files status OK
02:14:02:WU01:FS00:0xa4:- Expanded 255163 -> 397584 (decompressed 155.8 percent)
02:14:02:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=255163 data_size=397584, decompressed_data_size=397584 diff=0
02:14:02:WU01:FS00:0xa4:- Digital signature verified
02:14:02:WU01:FS00:0xa4:
02:14:02:WU01:FS00:0xa4:Project: 10424 (Run 14040, Clone 0, Gen 7)
02:14:02:WU01:FS00:0xa4:
02:14:02:WU01:FS00:0xa4:Assembly optimizations on if available.
02:14:02:WU01:FS00:0xa4:Entering M.D.
02:14:07:WU00:FS00:Upload 72.00%
02:14:07:WU01:FS00:0xa4:Mapping NT from 20 to 20
02:14:07:WU01:FS00:0xa4:mdrun returned 255
02:14:07:WU01:FS00:0xa4:Going to send back what have done -- stepsTotalG=2000000
02:14:07:WU01:FS00:0xa4:Work fraction=0.0000 steps=2000000.
02:14:11:WU01:FS00:0xa4:logfile size=0 infoLength=0 edr=0 trr=25
02:14:11:WU01:FS00:0xa4:logfile size: 0 info=0 bed=0 hdr=25
02:14:11:WU01:FS00:0xa4:- Writing 641 bytes of core data to disk...
02:14:11:WU01:FS00:0xa4:Done: 129 -> 144 (compressed to 111.6 percent)
02:14:11:WU01:FS00:0xa4:  ... Done.
02:14:11:WU01:FS00:0xa4:
02:14:11:WU01:FS00:0xa4:Folding@home Core Shutdown: EARLY_UNIT_END
02:14:12:WARNING:WU01:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
02:14:12:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:10424 run:14040 clone:0 gen:7 core:0xa4 unit:0x0000000a0a3b1e734ccfc5705e617953
02:14:12:WU01:FS00:Uploading 656B to 171.64.65.79
02:14:12:WU01:FS00:Connecting to 171.64.65.79:8080
02:14:12:WU03:FS00:Connecting to 171.67.108.200:8080
02:14:13:WU00:FS00:Upload 77.81%
02:14:13:WU01:FS00:Upload complete
02:14:13:WU01:FS00:Server responded WORK_ACK (400)
02:14:13:WU01:FS00:Cleaning up
02:14:13:WU03:FS00:Assigned to work server 171.64.65.79
02:14:13:WU03:FS00:Requesting new work unit for slot 00: READY cpu:20 from 171.64.65.79
02:14:13:WU03:FS00:Connecting to 171.64.65.79:8080
02:14:14:WU03:FS00:Downloading 249.50KiB
02:14:17:WU03:FS00:Download complete
02:14:17:WU03:FS00:Received Unit: id:03 state:DOWNLOAD error:NO_ERROR project:10424 run:10797 clone:0 gen:3 core:0xa4 unit:0x000000060a3b1e734ccfba2e1ba05dfe
02:14:17:WU03:FS00:Starting
02:14:17:WU03:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 03 -suffix 01 -version 704 -lifeline 5756 -checkpoint 10 -np 20
02:14:17:WU03:FS00:Started FahCore on PID 13592
02:14:17:WU03:FS00:Core PID:5636
02:14:17:WU03:FS00:FahCore 0xa4 started
02:14:18:WU03:FS00:0xa4:
02:14:18:WU03:FS00:0xa4:*------------------------------*
02:14:18:WU03:FS00:0xa4:Folding@Home Gromacs GB Core
02:14:18:WU03:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
02:14:18:WU03:FS00:0xa4:
02:14:18:WU03:FS00:0xa4:Preparing to commence simulation
02:14:18:WU03:FS00:0xa4:- Looking at optimizations...
02:14:18:WU03:FS00:0xa4:- Created dyn
02:14:18:WU03:FS00:0xa4:- Files status OK
02:14:18:WU03:FS00:0xa4:- Expanded 254980 -> 397584 (decompressed 155.9 percent)
02:14:18:WU03:FS00:0xa4:Called DecompressByteArray: compressed_data_size=254980 data_size=397584, decompressed_data_size=397584 diff=0
02:14:18:WU03:FS00:0xa4:- Digital signature verified
02:14:18:WU03:FS00:0xa4:
02:14:18:WU03:FS00:0xa4:Project: 10424 (Run 10797, Clone 0, Gen 3)
02:14:18:WU03:FS00:0xa4:
02:14:18:WU03:FS00:0xa4:Assembly optimizations on if available.
02:14:18:WU03:FS00:0xa4:Entering M.D.
02:14:19:WU00:FS00:Upload 83.62%
02:14:24:WU03:FS00:0xa4:Mapping NT from 20 to 20
02:14:24:WU03:FS00:0xa4:mdrun returned 255
02:14:24:WU03:FS00:0xa4:Going to send back what have done -- stepsTotalG=2000000
02:14:24:WU03:FS00:0xa4:Work fraction=0.0000 steps=2000000.
02:14:25:WU00:FS00:Upload 89.42%
02:14:28:WU03:FS00:0xa4:logfile size=0 infoLength=0 edr=0 trr=25
02:14:28:WU03:FS00:0xa4:logfile size: 0 info=0 bed=0 hdr=25
02:14:28:WU03:FS00:0xa4:- Writing 641 bytes of core data to disk...
02:14:28:WU03:FS00:0xa4:Done: 129 -> 145 (compressed to 112.4 percent)
02:14:28:WU03:FS00:0xa4:  ... Done.
02:14:28:WU03:FS00:0xa4:
02:14:28:WU03:FS00:0xa4:Folding@home Core Shutdown: EARLY_UNIT_END
02:14:28:WARNING:WU03:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
02:14:28:WU03:FS00:Sending unit results: id:03 state:SEND error:FAULTY project:10424 run:10797 clone:0 gen:3 core:0xa4 unit:0x000000060a3b1e734ccfba2e1ba05dfe
02:14:28:WU03:FS00:Uploading 657B to 171.64.65.79
02:14:28:WU03:FS00:Connecting to 171.64.65.79:8080
02:14:28:WU01:FS00:Connecting to 171.67.108.200:8080
02:14:29:WU03:FS00:Upload complete
02:14:29:WU03:FS00:Server responded WORK_ACK (400)
02:14:29:WU03:FS00:Cleaning up
02:14:29:WU01:FS00:Assigned to work server 171.64.65.79
02:14:29:WU01:FS00:Requesting new work unit for slot 00: READY cpu:20 from 171.64.65.79
02:14:29:WU01:FS00:Connecting to 171.64.65.79:8080
02:14:31:WU01:FS00:Downloading 249.87KiB
02:14:31:WU00:FS00:Upload 95.23%
02:14:35:WU01:FS00:Download complete
02:14:35:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:10424 run:10769 clone:0 gen:3 core:0xa4 unit:0x000000050a3b1e734ccfba13665e797d
02:14:35:WU01:FS00:Starting
02:14:35:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 01 -suffix 01 -version 704 -lifeline 5756 -checkpoint 10 -np 20
02:14:35:WU01:FS00:Started FahCore on PID 13944
02:14:35:WU01:FS00:Core PID:13112
02:14:35:WU01:FS00:FahCore 0xa4 started
02:14:36:WU01:FS00:0xa4:
02:14:36:WU01:FS00:0xa4:*------------------------------*
02:14:36:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
02:14:36:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
02:14:36:WU01:FS00:0xa4:
02:14:36:WU01:FS00:0xa4:Preparing to commence simulation
02:14:36:WU01:FS00:0xa4:- Looking at optimizations...
02:14:36:WU01:FS00:0xa4:- Created dyn
02:14:36:WU01:FS00:0xa4:- Files status OK
02:14:36:WU01:FS00:0xa4:- Expanded 255350 -> 397584 (decompressed 155.7 percent)
02:14:36:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=255350 data_size=397584, decompressed_data_size=397584 diff=0
02:14:36:WU01:FS00:0xa4:- Digital signature verified
02:14:36:WU01:FS00:0xa4:
02:14:36:WU01:FS00:0xa4:Project: 10424 (Run 10769, Clone 0, Gen 3)
02:14:36:WU01:FS00:0xa4:
02:14:36:WU01:FS00:0xa4:Assembly optimizations on if available.
02:14:36:WU01:FS00:0xa4:Entering M.D.
02:14:42:WU01:FS00:0xa4:Mapping NT from 20 to 20
02:14:42:WU01:FS00:0xa4:mdrun returned 255
02:14:42:WU01:FS00:0xa4:Going to send back what have done -- stepsTotalG=2000000
02:14:42:WU01:FS00:0xa4:Work fraction=0.0000 steps=2000000.
02:14:42:WU00:FS00:Upload complete
02:14:43:WU00:FS00:Server responded WORK_ACK (400)
02:14:43:WU00:FS00:Final credit estimate, 17015.00 points
02:14:43:WU00:FS00:Cleaning up
02:14:46:WU01:FS00:0xa4:logfile size=0 infoLength=0 edr=0 trr=25
02:14:46:WU01:FS00:0xa4:logfile size: 0 info=0 bed=0 hdr=25
02:14:46:WU01:FS00:0xa4:- Writing 641 bytes of core data to disk...
02:14:46:WU01:FS00:0xa4:Done: 129 -> 144 (compressed to 111.6 percent)
02:14:46:WU01:FS00:0xa4:  ... Done.
02:14:46:WU01:FS00:0xa4:
02:14:46:WU01:FS00:0xa4:Folding@home Core Shutdown: EARLY_UNIT_END
02:14:46:WARNING:WU01:FS00:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
02:14:46:WU01:FS00:Sending unit results: id:01 state:SEND error:FAULTY project:10424 run:10769 clone:0 gen:3 core:0xa4 unit:0x000000050a3b1e734ccfba13665e797d
02:14:46:WU01:FS00:Uploading 656B to 171.64.65.79
02:14:46:WU01:FS00:Connecting to 171.64.65.79:8080
02:14:46:WU00:FS00:Connecting to 171.67.108.200:8080
02:14:47:WU01:FS00:Upload complete
02:14:47:WU01:FS00:Server responded WORK_ACK (400)
02:14:47:WU01:FS00:Cleaning up
02:14:47:WU00:FS00:Assigned to work server 171.67.108.60
02:14:47:WU00:FS00:Requesting new work unit for slot 00: READY cpu:20 from 171.67.108.60
02:14:47:WU00:FS00:Connecting to 171.67.108.60:8080
02:14:48:WU00:FS00:Downloading 532.58KiB
02:14:51:WU00:FS00:Download complete
02:14:52:WU00:FS00:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:9500 run:4415 clone:1 gen:900 core:0xa4 unit:0x000003b06652edcc536433136907dde5
02:14:52:WU00:FS00:Starting
02:14:52:WU00:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 00 -suffix 01 -version 704 -lifeline 5756 -checkpoint 10 -np 20
02:14:52:WU00:FS00:Started FahCore on PID 6384
02:14:52:WU00:FS00:Core PID:13012
02:14:52:WU00:FS00:FahCore 0xa4 started
02:14:52:WU00:FS00:0xa4:
02:14:52:WU00:FS00:0xa4:*------------------------------*
02:14:52:WU00:FS00:0xa4:Folding@Home Gromacs GB Core
02:14:52:WU00:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
02:14:52:WU00:FS00:0xa4:
02:14:52:WU00:FS00:0xa4:Preparing to commence simulation
02:14:52:WU00:FS00:0xa4:- Looking at optimizations...
02:14:52:WU00:FS00:0xa4:- Created dyn
02:14:52:WU00:FS00:0xa4:- Files status OK
02:14:52:WU00:FS00:0xa4:- Expanded 544848 -> 1306412 (decompressed 239.7 percent)
02:14:52:WU00:FS00:0xa4:Called DecompressByteArray: compressed_data_size=544848 data_size=1306412, decompressed_data_size=1306412 diff=0
02:14:52:WU00:FS00:0xa4:- Digital signature verified
02:14:52:WU00:FS00:0xa4:
02:14:52:WU00:FS00:0xa4:Project: 9500 (Run 4415, Clone 1, Gen 900)
02:14:52:WU00:FS00:0xa4:
02:14:52:WU00:FS00:0xa4:Assembly optimizations on if available.
02:14:52:WU00:FS00:0xa4:Entering M.D.
02:14:58:WU00:FS00:0xa4:Mapping NT from 20 to 20
02:14:58:WU00:FS00:0xa4:Completed 0 out of 250000 steps  (0%)
02:15:21:WU00:FS00:0xa4:Completed 2500 out of 250000 steps  (1%)
Gooders
 
Posts: 89
Joined: Sun Jan 12, 2014 8:17 pm
Location: UK

Re: P10424 R4584 C0 G53 - slow?

Postby Gary480six » Tue Oct 28, 2014 1:41 pm

Bruce - I understand that each individual work unit is only sent out once (with certain exceptions), but if the number of atoms or the K factor is accidentally changed from one Run to another, it will affect more than just my little work unit.
Which is why I reported it.

The work unit P10424 R4584 C0 G53 should now be reported as being returned - for 656 points.

When I picked up this P10424 work unit:

Code: Select all
[17:12:28] + Attempting to get work packet
[17:12:28] Passkey found
[17:12:28] - Connecting to assignment server
[17:12:28] - Successful: assigned to (171.64.65.79).
[17:12:28] + News From Folding@Home:
[17:12:28] Loaded queue successfully.
[17:12:29] + Closed connections
[17:12:29]
[17:12:29] + Processing work unit
[17:12:29] A4 will attempt to use 8 threads.
[17:12:29] Core required: FahCore_a4.exe
[17:12:29] Core found.
[17:12:29] Working on queue slot 09 [October 26 17:12:29 UTC]
[17:12:29] + Working ...
[17:12:29]
[17:12:29] *------------------------------*
[17:12:29] Folding@Home Gromacs GB Core
[17:12:29] Version 2.27 (Dec. 15, 2010)
[17:12:29]
[17:12:29] Preparing to commence simulation
[17:12:29] - Looking at optimizations...
[17:12:29] - Created dyn
[17:12:29] - Files status OK
[17:12:29] - Expanded 31869 -> 397584 (decompressed 1247.5 percent)
[17:12:29] Called DecompressByteArray: compressed_data_size=31869 data_size=397584, decompressed_data_size=397584 diff=0
[17:12:29] - Digital signature verified
[17:12:29]
[17:12:29] Project: 10424 (Run 4584, Clone 0, Gen 53)
[17:12:29]
[17:12:29] Assembly optimizations on if available.
[17:12:29] Entering M.D.
[17:12:35] Mapping NT from 8 to 8
[17:12:35] Completed 0 out of 2000000 steps  (0%)
[17:34:04] Completed 20000 out of 2000000 steps  (1%)
[17:55:37] Completed 40000 out of 2000000 steps  (2%)
[18:17:00] Completed 60000 out of 2000000 steps  (3%)
[18:38:34] Completed 80000 out of 2000000 steps  (4%)
[19:00:07] Completed 100000 out of 2000000 steps  (5%)
[19:21:45] Completed 120000 out of 2000000 steps  (6%)


After I tried a reboot:

Code: Select all
Launch directory: C:\Users\Compaq 64\FAH
Executable: C:\Users\Compaq 64\FAH\FAH6.34-win32-SMP.exe
Arguments: -smp

[16:45:16] - Ask before connecting: No
[16:45:17] - User name: Gary480six (xxxxx)
[16:45:17] - User ID: XXX
[16:45:17] - Machine ID: 5
[16:45:17]
[16:45:17] Loaded queue successfully.
[16:45:17]
[16:45:17] + Processing work unit
[16:45:17] A4 will attempt to use 8 threads.
[16:45:17] Core required: FahCore_a4.exe
[16:45:17] Core found.
[16:45:17] Working on queue slot 09 [October 27 16:45:17 UTC]
[16:45:17] + Working ...
[16:45:18]
[16:45:18] *------------------------------*
[16:45:18] Folding@Home Gromacs GB Core
[16:45:18] Version 2.27 (Dec. 15, 2010)
[16:45:18]
[16:45:18] Preparing to commence simulation
[16:45:18] - Ensuring status. Please wait.
[16:45:27] - Looking at optimizations...
[16:45:27] - Working with standard loops on this execution.
[16:45:27] - Previous termination of core was improper.
[16:45:27] - Files status OK
[16:45:27] - Expanded 31869 -> 397584 (decompressed 1247.5 percent)
[16:45:27] Called DecompressByteArray: compressed_data_size=31869 data_size=397584, decompressed_data_size=397584 diff=0
[16:45:27] - Digital signature verified
[16:45:27]
[16:45:27] Project: 10424 (Run 4584, Clone 0, Gen 53)
[16:45:27]
[16:45:27] Entering M.D.
[16:45:33] Using Gromacs checkpoints
[16:45:33] Mapping NT from 8 to 8
[16:45:34] Resuming from checkpoint
[16:45:34] Verified work/wudata_09.log
[16:45:34] Verified work/wudata_09.trr
[16:45:34] Verified work/wudata_09.xtc
[16:45:34] Verified work/wudata_09.edr
[16:45:34] Completed 1295570 out of 2000000 steps  (64%)
[16:50:21] Completed 1300000 out of 2000000 steps  (65%)
[17:12:17] Completed 1320000 out of 2000000 steps  (66%)
[17:33:37] Completed 1340000 out of 2000000 steps  (67%)
[17:55:10] Completed 1360000 out of 2000000 steps  (68%)


As a follow up, this PC has since completed one P7504 work unit @ 18,900PPD and is working on a new P6096 work unit @ 19,900PPD - so I feel that the problem with that P10424 is not in my hardware.

p.s. the posting by Gooders seems more related to the 171.67.108.200 assignment server issues - can his post be moved so folks are not confused?
Gary480six
 
Posts: 67
Joined: Mon Jan 21, 2008 6:42 pm

Re: P10424 R4584 C0 G53 - slow?

Postby Gooders » Tue Oct 28, 2014 4:18 pm

Thanks for clearing that up garry, i saw your post to do with the same work unit...
Gooders
 
Posts: 89
Joined: Sun Jan 12, 2014 8:17 pm
Location: UK

Re: P10424 R4584 C0 G53 - slow?

Postby davidcoton » Tue Oct 28, 2014 4:27 pm

@Gooders. It's not the same WU. The WU is defined by project, run, clone, generation (PRCG). That is unique and (except in the case of failures) only issued once.
Your WU is the same project but not the same WU. Fold on.... :)
Image
davidcoton
 
Posts: 940
Joined: Wed Nov 05, 2008 3:19 pm
Location: Cambridge, UK

Re: P10424 R4584 C0 G53 - slow?

Postby Joe_H » Tue Oct 28, 2014 6:18 pm

The posts by gooders were off topic, but they at least help a little by giving a value for the number of steps for WU's from this project. So at least that value was correct for the WU reported by Gary480six. Generally an improperly set value for steps affects only one WU at a time. The k-factor is set for the project, so that is not an issue here. Possibly the WU was a bad one, but not enough information to tell for certain and how it was bad.

As for the problems reported for WU's downloaded by gooders, possibly they could be related to the core/thread setting of 20. A small number of projects have shown issues with values that are multiples of the prime number 5. Another possibility is that the simulation just could not be decomposed to that large a value. Either of these could apply, the first two WU's in the log file submitted by gooders have been completed successfully by others. But the third has not.
Joe_H
Site Admin
 
Posts: 4598
Joined: Tue Apr 21, 2009 4:41 pm
Location: W. MA

Re: P10424 R4584 C0 G53 - slow?

Postby bruce » Tue Oct 28, 2014 7:34 pm

Gary480six wrote:The work unit P10424 R4584 C0 G53 should now be reported as being returned - for 656 points.

Correct:
Hi Gary480six (team 40098),
Your WU (P10424 R4584 C0 G53) was added to the stats database on 2014-10-27 23:06:35 for 656.205 points of credit.
Days taken to complete WU: 1.54

From the log you posted:
[17:12:35] Completed 0 out of 2000000 steps (0%)
...
[19:00:07] Completed 100000 out of 2000000 steps (5%)

107:32 for 5% yields 2150m40s for 100% or 1.49 days, which is consistent with 1.54 days reported above.

What does Task Manager show (details for FAHClient and other tasks running an appreciable amount of CPU time?
bruce
 
Posts: 22873
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: P10424 R4584 C0 G53 - slow?

Postby Gary480six » Wed Oct 29, 2014 2:29 pm

Bruce,

That i7-2600K box is on and Folding 24/7. It is a dedicated Folding box with no other duties.

The Task manager currently shows the a3 core using 99% of the CPU resources for the P8809 it is working on.

As I said... since that P10424 finished, the PC has completed several a3 work units without problems and at full output. (18-19,000PPD)

Also mentioned before, this same system did complete a different P10424 work unit several weeks ago - again, without issues.

Right now, all I can think to do is delete the a4 core from my PC and let it reinstall the next time I get any a4 work units.

If nobody else reports any problems with the P10424 work units then Great. I'll just chalk it up to 'these things happen'.
Gary480six
 
Posts: 67
Joined: Mon Jan 21, 2008 6:42 pm

Re: P10424 R4584 C0 G53 - slow?

Postby Gooders » Wed Oct 29, 2014 3:24 pm

Thanks for those that helped me, Sorry garry i asumed as my project number was the same it was the same... *stupid english person learn to read*

Ive set mine to 23 to see if that fixes issues :)
Gooders
 
Posts: 89
Joined: Sun Jan 12, 2014 8:17 pm
Location: UK

Re: P10424 R4584 C0 G53 - slow?

Postby bruce » Wed Oct 29, 2014 3:59 pm

Gooders wrote:Thanks for those that helped me, Sorry garry i asumed as my project number was the same it was the same... *stupid english person learn to read*

Ive set mine to 23 to see if that fixes issues :)


The definition of the problem has to do with an increasing number of failures for what has been called "large prime" factors. Unfortunately the definition of "large" is somewhat uncertain.

The factors of 20 are 5x2x2. The factors of 23 are 23x1. Obviously 23 is a larger prime factor than 5 so your change will either make things worse or FAH will recognize a problem with 23 and exclude that setting.

Personally, I'd set it to 24. That violates the recommendation/rule of allocating a CPU to support your GPU, but under the circumstances, it's probably better to violate that rule than to struggle with finding some other number of cores that actually works better.
bruce
 
Posts: 22873
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: P10424 R4584 C0 G53 - slow?

Postby Gooders » Thu Oct 30, 2014 1:57 am

cheers bruce, will change it now... only worry is that gpu folding will slow right down? That makes the bulk of my points...
Gooders
 
Posts: 89
Joined: Sun Jan 12, 2014 8:17 pm
Location: UK

Re: P10424 R4584 C0 G53 - slow?

Postby davidcoton » Thu Oct 30, 2014 9:21 am

The usual recommendation is to note the PPD for each slot (and thus the total) in each config you try, and choose the best one. Remember that you need to test for some time (days) over either the same projects or a good mix for the comparisons to be valid.

You could also try CPU:16 plus CPU:6. Please post your results -- there is a lack of good quantitative info for this type of tuning.
davidcoton
 
Posts: 940
Joined: Wed Nov 05, 2008 3:19 pm
Location: Cambridge, UK

Next

Return to Issues with a specific WU

Who is online

Users browsing this forum: No registered users and 3 guests

cron