I have <next-unit-percentage v='98'/>, and the problem is that whenever the client requests a new WU, it gets assigned the very same WU it is currently folding. The request will thus time out, and I have to wait until the current WU uploads before I can get a legitimate new one. I guess I am missing something fairly obvious here, but what?? Thanks for any help!
Code: Select all
19:02:37:WU00:FS00:0xa4:Completed 240000 out of 250000 steps (96%)
19:02:55:WU00:FS00:0xa4:Completed 242500 out of 250000 steps (97%)
19:03:13:WU00:FS00:0xa4:Completed 245000 out of 250000 steps (98%)
19:03:14:WU01:FS00:Connecting to assign3.stanford.edu:8080
19:03:15:WU01:FS00:News: Welcome to Folding@Home
19:03:15:WU01:FS00:Assigned to work server 171.67.108.59
19:03:15:WU01:FS00:Requesting new work unit for slot 00: RUNNING smp:12 from 171.67.108.59
19:03:15:WU01:FS00:Connecting to 171.67.108.59:8080
19:03:15:ERROR:WU01:FS00:Exception: Have already seen this work unit 0x000001336652edcb4ee90096190ce999 aborting download
19:03:15:WU01:FS00:Connecting to assign3.stanford.edu:8080
19:03:16:WU01:FS00:News: Welcome to Folding@Home
19:03:16:WU01:FS00:Assigned to work server 171.67.108.59
19:03:16:WU01:FS00:Requesting new work unit for slot 00: RUNNING smp:12 from 171.67.108.59
19:03:16:WU01:FS00:Connecting to 171.67.108.59:8080
19:03:17:ERROR:WU01:FS00:Exception: Have already seen this work unit 0x000001336652edcb4ee90096190ce999 aborting download
19:03:31:WU00:FS00:0xa4:Completed 247500 out of 250000 steps (99%)
19:03:49:WU00:FS00:0xa4:Completed 250000 out of 250000 steps (100%)
19:03:49:WU00:FS00:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
19:03:59:WU00:FS00:0xa4:
19:03:59:WU00:FS00:0xa4:Finished Work Unit:
19:03:59:WU00:FS00:0xa4:- Reading up to 768912 from "00/wudata_01.trr": Read 768912
19:03:59:WU00:FS00:0xa4:trr file hash check passed.
19:03:59:WU00:FS00:0xa4:- Reading up to 455748 from "00/wudata_01.xtc": Read 455748
19:03:59:WU00:FS00:0xa4:xtc file hash check passed.
19:03:59:WU00:FS00:0xa4:edr file hash check passed.
19:03:59:WU00:FS00:0xa4:logfile size: 22407
19:03:59:WU00:FS00:0xa4:Leaving Run
19:04:04:WU00:FS00:0xa4:- Writing 1252471 bytes of core data to disk...
19:04:05:WU00:FS00:0xa4:Done: 1251959 -> 1192311 (compressed to 95.2 percent)
19:04:05:WU00:FS00:0xa4: ... Done.
19:04:15:WU01:FS00:Connecting to assign3.stanford.edu:8080
19:04:16:WU01:FS00:News: Welcome to Folding@Home
19:04:16:WU01:FS00:Assigned to work server 171.67.108.59
19:04:16:WU01:FS00:Requesting new work unit for slot 00: RUNNING smp:12 from 171.67.108.59
19:04:16:WU01:FS00:Connecting to 171.67.108.59:8080
19:04:17:ERROR:WU01:FS00:Exception: Have already seen this work unit 0x000001336652edcb4ee90096190ce999 aborting download
19:04:49:WU00:FS00:0xa4:- Shutting down core
19:04:49:WU00:FS00:0xa4:
19:04:49:WU00:FS00:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
19:04:54:WU00:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
19:04:54:WU00:FS00:Sending unit results: id:00 state:SEND error:OK project:8004 run:81 clone:4 gen:201 core:0xa4 unit:0x000001336652edcb4ee90096190ce999
19:04:54:WU00:FS00:Uploading 1.14MiB to 171.67.108.59
19:04:54:WU00:FS00:Connecting to 171.67.108.59:8080
19:04:58:WU00:FS00:Upload complete
19:04:58:WU00:FS00:Server responded WORK_ACK (400)
19:04:58:WU00:FS00:Final credit estimate, 976.00 points
19:04:58:WU00:FS00:Cleaning up
19:05:53:WU01:FS00:Connecting to assign3.stanford.edu:8080
19:05:53:WU01:FS00:News: Welcome to Folding@Home
19:05:53:WU01:FS00:Assigned to work server 171.67.108.59
19:05:53:WU01:FS00:Requesting new work unit for slot 00: READY smp:12 from 171.67.108.59
19:05:53:WU01:FS00:Connecting to 171.67.108.59:8080
19:05:54:WU01:FS00:Downloading 532.10KiB
19:05:57:WU01:FS00:Download complete
19:05:57:WU01:FS00:Received Unit: id:01 state:DOWNLOAD error:OK project:8004 run:111 clone:24 gen:127 core:0xa4 unit:0x000000ad6652edcb4ee9018079a84a14
19:05:57:WU01:FS00:Starting
19:05:57:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/www.stanford.edu/~pande/Linux/AMD64/Core_a4.fah/FahCore_a4 -dir 01 -suffix 01 -version 701 -lifeline 1101 -checkpoint 60 -np 12
19:05:57:WU01:FS00:Started FahCore on PID 2132
19:05:57:Started thread 10 on PID 1101
19:05:57:WU01:FS00:Core PID:2136
19:05:57:WU01:FS00:FahCore 0xa4 started
19:05:58:WU01:FS00:0xa4:
19:05:58:WU01:FS00:0xa4:*------------------------------*
19:05:58:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
19:05:58:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
19:05:58:WU01:FS00:0xa4:
19:05:58:WU01:FS00:0xa4:Preparing to commence simulation
19:05:58:WU01:FS00:0xa4:- Looking at optimizations...
19:05:58:WU01:FS00:0xa4:- Created dyn
19:05:58:WU01:FS00:0xa4:- Files status OK
19:05:58:WU01:FS00:0xa4:- Expanded 544358 -> 1305024 (decompressed 239.7 percent)
19:05:58:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=544358 data_size=1305024, decompressed_data_size=1305024 diff=0
19:05:58:WU01:FS00:0xa4:- Digital signature verified
19:05:58:WU01:FS00:0xa4:
19:05:58:WU01:FS00:0xa4:Project: 8004 (Run 111, Clone 24, Gen 127)
19:05:58:WU01:FS00:0xa4:
19:05:58:WU01:FS00:0xa4:Assembly optimizations on if available.
19:05:58:WU01:FS00:0xa4:Entering M.D.
19:06:04:WU01:FS00:0xa4:Completed 0 out of 250000 steps (0%)
19:06:22:WU01:FS00:0xa4:Completed 2500 out of 250000 steps (1%)
Here is my config.xml:
Code: Select all
<config>
<!-- FahCore Control -->
<checkpoint v='60'/>
<core-priority v='low'/>
<!-- Folding Slot Configuration -->
<max-packet-size v='big'/>
<!-- Logging -->
<verbosity v='5'/>
<!-- Network -->
<proxy v=':8080'/>
<!-- User Information -->
<passkey v=*********************/>
<user v='svanefalk'/>
<!-- Work Unit Control -->
<next-unit-percentage v='98'/>
<!-- Folding Slots -->
<slot id='0' type='SMP'>
<cpus v='-1'/>
<max-packet-size v='big'/>
<next-unit-percentage v='98'/>
</slot>
</config>