No CPU work units since 25may2015

Moderators: Site Moderators, FAHC Science Team

Post Reply
0db
Posts: 4
Joined: Fri Feb 20, 2015 3:24 am

No CPU work units since 25may2015

Post by 0db »

My client got stuck downloading a CPU work unit on 25may & hasn't loaded any since. GPU work units are still downloading & finishing OK. Here's part of the log:

Code: Select all

20:32:52:WU00:FS00:0xa4:DynamicWrapper: Finished Work Unit: sleep=10000
20:33:02:WU00:FS00:0xa4:
20:33:02:WU00:FS00:0xa4:Finished Work Unit:
20:33:02:WU00:FS00:0xa4:- Reading up to 905376 from "00/wudata_01.trr": Read 905376
20:33:02:WU00:FS00:0xa4:trr file hash check passed.
20:33:02:WU00:FS00:0xa4:- Reading up to 829596 from "00/wudata_01.xtc": Read 829596
20:33:02:WU00:FS00:0xa4:xtc file hash check passed.
20:33:02:WU00:FS00:0xa4:edr file hash check passed.
20:33:02:WU00:FS00:0xa4:logfile size: 23188
20:33:02:WU00:FS00:0xa4:Leaving Run
20:33:04:WU00:FS00:0xa4:- Writing 1760648 bytes of core data to disk...
20:33:05:WU00:FS00:0xa4:Done: 1760136 -> 1703018 (compressed to 96.7 percent)
20:33:05:WU00:FS00:0xa4:  ... Done.
20:33:07:WU00:FS00:0xa4:- Shutting down core
20:33:07:WU00:FS00:0xa4:
20:33:07:WU00:FS00:0xa4:Folding@home Core Shutdown: FINISHED_UNIT
20:33:08:WU00:FS00:FahCore returned: FINISHED_UNIT (100 = 0x64)
20:33:08:WU00:FS00:Sending unit results: id:00 state:SEND error:NO_ERROR project:9012 run:860 clone:0 gen:3 core:0xa4 unit:0x00000008ab40417c554e969e0adf27b0
20:33:08:WU00:FS00:Uploading 1.62MiB to 171.64.65.124
20:33:08:WU00:FS00:Connecting to 171.64.65.124:8080
20:33:14:WU00:FS00:Upload 19.24%
20:33:20:WU00:FS00:Upload 42.32%
20:33:26:WU00:FS00:Upload 61.55%
20:33:32:WU00:FS00:Upload 84.64%
20:33:37:WU00:FS00:Upload complete
20:33:37:WU00:FS00:Server responded WORK_ACK (400)
20:33:37:WU00:FS00:Final credit estimate, 2214.00 points
20:33:37:WU00:FS00:Cleaning up
******************************* Date: 2015-05-24 *******************************
******************************* Date: 2015-05-25 *******************************
******************************* Date: 2015-05-25 *******************************
16:22:18:WU00:FS00:Connecting to 171.67.108.200:8080
16:22:19:WU00:FS00:Assigned to work server 155.247.166.219
16:22:19:WU00:FS00:Requesting new work unit for slot 00: RUNNING cpu:11 from 155.247.166.219
16:22:19:WU00:FS00:Connecting to 155.247.166.219:8080
16:22:31:WU00:FS00:Downloading 116.94KiB
16:22:38:WU00:FS00:Download 54.73%
******************************* Date: 2015-05-25 *******************************
******************************* Date: 2015-05-25 *******************************
******************************* Date: 2015-05-26 *******************************
******************************* Date: 2015-05-26 *******************************
******************************* Date: 2015-05-26 *******************************
******************************* Date: 2015-05-27 *******************************
******************************* Date: 2015-05-27 *******************************
******************************* Date: 2015-05-27 *******************************
******************************* Date: 2015-05-27 *******************************
******************************* Date: 2015-05-28 *******************************
******************************* Date: 2015-05-28 *******************************
******************************* Date: 2015-05-28 *******************************
******************************* Date: 2015-05-28 *******************************
******************************* Date: 2015-05-29 *******************************
******************************* Date: 2015-05-29 *******************************
******************************* Date: 2015-05-29 *******************************
******************************* Date: 2015-05-29 *******************************
******************************* Date: 2015-05-30 *******************************
******************************* Date: 2015-05-30 *******************************
******************************* Date: 2015-05-30 *******************************
******************************* Date: 2015-05-31 *******************************
******************************* Date: 2015-05-31 *******************************
******************************* Date: 2015-05-31 *******************************
******************************* Date: 2015-06-01 *******************************
******************************* Date: 2015-06-01 *******************************
******************************* Date: 2015-06-01 *******************************
******************************* Date: 2015-06-01 *******************************
******************************* Date: 2015-06-02 *******************************
******************************* Date: 2015-06-02 *******************************
******************************* Date: 2015-06-02 *******************************
******************************* Date: 2015-06-02 *******************************
******************************* Date: 2015-06-03 *******************************
******************************* Date: 2015-06-03 *******************************
******************************* Date: 2015-06-03 *******************************
******************************* Date: 2015-06-03 *******************************
******************************* Date: 2015-06-04 *******************************
******************************* Date: 2015-06-04 *******************************
******************************* Date: 2015-06-04 *******************************
******************************* Date: 2015-06-04 *******************************
******************************* Date: 2015-06-05 *******************************
******************************* Date: 2015-06-05 *******************************
******************************* Date: 2015-06-05 *******************************
******************************* Date: 2015-06-06 *******************************
******************************* Date: 2015-06-06 *******************************
******************************* Date: 2015-06-06 *******************************
Suggestions to get folding again? Thanks for the help.
7im
Posts: 10189
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: No CPU work units since 25may2015

Post by 7im »

Don't use the CPU 11 setting. Fah excludes prime numbers. Try 12 or 10.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
davidcoton
Posts: 1102
Joined: Wed Nov 05, 2008 3:19 pm
Location: Cambridge, UK

Re: No CPU work units since 25may2015

Post by davidcoton »

It seems to have hung during a download. Try pause and unpause for the slot. If that doesn't work, restart the client. (The easiest way to do that may be to reboot the PC. Other ways exist but are OS dependant.)
Image
0db
Posts: 4
Joined: Fri Feb 20, 2015 3:24 am

Re: No CPU work units since 25may2015

Post by 0db »

@davidcoton: Thanks, rebooting got it going again.

@7im: Thanks for the reply & help, but I've always been using the CPU 11 setting. The Fah installer set that & I didn't change it. My involvement with the Fah software is minimalist - I don't tinker with it, just installed it & let it run as much as possible.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: No CPU work units since 25may2015

Post by bruce »

The client has a known problem when there is a communications error during a download or upload ... it never recovers without a reboot. We do hope that this will be fixed in a future version of the client but for now, a reboot (or a restart of FAHClient) is the only answer.

The problem of Gromacs failing when using 11 CPUs is also known, but I think you'll find that the software altered that value. The FAHCore actually uses 10, even when it's set to 11, avoiding that problem.
Post Reply