WU Stuck at 99.9%

It seems that a lot of GPU problems revolve around specific versions of drivers. Though AMD has their own support structure, you can often learn from information reported by others who fold.

Moderators: Site Moderators, FAHC Science Team

Post Reply
enthrop
Posts: 3
Joined: Tue Aug 07, 2012 3:31 pm

WU Stuck at 99.9%

Post by enthrop »

Hi - I'm not actually sure where to post, there seems to be a lot of subforums, but I hope this place works.

I'm actually new to GPU folding, and the other day I received a WU that was going to take 2 days. Someone told me to use the -beta version of GPU core instead, so I did that. It worked yesterday, but today, I've been staring at a project 8900 WU that's been at 99.9% for a very long time!

Here's how it looks from the GUI:
Image

But, when I look at the logs, I see only this:

Code: Select all

12:03:11:WU01:FS00:0x17:Completed 0 out of 2500000 steps (0%)
12:08:42:WU01:FS00:0x17:Completed 25000 out of 2500000 steps (1%)
12:13:58:WU01:FS00:0x17:Completed 50000 out of 2500000 steps (2%)
12:19:32:WU01:FS00:0x17:Completed 75000 out of 2500000 steps (3%)
12:24:49:WU01:FS00:0x17:Completed 100000 out of 2500000 steps (4%)
12:30:23:WU01:FS00:0x17:Completed 125000 out of 2500000 steps (5%)
12:35:39:WU01:FS00:0x17:Completed 150000 out of 2500000 steps (6%)
12:41:12:WU01:FS00:0x17:Completed 175000 out of 2500000 steps (7%)
12:46:28:WU01:FS00:0x17:Completed 200000 out of 2500000 steps (8%)
12:52:02:WU01:FS00:0x17:Completed 225000 out of 2500000 steps (9%)
12:57:18:WU01:FS00:0x17:Completed 250000 out of 2500000 steps (10%)
13:02:52:WU01:FS00:0x17:Completed 275000 out of 2500000 steps (11%)
13:08:09:WU01:FS00:0x17:Completed 300000 out of 2500000 steps (12%)
13:13:43:WU01:FS00:0x17:Completed 325000 out of 2500000 steps (13%)
13:18:59:WU01:FS00:0x17:Completed 350000 out of 2500000 steps (14%)
13:24:33:WU01:FS00:0x17:Completed 375000 out of 2500000 steps (15%)
13:29:50:WU01:FS00:0x17:Completed 400000 out of 2500000 steps (16%)
13:35:23:WU01:FS00:0x17:Completed 425000 out of 2500000 steps (17%)
13:40:40:WU01:FS00:0x17:Completed 450000 out of 2500000 steps (18%)
13:46:14:WU01:FS00:0x17:Completed 475000 out of 2500000 steps (19%)
13:51:32:WU01:FS00:0x17:Completed 500000 out of 2500000 steps (20%)
13:57:06:WU01:FS00:0x17:Completed 525000 out of 2500000 steps (21%)
14:02:23:WU01:FS00:0x17:Completed 550000 out of 2500000 steps (22%)
14:07:57:WU01:FS00:0x17:Completed 575000 out of 2500000 steps (23%)
14:13:12:WU01:FS00:0x17:Completed 600000 out of 2500000 steps (24%)
14:18:45:WU01:FS00:0x17:Completed 625000 out of 2500000 steps (25%)
14:24:02:WU01:FS00:0x17:Completed 650000 out of 2500000 steps (26%)
14:29:35:WU01:FS00:0x17:Completed 675000 out of 2500000 steps (27%)
14:34:51:WU01:FS00:0x17:Completed 700000 out of 2500000 steps (28%)
14:40:21:WU01:FS00:0x17:Completed 725000 out of 2500000 steps (29%)
14:45:27:WU01:FS00:0x17:Completed 750000 out of 2500000 steps (30%)
14:50:44:WU01:FS00:0x17:Completed 775000 out of 2500000 steps (31%)
14:55:49:WU01:FS00:0x17:Completed 800000 out of 2500000 steps (32%)
15:01:20:WU01:FS00:0x17:Completed 825000 out of 2500000 steps (33%)
15:06:36:WU01:FS00:0x17:Completed 850000 out of 2500000 steps (34%)
15:12:08:WU01:FS00:0x17:Completed 875000 out of 2500000 steps (35%)
15:17:24:WU01:FS00:0x17:Completed 900000 out of 2500000 steps (36%)
15:22:57:WU01:FS00:0x17:Completed 925000 out of 2500000 steps (37%)
15:28:13:WU01:FS00:0x17:Completed 950000 out of 2500000 steps (38%)
15:33:46:WU01:FS00:0x17:Completed 975000 out of 2500000 steps (39%)
15:38:56:WU01:FS00:0x17:Completed 1000000 out of 2500000 steps (40%)
15:44:14:WU01:FS00:0x17:Completed 1025000 out of 2500000 steps (41%)
15:49:19:WU01:FS00:0x17:Completed 1050000 out of 2500000 steps (42%)
15:54:37:WU01:FS00:0x17:Completed 1075000 out of 2500000 steps (43%)
15:59:42:WU01:FS00:0x17:Completed 1100000 out of 2500000 steps (44%)
16:05:15:WU01:FS00:0x17:Completed 1125000 out of 2500000 steps (45%)
16:10:32:WU01:FS00:0x17:Completed 1150000 out of 2500000 steps (46%)
16:16:04:WU01:FS00:0x17:Completed 1175000 out of 2500000 steps (47%)
16:21:19:WU01:FS00:0x17:Completed 1200000 out of 2500000 steps (48%)
16:26:47:WU01:FS00:0x17:Completed 1225000 out of 2500000 steps (49%)
16:31:52:WU01:FS00:0x17:Completed 1250000 out of 2500000 steps (50%)
16:37:11:WU01:FS00:0x17:Completed 1275000 out of 2500000 steps (51%)
16:42:17:WU01:FS00:0x17:Completed 1300000 out of 2500000 steps (52%)
16:47:35:WU01:FS00:0x17:Completed 1325000 out of 2500000 steps (53%)
16:52:04:FS00:Shutting core down
16:52:04:WU01:FS00:0x17:WARNING:Console control signal 1 on PID 5520
16:52:04:WU01:FS00:0x17:Exiting, please wait. . .
16:52:04:WU01:FS00:0x17:Folding@home Core Shutdown: INTERRUPTED
16:52:04:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
******************************* Date: 2013-06-15 *******************************
17:15:01:WU01:FS00:Starting
17:15:01:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/xxx/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/ATI/R600/beta/Core_17.fah/FahCore_17.exe -dir 01 -suffix 01 -version 703 -lifeline 5924 -checkpoint 15 -gpu 0 -gpu-vendor ati
17:15:01:WU01:FS00:Started FahCore on PID 6400
17:15:01:WU01:FS00:Core PID:5336
17:15:01:WU01:FS00:FahCore 0x17 started
17:15:01:WU01:FS00:0x17:*********************** Log Started 2013-06-15T17:15:01Z ***********************
17:15:01:WU01:FS00:0x17:Project: 8900 (Run 295, Clone 0, Gen 26)
17:15:01:WU01:FS00:0x17:Unit: 0x00000020028c126651a668372c528d70
17:15:01:WU01:FS00:0x17:CPU: 0x00000000000000000000000000000000
17:15:01:WU01:FS00:0x17:Machine: 0
17:15:01:WU01:FS00:0x17:Digital signatures verified
17:15:01:WU01:FS00:0x17:  Found a checkpoint file
17:16:58:FS00:Shutting core down
17:16:59:WU01:FS00:0x17:WARNING:Console control signal 1 on PID 5336
17:16:59:WU01:FS00:0x17:Exiting, please wait. . .
17:17:00:WU01:FS00:0x17:Completed 1300000 out of 2500000 steps (52%)
17:17:00:WU01:FS00:0x17:Lost lifeline PID 6400, exiting
17:17:00:WU01:FS00:0x17:ERROR:103: Lost client lifeline
17:17:00:WU01:FS00:0x17:Folding@home Core Shutdown: CLIENT_DIED
17:17:00:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
19:17:25:WU01:FS00:Starting
19:17:25:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/xxx/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/ATI/R600/beta/Core_17.fah/FahCore_17.exe -dir 01 -suffix 01 -version 703 -lifeline 5924 -checkpoint 15 -gpu 0 -gpu-vendor ati
19:17:25:WU01:FS00:Started FahCore on PID 6044
19:17:25:WU01:FS00:Core PID:4992
19:17:25:WU01:FS00:FahCore 0x17 started
19:17:25:WU01:FS00:0x17:*********************** Log Started 2013-06-15T19:17:25Z ***********************
19:17:25:WU01:FS00:0x17:Project: 8900 (Run 295, Clone 0, Gen 26)
19:17:25:WU01:FS00:0x17:Unit: 0x00000020028c126651a668372c528d70
19:17:25:WU01:FS00:0x17:CPU: 0x00000000000000000000000000000000
19:17:25:WU01:FS00:0x17:Machine: 0
19:17:25:WU01:FS00:0x17:Digital signatures verified
19:17:25:WU01:FS00:0x17:  Found a checkpoint file
19:19:33:WU01:FS00:0x17:Completed 1300000 out of 2500000 steps (52%)
19:26:27:WU01:FS00:0x17:Completed 1325000 out of 2500000 steps (53%)
20:25:32:FS00:Shutting core down
20:25:32:WU01:FS00:0x17:WARNING:Console control signal 1 on PID 4992
20:25:32:WU01:FS00:0x17:Exiting, please wait. . .
20:26:33:WARNING:FS00:Killing WU01
20:26:33:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
21:30:46:WU01:FS00:Starting
21:30:46:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/xxx/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/ATI/R600/beta/Core_17.fah/FahCore_17.exe -dir 01 -suffix 01 -version 703 -lifeline 5924 -checkpoint 15 -gpu 0 -gpu-vendor ati
21:30:46:WU01:FS00:Started FahCore on PID 6280
21:30:46:WU01:FS00:Core PID:4724
21:30:46:WU01:FS00:FahCore 0x17 started
21:30:47:WU01:FS00:0x17:*********************** Log Started 2013-06-15T21:30:46Z ***********************
21:30:47:WU01:FS00:0x17:Project: 8900 (Run 295, Clone 0, Gen 26)
21:30:47:WU01:FS00:0x17:Unit: 0x00000020028c126651a668372c528d70
21:30:47:WU01:FS00:0x17:CPU: 0x00000000000000000000000000000000
21:30:47:WU01:FS00:0x17:Machine: 0
21:30:47:WU01:FS00:0x17:Digital signatures verified
21:30:47:WU01:FS00:0x17:  Found a checkpoint file
21:33:10:WU01:FS00:0x17:Completed 1300000 out of 2500000 steps (52%)
21:36:03:FS00:Shutting core down
21:36:03:WU01:FS00:0x17:WARNING:Console control signal 1 on PID 4724
21:36:03:WU01:FS00:0x17:Exiting, please wait. . .
21:36:03:WU01:FS00:0x17:Folding@home Core Shutdown: INTERRUPTED
21:36:04:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
21:49:38:WU01:FS00:Starting
21:49:38:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/xxx/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/ATI/R600/beta/Core_17.fah/FahCore_17.exe -dir 01 -suffix 01 -version 703 -lifeline 5924 -checkpoint 15 -gpu 0 -gpu-vendor ati
21:49:38:WU01:FS00:Started FahCore on PID 5116
21:49:38:WU01:FS00:Core PID:6252
21:49:38:WU01:FS00:FahCore 0x17 started
21:49:39:WU01:FS00:0x17:*********************** Log Started 2013-06-15T21:49:38Z ***********************
21:49:39:WU01:FS00:0x17:Project: 8900 (Run 295, Clone 0, Gen 26)
21:49:39:WU01:FS00:0x17:Unit: 0x00000020028c126651a668372c528d70
21:49:39:WU01:FS00:0x17:CPU: 0x00000000000000000000000000000000
21:49:39:WU01:FS00:0x17:Machine: 0
21:49:39:WU01:FS00:0x17:Digital signatures verified
21:49:39:WU01:FS00:0x17:  Found a checkpoint file
21:49:49:FS00:Shutting core down
21:49:49:WU01:FS00:0x17:WARNING:Console control signal 1 on PID 6252
21:49:49:WU01:FS00:0x17:Exiting, please wait. . .
21:50:50:WARNING:FS00:Killing WU01
21:50:50:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
22:03:28:WU01:FS00:Starting
22:03:28:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/xxx/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/ATI/R600/beta/Core_17.fah/FahCore_17.exe -dir 01 -suffix 01 -version 703 -lifeline 5924 -checkpoint 15 -gpu 0 -gpu-vendor ati
22:03:28:WU01:FS00:Started FahCore on PID 5592
22:03:28:WU01:FS00:Core PID:6708
22:03:28:WU01:FS00:FahCore 0x17 started
22:03:29:WU01:FS00:0x17:*********************** Log Started 2013-06-15T22:03:28Z ***********************
22:03:29:WU01:FS00:0x17:Project: 8900 (Run 295, Clone 0, Gen 26)
22:03:29:WU01:FS00:0x17:Unit: 0x00000020028c126651a668372c528d70
22:03:29:WU01:FS00:0x17:CPU: 0x00000000000000000000000000000000
22:03:29:WU01:FS00:0x17:Machine: 0
22:03:29:WU01:FS00:0x17:Digital signatures verified
22:03:29:WU01:FS00:0x17:  Found a checkpoint file
22:04:25:FS00:Shutting core down
22:04:25:WU01:FS00:0x17:WARNING:Console control signal 1 on PID 6708
22:04:25:WU01:FS00:0x17:Exiting, please wait. . .
22:05:26:WARNING:FS00:Killing WU01
22:05:26:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
22:14:28:WU01:FS00:Starting
22:14:28:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/xxx/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/ATI/R600/beta/Core_17.fah/FahCore_17.exe -dir 01 -suffix 01 -version 703 -lifeline 5924 -checkpoint 15 -gpu 0 -gpu-vendor ati
22:14:28:WU01:FS00:Started FahCore on PID 4188
22:14:28:WU01:FS00:Core PID:5416
22:14:28:WU01:FS00:FahCore 0x17 started
22:14:29:WU01:FS00:0x17:*********************** Log Started 2013-06-15T22:14:28Z ***********************
22:14:29:WU01:FS00:0x17:Project: 8900 (Run 295, Clone 0, Gen 26)
22:14:29:WU01:FS00:0x17:Unit: 0x00000020028c126651a668372c528d70
22:14:29:WU01:FS00:0x17:CPU: 0x00000000000000000000000000000000
22:14:29:WU01:FS00:0x17:Machine: 0
22:14:29:WU01:FS00:0x17:Digital signatures verified
22:14:29:WU01:FS00:0x17:  Found a checkpoint file
22:14:57:FS00:Shutting core down
22:14:57:WU01:FS00:0x17:WARNING:Console control signal 1 on PID 5416
22:14:57:WU01:FS00:0x17:Exiting, please wait. . .
22:15:58:WARNING:FS00:Killing WU01
22:15:58:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
******************************* Date: 2013-06-15 *******************************
00:24:35:WU01:FS00:Starting
00:24:35:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/xxx/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/ATI/R600/beta/Core_17.fah/FahCore_17.exe -dir 01 -suffix 01 -version 703 -lifeline 5924 -checkpoint 15 -gpu 0 -gpu-vendor ati
00:24:35:WU01:FS00:Started FahCore on PID 3628
00:24:35:WU01:FS00:Core PID:1812
00:24:35:WU01:FS00:FahCore 0x17 started
00:24:36:WU01:FS00:0x17:*********************** Log Started 2013-06-16T00:24:35Z ***********************
00:24:36:WU01:FS00:0x17:Project: 8900 (Run 295, Clone 0, Gen 26)
00:24:36:WU01:FS00:0x17:Unit: 0x00000020028c126651a668372c528d70
00:24:36:WU01:FS00:0x17:CPU: 0x00000000000000000000000000000000
00:24:36:WU01:FS00:0x17:Machine: 0
00:24:36:WU01:FS00:0x17:Digital signatures verified
00:24:36:WU01:FS00:0x17:  Found a checkpoint file
00:26:43:WU01:FS00:0x17:Completed 1300000 out of 2500000 steps (52%)
01:37:27:FS00:Shutting core down
01:37:27:WU01:FS00:0x17:WARNING:Console control signal 1 on PID 1812
01:37:27:WU01:FS00:0x17:Exiting, please wait. . .
03:51:45:WARNING:FS00:Killing WU01
03:51:46:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
03:52:02:WU01:FS00:Starting
03:52:02:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/xxx/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/ATI/R600/beta/Core_17.fah/FahCore_17.exe -dir 01 -suffix 01 -version 703 -lifeline 5924 -checkpoint 15 -gpu 0 -gpu-vendor ati
03:52:02:WU01:FS00:Started FahCore on PID 2708
03:52:02:WU01:FS00:Core PID:6028
03:52:02:WU01:FS00:FahCore 0x17 started
03:52:02:WU01:FS00:0x17:*********************** Log Started 2013-06-16T03:52:02Z ***********************
03:52:02:WU01:FS00:0x17:Project: 8900 (Run 295, Clone 0, Gen 26)
03:52:02:WU01:FS00:0x17:Unit: 0x00000020028c126651a668372c528d70
03:52:02:WU01:FS00:0x17:CPU: 0x00000000000000000000000000000000
03:52:02:WU01:FS00:0x17:Machine: 0
03:52:02:WU01:FS00:0x17:Digital signatures verified
03:52:02:WU01:FS00:0x17:  Found a checkpoint file
03:53:53:WU01:FS00:0x17:Completed 1300000 out of 2500000 steps (52%)
03:59:11:WU01:FS00:0x17:Completed 1325000 out of 2500000 steps (53%)
Sorry for the many restarts - I had it set to use GPU only when idle, and I walk to/from computer rather often. It's confusing why GUI says 99.9% though, and logs have been at 53% for past ~12 hours.

[edit] Image not working, but it shows my GPU WU Progress at 99.99% with ETA of 3 secs.. and it's been showing that for past hour.
7im
Posts: 10189
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: WU Stuck at 99.9%

Post by 7im »

Sorry, but whoever suggested you use the beta flag should have warned you about the consequences of running potentially unstable work units. You also need to be a member of the beta team to get support for beta work. Or remove the beta flag and we can help.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
foldy
Posts: 2061
Joined: Sat Dec 01, 2012 3:43 pm
Hardware configuration: Folding@Home Client 7.6.13 (1 GPU slots)
Windows 7 64bit
Intel Core i5 2500k@4Ghz
Nvidia gtx 1080ti driver 441

Re: WU Stuck at 99.9%

Post by foldy »

Probably it is the standby/hibernate/resume problem ?
viewtopic.php?f=88&t=23926&start=15
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: WU Stuck at 99.9%

Post by bruce »

foldy wrote:Probably it is the standby/hibernate/resume problem ?
viewtopic.php?f=88&t=23926&start=15
Possibly ... possibly not.

Note that 03:51:46 to 03:52:02 represents 16 seconds which under normal circumstances is probably long enough to sync the files, depending on how the OS handles the cache.
Post Reply