13001 (327, 2, 79) 99.99% and hangs

Moderators: Site Moderators, FAHC Science Team

Post Reply
Yakbreeder
Posts: 16
Joined: Sat Feb 14, 2015 12:36 am
Hardware configuration: Intel i5 750
Asus 7P755D-E
16GB Corsair 1333 memory
Sapphire ATI Radeon 5750 Vapor X
OCZ ModXStream Pro 600 PSU
Windows 7 64 bit, Home Premium SP1
Coolermaster Centurion case with 6- 120mm fans
Location: KTRI

13001 (327, 2, 79) 99.99% and hangs

Post by Yakbreeder »

My apologies if I've posted this in the wrong place for help. Feel free to move it if necessary.

I've recently got my GPU folding by upgrading my drivers (currently running 14.12 drivers.) It processed the first project fine, total point count of 17,000+ over 10 days. Second project gave me 8,000 points over 4 days.

I got a new project (13000) about 4 days ago. When it started out, the estimated time for completion was about 10 days, at roughly 17,000 points. The third day (yesterday) into this project , I noticed that it had dropped down to 4 days left for completion. Last night, it really sped up, showing about 2 hours for completion, and total point count started showing 152,000+. I've NEVER seen point counts that high. Well, the two hours pass, progress shows 99.99% and hangs. I let it process for a few more hours last night thinking a collection server might be down etc. But it still showed 99.99% when I shut down my system for the night.

I was hoping that what I thought was an error, would correct itself overnight with a system shutdown. System starts out today, progress starts out at 32-33% with time to finish less than 2 hours. Still shows 152,000+ PPD. Progress gets to 99.99% and hangs again. I assumed it was a bad WU, so I deleted it and retrieved another.

I got 13001 (327, 2, 79) a few hours ago, it starts fine, shows roughly 2 hours to complete, and 152,000+ PPD. Two hours pass, 99.99% and hangs. I can see now in the log that it's processing the WU, and it just passed 3% a short time ago. The GPU is active, but I find it really strange that it zips through progress and then hangs at 99.99%.

Is this an error? Should I be concerned? Am I overreacting?

My system is i5 750, Radeon 5750, Win 7 64bit SP1, 16GB ram. Stock clocks on both CPU and GPU. If any additional information is needed, please let me know.

Code: Select all

*********************** Log Started 2015-02-13T21:27:57Z ***********************
21:27:58:WU02:FS01:Cleaning up
21:27:59:WU00:FS01:Connecting to 171.67.108.200:80
21:28:00:WU00:FS01:Assigned to work server 140.163.4.231
21:28:00:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:Juniper [Radeon HD 5700/6750] from 140.163.4.231
21:28:00:WU00:FS01:Connecting to 140.163.4.231:8080
21:28:00:WU00:FS01:Downloading 4.84MiB
21:28:06:WU00:FS01:Download 62.02%
21:28:08:WU00:FS01:Download complete
21:28:08:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:13001 run:327 clone:2 gen:79 core:0x17 unit:0x0000009a538b3db75328ac6a663613c8
21:28:08:WU00:FS01:Starting
21:28:08:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/ATI/R600/Core_17.fah/FahCore_17.exe -dir 00 -suffix 01 -version 704 -lifeline 3632 -checkpoint 15 -gpu 0 -gpu-vendor ati
21:28:08:WU00:FS01:Started FahCore on PID 1744
21:28:08:WU00:FS01:Core PID:4504
21:28:08:WU00:FS01:FahCore 0x17 started
21:28:09:WU00:FS01:0x17:*********************** Log Started 2015-02-13T21:28:09Z ***********************
21:28:09:WU00:FS01:0x17:Project: 13001 (Run 327, Clone 2, Gen 79)
21:28:09:WU00:FS01:0x17:Unit: 0x0000009a538b3db75328ac6a663613c8
21:28:09:WU00:FS01:0x17:CPU: 0x00000000000000000000000000000000
21:28:09:WU00:FS01:0x17:Machine: 1
21:28:09:WU00:FS01:0x17:Reading tar file state.xml
21:28:10:WU00:FS01:0x17:Reading tar file system.xml
21:28:11:WU00:FS01:0x17:Reading tar file integrator.xml
21:28:11:WU00:FS01:0x17:Reading tar file core.xml
21:28:11:WU00:FS01:0x17:Digital signatures verified
21:28:11:WU00:FS01:0x17:Folding@home GPU core17
21:28:11:WU00:FS01:0x17:Version 0.0.52
21:29:05:FS01:Paused
21:29:05:FS01:Shutting core down
21:29:05:WU00:FS01:0x17:WARNING:Console control signal 1 on PID 4504
21:29:05:WU00:FS01:0x17:Exiting, please wait. . .
21:29:48:FS01:Unpaused
21:31:02:FS01:Paused
21:31:02:FS01:Shutting core down
21:32:03:WARNING:FS01:Killing WU00
21:32:03:WU00:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
21:41:27:FS01:Unpaused
21:41:27:WU00:FS01:Starting
21:41:27:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/ProgramData/FAHClient/cores/web.stanford.edu/~pande/Win32/AMD64/ATI/R600/Core_17.fah/FahCore_17.exe -dir 00 -suffix 01 -version 704 -lifeline 3632 -checkpoint 15 -gpu 0 -gpu-vendor ati
21:41:27:WU00:FS01:Started FahCore on PID 1560
21:41:27:WU00:FS01:Core PID:1332
21:41:27:WU00:FS01:FahCore 0x17 started
21:41:28:WU00:FS01:0x17:*********************** Log Started 2015-02-13T21:41:27Z ***********************
21:41:28:WU00:FS01:0x17:Project: 13001 (Run 327, Clone 2, Gen 79)
21:41:28:WU00:FS01:0x17:Unit: 0x0000009a538b3db75328ac6a663613c8
21:41:28:WU00:FS01:0x17:CPU: 0x00000000000000000000000000000000
21:41:28:WU00:FS01:0x17:Machine: 1
21:41:28:WU00:FS01:0x17:Reading tar file state.xml
21:41:29:WU00:FS01:0x17:Reading tar file system.xml
21:41:30:WU00:FS01:0x17:Reading tar file integrator.xml
21:41:30:WU00:FS01:0x17:Reading tar file core.xml
21:41:30:WU00:FS01:0x17:Digital signatures verified
21:41:30:WU00:FS01:0x17:Folding@home GPU core17
21:41:30:WU00:FS01:0x17:Version 0.0.52
21:45:28:WU00:FS01:0x17:Completed 0 out of 5000000 steps (0%)
21:45:28:WU00:FS01:0x17:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
22:56:07:WU00:FS01:0x17:Completed 50000 out of 5000000 steps (1%)
00:05:30:WU00:FS01:0x17:Completed 100000 out of 5000000 steps (2%)
01:14:08:WU00:FS01:0x17:Completed 150000 out of 5000000 steps (3%)
Yakbreeder
Posts: 16
Joined: Sat Feb 14, 2015 12:36 am
Hardware configuration: Intel i5 750
Asus 7P755D-E
16GB Corsair 1333 memory
Sapphire ATI Radeon 5750 Vapor X
OCZ ModXStream Pro 600 PSU
Windows 7 64 bit, Home Premium SP1
Coolermaster Centurion case with 6- 120mm fans
Location: KTRI

Re: 13001 (327, 2, 79) 99.99% and hangs

Post by Yakbreeder »

It's strange. I had been keeping the log screen up for a few hours, when I finally went back to the status screen, it was roughly 50% progress. It cycled through 100% and then finally kicked in to what I would consider normal progress. It's currently at 6.11% with 4.53 days remaining with 3545 PPD.

I just knew that 152,000+ PPD was too good to be true.

I'm guessing 3500 PPD would be about normal for my older 5750 card?
Joe_H
Site Admin
Posts: 7870
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: 13001 (327, 2, 79) 99.99% and hangs

Post by Joe_H »

If your log stops at the 3% mark and does not continue, then it probably would be an indication that the folding core crashed due to a GPU reset. The display in the FAHControl or Web Control window is known to keep incrementing up in such a situation. Stopping and restarting should get folding going again. You would have to check the Windows Event log to see if there was a reset.

Causes for a GPU reset can vary. Most often they are caused by overheating, overclocking or by using MS Remote Desktop to access a PC. Other causes might be evident from the Event log. You would have to check into the first two. If the cause was using MS RDT, remotely accessing your PC with something based on a VNC client will not cause a reset.

The folding client does take several percent of completion of a WU to provide an accurate ETA for a project that has not been processed on an installation before. After a restart it also can take a bit before the estimates settle down. That also affects the reported PPD estimate. I entered your logged times for the few percent shown into a bonus calculator and came up with about 5000 PPD.

P.S. Unless you need the 14.12 drivers for a game or other application, the 14.9 drivers are reported to run folding better.

P.P.S. You posted your update as I was writing this. So some of this may refer to the previous WU and the things you saw then.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
Yakbreeder
Posts: 16
Joined: Sat Feb 14, 2015 12:36 am
Hardware configuration: Intel i5 750
Asus 7P755D-E
16GB Corsair 1333 memory
Sapphire ATI Radeon 5750 Vapor X
OCZ ModXStream Pro 600 PSU
Windows 7 64 bit, Home Premium SP1
Coolermaster Centurion case with 6- 120mm fans
Location: KTRI

Re: 13001 (327, 2, 79) 99.99% and hangs

Post by Yakbreeder »

Thanks for the reply Joe_H. It's really a mystery to me. It cycled through 100% progress that second time, picked back up at 3%-4% (6% when I noticed it) where it should have been, and hasn't missed a beat since. I did check the Windows event log, but didn't see anything indicating a reset. No overclock on the GPU, or the CPU. Who knows what it could have been. It's the strangest thing I've seen while folding though.

I've seen others commenting on using the 14.9 drivers, and will make the switch when this project completes in a couple of days. I'm only averaging about 3,500 PPD on the GPU, and 1,000 PDD on the CPU, so I hope the 14.9 drivers will give me a slight bump.

Thank you again for the input and suggestions.

Yakbreeder
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 13001 (327, 2, 79) 99.99% and hangs

Post by bruce »

Welcome to foldingforum.org, Yakbreeder.

The appearance of 99% complete during the initial stages of a new project is a known anomaly in how the client estimates when it will finish the WU. Don't believe anything you see until more of a WU has been completed.

If you don't have a passkey, get one. I presume you have not completed 10 bonus WUs yet using a passkey. Bonus points do not accrue until you've accomplished that milestone.
Yakbreeder
Posts: 16
Joined: Sat Feb 14, 2015 12:36 am
Hardware configuration: Intel i5 750
Asus 7P755D-E
16GB Corsair 1333 memory
Sapphire ATI Radeon 5750 Vapor X
OCZ ModXStream Pro 600 PSU
Windows 7 64 bit, Home Premium SP1
Coolermaster Centurion case with 6- 120mm fans
Location: KTRI

Re: 13001 (327, 2, 79) 99.99% and hangs

Post by Yakbreeder »

Thanks for the welcome Bruce.

The strange part of that particular project was that it ran fine for about 3 days. Time for completion was initially ten days. On day three, it dropped down to four days remaining, and then dropped to two hours, within an hours time. Ran up to 99.99% progress, and then hung at that point.

Anyway, I know now, that it was still processing, and as you eluded to an "anomaly." I just hate that I cleared that project when it wasn't complete.

I'm not sure if I can get a passkey or not. I'm folding for a deceased friend, in his memory, using his account name. He got me into folding years ago, when both of our fathers were diagnosed with cancer, at roughly the same time. Both of our fathers have since passed from cancer. I doubt passkey was even used before my friend passed about 3 years ago? I don't remember him mentioning it before he passed, if it did exist. I guess I can always try it with one of my emails and see what happens.

While my system is outdated, I'll gladly contribute what I can to the cause. It's very important to me, having lost many family members to cancer over the years.

Thank you again for the welcome, and thanks for letting me about the known anomaly.

Yakbreeder
Joe_H
Site Admin
Posts: 7870
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: 13001 (327, 2, 79) 99.99% and hangs

Post by Joe_H »

Yes, you can get a passkey. Use your own email and have it sent to you. What gets qualified is a passkey and username pair, once that has completed 10 WU's the QRB will be credited. So even if your friend had a passkey or not, you don't have to match it going forward.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
Yakbreeder
Posts: 16
Joined: Sat Feb 14, 2015 12:36 am
Hardware configuration: Intel i5 750
Asus 7P755D-E
16GB Corsair 1333 memory
Sapphire ATI Radeon 5750 Vapor X
OCZ ModXStream Pro 600 PSU
Windows 7 64 bit, Home Premium SP1
Coolermaster Centurion case with 6- 120mm fans
Location: KTRI

Re: 13001 (327, 2, 79) 99.99% and hangs

Post by Yakbreeder »

Thanks for the info Joe_H.

I'll get one shortly. Do I need to pause Folding, input the passkey and then resume, or can I enter it on the fly?

Thanks

Yakbreeder
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 13001 (327, 2, 79) 99.99% and hangs

Post by bruce »

Enter it on the fly, but restart the client before the next WU is downloaded.
Yakbreeder
Posts: 16
Joined: Sat Feb 14, 2015 12:36 am
Hardware configuration: Intel i5 750
Asus 7P755D-E
16GB Corsair 1333 memory
Sapphire ATI Radeon 5750 Vapor X
OCZ ModXStream Pro 600 PSU
Windows 7 64 bit, Home Premium SP1
Coolermaster Centurion case with 6- 120mm fans
Location: KTRI

Re: 13001 (327, 2, 79) 99.99% and hangs

Post by Yakbreeder »

Thanks Bruce.
Yakbreeder
Posts: 16
Joined: Sat Feb 14, 2015 12:36 am
Hardware configuration: Intel i5 750
Asus 7P755D-E
16GB Corsair 1333 memory
Sapphire ATI Radeon 5750 Vapor X
OCZ ModXStream Pro 600 PSU
Windows 7 64 bit, Home Premium SP1
Coolermaster Centurion case with 6- 120mm fans
Location: KTRI

Re: 13001 (327, 2, 79) 99.99% and hangs

Post by Yakbreeder »

Of course I think of one more question after I log off.

Anyway, using the same Folding user name, would I be able to use the same passkey on my desktop, as well as my laptop?
Joe_H
Site Admin
Posts: 7870
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: 13001 (327, 2, 79) 99.99% and hangs

Post by Joe_H »

Yes, you can use it on multiple installations of the folding client on different systems. The passkey can also be used with a different username if you wanted to fold for yourself or a different friend. In the official stats you can also do a search based on just the passkey to see all points awarded to WU's turned in and credited to that passkey.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: 13001 (327, 2, 79) 99.99% and hangs

Post by bruce »

Searching the official stats can be done either by name or by passkey or by {name+passkey} so plan how you want to see your stats reported.

If you change EITHER the passkey or the name, you'll have to complete another 10 WUs to (re-)qualify that particular pair so it makes some sense to minimize the number of combinations you use.
Yakbreeder
Posts: 16
Joined: Sat Feb 14, 2015 12:36 am
Hardware configuration: Intel i5 750
Asus 7P755D-E
16GB Corsair 1333 memory
Sapphire ATI Radeon 5750 Vapor X
OCZ ModXStream Pro 600 PSU
Windows 7 64 bit, Home Premium SP1
Coolermaster Centurion case with 6- 120mm fans
Location: KTRI

Re: 13001 (327, 2, 79) 99.99% and hangs

Post by Yakbreeder »

Thank you both for the information.
Post Reply