OS Hang/Lockup after 2h~

It seems that a lot of GPU problems revolve around specific versions of drivers. Though AMD has their own support structure, you can often learn from information reported by others who fold.

Moderators: Site Moderators, FAHC Science Team

Post Reply
Jaxinc
Posts: 7
Joined: Sun Aug 09, 2015 10:24 pm
Hardware configuration: W7P64, Gigabyte GA-990FXA-UD3, Gigabyte 7970Ghz, AMD FX-4100, G.Skill 16gb 1600, WD Black, Thermaltake 850w
Location: Alabama
Contact:

OS Hang/Lockup after 2h~

Post by Jaxinc »

After over a YEAR of running FaH in my current configuration my system has randomly started to freeze with FaH running... ONLY when FaH is running does this occur. I have gone through EVERY log on my PC and nothing shows a danged thing and I'm about to pull hair. I am currently unable to run FaH due to this issue.

I have updated/rolled back drivers, no effect.
Uninstalled, reinstalled FaH, no effect.
Disabled screen saver out of assumption(screen would go black and never come back), no effect.
Power saver is completely off.

Apparently when this issue started... I lost the ability to detect 'idle' and FaH would not run at idle and simple remained on standby... I didn't discover this until it dumped two sets of packets that were never done. On posting I have wiped all cache and logs, uninstalled and reinstalled FaH and I've removed ANY programs throwing errors previously in the logs(WinZip, PrimoRamDisk). I currently have a HDD failing, but it's an external with NO running software at all. No other hardware issues that I can tell. I have run SeaDisk, RamDisk, ect... no errors.

Specs
Win7 Pro 64
Gigabyte GA-990FXA
Gigabyte 7970Ghz
Sapphire R7-240(StandAlone)
AMD FX-4100
G.Skill Sniper 4x4g 1600
WD Black 250gb 7200

It has run fine on this rig for over a year... The lack of ANY errors in the logs, no BSOD, and no driver crashing is leaving me VERY confused. It simply freezes and requires a hard shut down after that.

Ideas? I can only think of hardware failure at this point.
Nathan_P
Posts: 1180
Joined: Wed Apr 01, 2009 9:22 pm
Hardware configuration: Asus Z8NA D6C, 2 x5670@3.2 Ghz, , 12gb Ram, GTX 980ti, AX650 PSU, win 10 (daily use)

Asus Z87 WS, Xeon E3-1230L v3, 8gb ram, KFA GTX 1080, EVGA 750ti , AX760 PSU, Mint 18.2 OS

Not currently folding
Asus Z9PE- D8 WS, 2 E5-2665@2.3 Ghz, 16Gb 1.35v Ram, Ubuntu (Fold only)
Asus Z9PA, 2 Ivy 12 core, 16gb Ram, H folding appliance (fold only)
Location: Jersey, Channel islands

Re: OS Hang/Lockup after 2h~

Post by Nathan_P »

Folding pushes the hardware more than almost anything else - I would start with memory - take out 2 sticks and run FAH, if you get no lock ups then problem solved, if you get a lock up try swapping out the sticks until you get no lock ups. If you still get lock ups after that then its something else.

Oh - what power supply are you using?
Image
Jaxinc
Posts: 7
Joined: Sun Aug 09, 2015 10:24 pm
Hardware configuration: W7P64, Gigabyte GA-990FXA-UD3, Gigabyte 7970Ghz, AMD FX-4100, G.Skill 16gb 1600, WD Black, Thermaltake 850w
Location: Alabama
Contact:

Re: OS Hang/Lockup after 2h~

Post by Jaxinc »

Thermaltake 850watt Grand Modular

I've run it set to medium for the afternoon without incident...
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: OS Hang/Lockup after 2h~

Post by bruce »

When was the last time you cleaned the dust out of the heatsinks?
foldy
Posts: 2061
Joined: Sat Dec 01, 2012 3:43 pm
Hardware configuration: Folding@Home Client 7.6.13 (1 GPU slots)
Windows 7 64bit
Intel Core i5 2500k@4Ghz
Nvidia gtx 1080ti driver 441

Re: OS Hang/Lockup after 2h~

Post by foldy »

I had the same problem with GPU folding and like bruce suggested cleaned GPU from dust.
This lowered GPU temperature but was not enough - it keeped crashing.
So I removed the GPU heatsink completely and renewed the thermal paste.
This did not lower the GPU temperature anymore - but now it's stable again.
JimF
Posts: 652
Joined: Thu Jan 21, 2010 2:03 pm

Re: OS Hang/Lockup after 2h~

Post by JimF »

If you are overclocking, it is not just a question of temperatures. Overclocked GPUs can fail even at ordinary temps. And work units are not all the same; they can work fine for a long time and then you can get a hard one that errors out (or worse).

But you mention HDDs; I am getting to be an unwilling expert on the subject. I have replaced two spinning-platter drives and an SSD for storing videos, along with three SATA cables in the past year (two SATA cables just yesterday). None of them were the OS drive or cable; they were just for backup and storage. But they all caused freezes and BSODs. So disconnect all that you can and maybe isolate the problem further.
Post Reply