BSODs on GPU jobs

It seems that a lot of GPU problems revolve around specific versions of drivers. Though AMD has their own support structure, you can often learn from information reported by others who fold.

Moderators: Site Moderators, FAHC Science Team

Post Reply
Vester
Posts: 79
Joined: Sun Dec 02, 2007 4:29 pm
Hardware configuration: Gigabyte Aorus Z590 Pro AX, Intel i9-10850K, 32GB Crucial Ballistix DDR4-2600, Samsung NVMe EVO 980 Pro 256GB, CoolerMaster liquid cooler ML360, Nvidia Titan X (Pascal), Dell Nvidia RTX 3080 10GB 4Y12V, Pop!_OS.
Location: Fair Play, SC

BSODs on GPU jobs

Post by Vester »

I have had three BSODs on GPU jobs in the last two days related to fah.exe. I have re-imaged my HD to one week ago, removed F@H, and tested my computer and found no problems on my end. I am running an HPE-h9-1135 Phoenix with AMD FX-8350 processor, AMD HD 7790 video card, 16 GB RAM, and Windows 7 Home Premium 64b. I am reinstalling F@h in a few minutes and see how it goes.

[edit]Driver Packaging Version 14.10.1006-140417a-171099C
Catalyst Version 14.4
Provider Advanced Micro Devices, Inc.
2D Driver Version 8.01.01.1390
Direct3D Version 9.14.10.01029
OpenGL Version 6.14.10.12874
AMD Catalyst Control Center Version 2014.0417.2226.38446
AMD Audio Driver Version 7.12.0.7718[/edit]
JimF
Posts: 652
Joined: Thu Jan 21, 2010 2:03 pm

Re: BSODs on GPU jobs

Post by JimF »

My experience with BSODs usually implicates hardware problems, which can include over-heating. Are you overclocking either the GPU or the CPU? Warmer weather could bring out latent problems. Less likely, but also possible, are memory errors (either in the DDR3 main memory, or the video card memory).
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: BSODs on GPU jobs

Post by bruce »

(Even without overclocking,) when you added the GPU card, did you do anything to make sure that the fans would be able handle the increased heat without an increase in internal temperatures?
PantherX
Site Moderator
Posts: 7020
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: BSODs on GPU jobs

Post by PantherX »

I do hope that your PSU is able to power the system once loaded. BTW, have you considered running other stability benchmarks to see if this is a F@H only issue or a stability issue in general? Finally, did you choose the updated version of 14.4 WHQL drivers or the initial release?
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Vester
Posts: 79
Joined: Sun Dec 02, 2007 4:29 pm
Hardware configuration: Gigabyte Aorus Z590 Pro AX, Intel i9-10850K, 32GB Crucial Ballistix DDR4-2600, Samsung NVMe EVO 980 Pro 256GB, CoolerMaster liquid cooler ML360, Nvidia Titan X (Pascal), Dell Nvidia RTX 3080 10GB 4Y12V, Pop!_OS.
Location: Fair Play, SC

Re: BSODs on GPU jobs

Post by Vester »

I have an 800 watt power supply. This CPU has never been overclocked. CPU temp (liquid cooler) is 52C and GPU temp is 60C. I run without the side cover. This morning I uninstalled the video drivers and Catalyst Control Center and reinstalled using a new download of 14-4-win7-win8-win8.1-64-dd-ccc-whql.exe. It has been running satisfactorily since. This is an error I found in Event Viewer | Applications:
Faulting application name: FahCore_a3.exe, version: 0.0.0.0, time stamp: 0x4d4720af
Faulting module name: FahCore_a3.exe, version: 0.0.0.0, time stamp: 0x4d4720af
Exception code: 0xc0000005
Fault offset: 0x00261bf6
Faulting process id: 0xb2c
Faulting application start time: 0x01cf8db6bf51d914
Faulting application path: C:\Users\Vester\AppData\Roaming\FAHClient\cores\web.stanford.edu\~pande\Win32\AMD64\Core_a3.fah\FahCore_a3.exe
Faulting module path: C:\Users\Vester\AppData\Roaming\FAHClient\cores\web.stanford.edu\~pande\Win32\AMD64\Core_a3.fah\FahCore_a3.exe
Report Id: ca1dcce0-fa2b-11e3-a5d1-9cb70d9c72c4
The message could be related to the recovery using a six day old image of the drive.

After the BSOD early today, I restarted (before computer testing/reinstalling) and had a popup when the desktop loaded that said Fah had faulted. We'll see how it goes. I was a moderator on another project and I often said, "The client won't break your computer but it will reveal computer problems." I've been sorting this computer for 20 months. Thanks.
JimF
Posts: 652
Joined: Thu Jan 21, 2010 2:03 pm

Re: BSODs on GPU jobs

Post by JimF »

I don't run CPU jobs, but that error could be a result of the crash rather than the cause. I practically never see errors in the GPU jobs and don't even think of them as a possible cause, though it could happen of course. Let us know if you discover any hardware problems (not due to temperature though!).
P5-133XL
Posts: 2948
Joined: Sun Dec 02, 2007 4:36 am
Hardware configuration: Machine #1:

Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).

Machine #2:

Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.

Machine 3:

Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32

I am currently folding just on the 5x GTX 460's for aprox. 70K PPD
Location: Salem. OR USA

Re: BSODs on GPU jobs

Post by P5-133XL »

The 0xc00000005 is a conventional memory access error and the FAHCore_A3 is a CPU folding core so I'm not sure why the BSOD would have anything to do with the GPU unless we are dealing with power issues where the video card is pulling so much current that it is dropping the voltage for the CPU/MB/RAM. May I suggest that you check your PS rails to make sure that the video card is on a separate rail from everything else.
Image
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: BSODs on GPU jobs

Post by bruce »

Vester wrote:I have had three BSODs on GPU jobs in the last two days related to fah.exe. I have re-imaged my HD to one week ago, removed F@H, and tested my computer and found no problems on my end.
You do realize that there's a false assumption there. You didn't test the computer under identical conditions. Put simply, a computer running FAH is drawing more power and generating more heat and undergoing more double-checking of results than a computer that's not running FAH.
davidcoton
Posts: 1102
Joined: Wed Nov 05, 2008 3:19 pm
Location: Cambridge, UK

Re: BSODs on GPU jobs

Post by davidcoton »

Why do you think the BSODs were related to GPU jobs? The one Win log extract relates to a CPU job (Core_A3). Do you have any other error logs?

Memory access errors are often the result of failing RAM, or sometimes the motherboard circuitry. Run a memory check, preferably for several hours. Check memory is correctly seated. If errors persist, remove or swap memory. I have seen machines fail with two identical sticks, but work with either one alone. Otherwise, check power distribution and voltages on load as P5-133XL suggests above. Could also be a local hotspot near the memory, but this is unlikely with the temp you noted.

However, if the machine is now working properly after driver update, and if that was the only change since faults were appearing, there could have been a problem in the previous driver version that FAH revealed. I don't think the Win log records enough information to determine the actual faulty software module -- at least not in a form mere mortals can interpret.
Image
Vester
Posts: 79
Joined: Sun Dec 02, 2007 4:29 pm
Hardware configuration: Gigabyte Aorus Z590 Pro AX, Intel i9-10850K, 32GB Crucial Ballistix DDR4-2600, Samsung NVMe EVO 980 Pro 256GB, CoolerMaster liquid cooler ML360, Nvidia Titan X (Pascal), Dell Nvidia RTX 3080 10GB 4Y12V, Pop!_OS.
Location: Fair Play, SC

Re: BSODs on GPU jobs

Post by Vester »

OK, accept this statement: "This computer crashed three times while crunching seven CPU jobs and one GPU job. I am done. I don't have time for it anymore. I have a new boat. Bye."
DeeGee
Posts: 61
Joined: Thu Oct 02, 2008 1:15 pm
Hardware configuration: Asus Crosshair Hero VIII, AMD Ryzen 3950x, 2x8GB 3600MHz DDR4, Radeon VII, Win10
Asus Crosshair Hero VII, Amd Ryzen 3900x, 2x16GB 3200MHz DDR4, GeForce 980 TI, Kubuntu 19.10
Location: Finland

Re: BSODs on GPU jobs

Post by DeeGee »

Although this is bit late, all of my BSODs have happened on 14.xx series drivers. Even the newest 14.6's cause BSOD together with folding and firefox/flash. 13.12 drivers have been bluescreen free for me, so I'll probably will be going back to them.
JimF
Posts: 652
Joined: Thu Jan 21, 2010 2:03 pm

Re: BSODs on GPU jobs

Post by JimF »

I have been Folding on two HD 7790s on the same motherboard with the 14.4 drivers (Win7 64-bit) since they came out with no problems. A couple of days ago I switched the cards from a Haswell machine to an Ivy Bridge machine. These are the Power Color cards, and they run hotter than most cards (typically 70 to 80 C), but with a side case fan on them that has not been a problem thus far.
Post Reply