EUE, Unstable Machine, Faulty Work Unit

It seems that a lot of GPU problems revolve around specific versions of drivers. Though NVidia has their own support structure, you can often learn from information reported by others who fold.

Moderators: Site Moderators, FAHC Science Team

Post Reply
hiigaran
Posts: 134
Joined: Thu Nov 17, 2011 6:01 pm

EUE, Unstable Machine, Faulty Work Unit

Post by hiigaran »

So I get this a little too frequently:

Code: Select all

10:05:15:WU00:FS00:0x15:Run: exception thrown in GuardedRun -- cannot continue further. 
10:05:15:WU00:FS00:0x15:Going to send back what have done -- stepsTotalG=40000000 
10:05:15:WU00:FS00:0x15:Work fraction=0.1632 steps=40000000. 
10:05:19:WU00:FS00:0x15:logfile size=15131 infoLength=15131 edr=0 trr=23 
10:05:19:WU00:FS00:0x15:+ Opened results file 
10:05:19:WU00:FS00:0x15:- Writing 15667 bytes of core data to disk... 
10:05:20:WU00:FS00:0x15:Done: 15155 -> 4835 (compressed to 31.9 percent) 
10:05:20:WU00:FS00:0x15:  ... Done. 
10:05:20:WU00:FS00:0x15:DeleteFrameFiles: successfully deleted file=00/wudata_01.ckp 
10:05:20:WU00:FS00:0x15: 
10:05:20:WU00:FS00:0x15:Folding@home Core Shutdown: UNSTABLE_MACHINE 
10:05:20:WARNING:WU00:FS00:FahCore returned: UNSTABLE_MACHINE (122 = 0x7a) 
10:05:20:WU00:FS00:Sending unit results: id:00 state:SEND error:FAULTY project:7626 run:260 clone:0 gen:130 core:0x15 unit:0x000000a7664f2dd14fe61b3d5d802508
GTX 670, no overclock, no issues in games, and temperatures are never over 75. Yet I always get work units dumped after a driver crash, both before, and after upgrading drivers (currently 310.90).

Ideas?
Last edited by bruce on Tue Feb 12, 2013 1:16 am, edited 1 time in total.
Reason: Added [code] tags
art_l_j_PlanetAMD64
Posts: 472
Joined: Sun May 30, 2010 2:28 pm

Re: EUE, Unstable Machine, Faulty Work Unit

Post by art_l_j_PlanetAMD64 »

hiigaran wrote:GTX 670, no overclock, no issues in games, and temperatures are never over 75. Yet I always get work units dumped after a driver crash, both before, and after upgrading drivers (currently 310.90).

Ideas?
The 310.xx drivers are a known problem on the NVidia GeForce Forums, please see this information about it. Many of those users are able to restore normal operation of their system by going back to version 306.97.

I use version 306.97 on all of my Windows 7 folding machines, which you can see here.

I would run version 306.97 on everything, but that version is not available for Windows XP (it has 306.81), and the Debian Linux/Wine/GPU3 systems require an older driver (275.43), to match the v6.41 GPU3 client and the v2 Linux kernel.

Folding is much more stressful on a GPU than anything else, so it is not unheard-of to have a situation like yours, where Folding is the only application experiencing a problem.

Art
art_l_j_PlanetAMD64
Over 1.04 Billion Total Points
Over 185,000 Work Units
Over 3,800,000 PPD
Overall rank (if points are combined) 20 of 1721690
In memory of my Mother May 12th 1923 - February 10th 2012
hiigaran
Posts: 134
Joined: Thu Nov 17, 2011 6:01 pm

Re: EUE, Unstable Machine, Faulty Work Unit

Post by hiigaran »

See, that's the problem. I don't think the drivers are the issue. Yes, I'm having the issue on the 310.90 drivers, but I have also had the issue on two older drivers as well. I don't remember what specific versions they were, but I do know they had different major versions, and the last time I updated was probably half a year ago.
bollix47
Posts: 2941
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: EUE, Unstable Machine, Faulty Work Unit

Post by bollix47 »

Have you tried increasing the fan speed using a utility like MSIAfterburner?

The current temperature on my 670 is 59C but I have manually adjusted the fan to 70%. I too was getting driver crashes before decreasing the temperature. Although the max temperature spec for a 670 is 97C I think you may be having a heat problem at 75C. I know I've had GPUs that produced folding problems when the temperature got anywhere near 70C while others worked fine at that temperature or even much higher.

Also, have a look at this thread for similar error code.

Run a memory test.

Clean install.
hiigaran
Posts: 134
Joined: Thu Nov 17, 2011 6:01 pm

Re: EUE, Unstable Machine, Faulty Work Unit

Post by hiigaran »

I already use Afterburner for a custom fan profile. its at around 70-75% between 70 and 76C, but once it goes past that, it kicks itself up to 80% and beyond on a steep ramp. I've got it set to force a fan update every 5 seconds. If I leave the computer and the drivers crash, the fan speeds would revert to defaults without the forcing.
P5-133XL
Posts: 2948
Joined: Sun Dec 02, 2007 4:36 am
Hardware configuration: Machine #1:

Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).

Machine #2:

Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.

Machine 3:

Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32

I am currently folding just on the 5x GTX 460's for aprox. 70K PPD
Location: Salem. OR USA

Re: EUE, Unstable Machine, Faulty Work Unit

Post by P5-133XL »

It really sounds like you have a flaky card. If it is stock, you may have to underclock to make it more reliable. I agree that you should run that Memory test. It is not a perfect test, but it may help you verify a possible HW issue.
Image
art_l_j_PlanetAMD64
Posts: 472
Joined: Sun May 30, 2010 2:28 pm

Re: EUE, Unstable Machine, Faulty Work Unit

Post by art_l_j_PlanetAMD64 »

bollix47 wrote:Have you tried increasing the fan speed using a utility like MSI Afterburner?

The current temperature on my 670 is 59C but I have manually adjusted the fan to 70%. I too was getting driver crashes before decreasing the temperature. Although the max temperature spec for a 670 is 97C I think you may be having a heat problem at 75C. I know I've had GPUs that produced folding problems when the temperature got anywhere near 70C while others worked fine at that temperature or even much higher.

Also, have a look at this thread for similar error code.

Run a memory test.

Clean install.
This is very good advice, I run all of my fan controls manually. Call me a worrywart, but I get nervous when any of my GPUs goes over 65C! :egeek: And that's even on the 762x WUs.
art_l_j_PlanetAMD64
Over 1.04 Billion Total Points
Over 185,000 Work Units
Over 3,800,000 PPD
Overall rank (if points are combined) 20 of 1721690
In memory of my Mother May 12th 1923 - February 10th 2012
hiigaran
Posts: 134
Joined: Thu Nov 17, 2011 6:01 pm

Re: EUE, Unstable Machine, Faulty Work Unit

Post by hiigaran »

I still haven't gotten around to this, but I've been having other issues with F@H. I used to run the v7 client as a service, without any issues for either my SMP or GPU slots. However, two days ago, I turned on the computer, and noticed a distinct lack of fan noise. FAHControl was not in my systray, and the service had somehow disappeared from my services list. So I tried to open it manually from the installation directory (ie, clicking FAHControl). It opened, but it remained stuck on 'connecting'. So I uninstalled and reinstalled in the same way I had it before, with it starting as a service, and starting FAHControl on startup. SMP slot works fine, but the GPU slot is stuck on 'ready'. According to the logs, there is a GPU memtest error, but what I don't understand is why it would say that when I chose 'install as a service' in the installation, when after reinstalling again, I chose the recommended option (the first one...I forgot the name of it), and it works just fine.

Just a whole bunch of strange occurrences. Might explain the driver crashes I was having, because now that I'm not running it as a service, I haven't been getting them. But how did the service get removed in the first place?

Technology...
PantherX
Site Moderator
Posts: 7020
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: EUE, Unstable Machine, Faulty Work Unit

Post by PantherX »

If you are using Windows XP, then the GPU can fold if F@H has been installed as a service. However, if you are running Windows Vista or higher, then you can not install F@H as a Service and expect the GPU to fold. This is a security feature from Microsoft. However, the CPU can continue to fold even if you install F@H as a service in all Windows OS.

The default installation is to start F@H at boot time (not as a service).

You may want to post the log showing the error and the configuration of your system and client (initial section of the log). Furthermore, you can always do a fresh installation with default settings (Finish any WU if you have been assigned then uninstall the software and select the option to delete the data files) and see if that helps you.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
hiigaran
Posts: 134
Joined: Thu Nov 17, 2011 6:01 pm

Re: EUE, Unstable Machine, Faulty Work Unit

Post by hiigaran »

Just an update on this, I disabled the beta slot option on my GPU, and I haven't had any issues since.
Post Reply