Crashes lately

If you think it might be a driver problem, see viewforum.php?f=79

Moderators: Site Moderators, FAHC Science Team

Post Reply
mezhaka
Posts: 3
Joined: Wed Mar 28, 2012 3:10 pm

Crashes lately

Post by mezhaka »

I do see the

Code: Select all

FahCore returned an unknown error code which probably indicates that it crashed
for about last month. Is there any way to cope this?
7im
Posts: 10189
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: Crashes lately

Post by 7im »

Hello mezhaka, welcome to the forum.

There are more than one possible cause. Please post the ***System*** section of the log file, and then please describe the OS and HW config of this machine. Please also post a bit more more the log before and after the error, so we can see what the client was doing when this failed. Thanks.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
scottwilkins
Posts: 6
Joined: Tue May 13, 2008 3:23 am

Re: Crashes lately

Post by scottwilkins »

I saw this when installed as a service. I un-installed and re-installed using normal configuration and it works perfectly now.
This is my sig...
mezhaka
Posts: 3
Joined: Wed Mar 28, 2012 3:10 pm

Re: Crashes lately

Post by mezhaka »

7im wrote:Hello mezhaka, welcome to the forum.

There are more than one possible cause. Please post the ***System*** section of the log file, and then please describe the OS and HW config of this machine. Please also post a bit more more the log before and after the error, so we can see what the client was doing when this failed. Thanks.
I am no sure what do you mean by ***System*** section, but hope this is it:

Code: Select all

02:11:55:WU00:FS00:Running FahCore: "E:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/anton/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/NVIDIA/Fermi/Core_15.fah/FahCore_15.exe -dir 00 -suffix 01 -version 701 -lifeline 7668 -checkpoint 15 -gpu 0
02:11:55:WU00:FS00:Started FahCore on PID 5912
02:11:55:WU00:FS00:Core PID:6356
02:11:55:WU00:FS00:FahCore 0x15 started
02:11:56:WU00:FS00:0x15:
02:11:56:WU00:FS00:0x15:*------------------------------*
02:11:56:WU00:FS00:0x15:Folding@Home GPU Core
02:11:56:WU00:FS00:0x15:Version                2.22 (Thu Dec 8 17:08:05 PST 2011)
02:11:56:WU00:FS00:0x15:Build host             SimbiosNvdWin7
02:11:56:WU00:FS00:0x15:Board Type             NVIDIA/CUDA
02:11:56:WU00:FS00:0x15:Core                   15
02:11:56:WU00:FS00:0x15:
02:11:56:WU00:FS00:0x15:Window's signal control handler registered.
02:11:56:WU00:FS00:0x15:Preparing to commence simulation
02:11:56:WU00:FS00:0x15:- Ensuring status. Please wait.
02:12:05:WU00:FS00:0x15:- Looking at optimizations...
02:12:05:WU00:FS00:0x15:- Working with standard loops on this execution.
02:12:05:WU00:FS00:0x15:- Previous termination of core was improper.
02:12:05:WU00:FS00:0x15:- Going to use standard loops.
02:12:05:WU00:FS00:0x15:- Files status OK
02:12:05:WU00:FS00:0x15:sizeof(CORE_PACKET_HDR) = 512 file=<>
02:12:05:WU00:FS00:0x15:- Expanded 119778 -> 542246 (decompressed 452.7 percent)
02:12:05:WU00:FS00:0x15:Called DecompressByteArray: compressed_data_size=119778 data_size=542246, decompressed_data_size=542246 diff=0
02:12:05:WU00:FS00:0x15:- Digital signature verified
02:12:05:WU00:FS00:0x15:
02:12:05:WU00:FS00:0x15:Project: 8034 (Run 8, Clone 0, Gen 17)
02:12:05:WU00:FS00:0x15:
02:12:05:WU00:FS00:0x15:Entering M.D.
02:12:07:WU00:FS00:0x15:Tpr hash 00/wudata_01.tpr:  3722898689 317970079 1029131593 2824764440 902500743
02:12:07:WU00:FS00:0x15:GPU device info: vendor=0 device=0 name=<NA> match=0
02:12:07:WU00:FS00:0x15:Working on Protein
02:12:07:WU00:FS00:0x15:Client config unavailable.
02:12:07:WU00:FS00:FahCore returned: UNKNOWN_ENUM (-1 = 0xffffffff)
02:12:07:WARNING:WU00:FS00:FahCore returned an unknown error code which probably indicates that it crashed
02:12:07:WARNING:WU00:FS00:Too many errors, failing
02:12:07:WU00:FS00:Sending unit results: id:00 state:SEND error:FAILED project:8034 run:8 clone:0 gen:17 core:0x15 unit:0x000000126953ee2e4f72de226aa1212a
02:12:07:WU00:FS00:Connecting to 171.67.108.142:8080
02:12:08:WU00:FS00:Server responded WORK_QUIT (404)
02:12:08:WARNING:WU00:FS00:Server did not like results, dumping
02:12:08:WU00:FS00:Cleaning up

I am on 64-bit Windows 7. Here's the System Summary:

OS Name Microsoft Windows 7 Professional
Version 6.1.7601 Service Pack 1 Build 7601
Processor Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz, 3401 Mhz, 4 Core(s), 8 Logical Processor(s)
BIOS Version/Date American Megatrends Inc. 1606, 4/26/2011
SMBIOS Version 2.6
Windows Directory C:\Windows
System Directory C:\Windows\system32
Boot Device \Device\HarddiskVolume1
Locale United States
Hardware Abstraction Layer Version = "6.1.7601.17514"
Time Zone W. Europe Daylight Time
Installed Physical Memory (RAM) 8.00 GB
Total Physical Memory 7.98 GB
Available Physical Memory 1.51 GB
Total Virtual Memory 16.0 GB
Available Virtual Memory 9.49 GB
Page File Space 7.98 GB
bollix47
Posts: 2941
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Crashes lately

Post by bollix47 »

If you Refresh the log the system section including config can be seen by scrolling up to the top.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Crashes lately

Post by bruce »

bollix47 wrote:If you Refresh the log the system section including config can be seen by scrolling up to the top.
. . . or by clicking on the System Info tab in FAHControl.
bollix47
Posts: 2941
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Crashes lately

Post by bollix47 »

True but not easy to copy/paste and doesn't include the config section. At least I haven't found a way to copy/paste it on my setup. Otherwise, it is the easiest and fastest way to see how your system is being recognized by v7.
Ripper36
Posts: 60
Joined: Sun Sep 18, 2011 8:55 am

Re: Crashes lately

Post by Ripper36 »

There have been very many experiences of this UNKNOWN_ENUM error when running 803x projects on NVIDIA GPUs. I have had many, on specific machines, but not on others, and not all the time - is it a sporadic error for you, or one that occurs always?

Things to check: that your PSU is giving the GPU enough power (these are very power-intensive units); that you have both display and CPU power saving options turned OFF; try running without folding on the CPU, or SMP set to 4 or 6, to test if that makes a difference.

If you look through the forum you will find many more suggestions. Some hold the view that it is the NVIDIA driver, and rolling back to 280.26 helps, but I have found that that doesn't make a difference. Good luck!
Image
mezhaka
Posts: 3
Joined: Wed Mar 28, 2012 3:10 pm

Re: Crashes lately

Post by mezhaka »

So this is the System Info from the Log:

Code: Select all

08:43:14:******************************** Build ********************************
08:43:14:      Version: 7.1.52
08:43:14:         Date: Mar 20 2012
08:43:14:         Time: 19:37:42
08:43:14:      SVN Rev: 3515
08:43:14:       Branch: fah/trunk/client
08:43:14:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
08:43:14:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
08:43:14:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT
08:43:14:     Platform: win32 XP
08:43:14:         Bits: 32
08:43:14:         Mode: Release
08:43:14:******************************* System ********************************
08:43:14:          CPU: Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz
08:43:14:       CPU ID: GenuineIntel Family 6 Model 42 Stepping 7
08:43:14:         CPUs: 8
08:43:14:       Memory: 7.98GiB
08:43:14:  Free Memory: 6.25GiB
08:43:14:      Threads: WINDOWS_THREADS
08:43:14:   On Battery: false
08:43:14:   UTC offset: 2
08:43:14:          PID: 4272
08:43:14:          CWD: C:/Users/anton/AppData/Roaming/FAHClient
08:43:14:           OS: Windows 7 Professional
08:43:14:      OS Arch: AMD64
08:43:14:         GPUs: 1
08:43:14:        GPU 0: FERMI:1 GF116 [GeForce GTX 550 Ti]
08:43:14:         CUDA: 2.1
08:43:14:  CUDA Driver: 4020
08:43:14:Win32 Service: false
08:43:14:***********************************************************************
Concerning the power -- I have no idea how to check if it's "enough" power, i.e. how much is enough?

In general I am not that interested to investigate into this problem on my own. Last months my client stays idle.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Crashes lately

Post by bruce »

mezhaka wrote:Concerning the power -- I have no idea how to check if it's "enough" power, i.e. how much is enough?

In general I am not that interested to investigate into this problem on my own. Last months my client stays idle.
I'm not sure how we can help you. Stasnford does not have technicians who will diagnose problems with your hardward. We can guess that you GPU is overclocked or more power is being used than can be provided by your power supply or the heat in the vicinity of the GPU is more than it's happy with, but without your active participation, all we can do is offer suggestions. When a GPU is unstable under load, the cause cannot be determined remotely without you giving us enough information to determine and fix the cause.
Post Reply