Page 1 of 1

UNKNOWN_ENUM (-1073740791 = 0xc0000409)

Posted: Fri Sep 25, 2020 6:04 pm
by gs60
Hello,

This is my first wu failure (which is good!). Just posting the issue here incase there is a problem with the wu, but I suspect the problem was an unceremonious shutdown of windows 10. The system was off when I came back from lunch. A review of the system log in the event viewer did not show any errors prior to the restart so windows was in the dark as to what happened (what's new about that, lol). I'm going to assume a power failure of some kind. The system is not yet on a ups, but that will change next time I'm in town. Here are the fah logs just in case.

Code: Select all

*********************** Log Started 2020-09-25T17:35:42Z ***********************
17:35:42:Trying to access database...
17:35:42:Successfully acquired database lock
17:35:42:Read GPUs.txt
17:35:42:Enabled folding slot 00: READY cpu:10
17:35:42:Enabled folding slot 01: READY gpu:0:Ellesmere XT [Radeon RX 470/480/570/580/590]
17:35:42:****************************** FAHClient ******************************
17:35:42:        Version: 7.6.13
17:35:42:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
17:35:42:      Copyright: 2020 foldingathome.org
17:35:42:       Homepage: https://foldingathome.org/
17:35:42:           Date: Apr 27 2020
17:35:42:           Time: 21:21:01
17:35:42:       Revision: 5a652817f46116b6e135503af97f18e094414e3b
17:35:42:         Branch: master
17:35:42:       Compiler: Visual C++ 2008
17:35:42:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
17:35:42:       Platform: win32 10
17:35:42:           Bits: 32
17:35:42:           Mode: Release
17:35:42:         Config: C:\Users\gary\AppData\Roaming\FAHClient\config.xml
17:35:42:******************************** CBang ********************************
17:35:42:           Date: Apr 24 2020
17:35:42:           Time: 17:07:55
17:35:42:       Revision: ea081a3b3b0f4a37c4d0440b4f1bc184197c7797
17:35:42:         Branch: master
17:35:42:       Compiler: Visual C++ 2008
17:35:42:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
17:35:42:       Platform: win32 10
17:35:42:           Bits: 32
17:35:42:           Mode: Release
17:35:42:******************************* System ********************************
17:35:42:            CPU: AMD Ryzen 5 3600 6-Core Processor
17:35:42:         CPU ID: AuthenticAMD Family 23 Model 113 Stepping 0
17:35:42:           CPUs: 12
17:35:42:         Memory: 7.93GiB
17:35:42:    Free Memory: 6.22GiB
17:35:42:        Threads: WINDOWS_THREADS
17:35:42:     OS Version: 6.2
17:35:42:    Has Battery: false
17:35:42:     On Battery: false
17:35:42:     UTC Offset: -7
17:35:42:            PID: 8288
17:35:42:            CWD: C:\Users\gary\AppData\Roaming\FAHClient
17:35:42:  Win32 Service: false
17:35:42:             OS: Windows 10 Home
17:35:42:        OS Arch: AMD64
17:35:42:           GPUs: 1
17:35:42:          GPU 0: Bus:8 Slot:0 Func:0 AMD:5 Ellesmere XT [Radeon RX
17:35:42:                 470/480/570/580/590]
17:35:42:           CUDA: Not detected: Failed to open dynamic library 'nvcuda.dll': The
17:35:42:                 specified module could not be found.
17:35:42:
17:35:42:OpenCL Device 0: Platform:0 Device:0 Bus:8 Slot:0 Compute:1.2 Driver:3110.7
17:35:42:******************************* libFAH ********************************
17:35:42:           Date: Apr 15 2020
17:35:42:           Time: 14:53:14
17:35:42:       Revision: 216968bc7025029c841ed6e36e81a03a316890d3
17:35:42:         Branch: master
17:35:42:       Compiler: Visual C++ 2008
17:35:42:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
17:35:42:       Platform: win32 10
17:35:42:           Bits: 32
17:35:42:           Mode: Release
17:35:42:***********************************************************************
17:35:42:<config>
17:35:42:  <!-- HTTP Server -->
17:35:42:  <allow v='127.0.0.1 192.168.43.0/24'/>
17:35:42:
17:35:42:  <!-- Network -->
17:35:42:  <proxy v=':8080'/>
17:35:42:
17:35:42:  <!-- Remote Command Server -->
17:35:42:  <command-allow-no-pass v='127.0.0.1 192.168.43.0/24'/>
17:35:42:
17:35:42:  <!-- User Information -->
17:35:42:  <passkey v='*****'/>
17:35:42:  <team v='259095'/>
17:35:42:  <user v='Gary_And_Shirley'/>
17:35:42:
17:35:42:  <!-- Folding Slots -->
17:35:42:  <slot id='0' type='CPU'/>
17:35:42:  <slot id='1' type='GPU'/>
17:35:42:</config>

...

17:38:46:WU01:FS00:0xa7:Calling: mdrun -s frame547.tpr -o frame547.trr -cpi state.cpt -cpt 15 -nt 10
17:38:46:WU01:FS00:0xa7:ERROR:Guru Meditation #66a7f55ebf5efa77.7f32d71c2bc399c3 (6081.7376) '01/01/pullx.xvg'
17:38:47:WARNING:WU01:FS00:FahCore returned an unknown error code which probably indicates that it crashed
17:38:47:WARNING:WU01:FS00:FahCore returned: UNKNOWN_ENUM (-1073740791 = 0xc0000409)
17:38:47:WARNING:WU01:FS00:Too many errors, failing
17:38:47:WU01:FS00:Sending unit results: id:01 state:SEND error:FAILED project:14379 run:2580 clone:0 gen:547 core:0xa7 unit:0x00000260455e42075e932f5dfabd5088


I read on this thread, which makes it more seem like an external event. viewtopic.php?nomobile=1&f=96&t=30634

* Update, I confirmed with the girlfriend that the power took a quick drop and must have occurred when fahcore was taking a checkpoint so it was unable to recover from an earlier good point. Sorry to bother ya'll!

Thank you.

Re: UNKNOWN_ENUM (-1073740791 = 0xc0000409)

Posted: Fri Sep 25, 2020 9:39 pm
by JohnChodera
Thanks for the update, and let us know if you end up with something like this again!

~ John Chodera // MSKCC

Re: UNKNOWN_ENUM (-1073740791 = 0xc0000409)

Posted: Sat Sep 26, 2020 5:51 am
by bruce
"ERROR:Guru Meditation" means that a file was corrupt. An unceremonious shutdown will do that and your UPS will give the system to do a clean shutdown if (or when) it happens again. Files are generally cached which means power needs do stay on a little longer after the notification of a pending shutdown is delivered --- until the in-core buffers can be synced to the HD.

Re: UNKNOWN_ENUM (-1073740791 = 0xc0000409)

Posted: Sat Sep 26, 2020 8:10 am
by foldy
Maybe FAHcore could also keep the previous checkpoint until next checkpoint completed successfully?

Re: UNKNOWN_ENUM (-1073740791 = 0xc0000409)

Posted: Sat Sep 26, 2020 9:21 am
by PantherX
Considering that FahCore_a7 has been superseded by FahCore_a8, it might be able to have such a feature implemented in it.