project:14445 run:0 clone:1930 gen:2 DUMPED

Moderators: Site Moderators, FAHC Science Team

Post Reply
esfishox
Posts: 5
Joined: Fri Mar 27, 2020 4:16 am
Hardware configuration: Gigabyte Windforce RTX 4080 on Ubuntu 22.04
Gigabyte WC RTX 3080 LHR on Windows 11

project:14445 run:0 clone:1930 gen:2 DUMPED

Post by esfishox »

WARNING:Unexpected exit from science code
I'm trying to understand why this WU dumped. Is it a problem with the WU? Bummer that it made it to 98% before dumping. This machine is not under/over clocked/voltage.

Code: Select all

15:17:31:WU00:FS01:Download complete
15:17:31:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:14445 run:0 clone:1930 gen:2 core:0x22 unit:0x0000000303854c135ea7b890a5de189e
15:17:31:WU00:FS01:Starting
15:17:31:WU00:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/v7/lin/64bit/Core_22.fah/FahCore_22 -dir 00 -suffix 01 -version 706 -lifeline 1589 -checkpoint 15 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
15:17:31:WU00:FS01:Started FahCore on PID 25599
15:17:31:WU00:FS01:Core PID:25603
15:17:31:WU00:FS01:FahCore 0x22 started
15:17:31:WU00:FS01:0x22:*********************** Log Started 2020-05-08T15:17:31Z ***********************
15:17:31:WU00:FS01:0x22:*************************** Core22 Folding@home Core ***************************
15:17:31:WU00:FS01:0x22:       Type: 0x22
15:17:31:WU00:FS01:0x22:       Core: Core22
15:17:31:WU00:FS01:0x22:    Website: https://foldingathome.org/
15:17:31:WU00:FS01:0x22:  Copyright: (c) 2009-2018 foldingathome.org
15:17:31:WU00:FS01:0x22:     Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora
15:17:31:WU00:FS01:0x22:             <rafal.wiewiora@choderalab.org>
15:17:31:WU00:FS01:0x22:       Args: -dir 00 -suffix 01 -version 706 -lifeline 25599 -checkpoint 15
15:17:31:WU00:FS01:0x22:             -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device
15:17:31:WU00:FS01:0x22:             0 -gpu 0
15:17:31:WU00:FS01:0x22:     Config: <none>
15:17:31:WU00:FS01:0x22:************************************ Build *************************************
15:17:31:WU00:FS01:0x22:    Version: 0.0.5
15:17:31:WU00:FS01:0x22:       Date: Apr 22 2020
15:17:31:WU00:FS01:0x22:       Time: 03:57:11
15:17:31:WU00:FS01:0x22: Repository: Git
15:17:31:WU00:FS01:0x22:   Revision: 2d69202c898bd9bb3e093f51cd32bf411c2a0388
15:17:31:WU00:FS01:0x22:     Branch: HEAD
15:17:31:WU00:FS01:0x22:   Compiler: GNU 4.8.2 20140120 (Red Hat 4.8.2-15)
15:17:31:WU00:FS01:0x22:    Options: -std=c++11 -O3 -funroll-loops
15:17:31:WU00:FS01:0x22:   Platform: linux2 4.19.76-linuxkit
15:17:31:WU00:FS01:0x22:       Bits: 64
15:17:31:WU00:FS01:0x22:       Mode: Release
15:17:31:WU00:FS01:0x22:************************************ System ************************************
15:17:31:WU00:FS01:0x22:        CPU: Intel(R) Core(TM) i5-2500K CPU @ 3.30GHz
15:17:31:WU00:FS01:0x22:     CPU ID: GenuineIntel Family 6 Model 42 Stepping 7
15:17:31:WU00:FS01:0x22:       CPUs: 4
15:17:31:WU00:FS01:0x22:     Memory: 31.33GiB
15:17:31:WU00:FS01:0x22:Free Memory: 28.62GiB
15:17:31:WU00:FS01:0x22:    Threads: POSIX_THREADS
15:17:31:WU00:FS01:0x22: OS Version: 4.15
15:17:31:WU00:FS01:0x22:Has Battery: false
15:17:31:WU00:FS01:0x22: On Battery: false
15:17:31:WU00:FS01:0x22: UTC Offset: 0
15:17:31:WU00:FS01:0x22:        PID: 25603
15:17:31:WU00:FS01:0x22:        CWD: /var/lib/fahclient/work
15:17:31:WU00:FS01:0x22:         OS: Linux 4.15.0-96-generic x86_64
15:17:31:WU00:FS01:0x22:    OS Arch: AMD64
15:17:31:WU00:FS01:0x22:********************************************************************************
15:17:31:WU00:FS01:0x22:Project: 14445 (Run 0, Clone 1930, Gen 2)
15:17:31:WU00:FS01:0x22:Unit: 0x0000000303854c135ea7b890a5de189e
15:17:31:WU00:FS01:0x22:Reading tar file core.xml
15:17:31:WU00:FS01:0x22:Reading tar file integrator.xml
15:17:31:WU00:FS01:0x22:Reading tar file state.xml
15:17:32:WU00:FS01:0x22:Reading tar file system.xml
15:17:32:WU00:FS01:0x22:Digital signatures verified
15:17:32:WU00:FS01:0x22:Folding@home GPU Core22 Folding@home Core
15:17:32:WU00:FS01:0x22:Version 0.0.5
15:17:36:WU01:FS01:Upload 53.12%
15:17:42:WU01:FS01:Upload 57.92%
15:17:47:WU00:FS01:0x22:Completed 0 out of 2000000 steps (0%)
15:17:47:WU00:FS01:0x22:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
...
22:31:30:WU00:FS01:0x22:Completed 1940000 out of 2000000 steps (97%)
22:35:58:WU00:FS01:0x22:Completed 1960000 out of 2000000 steps (98%)
22:36:02:WU00:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
22:36:02:WU00:FS01:Starting
22:36:02:WU00:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/v7/lin/64bit/Core_22.fah/FahCore_22 -dir 00 -suffix 01 -version 706 -lifeline 1589 -checkpoint 15 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
22:36:02:WU00:FS01:Started FahCore on PID 32646
22:36:02:WU00:FS01:Core PID:32650
22:36:02:WU00:FS01:FahCore 0x22 started
22:36:03:WU00:FS01:0x22:*********************** Log Started 2020-05-08T22:36:02Z ***********************
22:36:03:WU00:FS01:0x22:*************************** Core22 Folding@home Core ***************************
22:36:03:WU00:FS01:0x22:       Type: 0x22
22:36:03:WU00:FS01:0x22:       Core: Core22
22:36:03:WU00:FS01:0x22:    Website: https://foldingathome.org/
22:36:03:WU00:FS01:0x22:  Copyright: (c) 2009-2018 foldingathome.org
22:36:03:WU00:FS01:0x22:     Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora
22:36:03:WU00:FS01:0x22:             <rafal.wiewiora@choderalab.org>
22:36:03:WU00:FS01:0x22:       Args: -dir 00 -suffix 01 -version 706 -lifeline 32646 -checkpoint 15
22:36:03:WU00:FS01:0x22:             -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device
22:36:03:WU00:FS01:0x22:             0 -gpu 0
22:36:03:WU00:FS01:0x22:     Config: <none>
22:36:03:WU00:FS01:0x22:************************************ Build *************************************
22:36:03:WU00:FS01:0x22:    Version: 0.0.5
22:36:03:WU00:FS01:0x22:       Date: Apr 22 2020
22:36:03:WU00:FS01:0x22:       Time: 03:57:11
22:36:03:WU00:FS01:0x22: Repository: Git
22:36:03:WU00:FS01:0x22:   Revision: 2d69202c898bd9bb3e093f51cd32bf411c2a0388
22:36:03:WU00:FS01:0x22:     Branch: HEAD
22:36:03:WU00:FS01:0x22:   Compiler: GNU 4.8.2 20140120 (Red Hat 4.8.2-15)
22:36:03:WU00:FS01:0x22:    Options: -std=c++11 -O3 -funroll-loops
22:36:03:WU00:FS01:0x22:   Platform: linux2 4.19.76-linuxkit
22:36:03:WU00:FS01:0x22:       Bits: 64
22:36:03:WU00:FS01:0x22:       Mode: Release
22:36:03:WU00:FS01:0x22:************************************ System ************************************
22:36:03:WU00:FS01:0x22:        CPU: Intel(R) Core(TM) i5-2500K CPU @ 3.30GHz
22:36:03:WU00:FS01:0x22:     CPU ID: GenuineIntel Family 6 Model 42 Stepping 7
22:36:03:WU00:FS01:0x22:       CPUs: 4
22:36:03:WU00:FS01:0x22:     Memory: 31.33GiB
22:36:03:WU00:FS01:0x22:Free Memory: 28.52GiB
22:36:03:WU00:FS01:0x22:    Threads: POSIX_THREADS
22:36:03:WU00:FS01:0x22: OS Version: 4.15
22:36:03:WU00:FS01:0x22:Has Battery: false
22:36:03:WU00:FS01:0x22: On Battery: false
22:36:03:WU00:FS01:0x22: UTC Offset: 0
22:36:03:WU00:FS01:0x22:        PID: 32650
22:36:03:WU00:FS01:0x22:        CWD: /var/lib/fahclient/work
22:36:03:WU00:FS01:0x22:         OS: Linux 4.15.0-96-generic x86_64
22:36:03:WU00:FS01:0x22:    OS Arch: AMD64
22:36:03:WU00:FS01:0x22:********************************************************************************
22:36:03:WU00:FS01:0x22:Project: 14445 (Run 0, Clone 1930, Gen 2)
22:36:03:WU00:FS01:0x22:Unit: 0x0000000303854c135ea7b890a5de189e
22:36:03:WU00:FS01:0x22:Digital signatures verified
22:36:03:WU00:FS01:0x22:Folding@home GPU Core22 Folding@home Core
22:36:03:WU00:FS01:0x22:Version 0.0.5
22:36:03:WU00:FS01:0x22:  Found a checkpoint file
22:36:08:WU00:FS01:0x22:ERROR:Guru Meditation #0.1d0116116a4922b (0.52530076) '00/01/checkpointState.xml'
22:36:08:WU00:FS01:0x22:WARNING:Unexpected exit() call
22:36:08:WU00:FS01:0x22:WARNING:Unexpected exit from science code
22:36:08:WU00:FS01:0x22:Saving result file ../logfile_01.txt
22:36:08:WU00:FS01:0x22:Saving result file checkpointState.xml
22:36:08:WU00:FS01:0x22:ERROR:Guru Meditation #0.1d0116116a4922b (0.52530076) '00/01/checkpointState.xml'
22:36:08:WARNING:WU00:FS01:FahCore returned: BAD_FRAME_CHECKSUM (112 = 0x70)
22:36:08:WARNING:WU00:FS01:Fatal error, dumping
22:36:08:WU00:FS01:Sending unit results: id:00 state:SEND error:DUMPED project:14445 run:0 clone:1930 gen:2 core:0x22 unit:0x0000000303854c135ea7b890a5de189e
22:36:08:WU00:FS01:Connecting to 3.133.76.19:8080
22:38:18:WARNING:WU00:FS01:WorkServer connection failed on port 8080 trying 80
22:38:18:WU00:FS01:Connecting to 3.133.76.19:80
22:39:03:WU00:FS01:Server responded WORK_ACK (400)
22:39:03:WU00:FS01:Cleaning up
Thanks!! Mike
Image
PantherX
Site Moderator
Posts: 7020
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: project:14445 run:0 clone:1930 gen:2 DUMPED

Post by PantherX »

From what I can tell, everything was fine until 98% At that time, something interrupted the FahCore and it restarted. It could be that the checkpoint was being written but before it was completed, the FahCore was interrupted which resulted in a corrupted checkpoint and the WU can't continue.

22:35:58:WU00:FS01:0x22:Completed 1960000 out of 2000000 steps (98%)
22:36:02:WU00:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
22:36:02:WU00:FS01:Starting

So far, you're the only donor who has returned the WU so let's wait and see what happens: https://apps.foldingathome.org/wu#proje ... 1930&gen=2
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Post Reply