Project: 16435 (Run 478, Clone 0, Gen 12), Guru Meditation

Moderators: Site Moderators, FAHC Science Team

Post Reply
Nuitari
Posts: 80
Joined: Sun Jun 09, 2019 4:03 am
Hardware configuration: 1x Nvidia 1050ti
1x Nvidia 1660Super
1x Nvidia GTX 660
1x Nvidia 1060 3gb
1x AMD rx570
2x AMD rx560
1x AMD Ryzen 7 PRO 1700
1x AMD Ryzen 7 3700X
1x AMD Phenom II
1x AMD A8-9600
1x Intel i5-4590S

Project: 16435 (Run 478, Clone 0, Gen 12), Guru Meditation

Post by Nuitari »

Code: Select all

17:41:02:WU02:FS03:0x22:*********************** Log Started 2020-05-04T17:41:02Z ***********************
17:41:02:WU02:FS03:0x22:*************************** Core22 Folding@home Core ***************************
17:41:02:WU02:FS03:0x22:       Type: 0x22
17:41:02:WU02:FS03:0x22:       Core: Core22
17:41:02:WU02:FS03:0x22:    Website: https://foldingathome.org/
17:41:02:WU02:FS03:0x22:  Copyright: (c) 2009-2018 foldingathome.org
17:41:02:WU02:FS03:0x22:     Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora
17:41:02:WU02:FS03:0x22:             <rafal.wiewiora@choderalab.org>
17:41:02:WU02:FS03:0x22:       Args: -dir 02 -suffix 01 -version 706 -lifeline 19226 -checkpoint 15
17:41:02:WU02:FS03:0x22:             -gpu-vendor amd -opencl-platform 0 -opencl-device 3 -gpu 3
17:41:02:WU02:FS03:0x22:     Config: <none>
17:41:02:WU02:FS03:0x22:************************************ Build *************************************
17:41:02:WU02:FS03:0x22:    Version: 0.0.5
17:41:02:WU02:FS03:0x22:       Date: Apr 22 2020
17:41:02:WU02:FS03:0x22:       Time: 03:57:11
17:41:02:WU02:FS03:0x22: Repository: Git
17:41:02:WU02:FS03:0x22:   Revision: 2d69202c898bd9bb3e093f51cd32bf411c2a0388
17:41:02:WU02:FS03:0x22:     Branch: HEAD
17:41:02:WU02:FS03:0x22:   Compiler: GNU 4.8.2 20140120 (Red Hat 4.8.2-15)
17:41:02:WU02:FS03:0x22:    Options: -std=c++11 -O3 -funroll-loops
17:41:02:WU02:FS03:0x22:   Platform: linux2 4.19.76-linuxkit
17:41:02:WU02:FS03:0x22:       Bits: 64
17:41:02:WU02:FS03:0x22:       Mode: Release
17:41:02:WU02:FS03:0x22:************************************ System ************************************
17:41:02:WU02:FS03:0x22:        CPU: AMD Phenom(tm) II X4 925 Processor
17:41:02:WU02:FS03:0x22:     CPU ID: AuthenticAMD Family 16 Model 4 Stepping 2
17:41:02:WU02:FS03:0x22:       CPUs: 4
17:41:02:WU02:FS03:0x22:     Memory: 23.45GiB
17:41:02:WU02:FS03:0x22:Free Memory: 20.06GiB
17:41:02:WU02:FS03:0x22:    Threads: POSIX_THREADS
17:41:02:WU02:FS03:0x22: OS Version: 5.6
17:41:02:WU02:FS03:0x22:Has Battery: false
17:41:02:WU02:FS03:0x22: On Battery: false
17:41:02:WU02:FS03:0x22: UTC Offset: -4
17:41:02:WU02:FS03:0x22:        PID: 19230
17:41:02:WU02:FS03:0x22:        CWD: /root/fahclient_aurora/work
17:41:02:WU02:FS03:0x22:         OS: Linux 5.6.7-050607-generic x86_64
17:41:02:WU02:FS03:0x22:    OS Arch: AMD64
17:41:02:WU02:FS03:0x22:********************************************************************************
17:41:02:WU02:FS03:0x22:Project: 16435 (Run 478, Clone 0, Gen 12)
17:41:02:WU02:FS03:0x22:Unit: 0x0000001503854c135e9a4efb8181bda4
17:41:02:WU02:FS03:0x22:Reading tar file core.xml
17:41:02:WU02:FS03:0x22:Reading tar file integrator.xml
17:41:02:WU02:FS03:0x22:Reading tar file state.xml
17:41:02:WU02:FS03:0x22:Reading tar file system.xml
17:41:02:WU02:FS03:0x22:Digital signatures verified
17:41:02:WU02:FS03:0x22:Folding@home GPU Core22 Folding@home Core
17:41:02:WU02:FS03:0x22:Version 0.0.5
...
05:21:03:WU02:FS03:0x22:Completed 4450000 out of 5000000 steps (89%)
...
05:27:56:WU02:FS03:0x22:ERROR:Guru Meditation #6090dbd12a409924.205459311311919f (104734720.104733300) '02/01/positions.xtc'
05:27:56:WU02:FS03:0x22:WARNING:Unexpected exit() call
05:27:56:WU02:FS03:0x22:WARNING:Unexpected exit from science code
05:27:56:WU02:FS03:0x22:Saving result file ../logfile_01.txt
05:27:56:WU02:FS03:0x22:Saving result file checkpointState.xml
05:27:56:WU02:FS03:0x22:Saving result file checkpt.crc
05:27:56:WU02:FS03:0x22:Saving result file positions.xtc
ESC[93m05:27:56:WARNING:WU02:FS03:FahCore returned: BAD_FRAME_CHECKSUM (112 = 0x70)ESC[0m
ESC[93m05:27:56:WARNING:WU02:FS03:Fatal error, dumpingESC[0m
05:27:57:WU02:FS03:Sending unit results: id:02 state:SEND error:DUMPED project:16435 run:478 clone:0 gen:12 core:0x22 unit:0x0000001503854c135e9a4efb8181bda4
05:27:57:WU02:FS03:Uploading 9.69MiB to 3.133.76.19
05:27:57:WU02:FS03:Connecting to 3.133.76.19:8080
05:29:12:WU02:FS03:Upload 3.23%
05:29:18:WU02:FS03:Upload 69.68%
05:29:20:WU02:FS03:Upload complete
05:29:20:WU02:FS03:Server responded WORK_QUIT (404)
ESC[93m05:29:20:WARNING:WU02:FS03:Server did not like results, dumpingESC[0m
Not sure what happened there. No overclocking or anything on that card.
Image
Joe_H
Site Admin
Posts: 7870
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: Project: 16435 (Run 478, Clone 0, Gen 12), Guru Meditati

Post by Joe_H »

Code: Select all

05:27:56:WU02:FS03:0x22:ERROR:Guru Meditation #6090dbd12a409924.205459311311919f (104734720.104733300) '02/01/positions.xtc'
This appears in the log at the point the folding core is in the process of opening the previous checkpoint. It indicates something either corrupted one of the files needed, or prevented it from being opened. The error message is being passed through from the OS. A search in the console logs for the time in question may provide more information. Also look to see if you can find a translation for the error code given.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
Nuitari
Posts: 80
Joined: Sun Jun 09, 2019 4:03 am
Hardware configuration: 1x Nvidia 1050ti
1x Nvidia 1660Super
1x Nvidia GTX 660
1x Nvidia 1060 3gb
1x AMD rx570
2x AMD rx560
1x AMD Ryzen 7 PRO 1700
1x AMD Ryzen 7 3700X
1x AMD Phenom II
1x AMD A8-9600
1x Intel i5-4590S

Re: Project: 16435 (Run 478, Clone 0, Gen 12), Guru Meditati

Post by Nuitari »

I eventually found the issue later in the logs when a series of WU failed closed to one another and it was that I ran out of diskspace.

Its a repurposed mining rig that doesn't have a hard drive. I chucked an old usb key for the work folder, but the 2gb filled up. I've moved it to pure NFS like the other rig. Not ideal because that means I have to keep the NFS server up at all times.

This is probably related to file descriptors not being properly managed when the client forks the process to launch a WU.
Image
Post Reply