Not sure where to put this

If you think it might be a driver problem, see viewforum.php?f=79

Moderators: Site Moderators, FAHC Science Team

Post Reply
dickster
Posts: 41
Joined: Sun Mar 01, 2009 4:33 pm

Not sure where to put this

Post by dickster »

I have a machine running Mint 17 and an AMD R9 280 GPU. Not running the CPU part out of the F@H control, just the GPU slot. It runs right up to 99.99% and freezes there. If I reboot it goes back to 43.xx% every time and runs back to 99.99%. It never finishes a work unit. Running FAH 7.4.4 control.

Log right after my last reboot.

Code: Select all

*********************** Log Started 2019-07-22T17:58:32Z ***********************
17:58:32:************************* Folding@home Client *************************
17:58:32:    Website: http://folding.stanford.edu/
17:58:32:  Copyright: (c) 2009-2014 Stanford University
17:58:32:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
17:58:32:       Args: --child --lifeline 1292 /etc/fahclient/config.xml --run-as
17:58:32:             fahclient --pid-file=/var/run/fahclient.pid --daemon
17:58:32:     Config: /etc/fahclient/config.xml
17:58:32:******************************** Build ********************************
17:58:32:    Version: 7.4.4
17:58:32:       Date: Mar 4 2014
17:58:32:       Time: 12:02:38
17:58:32:    SVN Rev: 4130
17:58:32:     Branch: fah/trunk/client
17:58:32:   Compiler: GNU 4.4.7
17:58:32:    Options: -std=gnu++98 -O3 -funroll-loops -mfpmath=sse -ffast-math
17:58:32:             -fno-unsafe-math-optimizations -msse2
17:58:32:   Platform: linux2 3.2.0-1-amd64
17:58:32:       Bits: 64
17:58:32:       Mode: Release
17:58:32:******************************* System ********************************
17:58:32:        CPU: AMD FX(tm)-4130 Quad-Core Processor
17:58:32:     CPU ID: AuthenticAMD Family 21 Model 1 Stepping 2
17:58:32:       CPUs: 4
17:58:32:     Memory: 15.66GiB
17:58:32:Free Memory: 14.73GiB
17:58:32:    Threads: POSIX_THREADS
17:58:32: OS Version: 3.13
17:58:32:Has Battery: false
17:58:32: On Battery: false
17:58:32: UTC Offset: -5
17:58:32:        PID: 1294
17:58:32:        CWD: /var/lib/fahclient
17:58:32:         OS: Linux 3.13.0-24-generic x86_64
17:58:32:    OS Arch: AMD64
17:58:32:       GPUs: 1
17:58:32:      GPU 0: ATI:5 Tahiti PRO [Radeon HD 7950]
17:58:32:       CUDA: Not detected
17:58:32:***********************************************************************
17:58:32:<config>
17:58:32:  <!-- Client Control -->
17:58:32:  <fold-anon v='true'/>
17:58:32:
17:58:32:  <!-- Folding Slot Configuration -->
17:58:32:  <gpu v='false'/>
17:58:32:
17:58:32:  <!-- Network -->
17:58:32:  <proxy v=':8080'/>
17:58:32:
17:58:32:  <!-- Slot Control -->
17:58:32:  <power v='full'/>
17:58:32:
17:58:32:  <!-- User Information -->
17:58:32:  <passkey v='********************************'/>
17:58:32:  <team v='32035'/>
17:58:32:  <user v='dickster'/>
17:58:32:
17:58:32:  <!-- Folding Slots -->
17:58:32:  <slot id='0' type='GPU'/>
17:58:32:</config>
17:58:32:Switching to user fahclient
17:58:32:Trying to access database...
17:58:32:Successfully acquired database lock
17:58:32:Enabled folding slot 00: READY gpu:0:Tahiti PRO [Radeon HD 7950]
17:58:32:WU00:FS00:Starting
17:58:32:WU00:FS00:Removing old file './work/00/logfile_01-20190720-013012.txt'
17:58:32:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/Linux/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21 -dir 00 -suffix 01 -version 704 -lifeline 1294 -checkpoint 15 -gpu 0 -gpu-vendor ati
17:58:32:WU00:FS00:Started FahCore on PID 1304
17:58:32:WU00:FS00:Core PID:1308
17:58:32:WU00:FS00:FahCore 0x21 started
17:58:35:WU00:FS00:0x21:*********************** Log Started 2019-07-22T17:58:34Z ***********************
17:58:35:WU00:FS00:0x21:Project: 11719 (Run 0, Clone 2035, Gen 133)
17:58:35:WU00:FS00:0x21:Unit: 0x000000ad8ca304e75bbce921111698f0
17:58:35:WU00:FS00:0x21:CPU: 0x00000000000000000000000000000000
17:58:35:WU00:FS00:0x21:Machine: 0
17:58:35:WU00:FS00:0x21:Digital signatures verified
17:58:35:WU00:FS00:0x21:Folding@home GPU Core21 Folding@home Core
17:58:35:WU00:FS00:0x21:Version 0.0.20
17:58:44:WU00:FS00:0x21:Completed 0 out of 5000000 steps (0%)
17:58:44:WU00:FS00:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Not sure where to put this

Post by bruce »

The protein is not running up to 99.9%; it is running up to 43.xx% and hanging. The code that predicts how much longer it will take to finish is clocking non-progress but it know just enough to stop at 99.9% in predicting further progress. (You cut off the part of the log which showed that. The previous log is saved and might be useful.)

Two other people have returned a partially completed Project: 11719 (Run 0, Clone 2035, Gen 133) as FAULTY so it makes sense for you to dump it.

It also would be helpful if you can look at the previous logs and find the one where the WU was assigned. Were there any error messages when it got past 40% the first time?
dickster
Posts: 41
Joined: Sun Mar 01, 2009 4:33 pm

Re: Not sure where to put this

Post by dickster »

How do I delete the work unit? Where is the folder for the GPU client. Was thinking that if I emptied the folder and rebooted, it would search for a new work unit.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Not sure where to put this

Post by bruce »

In Linux, open /var/lib/fahclient. In subdirectory work, you'll find this particular wu inside of another subdirectory called "00"
Note that the WU says it is :WU00:

You can also do it by running FAHClient with the parameter string "--dump 00"

Inasmuch as fahclient runs as a service, you'll probably need to restart.

(I'm not on Linux today, so I'm going from memory.)
Post Reply