Status alternates between "Running", "Ready" and "Finishing"

If you're new to FAH and need help getting started or you have very basic questions, start here.

Moderators: Site Moderators, FAHC Science Team

Post Reply
arkadiyjs
Posts: 4
Joined: Thu Mar 14, 2019 1:59 pm

Status alternates between "Running", "Ready" and "Finishing"

Post by arkadiyjs »

The client seemed to have been folding quite efficiently for the first few days after installation, but a day or two ago it started to alternate between statuses "Running" and "Ready". I've already tried pausing, resuming and finishing, but instead of solving the problem, it has added "Finishing" to the set of statuses in alternation. PPD has also dramatically decreased from several thousand to around 20.

Code: Select all

*********************** Log Started 2019-03-16T21:26:04Z ***********************
21:26:04:************************* Folding@home Client *************************
21:26:04:    Website: https://foldingathome.org/
21:26:04:  Copyright: (c) 2009-2018 foldingathome.org
21:26:04:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
21:26:04:       Args: --child --lifeline 1797 /etc/fahclient/config.xml --run-as
21:26:04:             fahclient --pid-file=/var/run/fahclient.pid --daemon
21:26:04:     Config: /etc/fahclient/config.xml
21:26:04:******************************** Build ********************************
21:26:04:    Version: 7.5.1
21:26:04:       Date: May 11 2018
21:26:04:       Time: 19:59:04
21:26:04: Repository: Git
21:26:04:   Revision: 4705bf53c635f88b8fe85af7675557e15d491ff0
21:26:04:     Branch: master
21:26:04:   Compiler: GNU 6.3.0 20170516
21:26:04:    Options: -std=gnu++98 -O3 -funroll-loops
21:26:04:   Platform: linux2 4.14.0-3-amd64
21:26:04:       Bits: 64
21:26:04:       Mode: Release
21:26:04:******************************* System ********************************
21:26:04:        CPU: Intel(R) Core(TM) i3-6006U CPU @ 2.00GHz
21:26:04:     CPU ID: GenuineIntel Family 6 Model 78 Stepping 3
21:26:04:       CPUs: 4
21:26:04:     Memory: 7.58GiB
21:26:04:Free Memory: 6.05GiB
21:26:04:    Threads: POSIX_THREADS
21:26:04: OS Version: 4.19
21:26:04:Has Battery: true
21:26:04: On Battery: false
21:26:04: UTC Offset: 1
21:26:04:        PID: 1799
21:26:04:        CWD: /var/lib/fahclient
21:26:04:         OS: Linux 4.19.0-4-amd64 x86_64
21:26:04:    OS Arch: AMD64
21:26:04:       GPUs: 0
21:26:04:       CUDA: Not detected: Failed to open dynamic library 'libcuda.so':
21:26:04:             libcuda.so: cannot open shared object file: No such file or
21:26:04:             directory
21:26:04:     OpenCL: Not detected: Failed to open dynamic library 'libOpenCL.so':
21:26:04:             libOpenCL.so: cannot open shared object file: No such file or
21:26:04:             directory
21:26:04:***********************************************************************
21:26:04:<config>
21:26:04:  <!-- Client Control -->
21:26:04:  <fold-anon v='true'/>
21:26:04:
21:26:04:  <!-- Folding Core -->
21:26:04:  <checkpoint v='30'/>
21:26:04:
21:26:04:  <!-- Folding Slot Configuration -->
21:26:04:  <gpu v='false'/>
21:26:04:
21:26:04:  <!-- Network -->
21:26:04:  <proxy v=':8080'/>
21:26:04:
21:26:04:  <!-- Slot Control -->
21:26:04:  <pause-on-battery v='false'/>
21:26:04:  <power v='full'/>
21:26:04:
21:26:04:  <!-- User Information -->
21:26:04:  <passkey v='********************************'/>
21:26:04:  <team v='224497'/>
21:26:04:  <user v='Arkadiy_ALL_1KB8qeCYgq2418apTeMgScp21fbjrQfgxx'/>
21:26:04:
21:26:04:  <!-- Folding Slots -->
21:26:04:  <slot id='0' type='CPU'/>
21:26:04:</config>
21:26:04:Switching to user fahclient
21:26:04:Trying to access database...
21:26:04:Successfully acquired database lock
21:26:04:Enabled folding slot 00: READY cpu:4
21:26:04:WU01:FS00:Starting
21:26:04:WU01:FS00:Removing old file './work/01/logfile_01-20190316-202911.txt'
21:26:04:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/Linux/AMD64/Core_a4.fah/FahCore_a4 -dir 01 -suffix 01 -version 705 -lifeline 1799 -checkpoint 30 -np 4
21:26:04:WU01:FS00:Started FahCore on PID 1814
21:26:04:WU01:FS00:Core PID:1820
21:26:04:WU01:FS00:FahCore 0xa4 started
21:26:05:WU01:FS00:0xa4:
21:26:05:WU01:FS00:0xa4:*------------------------------*
21:26:05:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
21:26:05:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
21:26:05:WU01:FS00:0xa4:
21:26:05:WU01:FS00:0xa4:Preparing to commence simulation
21:26:05:WU01:FS00:0xa4:- Ensuring status. Please wait.
21:26:14:WU01:FS00:0xa4:- Looking at optimizations...
21:26:14:WU01:FS00:0xa4:- Working with standard loops on this execution.
21:26:14:WU01:FS00:0xa4:Examination of work files indicates 8 consecutive improper terminations of core.
21:26:14:WU01:FS00:0xa4:- Expanded 825893 -> 1403472 (decompressed 169.9 percent)
21:26:14:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=825893 data_size=1403472, decompressed_data_size=1403472 diff=0
21:26:14:WU01:FS00:0xa4:- Digital signature verified
21:26:14:WU01:FS00:0xa4:
21:26:14:WU01:FS00:0xa4:Project: 9037 (Run 702, Clone 2, Gen 2301)
21:26:14:WU01:FS00:0xa4:
21:26:14:WU01:FS00:0xa4:Entering M.D.
21:26:20:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
21:26:20:WU01:FS00:Starting
21:26:20:WU01:FS00:Removing old file './work/01/logfile_01-20190316-203011.txt'
21:26:20:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/Linux/AMD64/Core_a4.fah/FahCore_a4 -dir 01 -suffix 01 -version 705 -lifeline 1799 -checkpoint 30 -np 4
21:26:20:WU01:FS00:Started FahCore on PID 2619
21:26:20:WU01:FS00:Core PID:2623
21:26:20:WU01:FS00:FahCore 0xa4 started
21:26:21:WU01:FS00:0xa4:
21:26:21:WU01:FS00:0xa4:*------------------------------*
21:26:21:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
21:26:21:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
21:26:21:WU01:FS00:0xa4:
21:26:21:WU01:FS00:0xa4:Preparing to commence simulation
21:26:21:WU01:FS00:0xa4:- Ensuring status. Please wait.
21:26:30:WU01:FS00:0xa4:- Looking at optimizations...
21:26:30:WU01:FS00:0xa4:- Working with standard loops on this execution.
21:26:30:WU01:FS00:0xa4:Examination of work files indicates 8 consecutive improper terminations of core.
21:26:30:WU01:FS00:0xa4:- Expanded 825893 -> 1403472 (decompressed 169.9 percent)
21:26:30:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=825893 data_size=1403472, decompressed_data_size=1403472 diff=0
21:26:30:WU01:FS00:0xa4:- Digital signature verified
21:26:30:WU01:FS00:0xa4:
21:26:30:WU01:FS00:0xa4:Project: 9037 (Run 702, Clone 2, Gen 2301)
21:26:30:WU01:FS00:0xa4:
21:26:30:WU01:FS00:0xa4:Entering M.D.
21:26:37:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
21:27:20:WU01:FS00:Starting
21:27:20:WU01:FS00:Removing old file './work/01/logfile_01-20190316-203111.txt'
21:27:20:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/Linux/AMD64/Core_a4.fah/FahCore_a4 -dir 01 -suffix 01 -version 705 -lifeline 1799 -checkpoint 30 -np 4
21:27:20:WU01:FS00:Started FahCore on PID 3738
21:27:20:WU01:FS00:Core PID:3742
21:27:20:WU01:FS00:FahCore 0xa4 started
21:27:21:WU01:FS00:0xa4:
21:27:21:WU01:FS00:0xa4:*------------------------------*
21:27:21:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
21:27:21:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
21:27:21:WU01:FS00:0xa4:
21:27:21:WU01:FS00:0xa4:Preparing to commence simulation
21:27:21:WU01:FS00:0xa4:- Ensuring status. Please wait.
21:27:30:WU01:FS00:0xa4:- Looking at optimizations...
21:27:30:WU01:FS00:0xa4:- Working with standard loops on this execution.
21:27:30:WU01:FS00:0xa4:Examination of work files indicates 8 consecutive improper terminations of core.
21:27:30:WU01:FS00:0xa4:- Expanded 825893 -> 1403472 (decompressed 169.9 percent)
21:27:30:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=825893 data_size=1403472, decompressed_data_size=1403472 diff=0
21:27:30:WU01:FS00:0xa4:- Digital signature verified
21:27:30:WU01:FS00:0xa4:
21:27:30:WU01:FS00:0xa4:Project: 9037 (Run 702, Clone 2, Gen 2301)
21:27:30:WU01:FS00:0xa4:
21:27:30:WU01:FS00:0xa4:Entering M.D.
21:27:37:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
21:28:21:WU01:FS00:Starting
21:28:21:WU01:FS00:Removing old file './work/01/logfile_01-20190316-203211.txt'
21:28:21:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/Linux/AMD64/Core_a4.fah/FahCore_a4 -dir 01 -suffix 01 -version 705 -lifeline 1799 -checkpoint 30 -np 4
21:28:21:WU01:FS00:Started FahCore on PID 3840
21:28:21:WU01:FS00:Core PID:3844
21:28:21:WU01:FS00:FahCore 0xa4 started
21:28:21:WU01:FS00:0xa4:
21:28:21:WU01:FS00:0xa4:*------------------------------*
21:28:21:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
21:28:21:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
21:28:21:WU01:FS00:0xa4:
21:28:21:WU01:FS00:0xa4:Preparing to commence simulation
21:28:21:WU01:FS00:0xa4:- Ensuring status. Please wait.
21:28:30:WU01:FS00:0xa4:- Looking at optimizations...
21:28:30:WU01:FS00:0xa4:- Working with standard loops on this execution.
21:28:30:WU01:FS00:0xa4:Examination of work files indicates 8 consecutive improper terminations of core.
21:28:30:WU01:FS00:0xa4:- Expanded 825893 -> 1403472 (decompressed 169.9 percent)
21:28:30:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=825893 data_size=1403472, decompressed_data_size=1403472 diff=0
21:28:30:WU01:FS00:0xa4:- Digital signature verified
21:28:30:WU01:FS00:0xa4:
21:28:30:WU01:FS00:0xa4:Project: 9037 (Run 702, Clone 2, Gen 2301)
21:28:30:WU01:FS00:0xa4:
21:28:30:WU01:FS00:0xa4:Entering M.D.
21:28:37:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
21:29:21:WU01:FS00:Starting
21:29:21:WU01:FS00:Removing old file './work/01/logfile_01-20190316-203311.txt'
21:29:21:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/Linux/AMD64/Core_a4.fah/FahCore_a4 -dir 01 -suffix 01 -version 705 -lifeline 1799 -checkpoint 30 -np 4
21:29:21:WU01:FS00:Started FahCore on PID 4194
21:29:21:WU01:FS00:Core PID:4198
21:29:21:WU01:FS00:FahCore 0xa4 started
21:29:21:WU01:FS00:0xa4:
21:29:21:WU01:FS00:0xa4:*------------------------------*
21:29:21:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
21:29:21:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
21:29:21:WU01:FS00:0xa4:
21:29:21:WU01:FS00:0xa4:Preparing to commence simulation
21:29:21:WU01:FS00:0xa4:- Ensuring status. Please wait.
21:29:30:WU01:FS00:0xa4:- Looking at optimizations...
21:29:30:WU01:FS00:0xa4:- Working with standard loops on this execution.
21:29:30:WU01:FS00:0xa4:Examination of work files indicates 8 consecutive improper terminations of core.
21:29:30:WU01:FS00:0xa4:- Expanded 825893 -> 1403472 (decompressed 169.9 percent)
21:29:30:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=825893 data_size=1403472, decompressed_data_size=1403472 diff=0
21:29:30:WU01:FS00:0xa4:- Digital signature verified
21:29:30:WU01:FS00:0xa4:
21:29:30:WU01:FS00:0xa4:Project: 9037 (Run 702, Clone 2, Gen 2301)
21:29:30:WU01:FS00:0xa4:
21:29:30:WU01:FS00:0xa4:Entering M.D.
21:29:37:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
21:30:21:WU01:FS00:Starting
21:30:21:WU01:FS00:Removing old file './work/01/logfile_01-20190316-203411.txt'
21:30:21:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/Linux/AMD64/Core_a4.fah/FahCore_a4 -dir 01 -suffix 01 -version 705 -lifeline 1799 -checkpoint 30 -np 4
21:30:21:WU01:FS00:Started FahCore on PID 4231
21:30:21:WU01:FS00:Core PID:4235
21:30:21:WU01:FS00:FahCore 0xa4 started
21:30:21:WU01:FS00:0xa4:
21:30:21:WU01:FS00:0xa4:*------------------------------*
21:30:21:WU01:FS00:0xa4:Folding@Home Gromacs GB Core
21:30:21:WU01:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
21:30:21:WU01:FS00:0xa4:
21:30:21:WU01:FS00:0xa4:Preparing to commence simulation
21:30:21:WU01:FS00:0xa4:- Ensuring status. Please wait.
21:30:30:WU01:FS00:0xa4:- Looking at optimizations...
21:30:30:WU01:FS00:0xa4:- Working with standard loops on this execution.
21:30:30:WU01:FS00:0xa4:Examination of work files indicates 8 consecutive improper terminations of core.
21:30:30:WU01:FS00:0xa4:- Expanded 825893 -> 1403472 (decompressed 169.9 percent)
21:30:30:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=825893 data_size=1403472, decompressed_data_size=1403472 diff=0
21:30:30:WU01:FS00:0xa4:- Digital signature verified
21:30:30:WU01:FS00:0xa4:
21:30:30:WU01:FS00:0xa4:Project: 9037 (Run 702, Clone 2, Gen 2301)
21:30:30:WU01:FS00:0xa4:
21:30:30:WU01:FS00:0xa4:Entering M.D.
21:30:37:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
Thanks in advance!
Last edited by arkadiyjs on Sat Mar 16, 2019 10:26 pm, edited 4 times in total.
Joe_H
Site Admin
Posts: 7870
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: Status alternates between "Running", "Ready" and "Finish

Post by Joe_H »

Welcome to the folding support forum.

You do not need to post complete logs here, just post the beginning section that shows your hardware setup, client version, and the folding configuration, along with a section showing the problem.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Status alternates between "Running", "Ready" and "Finish

Post by bruce »

Generally speaking, you need to configure your power savings features so that your CPU runs continuously. Your log indicates that your CPU is probably sleeping/hibernating when your mouse/keyboard are inactive. FAH works best if it runs continuously.
arkadiyjs
Posts: 4
Joined: Thu Mar 14, 2019 1:59 pm

Re: Status alternates between "Running", "Ready" and "Finish

Post by arkadiyjs »

Joe_H wrote:Welcome to the folding support forum.

You do not need to post complete logs here, just post the beginning section that shows your hardware setup, client version, and the folding configuration, along with a section showing the problem.
Fixed!
arkadiyjs
Posts: 4
Joined: Thu Mar 14, 2019 1:59 pm

Re: Status alternates between "Running", "Ready" and "Finish

Post by arkadiyjs »

bruce wrote:Generally speaking, you need to configure your power savings features so that your CPU runs continuously. Your log indicates that your CPU is probably sleeping/hibernating when your mouse/keyboard are inactive. FAH works best if it runs continuously.
I checked my power settings and everything seems to be alright.
Image
Joe_H
Site Admin
Posts: 7870
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: Status alternates between "Running", "Ready" and "Finish

Post by Joe_H »

Is that image from when your laptop is attached to a power adapter, or jus on battery? The default F@h settings will always suspend folding when on battery.

As for your log, you cut if off before the section that show the folding configuration when you first added it to your post. The second time does show additional information needed to figure out what is going on. It appears you have a bad WU that will not start folding because the error keeps recurring right after the "Entering M.D.' phase of startup.

There is a bug in the A4 core where it is supposed to handle multiple failures. It should have dumped the WU with an error report after trying to start it several times, those are the "21:29:30:WU01:FS00:0xa4:Examination of work files indicates 8 consecutive improper terminations of core." messages in the log. You can safely dump the WU and see about working on another.

To dump it, pause folding. Then use FAHControl to remove the CPU folding slot. After starting folding the client should dump the WU with a message to the effect that no suitable slot was available for processing it. Then pause again, and recreate the CPU folding slot.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
arkadiyjs
Posts: 4
Joined: Thu Mar 14, 2019 1:59 pm

Re: Status alternates between "Running", "Ready" and "Finish

Post by arkadiyjs »

Joe_H wrote:Is that image from when your laptop is attached to a power adapter, or jus on battery? The default F@h settings will always suspend folding when on battery.

As for your log, you cut if off before the section that show the folding configuration when you first added it to your post. The second time does show additional information needed to figure out what is going on. It appears you have a bad WU that will not start folding because the error keeps recurring right after the "Entering M.D.' phase of startup.

There is a bug in the A4 core where it is supposed to handle multiple failures. It should have dumped the WU with an error report after trying to start it several times, those are the "21:29:30:WU01:FS00:0xa4:Examination of work files indicates 8 consecutive improper terminations of core." messages in the log. You can safely dump the WU and see about working on another.

To dump it, pause folding. Then use FAHControl to remove the CPU folding slot. After starting folding the client should dump the WU with a message to the effect that no suitable slot was available for processing it. Then pause again, and recreate the CPU folding slot.
This worked. Thank you very much!
Post Reply