SOLVED: Only 1 CPU producing

A forum for discussing FAH-related hardware choices and info on actual products (not speculation).

Moderator: Site Moderators

Forum rules
Please read the forum rules before posting.
Post Reply
Crunchtimer
Posts: 50
Joined: Tue May 05, 2020 5:34 am

SOLVED: Only 1 CPU producing

Post by Crunchtimer »

Hi all!

I just set up an identical g4dn.xlarge AWS Linux server to the one I already have running (4 CPUs and 1 GPU).
The problem is that only 100% CPU is running Fahclient on the new server, whilst on the other server 300% is running.
The GPU is crunching data but this server seems slower which I believe is due to some misconfig.
I have (I think) the same Fahclient-config and it's the same HW-config, so what could be worng?

Please advise.
Top on new server:

Code: Select all

  PID USER      PR  NI    VIRT    RES    SHR S  %CPU %MEM     TIME+ COMMAND
 3042 fahclie+  39  19  342660 165540  12536 R 100.0  1.0  38:46.94 FahCore_a7
 3161 fahclie+  39  19 6401256   1.0g 144688 R  99.3  6.5  33:20.75 FahCore_22
Top on old server:

Code: Select all

  PID USER      PR  NI    VIRT    RES    SHR S  %CPU %MEM     TIME+ COMMAND
 3033 fahclie+  39  19  351976  59208  12872 R 299.0  0.4 276:53.57 FahCore_a7
 3219 fahclie+  39  19 5764924 461424 137888 R  99.0  2.9  75:24.01 FahCore_22

Checking the log on the new server is says what's happened:

Code: Select all

ESC[93m19:25:37:WARNING:WU00:FS00:Changed SMP threads from 4 to 3 this can cause some work units to failESC[0m
ESC[93m19:25:37:WARNING:WU00:FS00:AS lowered CPUs from 3 to 1ESC[0m
Why is this so and please help me correct it.

EDIT:
I "fixed" it by amending the config file like this,

Code: Select all

  <!-- Folding Slots -->
  <slot id='0' type='CPU'/>
  <slot id='1' type='CPU'/>
  <slot id='2' type='CPU'/>
  <slot id='3' type='GPU'/>
which is not optimal as I get 3 FAHClinet_a7 processes each 100%. The error message is now

Code: Select all

WARNING:WU00:FS00:Changed SMP threads from 3 to 1 this can cause some work units to failE
and the log:

Code: Select all

20:23:22:****************************** FAHClient ******************************
20:23:22:        Version: 7.6.13
20:23:22:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
20:23:22:      Copyright: 2020 foldingathome.org
20:23:22:       Homepage: https://foldingathome.org/
20:23:22:           Date: Apr 28 2020
20:23:22:           Time: 04:20:27
20:23:22:       Revision: 5a652817f46116b6e135503af97f18e094414e3b
20:23:22:         Branch: master
20:23:22:       Compiler: GNU 4.9.4
20:23:22:        Options: -std=c++11 -ffunction-sections -fdata-sections -O3
20:23:22:                 -funroll-loops
20:23:22:       Platform: linux2 4.19.0-5-amd64
20:23:22:           Bits: 64
20:23:22:           Mode: Release
20:23:22:           Args: --child /etc/fahclient/config.xml --run-as fahclient
20:23:22:                 --pid-file=/var/run/fahclient.pid --daemon
20:23:22:         Config: /etc/fahclient/config.xml
20:23:22:******************************** CBang ********************************
20:23:22:           Date: Apr 25 2020
20:23:22:           Time: 00:07:55
20:23:22:       Revision: ea081a3b3b0f4a37c4d0440b4f1bc184197c7797
20:23:22:         Branch: master
20:23:22:       Compiler: GNU 4.9.4
20:23:22:        Options: -std=c++11 -ffunction-sections -fdata-sections -O3
20:23:22:                 -funroll-loops -fPIC
20:23:22:       Platform: linux2 4.19.0-5-amd64
20:23:22:           Bits: 64
20:23:22:           Mode: Release
20:23:22:******************************* System ********************************
20:23:22:            CPU: Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz
20:23:22:         CPU ID: GenuineIntel Family 6 Model 85 Stepping 7
20:23:22:           CPUs: 4
20:23:22:         Memory: 15.38GiB
20:23:22:    Free Memory: 14.85GiB
20:23:22:        Threads: POSIX_THREADS
20:23:22:     OS Version: 4.14
20:23:22:    Has Battery: false
20:23:22:     On Battery: false
20:23:22:     UTC Offset: 0
20:23:22:            PID: 3229
20:23:22:            CWD: /var/lib/fahclient
20:23:22:             OS: Linux 4.14.173-137.229.amzn2.x86_64 x86_64
20:23:22:        OS Arch: AMD64
20:23:22:           GPUs: 1
20:23:22:          GPU 0: Bus:0 Slot:30 Func:0 NVIDIA:7 TU104GL [Tesla T4]
20:23:22:  CUDA Device 0: Platform:0 Device:0 Bus:0 Slot:30 Compute:7.5 Driver:10.2
20:23:22:OpenCL Device 0: Platform:0 Device:0 Bus:0 Slot:30 Compute:1.2 Driver:440.64
20:23:22:******************************* libFAH ********************************
20:23:22:           Date: Apr 15 2020
20:23:22:           Time: 21:43:27
20:23:22:       Revision: 216968bc7025029c841ed6e36e81a03a316890d3
20:23:22:         Branch: master
20:23:22:       Compiler: GNU 4.9.4
20:23:22:        Options: -std=c++11 -ffunction-sections -fdata-sections -O3
20:23:22:                 -funroll-loops
20:23:22:       Platform: linux2 4.19.0-5-amd64
20:23:22:           Bits: 64
20:23:22:           Mode: Release
20:23:22:***********************************************************************
20:23:22:<config>
20:23:22:  <!-- Folding Slot Configuration -->
20:23:22:  <cause v='COVID_19'/>
20:23:22:
20:23:22:  <!-- Slot Control -->
20:23:22:  <power v='full'/>
20:23:22:
20:23:22:  <!-- User Information -->
20:23:22:  <passkey v='*****'/>
20:23:22:  <team v='******'/>
20:23:22:  <user v='********'/>
20:23:22:
20:23:22:  <!-- Folding Slots -->
20:23:22:  <slot id='0' type='CPU'/>
20:23:22:  <slot id='1' type='CPU'/>
20:23:22:  <slot id='2' type='CPU'/>
20:23:22:  <slot id='3' type='GPU'/>
20:23:22:</config>
20:23:22:WU00:FS00:Starting
20:23:22:WARNING:WU00:FS00:Changed SMP threads from 3 to 1 this can cause some work units to fail
20:23:22:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/v7/lin/64bit/avx/Core_a7.fah/FahCore_a7 -dir 00 -suffix 01 -version 706 -lifeline 3229 -checkpoint 15 -np 1
20:23:22:WU00:FS00:Started FahCore on PID 3258
20:23:22:WU00:FS00:Core PID:3262
20:23:22:WU00:FS00:FahCore 0xa7 started
20:23:23:WU01:FS01:Connecting to assign1.foldingathome.org:80
20:23:23:WU02:FS02:Connecting to assign1.foldingathome.org:80
20:23:23:WU03:FS03:Connecting to assign1.foldingathome.org:80
20:23:23:WU00:FS00:0xa7:*********************** Log Started 2020-05-06T20:23:22Z ***********************
20:23:23:WU00:FS00:0xa7:************************** Gromacs Folding@home Core ***************************
20:23:23:WU00:FS00:0xa7:       Type: 0xa7
20:23:23:WU00:FS00:0xa7:       Core: Gromacs
20:23:23:WU00:FS00:0xa7:       Args: -dir 00 -suffix 01 -version 706 -lifeline 3258 -checkpoint 15 -np 1
20:23:23:WU00:FS00:0xa7:************************************ CBang *************************************
20:23:23:WU00:FS00:0xa7:       Date: Nov 5 2019
20:23:23:WU00:FS00:0xa7:       Time: 06:06:57
20:23:23:WU00:FS00:0xa7:   Revision: 46c96f1aa8419571d83f3e63f9c99a0d602f6da9
20:23:23:WU00:FS00:0xa7:     Branch: master
20:23:23:WU00:FS00:0xa7:   Compiler: GNU 8.3.0
20:23:23:WU00:FS00:0xa7:    Options: -std=c++11 -O3 -funroll-loops -fno-pie -fPIC
20:23:23:WU00:FS00:0xa7:   Platform: linux2 4.19.0-5-amd64
20:23:23:WU00:FS00:0xa7:       Bits: 64
20:23:23:WU00:FS00:0xa7:       Mode: Release
20:23:23:WU00:FS00:0xa7:************************************ System ************************************
20:23:23:WU00:FS00:0xa7:        CPU: Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz
20:23:23:WU00:FS00:0xa7:     CPU ID: GenuineIntel Family 6 Model 85 Stepping 7
20:23:23:WU00:FS00:0xa7:       CPUs: 4
20:23:23:WU00:FS00:0xa7:     Memory: 15.38GiB
20:23:23:WU00:FS00:0xa7:Free Memory: 14.82GiB
20:23:23:WU00:FS00:0xa7:    Threads: POSIX_THREADS
20:23:23:WU00:FS00:0xa7: OS Version: 4.14
20:23:23:WU00:FS00:0xa7:Has Battery: false
20:23:23:WU00:FS00:0xa7: On Battery: false
20:23:23:WU00:FS00:0xa7: UTC Offset: 0
20:23:23:WU00:FS00:0xa7:        PID: 3262
20:23:23:WU00:FS00:0xa7:        CWD: /var/lib/fahclient/work
20:23:23:WU00:FS00:0xa7:******************************** Build - libFAH ********************************
20:23:23:WU00:FS00:0xa7:    Version: 0.0.18
20:23:23:WU00:FS00:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
20:23:23:WU00:FS00:0xa7:  Copyright: 2019 foldingathome.org
20:23:23:WU00:FS00:0xa7:   Homepage: https://foldingathome.org/
20:23:23:WU00:FS00:0xa7:       Date: Nov 5 2019
20:23:23:WU00:FS00:0xa7:       Time: 06:13:26
20:23:23:WU00:FS00:0xa7:   Revision: 490c9aa2957b725af319379424d5c5cb36efb656
20:23:23:WU00:FS00:0xa7:     Branch: master
20:23:23:WU00:FS00:0xa7:   Compiler: GNU 8.3.0
20:23:23:WU00:FS00:0xa7:    Options: -std=c++11 -O3 -funroll-loops -fno-pie
20:23:23:WU00:FS00:0xa7:   Platform: linux2 4.19.0-5-amd64
20:23:23:WU00:FS00:0xa7:       Bits: 64
20:23:23:WU00:FS00:0xa7:       Mode: Release
20:23:23:WU00:FS00:0xa7:************************************ Build *************************************
20:23:23:WU00:FS00:0xa7:       SIMD: avx_256
20:23:23:WU00:FS00:0xa7:********************************************************************************
20:23:23:WU00:FS00:0xa7:Project: 16412 (Run 0, Clone 203, Gen 61)
20:23:23:WU00:FS00:0xa7:Unit: 0x00000044a8f5c67d5e82316022a7f369
20:23:23:WU00:FS00:0xa7:Digital signatures verified
20:23:23:WU00:FS00:0xa7:Calling: mdrun -s frame61.tpr -o frame61.trr -x frame61.xtc -cpi state.cpt -cpt 15 -nt 1
20:23:23:WU00:FS00:0xa7:Steps: first=15250000 total=250000
20:23:24:WU01:FS01:Assigned to work server 40.114.52.201
20:23:24:WU01:FS01:Requesting new work unit for slot 01: READY cpu:1 from 40.114.52.201
20:23:24:WU01:FS01:Connecting to 40.114.52.201:8080
20:23:24:WARNING:WU03:FS03:Failed to get assignment from 'assign1.foldingathome.org:80': No WUs available for this configuration
20:23:24:WU03:FS03:Connecting to assign2.foldingathome.org:80
20:23:24:WU02:FS02:Assigned to work server 155.247.166.220
20:23:24:WU02:FS02:Requesting new work unit for slot 02: READY cpu:1 from 155.247.166.220
20:23:24:WU02:FS02:Connecting to 155.247.166.220:8080
20:23:24:WARNING:WU03:FS03:Failed to get assignment from 'assign2.foldingathome.org:80': No WUs available for this configuration
20:23:24:WU03:FS03:Connecting to assign3.foldingathome.org:80
20:23:24:WU02:FS02:Downloading 2.85MiB
20:23:24:WU01:FS01:Downloading 2.14MiB
20:23:24:WU00:FS00:0xa7:Completed 12342 out of 250000 steps (4%)
20:23:25:WU03:FS03:Assigned to work server 13.82.98.119
20:23:25:WU03:FS03:Requesting new work unit for slot 03: READY gpu:0:TU104GL [Tesla T4] from 13.82.98.119
20:23:25:WU03:FS03:Connecting to 13.82.98.119:8080
20:23:25:WU01:FS01:Download complete
20:23:25:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:13850 run:0 clone:20489 gen:77 core:0xa7 unit:0x0000005d287234c95e78906db8c0bf99
20:23:25:WU01:FS01:Starting
20:23:25:WU01:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/v7/lin/64bit/avx/Core_a7.fah/FahCore_a7 -dir 01 -suffix 01 -version 706 -lifeline 3229 -checkpoint 15 -np 1
20:23:25:WU01:FS01:Started FahCore on PID 3266
20:23:25:WU01:FS01:Core PID:3270
20:23:25:WU01:FS01:FahCore 0xa7 started
20:23:26:WARNING:WU01:FS01:FahCore returned: BAD_FRAME_CHECKSUM (112 = 0x70)
20:23:26:WARNING:WU01:FS01:Fatal error, dumping
20:23:26:WU01:FS01:Sending unit results: id:01 state:SEND error:DUMPED project:13850 run:0 clone:20489 gen:77 core:0xa7 unit:0x0000005d287234c95e78906db8c0bf99
20:23:26:WU01:FS01:Connecting to 40.114.52.201:8080
20:23:26:WU04:FS01:Connecting to assign1.foldingathome.org:80
20:23:26:WU01:FS01:Server responded WORK_ACK (400)
20:23:26:WU01:FS01:Cleaning up
20:23:27:WU04:FS01:Assigned to work server 128.252.203.9
20:23:27:WU04:FS01:Requesting new work unit for slot 01: READY cpu:1 from 128.252.203.9
20:23:27:WU04:FS01:Connecting to 128.252.203.9:8080
20:23:27:WU02:FS02:Download complete
20:23:27:ERROR:WU04:FS01:Exception: Server did not assign work unit
20:23:27:WU02:FS02:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:14627 run:805 clone:0 gen:74 core:0xa7 unit:0x000000500002894c5e8789fc1644f2d4
20:23:27:WU02:FS02:Starting
20:23:27:WU02:FS02:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/v7/lin/64bit/avx/Core_a7.fah/FahCore_a7 -dir 02 -suffix 01 -version 706 -lifeline 3229 -checkpoint 15 -np 1
20:23:27:WU02:FS02:Started FahCore on PID 3274
20:23:27:WU02:FS02:Core PID:3278
20:23:27:WU02:FS02:FahCore 0xa7 started
20:23:27:WU04:FS01:Connecting to assign1.foldingathome.org:80
20:23:28:WU02:FS02:0xa7:*********************** Log Started 2020-05-06T20:23:27Z ***********************
20:23:28:WU02:FS02:0xa7:************************** Gromacs Folding@home Core ***************************
20:23:28:WU02:FS02:0xa7:       Type: 0xa7
20:23:28:WU02:FS02:0xa7:       Core: Gromacs
20:23:28:WU02:FS02:0xa7:       Args: -dir 02 -suffix 01 -version 706 -lifeline 3274 -checkpoint 15 -np 1
20:23:28:WU02:FS02:0xa7:************************************ CBang *************************************
20:23:28:WU02:FS02:0xa7:       Date: Nov 5 2019
20:23:28:WU02:FS02:0xa7:       Time: 06:06:57
20:23:28:WU02:FS02:0xa7:   Revision: 46c96f1aa8419571d83f3e63f9c99a0d602f6da9
20:23:28:WU02:FS02:0xa7:     Branch: master
20:23:28:WU02:FS02:0xa7:   Compiler: GNU 8.3.0
20:23:28:WU02:FS02:0xa7:    Options: -std=c++11 -O3 -funroll-loops -fno-pie -fPIC
20:23:28:WU02:FS02:0xa7:   Platform: linux2 4.19.0-5-amd64
20:23:28:WU02:FS02:0xa7:       Bits: 64
20:23:28:WU02:FS02:0xa7:       Mode: Release
20:23:28:WU02:FS02:0xa7:************************************ System ************************************
20:23:28:WU02:FS02:0xa7:        CPU: Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz
20:23:28:WU02:FS02:0xa7:     CPU ID: GenuineIntel Family 6 Model 85 Stepping 7
20:23:28:WU02:FS02:0xa7:       CPUs: 4
20:23:28:WU02:FS02:0xa7:     Memory: 15.38GiB
20:23:28:WU02:FS02:0xa7:Free Memory: 14.76GiB
20:23:28:WU02:FS02:0xa7:    Threads: POSIX_THREADS
20:23:28:WU02:FS02:0xa7: OS Version: 4.14
20:23:28:WU02:FS02:0xa7:Has Battery: false
20:23:28:WU02:FS02:0xa7: On Battery: false
20:23:28:WU02:FS02:0xa7: UTC Offset: 0
20:23:28:WU02:FS02:0xa7:        PID: 3278
20:23:28:WU02:FS02:0xa7:        CWD: /var/lib/fahclient/work
20:23:28:WU02:FS02:0xa7:******************************** Build - libFAH ********************************
20:23:28:WU02:FS02:0xa7:    Version: 0.0.18
20:23:28:WU02:FS02:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
20:23:28:WU02:FS02:0xa7:  Copyright: 2019 foldingathome.org
20:23:28:WU02:FS02:0xa7:   Homepage: https://foldingathome.org/
20:23:28:WU02:FS02:0xa7:       Date: Nov 5 2019
20:23:28:WU02:FS02:0xa7:       Time: 06:13:26
20:23:28:WU02:FS02:0xa7:   Revision: 490c9aa2957b725af319379424d5c5cb36efb656
20:23:28:WU02:FS02:0xa7:     Branch: master
20:23:28:WU02:FS02:0xa7:   Compiler: GNU 8.3.0
20:23:28:WU02:FS02:0xa7:    Options: -std=c++11 -O3 -funroll-loops -fno-pie
20:23:28:WU02:FS02:0xa7:   Platform: linux2 4.19.0-5-amd64
20:23:28:WU02:FS02:0xa7:       Bits: 64
20:23:28:WU02:FS02:0xa7:       Mode: Release
20:23:28:WU02:FS02:0xa7:************************************ Build *************************************
20:23:28:WU02:FS02:0xa7:       SIMD: avx_256
20:23:28:WU02:FS02:0xa7:********************************************************************************
20:23:28:WU02:FS02:0xa7:Project: 14627 (Run 805, Clone 0, Gen 74)
20:23:28:WU02:FS02:0xa7:Unit: 0x000000500002894c5e8789fc1644f2d4
20:23:28:WU02:FS02:0xa7:Reading tar file core.xml
20:23:28:WU02:FS02:0xa7:Reading tar file frame74.tpr
20:23:28:WU02:FS02:0xa7:Digital signatures verified
20:23:28:WU02:FS02:0xa7:Calling: mdrun -s frame74.tpr -o frame74.trr -cpt 15 -nt 1
20:23:28:WU02:FS02:0xa7:Steps: first=0 total=250000
20:23:28:WU04:FS01:Assigned to work server 150.136.14.110
20:23:28:WU04:FS01:Requesting new work unit for slot 01: READY cpu:1 from 150.136.14.110
20:23:28:WU04:FS01:Connecting to 150.136.14.110:8080
20:23:28:WU04:FS01:Downloading 2.34MiB
20:23:29:WU02:FS02:0xa7:Completed 1 out of 250000 steps (0%)
20:23:29:WU03:FS03:Downloading 68.56MiB
20:23:30:WU04:FS01:Download complete
20:23:30:WU04:FS01:Received Unit: id:04 state:DOWNLOAD error:NO_ERROR project:16416 run:0 clone:1880 gen:101 core:0xa7 unit:0x0000007296880e6e5e82560ba32dc7b9
20:23:30:WU04:FS01:Starting
20:23:30:WU04:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/v7/lin/64bit/avx/Core_a7.fah/FahCore_a7 -dir 04 -suffix 01 -version 706 -lifeline 3229 -checkpoint 15 -np 1
20:23:30:WU04:FS01:Started FahCore on PID 3282
20:23:30:WU04:FS01:Core PID:3286
20:23:30:WU04:FS01:FahCore 0xa7 started
20:23:30:WU04:FS01:0xa7:*********************** Log Started 2020-05-06T20:23:30Z ***********************
20:23:30:WU04:FS01:0xa7:************************** Gromacs Folding@home Core ***************************
20:23:30:WU04:FS01:0xa7:       Type: 0xa7
20:23:30:WU04:FS01:0xa7:       Core: Gromacs
20:23:30:WU04:FS01:0xa7:       Args: -dir 04 -suffix 01 -version 706 -lifeline 3282 -checkpoint 15 -np 1
20:23:30:WU04:FS01:0xa7:************************************ CBang *************************************
20:23:30:WU04:FS01:0xa7:       Date: Nov 5 2019
20:23:30:WU04:FS01:0xa7:       Time: 06:06:57
20:23:30:WU04:FS01:0xa7:   Revision: 46c96f1aa8419571d83f3e63f9c99a0d602f6da9
20:23:30:WU04:FS01:0xa7:     Branch: master
20:23:30:WU04:FS01:0xa7:   Compiler: GNU 8.3.0
20:23:30:WU04:FS01:0xa7:    Options: -std=c++11 -O3 -funroll-loops -fno-pie -fPIC
20:23:30:WU04:FS01:0xa7:   Platform: linux2 4.19.0-5-amd64
20:23:30:WU04:FS01:0xa7:       Bits: 64
20:23:30:WU04:FS01:0xa7:       Mode: Release
20:23:30:WU04:FS01:0xa7:************************************ System ************************************
20:23:30:WU04:FS01:0xa7:        CPU: Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz
20:23:30:WU04:FS01:0xa7:     CPU ID: GenuineIntel Family 6 Model 85 Stepping 7
20:23:30:WU04:FS01:0xa7:       CPUs: 4
20:23:30:WU04:FS01:0xa7:     Memory: 15.38GiB
20:23:30:WU04:FS01:0xa7:Free Memory: 14.71GiB
20:23:30:WU04:FS01:0xa7:    Threads: POSIX_THREADS
20:23:30:WU04:FS01:0xa7: OS Version: 4.14
20:23:30:WU04:FS01:0xa7:Has Battery: false
20:23:30:WU04:FS01:0xa7: On Battery: false
20:23:30:WU04:FS01:0xa7: UTC Offset: 0
20:23:30:WU04:FS01:0xa7:        PID: 3286
20:23:30:WU04:FS01:0xa7:        CWD: /var/lib/fahclient/work
20:23:30:WU04:FS01:0xa7:******************************** Build - libFAH ********************************
20:23:30:WU04:FS01:0xa7:    Version: 0.0.18
20:23:30:WU04:FS01:0xa7:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
20:23:30:WU04:FS01:0xa7:  Copyright: 2019 foldingathome.org
20:23:30:WU04:FS01:0xa7:   Homepage: https://foldingathome.org/
20:23:30:WU04:FS01:0xa7:       Date: Nov 5 2019
20:23:30:WU04:FS01:0xa7:       Time: 06:13:26
20:23:30:WU04:FS01:0xa7:   Revision: 490c9aa2957b725af319379424d5c5cb36efb656
20:23:30:WU04:FS01:0xa7:     Branch: master
20:23:30:WU04:FS01:0xa7:   Compiler: GNU 8.3.0
20:23:30:WU04:FS01:0xa7:    Options: -std=c++11 -O3 -funroll-loops -fno-pie
20:23:30:WU04:FS01:0xa7:   Platform: linux2 4.19.0-5-amd64
20:23:30:WU04:FS01:0xa7:       Bits: 64
20:23:30:WU04:FS01:0xa7:       Mode: Release
20:23:30:WU04:FS01:0xa7:************************************ Build *************************************
20:23:30:WU04:FS01:0xa7:       SIMD: avx_256
20:23:30:WU04:FS01:0xa7:********************************************************************************
20:23:30:WU04:FS01:0xa7:Project: 16416 (Run 0, Clone 1880, Gen 101)
20:23:30:WU04:FS01:0xa7:Unit: 0x0000007296880e6e5e82560ba32dc7b9
20:23:30:WU04:FS01:0xa7:Reading tar file core.xml
20:23:30:WU04:FS01:0xa7:Reading tar file frame101.tpr
20:23:30:WU04:FS01:0xa7:Digital signatures verified
20:23:30:WU04:FS01:0xa7:Calling: mdrun -s frame101.tpr -o frame101.trr -x frame101.xtc -cpt 15 -nt 1
20:23:30:WU04:FS01:0xa7:Steps: first=25250000 total=250000
20:23:30:WU04:FS01:0xa7:Completed 1 out of 250000 steps (0%)
20:23:35:WU03:FS03:Download 24.25%
20:23:41:WU03:FS03:Download 72.47%
20:23:43:WU03:FS03:Download complete
20:23:43:WU03:FS03:Received Unit: id:03 state:DOWNLOAD error:NO_ERROR project:14433 run:0 clone:1362 gen:71 core:0x22 unit:0x0000005a0d5262775e8b4d532c1382eb
20:23:43:WU03:FS03:Starting
20:23:43:WU03:FS03:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/v7/lin/64bit/Core_22.fah/FahCore_22 -dir 03 -suffix 01 -version 706 -lifeline 3229 -checkpoint 15 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
20:23:43:WU03:FS03:Started FahCore on PID 3290
20:23:43:WU03:FS03:Core PID:3294
20:23:43:WU03:FS03:FahCore 0x22 started
20:23:44:WU03:FS03:0x22:*********************** Log Started 2020-05-06T20:23:43Z ***********************
20:23:44:WU03:FS03:0x22:*************************** Core22 Folding@home Core ***************************
20:23:44:WU03:FS03:0x22:       Type: 0x22
20:23:44:WU03:FS03:0x22:       Core: Core22
20:23:44:WU03:FS03:0x22:    Website: https://foldingathome.org/
20:23:44:WU03:FS03:0x22:  Copyright: (c) 2009-2018 foldingathome.org
20:23:44:WU03:FS03:0x22:     Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora
20:23:44:WU03:FS03:0x22:             <rafal.wiewiora@choderalab.org>
20:23:44:WU03:FS03:0x22:       Args: -dir 03 -suffix 01 -version 706 -lifeline 3290 -checkpoint 15
20:23:44:WU03:FS03:0x22:             -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device
20:23:44:WU03:FS03:0x22:             0 -gpu 0
20:23:44:WU03:FS03:0x22:     Config: <none>
20:23:44:WU03:FS03:0x22:************************************ Build *************************************
20:23:44:WU03:FS03:0x22:    Version: 0.0.5
20:23:44:WU03:FS03:0x22:       Date: Apr 22 2020
20:23:44:WU03:FS03:0x22:       Time: 03:57:11
20:23:44:WU03:FS03:0x22: Repository: Git
20:23:44:WU03:FS03:0x22:   Revision: 2d69202c898bd9bb3e093f51cd32bf411c2a0388
20:23:44:WU03:FS03:0x22:     Branch: HEAD
20:23:44:WU03:FS03:0x22:   Compiler: GNU 4.8.2 20140120 (Red Hat 4.8.2-15)
20:23:44:WU03:FS03:0x22:    Options: -std=c++11 -O3 -funroll-loops
20:23:44:WU03:FS03:0x22:   Platform: linux2 4.19.76-linuxkit
20:23:44:WU03:FS03:0x22:       Bits: 64
20:23:44:WU03:FS03:0x22:       Mode: Release
20:23:44:WU03:FS03:0x22:************************************ System ************************************
20:23:44:WU03:FS03:0x22:        CPU: Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz
20:23:44:WU03:FS03:0x22:     CPU ID: GenuineIntel Family 6 Model 85 Stepping 7
20:23:44:WU03:FS03:0x22:       CPUs: 4
20:23:44:WU03:FS03:0x22:     Memory: 15.38GiB
20:23:44:WU03:FS03:0x22:Free Memory: 14.59GiB
20:23:44:WU03:FS03:0x22:    Threads: POSIX_THREADS
20:23:44:WU03:FS03:0x22: OS Version: 4.14
20:23:44:WU03:FS03:0x22:Has Battery: false
20:23:44:WU03:FS03:0x22: On Battery: false
20:23:44:WU03:FS03:0x22: UTC Offset: 0
20:23:44:WU03:FS03:0x22:        PID: 3294
20:23:44:WU03:FS03:0x22:        CWD: /var/lib/fahclient/work
20:23:44:WU03:FS03:0x22:         OS: Linux 4.14.173-137.229.amzn2.x86_64 x86_64
20:23:44:WU03:FS03:0x22:    OS Arch: AMD64
20:23:44:WU03:FS03:0x22:********************************************************************************
20:23:44:WU03:FS03:0x22:Project: 14433 (Run 0, Clone 1362, Gen 71)
20:23:44:WU03:FS03:0x22:Unit: 0x0000005a0d5262775e8b4d532c1382eb
20:23:44:WU03:FS03:0x22:Reading tar file core.xml
20:23:44:WU03:FS03:0x22:Reading tar file integrator.xml
20:23:44:WU03:FS03:0x22:Reading tar file state.xml
20:23:44:WU03:FS03:0x22:Reading tar file system.xml
20:23:44:WU03:FS03:0x22:Digital signatures verified
20:23:44:WU03:FS03:0x22:Folding@home GPU Core22 Folding@home Core
20:23:44:WU03:FS03:0x22:Version 0.0.5
20:24:00:WU03:FS03:0x22:Completed 0 out of 2000000 steps (0%)
20:24:00:WU03:FS03:0x22:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
20:24:42:WU00:FS00:0xa7:Completed 12500 out of 250000 steps (5%)
20:26:37:WU03:FS03:0x22:Completed 20000 out of 2000000 steps (1%)
20:28:10:WU04:FS01:0xa7:Completed 2500 out of 250000 steps (1%)
20:29:11:WU03:FS03:0x22:Completed 40000 out of 2000000 steps (2%)
20:31:46:WU03:FS03:0x22:Completed 60000 out of 2000000 steps (3%)
What am I doing wrong?

Thanks!
Last edited by Crunchtimer on Thu May 07, 2020 5:57 pm, edited 1 time in total.
JimboPalmer
Posts: 2573
Joined: Mon Feb 16, 2009 4:12 am
Location: Greenwood MS USA

Re: Only 1 CPU producing

Post by JimboPalmer »

Welcome to Folding@Home!

20:23:22: <slot id='0' type='CPU'/>
20:23:22: <slot id='1' type='CPU'/>
20:23:22: <slot id='2' type='CPU'/>
20:23:22: <slot id='3' type='GPU'/>

You want one CPU slot that has -1 or 3 CPUs specified in the slot and a GPU slot. That will work best and make the most points/science.

F@H rewards quick results so one slot using 3 threads (F@H calls them CPUs) completes quicker than 3 separate slots of one CPU and helps more.
Tsar of all the Rushers
I tried to remain childlike, all I achieved was childish.
A friend to those who want no friends
Crunchtimer
Posts: 50
Joined: Tue May 05, 2020 5:34 am

Re: Only 1 CPU producing

Post by Crunchtimer »

Thanks for the response!
I agree with you and my initial config (the same config I have on the other server where it works fine) was:

<!-- Folding Slots -->
<slot id='0' type='CPU'/>
<slot id='1' type='GPU'/>

But then it said: "FS00:Changed SMP threads from 4 to 3 this can cause some work units to fail" and just put out 1 CPU + 1 GPU to work, instead of 3 CPU + 1 GPU.
Can I do anything now, or should I wait for it to correct itself?
JimboPalmer
Posts: 2573
Joined: Mon Feb 16, 2009 4:12 am
Location: Greenwood MS USA

Re: Only 1 CPU producing

Post by JimboPalmer »

I am not a linux user, but if you can start FAHcontrol, then it should be the same

On this screen to the left is a Configure button, click it
Now you get a screen with a Slots tab, click it

On this screen remove two CPU slots and on the third CPU slot edit it and set it to 3 CPUs (or -1)
Type the number you want, and click save.
Tsar of all the Rushers
I tried to remain childlike, all I achieved was childish.
A friend to those who want no friends
Crunchtimer
Posts: 50
Joined: Tue May 05, 2020 5:34 am

Re: Only 1 CPU producing

Post by Crunchtimer »

Thanks for the tip and I'm fine with the Gui versions.
The Linux servers I'm running from command line.
Crunchtimer
Posts: 50
Joined: Tue May 05, 2020 5:34 am

Re: Only 1 CPU producing

Post by Crunchtimer »

Solved (again)!

Ok so the way to get all 3 CPUs assiged (300% under top) to 1 process instance of FahCore_a7 was to kill the process whilst having a correct config-file. Rebooting was insufficient.

Correct config for 3 CPUs and 1 GPU:

Code: Select all

  <slot id='0' type='CPU'/>
  <slot id='1' type='GPU'/>
Command to kill all 3 process instances of FahCore_a7 was:

Code: Select all

sudo killall -9 FahCore_a7
PantherX
Site Moderator
Posts: 7020
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: SOLVED: Only 1 CPU producing

Post by PantherX »

Generally speaking, kills the FahCore_a7 process can result in data loss. Pausing the client should send a signal to stop FahCore_a7 gracefully. It may take a minute or two especially if it is writing a checkpoint.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: SOLVED: Only 1 CPU producing

Post by bruce »

The message "Changed SMP threads from M to N this can cause some work units to fail" is safe to ignore. There are very few cases where a problem will be caused by this sort of change.
Crunchtimer
Posts: 50
Joined: Tue May 05, 2020 5:34 am

Re: SOLVED: Only 1 CPU producing

Post by Crunchtimer »

PantherX wrote:Generally speaking, kills the FahCore_a7 process can result in data loss. Pausing the client should send a signal to stop FahCore_a7 gracefully. It may take a minute or two especially if it is writing a checkpoint.
Yes you are right loss occurs, so one has to time it correctly, i.e. right after upload.

I was previously on client 7.4.4. and then I was able to be graceful stopping and starting the service. With 7.6.13 the fahclient user holds the process in a firm grip, so I was left with the sledgehammer from CLI :)
Rel25917
Posts: 303
Joined: Wed Aug 15, 2012 2:31 am

Re: SOLVED: Only 1 CPU producing

Post by Rel25917 »

ESC[93m19:25:37:WARNING:WU00:FS00:AS lowered CPUs from 3 to 1ESC[0m

I'm assuming this was the first unit downloaded, there's on old bug that causes the first unit to run on a single core. Future units will run properly.
Crunchtimer
Posts: 50
Joined: Tue May 05, 2020 5:34 am

Re: SOLVED: Only 1 CPU producing

Post by Crunchtimer »

Rel25917 wrote:ESC[93m19:25:37:WARNING:WU00:FS00:AS lowered CPUs from 3 to 1ESC[0m

I'm assuming this was the first unit downloaded, there's on old bug that causes the first unit to run on a single core. Future units will run properly.
Yes you are right and good that it will be fixed!
Post Reply