Page 1 of 1

ERROR:GPU with PCI bus 0 and slot 2 not found

Posted: Sat Dec 25, 2021 12:07 pm
by promeneur
today again when i start in the morning my PC I get this error:

...
08:59:45:Successfully acquired database lock
08:59:45:ERROR:GPU with PCI bus 0 and slot 2 not found.: Deleting folding slot.
08:59:45:FS00:Initialized folding slot 00: cpu:3
08:59:45:WARNING:FS02:Disabling beta GPU slot 02: gpu:1:0. Beta GPUs can be tested for no points by setting ``gpu-beta=true`` in the configuration.
08:59:45:WARNING:FS01:``opencl-index`` 0 did not match GPU
08:59:45:WARNING:FS01:No CUDA or OpenCL 1.2+ support detected for GPU slot 01: gpu:-1:-1. Disabling.
...

Here is the complete log

Code: Select all

*********************** Log Started 2021-12-25T08:59:45Z ***********************
08:59:45:******************************* libFAH ********************************
08:59:45:           Date: Oct 20 2020
08:59:45:           Time: 20:36:41
08:59:45:       Revision: 5ca109d295a6245e2a2f590b3d0085ad5e567aeb
08:59:45:         Branch: master
08:59:45:       Compiler: GNU 4.9.4
08:59:45:        Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections
08:59:45:                 -O3 -funroll-loops
08:59:45:       Platform: linux2 5.8.0-1-amd64
08:59:45:           Bits: 64
08:59:45:           Mode: Release
08:59:45:****************************** FAHClient ******************************
08:59:45:        Version: 7.6.21
08:59:45:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
08:59:45:      Copyright: 2020 foldingathome.org
08:59:45:       Homepage: https://foldingathome.org/
08:59:45:           Date: Oct 20 2020
08:59:45:           Time: 20:38:59
08:59:45:       Revision: 6efbf0e138e22d3963e6a291f78dcb9c6422a278
08:59:45:         Branch: master
08:59:45:       Compiler: GNU 4.9.4
08:59:45:        Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections
08:59:45:                 -O3 -funroll-loops
08:59:45:       Platform: linux2 5.8.0-1-amd64
08:59:45:           Bits: 64
08:59:45:           Mode: Release
08:59:45:           Args: /etc/fahclient/config.xml
08:59:45:                 --pid-file=/run/fahclient/fahclient.pid
08:59:45:         Config: /etc/fahclient/config.xml
08:59:45:******************************** CBang ********************************
08:59:45:           Date: Oct 20 2020
08:59:45:           Time: 18:38:01
08:59:45:       Revision: 7e4ce85225d7eaeb775e87c31740181ca603de60
08:59:45:         Branch: master
08:59:45:       Compiler: GNU 4.9.4
08:59:45:        Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections
08:59:45:                 -O3 -funroll-loops -fPIC
08:59:45:       Platform: linux2 5.8.0-1-amd64
08:59:45:           Bits: 64
08:59:45:           Mode: Release
08:59:45:******************************* System ********************************
08:59:45:            CPU: Intel(R) Core(TM) i5-7400 CPU @ 3.00GHz
08:59:45:         CPU ID: GenuineIntel Family 6 Model 158 Stepping 9
08:59:45:           CPUs: 4
08:59:45:         Memory: 15.50GiB
08:59:45:    Free Memory: 14.66GiB
08:59:45:        Threads: POSIX_THREADS
08:59:45:     OS Version: 5.3
08:59:45:    Has Battery: false
08:59:45:     On Battery: false
08:59:45:     UTC Offset: 1
08:59:45:            PID: 1802
08:59:45:            CWD: /var/lib/fahclient
08:59:45:             OS: Linux 5.3.18-59.37-default x86_64
08:59:45:        OS Arch: AMD64
08:59:45:           GPUs: 1
08:59:45:          GPU 0: Bus:1 Slot:0 Func:0 NVIDIA:1
08:59:45:  CUDA Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:3.5 Driver:11.4
08:59:45:OpenCL Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:3.0 Driver:470.86
08:59:45:OpenCL Device 1: Platform:1 Device:0 Bus:NA Slot:NA Compute:2.0 Driver:1.3
08:59:45:***********************************************************************
08:59:45:<config>
08:59:45:  <!-- Network -->
08:59:45:  <proxy v=':8080'/>
08:59:45:
08:59:45:  <!-- Slot Control -->
08:59:45:  <power v='FULL'/>
08:59:45:
08:59:45:  <!-- User Information -->
08:59:45:  <passkey v='*****'/>
08:59:45:  <team v='51'/>
08:59:45:  <user v='philippe_roubach'/>
08:59:45:
08:59:45:  <!-- Folding Slots -->
08:59:45:  <slot id='0' type='CPU'>
08:59:45:    <cpus v='3'/>
08:59:45:  </slot>
08:59:45:  <slot id='2' type='GPU'>
08:59:45:    <pci-bus v='1'/>
08:59:45:    <pci-slot v='0'/>
08:59:45:  </slot>
08:59:45:  <slot id='1' type='GPU'>
08:59:45:    <opencl-index v='0'/>
08:59:45:    <pci-bus v='0'/>
08:59:45:    <pci-slot v='2'/>
08:59:45:  </slot>
08:59:45:</config>
08:59:45:Trying to access database...
08:59:45:Successfully acquired database lock
08:59:45:ERROR:GPU with PCI bus 0 and slot 2 not found.: Deleting folding slot.
08:59:45:FS00:Initialized folding slot 00: cpu:3
08:59:45:WARNING:FS02:Disabling beta GPU slot 02: gpu:1:0.  Beta GPUs can be tested for no points by setting ``gpu-beta=true`` in the configuration.
08:59:45:WARNING:FS01:``opencl-index`` 0 did not match GPU
08:59:45:WARNING:FS01:No CUDA or OpenCL 1.2+ support detected for GPU slot 01: gpu:-1:-1.  Disabling.
08:59:45:WARNING:WU00:No longer matches Slot 2's configuration and there are no other matching slots, dumping
08:59:45:WU00:FS02:Sending unit results: id:00 state:SEND error:DUMPED project:16487 run:0 clone:5 gen:95 core:0x22 unit:0x000000050000005f0000406700000000
08:59:45:WU00:FS02:Connecting to 140.163.4.200:8080
08:59:45:WU01:FS00:Starting
08:59:45:WARNING:WU00:FS02:WorkServer connection failed on port 8080 trying 80
08:59:45:WU00:FS02:Connecting to 140.163.4.200:80
08:59:45:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8 -dir 01 -suffix 01 -version 706 -lifeline 1802 -checkpoint 15 -np 3
08:59:45:WU01:FS00:Started FahCore on PID 1903
08:59:45:WU01:FS00:Core PID:1907
08:59:45:WU01:FS00:FahCore 0xa8 started
08:59:46:WU01:FS00:0xa8:*********************** Log Started 2021-12-25T08:59:45Z ***********************
08:59:46:WU01:FS00:0xa8:************************** Gromacs Folding@home Core ***************************
08:59:46:WU01:FS00:0xa8:       Core: Gromacs
08:59:46:WU01:FS00:0xa8:       Type: 0xa8
08:59:46:WU01:FS00:0xa8:    Version: 0.0.12
08:59:46:WU01:FS00:0xa8:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
08:59:46:WU01:FS00:0xa8:  Copyright: 2020 foldingathome.org
08:59:46:WU01:FS00:0xa8:   Homepage: https://foldingathome.org/
08:59:46:WU01:FS00:0xa8:       Date: Jan 16 2021
08:59:46:WU01:FS00:0xa8:       Time: 19:24:44
08:59:46:WU01:FS00:0xa8:   Compiler: GNU 8.3.0
08:59:46:WU01:FS00:0xa8:    Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
08:59:46:WU01:FS00:0xa8:             -fdata-sections -O3 -funroll-loops -fno-pie
08:59:46:WU01:FS00:0xa8:   Platform: linux2 4.15.0-128-generic
08:59:46:WU01:FS00:0xa8:       Bits: 64
08:59:46:WU01:FS00:0xa8:       Mode: Release
08:59:46:WU01:FS00:0xa8:       SIMD: avx2_256
08:59:46:WU01:FS00:0xa8:     OpenMP: ON
08:59:46:WU01:FS00:0xa8:       CUDA: OFF
08:59:46:WU01:FS00:0xa8:       Args: -dir 01 -suffix 01 -version 706 -lifeline 1903 -checkpoint 15 -np 3
08:59:46:WU01:FS00:0xa8:************************************ libFAH ************************************
08:59:46:WU01:FS00:0xa8:       Date: Jan 16 2021
08:59:46:WU01:FS00:0xa8:       Time: 19:21:38
08:59:46:WU01:FS00:0xa8:   Compiler: GNU 8.3.0
08:59:46:WU01:FS00:0xa8:    Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
08:59:46:WU01:FS00:0xa8:             -fdata-sections -O3 -funroll-loops -fno-pie
08:59:46:WU01:FS00:0xa8:   Platform: linux2 4.15.0-128-generic
08:59:46:WU01:FS00:0xa8:       Bits: 64
08:59:46:WU01:FS00:0xa8:       Mode: Release
08:59:46:WU01:FS00:0xa8:************************************ CBang *************************************
08:59:46:WU01:FS00:0xa8:       Date: Jan 16 2021
08:59:46:WU01:FS00:0xa8:       Time: 19:21:24
08:59:46:WU01:FS00:0xa8:   Compiler: GNU 8.3.0
08:59:46:WU01:FS00:0xa8:    Options: -faligned-new -std=c++14 -fsigned-char -ffunction-sections
08:59:46:WU01:FS00:0xa8:             -fdata-sections -O3 -funroll-loops -fno-pie -fPIC
08:59:46:WU01:FS00:0xa8:   Platform: linux2 4.15.0-128-generic
08:59:46:WU01:FS00:0xa8:       Bits: 64
08:59:46:WU01:FS00:0xa8:       Mode: Release
08:59:46:WU01:FS00:0xa8:************************************ System ************************************
08:59:46:WU01:FS00:0xa8:        CPU: Intel(R) Core(TM) i5-7400 CPU @ 3.00GHz
08:59:46:WU01:FS00:0xa8:     CPU ID: GenuineIntel Family 6 Model 158 Stepping 9
08:59:46:WU01:FS00:0xa8:       CPUs: 4
08:59:46:WU01:FS00:0xa8:     Memory: 15.50GiB
08:59:46:WU01:FS00:0xa8:Free Memory: 14.42GiB
08:59:46:WU01:FS00:0xa8:    Threads: POSIX_THREADS
08:59:46:WU01:FS00:0xa8: OS Version: 5.3
08:59:46:WU01:FS00:0xa8:Has Battery: false
08:59:46:WU01:FS00:0xa8: On Battery: false
08:59:46:WU01:FS00:0xa8: UTC Offset: 1
08:59:46:WU01:FS00:0xa8:        PID: 1907
08:59:46:WU01:FS00:0xa8:        CWD: /var/lib/fahclient/work
08:59:46:WU01:FS00:0xa8:********************************************************************************
08:59:46:WU01:FS00:0xa8:Project: 18435 (Run 17, Clone 61, Gen 27)
08:59:46:WU01:FS00:0xa8:Unit: 0x00000000000000000000000000000000
08:59:46:WU01:FS00:0xa8:Digital signatures verified
08:59:46:WU01:FS00:0xa8:Calling: mdrun -c frame27.gro -s frame27.tpr -x frame27.xtc -cpi state.cpt -cpt 15 -nt 3 -ntmpi 1
08:59:46:WU01:FS00:0xa8:Steps: first=135000000 total=140000000
08:59:46:WU01:FS00:0xa8:Completed 3044902 out of 5000000 steps (60%)
08:59:46:WARNING:WU00:FS02:Exception: Failed to send results to work server: Failed to connect to 140.163.4.200:80: Network is unreachable
08:59:46:WU00:FS02:Trying to send results to collection server
08:59:46:WU00:FS02:Connecting to 140.163.4.210:8080
08:59:46:WARNING:WU00:FS02:WorkServer connection failed on port 8080 trying 80
08:59:46:WU00:FS02:Connecting to 140.163.4.210:80
08:59:47:ERROR:WU00:FS02:Exception: Failed to connect to 140.163.4.210:80: Network is unreachable
08:59:47:WU00:FS02:Sending unit results: id:00 state:SEND error:DUMPED project:16487 run:0 clone:5 gen:95 core:0x22 unit:0x000000050000005f0000406700000000
08:59:47:WU00:FS02:Connecting to 140.163.4.200:8080
08:59:47:WARNING:WU00:FS02:WorkServer connection failed on port 8080 trying 80
08:59:47:WU00:FS02:Connecting to 140.163.4.200:80
08:59:47:WARNING:WU00:FS02:Exception: Failed to send results to work server: Failed to connect to 140.163.4.200:80: Network is unreachable
08:59:47:WU00:FS02:Trying to send results to collection server
08:59:47:WU00:FS02:Connecting to 140.163.4.210:8080
08:59:47:WARNING:WU00:FS02:WorkServer connection failed on port 8080 trying 80
08:59:47:WU00:FS02:Connecting to 140.163.4.210:80
08:59:47:ERROR:WU00:FS02:Exception: Failed to connect to 140.163.4.210:80: Network is unreachable
09:00:46:Saving configuration to /etc/fahclient/config.xml
09:00:46:<config>
09:00:46:  <!-- Network -->
09:00:46:  <proxy v=':8080'/>
09:00:46:
09:00:46:  <!-- Slot Control -->
09:00:46:  <power v='FULL'/>
09:00:46:
09:00:46:  <!-- User Information -->
09:00:46:  <passkey v='*****'/>
09:00:46:  <team v='51'/>
09:00:46:  <user v='philippe_roubach'/>
09:00:46:
09:00:46:  <!-- Folding Slots -->
09:00:46:  <slot id='0' type='CPU'>
09:00:46:    <cpus v='3'/>
09:00:46:  </slot>
09:00:46:  <slot id='2' type='GPU'>
09:00:46:    <pci-bus v='1'/>
09:00:46:    <pci-slot v='0'/>
09:00:46:  </slot>
09:00:46:</config>
09:00:47:WU00:FS02:Sending unit results: id:00 state:SEND error:DUMPED project:16487 run:0 clone:5 gen:95 core:0x22 unit:0x000000050000005f0000406700000000
09:00:47:WU00:FS02:Connecting to 140.163.4.200:8080
09:00:48:WU00:FS02:Server responded WORK_ACK (400)
09:00:48:WU00:FS02:Cleaning up
09:00:55:WU01:FS00:0xa8:Completed 3050000 out of 5000000 steps (61%)
09:13:30:WU01:FS00:0xa8:Completed 3100000 out of 5000000 steps (62%)

Re: ERROR:GPU with PCI bus 0 and slot 2 not found

Posted: Sat Dec 25, 2021 12:37 pm
by PaulTV
Hi,

Something like taht happened on my machine a couple times as well, and it's quite annoying.

1) Stop the client (not only folding, but also the background fahclient process)
2) Remove GPUs.txt (on Windows in C:\ProgramData\FAHClient, on Linux in /var/lib/fahclient)
3) Start the client

Hopefully that'll fix it for you as well.

I still don't know if the gpus.txt gets corrupted during normal operation, or if this is because the computer may not have internet access yet when the client is started. The next time it happens to me, I'm gonna save the GPUs.txt file and make a case out of it.

Re: ERROR:GPU with PCI bus 0 and slot 2 not found

Posted: Sat Dec 25, 2021 12:56 pm
by promeneur
I only stopped, then started faclient to fix the problem.

Thanks

Re: ERROR:GPU with PCI bus 0 and slot 2 not found

Posted: Sat Dec 25, 2021 1:48 pm
by Neil-B
Slightly confused ... your first post implies you had just started your pc this morning and got this error - which might indicate either no Internet gpus.txt issue or the drivers updating possibly as part of a system update as part of the shutdown/restart ... your next post implies you started fachlient to give the problem which I'm not sure I understand - dud the problem exists before you shut down?

Re: ERROR:GPU with PCI bus 0 and slot 2 not found

Posted: Sat Dec 25, 2021 1:56 pm
by promeneur
I don't understand what is confused.

Yesterday no problem.
I start my PC this morning then I saw the issue.
Then I fixed the problem by restarting fahclient.

Last update of nvidia driver : 2021 november

Re: ERROR:GPU with PCI bus 0 and slot 2 not found

Posted: Sat Dec 25, 2021 3:09 pm
by toTOW
08:59:46:WARNING:WU00:FS02:Exception: Failed to send results to work server: Failed to connect to 140.163.4.200:80: Network is unreachable
08:59:46:WU00:FS02:Trying to send results to collection server
08:59:46:WU00:FS02:Connecting to 140.163.4.210:8080
08:59:46:WARNING:WU00:FS02:WorkServer connection failed on port 8080 trying 80
08:59:46:WU00:FS02:Connecting to 140.163.4.210:80
08:59:47:ERROR:WU00:FS02:Exception: Failed to connect to 140.163.4.210:80: Network is unreachable
08:59:47:WU00:FS02:Sending unit results: id:00 state:SEND error:DUMPED project:16487 run:0 clone:5 gen:95 core:0x22 unit:0x000000050000005f0000406700000000
08:59:47:WU00:FS02:Connecting to 140.163.4.200:8080
08:59:47:WARNING:WU00:FS02:WorkServer connection failed on port 8080 trying 80
08:59:47:WU00:FS02:Connecting to 140.163.4.200:80
08:59:47:WARNING:WU00:FS02:Exception: Failed to send results to work server: Failed to connect to 140.163.4.200:80: Network is unreachable
08:59:47:WU00:FS02:Trying to send results to collection server
08:59:47:WU00:FS02:Connecting to 140.163.4.210:8080
08:59:47:WARNING:WU00:FS02:WorkServer connection failed on port 8080 trying 80
08:59:47:WU00:FS02:Connecting to 140.163.4.210:80
08:59:47:ERROR:WU00:FS02:Exception: Failed to connect to 140.163.4.210:80: Network is unreachable

I already told you multiple times that you need an active Internet access at client startup ...