GTX 1060, "No compute devices matched", Arch Linux

It seems that a lot of GPU problems revolve around specific versions of drivers. Though NVidia has their own support structure, you can often learn from information reported by others who fold.

Moderators: Site Moderators, FAHC Science Team

GTX 1060, "No compute devices matched", Arch Linux

Postby Qwertystop » Wed Apr 22, 2020 2:46 pm

I'm on Arch Linux, running folding@home 7.5.1. It seems to be unable to use my GPU. This is consistent, though on one occasion it did manage to use the GPU for a short time before losing it again; I don't think I changed anything during that interval. In the log, I get "ERROR:No compute devices matched GPU #0 NVIDIA:7 GP106 [GeForce GTX 1060 6GB] 4372. You may need to update your graphics drivers." In the System Info tab of FAHControl, I see

  • GPUs 1
  • GPU 0 Bus:1 Slot:0 Func:0 NVIDIA:7 GP106 [GeForce GTX 1060 6GB] 4372
  • CUDA Not detected: cuInit() returned 999
  • OpenCL Not detected: clGetPlatformIDs() returned -1001

I have both cuda and opencl installed, along with nvidia drivers, all up to date as of 22 April 2020. I did check if there were multiple files in /etc/OpenCL/vendors, because someone else had that issue; there were not, just the one.

The full startup log at verbosity 5 is as follows:

Code: Select all
*********************** Log Started 2020-04-22T13:11:19Z ***********************
13:11:19:************************* Folding@home Client *************************
13:11:19:    Website: https://foldingathome.org/
13:11:19:  Copyright: (c) 2009-2018 foldingathome.org
13:11:19:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
13:11:19:       Args:
13:11:19:     Config: /var/lib/private/fah/config.xml
13:11:19:Started thread 1 on PID 2477
13:11:19:******************************** Build ********************************
13:11:19:    Version: 7.5.1
13:11:19:       Date: May 11 2018
13:11:19:       Time: 19:59:04
13:11:19: Repository: Git
13:11:19:   Revision: 4705bf53c635f88b8fe85af7675557e15d491ff0
13:11:19:     Branch: master
13:11:19:   Compiler: GNU 6.3.0 20170516
13:11:19:    Options: -std=gnu++98 -O3 -funroll-loops
13:11:19:   Platform: linux2 4.14.0-3-amd64
13:11:19:       Bits: 64
13:11:19:       Mode: Release
13:11:19:******************************* System ********************************
13:11:19:        CPU: Intel(R) Core(TM) i5-6500 CPU @ 3.20GHz
13:11:19:     CPU ID: GenuineIntel Family 6 Model 94 Stepping 3
13:11:19:       CPUs: 4
13:11:19:     Memory: 15.57GiB
13:11:19:Free Memory: 11.51GiB
13:11:19:    Threads: POSIX_THREADS
13:11:19: OS Version: 5.6
13:11:19:Has Battery: false
13:11:19: On Battery: false
13:11:19: UTC Offset: -4
13:11:19:        PID: 2477
13:11:19:        CWD: /var/lib/private/fah
13:11:19:         OS: Linux 5.6.6-arch1-1 x86_64
13:11:19:    OS Arch: AMD64
13:11:19:       GPUs: 1
13:11:19:      GPU 0: Bus:1 Slot:0 Func:0 NVIDIA:7 GP106 [GeForce GTX 1060 6GB] 4372
13:11:19:       CUDA: Not detected: cuInit() returned 999
13:11:19:     OpenCL: Not detected: clGetPlatformIDs() returned -1001
13:11:19:***********************************************************************
13:11:19:<config>
13:11:19:  <!-- Client Control -->
13:11:19:  <client-threads v='6'/>
13:11:19:  <cycle-rate v='4'/>
13:11:19:  <cycles v='-1'/>
13:11:19:  <data-directory v='.'/>
13:11:19:  <disable-sleep-when-active v='true'/>
13:11:19:  <exec-directory v='/opt/fah'/>
13:11:19:  <exit-when-done v='false'/>
13:11:19:  <fold-anon v='false'/>
13:11:19:  <idle-seconds v='300'/>
13:11:19:  <open-web-control v='false'/>
13:11:19:
13:11:19:  <!-- Configuration -->
13:11:19:  <config-rotate v='true'/>
13:11:19:  <config-rotate-dir v='configs'/>
13:11:19:  <config-rotate-max v='16'/>
13:11:19:
13:11:19:  <!-- Debugging -->
13:11:19:  <assignment-servers>
13:11:19:    assign1.foldingathome.org:8080 assign2.foldingathome.org:80
13:11:19:  </assignment-servers>
13:11:19:  <auth-as v='true'/>
13:11:19:  <capture-directory v='capture'/>
13:11:19:  <capture-on-error v='false'/>
13:11:19:  <capture-packets v='false'/>
13:11:19:  <capture-requests v='false'/>
13:11:19:  <capture-responses v='false'/>
13:11:19:  <capture-sockets v='false'/>
13:11:19:  <core-exec v='FahCore_$type'/>
13:11:19:  <core-wrapper-exec v='FAHCoreWrapper'/>
13:11:19:  <debug-sockets v='false'/>
13:11:19:  <exception-locations v='true'/>
13:11:19:  <stack-traces v='false'/>
13:11:19:
13:11:19:  <!-- Error Handling -->
13:11:19:  <max-slot-errors v='10'/>
13:11:19:  <max-unit-errors v='5'/>
13:11:19:
13:11:19:  <!-- Folding Core -->
13:11:19:  <checkpoint v='15'/>
13:11:19:  <core-dir v='cores'/>
13:11:19:  <core-priority v='idle'/>
13:11:19:  <cpu-affinity v='false'/>
13:11:19:  <cpu-usage v='100'/>
13:11:19:  <gpu-usage v='100'/>
13:11:19:  <no-assembly v='false'/>
13:11:19:
13:11:19:  <!-- Folding Slot Configuration -->
13:11:19:  <cause v='ANY'/>
13:11:19:  <client-subtype v='LINUX'/>
13:11:19:  <client-type v='normal'/>
13:11:19:  <cpu-species v='X86_PENTIUM_II'/>
13:11:19:  <cpu-type v='AMD64'/>
13:11:19:  <cpus v='-1'/>
13:11:19:  <disable-viz v='false'/>
13:11:19:  <gpu v='true'/>
13:11:19:  <max-packet-size v='normal'/>
13:11:19:  <os-species v='UNKNOWN'/>
13:11:19:  <os-type v='LINUX'/>
13:11:19:  <project-key v='0'/>
13:11:19:  <smp v='true'/>
13:11:19:
13:11:19:  <!-- GUI -->
13:11:19:  <gui-enabled v='true'/>
13:11:19:
13:11:19:  <!-- HTTP Server -->
13:11:19:  <allow v='127.0.0.1'/>
13:11:19:  <connection-timeout v='60'/>
13:11:19:  <deny v='0/0'/>
13:11:19:  <http-addresses v='0:7396'/>
13:11:19:  <https-addresses v=''/>
13:11:19:  <max-connect-time v='900'/>
13:11:19:  <max-connections v='800'/>
13:11:19:  <max-request-length v='52428800'/>
13:11:19:  <min-connect-time v='300'/>
13:11:19:
13:11:19:  <!-- Logging -->
13:11:19:  <log v='log.txt'/>
13:11:19:  <log-color v='true'/>
13:11:19:  <log-crlf v='false'/>
13:11:19:  <log-date v='false'/>
13:11:19:  <log-date-periodically v='21600'/>
13:11:19:  <log-domain v='false'/>
13:11:19:  <log-header v='true'/>
13:11:19:  <log-level v='true'/>
13:11:19:  <log-no-info-header v='true'/>
13:11:19:  <log-redirect v='false'/>
13:11:19:  <log-rotate v='true'/>
13:11:19:  <log-rotate-dir v='logs'/>
13:11:19:  <log-rotate-max v='16'/>
13:11:19:  <log-short-level v='false'/>
13:11:19:  <log-simple-domains v='true'/>
13:11:19:  <log-thread-id v='false'/>
13:11:19:  <log-thread-prefix v='true'/>
13:11:19:  <log-time v='true'/>
13:11:19:  <log-to-screen v='true'/>
13:11:19:  <log-truncate v='false'/>
13:11:19:  <verbosity v='5'/>
13:11:19:
13:11:19:  <!-- Network -->
13:11:19:  <proxy v=':8080'/>
13:11:19:  <proxy-enable v='false'/>
13:11:19:  <proxy-pass v=''/>
13:11:19:  <proxy-user v=''/>
13:11:19:
13:11:19:  <!-- Process Control -->
13:11:19:  <child v='false'/>
13:11:19:  <daemon v='false'/>
13:11:19:  <fork v='false'/>
13:11:19:  <pid v='false'/>
13:11:19:  <pid-file v='Folding@home Client.pid'/>
13:11:19:  <respawn v='false'/>
13:11:19:  <service v='false'/>
13:11:19:
13:11:19:  <!-- Remote Command Server -->
13:11:19:  <command-address v='0.0.0.0'/>
13:11:19:  <command-allow-no-pass v='127.0.0.1'/>
13:11:19:  <command-deny-no-pass v='0/0'/>
13:11:19:  <command-enable v='true'/>
13:11:19:  <command-port v='36330'/>
13:11:19:
13:11:19:  <!-- Slot Control -->
13:11:19:  <idle v='false'/>
13:11:19:  <max-shutdown-wait v='60'/>
13:11:19:  <pause-on-battery v='true'/>
13:11:19:  <pause-on-start v='false'/>
13:11:19:  <paused v='false'/>
13:11:19:  <power v='full'/>
13:11:19:
13:11:19:  <!-- User Information -->
13:11:19:  <machine-id v='0'/>
13:11:19:  <passkey v='********************************'/>
13:11:19:  <team v='45032'/>
13:11:19:  <user v='Qwertystop'/>
13:11:19:
13:11:19:  <!-- Web Server -->
13:11:19:  <web-allow v='127.0.0.1'/>
13:11:19:  <web-deny v='0/0'/>
13:11:19:  <web-enable v='true'/>
13:11:19:
13:11:19:  <!-- Web Server Sessions -->
13:11:19:  <session-cookie v='sid'/>
13:11:19:  <session-lifetime v='86400'/>
13:11:19:  <session-timeout v='3600'/>
13:11:19:
13:11:19:  <!-- Work Unit Control -->
13:11:19:  <dump-after-deadline v='true'/>
13:11:19:  <max-queue v='16'/>
13:11:19:  <max-units v='0'/>
13:11:19:  <next-unit-percentage v='99'/>
13:11:19:  <stall-detection-enabled v='false'/>
13:11:19:  <stall-percent v='5'/>
13:11:19:  <stall-timeout v='1800'/>
13:11:19:
13:11:19:  <!-- Folding Slots -->
13:11:19:  <slot id='0' type='CPU'/>
13:11:19:  <slot id='1' type='GPU'>
13:11:19:    <paused v='true'/>
13:11:19:  </slot>
13:11:19:</config>
13:11:19:Trying to access database...
13:11:19:Successfully acquired database lock
13:11:19:Enabled folding slot 00: READY cpu:3
13:11:19:Enabled folding slot 01: PAUSED gpu:0:GP106 [GeForce GTX 1060 6GB] 4372 (by user)
13:11:19:ERROR:No compute devices matched GPU #0 NVIDIA:7 GP106 [GeForce GTX 1060 6GB] 4372.  You may need to update your graphics drivers.
Qwertystop
 
Posts: 7
Joined: Wed Apr 22, 2020 2:26 pm

Re: GTX 1060, "No compute devices matched", Arch Linux

Postby ajm » Wed Apr 22, 2020 3:44 pm

13:11:19: CUDA: Not detected: cuInit() returned 999
13:11:19: OpenCL: Not detected: clGetPlatformIDs() returned -1001


If you are certain that Cuda and OpenCL are installed, I would try to uninstall/reinstall FAH. The detection may be better during installation.
If that doesn't work, I would uninstall FAH, and the drivers and OpenCL, then reinstall the whole thing, first drivers (from Nvidia), then OpenCL (sudo apt install ocl-icd-opencl-dev), then FAH.
ajm
 
Posts: 495
Joined: Sat Mar 21, 2020 6:22 am
Location: Lucerne, Switzerland

Re: GTX 1060, "No compute devices matched", Arch Linux

Postby Qwertystop » Wed Apr 22, 2020 4:35 pm

ajm wrote:
13:11:19: CUDA: Not detected: cuInit() returned 999
13:11:19: OpenCL: Not detected: clGetPlatformIDs() returned -1001


If you are certain that Cuda and OpenCL are installed, I would try to uninstall/reinstall FAH. The detection may be better during installation.
If that doesn't work, I would uninstall FAH, and the drivers and OpenCL, then reinstall the whole thing, first drivers (from Nvidia), then OpenCL (sudo apt install ocl-icd-opencl-dev), then FAH.

Uninstalling and reinstalling FAH led me to realize that of the two Arch packages for folding@home, one (foldingathome-noroot, that I had installed) has apparently ceased to be supported, and the other (foldingathome) seems to have added an additional auxiliary service necessary for nvidia cards. So I copied that service in after doing the reinstall, and it seems to be working now. At the very least, it's now detecting both Cuda and OpenCL; I'll have to wait to get a work unit before I know for sure that it's working.
Qwertystop
 
Posts: 7
Joined: Wed Apr 22, 2020 2:26 pm

Re: GTX 1060, "No compute devices matched", Arch Linux

Postby Qwertystop » Sat Apr 25, 2020 12:39 am

Update on this: While it did successfully complete two work units, now it's back to the same "not detected" messages as before in the System Info tab. Attempting a reinstall of FAH didn't work this time. I'm seeing new error messages in the log:

Code: Select all
23:35:23:WU01:FS01:0x22:ERROR:exception: There is no registered Platform called "OpenCL"

and
Code: Select all
23:32:22:ERROR:WU02:FS01:Failed to start core: OpenCL device matching slot 1 not found, make sure the OpenCL driver is installed or try setting 'opencl-index' manually

I did set opencl-index (to 0); that message stopped appearing but the other didn't.

Also, I have seen the more-detailed variant message:
Code: Select all
23:39:13:ERROR:No compute devices matched GPU #0 {
23:39:13:ERROR:  "vendor": 4318,
23:39:13:ERROR:  "device": 7171,
23:39:13:ERROR:  "type": 2,
23:39:13:ERROR:  "species": 7,
23:39:13:ERROR:  "description": "GP106 [GeForce GTX 1060 6GB] 4372"
23:39:13:ERROR:}.  You may need to update your graphics drivers.
Last edited by Qwertystop on Sat Apr 25, 2020 12:41 am, edited 1 time in total.
Qwertystop
 
Posts: 7
Joined: Wed Apr 22, 2020 2:26 pm

Re: GTX 1060, "No compute devices matched", Arch Linux

Postby PantherX » Sat Apr 25, 2020 12:41 am

Welcome to the F@H Forum Qwertystop,

Out of curiosity, what changed that caused your system to change from functional to non-functional? If it was a software update, would be good to know to help our future Donors who might encounter this issue.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
User avatar
PantherX
Site Moderator
 
Posts: 6327
Joined: Wed Dec 23, 2009 10:33 am
Location: Land Of The Long White Cloud

Re: GTX 1060, "No compute devices matched", Arch Linux

Postby Qwertystop » Sat Apr 25, 2020 12:51 am

PantherX wrote:Welcome to the F@H Forum Qwertystop,

Out of curiosity, what changed that caused your system to change from functional to non-functional? If it was a software update, would be good to know to help our future Donors who might encounter this issue.

Uncertain. I attempted to set client-type "advanced" (as described at https://foldingathome.org/support/faq/i ... ion-guide/ ), but reverted that change when I realized that I wasn't sure what to do when a WU failed (it seemed to lock up, not proceeding to a new WU, which was not the expected result). I also restarted foldingathome.service in an attempt to get that failed WU to clear. I'm not sure which, if any, of these actions would have caused the problem.

EDIT: Now it seems to have, despite all errors listed in my prior post, attempted to run a WU, and failed. The error portion of the log is:
Code: Select all
00:12:27:WU03:FS01:0x22:ERROR:exception: There is no registered Platform called "OpenCL"
00:12:27:WU03:FS01:0x22:Saving result file ../logfile_01.txt
00:12:27:WU03:FS01:0x22:Saving result file science.log
00:12:27:WU03:FS01:0x22:Folding@home Core Shutdown: BAD_WORK_UNIT
00:12:27:WARNING:WU03:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
00:12:27:WU03:FS01:Sending unit results: id:03 state:SEND error:FAULTY project:13400 run:49 clone:17 gen:0 core:0x22 unit:0x0000000012bc7d9a5ea35ec1cba2cf7a

I looked for the two result files, but did not find them for this work unit. I assume they were cleared automatically at some point, presumably when I restarted foldingathome.service to try to clear the "failed" state before noticing those lines in the log. If it happens again I will retrieve the logs and hope they have more error details.
Last edited by Qwertystop on Sat Apr 25, 2020 2:39 am, edited 1 time in total.
Qwertystop
 
Posts: 7
Joined: Wed Apr 22, 2020 2:26 pm

Re: GTX 1060, "No compute devices matched", Arch Linux

Postby PantherX » Sat Apr 25, 2020 2:38 am

Changing the client-type would not cause changes for OpenCL. Have you tried to restart your system?

Also, how did you attempt to change the client-type; via FAHControl or manually editing the config.xml file?
If it was via FAHControl, then no idea what could cause that issue.
If it was manually, it could be that you incorrectly formatted the config.xml file so you can look at the previous config.xml file to restore your configuration and thus, get back to folding WUs.
User avatar
PantherX
Site Moderator
 
Posts: 6327
Joined: Wed Dec 23, 2009 10:33 am
Location: Land Of The Long White Cloud

Re: GTX 1060, "No compute devices matched", Arch Linux

Postby Qwertystop » Sat Apr 25, 2020 2:54 am

PantherX wrote:Changing the client-type would not cause changes for OpenCL. Have you tried to restart your system?

Also, how did you attempt to change the client-type; via FAHControl or manually editing the config.xml file?
If it was via FAHControl, then no idea what could cause that issue.
If it was manually, it could be that you incorrectly formatted the config.xml file so you can look at the previous config.xml file to restore your configuration and thus, get back to folding WUs.

Via FAHControl.

I got a new instance of the failed unit; it keeps happening repeatedly. Here's the full log of one instance from the FAHControl interface:
Code: Select all
23:49:45:WU02:FS01:Connecting to 65.254.110.245:80
23:49:45:WARNING:WU02:FS01:Failed to get assignment from '65.254.110.245:80': No WUs available for this configuration
23:49:45:WU02:FS01:Connecting to 18.218.241.186:80
23:49:45:WU02:FS01:Assigned to work server 52.224.109.74
23:49:45:WU02:FS01:Requesting new work unit for slot 01: READY gpu:0:GP106 [GeForce GTX 1060 6GB] 4372 from 52.224.109.74
23:49:45:WU02:FS01:Connecting to 52.224.109.74:8080
23:49:48:WU02:FS01:Downloading 161.47MiB
23:49:54:WU02:FS01:Download 18.04%
23:50:00:WU02:FS01:Download 34.10%
23:50:06:WU02:FS01:Download 57.13%
23:50:12:WU02:FS01:Download 78.73%
23:50:17:WU02:FS01:Download complete
23:50:17:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:13879 run:0 clone:1504 gen:47 core:0x22 unit:0x0000003a34e06d4a5e81129d11cab8fe
23:50:17:WU02:FS01:Starting
23:50:17:WU02:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/private/fah/cores/cores.foldingathome.org/v7/lin/64bit/Core_22.fah/FahCore_22 -dir 02 -suffix 01 -version 706 -lifeline 27986 -checkpoint 15 -gpu-vendor nvidia -opencl-device 0 -cuda-device 0 -gpu 0 -big
23:50:17:WU02:FS01:Started FahCore on PID 28235
23:50:17:WU02:FS01:Core PID:28239
23:50:17:WU02:FS01:FahCore 0x22 started
23:50:18:WU02:FS01:0x22:*********************** Log Started 2020-04-24T23:50:17Z ***********************
23:50:18:WU02:FS01:0x22:*************************** Core22 Folding@home Core ***************************
23:50:18:WU02:FS01:0x22:       Type: 0x22
23:50:18:WU02:FS01:0x22:       Core: Core22
23:50:18:WU02:FS01:0x22:    Website: https://foldingathome.org/
23:50:18:WU02:FS01:0x22:  Copyright: (c) 2009-2018 foldingathome.org
23:50:18:WU02:FS01:0x22:     Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora
23:50:18:WU02:FS01:0x22:             <rafal.wiewiora@choderalab.org>
23:50:18:WU02:FS01:0x22:       Args: -dir 02 -suffix 01 -version 706 -lifeline 28235 -checkpoint 15
23:50:18:WU02:FS01:0x22:             -gpu-vendor nvidia -opencl-device 0 -cuda-device 0 -gpu 0 -big
23:50:18:WU02:FS01:0x22:     Config: <none>
23:50:18:WU02:FS01:0x22:************************************ Build *************************************
23:50:18:WU02:FS01:0x22:    Version: 0.0.5
23:50:18:WU02:FS01:0x22:       Date: Apr 22 2020
23:50:18:WU02:FS01:0x22:       Time: 03:57:11
23:50:18:WU02:FS01:0x22: Repository: Git
23:50:18:WU02:FS01:0x22:   Revision: 2d69202c898bd9bb3e093f51cd32bf411c2a0388
23:50:18:WU02:FS01:0x22:     Branch: HEAD
23:50:18:WU02:FS01:0x22:   Compiler: GNU 4.8.2 20140120 (Red Hat 4.8.2-15)
23:50:18:WU02:FS01:0x22:    Options: -std=c++11 -O3 -funroll-loops
23:50:18:WU02:FS01:0x22:   Platform: linux2 4.19.76-linuxkit
23:50:18:WU02:FS01:0x22:       Bits: 64
23:50:18:WU02:FS01:0x22:       Mode: Release
23:50:18:WU02:FS01:0x22:************************************ System ************************************
23:50:18:WU02:FS01:0x22:        CPU: Intel(R) Core(TM) i5-6500 CPU @ 3.20GHz
23:50:18:WU02:FS01:0x22:     CPU ID: GenuineIntel Family 6 Model 94 Stepping 3
23:50:18:WU02:FS01:0x22:       CPUs: 4
23:50:18:WU02:FS01:0x22:     Memory: 15.57GiB
23:50:18:WU02:FS01:0x22:Free Memory: 3.50GiB
23:50:18:WU02:FS01:0x22:    Threads: POSIX_THREADS
23:50:18:WU02:FS01:0x22: OS Version: 5.6
23:50:18:WU02:FS01:0x22:Has Battery: false
23:50:18:WU02:FS01:0x22: On Battery: false
23:50:18:WU02:FS01:0x22: UTC Offset: -4
23:50:18:WU02:FS01:0x22:        PID: 28239
23:50:18:WU02:FS01:0x22:        CWD: /var/lib/private/fah/work
23:50:18:WU02:FS01:0x22:         OS: Linux 5.6.6-arch1-1 x86_64
23:50:18:WU02:FS01:0x22:    OS Arch: AMD64
23:50:18:WU02:FS01:0x22:********************************************************************************
23:50:18:WU02:FS01:0x22:Project: 13879 (Run 0, Clone 1504, Gen 47)
23:50:18:WU02:FS01:0x22:Unit: 0x0000003a34e06d4a5e81129d11cab8fe
23:50:18:WU02:FS01:0x22:Reading tar file core.xml
23:50:18:WU02:FS01:0x22:Reading tar file integrator.xml
23:50:18:WU02:FS01:0x22:Reading tar file state.xml
23:50:18:WU02:FS01:0x22:Reading tar file system.xml
23:50:18:WU02:FS01:0x22:Digital signatures verified
23:50:18:WU02:FS01:0x22:Folding@home GPU Core22 Folding@home Core
23:50:18:WU02:FS01:0x22:Version 0.0.5
23:50:18:WU02:FS01:0x22:ERROR:exception: There is no registered Platform called "OpenCL"
23:50:18:WU02:FS01:0x22:Saving result file ../logfile_01.txt
23:50:18:WU02:FS01:0x22:Saving result file science.log
23:50:18:WU02:FS01:0x22:Folding@home Core Shutdown: BAD_WORK_UNIT
23:50:18:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
23:50:18:WU02:FS01:Sending unit results: id:02 state:SEND error:FAULTY project:13879 run:0 clone:1504 gen:47 core:0x22 unit:0x0000003a34e06d4a5e81129d11cab8fe
23:50:18:WU02:FS01:Uploading 7.00KiB to 52.224.109.74
23:50:18:WU02:FS01:Connecting to 52.224.109.74:8080
23:50:18:WU02:FS01:Upload complete
23:50:19:WU02:FS01:Server responded WORK_ACK (400)
23:50:19:WU02:FS01:Cleaning up


I can't find the ../logfile_01.txt and science.log mentioned. That is, I found files by those names in those relative positions, but the last-modification times on them don't match the timestamps in the FAHControl log for that work unit. The ones I found are above/in to /var/lib/private/fah/work/02/01, though, which would seem to match assuming the first numbered directory is work-unit and the second is slot. But the actual contents of the logfile_01 there match the contents of the FAHControl log for work-unit 02, slot 00 (my CPU), not slot 01. I can find no logs in /var/lib/private/fah containing "WU02:FS01". So on the whole, digging deeper only got me more confused.

EDIT: Fully restarting computer seems to have made it work. Whether this actually fixed anything or is just coincidence, I don't know. Whether it'll break again, I don't know.
Last edited by Qwertystop on Sat Apr 25, 2020 4:12 am, edited 1 time in total.
Qwertystop
 
Posts: 7
Joined: Wed Apr 22, 2020 2:26 pm

Re: GTX 1060, "No compute devices matched", Arch Linux

Postby PantherX » Sat Apr 25, 2020 3:51 am

Since you're using FAHControl, please click the Refresh button and then copy the log from the top until after the configuration finishes... it should look something like this:
Code: Select all
*********************** Log Started 2020-04-23T06:36:18Z ***********************
06:36:18:****************************** FAHClient ******************************
06:36:18:        Version: 7.6.9
06:36:18:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
06:36:18:      Copyright: 2020 foldingathome.org
06:36:18:       Homepage: https://foldingathome.org/
06:36:18:           Date: Apr 17 2020
06:36:18:           Time: 11:13:06
06:36:18:       Revision: 398c2b17fa535e0cc6c9d10856b2154c32771646
06:36:18:         Branch: master
06:36:18:       Compiler: Visual C++ 2008
06:36:18:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
06:36:18:       Platform: win32 10
06:36:18:           Bits: 32
06:36:18:           Mode: Release
06:36:18:         Config: C:\Users\PantherX-H\AppData\Roaming\FAHClient\config.xml
06:36:18:******************************** CBang ********************************
06:36:18:           Date: Apr 17 2020
06:36:18:           Time: 11:10:09
06:36:18:       Revision: 2fb0be7809c5e45287a122ca5fbc15b5ae859a3b
06:36:18:         Branch: master
06:36:18:       Compiler: Visual C++ 2008
06:36:18:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
06:36:18:       Platform: win32 10
06:36:18:           Bits: 32
06:36:18:           Mode: Release
06:36:18:******************************* System ********************************
06:36:18:            CPU: Intel(R) Core(TM) i7-6700K CPU @ 4.00GHz
06:36:18:         CPU ID: GenuineIntel Family 6 Model 94 Stepping 3
06:36:18:           CPUs: 8
06:36:18:         Memory: 31.94GiB
06:36:18:    Free Memory: 27.63GiB
06:36:18:        Threads: WINDOWS_THREADS
06:36:18:     OS Version: 6.2
06:36:18:    Has Battery: false
06:36:18:     On Battery: false
06:36:18:     UTC Offset: 12
06:36:18:            PID: 11504
06:36:18:            CWD: C:\Users\PantherX-H\AppData\Roaming\FAHClient
06:36:18:             OS: Windows 10 Enterprise
06:36:18:        OS Arch: AMD64
06:36:18:           GPUs: 1
06:36:18:          GPU 0: Bus:1 Slot:0 Func:0 NVIDIA:8 GP102 [GeForce GTX 1080 Ti] 11380
06:36:18:  CUDA Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:6.1 Driver:10.2
06:36:18:OpenCL Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:1.2 Driver:442.19
06:36:18:  Win32 Service: false
06:36:18:******************************* libFAH ********************************
06:36:18:           Date: Apr 15 2020
06:36:18:           Time: 14:53:14
06:36:18:       Revision: 216968bc7025029c841ed6e36e81a03a316890d3
06:36:18:         Branch: master
06:36:18:       Compiler: Visual C++ 2008
06:36:18:        Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
06:36:18:       Platform: win32 10
06:36:18:           Bits: 32
06:36:18:           Mode: Release
06:36:18:***********************************************************************
06:36:19:<config>
06:36:19:  <!-- Folding Core -->
06:36:19:  <checkpoint v='30'/>
06:36:19:  <core-priority v='low'/>
06:36:19:
06:36:19:  <!-- Folding Slot Configuration -->
06:36:19:  <client-type v='advanced'/>
06:36:19:
06:36:19:  <!-- Network -->
06:36:19:  <proxy v=':8080'/>
06:36:19:
06:36:19:  <!-- Slot Control -->
06:36:19:  <pause-on-start v='true'/>
06:36:19:  <power v='full'/>
06:36:19:
06:36:19:  <!-- User Information -->
06:36:19:  <passkey v='*****'/>
06:36:19:  <team v='69411'/>
06:36:19:  <user v='PantherX'/>
06:36:19:
06:36:19:  <!-- Work Unit Control -->
06:36:19:  <next-unit-percentage v='100'/>
06:36:19:
06:36:19:  <!-- Folding Slots -->
06:36:19:  <slot id='0' type='CPU'>
06:36:19:    <cpus v='4'/>
06:36:19:  </slot>
06:36:19:  <slot id='1' type='GPU'/>
06:36:19:</config>
06:36:19:Trying to access database...
06:36:19:Successfully acquired database lock
06:36:19:Enabled folding slot 00: PAUSED cpu:4 (by user)
06:36:19:Enabled folding slot 01: PAUSED gpu:0:GP102 [GeForce GTX 1080 Ti] 11380 (by user)
06:37:15:FS01:Unpaused
06:37:15:WU00:FS01:Starting

If you require guidance, please have a look here: viewtopic.php?f=24&t=26036
User avatar
PantherX
Site Moderator
 
Posts: 6327
Joined: Wed Dec 23, 2009 10:33 am
Location: Land Of The Long White Cloud

Re: GTX 1060, "No compute devices matched", Arch Linux

Postby Qwertystop » Sat Apr 25, 2020 4:14 am

PantherX wrote:Since you're using FAHControl, please click the Refresh button and then copy the log from the top until after the configuration finishes... it should look something like this:

...

If you require guidance, please have a look here: viewtopic.php?f=24&t=26036

Code: Select all
*********************** Log Started 2020-04-25T03:06:43Z ***********************
03:06:43:****************************** FAHClient ******************************
03:06:43:        Version: 7.6.9
03:06:43:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
03:06:43:      Copyright: 2020 foldingathome.org
03:06:43:       Homepage: https://foldingathome.org/
03:06:43:           Date: Apr 17 2020
03:06:43:           Time: 18:11:26
03:06:43:       Revision: 398c2b17fa535e0cc6c9d10856b2154c32771646
03:06:43:         Branch: master
03:06:43:       Compiler: GNU 8.3.0
03:06:43:        Options: -std=c++11 -ffunction-sections -fdata-sections -O3
03:06:43:                 -funroll-loops -fno-pie
03:06:43:       Platform: linux2 4.19.0-5-amd64
03:06:43:           Bits: 64
03:06:43:           Mode: Release
03:06:43:           Args: --config /etc/foldingathome/config.xml --log
03:06:43:                 /var/log/foldingathome/log.txt --log-rotate-dir
03:06:43:                 /var/log/foldingathome
03:06:43:         Config: /etc/foldingathome/config.xml
03:06:43:******************************** CBang ********************************
03:06:43:           Date: Apr 17 2020
03:06:43:           Time: 18:10:13
03:06:43:       Revision: 2fb0be7809c5e45287a122ca5fbc15b5ae859a3b
03:06:43:         Branch: master
03:06:43:       Compiler: GNU 8.3.0
03:06:43:        Options: -std=c++11 -ffunction-sections -fdata-sections -O3
03:06:43:                 -funroll-loops -fno-pie -fPIC
03:06:43:       Platform: linux2 4.19.0-5-amd64
03:06:43:           Bits: 64
03:06:43:           Mode: Release
03:06:43:******************************* System ********************************
03:06:43:            CPU: Intel(R) Core(TM) i5-6500 CPU @ 3.20GHz
03:06:43:         CPU ID: GenuineIntel Family 6 Model 94 Stepping 3
03:06:43:           CPUs: 4
03:06:43:         Memory: 15.57GiB
03:06:43:    Free Memory: 15.11GiB
03:06:43:        Threads: POSIX_THREADS
03:06:43:     OS Version: 5.6
03:06:43:    Has Battery: false
03:06:43:     On Battery: false
03:06:43:     UTC Offset: -4
03:06:43:            PID: 560
03:06:43:            CWD: /var/lib/private/fah
03:06:43:             OS: Linux 5.6.6-arch1-1 x86_64
03:06:43:        OS Arch: AMD64
03:06:43:           GPUs: 1
03:06:43:          GPU 0: Bus:1 Slot:0 Func:0 NVIDIA:7 GP106 [GeForce GTX 1060 6GB] 4372
03:06:43:  CUDA Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:6.1 Driver:10.2
03:06:43:OpenCL Device 0: Platform:0 Device:0 Bus:1 Slot:0 Compute:1.2 Driver:440.82
03:06:43:******************************* libFAH ********************************
03:06:43:           Date: Apr 15 2020
03:06:43:           Time: 21:43:24
03:06:43:       Revision: 216968bc7025029c841ed6e36e81a03a316890d3
03:06:43:         Branch: master
03:06:43:       Compiler: GNU 8.3.0
03:06:43:        Options: -std=c++11 -ffunction-sections -fdata-sections -O3
03:06:43:                 -funroll-loops -fno-pie
03:06:43:       Platform: linux2 4.19.0-5-amd64
03:06:43:           Bits: 64
03:06:43:           Mode: Release
03:06:43:***********************************************************************
03:06:43:<config>
03:06:43:  <!-- Folding Slot Configuration -->
03:06:43:  <extra-core-args v='-big'/>
03:06:43:
03:06:43:  <!-- Network -->
03:06:43:  <proxy v=':8080'/>
03:06:43:
03:06:43:  <!-- Slot Control -->
03:06:43:  <power v='full'/>
03:06:43:
03:06:43:  <!-- User Information -->
03:06:43:  <passkey v='*****'/>
03:06:43:  <team v='45032'/>
03:06:43:  <user v='Qwertystop'/>
03:06:43:
03:06:43:  <!-- Folding Slots -->
03:06:43:  <slot id='0' type='CPU'/>
03:06:43:  <slot id='1' type='GPU'>
03:06:43:    <gpu-index v='0'/>
03:06:43:  </slot>
03:06:43:</config>
03:06:43:Trying to access database...
03:06:43:Successfully acquired database lock
03:06:43:Enabled folding slot 00: READY cpu:3
03:06:43:Enabled folding slot 01: READY gpu:0:GP106 [GeForce GTX 1060 6GB] 4372


It started working again after a full reboot. Who knows whether that'll hold.
Qwertystop
 
Posts: 7
Joined: Wed Apr 22, 2020 2:26 pm

Re: GTX 1060, "No compute devices matched", Arch Linux

Postby JohnChodera » Sat Apr 25, 2020 4:53 am

@Qwertystop: The key error is here:
Code: Select all
23:50:18:WU02:FS01:0x22:ERROR:exception: There is no registered Platform called "OpenCL"

This means that the OpenCL drivers could not be loaded, so the core can't use your GPU.
We'll add some instrumentation to the next core release to print and bring back more useful information about why the core is unable to contact the OpenCL drivers.
Thanks!
User avatar
JohnChodera
Pande Group Member
 
Posts: 309
Joined: Fri Feb 22, 2013 10:59 pm

Re: GTX 1060, "No compute devices matched", Arch Linux

Postby AndreasH » Sat Apr 25, 2020 10:33 am

JohnChodera wrote:@Qwertystop: The key error is here:
Code: Select all
23:50:18:WU02:FS01:0x22:ERROR:exception: There is no registered Platform called "OpenCL"

This means that the OpenCL drivers could not be loaded, so the core can't use your GPU.
We'll add some instrumentation to the next core release to print and bring back more useful information about why the core is unable to contact the OpenCL drivers.
Thanks!


Are you running FAHClient with the "--run-as" switch? Then this could also be a simple permission problem!
See viewtopic.php?f=80&t=34689 for a detailled analysis.

I don't know how your system is set up, so to check please do the following:

* Check the permissions on the /dev/nvidiactl device file
If the device file has read/write permissions for all (mode 0666) like this (example from a standard Ubuntu installation)
Code: Select all
demouser@ubuntutest:~$ ll /dev/nvidiactl
crw-rw-rw- 1 root root 195, 255 Apr 23 18:56 /dev/nvidiactl


then it should not be a permission problem and presumably everything I say below does not apply.

If on the other hand the device file has read/write permissions for owner and group only, like this (example from a standard OpenSUSE installation)
Code: Select all
andreas@ws1:~> ll /dev/nvidiactl
crw-rw---- 1 root video 195, 255 25. Apr 10:56 /dev/nvidiactl


then it might well be a permission problem.

* If you think it is a permission problem, you could do one of the following:

0) Add the user "fahclient" (assuming this is the user the FAHClient process is switching to) to the group which has read/write permissions on the device file (in the example above this would be group "video") and restart FAHClient.
This would be the best option, but it seems FAHClient does not correctly initialize the process' group identifiers so I would think this will currently not solve your problem. (See my postings in viewtopic.php?f=80&t=34689 for a detailled explanation)

1) Add the root user to the group which has read/write permissions on the device file (in the example above this would be group "video") and restart FAHClient

2) Add a special access control list to the device file allowing the user fahclient to read and write to it and restart FAHClient

3) Change the permissions of the device file to mode 0666 (world read/writable) and restart FAHClient (not recommended)

4) Do not use the "--run-is" option, but run the FAHClient as root (not recommended)

5) Wait for a new version of FAHClient with a fix for this problem

HTH

- andreas
AndreasH
 
Posts: 5
Joined: Wed Apr 22, 2020 10:59 am

Re: GTX 1060, "No compute devices matched", Arch Linux

Postby Qwertystop » Sun Apr 26, 2020 6:25 pm

/dev/nvidiactl has 0666 permissions, so that's not it. The issue has not recurred yet, and I have completed a few GPU work units.
Qwertystop
 
Posts: 7
Joined: Wed Apr 22, 2020 2:26 pm


Return to Problems with NVidia drivers

Who is online

Users browsing this forum: No registered users and 1 guest

cron