RX5700XT not folding (Linux)

It seems that a lot of GPU problems revolve around specific versions of drivers. Though AMD has their own support structure, you can often learn from information reported by others who fold.

Moderators: Site Moderators, FAHC Science Team

gunnarre
Posts: 567
Joined: Sun May 24, 2020 7:23 pm
Location: Norway

Re: RX5700XT not folding (Linux)

Post by gunnarre »

If you run clinfo twice in a row, does it cause a Segmentation Fault crash? No need to post the output again, just checking if you're getting a weird bug that I've seen once before on an older AMD card on Linux.
Image
Online: GTX 1660 Super, GTX 1080, GTX 1050 Ti 4G OC, RX580 + occasional CPU folding in the cold.
Offline: Radeon HD 7770, GTX 960, GTX 950
tehowe
Posts: 3
Joined: Sat Nov 28, 2020 10:54 pm

Re: RX5700XT not folding (Linux)

Post by tehowe »

I'm having the same problem here. Ubuntu 20.04, AMD 5700 XT GPU, I figured out by consulting these forums I had to manually edit the config.xml file since fahcontrol is broken on the latest Ubuntu LTS release or it will just keep running on CPU. This is what I'm using now for config.xml, it's what gets generated after getting OpenCL installed and using the config template and forcing slot 0 to GPU

Code: Select all

<config>
  <!-- Client Control -->
  <fold-anon v='true'/>

  <!-- Folding Slot Configuration -->
  <cause v='COVID_19'/>

  <!-- Slot Control -->
  <power v='MEDIUM'/>

  <!-- User Information -->
  <user v='itmemyusername'/>

  <!-- Folding Slots -->
  <slot id='0' type='GPU'>
    <pci-bus v='3'/>
    <pci-slot v='0'/>
  </slot>
</config>
Checking the log, it has started a core instance on this apparently working OpenCL

Code: Select all

23:30:16:OpenCL Device 0: Platform:0 Device:0 Bus:3 Slot:0 Compute:2.0 Driver:3188.4
but just stops at OpenCL configuration

Code: Select all

22:49:26:WU00:FS00:0x22:There are 3 platforms available.
22:49:26:WU00:FS00:0x22:Platform 0: Reference
22:49:26:WU00:FS00:0x22:Platform 1: CPU
22:49:26:WU00:FS00:0x22:Platform 2: OpenCL
22:49:26:WU00:FS00:0x22:  opencl-device 0 specified
22:49:34:WU00:FS00:0x22:Attempting to create OpenCL context:
22:49:34:WU00:FS00:0x22:  Configuring platform OpenCL
I've tried installing only the proprietary headless legacy and rocr OpenCL drivers from AMD's website alongside the package ocl-icd-opencl-dev and the opensource AMD drivers the system comes with.

I've tried ripping everything out back to the open source drivers and then installing the full AMD proprietary driver stack with sudo ./amdgpu-pro-install --opencl=rocr

In top a FahCore_22 pid sits around 100% activity but the FAHClient web interface just sits there at 0.0% doing no work
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: RX5700XT not folding (Linux)

Post by bruce »

The purpose of the FAHClient is to manage downloads and uploads. It SHOULD sit there doing nothing most of the time. It's waiting for you to finish the current WU so it fetch a new assignment and return the completed one. When there is work to be done, a FAHCore_xx is invoked to do the actual work.
tehowe
Posts: 3
Joined: Sat Nov 28, 2020 10:54 pm

Re: RX5700XT not folding (Linux)

Post by tehowe »

bruce wrote:The purpose of the FAHClient is to manage downloads and uploads. It SHOULD sit there doing nothing most of the time. It's waiting for you to finish the current WU so it fetch a new assignment and return the completed one. When there is work to be done, a FAHCore_xx is invoked to do the actual work.
No work is being done though. That's the point. I wasn't saying I see 0.0% CPU activity in top for fahclient, it's that the bar in the web application for Folding at Home client sits at 0.0% progress (and yes there is a core running)
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: RX5700XT not folding (Linux)

Post by bruce »

What I do not see is a WU being downloaded and a FAHCore being invoked. Those steps require FAHClient to have access to the internet. Do you have a firewall that's blocking internet access? It should be given the same permission set that a browser has although the folks that distribute firewalls automatically enable known browsers but not FAHClient. You have to configure an exception for FAHClient.
tehowe
Posts: 3
Joined: Sat Nov 28, 2020 10:54 pm

Re: RX5700XT not folding (Linux)

Post by tehowe »

bruce wrote:What I do not see is a WU being downloaded and a FAHCore being invoked. Those steps require FAHClient to have access to the internet. Do you have a firewall that's blocking internet access? It should be given the same permission set that a browser has although the folks that distribute firewalls automatically enable known browsers but not FAHClient. You have to configure an exception for FAHClient.
Hello it is downloading, I just excluded that from the log snippets above since everything looked fine up until the point the logs stop and the web app progress bar fails to move. All the WU00 prefixes in the log seem to indicate it has fetched a work unit? Here's the full thing, along with me reloading the web app and exiting at the end

Code: Select all

*********************** Log Started 2020-11-28T23:30:16Z ***********************
23:30:16:******************************* libFAH ********************************
23:30:16:           Date: Oct 20 2020
23:30:16:           Time: 20:36:39
23:30:16:       Revision: 5ca109d295a6245e2a2f590b3d0085ad5e567aeb
23:30:16:         Branch: master
23:30:16:       Compiler: GNU 8.3.0
23:30:16:        Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
23:30:16:                 -fdata-sections -O3 -funroll-loops -fno-pie
23:30:16:       Platform: linux2 5.8.0-1-amd64
23:30:16:           Bits: 64
23:30:16:           Mode: Release
23:30:16:****************************** FAHClient ******************************
23:30:16:        Version: 7.6.21
23:30:16:         Author: Joseph Coffland <joseph@cauldrondevelopment.com>
23:30:16:      Copyright: 2020 foldingathome.org
23:30:16:       Homepage: https://foldingathome.org/
23:30:16:           Date: Oct 20 2020
23:30:16:           Time: 20:39:00
23:30:16:       Revision: 6efbf0e138e22d3963e6a291f78dcb9c6422a278
23:30:16:         Branch: master
23:30:16:       Compiler: GNU 8.3.0
23:30:16:        Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
23:30:16:                 -fdata-sections -O3 -funroll-loops -fno-pie
23:30:16:       Platform: linux2 5.8.0-1-amd64
23:30:16:           Bits: 64
23:30:16:           Mode: Release
23:30:16:           Args: --child /etc/fahclient/config.xml --run-as fahclient
23:30:16:                 --pid-file=/var/run/fahclient.pid --daemon
23:30:16:         Config: /etc/fahclient/config.xml
23:30:16:******************************** CBang ********************************
23:30:16:           Date: Oct 20 2020
23:30:16:           Time: 18:37:59
23:30:16:       Revision: 7e4ce85225d7eaeb775e87c31740181ca603de60
23:30:16:         Branch: master
23:30:16:       Compiler: GNU 8.3.0
23:30:16:        Options: -faligned-new -std=c++11 -fsigned-char -ffunction-sections
23:30:16:                 -fdata-sections -O3 -funroll-loops -fno-pie -fPIC
23:30:16:       Platform: linux2 5.8.0-1-amd64
23:30:16:           Bits: 64
23:30:16:           Mode: Release
23:30:16:******************************* System ********************************
23:30:16:            CPU: AMD FX(tm)-6300 Six-Core Processor
23:30:16:         CPU ID: AuthenticAMD Family 21 Model 2 Stepping 0
23:30:16:           CPUs: 6
23:30:16:         Memory: 31.27GiB
23:30:16:    Free Memory: 18.25GiB
23:30:16:        Threads: POSIX_THREADS
23:30:16:     OS Version: 5.4
23:30:16:    Has Battery: false
23:30:16:     On Battery: false
23:30:16:     UTC Offset: -5
23:30:16:            PID: 360329
23:30:16:            CWD: /var/lib/fahclient
23:30:16:             OS: Linux 5.4.0-54-generic x86_64
23:30:16:        OS Arch: AMD64
23:30:16:           GPUs: 1
23:30:16:          GPU 0: Bus:3 Slot:0 Func:0 AMD:6 Navi 10 [Radeon RX 5600 OEM/5600
23:30:16:                 XT/5700/5700 XT]
23:30:16:           CUDA: Not detected: Failed to open dynamic library 'libcuda.so':
23:30:16:                 libcuda.so: cannot open shared object file: No such file or
23:30:16:                 directory
23:30:16:OpenCL Device 0: Platform:0 Device:0 Bus:3 Slot:0 Compute:2.0 Driver:3188.4
23:30:16:***********************************************************************
23:30:16:<config>
23:30:16:  <!-- Client Control -->
23:30:16:  <fold-anon v='true'/>
23:30:16:
23:30:16:  <!-- Folding Slot Configuration -->
23:30:16:  <cause v='COVID_19'/>
23:30:16:
23:30:16:  <!-- Slot Control -->
23:30:16:  <power v='MEDIUM'/>
23:30:16:
23:30:16:  <!-- User Information -->
23:30:16:  <user v='z3r0fox'/>
23:30:16:
23:30:16:  <!-- Folding Slots -->
23:30:16:  <slot id='0' type='GPU'>
23:30:16:    <pci-bus v='3'/>
23:30:16:    <pci-slot v='0'/>
23:30:16:  </slot>
23:30:16:</config>
23:30:16:Trying to access database...
23:30:16:Successfully acquired database lock
23:30:16:FS00:Initialized folding slot 00: gpu:3:0 Navi 10 [Radeon RX 5600 OEM/5600 XT/5700/5700 XT]
23:30:16:WU00:FS00:Starting
23:30:16:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit/22-0.0.13/Core_22.fah/FahCore_22 -dir 00 -suffix 01 -version 706 -lifeline 360329 -checkpoint 15 -opencl-platform 0 -opencl-device 0 -gpu-vendor amd -gpu 0 -gpu-usage 100
23:30:16:WU00:FS00:Started FahCore on PID 360360
23:30:16:WU00:FS00:Core PID:360368
23:30:16:WU00:FS00:FahCore 0x22 started
23:30:16:WU00:FS00:0x22:*********************** Log Started 2020-11-28T23:30:16Z ***********************
23:30:16:WU00:FS00:0x22:*************************** Core22 Folding@home Core ***************************
23:30:16:WU00:FS00:0x22:       Core: Core22
23:30:16:WU00:FS00:0x22:       Type: 0x22
23:30:16:WU00:FS00:0x22:    Version: 0.0.13
23:30:16:WU00:FS00:0x22:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
23:30:16:WU00:FS00:0x22:  Copyright: 2020 foldingathome.org
23:30:16:WU00:FS00:0x22:   Homepage: https://foldingathome.org/
23:30:16:WU00:FS00:0x22:       Date: Sep 19 2020
23:30:16:WU00:FS00:0x22:       Time: 01:10:35
23:30:16:WU00:FS00:0x22:   Revision: 571cf95de6de2c592c7c3ed48fcfb2e33e9ea7d3
23:30:16:WU00:FS00:0x22:     Branch: core22-0.0.13
23:30:16:WU00:FS00:0x22:   Compiler: GNU 4.8.2 20140120 (Red Hat 4.8.2-15)
23:30:16:WU00:FS00:0x22:    Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections -O3
23:30:16:WU00:FS00:0x22:             -funroll-loops -DOPENMM_GIT_HASH="\"189320d0\""
23:30:16:WU00:FS00:0x22:   Platform: linux2 4.19.76-linuxkit
23:30:16:WU00:FS00:0x22:       Bits: 64
23:30:16:WU00:FS00:0x22:       Mode: Release
23:30:16:WU00:FS00:0x22:Maintainers: John Chodera <john.chodera@choderalab.org> and Peter Eastman
23:30:16:WU00:FS00:0x22:             <peastman@stanford.edu>
23:30:16:WU00:FS00:0x22:       Args: -dir 00 -suffix 01 -version 706 -lifeline 360360 -checkpoint 15
23:30:16:WU00:FS00:0x22:             -opencl-platform 0 -opencl-device 0 -gpu-vendor amd -gpu 0
23:30:16:WU00:FS00:0x22:             -gpu-usage 100
23:30:16:WU00:FS00:0x22:************************************ libFAH ************************************
23:30:16:WU00:FS00:0x22:       Date: Sep 15 2020
23:30:16:WU00:FS00:0x22:       Time: 05:14:43
23:30:16:WU00:FS00:0x22:   Revision: 44301ed97b996b63fe736bb8073f22209cb2b603
23:30:16:WU00:FS00:0x22:     Branch: HEAD
23:30:16:WU00:FS00:0x22:   Compiler: GNU 4.8.2 20140120 (Red Hat 4.8.2-15)
23:30:16:WU00:FS00:0x22:    Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections -O3
23:30:16:WU00:FS00:0x22:             -funroll-loops
23:30:16:WU00:FS00:0x22:   Platform: linux2 4.19.76-linuxkit
23:30:16:WU00:FS00:0x22:       Bits: 64
23:30:16:WU00:FS00:0x22:       Mode: Release
23:30:16:WU00:FS00:0x22:************************************ CBang *************************************
23:30:16:WU00:FS00:0x22:       Date: Sep 15 2020
23:30:16:WU00:FS00:0x22:       Time: 05:11:04
23:30:16:WU00:FS00:0x22:   Revision: 33fcfc2b3ed2195a423606a264718e31e6b3903f
23:30:16:WU00:FS00:0x22:     Branch: HEAD
23:30:16:WU00:FS00:0x22:   Compiler: GNU 4.8.2 20140120 (Red Hat 4.8.2-15)
23:30:16:WU00:FS00:0x22:    Options: -std=c++11 -fsigned-char -ffunction-sections -fdata-sections -O3
23:30:16:WU00:FS00:0x22:             -funroll-loops -fPIC
23:30:16:WU00:FS00:0x22:   Platform: linux2 4.19.76-linuxkit
23:30:16:WU00:FS00:0x22:       Bits: 64
23:30:16:WU00:FS00:0x22:       Mode: Release
23:30:16:WU00:FS00:0x22:************************************ System ************************************
23:30:16:WU00:FS00:0x22:        CPU: AMD FX(tm)-6300 Six-Core Processor
23:30:16:WU00:FS00:0x22:     CPU ID: AuthenticAMD Family 21 Model 2 Stepping 0
23:30:16:WU00:FS00:0x22:       CPUs: 6
23:30:16:WU00:FS00:0x22:     Memory: 31.27GiB
23:30:16:WU00:FS00:0x22:Free Memory: 18.24GiB
23:30:16:WU00:FS00:0x22:    Threads: POSIX_THREADS
23:30:16:WU00:FS00:0x22: OS Version: 5.4
23:30:16:WU00:FS00:0x22:Has Battery: false
23:30:16:WU00:FS00:0x22: On Battery: false
23:30:16:WU00:FS00:0x22: UTC Offset: -5
23:30:16:WU00:FS00:0x22:        PID: 360368
23:30:16:WU00:FS00:0x22:        CWD: /var/lib/fahclient/work
23:30:16:WU00:FS00:0x22:************************************ OpenMM ************************************
23:30:16:WU00:FS00:0x22:   Revision: 189320d0
23:30:16:WU00:FS00:0x22:********************************************************************************
23:30:16:WU00:FS00:0x22:Project: 13428 (Run 6909, Clone 2, Gen 2)
23:30:16:WU00:FS00:0x22:Unit: 0x0000000212bc7d9a000000001afd0002
23:30:16:WU00:FS00:0x22:Digital signatures verified
23:30:16:WU00:FS00:0x22:Folding@home GPU Core22 Folding@home Core
23:30:16:WU00:FS00:0x22:Version 0.0.13
23:30:16:WU00:FS00:0x22:  Checkpoint write interval: 50000 steps (5%) [20 total]
23:30:16:WU00:FS00:0x22:  JSON viewer frame write interval: 10000 steps (1%) [100 total]
23:30:16:WU00:FS00:0x22:  XTC frame write interval: 250000 steps (25%) [4 total]
23:30:16:WU00:FS00:0x22:  Global context and integrator variables write interval: 25000 steps (2.5%) [40 total]
23:30:16:WU00:FS00:0x22:There are 3 platforms available.
23:30:16:WU00:FS00:0x22:Platform 0: Reference
23:30:16:WU00:FS00:0x22:Platform 1: CPU
23:30:16:WU00:FS00:0x22:Platform 2: OpenCL
23:30:16:WU00:FS00:0x22:  opencl-device 0 specified
23:30:25:WU00:FS00:0x22:Attempting to create OpenCL context:
23:30:25:WU00:FS00:0x22:  Configuring platform OpenCL
23:30:34:WARNING:9:127.0.0.1:404 HTTP NOT FOUND /css/images/ui-bg_highlight-soft_75_cccccc_1x100.png
23:30:34:WARNING:13:127.0.0.1:404 HTTP NOT FOUND /css/images/ui-bg_glass_55_fbf9ee_1x400.png
23:30:34:WARNING:14:127.0.0.1:404 HTTP NOT FOUND /css/images/ui-bg_glass_95_fef1ec_1x400.png
23:30:34:WARNING:16:127.0.0.1:404 HTTP NOT FOUND /css/images/ui-icons_222222_256x240.png
23:30:34:WARNING:18:127.0.0.1:404 HTTP NOT FOUND /css/images/ui-icons_cd0a0a_256x240.png
23:30:34:32:127.0.0.1:New Web session
23:38:21:Caught signal SIGTERM(15) on PID 360329
23:38:21:Exiting, please wait. . .
23:38:21:Caught signal SIGINT(2) on PID 360329
23:38:21:WARNING:Next signal will force exit
23:38:22:FS00:Shutting core down
23:38:27:WU00:FS00:0x22:Caught signal SIGTERM(15) on PID 360368
23:38:27:WU00:FS00:0x22:Exiting, please wait. . .
23:38:28:WU00:FS00:0x22:Caught signal SIGTERM(15) on PID 360368
23:38:28:WU00:FS00:0x22:WARNING:Next signal will force exit
23:38:29:Clean exit
I believe this is a previously downloaded unit it is trying to work on

Code: Select all

23:30:16:WU00:FS00:0x22:Unit: 0x0000000212bc7d9a000000001afd0002
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: RX5700XT not folding (Linux)

Post by bruce »

tehowe wrote:
I believe this is a previously downloaded unit it is trying to work on

Code: Select all

23:30:16:WU00:FS00:0x22:Unit: 0x0000000212bc7d9a000000001afd0002
True, but I don't know how to interpret that code. We refer to the WU as

Code: Select all

WU00:FS00:0x22:Project: 13428 (Run 6909, Clone 2, Gen 2)
or with those four integers for PRCG.

I'm not sure about the WARNING messages about /css/images/* I've never seen them before.
Post Reply