Page 1 of 1

GTX 1080 - Temperature control disabled

Posted: Wed May 13, 2020 7:41 am
by KA1J
I'm trying to lower my warnings & errors in the log, some I don't know what to do with. I'm not sure what to make from "Temperature control disabled"

When I run Precision XOC I see the temperature slider is set at a temperature target of 85C. There is a digital tab which attaches to a priority wqidget and I have the tab attached to temperature, the other option it to switch the tab to the power target. I have set the dynamic clock adjustments to be prioritized to temperature. My actual temperature in the room is stable and the core temperature is 53C, not excessive.

I see no other options on the GTX 1080 control to address the temperature other than fan and I have increased the slider 4/5ths of the way up, so the fans are working hard & keeping the temp down. Seeing as the last lines below say: "06:31:12:WU04:FS01:0x22:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900" and this shows as warnings and errors in the GPU, what can I do to remove the issue causing the warning?

I have on order an EVGA RTX 2080 Tii XC Ultra and will be running that in the same computer as this GTX 1080 I'm assuming that will be a dance to get both running well with each other and the computer, I should probably get this 1080 running properly first & then go from there.

Ideas what I can do to enable the temperature control & stop this warning? right now the sensor reads at 52C

Code: Select all

06:30:51:WU04:FS01:0x22:*********************** Log Started 2020-05-13T06:30:50Z ***********************
06:30:51:WU04:FS01:0x22:*************************** Core22 Folding@home Core ***************************
06:30:51:WU04:FS01:0x22:       Type: 0x22
06:30:51:WU04:FS01:0x22:       Core: Core22
06:30:51:WU04:FS01:0x22:    Website: https://foldingathome.org/
06:30:51:WU04:FS01:0x22:  Copyright: (c) 2009-2018 foldingathome.org
06:30:51:WU04:FS01:0x22:     Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora
06:30:51:WU04:FS01:0x22:             <rafal.wiewiora@choderalab.org>
06:30:51:WU04:FS01:0x22:       Args: -dir 04 -suffix 01 -version 706 -lifeline 8708 -checkpoint 3
06:30:51:WU04:FS01:0x22:             -gpu-vendor nvidia -opencl-platform 1 -opencl-device 0 -cuda-device
06:30:51:WU04:FS01:0x22:             0 -gpu 0
06:30:51:WU04:FS01:0x22:     Config: <none>
06:30:51:WU04:FS01:0x22:************************************ Build *************************************
06:30:51:WU04:FS01:0x22:    Version: 0.0.5
06:30:51:WU04:FS01:0x22:       Date: Apr 22 2020
06:30:51:WU04:FS01:0x22:       Time: 04:42:59
06:30:51:WU04:FS01:0x22: Repository: Git
06:30:51:WU04:FS01:0x22:   Revision: 2d69202c898bd9bb3e093f51cd32bf411c2a0388
06:30:51:WU04:FS01:0x22:     Branch: HEAD
06:30:51:WU04:FS01:0x22:   Compiler: Visual C++ 2008
06:30:51:WU04:FS01:0x22:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
06:30:51:WU04:FS01:0x22:   Platform: win32 10
06:30:51:WU04:FS01:0x22:       Bits: 64
06:30:51:WU04:FS01:0x22:       Mode: Release
06:30:51:WU04:FS01:0x22:************************************ System ************************************
06:30:51:WU04:FS01:0x22:        CPU: Intel(R) Core(TM) i7-8700K CPU @ 3.70GHz
06:30:51:WU04:FS01:0x22:     CPU ID: GenuineIntel Family 6 Model 158 Stepping 10
06:30:51:WU04:FS01:0x22:       CPUs: 12
06:30:51:WU04:FS01:0x22:     Memory: 31.85GiB
06:30:51:WU04:FS01:0x22:Free Memory: 17.02GiB
06:30:51:WU04:FS01:0x22:    Threads: WINDOWS_THREADS
06:30:51:WU04:FS01:0x22: OS Version: 6.2
06:30:51:WU04:FS01:0x22:Has Battery: false
06:30:51:WU04:FS01:0x22: On Battery: false
06:30:51:WU04:FS01:0x22: UTC Offset: -4
06:30:51:WU04:FS01:0x22:        PID: 18736
06:30:51:WU04:FS01:0x22:        CWD: C:\Users\Zuul\AppData\Roaming\FAHClient\work
06:30:51:WU04:FS01:0x22:         OS: Windows 10 Enterprise
06:30:51:WU04:FS01:0x22:    OS Arch: AMD64
06:30:51:WU04:FS01:0x22:********************************************************************************
06:30:51:WU04:FS01:0x22:Project: 11745 (Run 0, Clone 7104, Gen 19)
06:30:51:WU04:FS01:0x22:Unit: 0x000000228ca304f15e6bc3b2bb37cedb
06:30:51:WU04:FS01:0x22:Reading tar file core.xml
06:30:51:WU04:FS01:0x22:Reading tar file integrator.xml
06:30:51:WU04:FS01:0x22:Reading tar file state.xml
06:30:53:WU04:FS01:0x22:Reading tar file system.xml
06:30:54:WU04:FS01:0x22:Digital signatures verified
06:30:55:WU04:FS01:0x22:Folding@home GPU Core22 Folding@home Core
06:30:55:WU04:FS01:0x22:Version 0.0.5
06:30:55:WU01:FS01:Upload complete
06:30:55:WU01:FS01:Server responded WORK_ACK (400)
06:30:55:WU01:FS01:Final credit estimate, 116549.00 points
06:30:55:WU01:FS01:Cleaning up
06:31:12:WU04:FS01:0x22:Completed 0 out of 1000000 steps (0%)
06:31:12:WU04:FS01:0x22:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
06:31:22:WU02:FS05:0xa7:Completed 212500 out of 250000 steps (85%)

Re: GTX 1080 - Temperature control disabled

Posted: Wed May 13, 2020 8:01 am
by HugoNotte
06:31:12:WU04:FS01:0x22:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
That line seems to be pretty much a standard line. It doesn't affect the temperature control of your hardware through your operating system, BIOS or separate programs. It might be a feature within GROMACS, the molecular dynamics package which the WUs are build on, that for other platforms WUs could control temperatures of CPUs and or GPUs. In FAH WUs that is always disabled. Hence that line in the code.

Re: GTX 1080 - Temperature control disabled

Posted: Wed May 13, 2020 8:04 am
by ajm
I've read here a few times already that this line was used during development and should actually have been removed. It is now useless and can safely be ignored.

Re: GTX 1080 - Temperature control disabled

Posted: Wed May 13, 2020 8:50 am
by PantherX
There's an issue logged for this to be removed: https://github.com/FoldingAtHome/fah-issues/issues/1431

It was an attempt at a new feature several years ago but it didn't work and now the cosmetic message still remains... hopefully, not for long.

Re: GTX 1080 - Temperature control disabled

Posted: Wed May 13, 2020 2:12 pm
by KA1J
Thank you all for the answer, it is an artifact then & of no concern.

That solved, I have another warning I see occasionally and would appreciate thoughts on; the unknown error. You see them it at the bottom of the log file. This is from last night & what I found after maybe 7 hours of running:

1:38:22:WARNING:WU03:FS00:FahCore returned an unknown error code which probably indicates that it crashed

Thoughts?

Code: Select all

*********************** Log Started 2020-05-13T03:58:42Z ***********************
06:27:47:WARNING:WU04:FS01:Failed to get assignment from 'assign1.foldingathome.org:80': No WUs available for this configuration
06:28:09:WARNING:WU04:FS01:WorkServer connection failed on port 8080 trying 80
06:28:30:ERROR:WU04:FS01:Exception: Failed to connect to 128.252.203.10:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
06:28:52:WARNING:WU04:FS01:WorkServer connection failed on port 8080 trying 80
06:29:13:ERROR:WU04:FS01:Exception: Failed to connect to 128.252.203.10:80: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond.
08:31:50:WARNING:WU01:FS01:Failed to get assignment from 'assign1.foldingathome.org:80': No WUs available for this configuration
08:31:50:WARNING:WU01:FS01:Failed to get assignment from 'assign2.foldingathome.org:80': No WUs available for this configuration
08:31:51:WARNING:WU01:FS01:Failed to get assignment from 'assign3.foldingathome.org:80': No WUs available for this configuration
08:31:51:WARNING:WU01:FS01:Failed to get assignment from 'assign4.foldingathome.org:80': No WUs available for this configuration
08:31:51:ERROR:WU01:FS01:Exception: Could not get an assignment
08:31:51:WARNING:WU01:FS01:Failed to get assignment from 'assign1.foldingathome.org:80': No WUs available for this configuration
08:31:52:WARNING:WU01:FS01:Failed to get assignment from 'assign2.foldingathome.org:80': No WUs available for this configuration
08:31:52:WARNING:WU01:FS01:Failed to get assignment from 'assign3.foldingathome.org:80': No WUs available for this configuration
08:43:50:WARNING:WU01:FS01:FahCore returned an unknown error code which probably indicates that it crashed
08:43:50:WARNING:WU01:FS01:FahCore returned: WU_STALLED (127 = 0x7f)
******************************* Date: 2020-05-13 *******************************
11:38:22:WARNING:WU03:FS00:FahCore returned an unknown error code which probably indicates that it crashed
11:38:22:WARNING:WU03:FS00:FahCore returned: WU_STALLED (127 = 0x7f)
11:38:22:WARNING:WU00:FS05:FahCore returned an unknown error code which probably indicates that it crashed
11:38:22:WARNING:WU00:FS05:FahCore returned: WU_STALLED (127 = 0x7f)
13:56:38:ERROR:WU02:FS00:Exception: 10002: Received short response, expected 512 bytes, got 0

Re: GTX 1080 - Temperature control disabled

Posted: Wed May 13, 2020 3:00 pm
by Rel25917
Random fact, if you have a single nvidia card and set the tmax and twait settings you can still turn it on, kinda useless except as a last resort though.

<extra-core-args v='-tmax=65 -twait=5000'/>

14:44:39:WU00:FS01:0x22:Completed 0 out of 1000000 steps (0%)
14:44:39:WU00:FS01:0x22:Core22 Folding@home Core: single GPU Temperature Control enabled, tmax: 65 twait: 5000
14:44:56:WU00:FS01:0x22:Pausing the core... cutoff reached!

Re: GTX 1080 - Temperature control disabled

Posted: Thu May 14, 2020 3:41 am
by PantherX
Rel25917 wrote:Random fact, if you have a single nvidia card and set the tmax and twait settings you can still turn it on, kinda useless except as a last resort though.

<extra-core-args v='-tmax=65 -twait=5000'/>

14:44:39:WU00:FS01:0x22:Completed 0 out of 1000000 steps (0%)
14:44:39:WU00:FS01:0x22:Core22 Folding@home Core: single GPU Temperature Control enabled, tmax: 65 twait: 5000
14:44:56:WU00:FS01:0x22:Pausing the core... cutoff reached!
That has the potential of inducing thermal stress on the GPU which was one of the reasons it was discontinued (other being is that it didn't work as effectively as originally planned).