GPU Cores repeatedly failing (even after reinstall)

If you think it might be a driver problem, see viewforum.php?f=79

Moderators: Site Moderators, FAHC Science Team

Post Reply
Kevincav
Posts: 23
Joined: Wed Sep 14, 2016 3:12 am

GPU Cores repeatedly failing (even after reinstall)

Post by Kevincav »

I have a dual GPU setup which keeps failing on me. I've uninstalled and reinstalled. Nothing is overclocked. I'm kind of at a loss. It's weird, occasionally I see a random temperature control disabled, I don't really know how that applies / how to fix that. Anyone able to help me with this?

Logs

Code: Select all

*********************** Log Started 2017-01-21T01:07:43Z ***********************
01:07:43:************************* Folding@home Client *************************
01:07:43:      Website: http://folding.stanford.edu/
01:07:43:    Copyright: (c) 2009-2014 Stanford University
01:07:43:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
01:07:43:         Args: --open-web-control
01:07:43:       Config: C:/Users/Kevin/AppData/Roaming/FAHClient/config.xml
01:07:43:******************************** Build ********************************
01:07:43:      Version: 7.4.4
01:07:43:         Date: Mar 4 2014
01:07:43:         Time: 20:26:54
01:07:43:      SVN Rev: 4130
01:07:43:       Branch: fah/trunk/client
01:07:43:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
01:07:43:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
01:07:43:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
01:07:43:     Platform: win32 XP
01:07:43:         Bits: 32
01:07:43:         Mode: Release
01:07:43:******************************* System ********************************
01:07:43:          CPU: Intel(R) Xeon(R) CPU E5-2696 v4 @ 2.20GHz
01:07:43:       CPU ID: GenuineIntel Family 6 Model 79 Stepping 1
01:07:43:         CPUs: 32
01:07:43:       Memory: 255.89GiB
01:07:43:  Free Memory: 245.93GiB
01:07:43:      Threads: WINDOWS_THREADS
01:07:43:   OS Version: 6.2
01:07:43:  Has Battery: false
01:07:43:   On Battery: false
01:07:43:   UTC Offset: -8
01:07:43:          PID: 14296
01:07:43:          CWD: C:/Users/Kevin/AppData/Roaming/FAHClient
01:07:43:           OS: Windows 10 Pro
01:07:43:      OS Arch: AMD64
01:07:43:         GPUs: 2
01:07:43:        GPU 0: NVIDIA:5 GP102 [GeForce Titan X]
01:07:43:        GPU 1: NVIDIA:5 GP102 [GeForce Titan X]
01:07:43:         CUDA: 6.1
01:07:43:  CUDA Driver: 8000
01:07:43:Win32 Service: false
01:07:43:***********************************************************************
01:07:43:<config>
01:07:43:  <!-- Network -->
01:07:43:  <proxy v=':8080'/>
01:07:43:
01:07:43:  <!-- Slot Control -->
01:07:43:  <power v='FULL'/>
01:07:43:
01:07:43:  <!-- User Information -->
01:07:43:  <passkey v='********************************'/>
01:07:43:  <team v='111065'/>
01:07:43:  <user v='Kevincav'/>
01:07:43:
01:07:43:  <!-- Folding Slots -->
01:07:43:</config>
01:07:43:Trying to access database...
01:07:43:Successfully acquired database lock
01:07:43:Enabled folding slot 00: READY cpu:30
01:07:43:Enabled folding slot 01: READY gpu:0:GP102 [GeForce Titan X]
01:07:43:Enabled folding slot 02: READY gpu:1:GP102 [GeForce Titan X]
01:07:43:WU00:FS00:Connecting to 171.67.108.45:8080
01:07:43:WU01:FS01:Connecting to 171.67.108.45:80
01:07:43:WU02:FS02:Connecting to 171.67.108.45:80
01:07:44:WARNING:WU00:FS00:Failed to get assignment from '171.67.108.45:8080': Empty work server assignment
01:07:44:WU01:FS01:Assigned to work server 140.163.4.245
01:07:44:WU00:FS00:Connecting to 171.64.65.35:80
01:07:44:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:GP102 [GeForce Titan X] from 140.163.4.245
01:07:44:WU01:FS01:Connecting to 140.163.4.245:8080
01:07:44:WU02:FS02:Assigned to work server 140.163.4.243
01:07:44:WU02:FS02:Requesting new work unit for slot 02: READY gpu:1:GP102 [GeForce Titan X] from 140.163.4.243
01:07:44:WU02:FS02:Connecting to 140.163.4.243:8080
01:07:44:WARNING:WU00:FS00:Failed to get assignment from '171.64.65.35:80': Empty work server assignment
01:07:44:ERROR:WU00:FS00:Exception: Could not get an assignment
01:07:44:WU01:FS01:Downloading 5.13MiB
01:07:44:WU02:FS02:Downloading 2.67MiB
01:07:44:WU00:FS00:Connecting to 171.67.108.45:8080
01:07:44:WARNING:WU00:FS00:Failed to get assignment from '171.67.108.45:8080': Empty work server assignment
01:07:44:WU00:FS00:Connecting to 171.64.65.35:80
01:07:44:WARNING:WU00:FS00:Failed to get assignment from '171.64.65.35:80': Empty work server assignment
01:07:44:ERROR:WU00:FS00:Exception: Could not get an assignment
01:07:46:WU01:FS01:Download complete
01:07:46:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:10495 run:28 clone:65 gen:87 core:0x21 unit:0x000000948ca304f556ba64d78db26cb7
01:07:46:WU01:FS01:Starting
01:07:46:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Kevin/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 01 -suffix 01 -version 704 -lifeline 14296 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
01:07:46:WU01:FS01:Started FahCore on PID 6092
01:07:46:WU01:FS01:Core PID:17276
01:07:46:WU01:FS01:FahCore 0x21 started
01:07:46:16:127.0.0.1:New Web connection
01:07:47:WU02:FS02:Download complete
01:07:47:WU02:FS02:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:11707 run:125 clone:22 gen:19 core:0x21 unit:0x0000001a8ca304f35876a50e4201c983
01:07:47:WU02:FS02:Starting
01:07:47:WU02:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Kevin/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 02 -suffix 01 -version 704 -lifeline 14296 -checkpoint 15 -gpu 1 -gpu-vendor nvidia
01:07:47:WU02:FS02:Started FahCore on PID 6156
01:07:47:WU02:FS02:Core PID:732
01:07:47:WU02:FS02:FahCore 0x21 started
01:07:48:WU01:FS01:0x21:*********************** Log Started 2017-01-21T01:07:47Z ***********************
01:07:48:WU01:FS01:0x21:Project: 10495 (Run 28, Clone 65, Gen 87)
01:07:48:WU01:FS01:0x21:Unit: 0x000000948ca304f556ba64d78db26cb7
01:07:48:WU01:FS01:0x21:CPU: 0x00000000000000000000000000000000
01:07:48:WU01:FS01:0x21:Machine: 1
01:07:48:WU01:FS01:0x21:Reading tar file core.xml
01:07:48:WU01:FS01:0x21:Reading tar file system.xml
01:07:48:WU01:FS01:0x21:Reading tar file integrator.xml
01:07:48:WU01:FS01:0x21:Reading tar file state.xml
01:07:49:WU02:FS02:0x21:*********************** Log Started 2017-01-21T01:07:49Z ***********************
01:07:49:WU02:FS02:0x21:Project: 11707 (Run 125, Clone 22, Gen 19)
01:07:49:WU02:FS02:0x21:Unit: 0x0000001a8ca304f35876a50e4201c983
01:07:49:WU02:FS02:0x21:CPU: 0x00000000000000000000000000000000
01:07:49:WU02:FS02:0x21:Machine: 2
01:07:49:WU02:FS02:0x21:Reading tar file core.xml
01:07:49:WU02:FS02:0x21:Reading tar file system.xml
01:07:49:WU01:FS01:0x21:Digital signatures verified
01:07:49:WU01:FS01:0x21:Folding@home GPU Core21 Folding@home Core
01:07:49:WU01:FS01:0x21:Version 0.0.17
01:07:49:WU02:FS02:0x21:Reading tar file integrator.xml
01:07:49:WU02:FS02:0x21:Reading tar file state.xml
01:07:50:WU02:FS02:0x21:Digital signatures verified
01:07:50:WU02:FS02:0x21:Folding@home GPU Core21 Folding@home Core
01:07:50:WU02:FS02:0x21:Version 0.0.17
01:07:56:WU02:FS02:0x21:ERROR:exception: Error downloading array interactionCount: clEnqueueReadBuffer (-5)
01:07:56:WU02:FS02:0x21:Saving result file logfile_01.txt
01:07:56:WU02:FS02:0x21:Saving result file log.txt
01:07:56:WU02:FS02:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
01:08:00:WARNING:WU02:FS02:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
01:08:00:WU02:FS02:Sending unit results: id:02 state:SEND error:FAULTY project:11707 run:125 clone:22 gen:19 core:0x21 unit:0x0000001a8ca304f35876a50e4201c983
01:08:00:WU02:FS02:Uploading 2.48KiB to 140.163.4.243
01:08:00:WU02:FS02:Connecting to 140.163.4.243:8080
01:08:00:WU02:FS02:Upload complete
01:08:01:WU02:FS02:Server responded WORK_ACK (400)
01:08:01:WU02:FS02:Cleaning up
01:08:01:WU03:FS02:Connecting to 171.67.108.45:80
01:08:01:WU03:FS02:Assigned to work server 140.163.4.242
01:08:01:WU03:FS02:Requesting new work unit for slot 02: READY gpu:1:GP102 [GeForce Titan X] from 140.163.4.242
01:08:01:WU03:FS02:Connecting to 140.163.4.242:8080
01:08:01:WU03:FS02:Downloading 3.47MiB
01:08:02:WU01:FS01:0x21:ERROR:exception: Error downloading array interactionCount: clEnqueueReadBuffer (-5)
01:08:02:WU01:FS01:0x21:Saving result file logfile_01.txt
01:08:02:WU01:FS01:0x21:Saving result file log.txt
01:08:02:WU01:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
01:08:03:WU03:FS02:Download complete
01:08:03:WU03:FS02:Received Unit: id:03 state:DOWNLOAD error:NO_ERROR project:11407 run:8 clone:37 gen:70 core:0x21 unit:0x000000738ca304f25686b2930d93746a
01:08:03:WU03:FS02:Starting
01:08:03:WU03:FS02:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Kevin/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 03 -suffix 01 -version 704 -lifeline 14296 -checkpoint 15 -gpu 1 -gpu-vendor nvidia
01:08:03:WU03:FS02:Started FahCore on PID 8540
01:08:03:WU03:FS02:Core PID:6256
01:08:03:WU03:FS02:FahCore 0x21 started
01:08:05:WU03:FS02:0x21:*********************** Log Started 2017-01-21T01:08:05Z ***********************
01:08:05:WU03:FS02:0x21:Project: 11407 (Run 8, Clone 37, Gen 70)
01:08:05:WU03:FS02:0x21:Unit: 0x000000738ca304f25686b2930d93746a
01:08:05:WU03:FS02:0x21:CPU: 0x00000000000000000000000000000000
01:08:05:WU03:FS02:0x21:Machine: 2
01:08:05:WU03:FS02:0x21:Reading tar file core.xml
01:08:05:WU03:FS02:0x21:Reading tar file system.xml
01:08:06:WU03:FS02:0x21:Reading tar file integrator.xml
01:08:06:WU03:FS02:0x21:Reading tar file state.xml
01:08:06:WARNING:WU01:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
01:08:06:WU01:FS01:Sending unit results: id:01 state:SEND error:FAULTY project:10495 run:28 clone:65 gen:87 core:0x21 unit:0x000000948ca304f556ba64d78db26cb7
01:08:06:WU01:FS01:Uploading 2.48KiB to 140.163.4.245
01:08:06:WU01:FS01:Connecting to 140.163.4.245:8080
01:08:06:WU01:FS01:Upload complete
01:08:06:WU01:FS01:Server responded WORK_ACK (400)
01:08:06:WU01:FS01:Cleaning up
01:08:06:WU02:FS01:Connecting to 171.67.108.45:80
01:08:07:WU02:FS01:Assigned to work server 171.64.65.84
01:08:07:WU02:FS01:Requesting new work unit for slot 01: READY gpu:0:GP102 [GeForce Titan X] from 171.64.65.84
01:08:07:WU02:FS01:Connecting to 171.64.65.84:8080
01:08:07:WU03:FS02:0x21:Digital signatures verified
01:08:07:WU03:FS02:0x21:Folding@home GPU Core21 Folding@home Core
01:08:07:WU03:FS02:0x21:Version 0.0.17
01:08:07:WU02:FS01:Downloading 2.53MiB
01:08:08:Saving configuration to config.xml
01:08:08:<config>
01:08:08:  <!-- Network -->
01:08:08:  <proxy v=':8080'/>
01:08:08:
01:08:08:  <!-- Slot Control -->
01:08:08:  <power v='FULL'/>
01:08:08:
01:08:08:  <!-- User Information -->
01:08:08:  <passkey v='********************************'/>
01:08:08:  <team v='111065'/>
01:08:08:  <user v='Kevincav'/>
01:08:08:
01:08:08:  <!-- Folding Slots -->
01:08:08:  <slot id='1' type='GPU'/>
01:08:08:</config>
01:08:08:FS02:Shutting core down
01:08:08:WU00:FS00:Slot ID 0 no longer exists and not yet downloaded, dumping
01:08:08:WU00:FS00:Cleaning up
01:08:08:WU03:FS02:0x21:WARNING:Console control signal 1 on PID 6256
01:08:08:WU03:FS02:0x21:Exiting, please wait. . .
01:08:08:WU02:FS01:Download complete
01:08:08:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:9188 run:2 clone:10 gen:113 core:0x21 unit:0x000000beab40415457cb2b724748f101
01:08:08:WU02:FS01:Starting
01:08:08:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Kevin/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 02 -suffix 01 -version 704 -lifeline 14296 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
01:08:08:WU02:FS01:Started FahCore on PID 17752
01:08:08:WU02:FS01:Core PID:8272
01:08:08:WU02:FS01:FahCore 0x21 started
01:08:10:WU02:FS01:0x21:*********************** Log Started 2017-01-21T01:08:09Z ***********************
01:08:10:WU02:FS01:0x21:Project: 9188 (Run 2, Clone 10, Gen 113)
01:08:10:WU02:FS01:0x21:Unit: 0x000000beab40415457cb2b724748f101
01:08:10:WU02:FS01:0x21:CPU: 0x00000000000000000000000000000000
01:08:10:WU02:FS01:0x21:Machine: 1
01:08:10:WU02:FS01:0x21:Reading tar file core.xml
01:08:10:WU02:FS01:0x21:Reading tar file system.xml
01:08:10:WU02:FS01:0x21:Reading tar file integrator.xml
01:08:10:WU02:FS01:0x21:Reading tar file state.xml
01:08:10:WU02:FS01:0x21:Digital signatures verified
01:08:10:WU02:FS01:0x21:Folding@home GPU Core21 Folding@home Core
01:08:10:WU02:FS01:0x21:Version 0.0.17
01:08:14:WU03:FS02:0x21:ERROR:exception: Error downloading array interactionCount: clEnqueueReadBuffer (-5)
01:08:14:WU03:FS02:0x21:Saving result file logfile_01.txt
01:08:14:WU03:FS02:0x21:Saving result file log.txt
01:08:14:WU03:FS02:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
01:08:17:WU03:FS02:FahCore returned: INTERRUPTED (102 = 0x66)
01:08:17:WARNING:WU03:Slot ID 2 no longer exists, migrating to FS01
01:08:19:WU02:FS01:0x21:ERROR:exception: Error downloading array interactionCount: clEnqueueReadBuffer (-5)
01:08:19:WU02:FS01:0x21:Saving result file logfile_01.txt
01:08:19:WU02:FS01:0x21:Saving result file log.txt
01:08:19:WU02:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
01:08:22:WU03:FS01:Starting
01:08:22:WU03:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Kevin/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 03 -suffix 01 -version 704 -lifeline 14296 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
01:08:22:WU03:FS01:Started FahCore on PID 18340
01:08:22:WU03:FS01:Core PID:17340
01:08:22:WU03:FS01:FahCore 0x21 started
01:08:22:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
01:08:22:WU02:FS01:Sending unit results: id:02 state:SEND error:FAULTY project:9188 run:2 clone:10 gen:113 core:0x21 unit:0x000000beab40415457cb2b724748f101
01:08:22:WU02:FS01:Uploading 2.49KiB to 171.64.65.84
01:08:22:WU02:FS01:Connecting to 171.64.65.84:8080
01:08:22:WU02:FS01:Upload complete
01:08:22:WU02:FS01:Server responded WORK_ACK (400)
01:08:22:WU02:FS01:Cleaning up
01:08:24:WU03:FS01:0x21:*********************** Log Started 2017-01-21T01:08:23Z ***********************
01:08:24:WU03:FS01:0x21:Project: 11407 (Run 8, Clone 37, Gen 70)
01:08:24:WU03:FS01:0x21:Unit: 0x000000738ca304f25686b2930d93746a
01:08:24:WU03:FS01:0x21:CPU: 0x00000000000000000000000000000000
01:08:24:WU03:FS01:0x21:Machine: 2
01:08:24:WU03:FS01:0x21:Reading tar file core.xml
01:08:24:WU03:FS01:0x21:Reading tar file system.xml
01:08:24:WU03:FS01:0x21:Reading tar file integrator.xml
01:08:24:WU03:FS01:0x21:Reading tar file state.xml
01:08:25:WU03:FS01:0x21:Digital signatures verified
01:08:25:WU03:FS01:0x21:Folding@home GPU Core21 Folding@home Core
01:08:25:WU03:FS01:0x21:Version 0.0.17
01:08:31:WU03:FS01:0x21:ERROR:exception: Error downloading array interactionCount: clEnqueueReadBuffer (-5)
01:08:31:WU03:FS01:0x21:Saving result file logfile_01.txt
01:08:31:WU03:FS01:0x21:Saving result file log.txt
01:08:31:WU03:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
01:08:34:WARNING:WU03:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
01:08:34:WU03:FS01:Sending unit results: id:03 state:SEND error:FAULTY project:11407 run:8 clone:37 gen:70 core:0x21 unit:0x000000738ca304f25686b2930d93746a
01:08:34:WU03:FS01:Uploading 2.49KiB to 140.163.4.242
01:08:34:WU03:FS01:Connecting to 140.163.4.242:8080
01:08:34:WU03:FS01:Upload complete
01:08:35:WU03:FS01:Server responded WORK_ACK (400)
01:08:35:WU03:FS01:Cleaning up
01:08:35:WU00:FS01:Connecting to 171.67.108.45:80
01:08:35:WU00:FS01:Assigned to work server 140.163.4.242
01:08:35:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GP102 [GeForce Titan X] from 140.163.4.242
01:08:35:WU00:FS01:Connecting to 140.163.4.242:8080
01:08:35:WU00:FS01:Downloading 4.22MiB
01:08:37:WU00:FS01:Download complete
01:08:38:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:11403 run:7 clone:49 gen:109 core:0x21 unit:0x000000a68ca304f255ed4f7fd87d06e1
01:08:38:WU00:FS01:Starting
01:08:38:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Kevin/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 00 -suffix 01 -version 704 -lifeline 14296 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
01:08:38:WU00:FS01:Started FahCore on PID 18776
01:08:38:WU00:FS01:Core PID:14040
01:08:38:WU00:FS01:FahCore 0x21 started
01:08:39:WU00:FS01:0x21:*********************** Log Started 2017-01-21T01:08:39Z ***********************
01:08:39:WU00:FS01:0x21:Project: 11403 (Run 7, Clone 49, Gen 109)
01:08:39:WU00:FS01:0x21:Unit: 0x000000a68ca304f255ed4f7fd87d06e1
01:08:39:WU00:FS01:0x21:CPU: 0x00000000000000000000000000000000
01:08:39:WU00:FS01:0x21:Machine: 1
01:08:39:WU00:FS01:0x21:Reading tar file core.xml
01:08:39:WU00:FS01:0x21:Reading tar file system.xml
01:08:40:WU00:FS01:0x21:Reading tar file integrator.xml
01:08:40:WU00:FS01:0x21:Reading tar file state.xml
01:08:41:WU00:FS01:0x21:Digital signatures verified
01:08:41:WU00:FS01:0x21:Folding@home GPU Core21 Folding@home Core
01:08:41:WU00:FS01:0x21:Version 0.0.17
01:08:44:Saving configuration to config.xml
01:08:44:<config>
01:08:44:  <!-- Network -->
01:08:44:  <proxy v=':8080'/>
01:08:44:
01:08:44:  <!-- Slot Control -->
01:08:44:  <power v='FULL'/>
01:08:44:
01:08:44:  <!-- User Information -->
01:08:44:  <passkey v='********************************'/>
01:08:44:  <team v='111065'/>
01:08:44:  <user v='Kevincav'/>
01:08:44:
01:08:44:  <!-- Folding Slots -->
01:08:44:  <slot id='1' type='GPU'/>
01:08:44:</config>
01:08:49:WU00:FS01:0x21:ERROR:exception: Error downloading array interactionCount: clEnqueueReadBuffer (-5)
01:08:49:WU00:FS01:0x21:Saving result file logfile_01.txt
01:08:49:WU00:FS01:0x21:Saving result file log.txt
01:08:49:WU00:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
01:08:52:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
01:08:52:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:11403 run:7 clone:49 gen:109 core:0x21 unit:0x000000a68ca304f255ed4f7fd87d06e1
01:08:52:WU00:FS01:Uploading 2.48KiB to 140.163.4.242
01:08:52:WU00:FS01:Connecting to 140.163.4.242:8080
01:08:52:WU00:FS01:Upload complete
01:08:52:WU00:FS01:Server responded WORK_ACK (400)
01:08:52:WU00:FS01:Cleaning up
01:08:52:WU01:FS01:Connecting to 171.67.108.45:80
01:08:53:WU01:FS01:Assigned to work server 171.64.65.84
01:08:53:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:GP102 [GeForce Titan X] from 171.64.65.84
01:08:53:WU01:FS01:Connecting to 171.64.65.84:8080
01:08:53:WU01:FS01:Downloading 2.58MiB
01:08:54:WU01:FS01:Download complete
01:08:54:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:9189 run:0 clone:45 gen:188 core:0x21 unit:0x0000011fab40415457cb2ba3c6cbf554
01:08:54:WU01:FS01:Starting
01:08:54:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Kevin/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 01 -suffix 01 -version 704 -lifeline 14296 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
01:08:54:WU01:FS01:Started FahCore on PID 16700
01:08:54:WU01:FS01:Core PID:9164
01:08:54:WU01:FS01:FahCore 0x21 started
01:08:55:WU01:FS01:0x21:*********************** Log Started 2017-01-21T01:08:55Z ***********************
01:08:55:WU01:FS01:0x21:Project: 9189 (Run 0, Clone 45, Gen 188)
01:08:55:WU01:FS01:0x21:Unit: 0x0000011fab40415457cb2ba3c6cbf554
01:08:55:WU01:FS01:0x21:CPU: 0x00000000000000000000000000000000
01:08:55:WU01:FS01:0x21:Machine: 1
01:08:55:WU01:FS01:0x21:Reading tar file core.xml
01:08:55:WU01:FS01:0x21:Reading tar file system.xml
01:08:56:WU01:FS01:0x21:Reading tar file integrator.xml
01:08:56:WU01:FS01:0x21:Reading tar file state.xml
01:08:56:WU01:FS01:0x21:Digital signatures verified
01:08:56:WU01:FS01:0x21:Folding@home GPU Core21 Folding@home Core
01:08:56:WU01:FS01:0x21:Version 0.0.17
01:09:02:WU01:FS01:0x21:ERROR:exception: Error downloading array interactionCount: clEnqueueReadBuffer (-5)
01:09:02:WU01:FS01:0x21:Saving result file logfile_01.txt
01:09:02:WU01:FS01:0x21:Saving result file log.txt
01:09:02:WU01:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
01:09:09:WARNING:WU01:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
01:09:09:WU01:FS01:Sending unit results: id:01 state:SEND error:FAULTY project:9189 run:0 clone:45 gen:188 core:0x21 unit:0x0000011fab40415457cb2ba3c6cbf554
01:09:09:WU01:FS01:Uploading 2.49KiB to 171.64.65.84
01:09:09:WU01:FS01:Connecting to 171.64.65.84:8080
01:09:10:WU01:FS01:Upload complete
01:09:10:WU01:FS01:Server responded WORK_ACK (400)
01:09:10:WU01:FS01:Cleaning up
01:09:10:WU00:FS01:Connecting to 171.67.108.45:80
01:09:10:WU00:FS01:Assigned to work server 140.163.4.243
01:09:11:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GP102 [GeForce Titan X] from 140.163.4.243
01:09:11:WU00:FS01:Connecting to 140.163.4.243:8080
01:09:11:WU00:FS01:Downloading 2.67MiB
01:09:14:WU00:FS01:Download complete
01:09:14:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:11707 run:82 clone:19 gen:26 core:0x21 unit:0x000000218ca304f358702f728590d648
01:09:14:WU00:FS01:Starting
01:09:14:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Kevin/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 00 -suffix 01 -version 704 -lifeline 14296 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
01:09:14:WU00:FS01:Started FahCore on PID 19440
01:09:14:WU00:FS01:Core PID:9412
01:09:14:WU00:FS01:FahCore 0x21 started
01:09:15:WU00:FS01:0x21:*********************** Log Started 2017-01-21T01:09:15Z ***********************
01:09:15:WU00:FS01:0x21:Project: 11707 (Run 82, Clone 19, Gen 26)
01:09:15:WU00:FS01:0x21:Unit: 0x000000218ca304f358702f728590d648
01:09:15:WU00:FS01:0x21:CPU: 0x00000000000000000000000000000000
01:09:15:WU00:FS01:0x21:Machine: 1
01:09:15:WU00:FS01:0x21:Reading tar file core.xml
01:09:15:WU00:FS01:0x21:Reading tar file system.xml
01:09:16:WU00:FS01:0x21:Reading tar file integrator.xml
01:09:16:WU00:FS01:0x21:Reading tar file state.xml
01:09:16:WU00:FS01:0x21:Digital signatures verified
01:09:16:WU00:FS01:0x21:Folding@home GPU Core21 Folding@home Core
01:09:16:WU00:FS01:0x21:Version 0.0.17
01:09:23:WU00:FS01:0x21:ERROR:exception: Error downloading array interactionCount: clEnqueueReadBuffer (-5)
01:09:23:WU00:FS01:0x21:Saving result file logfile_01.txt
01:09:23:WU00:FS01:0x21:Saving result file log.txt
01:09:23:WU00:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
01:09:26:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
01:09:26:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:11707 run:82 clone:19 gen:26 core:0x21 unit:0x000000218ca304f358702f728590d648
01:09:26:WU00:FS01:Uploading 2.48KiB to 140.163.4.243
01:09:26:WU00:FS01:Connecting to 140.163.4.243:8080
01:09:26:WU00:FS01:Upload complete
01:09:26:WU00:FS01:Server responded WORK_ACK (400)
01:09:26:WU00:FS01:Cleaning up
01:09:26:WU01:FS01:Connecting to 171.67.108.45:80
01:09:26:WU01:FS01:Assigned to work server 140.163.4.243
01:09:26:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:GP102 [GeForce Titan X] from 140.163.4.243
01:09:26:WU01:FS01:Connecting to 140.163.4.243:8080
01:09:27:ERROR:WU01:FS01:Exception: Server did not assign work unit
01:09:27:WU01:FS01:Connecting to 171.67.108.45:80
01:09:27:WU01:FS01:Assigned to work server 140.163.4.242
01:09:27:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:GP102 [GeForce Titan X] from 140.163.4.242
01:09:27:WU01:FS01:Connecting to 140.163.4.242:8080
01:09:28:WU01:FS01:Downloading 4.22MiB
01:09:30:WU01:FS01:Download complete
01:09:30:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:11403 run:0 clone:46 gen:412 core:0x21 unit:0x000002278ca304f255ed4f25b550567d
01:09:30:WU01:FS01:Starting
01:09:30:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Kevin/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 01 -suffix 01 -version 704 -lifeline 14296 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
01:09:30:WU01:FS01:Started FahCore on PID 12948
01:09:30:WU01:FS01:Core PID:5880
01:09:30:WU01:FS01:FahCore 0x21 started
01:09:32:WU01:FS01:0x21:*********************** Log Started 2017-01-21T01:09:31Z ***********************
01:09:32:WU01:FS01:0x21:Project: 11403 (Run 0, Clone 46, Gen 412)
01:09:32:WU01:FS01:0x21:Unit: 0x000002278ca304f255ed4f25b550567d
01:09:32:WU01:FS01:0x21:CPU: 0x00000000000000000000000000000000
01:09:32:WU01:FS01:0x21:Machine: 1
01:09:32:WU01:FS01:0x21:Reading tar file core.xml
01:09:32:WU01:FS01:0x21:Reading tar file system.xml
01:09:32:WU01:FS01:0x21:Reading tar file integrator.xml
01:09:32:WU01:FS01:0x21:Reading tar file state.xml
01:09:33:WU01:FS01:0x21:Digital signatures verified
01:09:33:WU01:FS01:0x21:Folding@home GPU Core21 Folding@home Core
01:09:33:WU01:FS01:0x21:Version 0.0.17
01:09:42:WU01:FS01:0x21:ERROR:exception: Error downloading array interactionCount: clEnqueueReadBuffer (-5)
01:09:42:WU01:FS01:0x21:Saving result file logfile_01.txt
01:09:42:WU01:FS01:0x21:Saving result file log.txt
01:09:42:WU01:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
01:09:45:WARNING:WU01:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
01:09:45:WU01:FS01:Sending unit results: id:01 state:SEND error:FAULTY project:11403 run:0 clone:46 gen:412 core:0x21 unit:0x000002278ca304f255ed4f25b550567d
01:09:45:WU01:FS01:Uploading 2.49KiB to 140.163.4.242
01:09:45:WU01:FS01:Connecting to 140.163.4.242:8080
01:09:45:WU01:FS01:Upload complete
01:09:45:WU01:FS01:Server responded WORK_ACK (400)
01:09:45:WU01:FS01:Cleaning up
01:09:45:WU00:FS01:Connecting to 171.67.108.45:80
01:09:45:WU00:FS01:Assigned to work server 171.67.108.105
01:09:45:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GP102 [GeForce Titan X] from 171.67.108.105
01:09:45:WU00:FS01:Connecting to 171.67.108.105:8080
01:09:46:WU00:FS01:Downloading 21.34MiB
01:09:47:WU00:FS01:Download complete
01:09:48:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:9178 run:8 clone:1 gen:110 core:0x21 unit:0x000000c3ab436c6957b24c29f1c1e261
01:09:48:WU00:FS01:Starting
01:09:48:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Kevin/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 00 -suffix 01 -version 704 -lifeline 14296 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
01:09:48:WU00:FS01:Started FahCore on PID 14444
01:09:48:WU00:FS01:Core PID:7324
01:09:48:WU00:FS01:FahCore 0x21 started
01:09:49:WU00:FS01:0x21:*********************** Log Started 2017-01-21T01:09:49Z ***********************
01:09:49:WU00:FS01:0x21:Project: 9178 (Run 8, Clone 1, Gen 110)
01:09:49:WU00:FS01:0x21:Unit: 0x000000c3ab436c6957b24c29f1c1e261
01:09:49:WU00:FS01:0x21:CPU: 0x00000000000000000000000000000000
01:09:49:WU00:FS01:0x21:Machine: 1
01:09:49:WU00:FS01:0x21:Reading tar file core.xml
01:09:49:WU00:FS01:0x21:Reading tar file integrator.xml
01:09:49:WU00:FS01:0x21:Reading tar file state.xml
01:09:49:WU00:FS01:0x21:Reading tar file system.xml
01:09:50:WU00:FS01:0x21:Digital signatures verified
01:09:50:WU00:FS01:0x21:Folding@home GPU Core21 Folding@home Core
01:09:50:WU00:FS01:0x21:Version 0.0.17
01:09:58:WU00:FS01:0x21:ERROR:exception: Error downloading array interactionCount: clEnqueueReadBuffer (-5)
01:09:58:WU00:FS01:0x21:Saving result file logfile_01.txt
01:09:58:WU00:FS01:0x21:Saving result file log.txt
01:09:58:WU00:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
01:10:02:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
01:10:02:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:9178 run:8 clone:1 gen:110 core:0x21 unit:0x000000c3ab436c6957b24c29f1c1e261
01:10:02:WU00:FS01:Uploading 7.50KiB to 171.67.108.105
01:10:02:WU00:FS01:Connecting to 171.67.108.105:8080
01:10:02:WU00:FS01:Upload complete
01:10:02:WU00:FS01:Server responded WORK_ACK (400)
01:10:02:WU00:FS01:Cleaning up
01:10:02:WU00:FS01:Connecting to 171.67.108.45:80
01:10:03:WU00:FS01:Assigned to work server 171.67.108.105
01:10:03:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GP102 [GeForce Titan X] from 171.67.108.105
01:10:03:WU00:FS01:Connecting to 171.67.108.105:8080
01:10:03:WU00:FS01:Downloading 20.01MiB
01:10:05:WU00:FS01:Download complete
01:10:06:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:9178 run:16 clone:15 gen:104 core:0x21 unit:0x000000c9ab436c6957b24c2af19e7d3c
01:10:06:WU00:FS01:Starting
01:10:06:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Kevin/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 00 -suffix 01 -version 704 -lifeline 14296 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
01:10:06:WU00:FS01:Started FahCore on PID 6668
01:10:06:WU00:FS01:Core PID:5236
01:10:06:WU00:FS01:FahCore 0x21 started
01:10:07:WU00:FS01:0x21:*********************** Log Started 2017-01-21T01:10:07Z ***********************
01:10:07:WU00:FS01:0x21:Project: 9178 (Run 16, Clone 15, Gen 104)
01:10:07:WU00:FS01:0x21:Unit: 0x000000c9ab436c6957b24c2af19e7d3c
01:10:07:WU00:FS01:0x21:CPU: 0x00000000000000000000000000000000
01:10:07:WU00:FS01:0x21:Machine: 1
01:10:07:WU00:FS01:0x21:Reading tar file core.xml
01:10:07:WU00:FS01:0x21:Reading tar file integrator.xml
01:10:07:WU00:FS01:0x21:Reading tar file state.xml
01:10:07:WU00:FS01:0x21:Reading tar file system.xml
01:10:08:WU00:FS01:0x21:Digital signatures verified
01:10:08:WU00:FS01:0x21:Folding@home GPU Core21 Folding@home Core
01:10:08:WU00:FS01:0x21:Version 0.0.17
01:10:16:WU00:FS01:0x21:ERROR:exception: Error downloading array interactionCount: clEnqueueReadBuffer (-5)
01:10:17:WU00:FS01:0x21:Saving result file logfile_01.txt
01:10:17:WU00:FS01:0x21:Saving result file log.txt
01:10:17:WU00:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
01:10:19:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
01:10:19:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:9178 run:16 clone:15 gen:104 core:0x21 unit:0x000000c9ab436c6957b24c2af19e7d3c
01:10:19:WU00:FS01:Uploading 7.50KiB to 171.67.108.105
01:10:19:WU00:FS01:Connecting to 171.67.108.105:8080
01:10:19:WU00:FS01:Upload complete
01:10:19:WU00:FS01:Server responded WORK_ACK (400)
01:10:19:WU00:FS01:Cleaning up
01:10:20:WU00:FS01:Connecting to 171.67.108.45:80
01:10:20:WU00:FS01:Assigned to work server 140.163.4.244
01:10:20:WU00:FS01:Requesting new work unit for slot 01: READY gpu:0:GP102 [GeForce Titan X] from 140.163.4.244
01:10:20:WU00:FS01:Connecting to 140.163.4.244:8080
01:10:20:WU00:FS01:Downloading 2.77MiB
01:10:22:WU00:FS01:Download complete
01:10:23:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:13500 run:0 clone:682 gen:80 core:0x21 unit:0x000000818ca304f457a358b5db96a2a1
01:10:23:WU00:FS01:Starting
01:10:23:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/Kevin/AppData/Roaming/FAHClient/cores/fahwebx.stanford.edu/cores/Win32/AMD64/NVIDIA/Fermi/Core_21.fah/FahCore_21.exe -dir 00 -suffix 01 -version 704 -lifeline 14296 -checkpoint 15 -gpu 0 -gpu-vendor nvidia
01:10:23:WU00:FS01:Started FahCore on PID 7504
01:10:23:WU00:FS01:Core PID:18344
01:10:23:WU00:FS01:FahCore 0x21 started
01:10:24:WU00:FS01:0x21:*********************** Log Started 2017-01-21T01:10:24Z ***********************
01:10:24:WU00:FS01:0x21:Project: 13500 (Run 0, Clone 682, Gen 80)
01:10:24:WU00:FS01:0x21:Unit: 0x000000818ca304f457a358b5db96a2a1
01:10:24:WU00:FS01:0x21:CPU: 0x00000000000000000000000000000000
01:10:24:WU00:FS01:0x21:Machine: 1
01:10:24:WU00:FS01:0x21:Reading tar file core.xml
01:10:24:WU00:FS01:0x21:Reading tar file system.xml
01:10:25:WU00:FS01:0x21:Reading tar file integrator.xml
01:10:25:WU00:FS01:0x21:Reading tar file state.xml
01:10:25:WU00:FS01:0x21:Digital signatures verified
01:10:25:WU00:FS01:0x21:Folding@home GPU Core21 Folding@home Core
01:10:25:WU00:FS01:0x21:Version 0.0.17
01:10:33:WU00:FS01:0x21:ERROR:exception: Error downloading array interactionCount: clEnqueueReadBuffer (-5)
01:10:33:WU00:FS01:0x21:Saving result file logfile_01.txt
01:10:33:WU00:FS01:0x21:Saving result file log.txt
01:10:33:WU00:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
01:10:34:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
01:10:34:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:13500 run:0 clone:682 gen:80 core:0x21 unit:0x000000818ca304f457a358b5db96a2a1
01:10:34:WU00:FS01:Uploading 2.52KiB to 140.163.4.244
01:10:34:WU00:FS01:Connecting to 140.163.4.244:8080
01:10:34:WU00:FS01:Upload complete
01:10:35:WU00:FS01:Server responded WORK_ACK (400)
01:10:35:WU00:FS01:Cleaning up
SteveWillis
Posts: 409
Joined: Fri Apr 15, 2016 12:42 am
Hardware configuration: PC 1:
Linux Mint 17.3
three gtx 1080 GPUs One on a powered header
Motherboard = [MB-AM3-AS-SB-990FXR2] qty 1 Asus Sabertooth 990FX(+59.99)
CPU = [CPU-AM3-FX-8320BR] qty 1 AMD FX 8320 Eight Core 3.5GHz(+41.99)

PC2:
Linux Mint 18
Open air case
Motherboard: ASUS Crosshair V Formula-Z AM3+ AMD 990FX SATA 6Gb/s USB 3.0 ATX AMD
AMD FD6300WMHKBOX FX-6300 6-Core Processor Black Edition with Cooler Master Hyper 212 EVO - CPU Cooler with 120mm PWM Fan
three gtx 1080,
one gtx 1080 TI on a powered header

Re: GPU Cores repeatedly failing (even after reinstall)

Post by SteveWillis »

Which Nvidia driver version?
Image

1080 and 1080TI GPUs on Linux Mint
Joe_H
Site Admin
Posts: 7857
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: GPU Cores repeatedly failing (even after reinstall)

Post by Joe_H »

Code: Select all

01:08:02:WU01:FS01:0x21:ERROR:exception: Error downloading array interactionCount: clEnqueueReadBuffer (-5)
This error indicates you are probably using one of the version 375 or 376 drivers for your nVidia cards. There is an extensive topic here already warning against that, it was also posted on the PG blog.

The fix is to either roll back to an earlier driver, 373.06 is the last reported to work, or to download the hot fix driver that nVidia released, version 376.48. The hot fix driver works most of the time, but some have reported lower performance.

Finally, a setting using 30 CPU cores will rarely receive work. There is work generally for up to 24 cores, some projects will go higher.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
Kevincav
Posts: 23
Joined: Wed Sep 14, 2016 3:12 am

Re: GPU Cores repeatedly failing (even after reinstall)

Post by Kevincav »

SteveWillis wrote:Which Nvidia driver version?
I think it was 376'ish. I updated to the recent one to test.
Kevincav
Posts: 23
Joined: Wed Sep 14, 2016 3:12 am

Re: GPU Cores repeatedly failing (even after reinstall)

Post by Kevincav »

Joe_H wrote:

Code: Select all

01:08:02:WU01:FS01:0x21:ERROR:exception: Error downloading array interactionCount: clEnqueueReadBuffer (-5)
This error indicates you are probably using one of the version 375 or 376 drivers for your nVidia cards. There is an extensive topic here already warning against that, it was also posted on the PG blog.

The fix is to either roll back to an earlier driver, 373.06 is the last reported to work, or to download the hot fix driver that nVidia released, version 376.48. The hot fix driver works most of the time, but some have reported lower performance.

Finally, a setting using 30 CPU cores will rarely receive work. There is work generally for up to 24 cores, some projects will go higher.
I just hot fixed the driver and it appears to be working now (from the 5 min it's been running). Thanks for the info, I'll post an update tomorrow.

Oh, for the 30 cores thing, yeah totally understand. 30 was the default slot when reinstalling it. I appreciate it.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: GPU Cores repeatedly failing (even after reinstall)

Post by bruce »

Joe_H wrote:Finally, a setting using 30 CPU cores will rarely receive work. There is work generally for up to 24 cores, some projects will go higher.
This issue will be resolved one way or another in the next version of FAHClient. (It's already fixed in the beta version 7.4.16)

Let's say that with the new version, you leave the setting at 30 and there's no work that can be assigned to 25,26,27,28,29, or 30. You will be assigned work using 24 of your 30 cores, giving you SOMETHING to work on, even if it doesn't use the full capabilities of your system. When that WU is completed, it will again go through the process of finding an available WU that uses as much of your system as possible.

Note: That does mean that you might go for long periods of time using 24 cores with the other 6 free. You'll still have the option of manually re-configuring your system to have two or more CPU slots, say one with 24 CPUs and another 6 ... or some other combination that maximizes the value of the assignments that you will be processing. Unfortunately it's impossible to predict the characteristics of whatever projects will be suspended or added in the future, though.
Kevincav
Posts: 23
Joined: Wed Sep 14, 2016 3:12 am

Re: GPU Cores repeatedly failing (even after reinstall)

Post by Kevincav »

So on 373.06, same issues as before. On 376.48 it was weird. One gpu failed slower and the other seemed to not fully fail at least for a day.
Leonardo
Posts: 261
Joined: Tue Dec 04, 2007 5:09 am
Hardware configuration: GPU slots on home-built, purpose-built PCs.
Location: Eagle River, Alaska

Re: GPU Cores repeatedly failing (even after reinstall)

Post by Leonardo »

What are the GPU core temperatures? Use freeware GPU-Z to check.

The symptoms you describe could indicate power deliver problems or overheating video card components. What is the power rating of your power supply unit? Which Nvidia card models are you using?

Perhaps, if not a hardware (overheating, insufficient power) problem, it could indeed still be a driver problem. You might want to try re-installing the Nvidia drivers. Ensure that you select "Clean Install" in the custom installation option.
Image
foldy
Posts: 2061
Joined: Sat Dec 01, 2012 3:43 pm
Hardware configuration: Folding@Home Client 7.6.13 (1 GPU slots)
Windows 7 64bit
Intel Core i5 2500k@4Ghz
Nvidia gtx 1080ti driver 441

Re: GPU Cores repeatedly failing (even after reinstall)

Post by foldy »

If your reinstall use latest nvidia 378.49 whql driver.
Post Reply