FAULTY project: 11749

Moderators: Site Moderators, FAHC Science Team

Post Reply
Jertzuu
Posts: 31
Joined: Wed Mar 25, 2020 6:25 pm
Hardware configuration: Ryzen 5 2600 @ 4,1GHz
G.Skill Trident Z RGB 3200mHz CL14 32Gb
Asus Prime X470-PRO
Zotac AMP! Extreme 1080 Ti
Samsung Evo 970 250Gb NVMe M.2
Samsung Evo 860 500Gb SSD
WD Black 1Tb 7200RPM HDD
Location: Ulvila, FInland

FAULTY project: 11749

Post by Jertzuu »

This weird GPU project bug keeps harassing me, any fix?

Code: Select all

16:47:47:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:NO_ERROR project:11749 run:0 clone:7872 gen:19 core:0x22 unit:0x0000001c8ca304e75e6bb93a5175b6b5
16:47:47:WU02:FS01:Starting
16:47:47:WU02:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\Users\jeret\AppData\Roaming\FAHClient\cores/cores.foldingathome.org/v7/win/64bit/Core_22.fah/FahCore_22.exe -dir 02 -suffix 01 -version 705 -lifeline 12988 -checkpoint 10 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
16:47:47:WU02:FS01:Started FahCore on PID 10732
16:47:47:WU02:FS01:Core PID:14684
16:47:47:WU02:FS01:FahCore 0x22 started
16:47:48:WU02:FS01:0x22:*********************** Log Started 2020-03-26T16:47:47Z ***********************
16:47:48:WU02:FS01:0x22:*************************** Core22 Folding@home Core ***************************
16:47:48:WU02:FS01:0x22:       Type: 0x22
16:47:48:WU02:FS01:0x22:       Core: Core22
16:47:48:WU02:FS01:0x22:    Website: https://foldingathome.org/
16:47:48:WU02:FS01:0x22:  Copyright: (c) 2009-2018 foldingathome.org
16:47:48:WU02:FS01:0x22:     Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora
16:47:48:WU02:FS01:0x22:             <rafal.wiewiora@choderalab.org>
16:47:48:WU02:FS01:0x22:       Args: -dir 02 -suffix 01 -version 705 -lifeline 10732 -checkpoint 10
16:47:48:WU02:FS01:0x22:             -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device
16:47:48:WU02:FS01:0x22:             0 -gpu 0
16:47:48:WU02:FS01:0x22:     Config: <none>
16:47:48:WU02:FS01:0x22:************************************ Build *************************************
16:47:48:WU02:FS01:0x22:    Version: 0.0.2
16:47:48:WU02:FS01:0x22:       Date: Dec 6 2019
16:47:48:WU02:FS01:0x22:       Time: 21:30:31
16:47:48:WU02:FS01:0x22: Repository: Git
16:47:48:WU02:FS01:0x22:   Revision: abeb39247cc72df5af0f63723edafadb23d5dfbe
16:47:48:WU02:FS01:0x22:     Branch: HEAD
16:47:48:WU02:FS01:0x22:   Compiler: Visual C++ 2008
16:47:48:WU02:FS01:0x22:    Options: /TP /nologo /EHa /wd4297 /wd4103 /Ox /MT
16:47:48:WU02:FS01:0x22:   Platform: win32 10
16:47:48:WU02:FS01:0x22:       Bits: 64
16:47:48:WU02:FS01:0x22:       Mode: Release
16:47:48:WU02:FS01:0x22:************************************ System ************************************
16:47:48:WU02:FS01:0x22:        CPU: AMD Ryzen 5 2600 Six-Core Processor
16:47:48:WU02:FS01:0x22:     CPU ID: AuthenticAMD Family 23 Model 8 Stepping 2
16:47:48:WU02:FS01:0x22:       CPUs: 12
16:47:48:WU02:FS01:0x22:     Memory: 31.92GiB
16:47:48:WU02:FS01:0x22:Free Memory: 26.97GiB
16:47:48:WU02:FS01:0x22:    Threads: WINDOWS_THREADS
16:47:48:WU02:FS01:0x22: OS Version: 6.2
16:47:48:WU02:FS01:0x22:Has Battery: false
16:47:48:WU02:FS01:0x22: On Battery: false
16:47:48:WU02:FS01:0x22: UTC Offset: 2
16:47:48:WU02:FS01:0x22:        PID: 14684
16:47:48:WU02:FS01:0x22:        CWD: C:\Users\jeret\AppData\Roaming\FAHClient\work
16:47:48:WU02:FS01:0x22:         OS: Windows 10 Pro
16:47:48:WU02:FS01:0x22:    OS Arch: AMD64
16:47:48:WU02:FS01:0x22:********************************************************************************
16:47:48:WU02:FS01:0x22:Project: 11749 (Run 0, Clone 7872, Gen 19)
16:47:48:WU02:FS01:0x22:Unit: 0x0000001c8ca304e75e6bb93a5175b6b5
16:47:48:WU02:FS01:0x22:Reading tar file core.xml
16:47:48:WU02:FS01:0x22:Reading tar file integrator.xml
16:47:48:WU02:FS01:0x22:Reading tar file state.xml
16:47:48:WU02:FS01:0x22:Reading tar file system.xml
16:47:49:WU02:FS01:0x22:Digital signatures verified
16:47:49:WU02:FS01:0x22:Folding@home GPU Core22 Folding@home Core
16:47:49:WU02:FS01:0x22:Version 0.0.2
16:47:57:WU02:FS01:0x22:Completed 0 out of 2000000 steps (0%)
16:47:57:WU02:FS01:0x22:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
16:48:03:WU02:FS01:0x22:Bad State detected... attempting to resume from last good checkpoint. Is your system overclocked?
16:48:03:WU02:FS01:0x22:Following exception occured: Particle coordinate is nan
16:48:09:WU02:FS01:0x22:Bad State detected... attempting to resume from last good checkpoint. Is your system overclocked?
16:48:09:WU02:FS01:0x22:Following exception occured: Particle coordinate is nan
16:48:15:WU02:FS01:0x22:Bad State detected... attempting to resume from last good checkpoint. Is your system overclocked?
16:48:15:WU02:FS01:0x22:Following exception occured: Particle coordinate is nan
16:48:15:WU02:FS01:0x22:ERROR:114: Max Retries Reached
16:48:15:WU02:FS01:0x22:Saving result file ..\logfile_01.txt
16:48:15:WU02:FS01:0x22:Saving result file badstate-0.xml
16:48:16:WU01:FS00:Connecting to 65.254.110.245:8080
16:48:16:WU02:FS01:0x22:Saving result file badstate-1.xml
16:48:17:WU02:FS01:0x22:Saving result file badstate-2.xml
16:48:19:WU02:FS01:0x22:Saving result file checkpt.crc
16:48:19:WU02:FS01:0x22:Saving result file science.log
16:48:19:WU02:FS01:0x22:Folding@home Core Shutdown: BAD_WORK_UNIT
16:48:19:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
16:48:19:WU02:FS01:Sending unit results: id:02 state:SEND error:FAULTY project:11749 run:0 clone:7872 gen:19 core:0x22 unit:0x0000001c8ca304e75e6bb93a5175b6b5
16:48:19:WU02:FS01:Uploading 32.88KiB to 140.163.4.231
16:49:59:WU02:FS01:Upload complete
16:49:59:WU02:FS01:Server responded WORK_ACK (400)
16:49:59:WU02:FS01:Cleaning up
Image

Ryzen 5 2600 @ 4,1GHz
Zotac AMP! Extreme 1080 Ti
toTOW
Site Moderator
Posts: 6309
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: FAULTY project: 11749

Post by toTOW »

No other report for this WU yet.
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
MrFrizzy
Posts: 123
Joined: Fri Feb 14, 2020 4:48 am

Re: FAULTY project: 11749

Post by MrFrizzy »

I would try lowering your GPU/Memory clocks, increasing your fan speeds, or if you have enough thermal headroom, increasing the voltage on your GPU. Even if you are on stock, out of the box settings, I would still recommend tweaking things as I have seen this message come up even on stock settings in the past.
S1: AMD R5 3600 & Sapphire RX 5700 XT Reference @2.1GHz under water
S2: Intel Xeon E5-2620v3 & MSI GTX 1650

RX 5700 XT Project & PPD Tracking Spreadsheet

Image
Jertzuu
Posts: 31
Joined: Wed Mar 25, 2020 6:25 pm
Hardware configuration: Ryzen 5 2600 @ 4,1GHz
G.Skill Trident Z RGB 3200mHz CL14 32Gb
Asus Prime X470-PRO
Zotac AMP! Extreme 1080 Ti
Samsung Evo 970 250Gb NVMe M.2
Samsung Evo 860 500Gb SSD
WD Black 1Tb 7200RPM HDD
Location: Ulvila, FInland

Re: FAULTY project: 11749

Post by Jertzuu »

MrFrizzy wrote:I would try lowering your GPU/Memory clocks, increasing your fan speeds, or if you have enough thermal headroom, increasing the voltage on your GPU. Even if you are on stock, out of the box settings, I would still recommend tweaking things as I have seen this message come up even on stock settings in the past.
Thanks for the tip. Lowered clocks and now I'm just waiting on a project to try it out

*Edit* Reinstalled the client and made sure all previous data was deleted from my PC. So far so good, and seems to be working fine so far
Image

Ryzen 5 2600 @ 4,1GHz
Zotac AMP! Extreme 1080 Ti
Manfred.Knick
Posts: 36
Joined: Wed Mar 25, 2020 10:21 am
Hardware configuration: Multiple XEON + GTX
Location: Germany

Re: FAULTY project: 11749

Post by Manfred.Knick »

+1: again

06:00:40:WU02:FS01:0x22:Project: 11749 (Run 0, Clone 6603, Gen 5) <-------------------------------- P R C G
...
07:12:11:WU02:FS01:Sending unit results: id:02 state:SEND error:NO_ERROR project:11749 run:0 clone:6603 gen:5 core:0x22 unit:0x000000128ca304e75e6bb7fd572b20ad
07:12:11:WU02:FS01:Uploading 12.57MiB to 140.163.4.231
07:12:11:WU02:FS01:Connecting to 140.163.4.231:8080
...
07:12:11:WU02:FS01:Sending unit results: id:02 state:SEND error:NO_ERROR project:11749 run:0 clone:6603 gen:5 core:0x22 unit:0x000000128ca304e75e6bb7fd572b20ad
...
07:16:11:WU02:FS01:Upload complete
07:16:11:WU02:FS01:Server responded WORK_QUIT (404) <----------------------------------------------- !
07:16:11:WARNING:WU02:FS01:Server did not like results, dumping <----------------------------------- !
07:16:11:WU02:FS01:Cleaning up
Last edited by Manfred.Knick on Fri Apr 03, 2020 8:06 am, edited 1 time in total.
anandhanju
Posts: 526
Joined: Mon Dec 03, 2007 4:33 am
Location: Australia

Re: FAULTY project: 11749

Post by anandhanju »

The previous WU was successfully completed by someone else. I think you might need to revisit those tweaks.
Manfred.Knick
Posts: 36
Joined: Wed Mar 25, 2020 10:21 am
Hardware configuration: Multiple XEON + GTX
Location: Germany

Re: FAULTY project: 11749

Post by Manfred.Knick »

anandhanju wrote: ... revisit those tweaks ...
? sorrry - which tweaks ?
Neil-B
Posts: 2027
Joined: Sun Mar 22, 2020 5:52 pm
Hardware configuration: 1: 2x Xeon E5-2697v3@2.60GHz, 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent 20H2, Quadro K420 1GB, FAH 7.6.21
2: Xeon E3-1505Mv5@2.80GHz, 32GB DDR4, NVME, Win10 Pro 20H2, Quadro M1000M 2GB, FAH 7.6.21 (actually have two of these)
3: i7-960@3.20GHz, 12GB DDR3, SSD, Win10 Pro 20H2, GTX 750Ti 2GB, GTX 1080Ti 11GB, FAH 7.6.21
Location: UK

Re: FAULTY project: 11749

Post by Neil-B »

Believe mix up ... responder may have thought you were the original poster on this thread
2x Xeon E5-2697v3, 512GB DDR4 LRDIMM, SSD Raid, W10-Ent, Quadro K420
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070

(Green/Bold = Active)
Manfred.Knick
Posts: 36
Joined: Wed Mar 25, 2020 10:21 am
Hardware configuration: Multiple XEON + GTX
Location: Germany

Re: FAULTY project: 11749

Post by Manfred.Knick »

Neil-B wrote:Believe mix up
Right, I see.
Question remains:
anandhanju wrote:The previous WU was successfully completed by someone else.
Why was this WU "double"-assigned ?
Neil-B
Posts: 2027
Joined: Sun Mar 22, 2020 5:52 pm
Hardware configuration: 1: 2x Xeon E5-2697v3@2.60GHz, 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent 20H2, Quadro K420 1GB, FAH 7.6.21
2: Xeon E3-1505Mv5@2.80GHz, 32GB DDR4, NVME, Win10 Pro 20H2, Quadro M1000M 2GB, FAH 7.6.21 (actually have two of these)
3: i7-960@3.20GHz, 12GB DDR3, SSD, Win10 Pro 20H2, GTX 750Ti 2GB, GTX 1080Ti 11GB, FAH 7.6.21
Location: UK

Re: FAULTY project: 11749

Post by Neil-B »

A WU will usually be reissued to another folder under certain circumstances .. if not returned before timeout, if returned faulty (iirc), if it has been returned to a CS but not made its way back to the WS before timeout (possibly??), and I have a suspicion that under periods of high loads when assignments where overloaded there may have been some extra scenarios .. without looking into a specific case it is hard to be more precise .. under normal running a WU is given out once and when returned (hopefully well within timeout) it is then used to create the next gen of that WU … The way points are allocated in this circumstance is "logical" I just don't recall what it is, Sorry :(
2x Xeon E5-2697v3, 512GB DDR4 LRDIMM, SSD Raid, W10-Ent, Quadro K420
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070

(Green/Bold = Active)
Manfred.Knick
Posts: 36
Joined: Wed Mar 25, 2020 10:21 am
Hardware configuration: Multiple XEON + GTX
Location: Germany

Re: FAULTY project: 11749

Post by Manfred.Knick »

@ Neil: Thanks for your hints!
treckin
Posts: 27
Joined: Mon Mar 23, 2020 7:51 am

Re: FAULTY project: 11749

Post by treckin »

I think it could be the server, it’s been failing to upload WUs for me and others if you poke the “issues with specific servers” and even another thread in this sub
tessa
Posts: 7
Joined: Sun Mar 15, 2020 8:36 am

Re: FAULTY project: 11749

Post by tessa »

This WU is also faulty for me:
viewtopic.php?f=19&t=32289
Post Reply