Bad GPU work units (114 = 0x72)

If you're new to FAH and need help getting started or you have very basic questions, start here.

Moderators: Site Moderators, FAHC Science Team

bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Bad work units.

Post by bruce »

~shadowlegend~ wrote:btw i only have 1 gpu
Well, that depends on what you choose to call a GPU.

Your Intel Core i5-4670K CPU has an on-chip GPU which is known as an Intel HD Graphics 4600 device. FAH doesn't support it, but somehow you have installed the drivers for it, and that is confusing FAHClient.
~shadowlegend~
Posts: 14
Joined: Mon Oct 15, 2018 8:10 am

Re: Bad work units.

Post by ~shadowlegend~ »

bruce wrote:
~shadowlegend~ wrote:btw i only have 1 gpu
Well, that depends on what you choose to call a GPU.

Your Intel Core i5-4670K CPU has an on-chip GPU which is known as an Intel HD Graphics 4600 device. FAH doesn't support it, but somehow you have installed the drivers for it, and that is confusing FAHClient.
im really bad at this stuff sorry.

idk what else to do.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Bad work units.

Post by bruce »

Please send us a copy of a failing WU. The next time you see something like this:

17:54:44:WU02:FS01:0x21:Completed 105000 out of 250000 steps (42%)
17:54:53:WU02:FS01:0x21:Bad State detected... attempting to resume from last good checkpoint. Is your system overclocked?
17:56:06:WU02:FS01:0x21:Bad State detected... attempting to resume from last good checkpoint. Is your system overclocked?
17:56:11:WU02:FS01:0x21:Bad State detected... attempting to resume from last good checkpoint. Is your system overclocked?
17:56:11:WU02:FS01:0x21:ERROR:114: Max Retries Reached
17:56:11:WU02:FS01:0x21:Saving result file logfile_01.txt
17:56:11:WU02:FS01:0x21:Saving result file badstate-0.xml
17:56:11:WU02:FS01:0x21:Saving result file badstate-1.xml
17:56:11:WU02:FS01:0x21:Saving result file badstate-2.xml
17:56:11:WU02:FS01:0x21:Saving result file log.txt
17:56:11:WU02:FS01:0x21:Folding@home Core Shutdown: BAD_WORK_UNIT
17:56:12:WARNING:WU02:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)

17:56:12:WU02:FS01:Sending unit results: id:02 state:SEND error:FAULTY project:11728 run:0 clone:1749 gen:255 core:0x21 unit:0x0000013c8ca304e75ba032663305647e
17:56:12:WU02:FS01:Uploading 9.93MiB to 140.163.4.231
17:56:24:WU02:FS01:Server responded WORK_ACK (400)
17:56:24:WU02:FS01:Cleaning up


Note that the part I quoted says WU02 repeatedly.. Note that number. (For the failing WU described by those messages, N=2. Your sequence of messages may contain a different number N, but you are going to capture WU N before your client deletes it.)

In preparing for the capture of the failure data, create a new compressed file somewhere that's easy to find. Open the FAHData directory and then the "work" subdirectory inside of it.
When there is a failing WU, find the subdirectory named 0 or 1 or 2 ... whatever number matches the N for your WU.

During that failure sequence of messages, copy subdirectory N into the compressed file.

If the copying process is completed BEFORE you get a message about "cleaning up" Good. If not, try again on the next failure. Send me the compressed file.
~shadowlegend~
Posts: 14
Joined: Mon Oct 15, 2018 8:10 am

Re: Bad work units.

Post by ~shadowlegend~ »

bruce wrote:Send me the compressed file.
How?
SteveWillis
Posts: 409
Joined: Fri Apr 15, 2016 12:42 am
Hardware configuration: PC 1:
Linux Mint 17.3
three gtx 1080 GPUs One on a powered header
Motherboard = [MB-AM3-AS-SB-990FXR2] qty 1 Asus Sabertooth 990FX(+59.99)
CPU = [CPU-AM3-FX-8320BR] qty 1 AMD FX 8320 Eight Core 3.5GHz(+41.99)

PC2:
Linux Mint 18
Open air case
Motherboard: ASUS Crosshair V Formula-Z AM3+ AMD 990FX SATA 6Gb/s USB 3.0 ATX AMD
AMD FD6300WMHKBOX FX-6300 6-Core Processor Black Edition with Cooler Master Hyper 212 EVO - CPU Cooler with 120mm PWM Fan
three gtx 1080,
one gtx 1080 TI on a powered header

Re: Bad work units.

Post by SteveWillis »

You can get 15GB of free storage on google drive. Save it there, create a link and send it to him.
Image

1080 and 1080TI GPUs on Linux Mint
~shadowlegend~
Posts: 14
Joined: Mon Oct 15, 2018 8:10 am

Re: Bad work units.

Post by ~shadowlegend~ »

SteveWillis wrote:You can get 15GB of free storage on google drive. Save it there, create a link and send it to him.

Thank you!
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Bad work units.

Post by bruce »

The file you uploaded isn't useful but that's probably because FAHClient uses hidden files. According to your earlier log, your FAH Data Directory is C:\Users\simon\AppData\Roaming\FAHClient.

The file you uploaded is associated with a WU running on your CPU and it contains no errors. We need a file associated with a WU running on your GPU containing a snapshot of the error.

Try again. Open either the full directory in the first paragraph or one called %APPDATA%\FAHClient (Either method should give you the same result.) It should contain \work and two folders with names like \0 or \1 or \2 plus a couple of other files. The file you uploaded was inside a directory called ...\temp. (That's not FAH's Data Directory.)
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Bad work units.

Post by bruce »

By the way, your system might be having trouble dissipating the heat generated inside the case. Download GPU-Z or a similar utility and note the GPU temperature readings.
SteveWillis
Posts: 409
Joined: Fri Apr 15, 2016 12:42 am
Hardware configuration: PC 1:
Linux Mint 17.3
three gtx 1080 GPUs One on a powered header
Motherboard = [MB-AM3-AS-SB-990FXR2] qty 1 Asus Sabertooth 990FX(+59.99)
CPU = [CPU-AM3-FX-8320BR] qty 1 AMD FX 8320 Eight Core 3.5GHz(+41.99)

PC2:
Linux Mint 18
Open air case
Motherboard: ASUS Crosshair V Formula-Z AM3+ AMD 990FX SATA 6Gb/s USB 3.0 ATX AMD
AMD FD6300WMHKBOX FX-6300 6-Core Processor Black Edition with Cooler Master Hyper 212 EVO - CPU Cooler with 120mm PWM Fan
three gtx 1080,
one gtx 1080 TI on a powered header

Re: Bad work units.

Post by SteveWillis »

It is also possible that you have a bad gpu. I'm successfully folding on 15 GPUs of different manufacturers but also have one that does nothing but create bad work units, over and over as you describe. Good luck if that's it. I sent my bad one in twice to Gigabyte and both times they said there was nothing wrong with it. It won't fold but works fine to mine ETH, which I do just because I have it. If it would just die I could probably get it replaced. I'll never buy another Gigabyte GPU.
Image

1080 and 1080TI GPUs on Linux Mint
~shadowlegend~
Posts: 14
Joined: Mon Oct 15, 2018 8:10 am

Re: Bad work units.

Post by ~shadowlegend~ »

bruce wrote:By the way, your system might be having trouble dissipating the heat generated inside the case. Download GPU-Z or a similar utility and note the GPU temperature readings.
ok
~shadowlegend~
Posts: 14
Joined: Mon Oct 15, 2018 8:10 am

Re: Bad work units.

Post by ~shadowlegend~ »

SteveWillis wrote:It is also possible that you have a bad gpu. I'm successfully folding on 15 GPUs of different manufacturers but also have one that does nothing but create bad work units, over and over as you describe. Good luck if that's it. I sent my bad one in twice to Gigabyte and both times they said there was nothing wrong with it. It won't fold but works fine to mine ETH, which I do just because I have it. If it would just die I could probably get it replaced. I'll never buy another Gigabyte GPU.
15 GPUs :shock: i will upgrade my pc next year. but i just want to be able to fold now.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Bad work units.

Post by bruce »

~shadowlegend~ wrote:15 GPUs :shock: i will upgrade my pc next year. but i just want to be able to fold now.
Have you tried interchanging that GPU with another NVIDIA GK10x? Does the problem move?

Have you tried underclocking?
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Bad work units.

Post by bruce »

@ ~shadowlegend~

Please describe your hardware including speed details of the hardware that FAH uses.

Are your GPUs using risers?
~shadowlegend~
Posts: 14
Joined: Mon Oct 15, 2018 8:10 am

Re: Bad work units.

Post by ~shadowlegend~ »

bruce wrote:
~shadowlegend~ wrote:15 GPUs :shock: i will upgrade my pc next year. but i just want to be able to fold now.
Have you tried interchanging that GPU with another NVIDIA GK10x? Does the problem move?

Have you tried underclocking?
What's NVIDIA GK10x?
How do i underclock safely?
~shadowlegend~
Posts: 14
Joined: Mon Oct 15, 2018 8:10 am

Re: Bad work units.

Post by ~shadowlegend~ »

bruce wrote:@ ~shadowlegend~

Please describe your hardware including speed details of the hardware that FAH uses.
What do you mean?
Post Reply