BAD_WORK_UNIT (114 = 0x72)

Driver issues associated with the Windows 10 roll-out

Moderators: Site Moderators, FAHC Science Team

Hoovoloo
Posts: 26
Joined: Mon May 11, 2020 9:01 am
Hardware configuration: CPU: AMD Ryzen 9 3900X 12-Core Processor, GPU:GeForce RTX 2070 Super
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Post by Hoovoloo »

Just realised that there are a pair of nvlddmkm errors for each failure. The pair for a failure are below.

Code: Select all

Log Name:      System
Source:        nvlddmkm
Date:          18/05/2020 12:17:20
Event ID:      13
Task Category: None
Level:         Error
Keywords:      Classic
User:          N/A
Computer:      DESKTOP-CDEFV44
Description:
The description for Event ID 13 from source nvlddmkm cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.

If the event originated on another computer, the display information had to be saved with the event.

The following information was included with the event: 

\Device\Video3
Graphics SM Warp Exception on (GPC 3, TPC 2, SM 1): Misaligned PC

The message resource is present but the message was not found in the message table

Event Xml:
<Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
  <System>
    <Provider Name="nvlddmkm" />
    <EventID Qualifiers="49322">13</EventID>
    <Level>2</Level>
    <Task>0</Task>
    <Keywords>0x80000000000000</Keywords>
    <TimeCreated SystemTime="2020-05-18T11:17:20.126326100Z" />
    <EventRecordID>6991</EventRecordID>
    <Channel>System</Channel>
    <Computer>DESKTOP-CDEFV44</Computer>
    <Security />
  </System>
  <EventData>
    <Data>\Device\Video3</Data>
    <Data>Graphics SM Warp Exception on (GPC 3, TPC 2, SM 1): Misaligned PC</Data>
    <Binary>0000000002003000000000000D00AAC0000000000000000000000000000000000000000000000000</Binary>
  </EventData>
</Event>

Code: Select all

Log Name:      System
Source:        nvlddmkm
Date:          18/05/2020 12:17:20
Event ID:      13
Task Category: None
Level:         Error
Keywords:      Classic
User:          N/A
Computer:      DESKTOP-CDEFV44
Description:
The description for Event ID 13 from source nvlddmkm cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.

If the event originated on another computer, the display information had to be saved with the event.

The following information was included with the event: 

\Device\Video3
Graphics Exception: ESR 0x51d7b0=0xa0005 0x51d7b4=0x20 0x51d7a8=0x4c1eb72 0x51d7ac=0x174

The message resource is present but the message was not found in the message table

Event Xml:
<Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
  <System>
    <Provider Name="nvlddmkm" />
    <EventID Qualifiers="49322">13</EventID>
    <Level>2</Level>
    <Task>0</Task>
    <Keywords>0x80000000000000</Keywords>
    <TimeCreated SystemTime="2020-05-18T11:17:20.126326100Z" />
    <EventRecordID>6992</EventRecordID>
    <Channel>System</Channel>
    <Computer>DESKTOP-CDEFV44</Computer>
    <Security />
  </System>
  <EventData>
    <Data>\Device\Video3</Data>
    <Data>Graphics Exception: ESR 0x51d7b0=0xa0005 0x51d7b4=0x20 0x51d7a8=0x4c1eb72 0x51d7ac=0x174</Data>
    <Binary>0000000002003000000000000D00AAC0000000000000000000000000000000000000000000000000</Binary>
  </EventData>
</Event>
they appear in the order that they are listed here
_r2w_ben
Posts: 285
Joined: Wed Apr 23, 2008 3:11 pm

Re: BAD_WORK_UNIT (114 = 0x72)

Post by _r2w_ben »

Code: Select all

\Device\Video3
Graphics SM Warp Exception on (GPC 3, TPC 2, SM 1): Misaligned PC
PC probably means Program Counter in this context. NVIDIA has a CUDA tool to help developers diagnose this type of issue but FAH's GPU core is built using OpenCL. Can someone recommend a good GPU memory test?

What is the manufacturer and model number of your 2070 Super? Does it have any odd characteristics like different amounts of L1/L2 cache depending on which shader core is used? If you look back in Event Viewer for other instances, are they also "GPC 3, TPC 2, SM 1"?
Hoovoloo
Posts: 26
Joined: Mon May 11, 2020 9:01 am
Hardware configuration: CPU: AMD Ryzen 9 3900X 12-Core Processor, GPU:GeForce RTX 2070 Super
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Post by Hoovoloo »

This is the card https://www.zotac.com/pk/product/graphi ... -mini#spec
Not sure about characteristics but the link is to the spec page
As far as i can see the error pairing are identical each time
_r2w_ben
Posts: 285
Joined: Wed Apr 23, 2008 3:11 pm

Re: BAD_WORK_UNIT (114 = 0x72)

Post by _r2w_ben »

Hoovoloo wrote:As far as i can see the error pairing are identical each time
In the details of the error, is the part I put in the code block the same each time?
Hoovoloo
Posts: 26
Joined: Mon May 11, 2020 9:01 am
Hardware configuration: CPU: AMD Ryzen 9 3900X 12-Core Processor, GPU:GeForce RTX 2070 Super
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Post by Hoovoloo »

yes it is in that error. On the otherone it is a long string, i couldn't see any difference but might have missed something
Hoovoloo
Posts: 26
Joined: Mon May 11, 2020 9:01 am
Hardware configuration: CPU: AMD Ryzen 9 3900X 12-Core Processor, GPU:GeForce RTX 2070 Super
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Post by Hoovoloo »

Well not sure if this is progress or just frustrating. Did a clean re-instal of windows and the OpenCl benchmark ran fine. tried FAH and it failed as before. Oddly if I delete the GPU slot, create a second CPU slot and then edit it to be a GPU slot, the first GPU job it tries runs ok but all subsequent ones fail. That has worked twice now. Go figure
PantherX
Site Moderator
Posts: 7020
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: BAD_WORK_UNIT (114 = 0x72)

Post by PantherX »

Please note that if the Slot is configured for CPU and has a WU, it can only run on the CPU. Changing it to the GPU Slot will cause it to work... the WU that it resumes can only run on the CPU. Maybe you can post the log file to see what's happening in the case where you manually changed the Slot type?
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Hoovoloo
Posts: 26
Joined: Mon May 11, 2020 9:01 am
Hardware configuration: CPU: AMD Ryzen 9 3900X 12-Core Processor, GPU:GeForce RTX 2070 Super
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Post by Hoovoloo »

PantherX yes the job that it starts before the slot is edited to GPU runs on the CPU as you say but the next job it picks up runs on the GPU and you can see the load on the GPU is task manager confirming that is what is happening. Will have a look at the log and post in a few minutes if I can find that part. should be near the start of the log when filtered for Slot 1
Hoovoloo
Posts: 26
Joined: Mon May 11, 2020 9:01 am
Hardware configuration: CPU: AMD Ryzen 9 3900X 12-Core Processor, GPU:GeForce RTX 2070 Super
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Post by Hoovoloo »

Didn't work this time around so nothing useful in the go. I am struggling to think where to go next with this. Seems really odd that a relatively high spec machine can't get this to run, especially as it was running absolutely fine for a quite a while before these issues
_r2w_ben
Posts: 285
Joined: Wed Apr 23, 2008 3:11 pm

Re: BAD_WORK_UNIT (114 = 0x72)

Post by _r2w_ben »

Code: Select all

<EventData>
    <Data>\Device\Video3</Data>
    <Data>Graphics SM Warp Exception on (GPC 3, TPC 2, SM 1): Misaligned PC</Data>
    <Binary>0000000002003000000000000D00AAC0000000000000000000000000000000000000000000000000</Binary>
  </EventData>
You mentioned that the errors are logged in pairs. For the errors that have this section, is it always Warp Exception and Misaligned PC? Are the GPC, TPC, and SM numbers always the same?
Hoovoloo
Posts: 26
Joined: Mon May 11, 2020 9:01 am
Hardware configuration: CPU: AMD Ryzen 9 3900X 12-Core Processor, GPU:GeForce RTX 2070 Super
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Post by Hoovoloo »

Yes they are and as far as I can see the strings in the other error are also the same each time
_r2w_ben
Posts: 285
Joined: Wed Apr 23, 2008 3:11 pm

Re: BAD_WORK_UNIT (114 = 0x72)

Post by _r2w_ben »

Have you run MemtestG80 or MemtestCL? I would suggest running a full test of both of them since there might be a small portion of your GPU that is faulty.
https://simtk.org/projects/memtest
https://www.majorgeeks.com/files/detail ... stg80.html
https://www.majorgeeks.com/files/details/memtestcl.html
Hoovoloo
Posts: 26
Joined: Mon May 11, 2020 9:01 am
Hardware configuration: CPU: AMD Ryzen 9 3900X 12-Core Processor, GPU:GeForce RTX 2070 Super
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Post by Hoovoloo »

both of them are throwing errors. Seems to vary between runs
Hoovoloo
Posts: 26
Joined: Mon May 11, 2020 9:01 am
Hardware configuration: CPU: AMD Ryzen 9 3900X 12-Core Processor, GPU:GeForce RTX 2070 Super
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Post by Hoovoloo »

Have raised it with the vendors support team
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: BAD_WORK_UNIT (114 = 0x72)

Post by bruce »

I would try underclocking.
Post Reply