BAD_WORK_UNIT (114 = 0x72)

Driver issues associated with the Windows 10 roll-out

Moderators: Site Moderators, FAHC Science Team

Re: BAD_WORK_UNIT (114 = 0x72)

Postby Hoovoloo » Mon May 18, 2020 12:41 pm

Just realised that there are a pair of nvlddmkm errors for each failure. The pair for a failure are below.
Code: Select all
Log Name:      System
Source:        nvlddmkm
Date:          18/05/2020 12:17:20
Event ID:      13
Task Category: None
Level:         Error
Keywords:      Classic
User:          N/A
Computer:      DESKTOP-CDEFV44
Description:
The description for Event ID 13 from source nvlddmkm cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.

If the event originated on another computer, the display information had to be saved with the event.

The following information was included with the event:

\Device\Video3
Graphics SM Warp Exception on (GPC 3, TPC 2, SM 1): Misaligned PC

The message resource is present but the message was not found in the message table

Event Xml:
<Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
  <System>
    <Provider Name="nvlddmkm" />
    <EventID Qualifiers="49322">13</EventID>
    <Level>2</Level>
    <Task>0</Task>
    <Keywords>0x80000000000000</Keywords>
    <TimeCreated SystemTime="2020-05-18T11:17:20.126326100Z" />
    <EventRecordID>6991</EventRecordID>
    <Channel>System</Channel>
    <Computer>DESKTOP-CDEFV44</Computer>
    <Security />
  </System>
  <EventData>
    <Data>\Device\Video3</Data>
    <Data>Graphics SM Warp Exception on (GPC 3, TPC 2, SM 1): Misaligned PC</Data>
    <Binary>0000000002003000000000000D00AAC0000000000000000000000000000000000000000000000000</Binary>
  </EventData>
</Event>

Code: Select all
Log Name:      System
Source:        nvlddmkm
Date:          18/05/2020 12:17:20
Event ID:      13
Task Category: None
Level:         Error
Keywords:      Classic
User:          N/A
Computer:      DESKTOP-CDEFV44
Description:
The description for Event ID 13 from source nvlddmkm cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.

If the event originated on another computer, the display information had to be saved with the event.

The following information was included with the event:

\Device\Video3
Graphics Exception: ESR 0x51d7b0=0xa0005 0x51d7b4=0x20 0x51d7a8=0x4c1eb72 0x51d7ac=0x174

The message resource is present but the message was not found in the message table

Event Xml:
<Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
  <System>
    <Provider Name="nvlddmkm" />
    <EventID Qualifiers="49322">13</EventID>
    <Level>2</Level>
    <Task>0</Task>
    <Keywords>0x80000000000000</Keywords>
    <TimeCreated SystemTime="2020-05-18T11:17:20.126326100Z" />
    <EventRecordID>6992</EventRecordID>
    <Channel>System</Channel>
    <Computer>DESKTOP-CDEFV44</Computer>
    <Security />
  </System>
  <EventData>
    <Data>\Device\Video3</Data>
    <Data>Graphics Exception: ESR 0x51d7b0=0xa0005 0x51d7b4=0x20 0x51d7a8=0x4c1eb72 0x51d7ac=0x174</Data>
    <Binary>0000000002003000000000000D00AAC0000000000000000000000000000000000000000000000000</Binary>
  </EventData>
</Event>

they appear in the order that they are listed here
User avatar
Hoovoloo
 
Posts: 26
Joined: Mon May 11, 2020 10:01 am
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Postby _r2w_ben » Mon May 18, 2020 2:08 pm

Code: Select all
\Device\Video3
Graphics SM Warp Exception on (GPC 3, TPC 2, SM 1): Misaligned PC

PC probably means Program Counter in this context. NVIDIA has a CUDA tool to help developers diagnose this type of issue but FAH's GPU core is built using OpenCL. Can someone recommend a good GPU memory test?

What is the manufacturer and model number of your 2070 Super? Does it have any odd characteristics like different amounts of L1/L2 cache depending on which shader core is used? If you look back in Event Viewer for other instances, are they also "GPC 3, TPC 2, SM 1"?
_r2w_ben
 
Posts: 281
Joined: Wed Apr 23, 2008 4:11 pm

Re: BAD_WORK_UNIT (114 = 0x72)

Postby Hoovoloo » Mon May 18, 2020 3:04 pm

This is the card https://www.zotac.com/pk/product/graphi ... -mini#spec
Not sure about characteristics but the link is to the spec page
As far as i can see the error pairing are identical each time
User avatar
Hoovoloo
 
Posts: 26
Joined: Mon May 11, 2020 10:01 am
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Postby _r2w_ben » Mon May 18, 2020 6:52 pm

Hoovoloo wrote:As far as i can see the error pairing are identical each time

In the details of the error, is the part I put in the code block the same each time?
_r2w_ben
 
Posts: 281
Joined: Wed Apr 23, 2008 4:11 pm

Re: BAD_WORK_UNIT (114 = 0x72)

Postby Hoovoloo » Mon May 18, 2020 7:46 pm

yes it is in that error. On the otherone it is a long string, i couldn't see any difference but might have missed something
User avatar
Hoovoloo
 
Posts: 26
Joined: Mon May 11, 2020 10:01 am
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Postby Hoovoloo » Fri May 22, 2020 10:53 am

Well not sure if this is progress or just frustrating. Did a clean re-instal of windows and the OpenCl benchmark ran fine. tried FAH and it failed as before. Oddly if I delete the GPU slot, create a second CPU slot and then edit it to be a GPU slot, the first GPU job it tries runs ok but all subsequent ones fail. That has worked twice now. Go figure
User avatar
Hoovoloo
 
Posts: 26
Joined: Mon May 11, 2020 10:01 am
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Postby PantherX » Fri May 22, 2020 7:59 pm

Please note that if the Slot is configured for CPU and has a WU, it can only run on the CPU. Changing it to the GPU Slot will cause it to work... the WU that it resumes can only run on the CPU. Maybe you can post the log file to see what's happening in the case where you manually changed the Slot type?
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
User avatar
PantherX
Site Moderator
 
Posts: 6765
Joined: Wed Dec 23, 2009 10:33 am
Location: Land Of The Long White Cloud

Re: BAD_WORK_UNIT (114 = 0x72)

Postby Hoovoloo » Fri May 22, 2020 10:02 pm

PantherX yes the job that it starts before the slot is edited to GPU runs on the CPU as you say but the next job it picks up runs on the GPU and you can see the load on the GPU is task manager confirming that is what is happening. Will have a look at the log and post in a few minutes if I can find that part. should be near the start of the log when filtered for Slot 1
User avatar
Hoovoloo
 
Posts: 26
Joined: Mon May 11, 2020 10:01 am
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Postby Hoovoloo » Sat May 23, 2020 9:47 am

Didn't work this time around so nothing useful in the go. I am struggling to think where to go next with this. Seems really odd that a relatively high spec machine can't get this to run, especially as it was running absolutely fine for a quite a while before these issues
User avatar
Hoovoloo
 
Posts: 26
Joined: Mon May 11, 2020 10:01 am
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Postby _r2w_ben » Sat May 23, 2020 1:37 pm

Code: Select all
<EventData>
    <Data>\Device\Video3</Data>
    <Data>Graphics SM Warp Exception on (GPC 3, TPC 2, SM 1): Misaligned PC</Data>
    <Binary>0000000002003000000000000D00AAC0000000000000000000000000000000000000000000000000</Binary>
  </EventData>

You mentioned that the errors are logged in pairs. For the errors that have this section, is it always Warp Exception and Misaligned PC? Are the GPC, TPC, and SM numbers always the same?
_r2w_ben
 
Posts: 281
Joined: Wed Apr 23, 2008 4:11 pm

Re: BAD_WORK_UNIT (114 = 0x72)

Postby Hoovoloo » Sat May 23, 2020 1:53 pm

Yes they are and as far as I can see the strings in the other error are also the same each time
User avatar
Hoovoloo
 
Posts: 26
Joined: Mon May 11, 2020 10:01 am
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Postby _r2w_ben » Sat May 23, 2020 2:19 pm

Have you run MemtestG80 or MemtestCL? I would suggest running a full test of both of them since there might be a small portion of your GPU that is faulty.
https://simtk.org/projects/memtest
https://www.majorgeeks.com/files/detail ... stg80.html
https://www.majorgeeks.com/files/details/memtestcl.html
_r2w_ben
 
Posts: 281
Joined: Wed Apr 23, 2008 4:11 pm

Re: BAD_WORK_UNIT (114 = 0x72)

Postby Hoovoloo » Sat May 23, 2020 2:47 pm

both of them are throwing errors. Seems to vary between runs
User avatar
Hoovoloo
 
Posts: 26
Joined: Mon May 11, 2020 10:01 am
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Postby Hoovoloo » Sat May 23, 2020 4:03 pm

Have raised it with the vendors support team
User avatar
Hoovoloo
 
Posts: 26
Joined: Mon May 11, 2020 10:01 am
Location: England

Re: BAD_WORK_UNIT (114 = 0x72)

Postby bruce » Sat May 23, 2020 4:06 pm

I would try underclocking.
bruce
 
Posts: 20019
Joined: Thu Nov 29, 2007 11:13 pm
Location: So. Cal.

PreviousNext

Return to Windows 10 + NVidia

Who is online

Users browsing this forum: No registered users and 1 guest

cron