Moderators: Site Moderators, PandeGroup
Grandpa_01 wrote:I can tell you how to fix this but you probably are not going to be real happy with the solution.
This is not a core21 problem it is a Nvidia problem that affects the upper end Maxwells mainly GTX 980's and some of the 970's, It may also affect 980 TI's but I do not know that since I do not have a 980 TI to test. I have a feeling Nvidia knows about it since they took steps to reduce the problem. When you run compute software on the Maxwells it defaults to P2 state which has the same core clock but has a reduced memory speed it gets lowered from 7000Mhz to 6000Mhz no other generation of Nvidia GPU's does this that I know of.
Anyway I started having a lot of the bad states on my GPU's both in Windows and Linux with this series of WU's and some others and I started testing things out with the help of a few others from here in the forum. And one of the thing's I did was lower the Memory speed using Nvidia Inspector in P2 state a little. The first adjustment was 300Mhz and the errors stopped completely and I was able to achieve a higher stable OC. I ended up between 5750Mhz and 5755Mhz on all 3 of the cards.
The part you are not going to like is that there is no way at this time to lower the P2 state memory speeds in Linux. I have put in a request for help over at the Nvidia Developers forum so hopefully they will enable that ability in x server soon. But as of yet nobody has replied to the post but it is Sunday. Until then I would recommend putting the 980 in a Windows box, which I know is slower but in the long run it will pay off, I have moved 3 of mine from Linux to Windows. Or you can yell at Nvidia for selling us a GPU with what I am thinking is either faulty memory or faulty memory controller, most likely the latter.
https://devtalk.nvidia.com/default/topic/888789/linux/problem-with-memory-speed-on-gtx-9xx-when-running-compute-software/
nvidia-settings -q [gpu:0]/GPUCurrentPerfLevel -q [gpu:0]/GPUCurrentClockFreqs
Attribute 'GPUCurrentPerfLevel' (mslinuxmint:0[gpu:0]): 2.
'GPUCurrentPerfLevel' is an integer attribute.
'GPUCurrentPerfLevel' is a read-only attribute.
'GPUCurrentPerfLevel' can use the following target types: X Screen, GPU.
Attribute 'GPUCurrentClockFreqs' (mslinuxmint:0[gpu:0]): 1514,3004.
'GPUCurrentClockFreqs' is a packed integer attribute.
'GPUCurrentClockFreqs' is a read-only attribute.
'GPUCurrentClockFreqs' can use the following target types: X Screen, GPU.
nvidia-smi --query-gpu=pstate --format=csv
pstate
P2
nvidia-smi --query-gpu=clocks.applications.graphics --format=csv
clocks.applications.graphics [MHz]
1113 MHz
nvidia-smi --query-gpu=clocks.applications.memory --format=csv
clocks.applications.memory [MHz]
3505 MHz
nvidia-smi -q -d SUPPORTED_CLOCKS
==============NVSMI LOG==============
Timestamp : Thu Oct 29 23:12:38 2015
Driver Version : 346.96
Attached GPUs : 1
GPU 0000:01:00.0
Supported Clocks
Memory : 3505 MHz
Graphics : 1641 MHz
Graphics : 1628 MHz
Graphics : 1616 MHz
Graphics : 1603 MHz
Graphics : 1590 MHz
Graphics : 1578 MHz
Graphics : 1565 MHz
Graphics : 1552 MHz
Graphics : 1540 MHz
Graphics : 1527 MHz
Graphics : 1514 MHz
Graphics : 1502 MHz
Graphics : 1489 MHz
Graphics : 1476 MHz
Graphics : 1464 MHz
Graphics : 1451 MHz
Graphics : 1438 MHz
Graphics : 1426 MHz
Graphics : 1413 MHz
Graphics : 1400 MHz
Graphics : 1388 MHz
Graphics : 1375 MHz
Graphics : 1362 MHz
Graphics : 1350 MHz
Graphics : 1337 MHz
Graphics : 1324 MHz
Graphics : 1312 MHz
Graphics : 1299 MHz
Graphics : 1286 MHz
Graphics : 1274 MHz
Graphics : 1261 MHz
Graphics : 1249 MHz
Graphics : 1236 MHz
Graphics : 1223 MHz
Graphics : 1211 MHz
Graphics : 1198 MHz
Graphics : 1185 MHz
Graphics : 1173 MHz
Graphics : 1160 MHz
Graphics : 1147 MHz
Graphics : 1135 MHz
Graphics : 1122 MHz
Graphics : 1109 MHz
Graphics : 1097 MHz
Graphics : 1085 MHz
Graphics : 1072 MHz
Graphics : 1071 MHz
Graphics : 1059 MHz
Graphics : 1046 MHz
Graphics : 1033 MHz
Graphics : 1021 MHz
Graphics : 1008 MHz
Graphics : 995 MHz
Graphics : 983 MHz
Graphics : 970 MHz
Graphics : 957 MHz
Graphics : 945 MHz
Graphics : 932 MHz
Graphics : 919 MHz
Graphics : 907 MHz
Graphics : 894 MHz
Graphics : 881 MHz
Graphics : 869 MHz
Graphics : 856 MHz
Graphics : 844 MHz
Graphics : 831 MHz
Graphics : 818 MHz
Graphics : 806 MHz
Graphics : 793 MHz
Graphics : 780 MHz
Graphics : 768 MHz
Graphics : 755 MHz
Graphics : 742 MHz
Graphics : 730 MHz
Graphics : 717 MHz
Graphics : 704 MHz
Graphics : 692 MHz
Graphics : 680 MHz
Graphics : 667 MHz
Graphics : 655 MHz
Graphics : 642 MHz
Graphics : 630 MHz
Graphics : 617 MHz
Graphics : 605 MHz
Graphics : 592 MHz
Graphics : 590 MHz
Graphics : 509 MHz
Graphics : 455 MHz
Graphics : 388 MHz
Graphics : 349 MHz
Graphics : 320 MHz
Memory : 3004 MHz
Graphics : 1641 MHz
Graphics : 1628 MHz
Graphics : 1616 MHz
Graphics : 1603 MHz
Graphics : 1590 MHz
Graphics : 1578 MHz
Graphics : 1565 MHz
Graphics : 1552 MHz
Graphics : 1540 MHz
Graphics : 1527 MHz
Graphics : 1514 MHz
Graphics : 1502 MHz
Graphics : 1489 MHz
Graphics : 1476 MHz
Graphics : 1464 MHz
Graphics : 1451 MHz
Graphics : 1438 MHz
Graphics : 1426 MHz
Graphics : 1413 MHz
Graphics : 1400 MHz
Graphics : 1388 MHz
Graphics : 1375 MHz
Graphics : 1362 MHz
Graphics : 1350 MHz
Graphics : 1337 MHz
Graphics : 1324 MHz
Graphics : 1312 MHz
Graphics : 1299 MHz
Graphics : 1286 MHz
Graphics : 1274 MHz
Graphics : 1261 MHz
Graphics : 1249 MHz
Graphics : 1236 MHz
Graphics : 1223 MHz
Graphics : 1211 MHz
Graphics : 1198 MHz
Graphics : 1185 MHz
Graphics : 1173 MHz
Graphics : 1160 MHz
Graphics : 1147 MHz
Graphics : 1135 MHz
Graphics : 1122 MHz
Graphics : 1109 MHz
Graphics : 1097 MHz
Graphics : 1085 MHz
Graphics : 1072 MHz
Graphics : 1071 MHz
Graphics : 1059 MHz
Graphics : 1046 MHz
Graphics : 1033 MHz
Graphics : 1021 MHz
Graphics : 1008 MHz
Graphics : 995 MHz
Graphics : 983 MHz
Graphics : 970 MHz
Graphics : 957 MHz
Graphics : 945 MHz
Graphics : 932 MHz
Graphics : 919 MHz
Graphics : 907 MHz
Graphics : 894 MHz
Graphics : 881 MHz
Graphics : 869 MHz
Graphics : 856 MHz
Graphics : 844 MHz
Graphics : 831 MHz
Graphics : 818 MHz
Graphics : 806 MHz
Graphics : 793 MHz
Graphics : 780 MHz
Graphics : 768 MHz
Graphics : 755 MHz
Graphics : 742 MHz
Graphics : 730 MHz
Graphics : 717 MHz
Graphics : 704 MHz
Graphics : 692 MHz
Graphics : 680 MHz
Graphics : 667 MHz
Graphics : 655 MHz
Graphics : 642 MHz
Graphics : 630 MHz
Graphics : 617 MHz
Graphics : 605 MHz
Graphics : 592 MHz
Graphics : 590 MHz
Graphics : 509 MHz
Graphics : 455 MHz
Graphics : 388 MHz
Graphics : 349 MHz
Graphics : 320 MHz
Memory : 810 MHz
Graphics : 1455 MHz
Graphics : 1442 MHz
Graphics : 1430 MHz
Graphics : 1417 MHz
Graphics : 1404 MHz
Graphics : 1392 MHz
Graphics : 1379 MHz
Graphics : 1366 MHz
Graphics : 1354 MHz
Graphics : 1341 MHz
Graphics : 1328 MHz
Graphics : 1316 MHz
Graphics : 1303 MHz
Graphics : 1290 MHz
Graphics : 1278 MHz
Graphics : 1265 MHz
Graphics : 1252 MHz
Graphics : 1240 MHz
Graphics : 1227 MHz
Graphics : 1215 MHz
Graphics : 1202 MHz
Graphics : 1189 MHz
Graphics : 1177 MHz
Graphics : 1164 MHz
Graphics : 1151 MHz
Graphics : 1139 MHz
Graphics : 1126 MHz
Graphics : 1113 MHz
Graphics : 1101 MHz
Graphics : 1088 MHz
Graphics : 1075 MHz
Graphics : 1063 MHz
Graphics : 1050 MHz
Graphics : 1037 MHz
Graphics : 1025 MHz
Graphics : 1012 MHz
Graphics : 999 MHz
Graphics : 987 MHz
Graphics : 974 MHz
Graphics : 961 MHz
Graphics : 949 MHz
Graphics : 936 MHz
Graphics : 923 MHz
Graphics : 911 MHz
Graphics : 899 MHz
Graphics : 886 MHz
Graphics : 885 MHz
Graphics : 873 MHz
Graphics : 860 MHz
Graphics : 847 MHz
Graphics : 835 MHz
Graphics : 822 MHz
Graphics : 810 MHz
Graphics : 797 MHz
Graphics : 784 MHz
Graphics : 772 MHz
Graphics : 759 MHz
Graphics : 746 MHz
Graphics : 734 MHz
Graphics : 721 MHz
Graphics : 708 MHz
Graphics : 696 MHz
Graphics : 683 MHz
Graphics : 670 MHz
Graphics : 658 MHz
Graphics : 645 MHz
Graphics : 632 MHz
Graphics : 620 MHz
Graphics : 607 MHz
Graphics : 594 MHz
Graphics : 582 MHz
Graphics : 569 MHz
Graphics : 556 MHz
Graphics : 544 MHz
Graphics : 531 MHz
Graphics : 519 MHz
Graphics : 506 MHz
Graphics : 494 MHz
Graphics : 481 MHz
Graphics : 469 MHz
Graphics : 456 MHz
Graphics : 444 MHz
Graphics : 431 MHz
Graphics : 419 MHz
Graphics : 406 MHz
Graphics : 405 MHz
Graphics : 324 MHz
Graphics : 270 MHz
Graphics : 202 MHz
Graphics : 162 MHz
Graphics : 135 MHz
Memory : 324 MHz
Graphics : 405 MHz
Graphics : 324 MHz
Graphics : 270 MHz
Graphics : 202 MHz
Graphics : 162 MHz
Graphics : 135 MHz
nvidia-smi -ac 3004,1109
Applications clocks set to "(MEM 3004, SM 1109)" for GPU 0000:01:00.0
Warning: persistence mode is disabled on this device. This settings will go back to default as soon as driver unloads (e.g. last application like nvidia-smi or cuda application terminates). Run with [--help | -h] switch to get more information on how to enable persistence mode.
All done.
nvidia-settings -q [gpu:0]/GPUCurrentPerfLevel -q [gpu:0]/GPUCurrentClockFreqs
Attribute 'GPUCurrentPerfLevel' (mslinuxmint:0[gpu:0]): 3.
'GPUCurrentPerfLevel' is an integer attribute.
'GPUCurrentPerfLevel' is a read-only attribute.
'GPUCurrentPerfLevel' can use the following target types: X Screen, GPU.
Attribute 'GPUCurrentClockFreqs' (mslinuxmint:0[gpu:0]): 1514,3505.
'GPUCurrentClockFreqs' is a packed integer attribute.
'GPUCurrentClockFreqs' is a read-only attribute.
'GPUCurrentClockFreqs' can use the following target types: X Screen, GPU.
nvidia-smi --query-gpu=pstate --format=csv
pstate
P0
nvidia-settings --assign [gpu:0]/GPUMemoryTransferRateOffset[3]=...nvidia-smi -rac
All done.
nvidia-settings -q [gpu:0]/GPUCurrentPerfLevel -q [gpu:0]/GPUCurrentClockFreqs
Attribute 'GPUCurrentPerfLevel' (mslinuxmint:0[gpu:0]): 2.
'GPUCurrentPerfLevel' is an integer attribute.
'GPUCurrentPerfLevel' is a read-only attribute.
'GPUCurrentPerfLevel' can use the following target types: X Screen, GPU.
Attribute 'GPUCurrentClockFreqs' (mslinuxmint:0[gpu:0]): 1514,3004.
'GPUCurrentClockFreqs' is a packed integer attribute.
'GPUCurrentClockFreqs' is a read-only attribute.
'GPUCurrentClockFreqs' can use the following target types: X Screen, GPU.
nvidia-smi --query-gpu=clocks.applications.graphics --format=csv
clocks.applications.graphics [MHz]
1113 MHz
nvidia-smi --query-gpu=clocks.applications.memory --format=csv
clocks.applications.memory [MHz]
3505 MHz
nvidia-smi --query-gpu=pstate --format=csv
pstate
P2

toTOW wrote:Did you change clock for the right P state ? 7010 MHz (1752 MHz real) is the clock for P0, but most cards are folding in P2 state, which default clock is 6000 MHz (1500 MHz real).

jimerickson wrote:this is on linux with 355.11 driver.
Return to Issues with a specific WU
Users browsing this forum: Yahoo [Bot] and 1 guest