Page 1 of 1

Project: P13200, R0, C4, G1222

Posted: Mon Sep 11, 2017 10:01 pm
by v00d00
One of my GTX970's got one of these. It sat at 0.00 for 30 mins at which point I terminated it and restarted the client. The next time it did exactly the same things. No work done, but no crash. So im guessing its either the biggest protein ever or its not working for me.

Hate to say I dumped it, but I did. Its running P9841 now without issue.

Re: Project: P13200, R0, C4, G1222

Posted: Mon Sep 11, 2017 10:43 pm
by bruce
Post the segment of your log showing what it said up to the point that it hung.

Unfortunately there's not a lot I can offer since the stats database has been really having troubles.

Re: Project: P13200, R0, C4, G1222

Posted: Tue Sep 12, 2017 3:16 am
by v00d00
Ahh sorry totally forgot to back it up prior to rebuilding the directory.

It actually looked totally normal, it made it as far as completed 0%, but went no further. No error messages, no cryptic messages. Just a hung FahCore that had to be terminated the hard way (kill -9, but tried -15 and a normal ctrl+c from console).

If I get any more of them (that fail), i will repost.

Re: Project: P13200, R0, C4, G1222

Posted: Tue Sep 12, 2017 3:59 pm
by bruce
Here is a typical startup of Core_21

Code: Select all

23:26:21:WU02:FS02:0x21:Project: xxxx (Run x, Clone x, Gen x)
23:26:21:WU02:FS02:0x21:Unit: 0x000000xxxxxxxxxxxxxxxxxxxxxxxx
23:26:21:WU02:FS02:0x21:CPU: 0x00000000000000000000000000000000
23:26:21:WU02:FS02:0x21:Machine: x
23:26:21:WU02:FS02:0x21:Reading tar file core.xml
23:26:21:WU02:FS02:0x21:Reading tar file integrator.xml
23:26:21:WU02:FS02:0x21:Reading tar file state.xml
23:26:21:WU02:FS02:0x21:Reading tar file system.xml
23:26:21:WU02:FS02:0x21:Digital signatures verified
23:26:21:WU02:FS02:0x21:Folding@home GPU Core21 Folding@home Core
23:26:21:WU02:FS02:0x21:Version 0.0.18
23:26:26:WU02:FS02:0x21:Completed 0 out of 6250000 steps (0%)
23:26:26:WU02:FS02:0x21:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
Which was the LAST message you got?

Re: Project: P13200, R0, C4, G1222

Posted: Tue Sep 12, 2017 5:22 pm
by v00d00
The last message in the log was essentially this.

Code: Select all

WU01:FS02:0x21:Completed 0 out of 6250000 steps (0%)
It never made it any further. The web interface showed it stuck at 0.01% and it never went any further.

I left it for a while, then stopped it via web interface. The core didn't terminate. I then reattached that screen and ctrl+c to close everything, client terminated without issue, core still running at 100%, forcefully killed it. Restarted client. Same thing occurred again, no error messages in the log. Repeated previous steps after a bit. Deleted workunit, pulled it off beta, restarted. It grabbed a P9841 and has completed 6 or 7 workunits since without issue.

The card is running at stock settings, as is the machine.

Re: Project: P13200, R0, C4, G1222

Posted: Sat Sep 16, 2017 10:48 am
by toTOW
Did you see CPU and/or GPU loads ?

edit : this WU has been completed by someone else two days ago ...

Re: Project: P13200, R0, C4, G1222

Posted: Sat Sep 16, 2017 8:12 pm
by v00d00
Its fine. If someone else completed it, their is no crisis.

Folding continues. I havent had any of these workunits since so can't clarify on whether it was just a rogue unit that didn't like my system, or a whole project that wont run.

And for the record, I had 100% FahCore usage, but no gpu utilisation. The client terminated without issue, but the core was hung.