[*]Failing projects (please add a list of exact project numbers if you have them)
Project: 5757 (Run 10, Clone 89, Gen 141) 81%
Project: 5757 (Run 10, Clone 89, Gen 141) 55%
Project: 5757 (Run 10, Clone 89, Gen 141) 34%
Project: 5757 (Run 10, Clone 89, Gen 141) 100%
Project: 5753 (Run 0, Clone 263, Gen 1) 52%
Project: 5753 (Run 0, Clone 263, Gen 1) 100%
Project: 5762 (Run 1, Clone 15, Gen 159) 55%
Project: 5762 (Run 1, Clone 15, Gen 159) 90%
Project: 5771 (Run 1, Clone 256, Gen 1) 29%
Project: 5771 (Run 1, Clone 256, Gen 1) 100%
Project: 5759 (Run 14, Clone 74, Gen 156) 25%
Project: 5759 (Run 14, Clone 74, Gen 156) 61%
Project: 5759 (Run 14, Clone 74, Gen 156) 85%
Separate work Unit completed - Project: 5750 (Run 5, Clone 265, Gen 5)
Project: 5759 (Run 14, Clone 74, Gen 156) 9%
Project: 5759 (Run 14, Clone 74, Gen 156) 100%
Project: 5760 (Run 12, Clone 34, Gen 71) 65%
Project: 5760 (Run 12, Clone 34, Gen 71) 38%
Project: 5760 (Run 12, Clone 34, Gen 71) 100%
Project: 5770 (Run 9, Clone 250, Gen 3) 9%
Project: 5770 (Run 9, Clone 250, Gen 3) 18%
Project: 5760 (Run 8, Clone 9, Gen 118) 59%
Project: 5760 (Run 8, Clone 9, Gen 118) 44%
Project: 5760 (Run 8, Clone 9, Gen 118) 38%
Computer shuts down folding for 24hrs due to the last five mdrun errors listed above
[*]Failing hardware (please add the exact GPU designation if you know it. ie 9800GTX+)
* 8800GT series PNY 256MB
[*]Failing OS
* Windows XP 64 bits
[*]Failing drivers (enter here the version number of the driver you use)
180.60 - No successful units completed
181.20 - Has started to return units successfully after acting like the 180.60 drivers for some time
[*]Comments (add below any detail you might find useful to the report)
This card ran for several weeks without EUEing on its own in a WinXP 64bit machine with 180.60 drivers on Intel Core2 hardware.
After being transplanted into a new machine it near constantly EUEed with the 180.60 drivers and after trying the 181.20 drivers with the same results I turned off the client and left it for a day. The following day I started the client up again which resulted in the strange units that progressively failed and then suddenly completed. Since then I've had this occur again. AMD 790FX 64x2 hardware this time.
This machine also has three other identical cards folding, and this card is GPU0. This is the only card with issues.
In addition, temps are ~38degC at load (my attic is cold in the winter) and stock clocks on the problem card (lowering the clocks doesn't help)
After every error I get the following:
- Code: Select all
Error: Could not get length of results file work/wuresults_0*.dat
Error: Could not read unit 0* file. Removing from queue.
I take it that this is normal? The units are not getting deleted though as my work folders are full of old units. I just cleared out a folder and it had 70MB of WU files in it
Update: I just got the folowing errors when trying to restart the GPU
- Code: Select all
[10:19:11] Project: 5760 (Run 8, Clone 9, Gen 118)
[10:19:11]
[10:19:11] Assembly optimizations on if available.
[10:19:11] Entering M.D.
[10:19:20] Working on Protein
[10:19:20] mdrun_gpu returned
[10:19:20] Self-test failure
[10:19:20]
[10:19:20] Folding@home Core Shutdown: UNSTABLE_MACHINE
[10:19:24] CoreStatus = 7A (122)
[10:19:24] Sending work to server
[10:19:24] Project: 5760 (Run 8, Clone 9, Gen 118)
five times then shutdown.
I have deleted the work folder contents and the queue and will see what I get.