SMP Crash - Core A4 and PRCG 7200 (65, 8, 13)

Moderators: Site Moderators, FAHC Science Team

SMP Crash - Core A4 and PRCG 7200 (65, 8, 13)

Postby IvanV » Fri Sep 09, 2011 6:49 pm

I got a FahCore_a4.exe has stopped working as well as a Visual C++ error saying "Assertion failed|Program F:\...\FAHControl.exe|File: cairo-surface.c|Line:800|Expression status==CAIRO_STATUS_SUCCESS"

Faulting application name: FahCore_a4.exe, version: 0.0.0.0, time stamp: 0x4d23eafc
Faulting module name: FahCore_a4.exe, version: 0.0.0.0, time stamp: 0x4d23eafc
Exception code: 0xc0000005
Fault offset: 0x0026ed87
Faulting process id: 0x1294
Faulting application start time: 0x01cc6ee1f5e7b78e
Faulting application path: C:\Users\Ivan\AppData\Roaming\FAHClient\cores\www.stanford.edu\~pande\Win32\x86\Core_a4.fah\FahCore_a4.exe
Faulting module path: C:\Users\Ivan\AppData\Roaming\FAHClient\cores\www.stanford.edu\~pande\Win32\x86\Core_a4.fah\FahCore_a4.exe
Report Id: 9aa5178b-dad9-11e0-92d0-00248cd3c1a6

Event Viewer says:
Code: Select all
Faulting application name: FahCore_a4.exe, version: 0.0.0.0, time stamp: 0x4d23eafc
Faulting module name: FahCore_a4.exe, version: 0.0.0.0, time stamp: 0x4d23eafc
Exception code: 0xc0000005
Fault offset: 0x0026ed87
Faulting process id: 0x1294
Faulting application start time: 0x01cc6ee1f5e7b78e
Faulting application path: C:\Users\Ivan\AppData\Roaming\FAHClient\cores\www.stanford.edu\~pande\Win32\x86\Core_a4.fah\FahCore_a4.exe
Faulting module path: C:\Users\Ivan\AppData\Roaming\FAHClient\cores\www.stanford.edu\~pande\Win32\x86\Core_a4.fah\FahCore_a4.exe
Report Id: 9aa5178b-dad9-11e0-92d0-00248cd3c1a6


The crash happened at 6%, this was the ending of the wudata_01.log file
Code: Select all
DD  step 9799999  vol min/aver 0.813  load imb.: force  1.6%

           Step           Time         Lambda
        9800000    19600.00000        0.00000

   Energies (kJ/mol)
            U-B    Proper Dih.  Improper Dih.      CMAP Dih.          LJ-14
   4.34781e+003   2.55017e+003   2.44344e+002   -6.46274e+002   1.49158e+003
     Coulomb-14        LJ (SR)  Disper. corr.   Coulomb (SR)   Coul. recip.
   2.22449e+004   4.25309e+004   -1.82556e+003   -3.40914e+005   -4.24428e+004
      Potential    Kinetic En.   Total Energy  Conserved En.    Temperature
   -3.12419e+005   5.90199e+004   -2.53399e+005   -2.51369e+005   2.95842e+002
 Pres. DC (bar) Pressure (bar)   Constr. rmsd
   -1.25530e+002   4.09961e+001   2.88361e-005

Writing checkpoint, step 9804690 at Fri Sep 09 19:47:25 2011


DD  step 9804999  vol min/aver 0.801  load imb.: force  3.1%

           Step           Time         Lambda
        9805000    19610.00000        0.00000

   Energies (kJ/mol)
            U-B    Proper Dih.  Improper Dih.      CMAP Dih.          LJ-14
   4.12241e+003   2.64038e+003   2.59464e+002   -6.70481e+002   1.52399e+003
     Coulomb-14        LJ (SR)  Disper. corr.   Coulomb (SR)   Coul. recip.
   2.22626e+004   4.21001e+004   -1.82556e+003   -3.40302e+005   -4.24915e+004
      Potential    Kinetic En.   Total Energy  Conserved En.    Temperature
   -3.12381e+005   6.02367e+004   -2.52144e+005   -2.51234e+005   3.01941e+002
 Pres. DC (bar) Pressure (bar)   Constr. rmsd
   -1.25530e+002   -4.03105e+001   2.91864e-005
 
AMD Phenom II X4 940 BE @ 3.2GHz | ASUS M3A78-CM | 2 x 2GB DDR2 800MHz | AMD Radeon 6950 @ 800MHz / 1250MHz | SilverStone Strider 600W
Windows 7 x64 Professional SP1 | Folding@home Client Control 7.1.24
IvanV
 
Posts: 15
Joined: Wed May 04, 2011 12:10 pm

Re: SMP Crash - Core A4 and PRCG 7200 (65, 8, 13)

Postby 7im » Fri Sep 09, 2011 8:55 pm

"Exception code: 0xc0000005"

You didn't post any computer specs, but it's likely you overclocked too much. The above code is common from unstable systems.


It might be a bad work unit, and a mod might be able to check that for you.
Project: 7200 (Run 65, Clone 8, Gen 13)
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
User avatar
7im
 
Posts: 10189
Joined: Thu Nov 29, 2007 5:30 pm
Location: Arizona

Re: SMP Crash - Core A4 and PRCG 7200 (65, 8, 13)

Postby Grandpa_01 » Fri Sep 09, 2011 9:09 pm

This might be the same problem. viewtopic.php?f=66&t=19567
Image
2 - SM H8QGi-F AMD 6xxx=112 cores @ 3.2 & 3.9Ghz
5 - SM X9QRI-f+ Intel 4650 = 320 cores @ 3.15Ghz
2 - I7 980X 4.4Ghz 2-GTX680
1 - 2700k 4.4Ghz GTX680
Total = 464 cores folding
User avatar
Grandpa_01
 
Posts: 1122
Joined: Wed Mar 04, 2009 8:36 am

Re: SMP Crash - Core A4 and PRCG 7200 (65, 8, 13)

Postby IvanV » Fri Sep 09, 2011 10:10 pm

I'm sorry I forgot the specs, I've added them to my signature. As for stability, the CPU has been folding for about 48h prior to the crash, five back-to-back WUs, almost non-stop except for a couple of brief gaming sessions (which isn't exactly resting either) and it passed IntelBurn Test at Maximum setting at high number of repetitions so, while I can't say "impossible", I'd say that it's unlikely that there is a stability issue with the system itself.
IvanV
 
Posts: 15
Joined: Wed May 04, 2011 12:10 pm

Re: SMP Crash - Core A4 and PRCG 7200 (65, 8, 13)

Postby Leonardo » Sat Sep 10, 2011 1:28 am

You left out the most important component for high performance desktop computing - the power supply unit.
Image
User avatar
Leonardo
 
Posts: 260
Joined: Tue Dec 04, 2007 6:09 am
Location: Eagle River, Alaska

Re: SMP Crash - Core A4 and PRCG 7200 (65, 8, 13)

Postby IvanV » Sat Sep 10, 2011 1:58 am

:D You're absolutely right and the sig has been updated, though I doubt that it's relevant to this matter. Even the GPU slot wasn't active at the time, so the PSU really wasn't breaking much sweat.
IvanV
 
Posts: 15
Joined: Wed May 04, 2011 12:10 pm

Re: SMP Crash - Core A4 and PRCG 7200 (65, 8, 13)

Postby Leonardo » Sat Sep 10, 2011 5:17 am

That's the strangest looking post-crash log I've seen for Folding, but then, I'm not using V7 yet.
I don't think it's indicative of a hardware problem. Wish I could be more helpful.
User avatar
Leonardo
 
Posts: 260
Joined: Tue Dec 04, 2007 6:09 am
Location: Eagle River, Alaska

Re: SMP Crash - Core A4 and PRCG 7200 (65, 8, 13)

Postby Nathan_P » Sat Sep 10, 2011 5:15 pm

Exception code: 0xc0000005

This usually indicates either a bad overclock on the memory or a failing memory module. Grab memtest86+ and let run a full test of your ram
Image
Nathan_P
 
Posts: 1175
Joined: Wed Apr 01, 2009 10:22 pm
Location: Jersey, Channel islands

Re: SMP Crash - Core A4 and PRCG 7200 (65, 8, 13)

Postby IvanV » Sat Sep 10, 2011 10:29 pm

Thanks for the input guys, RAM is not OCed and the latest memtest86+ has just given it a clean bill of health.

Also, if it's worth anything, the WU finished this morning without any other hiccoughs (and so did the one after that as well as a GPU WU).
IvanV
 
Posts: 15
Joined: Wed May 04, 2011 12:10 pm

Re: SMP Crash - Core A4 and PRCG 7200 (65, 8, 13)

Postby Athlonite » Sun Sep 25, 2011 1:37 pm

I had the very same thing happen to me after installing 7.1.33 where previously I had 7.1.24 same OC same temps as before but got a crash 3 times in a row

same CPU as IvanV except my OC is 3.41GHz previously stable for many WU's

WU project:7610 run:268 clone:0 gen:16 core:0xa4
Asus Strix X470F Gaming
AMD R7 3700
16GB GSkill trident Z DDR4-3200
Samsung 860 Evo 500GB SATA SSD
Gigabyte RX580 8GB
Athlonite
 
Posts: 30
Joined: Fri Dec 18, 2009 1:52 am
Location: Napier, NZ

Re: SMP Crash - Core A4 and PRCG 7200 (65, 8, 13)

Postby sortofageek » Sun Sep 25, 2011 4:06 pm

Athlonite, I checked the database. There are no results showing at this point on Project: 7610 (Run 268, Clone 0, Gen 16), just FYI.
User avatar
sortofageek
Site Admin
 
Posts: 3111
Joined: Fri Nov 30, 2007 9:06 pm
Location: Team Helix

Re: SMP Crash - Core A4 and PRCG 7200 (65, 8, 13)

Postby Athlonite » Mon Sep 26, 2011 7:29 am

No there wont be any results from me as after the third crash I dumped the SMP slot, waited a couple of days and reinstated it so far not a problem with project:7903 run:3 clone:1 gen:9 core:0xa4 unit:0x0000000b00ac9c214e4d2c04355b2daa currently sitting at 42% and not one single crash
Athlonite
 
Posts: 30
Joined: Fri Dec 18, 2009 1:52 am
Location: Napier, NZ

Re: SMP Crash - Core A4 and PRCG 7200 (65, 8, 13)

Postby 7im » Mon Sep 26, 2011 8:36 pm

She is saying there are no results from you, OR from anyone else.
User avatar
7im
 
Posts: 10189
Joined: Thu Nov 29, 2007 5:30 pm
Location: Arizona

Re: SMP Crash - Core A4 and PRCG 7200 (65, 8, 13)

Postby sortofageek » Thu Sep 29, 2011 4:53 am

Thank you, 7im, for clarifying.

I just checked again and there are still no results from anyone. The more time that passes with no results, the more likelihood it was a bad WU.
User avatar
sortofageek
Site Admin
 
Posts: 3111
Joined: Fri Nov 30, 2007 9:06 pm
Location: Team Helix

Re: SMP Crash - Core A4 and PRCG 7200 (65, 8, 13)

Postby sortofageek » Mon Oct 24, 2011 12:32 am

Athlonite wrote: I had the very same thing happen to me after installing 7.1.33 where previously I had 7.1.24 same OC same temps as before but got a crash 3 times in a row

same CPU as IvanV except my OC is 3.41GHz previously stable for many WU's

WU project:7610 run:268 clone:0 gen:16 core:0xa4


Two different folders have completed Project: 7610 (Run 268, Clone 0, Gen 16) successfully.
User avatar
sortofageek
Site Admin
 
Posts: 3111
Joined: Fri Nov 30, 2007 9:06 pm
Location: Team Helix

Next

Return to V7.1.52 Windows/Linux

Who is online

Users browsing this forum: No registered users and 1 guest

cron