Two 85xx crashes UNSTABLE_MACHINE

Moderators: Site Moderators, FAHC Science Team

Two 85xx crashes UNSTABLE_MACHINE

Postby BlackSun59 » Thu Jul 18, 2013 3:31 pm

LTNS, everyone. Been having no F@H problems in over a year until now.

Specs:
MOBO: Gigabyte GA-78LMT-USB3
CPU: AMD Athlon II X4 640 no OC
RAM: 2x2GB Kingston DDR3 1333
HDD: 2xWD 160GB, WD 80GB, Seagate 1TB
PSU: Antec EarthWatts 430D
OS: Win 7 Pro 32-bit SP1
F@H client: 7.2.9, SMP folding
CPU temp@100% load: 53°c
CPU temp@idle: 38°c
Ambient temp: 31°c

In 3 hours, I got an 8585 that crashed after 23%, and an 8572 that crashed after just 2%.
Log says UNSTABLE_MACHINE, but also "Could not write compressed data to results file" and "Missing original Unit data, cannot send dump report"
I copied the entire log before I shut F@H down after getting a 7809.

Code: Select all
05:11:05:  <exception-locations v='true'/>
05:11:05:  <gpu-assignment-servers>
05:11:05:    assign-GPU.stanford.edu:80 assign-GPU.stanford.edu:8080
05:11:05:  </gpu-assignment-servers>
05:11:05:  <stack-traces v='false'/>
05:11:05:
05:11:05:  <!-- Error Handling -->
05:11:05:  <max-slot-errors v='5'/>
05:11:05:  <max-unit-errors v='5'/>
05:11:05:
05:11:05:  <!-- FahCore Control -->
05:11:05:  <checkpoint v='30'/>
05:11:05:  <core-dir v='cores'/>
05:11:05:  <core-priority v='idle'/>
05:11:05:  <cpu-affinity v='false'/>
05:11:05:  <cpu-usage v='100'/>
05:11:05:  <no-assembly v='false'/>
05:11:05:
05:11:05:  <!-- Folding Slot Configuration -->
05:11:05:  <cause-pref v='ANY'/>
05:11:05:  <client-subtype v='STDCLI'/>
05:11:05:  <client-type v='normal'/>
05:11:05:  <cpu-species v='X86_AMD'/>
05:11:05:  <cpu-type v='X86'/>
05:11:05:  <cpus v='-1'/>
05:11:05:  <cuda-index v='0'/>
05:11:05:  <extra-core-args v='extra-core-args advanced'/>
05:11:05:  <gpu v='false'/>
05:11:05:  <gpu-usage v='100'/>
05:11:05:  <max-packet-size v='normal'/>
05:11:05:  <opencl-index v='0'/>
05:11:05:  <os-species v='UNKNOWN'/>
05:11:05:  <os-type v='WIN32'/>
05:11:05:  <project-key v='0'/>
05:11:05:  <smp v='true'/>
05:11:05:
05:11:05:  <!-- Logging -->
05:11:05:  <log v='log.txt'/>
05:11:05:  <log-color v='false'/>
05:11:05:  <log-crlf v='true'/>
05:11:05:  <log-date v='false'/>
05:11:05:  <log-date-periodically v='21600'/>
05:11:05:  <log-debug v='true'/>
05:11:05:  <log-domain v='false'/>
05:11:05:  <log-header v='true'/>
05:11:05:  <log-level v='true'/>
05:11:05:  <log-no-info-header v='true'/>
05:11:05:  <log-redirect v='false'/>
05:11:05:  <log-rotate v='true'/>
05:11:05:  <log-rotate-dir v='logs'/>
05:11:05:  <log-rotate-max v='16'/>
05:11:05:  <log-short-level v='false'/>
05:11:05:  <log-simple-domains v='true'/>
05:11:05:  <log-thread-id v='false'/>
05:11:05:  <log-thread-prefix v='true'/>
05:11:05:  <log-time v='true'/>
05:11:05:  <log-to-screen v='true'/>
05:11:05:  <log-truncate v='false'/>
05:11:05:  <verbosity v='5'/>
05:11:05:
05:11:05:  <!-- Network -->
05:11:05:  <proxy v=':8080'/>
05:11:05:  <proxy-enable v='false'/>
05:11:05:  <proxy-pass v=''/>
05:11:05:  <proxy-user v=''/>
05:11:05:
05:11:05:  <!-- Process Control -->
05:11:05:  <child v='false'/>
05:11:05:  <daemon v='false'/>
05:11:05:  <pid v='false'/>
05:11:05:  <pid-file v='Folding@home Client.pid'/>
05:11:05:  <respawn v='false'/>
05:11:05:  <service v='false'/>
05:11:05:
05:11:05:  <!-- Remote Command Server -->
05:11:05:  <command-address v='0.0.0.0'/>
05:11:05:  <command-allow v='127.0.0.1'/>
05:11:05:  <command-allow-no-pass v='127.0.0.1'/>
05:11:05:  <command-deny v='0.0.0.0/0'/>
05:11:05:  <command-deny-no-pass v='0.0.0.0/0'/>
05:11:05:  <command-port v='36330'/>
05:11:05:
05:11:05:  <!-- Slot Control -->
05:11:05:  <max-shutdown-wait v='60'/>
05:11:05:  <pause-on-battery v='false'/>
05:11:05:  <pause-on-start v='false'/>
05:11:05:
05:11:05:  <!-- User Information -->
05:11:05:  <machine-id v='0'/>
05:11:05:  <passkey v='********************************'/>
05:11:05:  <team v='11108'/>
05:11:05:  <user v='BlackSun59'/>
05:11:05:
05:11:05:  <!-- Work Unit Control -->
05:11:05:  <dump-after-deadline v='true'/>
05:11:05:  <max-queue v='16'/>
05:11:05:  <max-units v='0'/>
05:11:05:  <next-unit-percentage v='99'/>
05:11:05:
05:11:05:  <!-- Folding Slots -->
05:11:05:  <slot id='0' type='SMP'/>
05:11:05:</config>
05:11:05:Trying to access database...
05:11:05:Successfully acquired database lock
05:11:05:Enabled folding slot 00: READY smp:4
05:11:05:Started thread 1 on PID 5320
05:11:05:Started thread 3 on PID 5320
05:11:05:WU00:FS00:Starting
05:11:05:Started thread 5 on PID 5320
05:11:05:Started thread 4 on PID 5320
05:11:05:Started thread 6 on PID 5320
05:11:05:WU00:FS00:Running FahCore: "C:\Program Files\FAHClient/FAHCoreWrapper.e
xe" C:/Users/user/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/
x86/Core_a3.fah/FahCore_a3.exe -dir 00 -suffix 01 -version 702 -lifeline 5320 -c
heckpoint 30 -np 4 extra-core-args advanced
05:11:05:WU00:FS00:Started FahCore on PID 4556
05:11:05:Started thread 7 on PID 5320
05:11:05:WU00:FS00:Core PID:6104
05:11:05:WU00:FS00:FahCore 0xa3 started
05:11:06:WU00:FS00:0xa3:
05:11:06:WU00:FS00:0xa3:*------------------------------*
05:11:06:WU00:FS00:0xa3:Folding@Home Gromacs SMP Core
05:11:06:WU00:FS00:0xa3:Version 2.27 (Dec. 15, 2010)
05:11:06:WU00:FS00:0xa3:
05:11:06:WU00:FS00:0xa3:Preparing to commence simulation
05:11:06:WU00:FS00:0xa3:- Looking at optimizations...
05:11:06:WU00:FS00:0xa3:- Files status OK
05:11:06:WU00:FS00:0xa3:- Expanded 3853723 -> 4394668 (decompressed 114.0 percen
t)
05:11:06:WU00:FS00:0xa3:Called DecompressByteArray: compressed_data_size=3853723
 data_size=4394668, decompressed_data_size=4394668 diff=0
05:11:06:WU00:FS00:0xa3:- Digital signature verified
05:11:06:WU00:FS00:0xa3:
05:11:06:WU00:FS00:0xa3:Project: 8585 (Run 0, Clone 0, Gen 147)
05:11:06:WU00:FS00:0xa3:
05:11:06:WU00:FS00:0xa3:Assembly optimizations on if available.
05:11:06:WU00:FS00:0xa3:Entering M.D.
05:11:12:WU00:FS00:0xa3:Using Gromacs checkpoints
05:11:13:WU00:FS00:0xa3:Mapping NT from 4 to 4
05:11:14:WU00:FS00:0xa3:Resuming from checkpoint
05:11:14:WU00:FS00:0xa3:Verified 00/wudata_01.log
05:11:14:WU00:FS00:0xa3:Verified 00/wudata_01.trr
05:11:14:WU00:FS00:0xa3:Verified 00/wudata_01.edr
05:11:15:WU00:FS00:0xa3:Completed 46255 out of 500000 steps  (9%)
Reading file ./work/00/wudata_01.tpr, VERSION 4.5.1-dev-20100930-afd66-dirty (si
ngle precision)
Reading file ./work/00/wudata_01.tpr, VERSION 4.5.1-dev-20100930-afd66-dirty (si
ngle precision)
05:29:28:WU00:FS00:0xa3:Completed 50000 out of 500000 steps  (10%)
05:53:41:WU00:FS00:0xa3:Completed 55000 out of 500000 steps  (11%)
06:16:43:WU00:FS00:0xa3:Completed 60000 out of 500000 steps  (12%)
06:44:08:WU00:FS00:0xa3:Completed 65000 out of 500000 steps  (13%)
07:05:17:WU00:FS00:0xa3:Completed 70000 out of 500000 steps  (14%)
07:29:51:WU00:FS00:0xa3:Completed 75000 out of 500000 steps  (15%)
07:51:00:WU00:FS00:0xa3:Completed 80000 out of 500000 steps  (16%)
08:12:07:WU00:FS00:0xa3:Completed 85000 out of 500000 steps  (17%)
08:33:14:WU00:FS00:0xa3:Completed 90000 out of 500000 steps  (18%)
08:54:20:WU00:FS00:0xa3:Completed 95000 out of 500000 steps  (19%)
09:15:25:WU00:FS00:0xa3:Completed 100000 out of 500000 steps  (20%)
09:36:31:WU00:FS00:0xa3:Completed 105000 out of 500000 steps  (21%)
09:57:36:WU00:FS00:0xa3:Completed 110000 out of 500000 steps  (22%)
10:25:37:WU00:FS00:0xa3:Completed 115000 out of 500000 steps  (23%)
10:41:15:WU00:FS00:0xa3:mdrun returned 255
10:41:15:WU00:FS00:0xa3:Going to send back what have done -- stepsTotalG=500000
10:41:15:WU00:FS00:0xa3:Work fraction=0.2363 steps=500000.
10:41:19:WU00:FS00:0xa3:logfile size=22682 infoLength=22682 edr=0 trr=25
10:41:19:WU00:FS00:0xa3:logfile size: 22682 info=22682 bed=0 hdr=25
10:41:19:WU00:FS00:0xa3:- Writing 23220 bytes of core data to disk...
10:41:19:WU00:FS00:0xa3:Done: 22708 -> 5961 (compressed to 26.2 percent)
10:41:19:WU00:FS00:0xa3:- Could not write compressed data to results file.
10:41:21:WU00:FS00:0xa3:
10:41:21:WU00:FS00:0xa3:Folding@home Core Shutdown: UNSTABLE_MACHINE
10:41:21:WARNING:WU00:FS00:FahCore returned: UNSTABLE_MACHINE (122 = 0x7a)
10:41:21:WU00:FS00:Sending unit results: id:00 state:SEND error:FAULTY project:8
585 run:0 clone:0 gen:147 core:0xa3 unit:0x000006160a3b1e595122612b8bf00e32
10:41:21:WARNING:WU00:FS00:Missing original Unit data, cannot send dump report
10:41:21:WU00:FS00:Cleaning up
10:41:21:WU00:FS00:Connecting to assign3.stanford.edu:8080
10:41:22:WU00:FS00:News: Welcome to Folding@Home
10:41:22:WU00:FS00:Assigned to work server 128.143.231.202
10:41:22:WU00:FS00:Requesting new work unit for slot 00: READY smp:4 from 128.14
3.231.202
10:41:22:WU00:FS00:Connecting to 128.143.231.202:8080
10:41:23:WU00:FS00:Downloading 3.67MiB
10:41:29:WU00:FS00:Download 49.39%
10:41:34:WU00:FS00:Download complete
10:41:34:WU00:FS00:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:85
72 run:0 clone:4 gen:125 core:0xa3 unit:0x0000061c0a3b1e5951225b34ca4b30fa
10:41:34:WU00:FS00:Starting
10:41:34:WU00:FS00:Running FahCore: "C:\Program Files\FAHClient/FAHCoreWrapper.e
xe" C:/Users/user/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/
x86/Core_a3.fah/FahCore_a3.exe -dir 00 -suffix 01 -version 702 -lifeline 5320 -c
heckpoint 30 -np 4 extra-core-args advanced
10:41:34:WU00:FS00:Started FahCore on PID 4312
10:41:34:Started thread 8 on PID 5320
10:41:34:WU00:FS00:Core PID:820
10:41:34:WU00:FS00:FahCore 0xa3 started
10:41:34:WU00:FS00:Downloading project 8572 description
10:41:34:WU00:FS00:Connecting to fah-web.stanford.edu:80
10:41:35:WU00:FS00:0xa3:
10:41:35:WU00:FS00:0xa3:*------------------------------*
10:41:35:WU00:FS00:0xa3:Folding@Home Gromacs SMP Core
10:41:35:WU00:FS00:0xa3:Version 2.27 (Dec. 15, 2010)
10:41:35:WU00:FS00:0xa3:
10:41:35:WU00:FS00:0xa3:Preparing to commence simulation
10:41:35:WU00:FS00:0xa3:- Looking at optimizations...
10:41:35:WU00:FS00:0xa3:- Created dyn
10:41:35:WU00:FS00:0xa3:- Files status OK
10:41:35:WU00:FS00:Project 8572 description downloaded successfully
10:41:35:WU00:FS00:0xa3:- Expanded 3847582 -> 4388700 (decompressed 114.0 percen
t)
10:41:35:WU00:FS00:0xa3:Called DecompressByteArray: compressed_data_size=3847582
 data_size=4388700, decompressed_data_size=4388700 diff=0
10:41:35:WU00:FS00:0xa3:- Digital signature verified
10:41:35:WU00:FS00:0xa3:
10:41:35:WU00:FS00:0xa3:Project: 8572 (Run 0, Clone 4, Gen 125)
10:41:35:WU00:FS00:0xa3:
10:41:35:WU00:FS00:0xa3:Assembly optimizations on if available.
10:41:35:WU00:FS00:0xa3:Entering M.D.
10:41:41:WU00:FS00:0xa3:Mapping NT from 4 to 4
10:41:42:WU00:FS00:0xa3:Completed 0 out of 500000 steps  (0%)
Reading file ./work/00/wudata_01.tpr, VERSION 4.5.1-dev-20100930-afd66-dirty (si
ngle precision)
Reading file ./work/00/wudata_01.tpr, VERSION 4.5.1-dev-20100930-afd66-dirty (si
ngle precision)
11:07:05:WU00:FS00:0xa3:Completed 5000 out of 500000 steps  (1%)
******************************** Date: 18/07/13 ********************************

11:31:58:WU00:FS00:0xa3:Completed 10000 out of 500000 steps  (2%)
11:41:43:WU00:FS00:0xa3:mdrun returned 255
11:41:43:WU00:FS00:0xa3:Going to send back what have done -- stepsTotalG=500000
11:41:43:WU00:FS00:0xa3:Work fraction=0.0239 steps=500000.
11:41:47:WU00:FS00:0xa3:logfile size=13228 infoLength=13228 edr=0 trr=25
11:41:47:WU00:FS00:0xa3:logfile size: 13228 info=13228 bed=0 hdr=25
11:41:47:WU00:FS00:0xa3:- Writing 13766 bytes of core data to disk...
11:41:47:WU00:FS00:0xa3:Done: 13254 -> 4704 (compressed to 35.4 percent)
11:41:47:WU00:FS00:0xa3:- Could not write compressed data to results file.
11:41:49:WU00:FS00:0xa3:
11:41:49:WU00:FS00:0xa3:Folding@home Core Shutdown: UNSTABLE_MACHINE
11:41:49:WARNING:WU00:FS00:FahCore returned: UNSTABLE_MACHINE (122 = 0x7a)
11:41:49:WU00:FS00:Sending unit results: id:00 state:SEND error:FAULTY project:8
572 run:0 clone:4 gen:125 core:0xa3 unit:0x0000061c0a3b1e5951225b34ca4b30fa
11:41:49:WARNING:WU00:FS00:Missing original Unit data, cannot send dump report
11:41:49:WU00:FS00:Cleaning up
11:41:49:WU00:FS00:Connecting to assign3.stanford.edu:8080
11:41:50:WU00:FS00:News: Welcome to Folding@Home
11:41:50:WU00:FS00:Assigned to work server 171.64.65.99
11:41:50:WU00:FS00:Requesting new work unit for slot 00: READY smp:4 from 171.64
.65.99
11:41:50:WU00:FS00:Connecting to 171.64.65.99:8080
11:41:51:WU00:FS00:Downloading 1.98MiB
11:41:57:WU00:FS00:Download 85.08%
11:41:57:WU00:FS00:Download complete
11:41:57:WU00:FS00:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:78
09 run:3 clone:441 gen:153 core:0xa4 unit:0x000000ec0a3b1e874e310cec89bdd50d
11:41:58:WU00:FS00:Starting
11:41:58:WU00:FS00:Running FahCore: "C:\Program Files\FAHClient/FAHCoreWrapper.e
xe" C:/Users/user/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/
x86/Core_a4.fah/FahCore_a4.exe -dir 00 -suffix 01 -version 702 -lifeline 5320 -c
heckpoint 30 -np 4 extra-core-args advanced
11:41:58:WU00:FS00:Started FahCore on PID 4908
11:41:58:Started thread 9 on PID 5320
11:41:58:WU00:FS00:Core PID:3152
11:41:58:WU00:FS00:FahCore 0xa4 started
11:41:58:WU00:FS00:Downloading project 7809 description
11:41:58:WU00:FS00:Connecting to fah-web.stanford.edu:80
11:41:58:WU00:FS00:0xa4:
11:41:58:WU00:FS00:0xa4:*------------------------------*
11:41:58:WU00:FS00:0xa4:Folding@Home Gromacs GB Core
11:41:58:WU00:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
11:41:58:WU00:FS00:0xa4:
11:41:58:WU00:FS00:0xa4:Preparing to commence simulation
11:41:58:WU00:FS00:0xa4:- Looking at optimizations...
11:41:58:WU00:FS00:0xa4:- Created dyn
11:41:58:WU00:FS00:0xa4:- Files status OK
11:41:58:WU00:FS00:Project 7809 description downloaded successfully
11:41:58:WU00:FS00:0xa4:- Expanded 2079221 -> 5386224 (decompressed 259.0 percen
t)
11:41:58:WU00:FS00:0xa4:Called DecompressByteArray: compressed_data_size=2079221
 data_size=5386224, decompressed_data_size=5386224 diff=0
11:41:58:WU00:FS00:0xa4:- Digital signature verified
11:41:58:WU00:FS00:0xa4:
11:41:58:WU00:FS00:0xa4:Project: 7809 (Run 3, Clone 441, Gen 153)
11:41:58:WU00:FS00:0xa4:
11:41:58:WU00:FS00:0xa4:Assembly optimizations on if available.
11:41:58:WU00:FS00:0xa4:Entering M.D.
11:42:05:WU00:FS00:0xa4:Mapping NT from 4 to 4
11:42:05:WU00:FS00:0xa4:Completed 0 out of 1500000 steps  (0%)
Reading file ./work/00/wudata_01.tpr, VERSION 4.5.3 (single precision)
Reading file ./work/00/wudata_01.tpr, VERSION 4.5.3 (single precision)
12:05:06:WU00:FS00:0xa4:Completed 15000 out of 1500000 steps  (1%)
12:28:14:WU00:FS00:0xa4:Completed 30000 out of 1500000 steps  (2%)
12:51:13:WU00:FS00:0xa4:Completed 45000 out of 1500000 steps  (3%)
13:14:11:WU00:FS00:0xa4:Completed 60000 out of 1500000 steps  (4%)
13:32:13:WARNING:Console control signal 0 on PID 5320
13:32:13:Exiting, please wait. . .
13:32:14:FS00:Shutting core down
13:32:18:Clean exit

C:\Users\user\AppData\Roaming\FAHClient>


What am I looking at here? Bad WU? Overheating? Hard disk/file system corruption? Any insight's more than welcome.
BlackSun59
 
Posts: 46
Joined: Wed Apr 09, 2008 8:24 pm

Re: Two 85xx crashes UNSTABLE_MACHINE

Postby Joe_H » Thu Jul 18, 2013 5:21 pm

Could you post the full log including the beginning that shows the version and configuration information about your system? If that does not show in the log window, the Refresh button can reload the log. Or you can copy from the actual log file in the data directory. Please also return the verbosity setting to its default of 3, higher settings give no useful information in most cases and hinder troubleshooting.

As for what is shown in the log portion that you posted, the fact the client could not write to the disk is troubling. It could be disk corruption, or lack of free space. Another possibility is that the ownerships and permissions on the directories where F@H is writing its files is wrong. There have been reports from other Windows users of working F@H installations that started failing when something changed those ownerships and permissions. I don't recall a cause of that being determined, uninstalling including the data files and doing a clean reinstall should correct that. If the current WU does process okay, finishing that first would be the best approach.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
Joe_H
Site Admin
 
Posts: 6696
Joined: Tue Apr 21, 2009 5:41 pm
Location: W. MA

Re: Two 85xx crashes UNSTABLE_MACHINE

Postby BlackSun59 » Thu Jul 18, 2013 5:51 pm

Here's the log after restarting with verbosity set to 3.
Code: Select all
*********************** Log Started 2013-07-18T16:46:22Z ***********************
16:46:22:************************* Folding@home Client *************************
16:46:22:      Website: http://folding.stanford.edu/
16:46:22:    Copyright: (c) 2009-2012 Stanford University
16:46:22:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
16:46:22:         Args:
16:46:22:       Config: C:/Users/user/AppData/Roaming/FAHClient/config.xml
16:46:22:******************************** Build ********************************
16:46:22:      Version: 7.2.9
16:46:22:         Date: Oct 3 2012
16:46:22:         Time: 18:05:48
16:46:22:      SVN Rev: 3578
16:46:22:       Branch: fah/trunk/client
16:46:22:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
16:46:22:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
16:46:22:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
16:46:22:     Platform: win32 XP
16:46:22:         Bits: 32
16:46:22:         Mode: Release
16:46:22:******************************* System ********************************
16:46:22:          CPU: AMD Athlon(tm) II X4 640 Processor
16:46:22:       CPU ID: AuthenticAMD Family 16 Model 5 Stepping 3
16:46:22:         CPUs: 4
16:46:22:       Memory: 3.37GiB
16:46:22:  Free Memory: 1.31GiB
16:46:22:      Threads: WINDOWS_THREADS
16:46:22:   On Battery: false
16:46:22:   UTC offset: -4
16:46:22:          PID: 4508
16:46:22:          CWD: C:/Users/user/AppData/Roaming/FAHClient
16:46:22:           OS: Windows 7 Professional Service Pack 1
16:46:22:      OS Arch: X86
16:46:22:         GPUs: 1
16:46:22:        GPU 0: UNSUPPORTED: 760G [Radeon 3000]
16:46:22:         CUDA: Not detected
16:46:22:Win32 Service: false
16:46:22:***********************************************************************
16:46:22:<config>
16:46:22:  <service-description v='Folding@home Client'/>
16:46:22:  <service-restart v='true'/>
16:46:22:  <service-restart-delay v='5000'/>
16:46:22:
16:46:22:  <!-- Client Control -->
16:46:22:  <cycle-rate v='4'/>
16:46:22:  <cycles v='-1'/>
16:46:22:  <data-directory v='.'/>
16:46:22:  <disable-project-lookup v='false'/>
16:46:22:  <exec-directory v='C:\Program Files\FAHClient'/>
16:46:22:  <exit-when-done v='false'/>
16:46:22:  <threads v='4'/>
16:46:22:
16:46:22:  <!-- Configuration -->
16:46:22:  <config-rotate v='true'/>
16:46:22:  <config-rotate-dir v='configs'/>
16:46:22:  <config-rotate-max v='16'/>
16:46:22:
16:46:22:  <!-- Debugging -->
16:46:22:  <assignment-servers>
16:46:22:    assign3.stanford.edu:8080 assign4.stanford.edu:80
16:46:22:  </assignment-servers>
16:46:22:  <capture-directory v='capture'/>
16:46:22:  <capture-sockets v='false'/>
16:46:22:  <debug-sockets v='false'/>
16:46:22:  <exception-locations v='true'/>
16:46:22:  <gpu-assignment-servers>
16:46:22:    assign-GPU.stanford.edu:80 assign-GPU.stanford.edu:8080
16:46:22:  </gpu-assignment-servers>
16:46:22:  <stack-traces v='false'/>
16:46:22:
16:46:22:  <!-- Error Handling -->
16:46:22:  <max-slot-errors v='5'/>
16:46:22:  <max-unit-errors v='5'/>
16:46:22:
16:46:22:  <!-- FahCore Control -->
16:46:22:  <checkpoint v='30'/>
16:46:22:  <core-dir v='cores'/>
16:46:22:  <core-priority v='idle'/>
16:46:22:  <cpu-affinity v='false'/>
16:46:22:  <cpu-usage v='100'/>
16:46:22:  <no-assembly v='false'/>
16:46:22:
16:46:22:  <!-- Folding Slot Configuration -->
16:46:22:  <cause-pref v='ANY'/>
16:46:22:  <client-subtype v='STDCLI'/>
16:46:22:  <client-type v='normal'/>
16:46:22:  <cpu-species v='X86_AMD'/>
16:46:22:  <cpu-type v='X86'/>
16:46:22:  <cpus v='-1'/>
16:46:22:  <cuda-index v='0'/>
16:46:22:  <extra-core-args v='extra-core-args advanced'/>
16:46:22:  <gpu v='false'/>
16:46:22:  <gpu-usage v='100'/>
16:46:22:  <max-packet-size v='normal'/>
16:46:22:  <opencl-index v='0'/>
16:46:22:  <os-species v='UNKNOWN'/>
16:46:22:  <os-type v='WIN32'/>
16:46:22:  <project-key v='0'/>
16:46:22:  <smp v='true'/>
16:46:22:
16:46:22:  <!-- Logging -->
16:46:22:  <log v='log.txt'/>
16:46:22:  <log-color v='false'/>
16:46:22:  <log-crlf v='true'/>
16:46:22:  <log-date v='false'/>
16:46:22:  <log-date-periodically v='21600'/>
16:46:22:  <log-debug v='true'/>
16:46:22:  <log-domain v='false'/>
16:46:22:  <log-header v='true'/>
16:46:22:  <log-level v='true'/>
16:46:22:  <log-no-info-header v='true'/>
16:46:22:  <log-redirect v='false'/>
16:46:22:  <log-rotate v='true'/>
16:46:22:  <log-rotate-dir v='logs'/>
16:46:22:  <log-rotate-max v='16'/>
16:46:22:  <log-short-level v='false'/>
16:46:22:  <log-simple-domains v='true'/>
16:46:22:  <log-thread-id v='false'/>
16:46:22:  <log-thread-prefix v='true'/>
16:46:22:  <log-time v='true'/>
16:46:22:  <log-to-screen v='true'/>
16:46:22:  <log-truncate v='false'/>
16:46:22:  <verbosity v='5'/>
16:46:22:
16:46:22:  <!-- Network -->
16:46:22:  <proxy v=':8080'/>
16:46:22:  <proxy-enable v='false'/>
16:46:22:  <proxy-pass v=''/>
16:46:22:  <proxy-user v=''/>
16:46:22:
16:46:22:  <!-- Process Control -->
16:46:22:  <child v='false'/>
16:46:22:  <daemon v='false'/>
16:46:22:  <pid v='false'/>
16:46:22:  <pid-file v='Folding@home Client.pid'/>
16:46:22:  <respawn v='false'/>
16:46:22:  <service v='false'/>
16:46:22:
16:46:22:  <!-- Remote Command Server -->
16:46:22:  <command-address v='0.0.0.0'/>
16:46:22:  <command-allow v='127.0.0.1'/>
16:46:22:  <command-allow-no-pass v='127.0.0.1'/>
16:46:22:  <command-deny v='0.0.0.0/0'/>
16:46:22:  <command-deny-no-pass v='0.0.0.0/0'/>
16:46:22:  <command-port v='36330'/>
16:46:22:
16:46:22:  <!-- Slot Control -->
16:46:22:  <max-shutdown-wait v='60'/>
16:46:22:  <pause-on-battery v='false'/>
16:46:22:  <pause-on-start v='false'/>
16:46:22:
16:46:22:  <!-- User Information -->
16:46:22:  <machine-id v='0'/>
16:46:22:  <passkey v='********************************'/>
16:46:22:  <team v='11108'/>
16:46:22:  <user v='BlackSun59'/>
16:46:22:
16:46:22:  <!-- Work Unit Control -->
16:46:22:  <dump-after-deadline v='true'/>
16:46:22:  <max-queue v='16'/>
16:46:22:  <max-units v='0'/>
16:46:22:  <next-unit-percentage v='99'/>
16:46:22:
16:46:22:  <!-- Folding Slots -->
16:46:22:  <slot id='0' type='SMP'/>
16:46:22:</config>
16:46:22:Trying to access database...
16:46:22:Successfully acquired database lock
16:46:22:Enabled folding slot 00: READY smp:4
16:46:22:Started thread 5 on PID 4508
16:46:22:Started thread 6 on PID 4508
16:46:22:WU00:FS00:Starting
16:46:22:Started thread 1 on PID 4508
16:46:22:Started thread 3 on PID 4508
16:46:22:Started thread 4 on PID 4508
16:46:22:WU00:FS00:Running FahCore: "C:\Program Files\FAHClient/FAHCoreWrapper.exe" C:/Users/user/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/Core_a4.fah/FahCore_a4.exe -dir 00 -suffix 01 -version 702 -lifeline 4508 -checkpoint 30 -np 4 extra-core-args advanced
16:46:22:WU00:FS00:Started FahCore on PID 756
16:46:22:Started thread 7 on PID 4508
16:46:22:WU00:FS00:Core PID:5748
16:46:22:WU00:FS00:FahCore 0xa4 started
16:46:22:Started thread 8 on PID 4508
16:46:22:Server connection id=1 on 0.0.0.0:36330 from 127.0.0.1
16:46:23:WU00:FS00:0xa4:
16:46:23:WU00:FS00:0xa4:*------------------------------*
16:46:23:WU00:FS00:0xa4:Folding@Home Gromacs GB Core
16:46:23:WU00:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
16:46:23:WU00:FS00:0xa4:
16:46:23:WU00:FS00:0xa4:Preparing to commence simulation
16:46:23:WU00:FS00:0xa4:- Looking at optimizations...
16:46:23:WU00:FS00:0xa4:- Files status OK
16:46:23:WU00:FS00:0xa4:- Expanded 2079221 -> 5386224 (decompressed 259.0 percent)
16:46:23:WU00:FS00:0xa4:Called DecompressByteArray: compressed_data_size=2079221 data_size=5386224, decompressed_data_size=5386224 diff=0
16:46:23:WU00:FS00:0xa4:- Digital signature verified
16:46:23:WU00:FS00:0xa4:
16:46:23:WU00:FS00:0xa4:Project: 7809 (Run 3, Clone 441, Gen 153)
16:46:23:WU00:FS00:0xa4:
16:46:23:WU00:FS00:0xa4:Assembly optimizations on if available.
16:46:23:WU00:FS00:0xa4:Entering M.D.
16:46:29:WU00:FS00:0xa4:Using Gromacs checkpoints
16:46:29:WU00:FS00:0xa4:Mapping NT from 4 to 4
16:46:30:WU00:FS00:0xa4:Resuming from checkpoint
16:46:30:WU00:FS00:0xa4:Verified 00/wudata_01.log
16:46:30:WU00:FS00:0xa4:Verified 00/wudata_01.trr
16:46:30:WU00:FS00:0xa4:Verified 00/wudata_01.xtc
16:46:30:WU00:FS00:0xa4:Verified 00/wudata_01.edr
16:46:30:WU00:FS00:0xa4:Completed 58630 out of 1500000 steps  (3%)


This is all I have, so...

I'll run CHKDSK later today. There is a possibility of lack of free space, as I recall that Eraser was doing its nightly unused disk space routine, but I've never had a WU fail. My C: partition is 50GB but only 22GB is used.

Here are the error messages from Eraser:

Code: Select all
Session: Thursday, July 18, 2013 6:01:04 AM
Thursday, July 18, 2013 6:01:04 AM Error C:\pagefile.sys did not have its cluster tips erased because of the following error: The process cannot access the file because it is being used by another process. (Exception from HRESULT: 0x80070020)
Thursday, July 18, 2013 6:01:08 AM Error C:\Boot\BCD did not have its cluster tips erased because of the following error: The process cannot access the file because it is being used by another process. (Exception from HRESULT: 0x80070020)
Thursday, July 18, 2013 6:01:08 AM Error C:\Boot\BCD.LOG did not have its cluster tips erased because of the following error: The process cannot access the file because it is being used by another process. (Exception from HRESULT: 0x80070020)
BlackSun59
 
Posts: 46
Joined: Wed Apr 09, 2008 8:24 pm

Re: Two 85xx crashes UNSTABLE_MACHINE

Postby Napoleon » Thu Jul 18, 2013 6:31 pm

16:46:22:WU00:FS00:Running FahCore: "C:\Program Files\FAHClient/FAHCoreWrapper.exe" C:/Users/user/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/Core_a4.fah/FahCore_a4.exe -dir 00 -suffix 01 -version 702 -lifeline 4508 -checkpoint 30 -np 4 extra-core-args advanced
This is wrong...
May not solve the stability problem, but check how you've configured your client. It's supposed to be client-type=advanced. Remove any extra core args. EDIT: also restore verbosity to the default value 3. Verbosity 5 tends clutter the log and provides hardly any useful additional information.
Last edited by Napoleon on Thu Jul 18, 2013 7:42 pm, edited 1 time in total.
Win7 64bit, FAH v7, OC'd
2C/4T Atom330 3x667MHz - GT430 2x832.5MHz - ION iGPU 3x466.7MHz
NaCl - Core_15 - display
User avatar
Napoleon
 
Posts: 887
Joined: Wed May 26, 2010 3:31 pm
Location: Finland

Re: Two 85xx crashes UNSTABLE_MACHINE

Postby 7im » Thu Jul 18, 2013 7:14 pm

Also, the log example you just posted still shows verbosity 5...

16:46:22: <verbosity v='5'/>
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
User avatar
7im
 
Posts: 10189
Joined: Thu Nov 29, 2007 5:30 pm
Location: Arizona

Re: Two 85xx crashes UNSTABLE_MACHINE

Postby BlackSun59 » Thu Jul 18, 2013 10:53 pm

Okay, CHKDSK came back clean.
Here's the latest log (thanks, guys, for pointing out the config errors)

Code: Select all
*********************** Log Started 2013-07-18T21:45:59Z ***********************
21:45:59:************************* Folding@home Client *************************
21:45:59:      Website: http://folding.stanford.edu/
21:45:59:    Copyright: (c) 2009-2012 Stanford University
21:45:59:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
21:45:59:         Args: --lifeline 4204 --command-port=36330
21:45:59:       Config: C:/Users/user/AppData/Roaming/FAHClient/config.xml
21:45:59:******************************** Build ********************************
21:45:59:      Version: 7.2.9
21:45:59:         Date: Oct 3 2012
21:45:59:         Time: 18:05:48
21:45:59:      SVN Rev: 3578
21:45:59:       Branch: fah/trunk/client
21:45:59:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
21:45:59:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
21:45:59:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
21:45:59:     Platform: win32 XP
21:45:59:         Bits: 32
21:45:59:         Mode: Release
21:45:59:******************************* System ********************************
21:45:59:          CPU: AMD Athlon(tm) II X4 640 Processor
21:45:59:       CPU ID: AuthenticAMD Family 16 Model 5 Stepping 3
21:45:59:         CPUs: 4
21:45:59:       Memory: 3.37GiB
21:45:59:  Free Memory: 2.14GiB
21:45:59:      Threads: WINDOWS_THREADS
21:45:59:   On Battery: false
21:45:59:   UTC offset: -4
21:45:59:          PID: 4588
21:45:59:          CWD: C:/Users/user/AppData/Roaming/FAHClient
21:45:59:           OS: Windows 7 Professional Service Pack 1
21:45:59:      OS Arch: X86
21:45:59:         GPUs: 1
21:45:59:        GPU 0: UNSUPPORTED: 760G [Radeon 3000]
21:45:59:         CUDA: Not detected
21:45:59:Win32 Service: false
21:45:59:***********************************************************************
21:45:59:<config>
21:45:59:  <!-- FahCore Control -->
21:45:59:  <checkpoint v='30'/>
21:45:59:
21:45:59:  <!-- Folding Slot Configuration -->
21:45:59:  <extra-core-args v='advanced'/>
21:45:59:
21:45:59:  <!-- Network -->
21:45:59:  <proxy v=':8080'/>
21:45:59:
21:45:59:  <!-- User Information -->
21:45:59:  <passkey v='********************************'/>
21:45:59:  <team v='11108'/>
21:45:59:  <user v='BlackSun59'/>
21:45:59:
21:45:59:  <!-- Folding Slots -->
21:45:59:  <slot id='0' type='SMP'/>
21:45:59:</config>
21:45:59:Trying to access database...
21:45:59:Successfully acquired database lock
21:45:59:Enabled folding slot 00: READY smp:4
21:45:59:WU00:FS00:Starting
21:45:59:WU00:FS00:Running FahCore: "C:\Program Files\FAHClient/FAHCoreWrapper.exe" C:/Users/user/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/x86/Core_a4.fah/FahCore_a4.exe -dir 00 -suffix 01 -version 702 -lifeline 4588 -checkpoint 30 -np 4 advanced
21:45:59:WU00:FS00:Started FahCore on PID 4604
21:45:59:WU00:FS00:Core PID:4428
21:45:59:WU00:FS00:FahCore 0xa4 started
21:45:59:WU00:FS00:0xa4:
21:45:59:WU00:FS00:0xa4:*------------------------------*
21:45:59:WU00:FS00:0xa4:Folding@Home Gromacs GB Core
21:45:59:WU00:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
21:45:59:WU00:FS00:0xa4:
21:45:59:WU00:FS00:0xa4:Preparing to commence simulation
21:45:59:WU00:FS00:0xa4:- Looking at optimizations...
21:45:59:WU00:FS00:0xa4:- Files status OK
21:45:59:WU00:FS00:0xa4:- Expanded 2079221 -> 5386224 (decompressed 259.0 percent)
21:45:59:WU00:FS00:0xa4:Called DecompressByteArray: compressed_data_size=2079221 data_size=5386224, decompressed_data_size=5386224 diff=0
21:45:59:WU00:FS00:0xa4:- Digital signature verified
21:45:59:WU00:FS00:0xa4:
21:45:59:WU00:FS00:0xa4:Project: 7809 (Run 3, Clone 441, Gen 153)
21:45:59:WU00:FS00:0xa4:
21:45:59:WU00:FS00:0xa4:Assembly optimizations on if available.
21:45:59:WU00:FS00:0xa4:Entering M.D.
21:46:00:Server connection id=1 on 0.0.0.0:36330 from 127.0.0.1
21:46:06:WU00:FS00:0xa4:Using Gromacs checkpoints
21:46:06:WU00:FS00:0xa4:Mapping NT from 4 to 4
21:46:06:WU00:FS00:0xa4:Resuming from checkpoint
21:46:06:WU00:FS00:0xa4:Verified 00/wudata_01.log
21:46:06:WU00:FS00:0xa4:Verified 00/wudata_01.trr
21:46:06:WU00:FS00:0xa4:Verified 00/wudata_01.xtc
21:46:06:WU00:FS00:0xa4:Verified 00/wudata_01.edr
21:46:06:WU00:FS00:0xa4:Completed 58630 out of 1500000 steps  (3%)
21:47:59:WU00:FS00:0xa4:Completed 60000 out of 1500000 steps  (4%)
BlackSun59
 
Posts: 46
Joined: Wed Apr 09, 2008 8:24 pm

Re: Two 85xx crashes UNSTABLE_MACHINE

Postby PantherX » Thu Jul 18, 2013 11:30 pm

Please note that the below is the wrong configuration that is still present:
21:45:59: <extra-core-args v='advanced'/>

I would suggest (as above) to remove it since it isn't a valid argument. Most likely, it will be ignored by the FahCore but better safe than sorry.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
User avatar
PantherX
Site Moderator
 
Posts: 6850
Joined: Wed Dec 23, 2009 10:33 am
Location: Land Of The Long White Cloud

Re: Two 85xx crashes UNSTABLE_MACHINE

Postby Napoleon » Fri Jul 19, 2013 12:09 am

A client properly configured for advanced (== late stage beta) WUs should have the line
<client-type v='advanced'/>
User avatar
Napoleon
 
Posts: 887
Joined: Wed May 26, 2010 3:31 pm
Location: Finland

Re: Two 85xx crashes UNSTABLE_MACHINE

Postby BlackSun59 » Fri Jul 19, 2013 2:26 am

Okay, here is what I have. Remember, this is client v7.2.9.
Image
Is this right or is it wrong?
BlackSun59
 
Posts: 46
Joined: Wed Apr 09, 2008 8:24 pm

Re: Two 85xx crashes UNSTABLE_MACHINE

Postby P5-133XL » Fri Jul 19, 2013 2:39 am

Wrong spot. You have it in extra core options (needs to be removed) and client-type = advanced should be in extra client options.
Image
P5-133XL
 
Posts: 2948
Joined: Sun Dec 02, 2007 5:36 am
Location: Salem. OR USA

Re: Two 85xx crashes UNSTABLE_MACHINE

Postby PantherX » Fri Jul 19, 2013 2:48 am

Is there any reason to use client-type=advanced in the first place? If it is for additional points, please note that it isn't necessarily true since projects often move from advanced to full so eventually, everyone gets them. In some rare cases, that doesn't happen like when a project has a higher than normal failure rate or some other technical issues.
User avatar
PantherX
Site Moderator
 
Posts: 6850
Joined: Wed Dec 23, 2009 10:33 am
Location: Land Of The Long White Cloud

Re: Two 85xx crashes UNSTABLE_MACHINE

Postby 7im » Fri Jul 19, 2013 4:18 am

Left side, not right side, better yet, neither.
User avatar
7im
 
Posts: 10189
Joined: Thu Nov 29, 2007 5:30 pm
Location: Arizona

Re: Two 85xx crashes UNSTABLE_MACHINE

Postby BlackSun59 » Fri Jul 19, 2013 4:19 am

Well, I did use it with the v6.x client, and according to
https://fah-web.stanford.edu/projects/F ... ncesV6ToV7
v6's "-advmethods" became v7's "client-type=advanced", and v6's "forceasm" became v7's "--extra-core-args=-forceasm"
I put them in when I started using the v7.x client.
:?
I'll take it out completely and see what happens. If I weren't getting daytime ambient room temps of 32-35°c I'd be able to see if there's much difference. So for now, I'm restricted to nighttime only.
Thanks guys, very much.
Last edited by BlackSun59 on Fri Jul 19, 2013 4:32 am, edited 1 time in total.
BlackSun59
 
Posts: 46
Joined: Wed Apr 09, 2008 8:24 pm

Re: Two 85xx crashes UNSTABLE_MACHINE

Postby Joe_H » Fri Jul 19, 2013 4:31 am

OvenMaster wrote:Well, I did use it with the v6.x client, and according to
https://fah-web.stanford.edu/projects/F ... ncesV6ToV7
v6's "-advmethods" became v7's "client-type=advanced", and v6's "forceasm" became v7's "--extra-core-args=-forceasm"
I put them in when I started using the v7.x client.

If you want client-type advanced to apply to all folding slots, then you would use the Expert tab Extra Client options. Better would be to add the option to the slot. Then if you have more than one slot, you can manage use of advanced individually.

As for forceasm, unless you are running the older Core 78 it is no longer needed. The newer A3 and A4 cores do not use it as they do not have the alternate non-SSE code paths that forceasm turned off.
Joe_H
Site Admin
 
Posts: 6696
Joined: Tue Apr 21, 2009 5:41 pm
Location: W. MA

Re: Two 85xx crashes UNSTABLE_MACHINE

Postby BlackSun59 » Fri Jul 19, 2013 4:33 am

Well, for now, to use the KISS formula, I'll do without it totally for a while to see what happens. Thank you all again.
BlackSun59
 
Posts: 46
Joined: Wed Apr 09, 2008 8:24 pm

Next

Return to V7.2.x -- Windows/Linux Release & OSX Beta

Who is online

Users browsing this forum: No registered users and 1 guest

cron