Project: 7809 (Run 0, Clone 390, Gen 24)

Moderators: Site Moderators, FAHC Science Team

Post Reply
Qinsp
Posts: 216
Joined: Sun Oct 17, 2010 2:34 pm

Project: 7809 (Run 0, Clone 390, Gen 24)

Post by Qinsp »

Machine:
Win7-64
2x Xeon X5650 @ 2.66ghz (factory speed and settings)
Have never witnessed a crash while using it.

First one I saw in the log said "Time clock off by 2 hours" or something like that. Neither the BIOS or Windows has the wrong time.

This time I got this although the screen had said over 90% complete:

Code: Select all

*********************** Log Started 2013-01-29T15:50:49Z ***********************
15:50:49:************************* Folding@home Client *************************
15:50:49:      Website: http://folding.stanford.edu/
15:50:49:    Copyright: (c) 2009-2012 Stanford University
15:50:49:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
15:50:49:         Args: --lifeline 3304 --command-port=36330
15:50:49:       Config: C:/Users/CrayZ3/AppData/Roaming/FAHClient/config.xml
15:50:49:******************************** Build ********************************
15:50:49:      Version: 7.2.9
15:50:49:         Date: Oct 3 2012
15:50:49:         Time: 18:05:48
15:50:49:      SVN Rev: 3578
15:50:49:       Branch: fah/trunk/client
15:50:49:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
15:50:49:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
15:50:49:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
15:50:49:     Platform: win32 XP
15:50:49:         Bits: 32
15:50:49:         Mode: Release
15:50:49:******************************* System ********************************
15:50:49:          CPU: Intel(R) Xeon(R) CPU X5650 @ 2.67GHz
15:50:49:       CPU ID: GenuineIntel Family 6 Model 44 Stepping 2
15:50:49:         CPUs: 24
15:50:49:       Memory: 11.99GiB
15:50:49:  Free Memory: 10.71GiB
15:50:49:      Threads: WINDOWS_THREADS
15:50:49:   On Battery: false
15:50:49:   UTC offset: -8
15:50:49:          PID: 3748
15:50:49:          CWD: C:/Users/CrayZ3/AppData/Roaming/FAHClient
15:50:49:           OS: Windows 7 Professional
15:50:49:      OS Arch: AMD64
15:50:49:         GPUs: 1
15:50:49:        GPU 0: NVIDIA:2 GF114 [GeForce GTX 560 Ti]
15:50:49:         CUDA: 2.1
15:50:49:  CUDA Driver: 5000
15:50:49:Win32 Service: false
15:50:49:***********************************************************************
15:50:49:<config>
15:50:49:  <!-- Folding Slot Configuration -->
15:50:49:  <gpu v='true'/>
15:50:49:
15:50:49:  <!-- Network -->
15:50:49:  <proxy v=':8080'/>
15:50:49:
15:50:49:  <!-- User Information -->
15:50:49:  <passkey v='********************************'/>
15:50:49:  <team v='195247'/>
15:50:49:  <user v='McSwain'/>
15:50:49:
15:50:49:  <!-- Folding Slots -->
15:50:49:  <slot id='1' type='SMP'/>
15:50:49:</config>
15:50:49:Trying to access database...
15:50:49:Successfully acquired database lock
15:50:49:Enabled folding slot 01: READY smp:24
15:50:49:WU00:FS01:Starting
15:50:49:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/CrayZ3/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 00 -suffix 01 -version 702 -lifeline 3748 -checkpoint 15 -np 24
15:50:49:WU00:FS01:Started FahCore on PID 3864
15:50:49:WU00:FS01:Core PID:3876
15:50:49:WU00:FS01:FahCore 0xa4 started
15:50:50:WU00:FS01:0xa4:
15:50:50:WU00:FS01:0xa4:*------------------------------*
15:50:50:WU00:FS01:0xa4:Folding@Home Gromacs GB Core
15:50:50:WU00:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
15:50:50:WU00:FS01:0xa4:
15:50:50:WU00:FS01:0xa4:Preparing to commence simulation
15:50:50:WU00:FS01:0xa4:- Ensuring status. Please wait.
15:50:52:Server connection id=1 on 0.0.0.0:36330 from 127.0.0.1
15:50:59:WU00:FS01:0xa4:- Looking at optimizations...
15:50:59:WU00:FS01:0xa4:- Working with standard loops on this execution.
15:50:59:WU00:FS01:0xa4:- Previous termination of core was improper.
15:50:59:WU00:FS01:0xa4:- Going to use standard loops.
15:50:59:WU00:FS01:0xa4:- Files status OK
15:50:59:WU00:FS01:0xa4:- Expanded 2079318 -> 5386224 (decompressed 259.0 percent)
15:50:59:WU00:FS01:0xa4:Called DecompressByteArray: compressed_data_size=2079318 data_size=5386224, decompressed_data_size=5386224 diff=0
15:50:59:WU00:FS01:0xa4:- Digital signature verified
15:50:59:WU00:FS01:0xa4:
15:50:59:WU00:FS01:0xa4:Project: 7809 (Run 0, Clone 390, Gen 24)
15:50:59:WU00:FS01:0xa4:
15:50:59:WU00:FS01:0xa4:Entering M.D.
15:51:05:WU00:FS01:0xa4:Using Gromacs checkpoints
15:51:06:WU00:FS01:0xa4:Mapping NT from 24 to 24 
15:51:06:WU00:FS01:0xa4:mdrun returned 255
15:51:06:WU00:FS01:0xa4:Going to send back what have done -- stepsTotalG=0
15:51:06:WU00:FS01:0xa4:Work fraction=0.0000 steps=0.
15:51:10:WU00:FS01:0xa4:logfile size=37788 infoLength=37788 edr=0 trr=25
15:51:10:WU00:FS01:0xa4:logfile size: 37788 info=37788 bed=0 hdr=25
15:51:10:WU00:FS01:0xa4:- Writing 38326 bytes of core data to disk...
15:51:10:WU00:FS01:0xa4:Done: 37814 -> 6856 (compressed to 18.1 percent)
15:51:10:WU00:FS01:0xa4:  ... Done.
15:51:10:ERROR:unknown exception
15:51:10:WU00:FS01:0xa4:
15:51:10:WU00:FS01:0xa4:Folding@home Core Shutdown: UNSTABLE_MACHINE
15:51:10:WARNING:WU00:FS01:FahCore returned: UNSTABLE_MACHINE (122 = 0x7a)
15:51:10:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:7809 run:0 clone:390 gen:24 core:0xa4 unit:0x0000001d0a3b1e874e31062d2747a32d
15:51:10:WU00:FS01:Uploading 7.20KiB to 171.64.65.99
15:51:10:WU00:FS01:Connecting to 171.64.65.99:8080
15:51:10:WU00:FS01:Upload complete
15:51:10:WU00:FS01:Server responded WORK_ACK (400)
15:51:10:WU00:FS01:Cleaning up
15:51:10:WU01:FS01:Connecting to assign3.stanford.edu:8080
15:51:11:WU01:FS01:News: Welcome to Folding@Home
15:51:11:WU01:FS01:Assigned to work server 171.64.65.99
15:51:11:WU01:FS01:Requesting new work unit for slot 01: READY smp:24 from 171.64.65.99
15:51:11:WU01:FS01:Connecting to 171.64.65.99:8080
15:51:12:WU01:FS01:Downloading 1.98MiB
15:51:13:WU01:FS01:Download complete
15:51:13:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:7809 run:7 clone:262 gen:58 core:0xa4 unit:0x0000004c0a3b1e874e3114dc86e4460c
15:51:13:WU01:FS01:Starting
15:51:13:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:/Users/CrayZ3/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 01 -suffix 01 -version 702 -lifeline 3748 -checkpoint 15 -np 24
15:51:13:WU01:FS01:Started FahCore on PID 3228
15:51:13:WU01:FS01:Core PID:3180
15:51:13:WU01:FS01:FahCore 0xa4 started
15:51:14:WU01:FS01:0xa4:
15:51:14:WU01:FS01:0xa4:*------------------------------*
15:51:14:WU01:FS01:0xa4:Folding@Home Gromacs GB Core
15:51:14:WU01:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
15:51:14:WU01:FS01:0xa4:
15:51:14:WU01:FS01:0xa4:Preparing to commence simulation
15:51:14:WU01:FS01:0xa4:- Looking at optimizations...
15:51:14:WU01:FS01:0xa4:- Created dyn
15:51:14:WU01:FS01:0xa4:- Files status OK
15:51:14:WU01:FS01:0xa4:- Expanded 2079361 -> 5386224 (decompressed 259.0 percent)
15:51:14:WU01:FS01:0xa4:Called DecompressByteArray: compressed_data_size=2079361 data_size=5386224, decompressed_data_size=5386224 diff=0
15:51:14:WU01:FS01:0xa4:- Digital signature verified
15:51:14:WU01:FS01:0xa4:
15:51:14:WU01:FS01:0xa4:Project: 7809 (Run 7, Clone 262, Gen 58)
15:51:14:WU01:FS01:0xa4:
15:51:14:WU01:FS01:0xa4:Assembly optimizations on if available.
15:51:14:WU01:FS01:0xa4:Entering M.D.
15:51:20:WU01:FS01:0xa4:Mapping NT from 24 to 24 
15:51:20:WU01:FS01:0xa4:Completed 0 out of 1500000 steps  (0%)
Is it the machine or something else? Standard install on 7.2.9. SMP Only.
Quality Inspection - Corona, CA, USA
Dimensional Inspection Laboratory
Pat McSwain, President
bollix47
Posts: 2942
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Project: 7809 (Run 0, Clone 390, Gen 24)

Post by bollix47 »

Moved to Issues with a specific WU. There are no current returns in the database for this WU but I will mark it for follow-up.
First one I saw in the log said "Time clock off by 2 hours" or something like that. Neither the BIOS or Windows has the wrong time.
If you stop the client to do an update or for some other reason when you restart the client it detects clock skew and has to adjust time estimates etc. This is normal.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 7809 (Run 0, Clone 390, Gen 24)

Post by bruce »

If you can find the log that said "Time clock off by 2 hours" I'd appreciate seeing it. Every time you restart FAHClient, it creates a new log and by default, the previous 16 are kept in a sub-directory of the data directory called logs. Names contain the date and time.
bollix47
Posts: 2942
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Project: 7809 (Run 0, Clone 390, Gen 24)

Post by bollix47 »

The database has been updated to show your return for zero points:

Hi McSwain (team 195247),
Your WU (P7809 R0 C390 G24) was added to the stats database on 2013-01-29 08:10:08 for 0 points of credit.

We will continue to monitor returns to see if anyone else has a problem with this WU.
bollix47
Posts: 2942
Joined: Sun Dec 02, 2007 5:04 am
Location: Canada

Re: Project: 7809 (Run 0, Clone 390, Gen 24)

Post by bollix47 »

The work unit was completed by another folder:

Hi xxxxxxxxxx (team 0),
Your WU (P7809 R0 C390 G24) was added to the stats database on 2013-01-30 03:10:15 for 12403 points of credit.

Report closed.
Post Reply