Project:7809 & Project:7904 - UNSTABLE MACHINE

Moderators: Site Moderators, FAHC Science Team

Post Reply
josgba2002
Posts: 3
Joined: Sat Sep 08, 2012 12:16 am

Project:7809 & Project:7904 - UNSTABLE MACHINE

Post by josgba2002 »

Hello. There is my log for today:

Code: Select all

*********************** Log Started 2012-09-07T20:07:02Z ***********************
20:07:02:************************* Folding@home Client *************************
20:07:02:      Website: http://folding.stanford.edu/
20:07:02:    Copyright: (c) 2009-2012 Stanford University
20:07:02:       Author: Joseph Coffland <joseph@cauldrondevelopment.com>
20:07:02:         Args: --lifeline 8028 --command-port=36330
20:07:02:       Config: E:/Users/Jose/AppData/Roaming/FAHClient/config.xml
20:07:02:******************************** Build ********************************
20:07:02:      Version: 7.1.52
20:07:02:         Date: Mar 20 2012
20:07:02:         Time: 19:37:42
20:07:02:      SVN Rev: 3515
20:07:02:       Branch: fah/trunk/client
20:07:02:     Compiler: Intel(R) C++ MSVC 1500 mode 1200
20:07:02:      Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
20:07:02:               /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT
20:07:02:     Platform: win32 XP
20:07:02:         Bits: 32
20:07:02:         Mode: Release
20:07:02:******************************* System ********************************
20:07:02:          CPU: Intel(R) Core(TM) i7-2600K CPU @ 3.40GHz
20:07:02:       CPU ID: GenuineIntel Family 6 Model 42 Stepping 7
20:07:02:         CPUs: 8
20:07:02:       Memory: 7.90GiB
20:07:02:  Free Memory: 3.89GiB
20:07:02:      Threads: WINDOWS_THREADS
20:07:02:   On Battery: false
20:07:02:   UTC offset: -4
20:07:02:          PID: 7720
20:07:02:          CWD: E:/Users/Jose/AppData/Roaming/FAHClient
20:07:02:           OS: Windows 7 Ultimate
20:07:02:      OS Arch: AMD64
20:07:02:         GPUs: 1
20:07:02:        GPU 0: FERMI:1 GK107 [GeForce GTX 670]
20:07:02:         CUDA: 3.0
20:07:02:  CUDA Driver: 5000
20:07:02:Win32 Service: false
20:07:02:***********************************************************************
20:07:02:<config>
20:07:02:  <!-- Folding Slot Configuration -->
20:07:02:  <gpu v='true'/>
20:07:02:
20:07:02:  <!-- Network -->
20:07:02:  <proxy v=':8080'/>
20:07:02:
20:07:02:  <!-- User Information -->
20:07:02:  <passkey v='********************************'/>
20:07:02:  <team v='111065'/>
20:07:02:  <user v='josgba2002'/>
20:07:02:
20:07:02:  <!-- Folding Slots -->
20:07:02:  <slot id='0' type='GPU'>
20:07:02:    <client-type v='beta'/>
20:07:02:    <cuda-index v='0'/>
20:07:02:    <opencl-index v='0'/>
20:07:02:    <pause-on-start v='true'/>
20:07:02:  </slot>
20:07:02:  <slot id='1' type='SMP'>
20:07:02:    <cpus v='-1'/>
20:07:02:    <max-packet-size v='small'/>
20:07:02:    <pause-on-start v='true'/>
20:07:02:  </slot>
20:07:02:</config>
20:07:02:Trying to access database...
20:07:02:Successfully acquired database lock
20:07:02:Enabled folding slot 00: PAUSED gpu:0:"GK107 [GeForce GTX 670]"
20:07:02:Enabled folding slot 01: PAUSED smp:8
20:07:02:WARNING:WU01:Missing data files, dumping
20:07:03:WU01:FS01:Cleaning up
20:07:06:Server connection id=1 on 0.0.0.0:36330 from 127.0.0.1
20:07:12:FS01:Unpaused
20:07:12:WU01:FS01:Connecting to assign3.stanford.edu:8080
20:07:13:WU01:FS01:News: Welcome to Folding@Home
20:07:13:WU01:FS01:Assigned to work server 171.64.65.99
20:07:13:WU01:FS01:Requesting new work unit for slot 01: READY smp:8 from 171.64.65.99
20:07:13:WU01:FS01:Connecting to 171.64.65.99:8080
20:07:17:WU01:FS01:Downloading 1.98MiB
20:07:23:WU01:FS01:Download 40.96%
20:07:29:WU01:FS01:Download 72.47%
20:07:33:WU01:FS01:Download complete
20:07:33:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:OK project:7809 run:7 clone:51 gen:42 core:0xa4 unit:0x000000340a3b1e874e3113f085bc3637
20:07:34:WU01:FS01:Starting
20:07:34:WU01:FS01:Running FahCore: "E:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" E:/Users/Jose/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 01 -suffix 01 -version 701 -lifeline 7720 -checkpoint 15 -np 8
20:07:34:WU01:FS01:Started FahCore on PID 9012
20:07:34:WU01:FS01:Core PID:7568
20:07:34:WU01:FS01:FahCore 0xa4 started
20:07:34:WU01:FS01:0xa4:
20:07:34:WU01:FS01:0xa4:*------------------------------*
20:07:34:WU01:FS01:0xa4:Folding@Home Gromacs GB Core
20:07:34:WU01:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
20:07:34:WU01:FS01:0xa4:
20:07:34:WU01:FS01:0xa4:Preparing to commence simulation
20:07:34:WU01:FS01:0xa4:- Looking at optimizations...
20:07:34:WU01:FS01:0xa4:- Created dyn
20:07:34:WU01:FS01:0xa4:- Files status OK
20:07:34:WU01:FS01:0xa4:- Expanded 2079412 -> 5386224 (decompressed 259.0 percent)
20:07:34:WU01:FS01:0xa4:Called DecompressByteArray: compressed_data_size=2079412 data_size=5386224, decompressed_data_size=5386224 diff=0
20:07:34:WU01:FS01:0xa4:- Digital signature verified
20:07:34:WU01:FS01:0xa4:
20:07:34:WU01:FS01:0xa4:Project: 7809 (Run 7, Clone 51, Gen 42)
20:07:34:WU01:FS01:0xa4:
20:07:34:WU01:FS01:0xa4:Assembly optimizations on if available.
20:07:34:WU01:FS01:0xa4:Entering M.D.
20:07:40:WU01:FS01:0xa4:Mapping NT from 8 to 8 
20:07:40:WU01:FS01:0xa4:Completed 0 out of 1500000 steps  (0%)
20:15:37:WU01:FS01:0xa4:Completed 15000 out of 1500000 steps  (1%)
20:23:25:WU01:FS01:0xa4:Completed 30000 out of 1500000 steps  (2%)
20:31:12:WU01:FS01:0xa4:Completed 45000 out of 1500000 steps  (3%)
20:39:17:WU01:FS01:0xa4:Completed 60000 out of 1500000 steps  (4%)
20:47:52:WU01:FS01:0xa4:Completed 75000 out of 1500000 steps  (5%)
20:55:41:WU01:FS01:0xa4:Completed 90000 out of 1500000 steps  (6%)
21:03:28:WU01:FS01:0xa4:Completed 105000 out of 1500000 steps  (7%)
21:11:17:WU01:FS01:0xa4:Completed 120000 out of 1500000 steps  (8%)
21:19:07:WU01:FS01:0xa4:Completed 135000 out of 1500000 steps  (9%)
21:26:59:WU01:FS01:0xa4:Completed 150000 out of 1500000 steps  (10%)
21:35:15:WU01:FS01:0xa4:Completed 165000 out of 1500000 steps  (11%)
21:43:06:WU01:FS01:0xa4:Completed 180000 out of 1500000 steps  (12%)
21:51:28:WU01:FS01:0xa4:Completed 195000 out of 1500000 steps  (13%)
21:59:32:WU01:FS01:0xa4:Completed 210000 out of 1500000 steps  (14%)
22:07:37:WU01:FS01:0xa4:Completed 225000 out of 1500000 steps  (15%)
22:15:43:WU01:FS01:0xa4:Completed 240000 out of 1500000 steps  (16%)
22:23:38:WU01:FS01:0xa4:Completed 255000 out of 1500000 steps  (17%)
22:31:30:WU01:FS01:0xa4:Completed 270000 out of 1500000 steps  (18%)
22:40:14:WU01:FS01:0xa4:Completed 285000 out of 1500000 steps  (19%)
22:48:38:WU01:FS01:0xa4:Completed 300000 out of 1500000 steps  (20%)
22:51:17:WU01:FS01:0xa4:mdrun returned 255
22:51:17:WU01:FS01:0xa4:Going to send back what have done -- stepsTotalG=1500000
22:51:17:WU01:FS01:0xa4:Work fraction=0.2031 steps=1500000.
22:51:21:WU01:FS01:0xa4:logfile size=13869 infoLength=13869 edr=0 trr=25
22:51:21:WU01:FS01:0xa4:logfile size: 13869 info=13869 bed=0 hdr=25
22:51:21:WU01:FS01:0xa4:- Writing 14407 bytes of core data to disk...
22:51:21:WU01:FS01:0xa4:Done: 13895 -> 4691 (compressed to 33.7 percent)
22:51:21:WU01:FS01:0xa4:  ... Done.
22:51:22:WU01:FS01:FahCore returned: UNSTABLE_MACHINE (122 = 0x7a)
22:51:22:WU01:FS01:Sending unit results: id:01 state:SEND error:FAULTY project:7809 run:7 clone:51 gen:42 core:0xa4 unit:0x000000340a3b1e874e3113f085bc3637
22:51:22:WU01:FS01:Uploading 5.08KiB to 171.64.65.99
22:51:22:WU01:FS01:Connecting to 171.64.65.99:8080
22:51:22:WU02:FS01:Connecting to assign3.stanford.edu:8080
22:51:23:WU02:FS01:News: Welcome to Folding@Home
22:51:23:WU02:FS01:Assigned to work server 128.113.12.162
22:51:23:WU02:FS01:Requesting new work unit for slot 01: READY smp:8 from 128.113.12.162
22:51:23:WU02:FS01:Connecting to 128.113.12.162:8080
22:51:23:WU01:FS01:Upload complete
22:51:23:WU01:FS01:Server responded WORK_ACK (400)
22:51:23:WU01:FS01:Cleaning up
22:51:27:WU02:FS01:Downloading 1.22MiB
22:51:33:WU02:FS01:Download 66.71%
22:51:36:WU02:FS01:Download complete
22:51:36:WU02:FS01:Received Unit: id:02 state:DOWNLOAD error:OK project:7904 run:20 clone:1 gen:26 core:0xa4 unit:0x0000002000ac9c224e4d32f04a3cb715
22:51:36:WU02:FS01:Starting
22:51:36:WU02:FS01:Running FahCore: "E:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" E:/Users/Jose/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 02 -suffix 01 -version 701 -lifeline 7720 -checkpoint 15 -np 8
22:51:36:WU02:FS01:Started FahCore on PID 8984
22:51:37:WU02:FS01:Core PID:8528
22:51:37:WU02:FS01:FahCore 0xa4 started
22:51:37:WU02:FS01:Downloading project 7904 description
22:51:37:WU02:FS01:Connecting to fah-web.stanford.edu:80
22:51:37:WU02:FS01:0xa4:
22:51:37:WU02:FS01:0xa4:*------------------------------*
22:51:37:WU02:FS01:0xa4:Folding@Home Gromacs GB Core
22:51:37:WU02:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
22:51:37:WU02:FS01:0xa4:
22:51:37:WU02:FS01:0xa4:Preparing to commence simulation
22:51:37:WU02:FS01:0xa4:- Looking at optimizations...
22:51:37:WU02:FS01:0xa4:- Created dyn
22:51:37:WU02:FS01:0xa4:- Files status OK
22:51:37:WU02:FS01:0xa4:- Expanded 1276653 -> 1751180 (decompressed 137.1 percent)
22:51:37:WU02:FS01:0xa4:Called DecompressByteArray: compressed_data_size=1276653 data_size=1751180, decompressed_data_size=1751180 diff=0
22:51:37:WU02:FS01:0xa4:- Digital signature verified
22:51:37:WU02:FS01:0xa4:
22:51:37:WU02:FS01:0xa4:Project: 7904 (Run 20, Clone 1, Gen 26)
22:51:37:WU02:FS01:0xa4:
22:51:37:WU02:FS01:0xa4:Assembly optimizations on if available.
22:51:37:WU02:FS01:0xa4:Entering M.D.
22:51:38:WU02:FS01:Project 7904 description downloaded successfully
22:51:43:WU02:FS01:0xa4:Mapping NT from 8 to 8 
22:51:43:WU02:FS01:0xa4:Completed 0 out of 1000000 steps  (0%)
22:55:39:WU02:FS01:0xa4:Completed 10000 out of 1000000 steps  (1%)
22:59:35:WU02:FS01:0xa4:Completed 20000 out of 1000000 steps  (2%)
23:03:41:WU02:FS01:0xa4:Completed 30000 out of 1000000 steps  (3%)
23:07:47:WU02:FS01:0xa4:Gromacs cannot continue further.
23:07:47:WU02:FS01:0xa4:Going to send back what have done -- stepsTotalG=1000000
23:07:47:WU02:FS01:0xa4:Work fraction=0.0400 steps=1000000.
23:07:51:WU02:FS01:0xa4:logfile size=12285 infoLength=12285 edr=0 trr=23
23:07:51:WU02:FS01:0xa4:logfile size: 12285 info=12285 bed=0 hdr=23
23:07:51:WU02:FS01:0xa4:- Writing 12821 bytes of core data to disk...
23:07:51:WU02:FS01:0xa4:Done: 12309 -> 4339 (compressed to 35.2 percent)
23:07:51:WU02:FS01:0xa4:  ... Done.
23:07:51:WU02:FS01:0xa4:
23:07:51:WU02:FS01:0xa4:Folding@home Core Shutdown: UNSTABLE_MACHINE
23:07:51:WU02:FS01:FahCore returned: UNSTABLE_MACHINE (122 = 0x7a)
23:07:51:WU02:FS01:Sending unit results: id:02 state:SEND error:FAULTY project:7904 run:20 clone:1 gen:26 core:0xa4 unit:0x0000002000ac9c224e4d32f04a3cb715
23:07:51:WU02:FS01:Uploading 4.74KiB to 128.113.12.162
23:07:51:WU02:FS01:Connecting to 128.113.12.162:8080
23:07:51:WU01:FS01:Connecting to assign3.stanford.edu:8080
23:07:52:WU01:FS01:News: Welcome to Folding@Home
23:07:52:WU01:FS01:Assigned to work server 128.113.12.162
23:07:52:WU01:FS01:Requesting new work unit for slot 01: READY smp:8 from 128.113.12.162
23:07:52:WU01:FS01:Connecting to 128.113.12.162:8080
23:07:52:WU02:FS01:Upload complete
23:07:52:WU02:FS01:Server responded WORK_ACK (400)
23:07:52:WU02:FS01:Cleaning up
23:07:56:WU01:FS01:Downloading 1.22MiB
23:08:02:WU01:FS01:Download 66.69%
23:08:08:WU01:FS01:Download 100.00%
23:08:08:WU01:FS01:Download complete
23:08:08:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:OK project:7904 run:14 clone:2 gen:17 core:0xa4 unit:0x0000001400ac9c224ebaaab437a08820
23:08:08:WU01:FS01:Starting
23:08:08:WU01:FS01:Running FahCore: "E:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" E:/Users/Jose/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a4.fah/FahCore_a4.exe -dir 01 -suffix 01 -version 701 -lifeline 7720 -checkpoint 15 -np 8
23:08:08:WU01:FS01:Started FahCore on PID 7524
23:08:08:WU01:FS01:Core PID:2268
23:08:08:WU01:FS01:FahCore 0xa4 started
23:08:08:WU01:FS01:0xa4:
23:08:08:WU01:FS01:0xa4:*------------------------------*
23:08:08:WU01:FS01:0xa4:Folding@Home Gromacs GB Core
23:08:08:WU01:FS01:0xa4:Version 2.27 (Dec. 15, 2010)
23:08:08:WU01:FS01:0xa4:
23:08:08:WU01:FS01:0xa4:Preparing to commence simulation
23:08:08:WU01:FS01:0xa4:- Looking at optimizations...
23:08:08:WU01:FS01:0xa4:- Created dyn
23:08:08:WU01:FS01:0xa4:- Files status OK
23:08:08:WU01:FS01:0xa4:- Expanded 1276947 -> 1751180 (decompressed 137.1 percent)
23:08:08:WU01:FS01:0xa4:Called DecompressByteArray: compressed_data_size=1276947 data_size=1751180, decompressed_data_size=1751180 diff=0
23:08:08:WU01:FS01:0xa4:- Digital signature verified
23:08:08:WU01:FS01:0xa4:
23:08:08:WU01:FS01:0xa4:Project: 7904 (Run 14, Clone 2, Gen 17)
23:08:08:WU01:FS01:0xa4:
23:08:08:WU01:FS01:0xa4:Assembly optimizations on if available.
23:08:08:WU01:FS01:0xa4:Entering M.D.
23:08:14:WU01:FS01:0xa4:Mapping NT from 8 to 8 
23:08:14:WU01:FS01:0xa4:Completed 0 out of 1000000 steps  (0%)
23:12:13:WU01:FS01:0xa4:Completed 10000 out of 1000000 steps  (1%)
23:16:09:WU01:FS01:0xa4:Completed 20000 out of 1000000 steps  (2%)
23:20:03:WU01:FS01:0xa4:Completed 30000 out of 1000000 steps  (3%)
23:23:53:WU01:FS01:0xa4:Completed 40000 out of 1000000 steps  (4%)
23:27:43:WU01:FS01:0xa4:Completed 50000 out of 1000000 steps  (5%)
23:31:36:WU01:FS01:0xa4:Completed 60000 out of 1000000 steps  (6%)
23:35:28:WU01:FS01:0xa4:Completed 70000 out of 1000000 steps  (7%)
23:39:25:WU01:FS01:0xa4:Completed 80000 out of 1000000 steps  (8%)
23:43:17:WU01:FS01:0xa4:Completed 90000 out of 1000000 steps  (9%)
23:47:29:WU01:FS01:0xa4:Completed 100000 out of 1000000 steps  (10%)
23:51:35:WU01:FS01:0xa4:Completed 110000 out of 1000000 steps  (11%)
23:55:43:WU01:FS01:0xa4:Completed 120000 out of 1000000 steps  (12%)
23:59:42:WU01:FS01:0xa4:Completed 130000 out of 1000000 steps  (13%)
00:03:43:WU01:FS01:0xa4:Completed 140000 out of 1000000 steps  (14%)
00:07:39:WU01:FS01:0xa4:Completed 150000 out of 1000000 steps  (15%)
First start with project 7809, but at 20% for any unknown reason a UNSTABLE MACHINE error show up. Client stop it, sent this part to server, and request a new unit: project 7904. Few later again a UNSTABLE MACHINE, client stop it, sent what was already done, and request a new unit (which is working right now).

1. Why happens this error?
2. I will receive points for these WU parts?
7im
Posts: 10189
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: Project:7809 & Project:7904 - UNSTABLE MACHINE

Post by 7im »

Hello josgba2002, welcome to the folding forum.

It looks like you are folding on Kepler hardware using a beta client setting. Hopefully whomever suggested using this configuration also warned you about the requirement of being a member of the beta team to get support for these beta work units.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
P5-133XL
Posts: 2948
Joined: Sun Dec 02, 2007 4:36 am
Hardware configuration: Machine #1:

Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).

Machine #2:

Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.

Machine 3:

Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32

I am currently folding just on the 5x GTX 460's for aprox. 70K PPD
Location: Salem. OR USA

Re: Project:7809 & Project:7904 - UNSTABLE MACHINE

Post by P5-133XL »

Just a side note p7809 and p7904 are SMP projects running on the A4 core and thus has nothing to do with Kepler. That being said the p7804 is still beta and needs to be dealt with in the beta forums. The p7809 has been released to the general public.

Typically these types of errors occur because of either OC'ing, excessive heat, RAM issues, or bad hardware like your MB. If you are OC'ing, even if you are absolutely positive that it is stable, stop and fold for a while to see if it goes away. Even underclocking is good to test. If it does stop then you have a likely cause and can try OC'ing at a lower level that isn't causing problems.

Download a temp monitoring program like RealTemp and monitor your core temps on the system tray to check for heat issues. Typically on a modern CPU 90C and lower is totally fine as long as you are not OC'ing but I would much prefer it to be 70C or lower.

If it isn't OC'ing or temperatures, I would next run Memtest86+ for an extended time to see if you have an issue with RAM. You can still have issues with RAM and folding that memtest86+ doesn't detect but that is much less likely.
Image
P5-133XL
Posts: 2948
Joined: Sun Dec 02, 2007 4:36 am
Hardware configuration: Machine #1:

Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).

Machine #2:

Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.

Machine 3:

Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32

I am currently folding just on the 5x GTX 460's for aprox. 70K PPD
Location: Salem. OR USA

Re: Project:7809 & Project:7904 - UNSTABLE MACHINE

Post by P5-133XL »

Hi josgba2002 (team 111065),
Your WU (P7809 R7 C51 G42) was added to the stats database on 2012-09-07 16:07:50 for 0 points of credit.

Hi josgba2002 (team 111065),
Your WU (P7904 R20 C1 G26) was added to the stats database on 2012-09-07 17:08:19 for 0 points of credit.
Image
josgba2002
Posts: 3
Joined: Sat Sep 08, 2012 12:16 am

Re: Project:7809 & Project:7904 - UNSTABLE MACHINE

Post by josgba2002 »

Hello. Thank you for your reply. I'm always monitoring my temps (around 56-60C while intensive folding), so there isn't any problem. BTW, my PC has been folding for long periods without errors, so this is really weird. In matter of points, first WU (7809) was folding for 2 hours and complete 20% (this was sent to servers so someone else will continue). I will not receive nothing for that?
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project:7809 & Project:7904 - UNSTABLE MACHINE

Post by bruce »

josgba2002 wrote:Hello. Thank you for your reply. I'm always monitoring my temps (around 56-60C while intensive folding), so there isn't any problem. BTW, my PC has been folding for long periods without errors, so this is really weird. In matter of points, first WU (7809) was folding for 2 hours and complete 20% (this was sent to servers so someone else will continue). I will not receive nothing for that?
If you're monitoring temperatures, that probably means that you're overclocking and that also means that you cannot rightfully say that "there isn't any problem" unless you remove all overclocking and try again. The FahCores are known to put more stress on the SSE components of your hardware than almost any of the "normal" overclocking benchmarks. You'll find many reports on this from folks who started out saying they were certain their overclock was stable who later found out it was not stable when running other benchmarks or when they allowed greater margins. Aside from temperature, how did you determine your system was stable when FAH is telling you it is not?
josgba2002
Posts: 3
Joined: Sat Sep 08, 2012 12:16 am

Re: Project:7809 & Project:7904 - UNSTABLE MACHINE

Post by josgba2002 »

Previous WUs that folded without any problem. This is the first time. If were unstable, since first day FAH were give errors.
Joe_H
Site Admin
Posts: 7856
Joined: Tue Apr 21, 2009 4:41 pm
Hardware configuration: Mac Pro 2.8 quad 12 GB smp4
MacBook Pro 2.9 i7 8 GB smp2
Location: W. MA

Re: Project:7809 & Project:7904 - UNSTABLE MACHINE

Post by Joe_H »

In matter of points, first WU (7809) was folding for 2 hours and complete 20% (this was sent to servers so someone else will continue). I will not receive nothing for that?
The WU will be reassigned to someone else to start over from the beginning, as was reported already you were given 0 pts. This is the other side of the Quick Return Bonus program. Successful completion of WU's before the preferred deadline gives you more than the base points with the QRB. But WU's that fail give no points usually.
Image

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3
PinHead
Posts: 285
Joined: Tue Jan 24, 2012 3:43 am
Hardware configuration: Quad Q9550 2.83 contains the GPU 57xx - running SMP and GPU
Quad Q6700 2.66 running just SMP
2P 32core Interlagos SMP on linux

Re: Project:7809 & Project:7904 - UNSTABLE MACHINE

Post by PinHead »

josgba2002 wrote:Previous WUs that folded without any problem. This is the first time. If were unstable, since first day FAH were give errors.
Well actually, each project and each WU ( project, run, clone, gen ) is not the same. Some might cause more heat, some might not.
P5-133XL
Posts: 2948
Joined: Sun Dec 02, 2007 4:36 am
Hardware configuration: Machine #1:

Intel Q9450; 2x2GB=8GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460; Windows Server 2008 X64 (SP1).

Machine #2:

Intel Q6600; 2x2GB=4GB Ram; Gigabyte GA-X48-DS4 Motherboard; PC Power and Cooling Q750 PS; 2x GTX 460 video card; Windows 7 X64.

Machine 3:

Dell Dimension 8400, 3.2GHz P4 4x512GB Ram, Video card GTX 460, Windows 7 X32

I am currently folding just on the 5x GTX 460's for aprox. 70K PPD
Location: Salem. OR USA

Re: Project:7809 & Project:7904 - UNSTABLE MACHINE

Post by P5-133XL »

I will agree that those temps are fine but that is not the only possible problem. Just because your machine has folded successfully at the current state does not mean it is currently stable. Some components especially things like capacitors on the motherboard deteriorate over time. Those capacitors keep the timings on your motherboard buses stable. You may merely be experiencing the start of new problems.

Again, I suggest that you turn off the OC to test to see if it is a factor. I'm not saying that it needs to be permanently turned off but merely to see if it is a factor. If the folding problem goes away, then it is likely to be and if it doesn't then perhaps something else is the primary issue like RAM.
Image
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project:7809 & Project:7904 - UNSTABLE MACHINE

Post by bruce »

The real test is whether the WUs are finished successfully after being reassigned. There are bad WUs, particularly if you set yourself up for beta testing but if it's the WU that's unstable, it should produce the same sort of failure on anybody's computer.

We have to give the WU time enough to be reassigned and to be returned. I'll flag this topic for a Moderator to recheck your WUs (P7809 R7 C51 G42) and (P7904 R20 C1 G26) in a few weeks but you may have already figured out something by then. (Preferred deadlines of 25.6 days and 12.0 days.)
7im
Posts: 10189
Joined: Thu Nov 29, 2007 4:30 pm
Hardware configuration: Intel i7-4770K @ 4.5 GHz, 16 GB DDR3-2133 Corsair Vengence (black/red), EVGA GTX 760 @ 1200 MHz, on an Asus Maximus VI Hero MB (black/red), in a blacked out Antec P280 Tower, with a Xigmatek Night Hawk (black) HSF, Seasonic 760w Platinum (black case, sleeves, wires), 4 SilenX 120mm Case fans with silicon fan gaskets and silicon mounts (all black), a 512GB Samsung SSD (black), and a 2TB Black Western Digital HD (silver/black).
Location: Arizona
Contact:

Re: Project:7809 & Project:7904 - UNSTABLE MACHINE

Post by 7im »

Past stability is no guarantee of future stability. Fah continues to release larger and more complex work units, as well as more demanding fahcores. What OC settings that once worked well may no longer work so well, even without the bit rot.
How to provide enough information to get helpful support
Tell me and I forget. Teach me and I remember. Involve me and I learn.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project:7809 & Project:7904 - UNSTABLE MACHINE

Post by bruce »

Successfully complete by someone else:
Hi xxxx (team xxxxx),
Your WU (P7809 R7 C51 G42) was added to the stats database on 2012-09-12 02:08:32 for 5129.09 points of credit.

Ad this point, we have no more data on Project: 7904 (Run 20, Clone 1, Gen 26)

07-24-2013 Mod Note: Project: 7904 (Run 20, Clone 1, Gen 26) was never completed successfully.
DoctorsSon
Posts: 56
Joined: Thu Apr 24, 2008 5:04 pm

Re: Project:7809 & Project:7904 - UNSTABLE MACHINE

Post by DoctorsSon »

I have completed 7809's on a couple of my folders with no issues.
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project:7809 & Project:7904 - UNSTABLE MACHINE

Post by bruce »

I have no doubt that there have been maybe 100,000 WUs completed successfully from Project 7809 and somewhat less for Project 7904 -- maybe as few as 50000. I interpreted the original question to be what happened to two specific WUs which josgba2002 failed to complete. (Yes, his title didn't exactly say that.)
Post Reply