UNSTABLE_MACHINE stock 275/6.23

Moderators: Site Moderators, PandeGroup

UNSTABLE_MACHINE stock 275/6.23

Postby cheechi » Thu May 03, 2012 11:27 pm

7 64 bit
GTX 275 at stock/whatever speed came out of the box.
CPU SMP client also running with no issues.
Drivers 285.62

Have had several completed WU between the attached errors. Nothing out of the ordinary temp wise, still plays games fine when not folding. Reboot did not fix, no recent updates or drivers installed. Generally stable PC.

I hope this helps, it does not appear to be an issue with the machine, it's still working 5769 6/204/2471 now with no issues & normal speeds. The CPU client is also still going and has had no interruption. The issue is also independent of time I leave the GPU idle, I had one happen on boot and another 12 units between it and the next.

Code: Select all
[01:09:52] Folding@home Core Shutdown: FINISHED_UNIT
[01:09:54] CoreStatus = 64 (100)
[01:09:54] Sending work to server
[01:09:54] Project: 5768 (Run 1, Clone 215, Gen 3239)
[01:09:54] - Read packet limit of 540015616... Set to 524286976.


[01:09:54] + Attempting to send results [May 3 01:09:54 UTC]
[01:09:55] + Results successfully sent
[01:09:55] Thank you for your contribution to Folding@Home.
[01:09:55] + Number of Units Completed: 10018

[01:09:59] - Preparing to get new work unit...
[01:09:59] + Attempting to get work packet
[01:09:59] - Connecting to assignment server
[01:10:00] - Successful: assigned to (171.67.108.11).
[01:10:00] + News From Folding@Home: Welcome to Folding@Home
[01:10:00] Loaded queue successfully.
[01:10:01] + Closed connections
[01:10:01]
[01:10:01] + Processing work unit
[01:10:01] Core required: FahCore_11.exe
[01:10:01] Core found.
[01:10:01] Working on queue slot 03 [May 3 01:10:01 UTC]
[01:10:01] + Working ...
[01:10:01]
[01:10:01] *------------------------------*
[01:10:01] Folding@Home GPU Core
[01:10:01] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[01:10:01]
[01:10:01] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[01:10:01] Build host: amoeba
[01:10:01] Board Type: Nvidia
[01:10:01] Core      :
[01:10:01] Preparing to commence simulation
[01:10:01] - Looking at optimizations...
[01:10:01] DeleteFrameFiles: successfully deleted file=work/wudata_03.ckp
[01:10:01] - Created dyn
[01:10:01] - Files status OK
[01:10:01] - Expanded 45505 -> 251112 (decompressed 551.8 percent)
[01:10:01] Called DecompressByteArray: compressed_data_size=45505 data_size=251112, decompressed_data_size=251112 diff=0
[01:10:01] - Digital signature verified
[01:10:01]
[01:10:01] Project: 5770 (Run 0, Clone 282, Gen 5214)
[01:10:01]
[01:10:01] Assembly optimizations on if available.
[01:10:01] Entering M.D.
[01:10:07] Tpr hash work/wudata_03.tpr:  2889382016 3358494243 4002536681 4042937175 3274701607
[01:10:07]
[01:10:07] Calling fah_main args: 14 usage=100
[01:10:07]
[01:10:08] Working on Protein
[01:10:08] Run: exception thrown during GuardedRun
[01:10:08] Run: exception thrown in GuardedRun -- Gromacs cannot continue further.
[01:10:08] Going to send back what have done -- stepsTotalG=0
[01:10:08] Work fraction=0.0000 steps=0.
[01:10:12] logfile size=0 infoLength=0 edr=0 trr=23
[01:10:12] + Opened results file
[01:10:12] - Writing 635 bytes of core data to disk...
[01:10:12] Done: 123 -> 124 (compressed to 100.8 percent)
[01:10:12]   ... Done.
[01:10:12] DeleteFrameFiles: successfully deleted file=work/wudata_03.ckp
[01:10:12]
[01:10:12] Folding@home Core Shutdown: UNSTABLE_MACHINE


Code: Select all
[07:08:59] Folding@home Core Shutdown: FINISHED_UNIT
[07:09:02] CoreStatus = 64 (100)
[07:09:02] Sending work to server
[07:09:02] Project: 5765 (Run 2, Clone 201, Gen 3906)
[07:09:02] - Read packet limit of 540015616... Set to 524286976.


[07:09:02] + Attempting to send results [May 3 07:09:02 UTC]
[07:09:03] + Results successfully sent
[07:09:03] Thank you for your contribution to Folding@Home.
[07:09:03] + Number of Units Completed: 10021

[07:09:07] - Preparing to get new work unit...
[07:09:07] + Attempting to get work packet
[07:09:07] - Connecting to assignment server
[07:09:07] - Successful: assigned to (171.67.108.11).
[07:09:07] + News From Folding@Home: Welcome to Folding@Home
[07:09:08] Loaded queue successfully.
[07:09:08] + Closed connections
[07:09:08]
[07:09:08] + Processing work unit
[07:09:08] Core required: FahCore_11.exe
[07:09:08] Core found.
[07:09:08] Working on queue slot 07 [May 3 07:09:08 UTC]
[07:09:08] + Working ...
[07:09:09]
[07:09:09] *------------------------------*
[07:09:09] Folding@Home GPU Core
[07:09:09] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[07:09:09]
[07:09:09] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[07:09:09] Build host: amoeba
[07:09:09] Board Type: Nvidia
[07:09:09] Core      :
[07:09:09] Preparing to commence simulation
[07:09:09] - Looking at optimizations...
[07:09:09] DeleteFrameFiles: successfully deleted file=work/wudata_07.ckp
[07:09:09] - Created dyn
[07:09:09] - Files status OK
[07:09:09] - Expanded 46692 -> 252912 (decompressed 541.6 percent)
[07:09:09] Called DecompressByteArray: compressed_data_size=46692 data_size=252912, decompressed_data_size=252912 diff=0
[07:09:09] - Digital signature verified
[07:09:09]
[07:09:09] Project: 5765 (Run 9, Clone 341, Gen 3342)
[07:09:09]
[07:09:09] Assembly optimizations on if available.
[07:09:09] Entering M.D.
[07:09:15] Tpr hash work/wudata_07.tpr:  2026299323 2286864549 2955327270 3042569252 424047811
[07:09:15]
[07:09:15] Calling fah_main args: 14 usage=100
[07:09:15]
[07:09:15] Working on Protein
[07:09:16] Run: exception thrown during GuardedRun
[07:09:16] Run: exception thrown in GuardedRun -- Gromacs cannot continue further.
[07:09:16] Going to send back what have done -- stepsTotalG=0
[07:09:16] Work fraction=0.0000 steps=0.
[07:09:20] logfile size=0 infoLength=0 edr=0 trr=23
[07:09:20] + Opened results file
[07:09:20] - Writing 635 bytes of core data to disk...
[07:09:20] Done: 123 -> 124 (compressed to 100.8 percent)
[07:09:20]   ... Done.
[07:09:20] DeleteFrameFiles: successfully deleted file=work/wudata_07.ckp
[07:09:20]
[07:09:20] Folding@home Core Shutdown: UNSTABLE_MACHINE

Folding@Home Client Shutdown.


--- Opening Log file [May 3 23:07:42 UTC]


# Windows GPU Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.23

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: D:\fah\gpu
Executable: D:\fah\gpu\Folding@home-Win32-GPU.exe


[23:07:42] - Ask before connecting: No
[23:07:42] - User name: cheechi (Team 186785)
[23:07:42] - User ID: 250042D4497857D7
[23:07:42] - Machine ID: 2
[23:07:42]
[23:07:42] Loaded queue successfully.
[23:07:42]
[23:07:42] + Processing work unit
[23:07:42] Core required: FahCore_11.exe
[23:07:42] Core found.
[23:07:42] Working on queue slot 07 [May 3 23:07:42 UTC]
[23:07:42] + Working ...
[23:07:42]
[23:07:42] *------------------------------*
[23:07:42] Folding@Home GPU Core
[23:07:42] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[23:07:42]
[23:07:42] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[23:07:42] Build host: amoeba
[23:07:42] Board Type: Nvidia
[23:07:42] Core      :
[23:07:42] Preparing to commence simulation
[23:07:42] - Looking at optimizations...
[23:07:42] DeleteFrameFiles: successfully deleted file=work/wudata_07.ckp
[23:07:42] - Created dyn
[23:07:42] - Files status OK
[23:07:42] Error: Missing work file=<>
[23:07:42]
[23:07:42] Folding@home Core Shutdown: MISSING_WORK_FILES
[23:07:47] CoreStatus = 74 (116)
[23:07:47] The core could not find the work files specified. Removing from queue
[23:07:47] Deleting current work unit & continuing...
[23:07:51] - Preparing to get new work unit...
[23:07:51] + Attempting to get work packet
[23:07:51] - Connecting to assignment server
[23:07:51] - Successful: assigned to (171.67.108.11).
[23:07:51] + News From Folding@Home: Welcome to Folding@Home
[23:07:51] Loaded queue successfully.
[23:07:52] + Could not connect to Work Server
[23:07:52] - Attempt #1  to get work failed, and no other work to do.
Waiting before retry.
[23:08:08] + Attempting to get work packet
[23:08:08] - Connecting to assignment server
[23:08:09] - Successful: assigned to (171.67.108.11).
[23:08:09] + News From Folding@Home: Welcome to Folding@Home
[23:08:09] Loaded queue successfully.
[23:08:10] + Closed connections
[23:08:15]
[23:08:15] + Processing work unit
[23:08:15] Core required: FahCore_11.exe
[23:08:15] Core found.
[23:08:15] Working on queue slot 08 [May 3 23:08:15 UTC]
[23:08:15] + Working ...
[23:08:15]
[23:08:15] *------------------------------*
[23:08:15] Folding@Home GPU Core
[23:08:15] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[23:08:15]
[23:08:15] Compiler  : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[23:08:15] Build host: amoeba
[23:08:15] Board Type: Nvidia
[23:08:15] Core      :
[23:08:15] Preparing to commence simulation
[23:08:15] - Looking at optimizations...
[23:08:15] DeleteFrameFiles: successfully deleted file=work/wudata_08.ckp
[23:08:15] - Created dyn
[23:08:15] - Files status OK
[23:08:15] - Expanded 45328 -> 251112 (decompressed 553.9 percent)
[23:08:15] Called DecompressByteArray: compressed_data_size=45328 data_size=251112, decompressed_data_size=251112 diff=0
[23:08:15] - Digital signature verified
[23:08:15]
[23:08:15] Project: 5769 (Run 6, Clone 204, Gen 2471)
[23:08:15]
[23:08:15] Assembly optimizations on if available.
[23:08:15] Entering M.D.
[23:08:21] Tpr hash work/wudata_08.tpr:  763721719 2782550799 1611530342 4196164560 3310935512
[23:08:21]
[23:08:21] Calling fah_main args: 14 usage=100
[23:08:21]
[23:08:21] Working on Protein
[23:08:22] Run: exception thrown during GuardedRun
[23:08:22] Run: exception thrown in GuardedRun -- Gromacs cannot continue further.
[23:08:22] Going to send back what have done -- stepsTotalG=0
[23:08:22] Work fraction=0.0000 steps=0.
[23:08:26] logfile size=0 infoLength=0 edr=0 trr=23
[23:08:26] + Opened results file
[23:08:26] - Writing 635 bytes of core data to disk...
[23:08:26] Done: 123 -> 124 (compressed to 100.8 percent)
[23:08:26]   ... Done.
[23:08:26] DeleteFrameFiles: successfully deleted file=work/wudata_08.ckp
[23:08:26]
[23:08:26] Folding@home Core Shutdown: UNSTABLE_MACHINE

Folding@Home Client Shutdown.
helping find cures since Dec 2004
Folding Wolves (186785)
cheechi
 
Posts: 49
Joined: Sun Mar 02, 2008 3:54 am

Re: UNSTABLE_MACHINE stock 275/6.23

Postby Leonardo » Fri May 04, 2012 1:28 am

When you say no problem with temperatures, does that mean that temperatures of the GPU core have not risen or that room temperature has not changed. Both?
Image
User avatar
Leonardo
 
Posts: 655
Joined: Tue Dec 04, 2007 5:09 am
Location: Eagle River, Alaska

Re: UNSTABLE_MACHINE stock 275/6.23

Postby cheechi » Fri May 04, 2012 3:39 am

System & GPU temps both have been consistent with the past several days. None of them are inordinately high or low.

I know this is not 100% guaranteed, but after a warm boot, or having not been folding for several minutes at least, if it was a temp issue it would probably go a few steps at least before a NAN/UM error. After having posted this thread, the same GPU has completed at least 3 units and is still going with no issue.
cheechi
 
Posts: 49
Joined: Sun Mar 02, 2008 3:54 am


Return to General GPU client issues

Who is online

Users browsing this forum: No registered users and 2 guests

cron