Project: 10041 (Run 903, Clone 0, Gen 18)

Moderators: Site Moderators, PandeGroup

Project: 10041 (Run 903, Clone 0, Gen 18)

Postby artoar_11 » Mon Jul 26, 2010 9:17 am

[07:07:34] Folding @ home Core Shutdown: BAD_WORK_UNIT.

This PC, dedicated to FAH week ago. This is the first incomplete WU.

Code: Select all
--- Opening Log file [July 26 06:50:45 UTC]


# Windows CPU Console Edition #################################################
###############################################################################

                       Folding@Home Client Version 6.23

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: F:\Tempo\FAH\2_FAH
Executable: F:\Tempo\FAH\2_FAH\FAH_console2.exe
Arguments: -verbosity 9 -forceasm

Warning:
By using the -forceasm flag, you are overriding
safeguards in the program. If you did not intend to
do this, please restart the program without -forceasm.
If work units are not completing fully (and particularly
if your machine is overclocked), then please discontinue
use of the flag.

[06:50:45] - Ask before connecting: No
[06:50:45] - User name: artoar_home (Team 32435)
[06:50:45] - User ID: ***********
[06:50:45] - Machine ID: 2
[06:50:45]
[06:50:45] Loaded queue successfully.
[06:50:45]
[06:50:45] + Processing work unit
[06:50:45] Core required: FahCore_b4.exe
[06:50:45] Core found.
[06:50:45] - Autosending finished units... [July 26 06:50:45 UTC]
[06:50:45] Trying to send all finished work units
[06:50:45] + No unsent completed units remaining.
[06:50:45] Working on queue slot 06 [July 26 06:50:45 UTC]
[06:50:45] - Autosend completed
[06:50:45] + Working ...
[06:50:45] - Calling '.\FahCore_b4.exe -dir work/ -suffix 06 -nocpulock -checkpoint 6 -forceasm -verbose -lifeline 3572 -version 623'

[06:50:50] *********************** Log Started 26/Jul/2010 06:50:50 ***********************
[06:50:50] ************************** ProtoMol Folding@Home Core **************************
[06:50:50]   Version: 25
[06:50:50]      Type: 180
[06:50:50]      Core: ProtoMol
[06:50:50]   Website: http://folding.stanford.edu/
[06:50:50] Copyright: (c) 2009 Stanford University
[06:50:50]    Author: Joseph Coffland <joseph@cauldrondevelopment.com>
[06:50:50]      Args: -dir work/ -suffix 06 -nocpulock -checkpoint 6 -forceasm -verbose
[06:50:50]            -lifeline 3572 -version 623
[06:50:50] ************************************ Build *************************************
[06:50:50]      Date: May 18 2010
[06:50:50]      Time: 23:43:52
[06:50:50]  Revision: 1819
[06:50:50]  Compiler: Intel(R) C++ MSVC 1500 mode 1110
[06:50:50]   Options: /TP /nologo /EHsc /wd4297 /wd4103 /wd1786 /arch:IA32 /Ox
[06:50:50]            /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qrestrict /MT
[06:50:50]   Defines: _CRT_SECURE_NO_WARNINGS NDEBUG HAVE_GEEKINFO BOOST_ALL_NO_LIB
[06:50:50]            XML_STATIC HAVE_EXPAT HAVE_OPENSSL HAVE_LIBFAH HAVE_SIMTK_LAPACK
[06:50:50]  Platform: Windows XP
[06:50:50]      Bits: 32
[06:50:50]      Mode: Release
[06:50:50] ************************************ System ************************************
[06:50:50]        OS: Microsoft Windows 7 Professional
[06:50:50]       CPU: Intel(R) Core(TM)2 CPU 6400 @ 2.13GHz
[06:50:50]    CPU ID: GenuineIntel Family 6 Model 15 Stepping 6
[06:50:50]      CPUs: 2 Logical, 1 Physical
[06:50:50]    Memory: 2.00 GB
[06:50:50]   Threads: Windows
[06:50:50] ********************************************************************************
[06:50:50] Project: 10041 (Run 903, Clone 0, Gen 18)
[06:50:50] Unit: 0x0000001b0001329c4be2f6fb0000076f
[06:50:50] User: 0x00000000000000000000000000000000
[06:50:50] Machine: 2
[06:50:50] Digital signatures verified
[06:50:52] GUI Server started
[06:50:52] Completed 1067700 out of 2000000 steps (53%)
[06:59:09] Completed 1080000 out of 2000000 steps (54%)
[07:07:34] ERROR: ProtoMol ERROR: Corrupt DCD file. Size is 278988, should be >= 285624.
[07:07:34] Saving result file logfile_06.txt
[07:07:34] Saving result file checkpt
[07:07:34] Saving result file checkpt.crc
[07:07:34] Saving result file log.txt
[07:07:34] Saving result file protomol.conf
[07:07:34] Saving result file ww.dcd
[07:07:34] Saving result file ww_structure_14_charm.2204.pos
[07:07:34] Saving result file ww_structure_14_charm.2204.vel
[07:07:34] WARNING: While cleaning up: 0: Failed to remove directory '06': boost::filesystem::remove: The process cannot access the file because it is being used by another process: "06\ww.dcd"
[07:07:34] Folding@home Core Shutdown: BAD_WORK_UNIT
[07:07:37] CoreStatus = 72 (114)
[07:07:37] Sending work to server
[07:07:37] Project: 10041 (Run 903, Clone 0, Gen 18)
[07:07:37] - Read packet limit of 540015616... Set to 524286976.


[07:07:37] + Attempting to send results [July 26 07:07:37 UTC]
[07:07:37] - Reading file work/wuresults_06.dat from core
[07:07:37]   (Read 285279 bytes from disk)
[07:07:37] Connecting to http://129.74.85.15:8080/
[07:08:09] - Couldn't send HTTP request to server
[07:08:09] + Could not connect to Work Server (results)
[07:08:09]     (129.74.85.15:8080)
[07:08:09] + Retrying using alternative port
[07:08:09] Connecting to http://129.74.85.15:80/
[07:08:11] - Couldn't send HTTP request to server
[07:08:11] + Could not connect to Work Server (results)
[07:08:11]     (129.74.85.15:80)
[07:08:11] - Error: Could not transmit unit 06 (completed July 26) to work server.
[07:08:11] - 1 failed uploads of this unit.
[07:08:11]   Keeping unit 06 in queue.
[07:08:11] Trying to send all finished work units
[07:08:11] Project: 10041 (Run 903, Clone 0, Gen 18)
[07:08:11] - Read packet limit of 540015616... Set to 524286976.


[07:08:11] + Attempting to send results [July 26 07:08:11 UTC]
[07:08:11] - Reading file work/wuresults_06.dat from core
[07:08:11]   (Read 285279 bytes from disk)
[07:08:11] Connecting to http://129.74.85.15:8080/
[07:09:02] - Couldn't send HTTP request to server
[07:09:02] + Could not connect to Work Server (results)
[07:09:02]     (129.74.85.15:8080)
[07:09:02] + Retrying using alternative port
[07:09:02] Connecting to http://129.74.85.15:80/
[07:09:03] - Couldn't send HTTP request to server
[07:09:03] + Could not connect to Work Server (results)
[07:09:03]     (129.74.85.15:80)
[07:09:03] - Error: Could not transmit unit 06 (completed July 26) to work server.
[07:09:03] - 2 failed uploads of this unit.
[07:09:03] - Read packet limit of 540015616... Set to 524286976.


[07:09:03] + Attempting to send results [July 26 07:09:03 UTC]
[07:09:03] - Reading file work/wuresults_06.dat from core
[07:09:03]   (Read 285279 bytes from disk)
[07:09:03] Connecting to http://129.74.85.16:8080/
[07:09:06] Posted data.
[07:09:06] Initial: 0000; - Uploaded at ~93 kB/s
[07:09:06] - Averaged speed for that direction ~75 kB/s
[07:09:06] + Results successfully sent
[07:09:06] Thank you for your contribution to Folding@Home.
[07:09:06]   Successfully sent unit 06 to Collection server.
[07:09:06] + Sent 1 of 1 completed units to the server
artoar_11
 
Posts: 26
Joined: Sun Nov 22, 2009 9:42 pm
Location: Bulgaria /Team #32435

Re: Project: 10041 (Run 903, Clone 0, Gen 18)

Postby bruce » Mon Jul 26, 2010 9:27 am

You did get partial credit for your efforts and somebody else completed it successfully, so sorry, this is not a BAD_WORK_UNIT.

Hi xxxx (team xxxx),
Your WU (P10041 R903 C0 G18) was added to the stats database on 2010-07-09 08:06:49 for 168.96 points of credit.
Hi artoar_home (team 32435),
Your WU (P10041 R903 C0 G18) was added to the stats database on 2010-07-26 01:07:02 for 92.3 points of credit.

I can't be certain, but the message "ERROR: ProtoMol ERROR: Corrupt DCD file. Size is 278988, should be >= 285624." is the sort of thing we sometimes see when an antivirus program decides it want's to throw away some essential portions of FAH's data files.
bruce
Site Admin
 
Posts: 8996
Joined: Thu Nov 29, 2007 11:13 pm
Location: So. Cal.

Re: Project: 10041 (Run 903, Clone 0, Gen 18)

Postby John_Weatherman » Mon Jul 26, 2010 9:28 am

This is a known error that occurs after a close down with the b4 core . Windows closes down the client before it's finished saving. See this thread http://foldingforum.org/viewtopic.php?f=47&t=14793&start=15
The only way to be sure is to manually close the client before closing Windows.
EDIT - Bruce posted will I was writing. Might be AV but it's not your machine :D
User avatar
John_Weatherman
 
Posts: 325
Joined: Sun Dec 02, 2007 5:31 am
Location: The back of beyond, in the middle of nowhere.

Re: Project: 10041 (Run 903, Clone 0, Gen 18)

Postby artoar_11 » Mon Jul 26, 2010 10:07 am

After Ctrl+C must wait longer before Shut down of Windows. I understood you correctly?
A similar thing happens with b4 after a power failure. Usually start at the beginning, or download a new WU.
AV - Microsoft Security Essentials. For now I have not noticed problems.

Thanks for the explanations.
artoar_11
 
Posts: 26
Joined: Sun Nov 22, 2009 9:42 pm
Location: Bulgaria /Team #32435

Re: Project: 10041 (Run 903, Clone 0, Gen 18)

Postby PantherX » Mon Jul 26, 2010 10:37 am

artoar_11 wrote:After Ctrl+C must wait longer before Shut down of Windows. I understood you correctly?...
If I Ctrl+C, I wait for a couple of minutes before I restart the Client or restart the system. I prefer to play safe :D

artoar_11 wrote:...AV - Microsoft Security Essentials. For now I have not noticed problems...
I too am using it (64 bit) and it never gave me any problems with SMP2 WUs; normal and bigadv.
User avatar
PantherX
 
Posts: 1397
Joined: Wed Dec 23, 2009 10:33 am
Location: Jeddah, Kingdom Of Saudia Arabia

Re: Project: 10041 (Run 903, Clone 0, Gen 18)

Postby John_Weatherman » Mon Jul 26, 2010 3:31 pm

artoar_11 wrote:After Ctrl+C must wait longer before Shut down of Windows. I understood you correctly?
A similar thing happens with b4 after a power failure. Usually start at the beginning, or download a new WU.


Yes, that's correct. The client will dump the WU and get a new one. Hopefully the new client will solve this problem.
User avatar
John_Weatherman
 
Posts: 325
Joined: Sun Dec 02, 2007 5:31 am
Location: The back of beyond, in the middle of nowhere.

Re: Project: 10041 (Run 903, Clone 0, Gen 18)

Postby codysluder » Mon Jul 26, 2010 5:48 pm

John_Weatherman wrote:Yes, that's correct. The client will dump the WU and get a new one. Hopefully the new client will solve this problem.


That error will require a new version of FahCore b4, not a new client. I'm not sure what's taking them so long. With this type of error, I would have expected an updated core by now.

The new client is a major rewrite and that can be expected to take a long time with a lot of unpredictability thrown in there, too.
codysluder
 
Posts: 1664
Joined: Sun Dec 02, 2007 1:43 pm


Return to Issues with a specific WU

Who is online

Users browsing this forum: No registered users