Project: 13734 (Run 2, Clone 0, Gen 213)

Moderators: Site Moderators, FAHC Science Team

Post Reply
parkut
Posts: 364
Joined: Tue Feb 12, 2008 7:33 am
Hardware configuration: Running exclusively Linux headless blades. All are dedicated crunching machines.
Location: SE Michigan, USA

Project: 13734 (Run 2, Clone 0, Gen 213)

Post by parkut »

This particular work unit was found on one of my Linux machines stuck in a loop of start/immediate fail for the last 12 hours.
"cured" the problem by deleting the 'work' directory to purge the 'stuck' work unit. Picked up a new project and back to folding.

Code: Select all

12:57:20:WU00:FS00:Starting
12:57:20:WU00:FS00:Removing old file './work/00/logfile_01-20171212-122620.txt'
-checkpoint 15 -np 8
12:57:20:WU00:FS00:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/fahwebx.stanford.edu/cores/Linux/AMD64/Core_a4.fah/FahCore_a4 -dir 00 -suffix 01 -version 704 -lifeline 17611 
12:57:20:WU00:FS00:Started FahCore on PID 19148
12:57:20:WU00:FS00:Core PID:19152
12:57:20:WU00:FS00:FahCore 0xa4 started
12:57:21:WU00:FS00:0xa4:
12:57:21:WU00:FS00:0xa4:*------------------------------*
12:57:21:WU00:FS00:0xa4:Folding@Home Gromacs GB Core
12:57:21:WU00:FS00:0xa4:Version 2.27 (Dec. 15, 2010)
12:57:21:WU00:FS00:0xa4:
12:57:21:WU00:FS00:0xa4:Preparing to commence simulation
12:57:21:WU00:FS00:0xa4:- Ensuring status. Please wait.
12:57:30:WU00:FS00:0xa4:- Looking at optimizations...
12:57:30:WU00:FS00:0xa4:- Working with standard loops on this execution.
12:57:30:WU00:FS00:0xa4:Examination of work files indicates 8 consecutive improper terminations of core.
12:57:30:WU00:FS00:0xa4:- Expanded 11252 -> 414076 (decompressed 3680.0 percent)
12:57:30:WU00:FS00:0xa4:Called DecompressByteArray: compressed_data_size=11252 data_size=414076, decompressed_data_size=414076 diff=0
12:57:30:WU00:FS00:0xa4:- Digital signature verified
12:57:30:WU00:FS00:0xa4:
12:57:30:WU00:FS00:0xa4:Project: 13734 (Run 2, Clone 0, Gen 213)
12:57:30:WU00:FS00:0xa4:
12:57:30:WU00:FS00:0xa4:Entering M.D.
12:57:36:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 13734 (Run 2, Clone 0, Gen 213)

Post by bruce »

I looked up Project: 13734 and didn't find any results submitted with error reports. That's not surprising, though, since the project is from Temple University and due to a reported problem with the campus firewall, Temple U is not reporting their results back to the master database. I hesitate to take any action until the campus firewall is functioning normally and the stats are collected from there. (See another topic on those servers.)

ASSUMING it's a bad WU, it makes sense to suspend it so others don't waste their time the way you did, but again, that action would require a working connection to the server hosting that project. In the next week or two, if you notice that the points from that server have been credited, send a PM to a Mod or Admin reminding them of this issue.
Post Reply