Project: 6381 (Run 9, Clone 37, Gen 324) failing.

Moderators: Site Moderators, FAHC Science Team

Post Reply
iBozz
Posts: 89
Joined: Wed Nov 26, 2008 7:01 pm
Hardware configuration: iMac (Retina 5K, 27-inch, 2017), 3.8 GHz Quad-Core Intel Core i5, 64 GB 2400 MHz DDR4, 2TB HD running under macOS Catalina v10.15.7 (19G2021)
Location: NW England, UK

Project: 6381 (Run 9, Clone 37, Gen 324) failing.

Post by iBozz »

I have just deleted the work unit Project: 6381 (Run 9, Clone 37, Gen 324) which has been failing for 20 hours. This is the first unit to fail (as far as I am aware) for six months.

A new unit Project: 9032 (Run 186, Clone 3, Gen 40) has been downloaded which seems to be working just fine.

A sample of the log is ...

Code: Select all

21:45:46:WU01:FS00:Started FahCore on PID 28765
21:45:46:WU01:FS00:Core PID:28766
21:45:46:WU01:FS00:FahCore 0xa4 started
21:45:47:WU01:FS00:0xa4:
21:45:47:WU01:FS00:0xa4:*------------------------------*
21:45:47:WU01:FS00:0xa4:Folding@Home Gromacs Core
21:45:47:WU01:FS00:0xa4:Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
21:45:47:WU01:FS00:0xa4:
21:45:47:WU01:FS00:0xa4:Preparing to commence simulation
21:45:47:WU01:FS00:0xa4:- Ensuring status. Please wait.
21:45:56:WU01:FS00:0xa4:- Looking at optimizations...
21:45:56:WU01:FS00:0xa4:- Working with standard loops on this execution.
21:45:56:WU01:FS00:0xa4:Examination of work files indicates 8 consecutive improper terminations of core.
21:45:56:WU01:FS00:0xa4:- Expanded 565225 -> 1381464 (decompressed 244.4 percent)
21:45:56:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=565225 data_size=1381464, decompressed_data_size=1381464 diff=0
21:45:56:WU01:FS00:0xa4:- Digital signature verified
21:45:56:WU01:FS00:0xa4:
21:45:56:WU01:FS00:0xa4:Project: 6381 (Run 9, Clone 37, Gen 324)
21:45:56:WU01:FS00:0xa4:
21:45:57:WU01:FS00:0xa4:Entering M.D.
21:46:03:WU01:FS00:0xa4:Mapping NT from 8 to 8 
21:46:03:WU01:FS00:0xa4:Completed 0 out of 2500000 steps  (0%)
21:46:04:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
21:46:46:WU01:FS00:Starting
21:46:46:WU01:FS00:Removing old file './work/01/logfile_01-20160223-211443.txt'
21:46:46:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper "/Library/Application Support/FAHClient/cores/web.stanford.edu/~pande/OSX/AMD64/Core_a4.fah/FahCore_a4" -dir 01 -suffix 01 -version 704 -lifeline 363 -checkpoint 15 -np 8
21:46:46:WU01:FS00:Started FahCore on PID 28768
21:46:47:WU01:FS00:Core PID:28769
21:46:47:WU01:FS00:FahCore 0xa4 started
21:46:47:WU01:FS00:0xa4:
21:46:47:WU01:FS00:0xa4:*------------------------------*
21:46:47:WU01:FS00:0xa4:Folding@Home Gromacs Core
21:46:47:WU01:FS00:0xa4:Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
21:46:47:WU01:FS00:0xa4:
21:46:47:WU01:FS00:0xa4:Preparing to commence simulation
21:46:47:WU01:FS00:0xa4:- Ensuring status. Please wait.
21:46:56:WU01:FS00:0xa4:- Looking at optimizations...
21:46:56:WU01:FS00:0xa4:- Working with standard loops on this execution.
21:46:56:WU01:FS00:0xa4:Examination of work files indicates 8 consecutive improper terminations of core.
21:46:56:WU01:FS00:0xa4:- Expanded 565225 -> 1381464 (decompressed 244.4 percent)
21:46:56:WU01:FS00:0xa4:Called DecompressByteArray: compressed_data_size=565225 data_size=1381464, decompressed_data_size=1381464 diff=0
21:46:56:WU01:FS00:0xa4:- Digital signature verified
21:46:56:WU01:FS00:0xa4:
21:46:56:WU01:FS00:0xa4:Project: 6381 (Run 9, Clone 37, Gen 324)
21:46:56:WU01:FS00:0xa4:
21:46:57:WU01:FS00:0xa4:Entering M.D.
21:47:03:WU01:FS00:0xa4:Mapping NT from 8 to 8 
21:47:03:WU01:FS00:0xa4:Completed 0 out of 2500000 steps  (0%)
21:47:04:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
21:47:47:WU01:FS00:Starting
21:47:47:WU01:FS00:Removing old file './work/01/logfile_01-20160223-211543.txt'
21:47:47:WU01:FS00:Running FahCore: /usr/bin/FAHCoreWrapper "/Library/Application Support/FAHClient/cores/web.stanford.edu/~pande/OSX/AMD64/Core_a4.fah/FahCore_a4" -dir 01 -suffix 01 -version 704 -lifeline 363 -checkpoint 15 -np 8
iMac (Retina 5K, 27-inch, 2017), 3.8 GHz Quad-Core Intel Core i5, 64 GB 2400 MHz DDR4, 2TB HD, macOS Catalina v10.15.7
bruce
Posts: 20910
Joined: Thu Nov 29, 2007 10:13 pm
Location: So. Cal.

Re: Project: 6381 (Run 9, Clone 37, Gen 324) failing.

Post by bruce »

No one has returned that WU yet, either successfully or with an error report.

Gen 323 was added to the stats database on 2016-02-10 so gen 324 has been kicking around for almost two weeks now. That's consistent with a timeout of 12 days so apparently it has only been processed by one other person. Nevertheless, I think it's expedient for me to mark it as a bad WU so it won't get reassigned again when your copy times out.
iBozz
Posts: 89
Joined: Wed Nov 26, 2008 7:01 pm
Hardware configuration: iMac (Retina 5K, 27-inch, 2017), 3.8 GHz Quad-Core Intel Core i5, 64 GB 2400 MHz DDR4, 2TB HD running under macOS Catalina v10.15.7 (19G2021)
Location: NW England, UK

Re: Project: 6381 (Run 9, Clone 37, Gen 324) failing.

Post by iBozz »

Thanks, bruce, I was worried it might just be me! If you want more log I'll be happy to provide, but I think (hope) that I had clipped enough before it repeated too often. If you have enough information from my log then don't bother to reply on that specific point.

Whether it is relevant or not, I note that my kit description at the foot of my original post was out of date and I've now updated it. :oops:
iMac (Retina 5K, 27-inch, 2017), 3.8 GHz Quad-Core Intel Core i5, 64 GB 2400 MHz DDR4, 2TB HD, macOS Catalina v10.15.7
Post Reply