Project: 2665 (Run 3, Clone 446, Gen 42)

Moderators: Site Moderators, FAHC Science Team

Post Reply
spike09
Posts: 29
Joined: Mon Dec 03, 2007 12:04 pm
Hardware configuration: 2 x q6600's with 2 GB pc6400 , e6400 2 Gb pc 6400, x2 3800 1Gb PC 3200, athlon 3700, 1GB pc3200, athlaon 2200 512MB pc 2700
Location: location location

Project: 2665 (Run 3, Clone 446, Gen 42)

Post by spike09 »

I have this wu all over me like a rash, on one machine.
sys spec q6600, asrock dual4 core sata2 , 2 gig of ocz ram running win xp and the deprecated client. no cooling probs and no overclocking.

Code: Select all

[02:05:39] + Processing work unit
[02:05:39] Core required: FahCore_a1.exe
[02:05:39] Core found.
[02:05:39] Working on Unit 08 [October 9 02:05:39]
[02:05:39] + Working ...
[02:05:39] - Calling 'mpiexec -channel shm -env MPICH_USE_SMP_OPTIMIZATIONS 1 -np 4 FahCore_a1.exe -dir work/ -suffix 08 -checkpoint 15 -forceasm -verbose -lifeline 504 -version 592'

[02:05:40] 
[02:05:40] *------------------------------*
[02:05:40] Folding@Home Gromacs SMP Core
[02:05:40] Version 1.76 (February 23, 2008)
[02:05:40] 
[02:05:40] Preparing to commence simulation
[02:05:40] - Ensuring status. Please wait.
[02:05:57] - Assembly optimizations manually forced on.
[02:05:57] - Not checking prior termination.
[02:06:24] - Expanded 4764646 -> 24426905 (decompressed 512.6 percent)
[02:06:24] - Failed to delete work/wudata_08.sas
[02:06:24] - Failed to delete work/wudata_08.pdo
[02:06:24] Warning:  check for stray files
[02:06:24] - Starting from initial work packet
[02:06:24] 
[02:06:24] Project: 2665 (Run 3, Clone 446, Gen 42)
[02:06:24] 
[02:06:24] ect: 2665 (Run 3, Clone 446, Gen 42)
[02:06:24] 
[02:06:26] .
[02:06:26] Entering M.D.
[02:06:26] ations on if available.
[02:06:26] Entering M.D.
[02:06:35] ater
[02:06:35] Writing local files
[02:06:35] ting local files
[02:06:36] Extra SSE boost OK.
[02:06:47] es
[02:06:48] Completed 0 out of 250000 steps  (0 percent)
[02:06:56] Warning:  long 1-4 interactions
[05:06:48] At least 3 hours since checkpoint written...
[05:06:48] 
[05:06:48] Folding@home Core Shutdown: EARLY_UNIT_END
[05:06:48] 
[05:06:48] Folding@home Core Shutdown: EARLY_UNIT_END
[05:06:59] CoreStatus = 63 (99)
[05:06:59] + Error starting Folding@Home core or unexpected system termination of core.
[05:07:04] 
[05:07:04] + Processing work unit
[05:07:04] Core required: FahCore_a1.exe
[05:07:04] Core found.
[05:07:04] Working on Unit 08 [October 9 05:07:04]
[05:07:04] + Working ...
[05:07:04] - Calling 'mpiexec -channel shm -env MPICH_USE_SMP_OPTIMIZATIONS 1 -np 4 FahCore_a1.exe -dir work/ -suffix 08 -checkpoint 15 -forceasm -verbose -lifeline 504 -version 592'

[05:07:16] 
[05:07:16] *------------------------------*
[05:07:16] Folding@Home Gromacs SMP Core
[05:07:16] Version 1.76 (February 23, 2008)
[05:07:16] 
[05:07:16] Preparing to commence simulation
[05:07:16] - Ensuring status. Please wait.
[05:07:27] 
[05:07:33] Project: 2665 (Run 3, Clone 446, Gen 42)
[05:07:33] 
[05:07:47] Assembly optimizations on if available.
[05:07:47] Entering M.D.
[05:07:58]  on if available.
[05:07:58] Entering M.D.
[05:08:23] Calling FAH init
[05:08:25] Read topology
[05:08:26] ocal files
[05:08:26] rom checkpoint)
[05:08:26] Read checkpoint
[05:08:26] Protein: HGG in water
[05:08:26] Writing local files
[05:08:39] Extra SSE boost OK.
[05:08:40] Writing local files
[05:08:40] Completed 0 out of 250000 steps  (0 percent)
[05:08:50] Warning:  long 1-4 interactions
[06:03:22] - Autosending finished units...
[06:03:22] Trying to send all finished work units
[06:03:22] + No unsent completed units remaining.
[06:03:22] - Autosend completed
[08:08:40] At least 3 hours since checkpoint written...
[08:08:40] 
[08:08:40] Folding@home Core Shutdown: EARLY_UNIT_END
[08:08:40] 
[08:08:40] Folding@home Core Shutdown: EARLY_UNIT_END
[08:08:45] CoreStatus = 63 (99)
[08:08:45] + Error starting Folding@Home core or unexpected system termination of core.
[08:08:50] 
[08:08:50] + Processing work unit
[08:08:50] Core required: FahCore_a1.exe
[08:08:50] Core found.
[08:08:50] Working on Unit 08 [October 9 08:08:50]
[08:08:50] + Working ...
[08:08:50] - Calling 'mpiexec -channel shm -env MPICH_USE_SMP_OPTIMIZATIONS 1 -np 4 FahCore_a1.exe -dir work/ -suffix 08 -checkpoint 15 -forceasm -verbose -lifeline 504 -version 592'

[08:08:52] 
[08:08:52] *------------------------------*
[08:08:52] Folding@Home Gromacs SMP Core
[08:08:52] Version 1.76 (February 23, 2008)
[08:08:52] 
[08:08:52] Preparing to commence simulation
[08:08:52] - Ensuring status. Please wait.
[08:08:59] 
[08:09:00] Project: 2665 (Run 3, Clone 446, Gen 42)
[08:09:00] 
[08:09:01] Assembly optimizations on if available.
[08:09:01] Entering M.D.
[08:09:30]  on if available.
[08:09:30] Entering M.D.
[08:09:37] Calling FAH init
[08:09:39] Read topology
[08:09:39] ocal files
[08:09:39] rom checkpoint)
[08:09:39] Read checkpoint
[08:09:39] Protein: HGG in water
[08:09:39] Writing local files
[08:09:51] Extra SSE boost OK.
[08:09:52] Writing local files
[08:09:52] Completed 0 out of 250000 steps  (0 percent)
[08:10:00] Warning:  long 1-4 interactions
[11:09:53] At least 3 hours since checkpoint written...
[11:09:53] 
[11:09:53] Folding@home Core Shutdown: EARLY_UNIT_END
[11:09:53] 
[11:09:53] Folding@home Core Shutdown: EARLY_UNIT_END
[11:09:56] CoreStatus = 63 (99)
[11:09:56] + Error starting Folding@Home core or unexpected system termination of core.
[11:09:56] - Attempting to download new core...
[11:09:56] + Downloading new core: FahCore_a1.exe
[11:09:56] Downloading core (/~pande/Win32/x86_Deino/Core_a1.fah from www.stanford.edu)
[11:09:57] Initial: AFDE; + 10240 bytes downloaded
[11:09:57] Initial: 10F0; + 20480 bytes downloaded
[11:09:57] Initial: DB70; + 30720 bytes downloaded
[11:09:57] Initial: 865E; + 40960 bytes downloaded
[11:09:57] Initial: 8F87; + 51200 bytes downloaded
[11:09:57] Initial: C48B; + 61440 bytes downloaded
[11:09:57] Initial: 92B3; + 71680 bytes downloaded
[11:09:57] Initial: C102; + 81920 bytes downloaded
[11:09:58] Initial: 1996; + 92160 bytes downloaded
[11:09:58] Initial: BFFE; + 102400 bytes downloaded
[11:09:58] Initial: 1810; + 112640 bytes downloaded
[11:09:58] Initial: 0626; + 122880 bytes downloaded
[11:09:58] Initial: 7B53; + 133120 bytes downloaded
[11:09:58] Initial: 0441; + 143360 bytes downloaded
[11:09:58] Initial: FECE; + 153600 bytes downloaded
[11:09:58] Initial: D346; + 163840 bytes downloaded
[11:09:58] Initial: 2DE8; + 174080 bytes downloaded
[11:09:58] Initial: B3F0; + 184320 bytes downloaded
[11:09:58] Initial: 2881; + 194560 bytes downloaded
[11:09:58] Initial: 9507; + 204800 bytes downloaded
[11:09:58] Initial: 1BAF; + 215040 bytes downloaded
[11:09:58] Initial: 717C; + 225280 bytes downloaded
[11:09:58] Initial: 23FD; + 235520 bytes downloaded
[11:09:59] Initial: 915F; + 245760 bytes downloaded
[11:09:59] Initial: CE52; + 256000 bytes downloaded
[11:09:59] Initial: ED88; + 266240 bytes downloaded
[11:09:59] Initial: 2579; + 276480 bytes downloaded
[11:09:59] Initial: 3396; + 286720 bytes downloaded
[11:09:59] Initial: 410C; + 296960 bytes downloaded
[11:09:59] Initial: 56D1; + 307200 bytes downloaded
[11:09:59] Initial: 1EBD; + 317440 bytes downloaded
[11:09:59] Initial: 6AD9; + 327680 bytes downloaded
[11:09:59] Initial: F931; + 337920 bytes downloaded
[11:09:59] Initial: 1C40; + 348160 bytes downloaded
[11:10:00] Initial: C4AE; + 358400 bytes downloaded
[11:10:00] Initial: 57E4; + 368640 bytes downloaded
[11:10:00] Initial: 1843; + 378880 bytes downloaded
[11:10:00] Initial: B0C0; + 389120 bytes downloaded
[11:10:00] Initial: AAAA; + 399360 bytes downloaded
[11:10:00] Initial: D737; + 409600 bytes downloaded
[11:10:00] Initial: 762A; + 419840 bytes downloaded
[11:10:00] Initial: 8685; + 430080 bytes downloaded
[11:10:00] Initial: 25B1; + 440320 bytes downloaded
[11:10:00] Initial: 44F1; + 450560 bytes downloaded
[11:10:00] Initial: EF81; + 460800 bytes downloaded
[11:10:00] Initial: 900E; + 471040 bytes downloaded
[11:10:00] Initial: 906E; + 481280 bytes downloaded
[11:10:01] Initial: D59F; + 491520 bytes downloaded
[11:10:01] Initial: 2406; + 501760 bytes downloaded
[11:10:01] Initial: 9777; + 512000 bytes downloaded
[11:10:01] Initial: 7783; + 522240 bytes downloaded
[11:10:01] Initial: AEC5; + 532480 bytes downloaded
[11:10:01] Initial: B8A1; + 542720 bytes downloaded
[11:10:01] Initial: D50E; + 552960 bytes downloaded
[11:10:01] Initial: BDEE; + 563200 bytes downloaded
[11:10:02] Initial: E433; + 573440 bytes downloaded
[11:10:02] Initial: 667A; + 583680 bytes downloaded
[11:10:02] Initial: C413; + 593920 bytes downloaded
[11:10:02] Initial: DB64; + 604160 bytes downloaded
[11:10:02] Initial: 313C; + 614400 bytes downloaded
[11:10:02] Initial: 4B8A; + 624640 bytes downloaded
[11:10:02] Initial: 1B3A; + 634880 bytes downloaded
[11:10:02] Initial: E39B; + 645120 bytes downloaded
[11:10:02] Initial: F9FD; + 655360 bytes downloaded
[11:10:02] Initial: BFF6; + 665600 bytes downloaded
[11:10:02] Initial: 0552; + 675840 bytes downloaded
[11:10:02] Initial: 14A7; + 686080 bytes downloaded
[11:10:02] Initial: 99A6; + 696320 bytes downloaded
[11:10:02] Initial: 06B2; + 706560 bytes downloaded
[11:10:02] Initial: 445D; + 716800 bytes downloaded
[11:10:02] Initial: 62C1; + 727040 bytes downloaded
[11:10:02] Initial: 0E27; + 737280 bytes downloaded
[11:10:02] Initial: EF9A; + 747520 bytes downloaded
[11:10:02] Initial: C105; + 757760 bytes downloaded
[11:10:02] Initial: 46D3; + 768000 bytes downloaded
[11:10:03] Initial: 33C7; + 778240 bytes downloaded
[11:10:03] Initial: 7E92; + 788480 bytes downloaded
[11:10:03] Initial: 24B3; + 795847 bytes downloaded
[11:10:03] Verifying core Core_a1.fah...
[11:10:03] Signature is VALID
[11:10:03] 
[11:10:03] Trying to unzip core FahCore_a1.exe
[11:10:03] Decompressed FahCore_a1.exe (2117632 bytes) successfully
[11:10:03] + Core successfully engaged
[11:10:08] 
[11:10:08] + Processing work unit
[11:10:08] Core required: FahCore_a1.exe
[11:10:08] Core found.
[11:10:08] Working on Unit 08 [October 9 11:10:08]
[11:10:08] + Working ...
[11:10:08] - Calling 'mpiexec -channel shm -env MPICH_USE_SMP_OPTIMIZATIONS 1 -np 4 FahCore_a1.exe -dir work/ -suffix 08 -checkpoint 15 -forceasm -verbose -lifeline 504 -version 592'

[11:10:09] 
[11:10:09] *------------------------------*
[11:10:09] Folding@Home Gromacs SMP Core
[11:10:09] Version 1.76 (February 23, 2008)
[11:10:09] 
[11:10:09] Preparing to commence simulation
[11:10:09] - Ensuring status. Please wait.
[11:10:16] 
[11:10:16] Project: 2665 (Run 3, Clone 446, Gen 42)
[11:10:16] 
[11:10:17] Assembly optimizations on if available.
[11:10:17] Entering M.D.
[11:10:48]  on if available.
[11:10:48] Entering M.D.
[11:10:55] Calling FAH init
[11:10:57] ater
[11:10:57] Writing local files
[11:10:57] rom checkpoint)
[11:10:57] Read checkpoint
[11:10:57] Protein: HGG in water
[11:10:57] Writing local files
[11:11:09] Extra SSE boost OK.
[11:11:10] Writing local files
[11:11:10] Completed 0 out of 250000 steps  (0 percent)
[11:11:18] Warning:  long 1-4 interactions
[12:03:22] - Autosending finished units...
[12:03:22] Trying to send all finished work units
[12:03:22] + No unsent completed units remaining.
[12:03:22] - Autosend completed
[12:55:07] Killing all core threads
[12:55:07] Killing SMP core threads
[12:55:07] Killing 4 cores
[12:55:07] Killing core 0
[12:55:07] Killing core 1
[12:55:07] Killing core 2
[12:55:07] Killing core 3

Folding@Home Client Shutdown at user request.
[12:55:07] ***** Got a SIGTERM signal (2)
[12:55:07] Killing all core threads
[12:55:07] Killing SMP core threads
[12:55:07] Killing 4 cores
[12:55:07] Killing core 0
[12:55:07] Killing core 1
[12:55:07] Killing core 2
[12:55:07] Killing core 3

Folding@Home Client Shutdown.


--- Opening Log file [October 9 12:55:37] 


# SMP Client ##################################################################
###############################################################################

                       Folding@Home Client Version 5.92beta

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Documents and Settings\spike\Desktop\Folding@Home Windows SMP Client V1.01
Executable: C:\Documents and Settings\spike\Desktop\Folding@Home Windows SMP Client V1.01\fah-SMP-592.exe
Arguments: -verbosity 9 -advmethods -forceasm 

Warning:
 By using the -forceasm flag, you are overriding
 safeguards in the program. If you did not intend to
 do this, please restart the program without -forceasm.
 If work units are not completing fully (and particularly
 if your machine is overclocked), then please discontinue
 use of the flag.

[12:55:37] - Ask before connecting: No
[12:55:37] - User name: SPIKE09 (Team 46590)
[12:55:37] - User ID: 42CC37B56154D853
[12:55:37] - Machine ID: 3
[12:55:37] 
[12:55:38] Loaded queue successfully.
[12:55:38] 
[12:55:38] - Autosending finished units...
[12:55:38] + Processing work unit
[12:55:38] Trying to send all finished work units
[12:55:38] Core required: FahCore_a1.exe
[12:55:38] + No unsent completed units remaining.
[12:55:38] - Autosend completed
[12:55:38] Core found.
[12:55:38] Working on Unit 08 [October 9 12:55:38]
[12:55:38] + Working ...
[12:55:38] - Calling 'mpiexec -channel shm -env MPICH_USE_SMP_OPTIMIZATIONS 1 -np 4 FahCore_a1.exe -dir work/ -suffix 08 -checkpoint 15 -forceasm -verbose -lifeline 2296 -version 592'

[12:55:39] 
[12:55:39] *------------------------------*
[12:55:39] Folding@Home Gromacs SMP Core
[12:55:39] Version 1.76 (February 23, 2008)
[12:55:39] 
[12:55:39] Preparing to commence simulation
[12:55:39] - Ensuring status. Please wait.
[12:55:56] - Assembly optimizations manually forced on.
[12:55:56] - Not checking prior termination.
[12:56:23] - Expanded 4764646 -> 24426905 (decompressed 512.6 percent)
[12:56:23] 
[12:56:23] Project: 2665 (Run 3, Clone 446, Gen 42)
[12:56:23] 
[12:56:23] Assembly optimizations on if available.
[12:56:23] Entering M.D.
[12:56:31] Calling FAH init
[12:56:33] Read topology
[12:56:34] ocal files
[12:56:34] rom checkpoint)
[12:56:34] Read checkpoint
[12:56:34] Protein: HGG in water
[12:56:34] Writing local files
[12:56:46] Extra SSE boost OK.
[12:56:46] Writing local files
[12:56:47] Completed 0 out of 250000 steps  (0 percent)
[12:56:55] Warning:  long 1-4 interactions
[15:56:46] At least 3 hours since checkpoint written...
[15:56:46] 
[15:56:46] Folding@home Core Shutdown: EARLY_UNIT_END
[15:56:46] 
[15:56:46] Folding@home Core Shutdown: EARLY_UNIT_END
[15:56:50] CoreStatus = 63 (99)
[15:56:50] + Error starting Folding@Home core or unexpected system termination of core.
[15:56:55] 
[15:56:55] + Processing work unit
[15:56:55] Core required: FahCore_a1.exe
[15:56:55] Core found.
[15:56:55] Working on Unit 08 [October 9 15:56:55]
[15:56:55] + Working ...
[15:56:55] - Calling 'mpiexec -channel shm -env MPICH_USE_SMP_OPTIMIZATIONS 1 -np 4 FahCore_a1.exe -dir work/ -suffix 08 -checkpoint 15 -forceasm -verbose -lifeline 2296 -version 592'

[15:56:56] 
[15:56:56] *------------------------------*
[15:56:56] Folding@Home Gromacs SMP Core
[15:56:56] Version 1.76 (February 23, 2008)
[15:56:56] 
[15:56:56] Preparing to commence simulation
[15:56:56] - Ensuring status. Please wait.
[15:57:03] 
[15:57:03] Project: 2665 (Run 3, Clone 446, Gen 42)
[15:57:03] 
[15:57:04] Assembly optimizations on if available.
[15:57:04] Entering M.D.
[15:57:34]  on if available.
[15:57:34] Entering M.D.
[15:57:42] Calling FAH init
[15:57:44] Read topology
[15:57:44] ocal files
[15:57:44] rom checkpoint)
[15:57:44] Read checkpoint
[15:57:44] Protein: HGG in water
[15:57:44] Writing local files
[15:57:56] Extra SSE boost OK.
[15:57:57] Writing local files
[15:57:57] Completed 0 out of 250000 steps  (0 percent)
[15:58:05] Warning:  long 1-4 interactions
team 46590 :)
toTOW
Site Moderator
Posts: 6312
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project: 2665 (Run 3, Clone 446, Gen 42)

Post by toTOW »

This one is a bad WU :(

Run qfix, and move on.
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
spike09
Posts: 29
Joined: Mon Dec 03, 2007 12:04 pm
Hardware configuration: 2 x q6600's with 2 GB pc6400 , e6400 2 Gb pc 6400, x2 3800 1Gb PC 3200, athlon 3700, 1GB pc3200, athlaon 2200 512MB pc 2700
Location: location location

Re: Project: 2665 (Run 3, Clone 446, Gen 42)

Post by spike09 »

ran qfix got the same wu, deleted queue and work folder same wu :(
team 46590 :)
spike09
Posts: 29
Joined: Mon Dec 03, 2007 12:04 pm
Hardware configuration: 2 x q6600's with 2 GB pc6400 , e6400 2 Gb pc 6400, x2 3800 1Gb PC 3200, athlon 3700, 1GB pc3200, athlaon 2200 512MB pc 2700
Location: location location

Re: Project: 2665 (Run 3, Clone 446, Gen 42)

Post by spike09 »

I have updated to the latest client followed Xililxons guides and it is borked . I have been a beta tester here for a few years and never encountered a worse client release. the project folks post less here than on the old forums and the poor admins and mods are left to cope, step up to the plate Stanford this is your baby and my money going down the drain. :evil:
team 46590 :)
muttonhunter
Posts: 14
Joined: Sun Mar 02, 2008 8:42 am
Hardware configuration: .

Re: Project: 2665 (Run 3, Clone 446, Gen 42)

Post by muttonhunter »

I have got the same long interactions on this wu, 13 times today. i have tried to run qfix and sendall but they just open and close immediately. I have deleted queue and work folder and unitinfo and still get 2665 r3 c446 g42. whats up with that?

this is using 6.22 beta2r3. i havent had time to do the 6.23 yet as i am busy remodeling our bathroom.
can i delete both of the fahcore a1 folders? or do i need to do the 6.23? or?
:e(
kasson
Pande Group Member
Posts: 1459
Joined: Thu Nov 29, 2007 9:37 pm

Re: Project: 2665 (Run 3, Clone 446, Gen 42)

Post by kasson »

We would recommend upgrading to 6.23 as this is most likely to fix the problem.
spike09
Posts: 29
Joined: Mon Dec 03, 2007 12:04 pm
Hardware configuration: 2 x q6600's with 2 GB pc6400 , e6400 2 Gb pc 6400, x2 3800 1Gb PC 3200, athlon 3700, 1GB pc3200, athlaon 2200 512MB pc 2700
Location: location location

Re: Project: 2665 (Run 3, Clone 446, Gen 42)

Post by spike09 »

kasson wrote:We would recommend upgrading to 6.23 as this is most likely to fix the problem.
6.22 is now working but will do kasson :mrgreen:
team 46590 :)
muttonhunter
Posts: 14
Joined: Sun Mar 02, 2008 8:42 am
Hardware configuration: .

Re: Project: 2665 (Run 3, Clone 446, Gen 42)

Post by muttonhunter »

I upgraded to 6.23, rebooted and started the smp again. AGAIN I got p2665 r3 c446 g42, killed it and started again, same wu!

killed it again and deleted work,queue and unit info, restarted and finally got a different wu, a p4100 this time. i have only gotten 2665 wu for a long time.
restarted smp succesfully. it has completed 2% so far, but...........

looking at task manager I see one FaHCore_81 instead of the four a1 that i am used to seeing.

this core81 is using 50% cpu cycles. and my gpu folding (core 11) witch use to only show a few percentage cpu usage occasionally now is using 25 to35%. system idle process is now showing as using between 15 to 25% usage whereas it used to be mostly zero. total cpu usage is listed between72 and 84%. even when the task manager shows core81 and core 11 and system idle add up to 100%.
i have a feeling something is not right. ?

edit to add: i used to have a task manager/processes entry called something like " mipich" , i had trouble with duplicates of that a long time ago when i first started smp folding, now it is not listed even once.
also this is a dual core w/8800gt gpu rig running XP pro.
Last edited by muttonhunter on Sun Oct 12, 2008 9:26 pm, edited 1 time in total.
toTOW
Site Moderator
Posts: 6312
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project: 2665 (Run 3, Clone 446, Gen 42)

Post by toTOW »

I think you didn't start with the -smp flag, and you got a uniprocessor WU.
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
muttonhunter
Posts: 14
Joined: Sun Mar 02, 2008 8:42 am
Hardware configuration: .

Re: Project: 2665 (Run 3, Clone 446, Gen 42)

Post by muttonhunter »

i did make the -smp flag shortcut and it shows as such when i right click the shortcut icon properties. i guess i will kill this and start again, and maybe redo the flag.
the flag is: (space)-smp, correct?
thanks toTow, i'll let you know how it goes.
muttonhunter
Posts: 14
Joined: Sun Mar 02, 2008 8:42 am
Hardware configuration: .

Re: Project: 2665 (Run 3, Clone 446, Gen 42)

Post by muttonhunter »

i got it!!
i restarted the smp and it went right back to the 4100wu where it left off.
the f@h window showed the -smp argument but in the folding program file folder the core 81 file was still there. I deleted that and work and queue and unitinfo and restarted correctly. this time a 2653 wu, if i never see a 2665 i will be happy , i have completed 30 something of them and lost just about as many.
thanks again toTOW. :D
Gary480six
Posts: 91
Joined: Mon Jan 21, 2008 6:42 pm

Re: Project: 2665 (Run 3, Clone 446, Gen 42)

Post by Gary480six »

This work unit is still out there and still BORKED.

It still does this when started:
[18:28:27] Project: 2665 (Run 3, Clone 446, Gen 42)
[18:28:27]
[18:28:29] Entering M.D.
[18:28:35] Calling FAH init
[18:28:37] Read topology
[18:28:37] ocal files
[18:28:37] rom checkpoint)
[18:28:37] Read checkpoint
[18:28:37] Protein: HGG in water
[18:28:37] Writing local files
[18:28:45] Extra SSE boost OK.
[18:28:46] Writing local files
[18:28:46] Completed 0 out of 250000 steps (0 percent)
[18:28:53] Warning: long 1-4 interactions

And then just sits there. Qfix does nothing because at 0% there is nothing to fix. This is XP Pro on a Q9300 system. Stock clocked. It is the new 6.23Beta client - So much for that fix.

If it helps anyone - the way I got my system to move on and not keep getting the same WU was to go into Task manager as this faulty WU sat doing nothing, and end the process of one of the a1 cores. It crashed the client and got me a different WU.
Hope this helps someone.
al2
Posts: 10
Joined: Tue Jan 01, 2008 3:48 pm
Location: U.K.

Re: Project: 2665 (Run 3, Clone 446, Gen 42)

Post by al2 »

Not sure if following (as i understand it) any use for your Qfix probs;

1) You might need the latest version (of Qfix) to work with V6 clients - see half way down following linked page;

http://foldingforum.org/viewtopic.php?f ... a&start=60


2)The very least Qfix should do (eg if WU fails at start/0%) is stop you getting the same problem WU repeatedly. I couldn't get it to work until i realized you have to DELETE (just) the 'queue slot' (number =) xx you can see on the log (just after a start ) which the problem WU is in BEFORE you use Qfix (unless the client has already deleted that, itself) NB Another quirk is the client will say its failed to carry out the (flag) "-delete xx" ( as before xx is the queue slot number the WU being folded is in) but in reality it HAS been deleted.

If you run Qfix AFTER this DELETION it should detect an ERROR (unlike without out the previous deletion) , rebuild (fix) the queue that you just deleted and upload/send whatever partial WU as finished to the server so it will bring you a totally new unit to fold.

The following link concerns Linux (so ignore the Linux specific commands etc) and a hanging client but the info about Qfix i found helped clarify things with EUE + using Qfix.

http://foldingforum.org/viewtopic.php?f=44&t=3889[color=#]
Folding on XPMce 32-bit in Dell9200 machine (stock) with;

C2D E6600

2GB DDR 533Mhz (Kingston)
toTOW
Site Moderator
Posts: 6312
Joined: Sun Dec 02, 2007 10:38 am
Location: Bordeaux, France
Contact:

Re: Project: 2665 (Run 3, Clone 446, Gen 42)

Post by toTOW »

You have a qfix how to posted on top of this forum : viewtopic.php?f=19&t=6042 ;)
Image

Folding@Home beta tester since 2002. Folding Forum moderator since July 2008.
Post Reply