Page 1 of 1

Projects Too Big To Handle?

Posted: Thu Jun 09, 2022 7:30 pm
by geokilla
I just got assigned Project 18414 and it's saying it has an ETA of 4 days to complete with a TPF of 1 hour. I just completed Project 16999 where it took about 2 minutes to complete 1% according to the logs. Both use the same A8 core. There's no way I'm going to complete this project in time as I only run Folding when I'm on the computer.

How the hell did I get such a large project? My core temps are under 80C so I'm not throttling either.

Edit: Paused it and resumed. Now saying ETA of 1.15 days and TPF of 17 minutes.

Code: Select all

19:19:37:WU01:FS00:0xa8:*********************** Log Started 2022-06-09T19:19:37Z ***********************
19:19:37:WU01:FS00:0xa8:************************** Gromacs Folding@home Core ***************************
19:19:37:WU01:FS00:0xa8:       Core: Gromacs
19:19:37:WU01:FS00:0xa8:       Type: 0xa8
19:19:37:WU01:FS00:0xa8:    Version: 0.0.12
19:19:37:WU01:FS00:0xa8:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
19:19:37:WU01:FS00:0xa8:  Copyright: 2020 foldingathome.org
19:19:37:WU01:FS00:0xa8:   Homepage: https://foldingathome.org/
19:19:37:WU01:FS00:0xa8:       Date: Jan 16 2021
19:19:37:WU01:FS00:0xa8:       Time: 12:29:40
19:19:37:WU01:FS00:0xa8:   Revision: c5816759c404e4b65f9f364c3d1ef554a67c4225
19:19:37:WU01:FS00:0xa8:     Branch: master
19:19:37:WU01:FS00:0xa8:   Compiler: Visual C++ 2019 16.7
19:19:37:WU01:FS00:0xa8:    Options: /TP /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
19:19:37:WU01:FS00:0xa8:   Platform: win32 10
19:19:37:WU01:FS00:0xa8:       Bits: 64
19:19:37:WU01:FS00:0xa8:       Mode: Release
19:19:37:WU01:FS00:0xa8:       SIMD: avx2_256
19:19:37:WU01:FS00:0xa8:     OpenMP: ON
19:19:37:WU01:FS00:0xa8:       CUDA: OFF
19:19:37:WU01:FS00:0xa8:       Args: -dir 01 -suffix 01 -version 706 -lifeline 7324 -checkpoint 15 -np
19:19:37:WU01:FS00:0xa8:             11
19:19:37:WU01:FS00:0xa8:************************************ libFAH ************************************
19:19:37:WU01:FS00:0xa8:       Date: Jan 16 2021
19:19:37:WU01:FS00:0xa8:       Time: 11:24:13
19:19:37:WU01:FS00:0xa8:   Revision: c5816759c404e4b65f9f364c3d1ef554a67c4225
19:19:37:WU01:FS00:0xa8:     Branch: master
19:19:37:WU01:FS00:0xa8:   Compiler: Visual C++ 2019 16.7
19:19:37:WU01:FS00:0xa8:    Options: /TP /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
19:19:37:WU01:FS00:0xa8:   Platform: win32 10
19:19:37:WU01:FS00:0xa8:       Bits: 64
19:19:37:WU01:FS00:0xa8:       Mode: Release
19:19:37:WU01:FS00:0xa8:************************************ CBang *************************************
19:19:37:WU01:FS00:0xa8:       Date: Jan 16 2021
19:19:37:WU01:FS00:0xa8:       Time: 11:23:53
19:19:37:WU01:FS00:0xa8:   Revision: c5816759c404e4b65f9f364c3d1ef554a67c4225
19:19:37:WU01:FS00:0xa8:     Branch: master
19:19:37:WU01:FS00:0xa8:   Compiler: Visual C++ 2019 16.7
19:19:37:WU01:FS00:0xa8:    Options: /TP /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
19:19:37:WU01:FS00:0xa8:   Platform: win32 10
19:19:37:WU01:FS00:0xa8:       Bits: 64
19:19:37:WU01:FS00:0xa8:       Mode: Release
19:19:37:WU01:FS00:0xa8:************************************ System ************************************
19:19:37:WU01:FS00:0xa8:        CPU: Intel(R) Core(TM) i5-10600KF CPU @ 4.10GHz
19:19:37:WU01:FS00:0xa8:     CPU ID: GenuineIntel Family 6 Model 165 Stepping 5
19:19:37:WU01:FS00:0xa8:       CPUs: 12
19:19:37:WU01:FS00:0xa8:     Memory: 31.92GiB
19:19:37:WU01:FS00:0xa8:Free Memory: 24.74GiB
19:19:37:WU01:FS00:0xa8:    Threads: WINDOWS_THREADS
19:19:37:WU01:FS00:0xa8: OS Version: 6.2
19:19:37:WU01:FS00:0xa8:Has Battery: false
19:19:37:WU01:FS00:0xa8: On Battery: false
19:19:37:WU01:FS00:0xa8: UTC Offset: -4
19:19:37:WU01:FS00:0xa8:        PID: 2040
19:19:37:WU01:FS00:0xa8:        CWD: C:\ProgramData\FAHClient\work
19:19:37:WU01:FS00:0xa8:********************************************************************************
19:19:37:WU01:FS00:0xa8:Project: 18414 (Run 143, Clone 5, Gen 63)
19:19:37:WU01:FS00:0xa8:Unit: 0x00000000000000000000000000000000
19:19:37:WU01:FS00:0xa8:Reading tar file core.xml
19:19:37:WU01:FS00:0xa8:Reading tar file frame63.tpr
19:19:37:WU01:FS00:0xa8:Digital signatures verified
19:19:37:WU01:FS00:0xa8:Calling: mdrun -c frame63.gro -s frame63.tpr -x frame63.xtc -cpt 15 -nt 11 -ntmpi 1
19:19:37:WU01:FS00:0xa8:Steps: first=630000000 total=640000000
19:19:39:WU00:FS00:Upload complete
19:19:39:WU00:FS00:Server responded WORK_ACK (400)
19:19:39:WU00:FS00:Final credit estimate, 19222.00 points
19:19:39:WU00:FS00:Cleaning up
19:19:39:WU01:FS00:0xa8:Completed 1 out of 10000000 steps (0%)
19:32:26:FS00:Paused
19:32:26:FS00:Shutting core down
19:32:27:WU01:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
19:32:42:FS00:Unpaused
19:32:42:WU01:FS00:Starting
19:32:42:WU01:FS00:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\ProgramData\FAHClient\cores/cores.foldingathome.org/win/64bit-avx2-256/a8-0.0.12/Core_a8.fah/FahCore_a8.exe -dir 01 -suffix 01 -version 706 -lifeline 11960 -checkpoint 15 -np 11
19:32:42:WU01:FS00:Started FahCore on PID 12192
19:32:42:WU01:FS00:Core PID:11692
19:32:42:WU01:FS00:FahCore 0xa8 started
19:32:43:WU01:FS00:0xa8:*********************** Log Started 2022-06-09T19:32:42Z ***********************
19:32:43:WU01:FS00:0xa8:************************** Gromacs Folding@home Core ***************************
19:32:43:WU01:FS00:0xa8:       Core: Gromacs
19:32:43:WU01:FS00:0xa8:       Type: 0xa8
19:32:43:WU01:FS00:0xa8:    Version: 0.0.12
19:32:43:WU01:FS00:0xa8:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
19:32:43:WU01:FS00:0xa8:  Copyright: 2020 foldingathome.org
19:32:43:WU01:FS00:0xa8:   Homepage: https://foldingathome.org/
19:32:43:WU01:FS00:0xa8:       Date: Jan 16 2021
19:32:43:WU01:FS00:0xa8:       Time: 12:29:40
19:32:43:WU01:FS00:0xa8:   Revision: c5816759c404e4b65f9f364c3d1ef554a67c4225
19:32:43:WU01:FS00:0xa8:     Branch: master
19:32:43:WU01:FS00:0xa8:   Compiler: Visual C++ 2019 16.7
19:32:43:WU01:FS00:0xa8:    Options: /TP /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
19:32:43:WU01:FS00:0xa8:   Platform: win32 10
19:32:43:WU01:FS00:0xa8:       Bits: 64
19:32:43:WU01:FS00:0xa8:       Mode: Release
19:32:43:WU01:FS00:0xa8:       SIMD: avx2_256
19:32:43:WU01:FS00:0xa8:     OpenMP: ON
19:32:43:WU01:FS00:0xa8:       CUDA: OFF
19:32:43:WU01:FS00:0xa8:       Args: -dir 01 -suffix 01 -version 706 -lifeline 12192 -checkpoint 15 -np
19:32:43:WU01:FS00:0xa8:             11
19:32:43:WU01:FS00:0xa8:************************************ libFAH ************************************
19:32:43:WU01:FS00:0xa8:       Date: Jan 16 2021
19:32:43:WU01:FS00:0xa8:       Time: 11:24:13
19:32:43:WU01:FS00:0xa8:   Revision: c5816759c404e4b65f9f364c3d1ef554a67c4225
19:32:43:WU01:FS00:0xa8:     Branch: master
19:32:43:WU01:FS00:0xa8:   Compiler: Visual C++ 2019 16.7
19:32:43:WU01:FS00:0xa8:    Options: /TP /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
19:32:43:WU01:FS00:0xa8:   Platform: win32 10
19:32:43:WU01:FS00:0xa8:       Bits: 64
19:32:43:WU01:FS00:0xa8:       Mode: Release
19:32:43:WU01:FS00:0xa8:************************************ CBang *************************************
19:32:43:WU01:FS00:0xa8:       Date: Jan 16 2021
19:32:43:WU01:FS00:0xa8:       Time: 11:23:53
19:32:43:WU01:FS00:0xa8:   Revision: c5816759c404e4b65f9f364c3d1ef554a67c4225
19:32:43:WU01:FS00:0xa8:     Branch: master
19:32:43:WU01:FS00:0xa8:   Compiler: Visual C++ 2019 16.7
19:32:43:WU01:FS00:0xa8:    Options: /TP /std:c++14 /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
19:32:43:WU01:FS00:0xa8:   Platform: win32 10
19:32:43:WU01:FS00:0xa8:       Bits: 64
19:32:43:WU01:FS00:0xa8:       Mode: Release
19:32:43:WU01:FS00:0xa8:************************************ System ************************************
19:32:43:WU01:FS00:0xa8:        CPU: Intel(R) Core(TM) i5-10600KF CPU @ 4.10GHz
19:32:43:WU01:FS00:0xa8:     CPU ID: GenuineIntel Family 6 Model 165 Stepping 5
19:32:43:WU01:FS00:0xa8:       CPUs: 12
19:32:43:WU01:FS00:0xa8:     Memory: 31.92GiB
19:32:43:WU01:FS00:0xa8:Free Memory: 24.74GiB
19:32:43:WU01:FS00:0xa8:    Threads: WINDOWS_THREADS
19:32:43:WU01:FS00:0xa8: OS Version: 6.2
19:32:43:WU01:FS00:0xa8:Has Battery: false
19:32:43:WU01:FS00:0xa8: On Battery: false
19:32:43:WU01:FS00:0xa8: UTC Offset: -4
19:32:43:WU01:FS00:0xa8:        PID: 11692
19:32:43:WU01:FS00:0xa8:        CWD: C:\ProgramData\FAHClient\work
19:32:43:WU01:FS00:0xa8:********************************************************************************
19:32:43:WU01:FS00:0xa8:Project: 18414 (Run 143, Clone 5, Gen 63)
19:32:43:WU01:FS00:0xa8:Unit: 0x00000000000000000000000000000000
19:32:43:WU01:FS00:0xa8:Digital signatures verified
19:32:43:WU01:FS00:0xa8:Calling: mdrun -c frame63.gro -s frame63.tpr -x frame63.xtc -cpi state.cpt -cpt 15 -nt 11 -ntmpi 1
19:32:43:WU01:FS00:0xa8:Steps: first=630000000 total=640000000
19:32:45:WU01:FS00:0xa8:Completed 76512 out of 10000000 steps (0%)
19:36:34:WU01:FS00:0xa8:Completed 100000 out of 10000000 steps (1%)

Re: Projects Too Big To Handle?

Posted: Thu Jun 09, 2022 7:43 pm
by Joe_H
If you client has never processed a WU from a particular project before, its initial estimates are based on the WU timeout. Once 2-3% has been completed the estimates will be more accurate. The estimates can also be off a bit right after restarting a WU after a pause.

Re: Projects Too Big To Handle?

Posted: Thu Jun 09, 2022 7:46 pm
by aetch
a few things:-
1). wait until it has processed 2-3%, when you receive a work unit for a new project (one your system has not seen before) it defaults the ETA to the timeout, for project 18414 that is 4 days
2). these work unit normally take about 12 hours on my system. I'm running a 20 thread CPU slot.
3). the researchers have controls to set a minimum slot size (core/thread count) for the project to run on, I guess you must have met or exceeded that thread/core count
4). the two main factors in determining the size of a project are the protein atom count and the length of time being simulated
5). 18414 is far from the biggest work units available