Can't choose large number of CPU cores

Moderators: Site Moderators, FAHC Science Team

Post Reply
bambihunter
Posts: 22
Joined: Fri Apr 03, 2009 4:09 pm
Hardware configuration: Fractal Design Define R7 XL USB-C Blackout case
Intel i9-9980xe @ 4.6ghz for 24/7 use (water cooled)
128gb GSkill TridentZ DDR4 3866 RAM
4TB Corsair MP600 PRO XT M.2 NVMe PCIe Gen. 4 x4 SSD
Asus WS x299 Sage motherboard
Asus RTX4090 TUF, EVGA RTX3090
Corsair AXi1600i 1600 watt digital power supply
Logitech G910 Spark Keyboard
Logitech G502 Hero Mouse
Location: Central Oklahoma, USA

Can't choose large number of CPU cores

Post by bambihunter »

I have a couple of servers that I am no longer going to use in production. Last night, I was working with the smaller one. It is dual 12 core Xeon's for 24 cores and 48 threads. It will not let me set it higher than 30 cores. While I realize that the HT "cores" won't double the yield, I was surprised it wouldn't go higher. Especially because my other system has quad E5-4650 v3's and I was going to use most of the CPU power for this until I migrate the last few smaller VM's off it. Of course I can set to use more but smaller WU's, but is there a top limit in number of CPU's?

Unrelated, but one of my old gaming system has 3 x GTX980's in it and it seems there is constantly one GPU failing. It is not the same GPU, nor same slot, etc. I have moved them from slot to slot and PC to PC. There is no rhyme or reason that I have found as to what is causing it. One card may run 25 units in a row, no failures. Then, fail repeatedly, then return to working fine again the next day. I have also tried opening up the side with additional cooling but no change. Has there been a rash of random GPU WU's that have caused this over the past 6 weeks or so?

https://stats.foldingathome.org/donor/2261
Neil-B
Posts: 2027
Joined: Sun Mar 22, 2020 5:52 pm
Hardware configuration: 1: 2x Xeon E5-2697v3@2.60GHz, 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent 20H2, Quadro K420 1GB, FAH 7.6.21
2: Xeon E3-1505Mv5@2.80GHz, 32GB DDR4, NVME, Win10 Pro 20H2, Quadro M1000M 2GB, FAH 7.6.21 (actually have two of these)
3: i7-960@3.20GHz, 12GB DDR3, SSD, Win10 Pro 20H2, GTX 750Ti 2GB, GTX 1080Ti 11GB, FAH 7.6.21
Location: UK

Re: Can't choose large number of CPU cores

Post by Neil-B »

Guess you are using windows ... 32threads is limit for a CPU slot and tends to work nicely and if possible you want one slot this large ... a second 12thread cpu slot would leave 4 threads free - you may need no more than that as FAH runs low priority and your other stuff should take precedence ... if the servers end up pure FAH then I'd put 16 remaining threads as the 2nd slot ... some people will argue the need to leave so threads to avoid contention issues, but my experience is that server grade systems running Xeons actually cope ok even if you "max" the thread count - I get best from my twin 14core 56 thread system when running 32/56 and 24/56 slots.

As to GPU issues I won't try to offer advice - they confuse me !!
2x Xeon E5-2697v3, 512GB DDR4 LRDIMM, SSD Raid, W10-Ent, Quadro K420
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070

(Green/Bold = Active)
bambihunter
Posts: 22
Joined: Fri Apr 03, 2009 4:09 pm
Hardware configuration: Fractal Design Define R7 XL USB-C Blackout case
Intel i9-9980xe @ 4.6ghz for 24/7 use (water cooled)
128gb GSkill TridentZ DDR4 3866 RAM
4TB Corsair MP600 PRO XT M.2 NVMe PCIe Gen. 4 x4 SSD
Asus WS x299 Sage motherboard
Asus RTX4090 TUF, EVGA RTX3090
Corsair AXi1600i 1600 watt digital power supply
Logitech G910 Spark Keyboard
Logitech G502 Hero Mouse
Location: Central Oklahoma, USA

Re: Can't choose large number of CPU cores

Post by bambihunter »

Thanks Neil. I appreciate the response. So it is a Windows limitation then? Or Windows FAH Client?

Yes it is running Windows Server 2012r2 at the moment with VM's in Hyper-V though if I keep it as a partial work server it will be moved to VMWare. It looks like your top system is similar to this server. it is a Dell T630. I have been reading up on https://flings.vmware.com/vmware-applia ... lding-home as it sounds like it could be viable. I still need to have the server available so that I CAN turn it down if needed. I am the local SysAdmin for our company and sometimes I have to take sandbox data and manipulate/test it. Once I finish getting the rest of the stuff off the blade server, THEN I can crank up the CPU folding to join up with the decent output of my half-dozen GPU's.
Neil-B
Posts: 2027
Joined: Sun Mar 22, 2020 5:52 pm
Hardware configuration: 1: 2x Xeon E5-2697v3@2.60GHz, 512GB DDR4 LRDIMM, SSD Raid, Win10 Ent 20H2, Quadro K420 1GB, FAH 7.6.21
2: Xeon E3-1505Mv5@2.80GHz, 32GB DDR4, NVME, Win10 Pro 20H2, Quadro M1000M 2GB, FAH 7.6.21 (actually have two of these)
3: i7-960@3.20GHz, 12GB DDR3, SSD, Win10 Pro 20H2, GTX 750Ti 2GB, GTX 1080Ti 11GB, FAH 7.6.21
Location: UK

Re: Can't choose large number of CPU cores

Post by Neil-B »

I have seen posts that imply enterprise versions can do more than 32thread slots ... Mine didn't and I never felt the desire to work out why ... I could run latest server product as have license but happy just running it in Win10 Ent with base system installation of FAH ... For work related use I'll tend to rebuild to latest drop of Server and run everything in Hyper-V as all I do is experimental/sandbox style proof of concept work - and when doing that I am thrashing the kit and FAH has to take a break.

There can be some issues with thread counts higher than 32 (people do run them under linux) as the slot counts are tested less (maybe never) and some odder things can happen with the way Gromacs in the FAHCore splits up the thread usage ... Another reason I have been relaxed about just using 32threads as max.
2x Xeon E5-2697v3, 512GB DDR4 LRDIMM, SSD Raid, W10-Ent, Quadro K420
Xeon E3-1505Mv5, 32GB DDR4, NVME, W10-Pro, Quadro M1000M
i7-960, 12GB DDR3, SSD, W10-Pro, GTX1080Ti
i9-10850K, 64GB DDR4, NVME, W11-Pro, RTX3070

(Green/Bold = Active)
Rel25917
Posts: 303
Joined: Wed Aug 15, 2012 2:31 am

Re: Can't choose large number of CPU cores

Post by Rel25917 »

If the failed gpu units are all 13404 or 13405 I wouldn't worry too much, they have a higher than normal failure rate. If they are other projects that are failing there could be a problem, would need to see the logs from a failure to have any ideas why.
MeeLee
Posts: 1375
Joined: Tue Feb 19, 2019 10:16 pm

Re: Can't choose large number of CPU cores

Post by MeeLee »

Just make sure you have enough headroom on the PSU. If you use a dual PSU system, try to see if some sort of logger notices electric spikes.
Nvidia GPUs are very susceptible to voltage drops or spikes.
Sometimes setting the fan curve to high, helps reduce voltage spikes, but it works against voltage droops...

Under load, my GPUs get somewhere between 11,5 to 11,8V.
PantherX
Site Moderator
Posts: 7020
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: Can't choose large number of CPU cores

Post by PantherX »

Assuming that your using work hardware, please make sure that you have permission (generally written) from people authorized to make those decisions (Internal IT, TL, Manager, CISO, CTO, GM, EGM, etc.) as per the EULA.

If you have to run F@H in a VM, consider using a Linux based on. Generally speaking, the current version of FahCore_a7 is more efficient while running in Linux than in Windows.
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
bambihunter
Posts: 22
Joined: Fri Apr 03, 2009 4:09 pm
Hardware configuration: Fractal Design Define R7 XL USB-C Blackout case
Intel i9-9980xe @ 4.6ghz for 24/7 use (water cooled)
128gb GSkill TridentZ DDR4 3866 RAM
4TB Corsair MP600 PRO XT M.2 NVMe PCIe Gen. 4 x4 SSD
Asus WS x299 Sage motherboard
Asus RTX4090 TUF, EVGA RTX3090
Corsair AXi1600i 1600 watt digital power supply
Logitech G910 Spark Keyboard
Logitech G502 Hero Mouse
Location: Central Oklahoma, USA

Re: Can't choose large number of CPU cores

Post by bambihunter »

These are my own systems PantherX. I buy some of the equipment as we retire it or from other places. For about 10 years I had my own I.T. consulting business for a while which was why I bought these initially. That's interesting that core performs better in Linux. I used to use Linux for everything at home but haven't used it much in at least 10 years, maybe 15 except for a bootable ISO to fix issues on Windows server/PC's.

Melee, very good tips. One should never underestimate the value of a good PSU. I retired my old gaming system a year ago and built a new HEDT with everything new except I reused the 1200w PSU. When I fired it up, the same game was crashing to desktop at the same time. On a hunch, I put a NIB warranty replacement 1k power supply in. That problem went away, but occasionally it it would just click and shut clear off. This led me to believe 1k wasn't enough and was shutting off in protect mode. So, I bought a 1600 watt Corsair and it has ran flawlessly since. The old system with the 980's now has a known good 1200 watt power supply (same model as the flakey one). It actually has adjustable voltage so one can step up the 12v rail a bit if there's too much voltage droop.

This newer gaming system (with a little work here and there) is a great folder. A pair of 2080ti's running on an 18 core i9-9980xe. Stock it is 3.0 but I fold 24/7 with it at 4.7ghz on all cores when not using it for gaming or work. The CPU on this, and on my servers really don't do much PPD comparatively. Maybe 80k per day (I can't remember), compared to my 2mil+ from each of the 2080's. If I were smart, I'd sell some of the servers and buy another video card.
This isn't a bad OC from a workstation board with 128gb of RAM:
https://valid.x86.fr/y1srm1
Post Reply