7.6.9 Software issue in Linux

If you're new to FAH and need help getting started or you have very basic questions, start here.

Moderators: Site Moderators, FAHC Science Team

Post Reply
Ahnilated
Posts: 3
Joined: Sun Apr 19, 2020 10:42 am

7.6.9 Software issue in Linux

Post by Ahnilated »

Hello everyone,

System: AMD 2950x (16 cores/32 threads), 128GB ram, Nvidia GTX 1080Ti, Ubuntu 5.3.0-46-generic kernel.

I seem to be running into a bug in the 7.6.9 software, output pasted below. I just keep getting this error over and over when it tries to start up the CPU thread now. I didn't get this error in the previous version, don't remember exactly what it was. Folding power set to full. Let me know if you need any other information.

10:33:30:WU00:FS00:0xa7:ERROR:Fatal error:
10:33:30:WU00:FS00:0xa7:ERROR:There is no domain decomposition for 25 ranks that is compatible with the given box and a minimum cell size of 1.37225 nm
10:33:30:WU00:FS00:0xa7:ERROR:Change the number of ranks or mdrun option -rcon or -dds or your LINCS settings
10:33:30:WU00:FS00:0xa7:ERROR:Look in the log file for details on the domain decomposition
10:33:30:WU00:FS00:0xa7:ERROR:For more information and tips for troubleshooting, please check the GROMACS
10:33:30:WU00:FS00:0xa7:ERROR:website at (removed to be able to post)
10:33:30:WU00:FS00:0xa7:ERROR:-------------------------------------------------------
10:33:34:WU00:FS00:0xa7:WARNING:Unexpected exit() call
10:33:34:WU00:FS00:0xa7:WARNING:Unexpected exit from science code
10:33:34:WU00:FS00:0xa7:Saving result file ../logfile_01.txt
10:33:34:WU00:FS00:0xa7:Saving result file md.log
10:33:34:WU00:FS00:0xa7:Saving result file science.log
10:33:35:WU00:FS00:FahCore returned: INTERRUPTED (102 = 0x66)
ajm
Posts: 754
Joined: Sat Mar 21, 2020 5:22 am
Location: Lucerne, Switzerland

Re: 7.6.9 Software issue in Linux

Post by ajm »

This is a problem with the number of threads allocated to FAH. It looks like you have allocated 25 threads, which is a bad choice for domain decomposition. The number should be divisible by 2 and/or 3. You can edit the CPU slot in Fahcontrol and enter there the number of threads you want to allocate. 30 for example. Or 24 and another slot at 6. Whatever, but always multiples of 2 or, and possibly and 3.
PantherX
Site Moderator
Posts: 7020
Joined: Wed Dec 23, 2009 9:33 am
Hardware configuration: V7.6.21 -> Multi-purpose 24/7
Windows 10 64-bit
CPU:2/3/4/6 -> Intel i7-6700K
GPU:1 -> Nvidia GTX 1080 Ti
§
Retired:
2x Nvidia GTX 1070
Nvidia GTX 675M
Nvidia GTX 660 Ti
Nvidia GTX 650 SC
Nvidia GTX 260 896 MB SOC
Nvidia 9600GT 1 GB OC
Nvidia 9500M GS
Nvidia 8800GTS 320 MB

Intel Core i7-860
Intel Core i7-3840QM
Intel i3-3240
Intel Core 2 Duo E8200
Intel Core 2 Duo E6550
Intel Core 2 Duo T8300
Intel Pentium E5500
Intel Pentium E5400
Location: Land Of The Long White Cloud
Contact:

Re: 7.6.9 Software issue in Linux

Post by PantherX »

Welcome to the F@H Forum Ahnilated,

Please look up the troubleshooting steps for "There is no domain decomposition" in this thread: viewtopic.php?f=19&t=16526 :)
ETA:
Now ↞ Very Soon ↔ Soon ↔ Soon-ish ↔ Not Soon ↠ End Of Time

Welcome To The F@H Support Forum Ӂ Troubleshooting Bad WUs Ӂ Troubleshooting Server Connectivity Issues
Ahnilated
Posts: 3
Joined: Sun Apr 19, 2020 10:42 am

Re: 7.6.9 Software issue in Linux

Post by Ahnilated »

Hello AJM,

Thank you, I have not allocated anything. The CPU slot is set at the default -1. That is why I am saying, this is a bug in the program.

I have removed the bad packet, as per the instructions thanks PantherX, and it is back to crunching fine with the same settings. Until it got a new packet and got the exact same bad one back, PRCG 14576 (0,4259,74). I have deleted it once again and posted about it in the forums.
Post Reply