Description: Optimized parallel simulations of analytic bond-order potentials on hybrid shared/distributed memory with MPI and OpenMP

This title appears in the Scientific Report : 2019

Optimized parallel simulations of analytic bond-order potentials on hybrid shared/distributed memory with MPI and OpenMP

Analytic bond-order potentials (BOPs) allow to obtain a highly accurate description of interatomic interactions at a reasonablecomputational cost. However, for simulations with very large systems, the high memory demands require theuse of a parallel implementation, which at the same time also optimi...

Personal Name(s):	Teijeiro, Carlos (Corresponding author)
	Hammerschmidt, Thomas / Drautz, Ralf / Sutmann, Godehard
Contributing Institute:	Jülich Supercomputing Center; JSC
Published in:	The international journal of high performance computing applications, 32 (2019) 2, S. 227 - 241
Imprint:	Thousand Oaks, Calif. Sage Science Press 2019
DOI:	10.1177/1094342017727060
Document Type:	Journal Article
Research Program:	Computational Science and Mathematical Methods
	Publikationsportal JuSER

Please use the identifier: http://dx.doi.org/10.1177/1094342017727060 in citations.

Analytic bond-order potentials (BOPs) allow to obtain a highly accurate description of interatomic interactions at a reasonablecomputational cost. However, for simulations with very large systems, the high memory demands require theuse of a parallel implementation, which at the same time also optimizes the use of computational resources. The calculationsof analytic BOPs are performed for a restricted volume around every atom and therefore have shown to be wellsuited for a message passing interface (MPI)-based parallelization based on a domain decomposition scheme, in whichone process manages one big domain using the entire memory of a compute node. On the basis of this approach, thepresent work focuses on the analysis and enhancement of its performance on shared memory by using OpenMP threadson each MPI process, in order to use many cores per node to speed up computations and minimize memory bottlenecks.Different algorithms are described and their corresponding performance results are presented, showing significantperformance gains for highly parallel systems with hybrid MPI/OpenMP simulations up to several thousands of threads.