This title appears in the Scientific Report :
2019
Please use the identifier:
http://dx.doi.org/10.1007/978-3-030-11987-4_6 in citations.
Please use the identifier: http://hdl.handle.net/2128/21896 in citations.
SCIPHI Score-P and Cube Extensions for Intel Phi
SCIPHI Score-P and Cube Extensions for Intel Phi
The Knights Landing processors offers unique features with regards to memory hierarchy and vectorization capabilities. To improve tool support within these two areas, we present extensions to the Score-P measurement infrastructure and the Cube report explorer. With the Knights Landing edition, Intel...
Saved in:
Personal Name(s): | Schlütter, Marc (Corresponding author) |
---|---|
Feld, Christian / Saviankou, Pavel / Knobloch, Michael / Hermanns, Marc-André / Mohr, Bernd | |
Contributing Institute: |
JARA - HPC; JARA-HPC Jülich Supercomputing Center; JSC |
Published in: |
Tools for High Performance Computing 2017 |
Imprint: |
Cham
Springer International Publishing
2019
|
Physical Description: |
85-104 |
ISBN: |
978-3-030-11987-4 |
DOI: |
10.1007/978-3-030-11987-4_6 |
Conference: | 11th International Workshop on Parallel Tools for High Performance Computing, Dresden (Germany), 2017-09-11 - 2017-09-12 |
Document Type: |
Contribution to a book Contribution to a conference proceedings |
Research Program: |
Computational Science and Mathematical Methods |
Link: |
Get full text OpenAccess OpenAccess |
Publikationsportal JuSER |
Please use the identifier: http://hdl.handle.net/2128/21896 in citations.
The Knights Landing processors offers unique features with regards to memory hierarchy and vectorization capabilities. To improve tool support within these two areas, we present extensions to the Score-P measurement infrastructure and the Cube report explorer. With the Knights Landing edition, Intel introduced a new memory architecture, utilizing two types of memory, MCDRAM and DDR4 SDRAM. To assist the user in the decision where to place data structures, we introduce a MCDRAM candidate metric to the Cube report explorer. In addition we track all MCDRAM allocations through the hbwmalloc interface, providing memory metrics like leaked memory or the high-water mark on a per-region basis, as already known for the ubiquitous malloc/free. A Score-P metric plugin that records memory statistics via numastat on a per process level enables a timeline analysis using the Vampir toolset. To get the best performance out of , the large vector processing units need to be utilized effectively. The ratio between computation and data access and the vector processing unit (VPU) intensity are introduced as metrics to identify vectorization candidates on a per-region basis. The Portable Hardware Locality (hwloc) Broquedis et al. (hwloc: a generic framework for managing hardware affinities in hpc applications, 2010 [2]) library allows us to visualize the distribution of the KNL-specific performance metrics within the Cube report explorer, taking the hardware topology consisting of processor tiles and cores into account. |