This title appears in the Scientific Report :
2023
Please use the identifier:
http://dx.doi.org/10.34734/FZJ-2023-04994 in citations.
Please use the identifier: http://dx.doi.org/10.1145/3624062.3624205 in citations.
An Event Model for Trace-Based Performance Analysis of MPI Partitioned Point-to-Point Communication
An Event Model for Trace-Based Performance Analysis of MPI Partitioned Point-to-Point Communication
The MPI 4.0 standard introduced the concept of partitioned point-to-point communication. One facet that may help in encouraging application developers to use this new concept in their programs is the availability of proper tool support in a timely manner. In this paper, we therefore propose nine new...
Saved in:
Personal Name(s): | Thärigen, Isabel (Corresponding author) |
---|---|
Hermanns, Marc-André / Geimer, Markus | |
Contributing Institute: |
Jülich Supercomputing Center; JSC |
Published in: |
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis |
Imprint: |
New York, NY, USA
ACM
2023
|
Physical Description: |
1357–1367 |
DOI: |
10.34734/FZJ-2023-04994 |
DOI: |
10.1145/3624062.3624205 |
Conference: | Workshop on Programming and Performance Visualization Tools, Denver, CO (USA), 2023-11-12 - 2023-11-12 |
Document Type: |
Contribution to a book Contribution to a conference proceedings |
Research Program: |
Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups |
Link: |
OpenAccess |
Publikationsportal JuSER |
Please use the identifier: http://dx.doi.org/10.1145/3624062.3624205 in citations.
The MPI 4.0 standard introduced the concept of partitioned point-to-point communication. One facet that may help in encouraging application developers to use this new concept in their programs is the availability of proper tool support in a timely manner. In this paper, we therefore propose nine new events extending the OTF2 event model to accurately represent the runtime behavior of partitioned point-to-point communication. We then demonstrate the suitability of these extensions with three different use cases in the context of performance analysis. In particular, we showcase a prototype implementation of an extended waitstate analysis in the Scalasca trace analyzer, and discuss further potential use cases in the realm of trace visualization and simulation. |