Skip to content
VuFind
  • 0 Items in e-Shelf (Full)
  • History
  • User Account
  • Logout
  • User Account
  • Help
    • English
    • Deutsch
  • Books & more
  • Articles & more
  • JuSER
Advanced
 
  • Literature Request
  • Cite this
  • Email this
  • Export
    • Export to RefWorks
    • Export to EndNoteWeb
    • Export to EndNote
    • Export to MARC
    • Export to MARCXML
    • Export to BibTeX
  • Favorites
  • Add to e-Shelf Remove from e-Shelf


QR Code
This title appears in the Scientific Report : 2016 

High-performance generation of the Hamiltonian and Overlap matrices in FLAPW methods

High-performance generation of the Hamiltonian and Overlap matrices in FLAPW methods

One of the greatest efforts of computational scientists is to translate the mathematical model describing a class of physical phenomena into large and complex codes. Many of these codes face the difficulty of implementing the mathematical operations in the model in terms of low level optimized kerne...

More

Saved in:
Personal Name(s): Di Napoli, Edoardo (Corresponding author)
Peise, Elmar (Corresponding author) / Hrywniak, Markus (Corresponding author) / Bientinesi, Paolo (Corresponding author)
Contributing Institute: Jülich Supercomputing Center; JSC
JARA - HPC; JARA-HPC
Published in: Computer physics communications, 211 (2017) S. 61-72
Imprint: Amsterdam North Holland Publ. Co. 2017
DOI: 10.1016/j.cpc.2016.10.003
Document Type: Journal Article
Research Program: Computational Science and Mathematical Methods
Publikationsportal JuSER
Please use the identifier: http://dx.doi.org/10.1016/j.cpc.2016.10.003 in citations.

  • Description
  • Staff View

One of the greatest efforts of computational scientists is to translate the mathematical model describing a class of physical phenomena into large and complex codes. Many of these codes face the difficulty of implementing the mathematical operations in the model in terms of low level optimized kernels offering both performance and portability. Legacy codes suffer from the additional curse of rigid design choices based on outdated performance metrics (e.g. minimization of memory footprint). Using a representative code from the Materials Science community, we propose a methodology to restructure the most expensive operations in terms of an optimized combination of dense linear algebra (BLAS3) kernels. The resulting algorithm guarantees an increased performance and an extended life span of this code, enabling larger scale simulations.

  • Forschungszentrum Jülich
  • Central Library (ZB)
  • Powered by VuFind 6.1.1
Loading...