Description: Routing brain traffic through the von Neumann bottleneck: Efficient cache usage in spiking neural network simulation code on general purpose computers

This title appears in the Scientific Report : 2022

Routing brain traffic through the von Neumann bottleneck: Efficient cache usage in spiking neural network simulation code on general purpose computers

Simulation is a third pillar next to experiment and theory in the study of complex dynamic systems such as biological neural networks. Contemporary brain-scale networks correspond to directed random graphs of a few million nodes, each with an in-degree and out-degree of several thousands of edges, w...

Personal Name(s):	Pronold, J. (Corresponding author)
	Jordan, J. / Wylie, B. J. N. / Kitayama, Itaru / Diesmann, M. / Kunkel, Susanne (Corresponding author)
Contributing Institute:	Jara-Institut Brain structure-function relationships; INM-10 Computational and Systems Neuroscience; IAS-6 Computational and Systems Neuroscience; INM-6
Published in:	Parallel computing, 113 (2022) S. 102952 -
Imprint:	Amsterdam [u.a.] North-Holland, Elsevier Science 2022
DOI:	10.1016/j.parco.2022.102952
Document Type:	Journal Article
Research Program:	The Next-Generation Integrated Simulation of Living Matter Doktorand ohne besondere Förderung Open-Access-Publikationskosten / 2022 - 2024 / Forschungszentrum Jülich (OAPKFZJ) GRK 2416: MultiSenses-MultiScales: Neue Ansätze zur Aufklärung neuronaler multisensorischer Integration Advanced Computing Architectures Brain-Scale Simulations DEEP - Extreme Scale Technologies Human Brain Project Specific Grant Agreement 3 Human Brain Project Specific Grant Agreement 2 Emerging NC Architectures
Link:	OpenAccess
	Publikationsportal JuSER

Please use the identifier: http://dx.doi.org/10.1016/j.parco.2022.102952 in citations.
Please use the identifier: http://hdl.handle.net/2128/31621 in citations.

Simulation is a third pillar next to experiment and theory in the study of complex dynamic systems such as biological neural networks. Contemporary brain-scale networks correspond to directed random graphs of a few million nodes, each with an in-degree and out-degree of several thousands of edges, where nodes and edges correspond to the fundamental biological units, neurons and synapses, respectively. The activity in neuronal networks is also sparse. Each neuron occasionally transmits a brief signal, called spike, via its outgoing synapses to the corresponding target neurons. In distributed computing these targets are scattered across thousands of parallel processes. The spatial and temporal sparsity represents an inherent bottleneck for simulations on conventional computers: irregular memory-access patterns cause poor cache utilization. Using an established neuronal network simulation code as a reference implementation, we investigate how common techniques to recover cache performance such as software-induced prefetching and software pipelining can benefit a real-world application. The algorithmic changes reduce simulation time by up to 50%. The study exemplifies that many-core systems assigned with an intrinsically parallel computational problem can alleviate the von Neumann bottleneck of conventional computer architectures.