Description: Decomposing neural networks as mappings of correlation functions

This title appears in the Scientific Report : 2022

Decomposing neural networks as mappings of correlation functions

Understanding the functional principles of information processing in deep neural networks continues to be a challenge, in particular for networks with trained and thus non-random weights. To address this issue, we study the mapping between probability distributions implemented by a deep feed-forward...

Personal Name(s):	Fischer, Kirsten (Corresponding author)
	Rene, Alexandre / Keup, Christian / Layer, Moritz / Dahmen, David / Helias, Moritz
Contributing Institute:	Jara-Institut Brain structure-function relationships; INM-10 Computational and Systems Neuroscience; IAS-6 Computational and Systems Neuroscience; INM-6
Imprint:	2022
Conference:	Quantum Information Seminnar, Aachen (Germany), 2022-12-12 - 2022-12-12
Document Type:	Talk (non-conference)
Research Program:	Advanced Computing Architectures Theory of multi-scale neuronal networks Transparent Deep Learning with Renormalized Flows Emerging NC Architectures Computational Principles Recurrence and stochasticity for neuro-inspired computation
	Publikationsportal JuSER

Understanding the functional principles of information processing in deep neural networks continues to be a challenge, in particular for networks with trained and thus non-random weights. To address this issue, we study the mapping between probability distributions implemented by a deep feed-forward network. We characterize this mapping as an iterated transformation of distributions, where the non-linearity in each layer transfers information between different orders of correlation functions. This allows us to identify essential statistics in the data, as well as different information representations that can be used by neural networks. Applied to an XOR task and to MNIST, we show that correlations up to second order predominantly capture the information processing in the internal layers, while the input layer also extracts higher-order correlations from the data. This analysis provides a quantitative and explainable perspective on classification.