Talks | Ravid Shwartz Ziv

2021

Compression in deep learning - an information theory perspective

While DNNs have achieved many breakthroughs, our understanding of their internal structure, optimization process, and generalization is poor, and we often treat them as black boxes. We attempt to resolve these issues by suggesting that DNNs learn to optimize the Information Bottleneck (IB) principle - the tradeoff between information compression and prediction quality. In the first part of the talk, I presented this approach, showing an analytical and numerical study of DNNs in the information plane. This analysis reveals how the training process compresses the input to an optimal, efficient representation. I discussed recent works inspired by this analysis and show how we can apply them to real-world problems. In the second part of the talk, I will discuss information in infinitely-wide neural networks using recent results in Neural Tangent Kernels (NTK) networks. The NTK allows us to derive many tractable information-theoretic quantities. By utilizing these derivations, we can do an empirical search to find the important information-theoretic quantities that affect generalization in DNNs. I aslo presented the Dual Information Bottleneck (dualIB) framework, to find an optimal representation that resolves some of the drawbacks of the original IB. A theoretical analysis of the dualIB shows the structure of its solution and its ability to preserve the original distribution’s statistics. Within this, we focused on the variational form of the dualIB, allowing its application to DNNs.

Sep 22, 2021 12:00 AM NYU Center for Data Science

Video

Compression in deep learning - an information theory perspective

2019

Information in Infinite Ensembles of Infinitely-Wide Neural Networks

Finding generalization signals using information for infinitely-wide neural networks.

Dec 8, 2019 12:00 AM 2nd Symposium on Advances in Approximate Bayesian Inference, Vancouver, Canada

PDF

Information in Infinite Ensembles of Infinitely-Wide Neural Networks

2018

Representation Compression in Deep Neural Network

An information theoretic viewpoint on the behavior of deep networks optimization processes and their generalization abilities by the information plane and how compression can help.

Aug 29, 2018 12:00 AM Google PhD Fellowship Summit, Mountain View, CA, USA

Representation Compression in Deep Neural Network

On the Information Theory of Deep Neural Networks

Understanding Deep Neural Networks with the information bottleneck principle.

May 28, 2018 1:00 PM Data Science Summit, Tel Aviv, Israel

Video

On the Information Theory of Deep Neural Networks

2017

Open the Black Box of Deep Neural Networks

Where is the information in deep neural networks? trying to find it by looking on the information plane.

Dec 28, 2017 12:00 AM Future Leaders of AI Retreat (FLAIR), New York University Shanghai, China

Slides

Open the Black Box of Deep Neural Networks

0001

Analysis and Theory of Perceptual Learning in Auditory Cortex

Analysing perceptual learning of pure tones in the auditory cortex. Using a novel computational model, we show that overrepresentation of the learned tones does not necessarily improve along the training

Jan 1, 0001 12:00 AM MMax Planck - Hebrew University Center for Sensory Processing of the Brain in Action meeting, Munich, Germany

Slides

Analysis and Theory of Perceptual Learning in Auditory Cortex