On the Expressive Power of Neural Networks

AI-generated keywords: Neural Networks Universal Approximation Theorem ReLU-Networks Expressive Power Linear Regions

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • The paper by Jan Holstermann focuses on the expressive power of neural networks.
  • The universal approximation theorem states that wide shallow neural networks can approximate any continuous function on a compact set.
  • Research has been done to determine optimal approximation rates for ReLU-networks in $L^p$-norms with $p \in [1,\infty)$ and to prove a universal approximation theorem for deep narrow ReLU-networks.
  • Holstermann introduces a framework of two expressive powers to answer open questions regarding the expressive power of neural networks.
  • The first expressive power counts the maximal number of linear regions of a function calculated by a ReLU-network, and Holstermann improves upon the best known bounds for this.
  • The second expressive power measures how many times an input has to be transformed before it can be classified correctly by a neural network, and is entirely new.
  • By improving existing bounds on expressive powers and introducing new ones, this work contributes to our understanding of how neural networks work and what they can do.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Jan Holstermann

arXiv: 2306.0145v1 - DOI (math.CA)
54 pages

Abstract: In 1989 George Cybenko proved in a landmark paper that wide shallow neural networks can approximate arbitrary continuous functions on a compact set. This universal approximation theorem sparked a lot of follow-up research. Shen, Yang and Zhang determined optimal approximation rates for ReLU-networks in $L^p$-norms with $p \in [1,\infty)$. Kidger and Lyons proved a universal approximation theorem for deep narrow ReLU-networks. Telgarsky gave an example of a deep narrow ReLU-network that cannot be approximated by a wide shallow ReLU-network unless it has exponentially many neurons. However, there are even more questions that still remain unresolved. Are there any wide shallow ReLU-networks that cannot be approximated well by deep narrow ReLU-networks? Is the universal approximation theorem still true for other norms like the Sobolev norm $W^{1,1}$? Do these results hold for activation functions other than ReLU? We will answer all of those questions and more with a framework of two expressive powers. The first one is well-known and counts the maximal number of linear regions of a function calculated by a ReLU-network. We will improve the best known bounds for this expressive power. The second one is entirely new.

Submitted to arXiv on 31 May. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2306.0145v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The paper "On the Expressive Power of Neural Networks" by Jan Holstermann delves into the expressive power of neural networks, specifically focusing on the universal approximation theorem and its implications. The universal approximation theorem, first proven by George Cybenko in 1989, states that wide shallow neural networks can approximate any continuous function on a compact set. This sparked a lot of research, including determining optimal approximation rates for ReLU-networks in $L^p$-norms with $p \in [1,\infty)$ by Shen, Yang, and Zhang and proving a universal approximation theorem for deep narrow ReLU-networks by Kidger and Lyons. To answer open questions regarding the expressive power of neural networks such as whether there are any wide shallow ReLU-networks that cannot be well approximated by deep narrow ReLU-networks or if the universal approximation theorem still holds true for other norms like the Sobolev norm $W^{1,1}$, Holstermann introduces a framework of two expressive powers. The first one is well-known and counts the maximal number of linear regions of a function calculated by a ReLU-network; Holstermann improves upon the best known bounds for this expressive power. The second one is entirely new and measures how many times an input has to be transformed before it can be classified correctly by a neural network. By improving existing bounds on expressive powers and introducing new ones, this work contributes to our understanding of how neural networks work and what they can do. Holstermann's paper provides insights into both the limitations and capabilities of neural networks in approximating functions.
Created on 16 Jun. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.