KAN: Kolmogorov-Arnold Networks

AI-generated keywords: Kolmogorov-Arnold Networks Multi-Layer Perceptrons learnable activation functions interpretability AI + Science

AI-generated Key Points

  • Kolmogorov-Arnold Networks (KANs) developed as an alternative to Multi-Layer Perceptrons (MLPs)
  • KANs have learnable activation functions on edges, outperforming MLPs in accuracy and interpretability
  • Faster neural scaling laws and easier visualization for human users
  • Assist scientists in discovering mathematical and physical laws through examples in mathematics and physics
  • Symbolic formulas can be automatically discovered by training KANs with different shapes
  • Unsupervised learning mode of KANs uncovers additional relations in knot invariants
  • Achieve better accuracy with fewer parameters compared to Deepmind's MLP architecture
  • Potential for AI + Science tasks to be less computationally demanding, enabling new scientific discoveries even on personal laptops
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Ziming Liu, Yixuan Wang, Sachin Vaidya, Fabian Ruehle, James Halverson, Marin Soljačić, Thomas Y. Hou, Max Tegmark

Accepted by International Conference on Learning Representations (ICLR) 2025 (conference version: https://openreview.net/forum?id=Ozo7qJ5vZi). Codes are available at https://github.com/KindXiaoming/pykan
License: CC BY 4.0

Abstract: Inspired by the Kolmogorov-Arnold representation theorem, we propose Kolmogorov-Arnold Networks (KANs) as promising alternatives to Multi-Layer Perceptrons (MLPs). While MLPs have fixed activation functions on nodes ("neurons"), KANs have learnable activation functions on edges ("weights"). KANs have no linear weights at all -- every weight parameter is replaced by a univariate function parametrized as a spline. We show that this seemingly simple change makes KANs outperform MLPs in terms of accuracy and interpretability. For accuracy, much smaller KANs can achieve comparable or better accuracy than much larger MLPs in data fitting and PDE solving. Theoretically and empirically, KANs possess faster neural scaling laws than MLPs. For interpretability, KANs can be intuitively visualized and can easily interact with human users. Through two examples in mathematics and physics, KANs are shown to be useful collaborators helping scientists (re)discover mathematical and physical laws. In summary, KANs are promising alternatives for MLPs, opening opportunities for further improving today's deep learning models which rely heavily on MLPs.

Submitted to arXiv on 30 Apr. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2404.19756v5

Researchers have developed Kolmogorov-Arnold Networks (KANs) as a promising alternative to Multi-Layer Perceptrons (MLPs), inspired by the Kolmogorov-Arnold representation theorem. Unlike MLPs, KANs have learnable activation functions on edges instead of fixed ones on nodes. This simple change allows KANs to outperform MLPs in terms of accuracy and interpretability. Additionally, KANs possess faster neural scaling laws and are more easily visualized and interacted with by human users. Through examples in mathematics and physics, researchers demonstrate how KANs can assist scientists in discovering or rediscovering mathematical and physical laws. By training KANs with different shapes, symbolic formulas can be automatically discovered, providing a balance between simplicity and accuracy. In the field of "AI for Math," KANs' unsupervised learning mode can also uncover additional relations in knot invariants. In comparison to Deepmind's MLP architecture, researchers find that KANs achieve better accuracy with fewer parameters in signature classification problems. This discovery highlights the potential for AI + Science tasks to be less computationally demanding than previously thought, opening up possibilities for new scientific discoveries even on personal laptops. Overall, this study showcases the capabilities of Kolmogorov-Arnold Networks as promising alternatives to Multi-Layer Perceptrons. They offer improved performance in accuracy and interpretability across various scientific domains.
Created on 03 Apr. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.