Influence-Directed Explanations for Deep Convolutional Networks

AI-generated keywords: Influence-Directed Explanations Deep Convolutional Networks Inner Workings Interpretability Neural Networks

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors: Klas Leino, Shayak Sen, Anupam Datta, Matt Fredrikson, Linyi Li
  • Novel approach: Utilizing influence-directed explanations to understand deep neural networks
  • Methodology: Using influence measure grounded in axioms to interpret concepts represented by influential neurons
  • Validation: Thorough evaluation on convolutional neural networks trained on ImageNet
  • Key strengths of influence-directed explanations:
  • Identify influential concepts with generalizability across instances
  • Distill the core essence of what the network has learned about a class
  • Isolate individual features crucial for decision-making and differentiation between classes
  • Findings: Shed light on deep neural network operations and enhance understanding of complex machine learning models
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Klas Leino, Shayak Sen, Anupam Datta, Matt Fredrikson, Linyi Li

To appear in International Test Conference 2018

Abstract: We study the problem of explaining a rich class of behavioral properties of deep neural networks. Distinctively, our influence-directed explanations approach this problem by peering inside the network to identify neurons with high influence on a quantity and distribution of interest, using an axiomatically-justified influence measure, and then providing an interpretation for the concepts these neurons represent. We evaluate our approach by demonstrating a number of its unique capabilities on convolutional neural networks trained on ImageNet. Our evaluation demonstrates that influence-directed explanations (1) identify influential concepts that generalize across instances, (2) can be used to extract the "essence" of what the network learned about a class, and (3) isolate individual features the network uses to make decisions and distinguish related classes.

Submitted to arXiv on 11 Feb. 2018

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1802.03788v2

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "Influence-Directed Explanations for Deep Convolutional Networks," authors Klas Leino, Shayak Sen, Anupam Datta, Matt Fredrikson, and Linyi Li delve into the intricate problem of explaining a wide array of behavioral properties exhibited by deep neural networks. Their novel approach involves utilizing influence-directed explanations to peer into the network's inner workings and pinpoint neurons that hold significant sway over a specific quantity and distribution of interest. By employing an influence measure grounded in axioms, the researchers are able to provide interpretations for the concepts represented by these influential neurons. To validate the effectiveness of their methodology, the team conducts a thorough evaluation on convolutional neural networks trained on ImageNet. Through this evaluation, they showcase several key strengths of influence-directed explanations. Firstly, these explanations successfully identify influential concepts that exhibit generalizability across instances. Secondly, they demonstrate the capability to distill the core "essence" of what the network has learned about a particular class. Lastly, the approach excels in isolating individual features that play a crucial role in the network's decision-making process and its ability to differentiate between closely related classes. The findings presented in this study not only shed light on how deep neural networks operate but also highlight the potential for influence-directed explanations to enhance our understanding of complex machine learning models. With their innovative approach and compelling results, Leino et al. 's research contributes significantly to advancing interpretability in deep convolutional networks and lays a solid foundation for future investigations in this domain.
Created on 28 Jul. 2025

Assess the quality of the AI-generated content by voting

Score: -1

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.