Enlarging Instance-specific and Class-specific Information for Open-set Action Recognition

AI-generated keywords: Open-set Action Recognition Prototypical Similarity Learning Information Bottleneck Theory Instance-specific Information Class-specific Information

AI-generated Key Points

Open-set action recognition (OSAR) involves rejecting unknown human action cases that fall outside the distribution of the training set.
Existing methods for OSAR focus on learning better uncertainty scores, but they often overlook the importance of feature representations.
The authors propose a novel Prototypical Similarity Learning (PSL) framework to enlarge instance-specific (IS) and class-specific (CS) information in feature representations for better OSAR performance.
CS information is used for inter-class recognition, while IS information is unique to each sample within a class. Both types of information are crucial for OSAR performance.
To enlarge IS information, PSL encourages instances to have less than 1 similarity with their corresponding prototypes, retaining more IS information in learned feature representations.
Video shuffling is introduced into PSL to alleviate misclassification issues caused by OoD videos that share similar appearances with InD videos.
Shuffled videos are encouraged to have less than 1 similarity with original samples, allowing networks to extract distinct temporal information among them and enlarging CS information.
Experiments demonstrate that PSL significantly boosts both open-set and closed-set performance on multiple benchmarks, achieving state-of-the-art results.
The proposed framework provides a novel perspective on analyzing OSAR tasks based on the information bottleneck theory and highlights the importance of retaining both IS and CS information for optimal performance.

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Jun Cen, Shiwei Zhang, Xiang Wang, Yixuan Pei, Zhiwu Qing, Yingya Zhang, Qifeng Chen

arXiv: 2303.15467v1 - DOI (cs.CV)

To appear at CVPR2023

License: CC BY-NC-SA 4.0

Abstract: Open-set action recognition is to reject unknown human action cases which are out of the distribution of the training set. Existing methods mainly focus on learning better uncertainty scores but dismiss the importance of feature representations. We find that features with richer semantic diversity can significantly improve the open-set performance under the same uncertainty scores. In this paper, we begin with analyzing the feature representation behavior in the open-set action recognition (OSAR) problem based on the information bottleneck (IB) theory, and propose to enlarge the instance-specific (IS) and class-specific (CS) information contained in the feature for better performance. To this end, a novel Prototypical Similarity Learning (PSL) framework is proposed to keep the instance variance within the same class to retain more IS information. Besides, we notice that unknown samples sharing similar appearances to known samples are easily misclassified as known classes. To alleviate this issue, video shuffling is further introduced in our PSL to learn distinct temporal information between original and shuffled samples, which we find enlarges the CS information. Extensive experiments demonstrate that the proposed PSL can significantly boost both the open-set and closed-set performance and achieves state-of-the-art results on multiple benchmarks. Code is available at https://github.com/Jun-CEN/PSL.

Submitted to arXiv on 25 Mar. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2303.15467v1

Comprehensive Summary
Key points
Layman's Summary
Blog article

The problem of open-set action recognition (OSAR) involves rejecting unknown human action cases that fall outside the distribution of the training set. Existing methods for OSAR focus on learning better uncertainty scores, but they often overlook the importance of feature representations. This paper proposes a novel Prototypical Similarity Learning (PSL) framework to enlarge instance-specific (IS) and class-specific (CS) information in feature representations for better OSAR performance. The authors begin by analyzing the behavior of feature representations in the open-set problem using the information bottleneck theory. They divide the information contained in features into IS and CS categories. CS information is used for inter-class recognition, while IS information is unique to each sample within a class. Both types of information are crucial for OSAR performance. To enlarge IS information, PSL encourages instances to have less than 1 similarity with their corresponding prototypes, retaining more IS information in learned feature representations. The authors also introduce video shuffling into PSL to alleviate misclassification issues caused by OoD videos that share similar appearances with InD videos. Shuffled videos are encouraged to have less than 1 similarity with original samples, allowing networks to extract distinct temporal information among them and enlarging CS information. Experiments demonstrate that PSL significantly boosts both open-set and closed-set performance on multiple benchmarks, achieving state-of-the-art results. The proposed framework provides a novel perspective on analyzing OSAR tasks based on the information bottleneck theory and highlights the importance of retaining both IS and CS information for optimal performance.

- Open-set action recognition (OSAR) involves rejecting unknown human action cases that fall outside the distribution of the training set.
- Existing methods for OSAR focus on learning better uncertainty scores, but they often overlook the importance of feature representations.
- The authors propose a novel Prototypical Similarity Learning (PSL) framework to enlarge instance-specific (IS) and class-specific (CS) information in feature representations for better OSAR performance.
- CS information is used for inter-class recognition, while IS information is unique to each sample within a class. Both types of information are crucial for OSAR performance.
- To enlarge IS information, PSL encourages instances to have less than 1 similarity with their corresponding prototypes, retaining more IS information in learned feature representations.
- Video shuffling is introduced into PSL to alleviate misclassification issues caused by OoD videos that share similar appearances with InD videos.
- Shuffled videos are encouraged to have less than 1 similarity with original samples, allowing networks to extract distinct temporal information among them and enlarging CS information.
- Experiments demonstrate that PSL significantly boosts both open-set and closed-set performance on multiple benchmarks, achieving state-of-the-art results.
- The proposed framework provides a novel perspective on analyzing OSAR tasks based on the information bottleneck theory and highlights the importance of retaining both IS and CS information for optimal performance.

1. Open-set action recognition (OSAR) is about recognizing human actions and rejecting unknown cases that are not in the training set. 2. Current methods for OSAR focus on uncertainty scores but ignore the importance of feature representations. 3. The authors propose a new framework called Prototypical Similarity Learning (PSL) to improve OSAR performance by enhancing instance-specific and class-specific information in feature representations. 4. Instance-specific information is unique to each sample within a class, while class-specific information is used for inter-class recognition. 5. PSL encourages instances to have less than 1 similarity with their corresponding prototypes to retain more IS information and introduces video shuffling to extract distinct temporal information among them and enlarge CS information. Definitions- Open-set action recognition: recognizing human actions while rejecting unknown cases outside the training set - Feature representations: mathematical descriptions of features that help recognize actions - Prototypical Similarity Learning: a framework that enhances instance-specific and class-specific information in feature representations - Instance-specific information: unique characteristics of each sample within a class - Class-specific information: characteristics shared by all samples in a class

The Problem of Open-Set Action Recognition (OSAR)

Open-set action recognition (OSAR) is a challenging problem in computer vision that involves recognizing human actions from videos while also rejecting unknown cases that fall outside the distribution of the training set. Existing methods for OSAR focus on learning better uncertainty scores, but they often overlook the importance of feature representations. This paper proposes a novel Prototypical Similarity Learning (PSL) framework to enlarge instance-specific (IS) and class-specific (CS) information in feature representations for better OSAR performance.

Analyzing Feature Representations with Information Bottleneck Theory

The authors begin by analyzing the behavior of feature representations in the open-set problem using the information bottleneck theory. They divide the information contained in features into IS and CS categories. CS information is used for inter-class recognition, while IS information is unique to each sample within a class. Both types of information are crucial for OSAR performance.

Enlarging Instance Specific Information

To enlarge IS information, PSL encourages instances to have less than 1 similarity with their corresponding prototypes, retaining more IS information in learned feature representations.

Video Shuffling

The authors also introduce video shuffling into PSL to alleviate misclassification issues caused by OoD videos that share similar appearances with InD videos. Shuffled videos are encouraged to have less than 1 similarity with original samples, allowing networks to extract distinct temporal information among them and enlarging CS information.

Experimental Results

Experiments demonstrate that PSL significantly boosts both open-set and closed-set performance on multiple benchmarks, achieving state-of-the-art results. The proposed framework provides a novel perspective on analyzing OSAR tasks based on the information bottleneck theory and highlights the importance of retaining both IS and CS information for optimal performance

Created on 09 Apr. 2023

Assess the quality of the AI-generated content by voting

Score: 0

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

50.6%

Locally Sparse Networks for Interpretable Predictions

cs.LG

50.0%

An Empirical Survey of Data Augmentation for Limited Data Learning in NLP

cs.CL

49.5%

FACE-AUDITOR: Data Auditing in Facial Recognition Systems

cs.CR

49.2%

PFT-SSR: Parallax Fusion Transformer for Stereo Image Super-Resolution

cs.CV

49.0%

Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework

cs.LG

48.7%

Predicting Stock Price Movement as an Image Classification Problem

q-fin.PR

48.1%

GoalsEye: Learning High Speed Precision Table Tennis on a Physical Robot

cs.RO

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.