Convolutional Visual Prompt for Robust Visual Perception

AI-generated keywords: Test-time adaptation Convolutional Visual Prompts Out-of-distribution Domain Generalization Robustness

AI-generated Key Points

  • Vision models are vulnerable to out-of-distribution (OOD) samples and existing methods for adapting these models have limitations.
  • Convolutional visual prompts (CVP) is introduced as a new approach for label-free test-time adaptation in visual perception tasks.
  • Visual prompts offer lightweight input-space adaptation but are prone to overfitting without labels.
  • CVP has a structured nature that requires fewer trainable parameters, reducing the risk of overfitting.
  • Extensive experiments show that CVP significantly improves robustness by up to 5.87% compared to large-scale models.
  • The paper also provides a comprehensive review of related work in domain generalization and test-time adaptation.
  • CVP differs from previous approaches by focusing on adapting models with OOD data without updating weights.
  • CVP is presented as an effective solution for label-free test-time adaptation in robust visual perception tasks, with superior performance over existing large-scale models.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yun-Yun Tsai, Chengzhi Mao, Junfeng Yang

License: CC BY 4.0

Abstract: Vision models are often vulnerable to out-of-distribution (OOD) samples without adapting. While visual prompts offer a lightweight method of input-space adaptation for large-scale vision models, they rely on a high-dimensional additive vector and labeled data. This leads to overfitting when adapting models in a self-supervised test-time setting without labels. We introduce convolutional visual prompts (CVP) for label-free test-time adaptation for robust visual perception. The structured nature of CVP demands fewer trainable parameters, less than 1\% compared to standard visual prompts, combating overfitting. Extensive experiments and analysis on a wide variety of OOD visual perception tasks show that our approach is effective, improving robustness by up to 5.87% over several large-scale models.

Submitted to arXiv on 01 Mar. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2303.00198v2

The paper discusses the vulnerability of vision models to out-of-distribution (OOD) samples and the limitations of existing methods for adapting these models. It introduces a new approach called convolutional visual prompts (CVP) for label-free test-time adaptation, which aims to improve robustness in visual perception tasks. The authors highlight that visual prompts offer a lightweight method of input-space adaptation for large-scale vision models but are prone to overfitting when used in a self-supervised test-time setting without labels. To address this issue, they propose CVP, which has a structured nature that requires fewer trainable parameters compared to standard visual prompts, reducing the risk of overfitting. To evaluate the effectiveness of their approach, the authors conduct extensive experiments and analysis on various OOD visual perception tasks. The results show that CVP significantly improves robustness by up to 5.87% compared to several large-scale models. In addition to introducing CVP, the paper also provides a comprehensive review of related work in domain generalization and test-time adaptation. It discusses previous approaches such as domain generalization techniques and test-time adaptation methods that update model weights or utilize auxiliary self-supervision models. The authors emphasize that their work differs from these approaches as it focuses on adapting models with OOD data without updating the weights. Overall, the paper presents convolutional visual prompts as an effective solution for label-free test-time adaptation in robust visual perception tasks. The structured nature of CVP reduces overfitting and improves model performance on OOD samples. The experimental results demonstrate its superiority over existing large-scale models, highlighting its potential for practical applications in real world scenarios.
Created on 17 Nov. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.