Metric Learning for User-defined Keyword Spotting

AI-generated keywords: Metric Learning

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors aim to improve keyword spotting tasks by allowing users to define custom keywords
  • Focus on metric learning techniques for training models for user-defined keywords
  • Construct a large-scale keyword dataset and introduce a filtering method
  • Propose a novel two-stage training strategy based on metric learning techniques
  • Demonstrated significant improvements in representations of user-defined keywords and overall performance
  • Proposed unified evaluation protocol and metrics for fair comparisons in user-defined KWS field
  • System eliminates need for incremental training on new keywords and outperforms previous works on Google Speech Commands dataset
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Jaemin Jung, Youkyum Kim, Jihwan Park, Youshin Lim, Byeong-Yeol Kim, Youngjoon Jang, Joon Son Chung

Abstract: The goal of this work is to detect new spoken terms defined by users. While most previous works address Keyword Spotting (KWS) as a closed-set classification problem, this limits their transferability to unseen terms. The ability to define custom keywords has advantages in terms of user experience. In this paper, we propose a metric learning-based training strategy for user-defined keyword spotting. In particular, we make the following contributions: (1) we construct a large-scale keyword dataset with an existing speech corpus and propose a filtering method to remove data that degrade model training; (2) we propose a metric learning-based two-stage training strategy, and demonstrate that the proposed method improves the performance on the user-defined keyword spotting task by enriching their representations; (3) to facilitate the fair comparison in the user-defined KWS field, we propose unified evaluation protocol and metrics. Our proposed system does not require an incremental training on the user-defined keywords, and outperforms previous works by a significant margin on the Google Speech Commands dataset using the proposed as well as the existing metrics.

Submitted to arXiv on 01 Nov. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2211.00439v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

, , , , In their paper titled "Metric Learning for User-defined Keyword Spotting," authors Jaemin Jung, Youkyum Kim, Jihwan Park, Youshin Lim, Byeong-Yeol Kim, Youngjoon Jang, and Joon Son Chung aim to improve the performance of keyword spotting tasks by allowing users to define custom keywords. This approach not only enriches user experience but also enhances the transferability to unseen terms. Unlike previous works that treat Keyword Spotting (KWS) as a closed-set classification problem, this study focuses on using metric learning techniques to train models for user-defined keywords. The authors first construct a large-scale keyword dataset using an existing speech corpus and introduce a filtering method to eliminate data that may hinder model training. Then, they propose a novel two-stage training strategy based on metric learning techniques. Through experiments, they demonstrate that this approach significantly improves the representations of user-defined keywords and boosts overall performance. To ensure fair comparisons in the field of user-defined KWS, the authors also propose a unified evaluation protocol and metrics. Their system eliminates the need for incremental training on new keywords and outperforms previous works by a significant margin on the Google Speech Commands dataset using both proposed and existing metrics. Overall, this study provides valuable insights into improving user-defined keyword spotting through metric learning techniques and sets a benchmark for future research in this domain.
Created on 19 Aug. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.