Current Time Series Anomaly Detection Benchmarks are Flawed and are Creating the Illusion of Progress

AI-generated keywords: Time Series Anomaly Detection Benchmarks Flaws UCR Archive Progress

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Time series anomaly detection is an important topic in data science
  • Recent interest has increased due to the success of deep learning
  • Most papers in this area rely on popular benchmark datasets created by organizations such as Yahoo, Numenta, and NASA
  • Majority of individual exemplars within these datasets suffer from four flaws
  • These flaws raise concerns about the reliability of published comparisons between anomaly detection algorithms and question the true progress made in recent years
  • Wu and Keogh introduce the UCR Time Series Anomaly Archive as a resource to overcome these limitations
  • The archive will serve a similar role to the UCR Time Series Classification Archive by providing a benchmark for meaningful comparisons between different approaches and offering an accurate measure of overall progress in time series anomaly detection
  • This paper sheds light on existing flaws present in current time series anomaly detection benchmarks and emphasizes the need for more reliable evaluation methods
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Renjie Wu, Eamonn J. Keogh

38th IEEE International Conference on Data Engineering (ICDE), 2022, pp. 1479-1480
Full paper accepted by IEEE TKDE, extended abstract accepted by IEEE ICDE 2022

Abstract: Time series anomaly detection has been a perennially important topic in data science, with papers dating back to the 1950s. However, in recent years there has been an explosion of interest in this topic, much of it driven by the success of deep learning in other domains and for other time series tasks. Most of these papers test on one or more of a handful of popular benchmark datasets, created by Yahoo, Numenta, NASA, etc. In this work we make a surprising claim. The majority of the individual exemplars in these datasets suffer from one or more of four flaws. Because of these four flaws, we believe that many published comparisons of anomaly detection algorithms may be unreliable, and more importantly, much of the apparent progress in recent years may be illusionary. In addition to demonstrating these claims, with this paper we introduce the UCR Time Series Anomaly Archive. We believe that this resource will perform a similar role as the UCR Time Series Classification Archive, by providing the community with a benchmark that allows meaningful comparisons between approaches and a meaningful gauge of overall progress.

Submitted to arXiv on 29 Sep. 2020

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2009.13807v5

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "Current Time Series Anomaly Detection Benchmarks are Flawed and are Creating the Illusion of Progress," authors Renjie Wu and Eamonn J. Keogh discuss the importance of time series anomaly detection in data science. They highlight that this field has been a significant topic since the 1950s, but recent interest has surged due to the success of deep learning in various domains and time series tasks. The authors point out that most papers in this area rely on a few popular benchmark datasets created by organizations such as Yahoo, Numenta, and NASA. However, they make a surprising claim that the majority of individual exemplars within these datasets suffer from four flaws which raise concerns about the reliability of published comparisons between anomaly detection algorithms and question the true progress made in recent years. To address these issues, Wu and Keogh introduce the UCR Time Series Anomaly Archive as a resource to overcome these limitations. This archive will serve a similar role to the UCR Time Series Classification Archive by providing a benchmark for meaningful comparisons between different approaches and offering an accurate measure of overall progress in time series anomaly detection. Overall, this paper sheds light on existing flaws present in current time series anomaly detection benchmarks and emphasizes the need for more reliable evaluation methods while introducing the UCR Time Series Anomaly Archive to facilitate advancements in this field.
Created on 10 Nov. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.