Multivariate outlier detection based on a robust Mahalanobis distance with shrinkage estimators

AI-generated keywords: Robust Mahalanobis distances Shrinkage Outlier Detection Heavy-tailed Distributions F-score

AI-generated Key Points

  • Proposal of robust Mahalanobis distances for multivariate outlier detection based on shrinkage
  • Optimal estimation of robust intensity and scaling factors to define the shrinkage
  • Investigation of properties such as affine equivariance and breakdown value
  • Comparison to other techniques through simulation studies and a real dataset, demonstrating high correct detection rates and low false detection rates in the vast majority of cases
  • Significantly smaller computation time compared to other methods
  • Appropriateness when deviating from normality assumptions, including heavy-tailed or skewed distributions
  • Introduction of a new evaluation metric, F-score, which combines precision and recall measures
  • Comprehensive approach to multivariate outlier detection using robust Mahalanobis distances with shrinkage estimators
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Elisa Cabana, Rosa E. Lillo, Henry Laniado

Stat Papers (2019)
License: CC BY-NC-SA 4.0

Abstract: A collection of robust Mahalanobis distances for multivariate outlier detection is proposed, based on the notion of shrinkage. Robust intensity and scaling factors are optimally estimated to define the shrinkage. Some properties are investigated, such as affine equivariance and breakdown value. The performance of the proposal is illustrated through the comparison to other techniques from the literature, in a simulation study and with a real dataset. The behavior when the underlying distribution is heavy-tailed or skewed, shows the appropriateness of the method when we deviate from the common assumption of normality. The resulting high correct detection rates and low false detection rates in the vast majority of cases, as well as the significantly smaller computation time shows the advantages of our proposal.

Submitted to arXiv on 04 Apr. 2019

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1904.02596v1

This paper proposes a collection of robust Mahalanobis distances for multivariate outlier detection based on the concept of shrinkage. The authors optimally estimate robust intensity and scaling factors to define the shrinkage and investigate properties such as affine equivariance and breakdown value. The proposed method is compared to other techniques from the literature through simulation studies and a real dataset, demonstrating its high correct detection rates and low false detection rates in the vast majority of cases. Additionally, the significantly smaller computation time highlights the advantages of this proposal. The study also explores how the proposed method behaves when underlying distributions are heavy-tailed or skewed, showing its appropriateness when deviating from normality assumptions. Furthermore, the authors introduce a new evaluation metric, F-score which combines precision and recall measures. Overall, this paper presents a comprehensive approach to multivariate outlier detection using robust Mahalanobis distances with shrinkage estimators. The results demonstrate its effectiveness in detecting outliers while being computationally efficient and appropriate for non-normal distributions.
Created on 02 Jun. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.