Trustworthy Social Bias Measurement

AI-generated keywords: Trustworthy Social Bias Measurement DivDist Validation

AI-generated Key Points

  • The paper addresses the challenge of designing reliable measures of social bias
  • Previous measures of social bias have not gained widespread trust
  • The authors propose a new approach to bias measurement based on measurement modeling theory
  • They introduce a general bias measurement framework called DivDist
  • The framework includes five concrete bias measures and a rigorous testing protocol with eight criteria
  • The authors provide evidence supporting the trustworthiness of their measures and address deficiencies in previous measures
  • The paper discusses related work in the field, including qualitative characterization of social bias and lack of adoption of quantitative measures in NLP datasets
  • Overall, the paper presents a comprehensive approach to designing trustworthy measures of social bias in NLP datasets and models.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Rishi Bommasani, Percy Liang

License: CC BY 4.0

Abstract: How do we design measures of social bias that we trust? While prior work has introduced several measures, no measure has gained widespread trust: instead, mounting evidence argues we should distrust these measures. In this work, we design bias measures that warrant trust based on the cross-disciplinary theory of measurement modeling. To combat the frequently fuzzy treatment of social bias in NLP, we explicitly define social bias, grounded in principles drawn from social science research. We operationalize our definition by proposing a general bias measurement framework DivDist, which we use to instantiate 5 concrete bias measures. To validate our measures, we propose a rigorous testing protocol with 8 testing criteria (e.g. predictive validity: do measures predict biases in US employment?). Through our testing, we demonstrate considerable evidence to trust our measures, showing they overcome conceptual, technical, and empirical deficiencies present in prior measures.

Submitted to arXiv on 20 Dec. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2212.11672v1

The paper titled "Trustworthy Social Bias Measurement" addresses the challenge of designing reliable measures of social bias. While previous research has introduced various measures, none have gained widespread trust, leading to doubts about their effectiveness. In this work, the authors propose a new approach to bias measurement based on the cross-disciplinary theory of measurement modeling. They aim to overcome the vague treatment of social bias in natural language processing (NLP) by providing an explicit definition grounded in principles from social science research. To operationalize their definition, the authors introduce a general bias measurement framework called DivDist. This framework serves as a basis for implementing five concrete bias measures and a rigorous testing protocol consisting of eight criteria including predictive validity. The authors demonstrate considerable evidence supporting the trustworthiness of their measures and show that they address conceptual, technical and empirical deficiencies present in previous measures. In addition to addressing existing gaps in social bias measurement, this paper also discusses related work in the field. It highlights the qualitative characterization of social bias across various disciplines in the social sciences and notes that quantitative measures proposed thus far have not been widely adopted for measuring bias in NLP datasets despite their significant role text corpora play in language models and growing interest in dataset documentation and governance. Overall, this paper presents a comprehensive approach to designing trustworthy measures of social bias. By explicitly defining social bias and developing a robust measurement framework, the authors contribute to advancing research on mitigating biases in NLP datasets and models.
Created on 25 Sep. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.