This study explores the gendered aspects of mental health stigma in masked language models (MLMs) and how these models generate words related to mental health based on gender. The researchers develop a framework using clinical psychology literature to measure mental health stigma and curate prompts for MLMs. Their findings reveal that the models consistently predict female subjects more than male subjects when discussing mental health conditions and seeking treatment. Additionally, the models associate stereotypes such as anger, blame, and pity more with women than men who have mental health conditions. This study highlights the complex nuances of gendered mental health stigma captured by MLMs and emphasizes the importance of considering context and overlapping dimensions of identity when assessing social biases in computational models. The authors acknowledge some limitations of their work, including relying on interpretations of black-box models rather than using modern interpretability methods to identify specific aspects responsible for generating gendered words. They also caution against using their framework as an off-the-shelf metric to evaluate models in practice since it is a preliminary exploration rather than a benchmarking tool. Furthermore, this study does not examine the concrete impacts of model behaviors in real-world applications or measure their harmfulness in the lived experiences of affected individuals. Moreover, there are limitations related to nonbinary and genderqueer identities as well as potential bias from English Wikipedia data used to derive gender associations. The set of manually curated prompts used in this study is also limited in size and may contain artifacts from the curation process or psychology literature it was based on. Additionally, these prompts were derived from a survey conducted in standard American English which may not accurately represent stigma in other languages or cultures. In conclusion, while this study provides valuable insights into how gender influences mental health stigma captured by MLMs, further research is needed to fully understand its impact in real-world applications and consider diverse gender identities and cultural contexts. The findings of this study highlight the importance of addressing social biases in computational models and considering the broader implications of their outputs in specific social domains, such as mental health.
- - Gendered aspects of mental health stigma in masked language models (MLMs) are explored
- - MLMs consistently predict female subjects more than male subjects when discussing mental health conditions and seeking treatment
- - Stereotypes such as anger, blame, and pity are associated more with women than men who have mental health conditions
- - Context and overlapping dimensions of identity should be considered when assessing social biases in computational models
- - Limitations include reliance on interpretations of black-box models instead of modern interpretability methods, caution against using the framework as an off-the-shelf metric, lack of examination of concrete impacts in real-world applications, potential bias from English Wikipedia data used to derive gender associations, limitations related to nonbinary and genderqueer identities, limited size and artifacts in the set of curated prompts used in the study, and potential inaccuracies in representing stigma in other languages or cultures.
- - Further research is needed to fully understand the impact of gender on mental health stigma captured by MLMs, consider diverse gender identities and cultural contexts, address social biases in computational models, and evaluate the broader implications of their outputs.
Key points1. Some computer programs called masked language models (MLMs) can predict and talk about mental health conditions.
2. These MLMs often talk more about women than men when discussing mental health conditions and seeking treatment.
3. People tend to associate negative stereotypes like anger, blame, and pity more with women who have mental health conditions.
4. When studying social biases in these computer models, it's important to consider the context and other aspects of a person's identity.
5. There are some limitations to this study, such as relying on interpretations of the computer models instead of using modern interpretability methods, not examining real-world impacts, and potential bias from using English Wikipedia data.
Definitions- Gendered: Related to differences between males and females.
- Mental health stigma: Negative attitudes or beliefs towards people with mental health conditions.
- Masked language models (MLMs): Computer programs that can generate text based on patterns they learn from large amounts of data.
- Stereotypes: Fixed ideas or beliefs about a particular group of people that may not be true for everyone in that group.
- Social biases: Unfair preferences or prejudices towards certain groups of people based on their social characteristics or identities.
- Interpretations: Understanding or explaining something based on personal judgment or analysis.
- Real-world applications: Using something in practical situations outside of a controlled environment like a laboratory or experiment.
Introduction
Mental health stigma is a pervasive issue that affects individuals of all genders, races, and backgrounds. It refers to the negative attitudes and beliefs surrounding mental health conditions, which can lead to discrimination, social exclusion, and barriers to seeking treatment. In recent years, there has been a growing concern about the impact of technology on perpetuating or reinforcing societal biases and stigmas. One area of focus has been on masked language models (MLMs), which are artificial intelligence systems trained on large datasets to generate human-like text.
In this study, researchers explore the gendered aspects of mental health stigma captured by MLMs and how these models generate words related to mental health based on gender. By developing a framework using clinical psychology literature as a guide for measuring mental health stigma and curating prompts for MLMs, they aim to shed light on the complex nuances of gendered mental health stigma in computational models.
The Study
The study begins by discussing previous research that has examined social biases in natural language processing (NLP) models. While some studies have focused on racial or ethnic biases in NLP systems, few have explored gender-based biases specifically related to mental health stigma.
To address this gap, the researchers developed a framework based on clinical psychology literature that measures three dimensions of mental health stigma: stereotypes (e.g., anger or pity), blame (e.g., attributing responsibility for their condition), and help-seeking attitudes (e.g., willingness to seek treatment). They then curated 50 prompts for MLMs using these dimensions as guidelines.
The researchers used two popular MLMs - GPT-3 and CTRL - trained on English Wikipedia data to generate responses based on these prompts. They also conducted an online survey with 200 participants from diverse backgrounds who were asked to rate each prompt's perceived level of stigmatization towards men or women with different mental health conditions.
Findings
The results of the study revealed that both GPT-3 and CTRL consistently predicted female subjects more than male subjects when discussing mental health conditions and seeking treatment. This finding aligns with previous research that has shown women are more likely to seek help for mental health issues, while men tend to underreport or avoid seeking treatment due to societal expectations of masculinity.
Additionally, the models associated stereotypes such as anger, blame, and pity more with women than men who have mental health conditions. This suggests that MLMs may perpetuate harmful gender stereotypes surrounding mental health.
Limitations
While this study provides valuable insights into how gender influences mental health stigma captured by MLMs, it also has some limitations. One major limitation is the use of black-box models rather than modern interpretability methods to identify specific aspects responsible for generating gendered words. This approach limits our understanding of why these biases exist in MLMs and how they can be addressed.
Furthermore, the study's framework should not be used as an off-the-shelf metric for evaluating models in practice since it is a preliminary exploration rather than a benchmarking tool. Additionally, the study does not examine the concrete impacts of model behaviors in real-world applications or measure their harmfulness in the lived experiences of affected individuals.
Moreover, there are limitations related to nonbinary and genderqueer identities as well as potential bias from English Wikipedia data used to derive gender associations. The set of manually curated prompts used in this study is also limited in size and may contain artifacts from the curation process or psychology literature it was based on. Furthermore, these prompts were derived from a survey conducted in standard American English which may not accurately represent stigma in other languages or cultures.
Conclusion
In conclusion, this study highlights important considerations regarding social biases present in computational models and their outputs related to mental health stigma. While the findings provide valuable insights into how gender influences mental health stigma captured by MLMs, further research is needed to fully understand its impact in real-world applications and consider diverse gender identities and cultural contexts.
The study also emphasizes the importance of addressing social biases in computational models and considering the broader implications of their outputs in specific social domains, such as mental health. As technology continues to advance, it is crucial to actively work towards creating more inclusive and unbiased systems that do not perpetuate harmful stereotypes or stigmas.