Finding Good Representations of Emotions for Text Classification

AI-generated keywords: Emotion Human-Machine Communication Representations Bias Classification

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Study focuses on the importance of accurately interpreting human emotions for effective human-machine communication
  • Emotion plays a crucial role in human-to-human interactions
  • Representing emotions in text poses a challenge in natural language processing (NLP)
  • Continuous vector representations like word2vec fail to consider emotions and may exhibit bias towards certain identities
  • Proposed approach includes emotional word vectors (EVEC) learned from a convolutional neural network model using an emotion-labeled corpus constructed with hashtags
  • Sentence-level representations are learned by leveraging a large corpus of texts through the pseudo task of recognizing emojis
  • Improved representations enhance sentiment/emotion analysis tasks significantly
  • Considering emotions within text helps build more robust machine learning models for affect-related text classification tasks like sentiment/emotion analysis and abusive language detection
  • Study explores model bias in existing approaches, specifically addressing gender bias in various neural network models through experiments aimed at measuring and reducing biases in representations
  • Research contributes to building more inclusive and robust classification models
  • Emphasizes the need for accurate representation of emotions in NLP tasks to facilitate better human-machine communication
  • Proposed methods offer promising avenues for improving affect-related text classification tasks while addressing biases present in current approaches.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Ji Ho Park

HKUST MPhil Thesis, 2018
HKUST MPhil Thesis, 87 pages

Abstract: It is important for machines to interpret human emotions properly for better human-machine communications, as emotion is an essential part of human-to-human communications. One aspect of emotion is reflected in the language we use. How to represent emotions in texts is a challenge in natural language processing (NLP). Although continuous vector representations like word2vec have become the new norm for NLP problems, their limitations are that they do not take emotions into consideration and can unintentionally contain bias toward certain identities like different genders. This thesis focuses on improving existing representations in both word and sentence levels by explicitly taking emotions inside text and model bias into account in their training process. Our improved representations can help to build more robust machine learning models for affect-related text classification like sentiment/emotion analysis and abusive language detection. We first propose representations called emotional word vectors (EVEC), which is learned from a convolutional neural network model with an emotion-labeled corpus, which is constructed using hashtags. Secondly, we extend to learning sentence-level representations with a huge corpus of texts with the pseudo task of recognizing emojis. Our results show that, with the representations trained from millions of tweets with weakly supervised labels such as hashtags and emojis, we can solve sentiment/emotion analysis tasks more effectively. Lastly, as examples of model bias in representations of existing approaches, we explore a specific problem of automatic detection of abusive language. We address the issue of gender bias in various neural network models by conducting experiments to measure and reduce those biases in the representations in order to build more robust classification models.

Submitted to arXiv on 22 Aug. 2018

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 1808.07235v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The study focuses on the importance of accurately interpreting human emotions for effective human-machine communication. Emotion plays a crucial role in human-to-human interactions, and representing emotions in text poses a challenge in natural language processing (NLP). While continuous vector representations like word2vec have become popular in NLP, they fail to consider emotions and may unintentionally exhibit bias towards certain identities. To address this issue, the thesis aims to improve existing representations at both the word and sentence levels by explicitly incorporating emotions and model bias into the training process. The proposed approach includes emotional word vectors (EVEC), which are learned from a convolutional neural network model using an emotion-labeled corpus constructed with hashtags. Additionally, sentence-level representations are learned by leveraging a large corpus of texts through the pseudo task of recognizing emojis. The results demonstrate that these improved representations, trained on millions of tweets with weakly supervised labels such as hashtags and emojis, enhance sentiment/emotion analysis tasks significantly. By considering emotions within text, more robust machine learning models can be built for affect-related text classification tasks like sentiment/emotion analysis and abusive language detection. Furthermore, the study explores model bias in existing approaches by examining automatic detection of abusive language. Specifically, it addresses gender bias in various neural network models through experiments aimed at measuring and reducing biases in representations. This research contributes to building more inclusive and robust classification models. Overall, this thesis emphasizes the need for accurate representation of emotions in NLP tasks to facilitate better human-machine communication. The proposed methods offer promising avenues for improving affect-related text classification tasks while addressing biases present in current approaches.
Created on 07 Oct. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.