MUSER: MUltimodal Stress Detection using Emotion Recognition as an Auxiliary Task
AI-generated Key Points
- Automatic detection of human stress is crucial for AI agents involved in affective computing and human-computer interaction.
- Stress and emotion are both human affective states, with stress having significant implications on the regulation and expression of emotion.
- MUSER is a transformer-based model architecture and a novel multi-task learning algorithm with speed-based dynamic sampling strategy that explores the inter-dependence between stress and emotion.
- The method was evaluated on the Multimodal Stressed Emotion (MuSE) dataset, which includes both stress and emotion labels, making it an ideal benchmark for an in-depth analysis of their inter-dependence.
- MUSER makes four main contributions: demonstrating the inter-dependence between stress and emotion via quantitative analyses on linguistic and acoustic features; establishing a state-of-the art stress detection model with a transformer structure as well as a novel speed-based dynamic sampling strategy for multi-task learning; achieving superior results on the MuSE dataset via multi-task training with both stress and emotion labels; showing that their speed-based dynamic sampling significantly outperforms other widely used methods.
- Previous studies have explored unimodal approaches such as textual modality or acoustic features for unimodal stress detection, but multimodal features usually result in better performances.
- MUSER provides an effective solution for detecting human stress using multiple modalities.
Authors: Yiqun Yao, Michalis Papakostas, Mihai Burzo, Mohamed Abouelenien, Rada Mihalcea
Abstract: The capability to automatically detect human stress can benefit artificial intelligent agents involved in affective computing and human-computer interaction. Stress and emotion are both human affective states, and stress has proven to have important implications on the regulation and expression of emotion. Although a series of methods have been established for multimodal stress detection, limited steps have been taken to explore the underlying inter-dependence between stress and emotion. In this work, we investigate the value of emotion recognition as an auxiliary task to improve stress detection. We propose MUSER -- a transformer-based model architecture and a novel multi-task learning algorithm with speed-based dynamic sampling strategy. Evaluations on the Multimodal Stressed Emotion (MuSE) dataset show that our model is effective for stress detection with both internal and external auxiliary tasks, and achieves state-of-the-art results.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.