LoRA-like Calibration for Multimodal Deception Detection using ATSFace Data

AI-generated keywords: Deception Detection

AI-generated Key Points

  • Deception detection on human videos has gained attention
  • AI models in this domain lack interpretability
  • A new attention-aware neural network model has been introduced
  • The model assesses visual, audio, and text features to identify deceptive cues
  • The model achieves 92% accuracy on a real-life trial dataset
  • The model provides insights into the underlying process by indicating attention focus in videos
  • An experiment involving university students was conducted to enrich the study
  • A calibration method inspired by Low-Rank Adaptation (LoRA) was introduced to refine deception detection accuracy
  • A new dataset called ATSFace was created, consisting of 309 video clips
  • The dataset involved posing questions and recording participants' responses using an iPhone 14 Pro in Chinese language
  • Transcripts of sample deceptive and truthful statements were obtained using an automatic speech recognition (ASR) system called CapCut
  • The research presents a comprehensive approach to deception detection on human videos.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Shun-Wen Hsiao, Cheng-Yuan Sun

10 pages, 9 figures
License: CC BY 4.0

Abstract: Recently, deception detection on human videos is an eye-catching techniques and can serve lots applications. AI model in this domain demonstrates the high accuracy, but AI tends to be a non-interpretable black box. We introduce an attention-aware neural network addressing challenges inherent in video data and deception dynamics. This model, through its continuous assessment of visual, audio, and text features, pinpoints deceptive cues. We employ a multimodal fusion strategy that enhances accuracy; our approach yields a 92\% accuracy rate on a real-life trial dataset. Most important of all, the model indicates the attention focus in the videos, providing valuable insights on deception cues. Hence, our method adeptly detects deceit and elucidates the underlying process. We further enriched our study with an experiment involving students answering questions either truthfully or deceitfully, resulting in a new dataset of 309 video clips, named ATSFace. Using this, we also introduced a calibration method, which is inspired by Low-Rank Adaptation (LoRA), to refine individual-based deception detection accuracy.

Submitted to arXiv on 04 Sep. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2309.01383v1

Recently, deception detection on human videos has gained significant attention due to its potential applications. While AI models in this domain have shown high accuracy, they often lack interpretability. To address this issue, a new attention-aware neural network model has been introduced. This model tackles the challenges associated with video data and deception dynamics by continuously assessing visual, audio, and text features to identify deceptive cues. By employing a multimodal fusion strategy, the model achieves an impressive 92% accuracy rate on a real-life trial dataset. One of the key contributions of this research is that the model not only detects deceit but also provides valuable insights into the underlying process by indicating the attention focus in the videos. To enrich the study further, an experiment involving university students answering questions either truthfully or deceitfully was conducted. This resulted in a new dataset called ATSFace, which consists of 309 video clips. To refine individual-based deception detection accuracy, a calibration method inspired by Low-Rank Adaptation (LoRA) was introduced. The evaluation of this approach involved creating a new dataset called ATSFace1 through a multimodal approach to deception detection. The dataset was collected by posing general questions about school life and finance to university students who were instructed to respond truthfully initially. They were then asked to create fictitious narratives about selected topics and provide honest narratives for other topics. The experiment involved recording participants using an iPhone 14 Pro in a 1080p HD/30fps format while they responded to questions in Chinese. The resulting dataset consisted of 309 videos, with 147 being deceptive and 162 being truthful clips. The average duration of these videos was 23.32 seconds. Transcripts of sample deceptive and truthful statements from the dataset were obtained using an automatic speech recognition (ASR) system called CapCut. These transcripts comprised 35,069 words in total, averaging 113 words per transcript. Overall, this research presents a comprehensive approach to deception detection on human videos. The attention-aware neural network model, combined with the multimodal fusion strategy and calibration method, achieves high accuracy in detecting deceit while providing insights into the underlying process.
Created on 24 Nov. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.