LoRA-like Calibration for Multimodal Deception Detection using ATSFace Data

AI-generated keywords: Deception Detection

AI-generated Key Points

Deception detection on human videos has gained attention
AI models in this domain lack interpretability
A new attention-aware neural network model has been introduced
The model assesses visual, audio, and text features to identify deceptive cues
The model achieves 92% accuracy on a real-life trial dataset
The model provides insights into the underlying process by indicating attention focus in videos
An experiment involving university students was conducted to enrich the study
A calibration method inspired by Low-Rank Adaptation (LoRA) was introduced to refine deception detection accuracy
A new dataset called ATSFace was created, consisting of 309 video clips
The dataset involved posing questions and recording participants' responses using an iPhone 14 Pro in Chinese language
Transcripts of sample deceptive and truthful statements were obtained using an automatic speech recognition (ASR) system called CapCut
The research presents a comprehensive approach to deception detection on human videos.

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Shun-Wen Hsiao, Cheng-Yuan Sun

arXiv: 2309.01383v1 - DOI (cs.CV)

10 pages, 9 figures

License: CC BY 4.0

Abstract: Recently, deception detection on human videos is an eye-catching techniques and can serve lots applications. AI model in this domain demonstrates the high accuracy, but AI tends to be a non-interpretable black box. We introduce an attention-aware neural network addressing challenges inherent in video data and deception dynamics. This model, through its continuous assessment of visual, audio, and text features, pinpoints deceptive cues. We employ a multimodal fusion strategy that enhances accuracy; our approach yields a 92\% accuracy rate on a real-life trial dataset. Most important of all, the model indicates the attention focus in the videos, providing valuable insights on deception cues. Hence, our method adeptly detects deceit and elucidates the underlying process. We further enriched our study with an experiment involving students answering questions either truthfully or deceitfully, resulting in a new dataset of 309 video clips, named ATSFace. Using this, we also introduced a calibration method, which is inspired by Low-Rank Adaptation (LoRA), to refine individual-based deception detection accuracy.

Submitted to arXiv on 04 Sep. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2309.01383v1

Comprehensive Summary
Key points
Layman's Summary
Blog article

Recently, deception detection on human videos has gained significant attention due to its potential applications. While AI models in this domain have shown high accuracy, they often lack interpretability. To address this issue, a new attention-aware neural network model has been introduced. This model tackles the challenges associated with video data and deception dynamics by continuously assessing visual, audio, and text features to identify deceptive cues. By employing a multimodal fusion strategy, the model achieves an impressive 92% accuracy rate on a real-life trial dataset. One of the key contributions of this research is that the model not only detects deceit but also provides valuable insights into the underlying process by indicating the attention focus in the videos. To enrich the study further, an experiment involving university students answering questions either truthfully or deceitfully was conducted. This resulted in a new dataset called ATSFace, which consists of 309 video clips. To refine individual-based deception detection accuracy, a calibration method inspired by Low-Rank Adaptation (LoRA) was introduced. The evaluation of this approach involved creating a new dataset called ATSFace1 through a multimodal approach to deception detection. The dataset was collected by posing general questions about school life and finance to university students who were instructed to respond truthfully initially. They were then asked to create fictitious narratives about selected topics and provide honest narratives for other topics. The experiment involved recording participants using an iPhone 14 Pro in a 1080p HD/30fps format while they responded to questions in Chinese. The resulting dataset consisted of 309 videos, with 147 being deceptive and 162 being truthful clips. The average duration of these videos was 23.32 seconds. Transcripts of sample deceptive and truthful statements from the dataset were obtained using an automatic speech recognition (ASR) system called CapCut. These transcripts comprised 35,069 words in total, averaging 113 words per transcript. Overall, this research presents a comprehensive approach to deception detection on human videos. The attention-aware neural network model, combined with the multimodal fusion strategy and calibration method, achieves high accuracy in detecting deceit while providing insights into the underlying process.

- Deception detection on human videos has gained attention
- AI models in this domain lack interpretability
- A new attention-aware neural network model has been introduced
- The model assesses visual, audio, and text features to identify deceptive cues
- The model achieves 92% accuracy on a real-life trial dataset
- The model provides insights into the underlying process by indicating attention focus in videos
- An experiment involving university students was conducted to enrich the study
- A calibration method inspired by Low-Rank Adaptation (LoRA) was introduced to refine deception detection accuracy
- A new dataset called ATSFace was created, consisting of 309 video clips
- The dataset involved posing questions and recording participants' responses using an iPhone 14 Pro in Chinese language
- Transcripts of sample deceptive and truthful statements were obtained using an automatic speech recognition (ASR) system called CapCut
- The research presents a comprehensive approach to deception detection on human videos.

Deception detection means figuring out if someone is being honest or not. AI models are computer programs that try to do this, but sometimes it's hard to understand how they work. A new model has been made that pays attention to different things in videos to find clues about deception. This model was tested on real videos and got 92% of the answers right. It also helps us understand how it works by showing what it pays attention to in the videos. The researchers did an experiment with university students to learn more. They also made a special dataset with videos and used a speech recognition system to get transcripts of what people said in the videos. This research gives us a good way to detect deception in human videos." Definitions- Deception detection: Figuring out if someone is being honest or not. - AI models: Computer programs that try to do tasks like humans. - Interpretability: Understanding how something works. - Neural network: A type of computer program inspired by the human brain. - Accuracy: How often something is correct. - Dataset: A collection of information used for study or analysis. - Transcripts: Written records of what people say. - Speech recognition system: A computer program that can understand spoken words.

Deception Detection on Human Videos: A Comprehensive Approach

Experiment Setup

To enrich the study further, an experiment involving university students answering questions either truthfully or deceitfully was conducted. This resulted in a new dataset called ATSFace, which consists of 309 video clips. To refine individual-based deception detection accuracy, a calibration method inspired by Low-Rank Adaptation (LoRA) was introduced. The evaluation of this approach involved creating a new dataset called ATSFace1 through a multimodal approach to deception detection. The dataset was collected by posing general questions about school life and finance to university students who were instructed to respond truthfully initially. They were then asked to create fictitious narratives about selected topics and provide honest narratives for other topics. The experiment involved recording participants using an iPhone 14 Pro in a 1080p HD/30fps format while they responded to questions in Chinese. The resulting dataset consisted of 309 videos, with 147 being deceptive and 162 being truthful clips. The average duration of these videos was 23

Created on 24 Nov. 2023

Assess the quality of the AI-generated content by voting

Score: 0

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

70.4%

Voting-based Multimodal Automatic Deception Detection

cs.LG

61.9%

Self Multi-Head Attention for Speaker Recognition

cs.SD

61.2%

Question Answering Survey: Directions, Challenges, Datasets, Evaluation Matri…

cs.CL

60.3%

VindLU: A Recipe for Effective Video-and-Language Pretraining

cs.CV

59.5%

TextMI: Textualize Multimodal Information for Integrating Non-verbal Cues in …

cs.CL

59.5%

Instruction Tuning for Large Language Models: A Survey

cs.CL

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.