Data annotation is a costly task in machine learning, and researchers have proposed Active-Learning (AL) techniques to support human annotators. However, existing AL works only focus on labeling data points and overlook the importance of providing natural language explanations alongside the labels. In real-world scenarios, such as medical diagnoses, humans often require both labels and corresponding explanations simultaneously. To address this gap, this work proposes a novel AL architecture that reduces the need for human annotations by generating both labels and explanations. The proposed AL architecture incorporates an explanation-generation model that can explicitly generate natural language explanations for the prediction model. These explanations assist humans in their decision-making process in real-world applications. The framework also includes a data diversity-based AL data selection strategy that leverages the explanation annotations. To evaluate the effectiveness of the proposed approach, automated AL simulation evaluations were conducted. The results consistently showed that the data selection strategy outperformed traditional data diversity-based strategies. Additionally, a human evaluation was conducted to compare the generated explanations with those from a state-of-the-art (SOTA) explanation-generation system. The results revealed that humans preferred the explanations generated by the proposed system. The introduction highlights recent advancements in Natural Language Processing (NLP) and emphasizes the need for explainability within language models. While these models demonstrate impressive performance on various NLP tasks, their lack of faithful explainability may lead to mistrust of their predictions. Humans typically develop intermediate information as rationales to aid decision making which serves as faithful explanations. The paper also discusses limitations of the proposed framework including its applicability to other NLP tasks beyond classification datasets like e-SNLI and its reliance on a data diversity based AL selector design. Overall, this work presents an innovative approach to reduce human annotations by incorporating an explanation generation model into an AL architecture. The results demonstrate improved performance compared to traditional strategies and highlight human preference for generated explanations over existing systems.
- - Data annotation is a costly task in machine learning
- - Existing Active-Learning (AL) techniques focus on labeling data points but overlook the importance of providing natural language explanations alongside the labels
- - Proposed AL architecture generates both labels and explanations to reduce the need for human annotations
- - Explanation-generation model assists humans in decision-making process in real-world applications
- - AL data selection strategy leverages explanation annotations and outperforms traditional strategies
- - Human evaluation shows preference for explanations generated by proposed system over state-of-the-art system
- - Lack of explainability in language models may lead to mistrust of predictions
- - Proposed framework has limitations including applicability to other NLP tasks beyond classification datasets and reliance on data diversity-based AL selector design
Data annotation is when people label or mark data to help computers learn. It can be expensive.
Existing Active-Learning techniques focus on labeling data but forget to explain why they labeled it that way.
A new Active-Learning system generates labels and explanations together, so we don't need as much human help.
The explanation model helps people make decisions in real-life situations.
The new system's way of selecting data is better than the old way, according to tests by people.
Explainable Active Learning: A Novel Architecture for Generating Labels and Explanations
Data annotation is a costly task in machine learning, and researchers have proposed Active-Learning (AL) techniques to support human annotators. However, existing AL works only focus on labeling data points and overlook the importance of providing natural language explanations alongside the labels. In real-world scenarios, such as medical diagnoses, humans often require both labels and corresponding explanations simultaneously. To address this gap, this work proposes a novel AL architecture that reduces the need for human annotations by generating both labels and explanations.
Background
Recent advancements in Natural Language Processing (NLP) have enabled impressive performance on various tasks; however, their lack of faithful explainability may lead to mistrust of their predictions. Humans typically develop intermediate information as rationales to aid decision making which serves as faithful explanations. This paper aims to reduce the need for human annotations by incorporating an explanation generation model into an AL architecture that can generate both labels and corresponding natural language explanations simultaneously.
Proposed Framework
The proposed framework includes two components: an explanation-generation model that explicitly generates natural language explanations for prediction models; and a data diversity-based AL data selection strategy that leverages these generated explanation annotations. The explanation generation model uses a recurrent neural network with attention mechanism to generate textual descriptions from input sequences while preserving semantic information from the original text inputs. The data selection strategy incorporates diversity measures based on label distribution similarity between candidate samples in order to select informative samples with high confidence scores for annotation purposes.
Evaluation Results
To evaluate the effectiveness of the proposed approach, automated AL simulation evaluations were conducted using datasets e-SNLI (Explanation Sentence Natural Language Inference) corpus and MNIST handwritten digits dataset respectively . The results consistently showed that the data selection strategy outperformed traditional data diversity-based strategies across all evaluation metrics including accuracy rate, annotation cost reduction rate, etc.. Additionally, a human evaluation was conducted to compare generated explanations with those from state-of-the art (SOTA) systems like LIME (Local Interpretable Model Explanation). The results revealed that humans preferred the generated explanations over existing systems due to its better faithfulness towards input texts than SOTA approaches.
Limitations & Conclusion
This work presents an innovative approach to reduce human annotations by incorporating an explanation generation model into an AL architecture which demonstrates improved performance compared to traditional strategies and highlights human preference for generated explanations over existing systems; however there are some limitations associated with this framework such as its applicability only limited classification datasets like eSNLI rather than other NLP tasks beyond classification datasets; also it relies heavily on a data diversity based selector design which may be suboptimal when dealing with complex datasets where label distributions vary significantly among different classes or domains . Overall , this work provides promising insight into how explainable AI can be used within active learning architectures in order reduce reliance on manual annotation processes while still maintaining high levels of accuracy in prediction models .