Text Classification via Large Language Models
AI-generated Key Points
- CARP enhances the performance of large-scale Language Models (LLMs) in text classification tasks.
- LLMs like GPT-3 underperform compared to fine-tuned models in text classification due to their lack of reasoning ability and limited number of tokens allowed in in-context learning.
- CARP adopts a progressive reasoning strategy by prompting LLMs to identify superficial clues and inducing a diagnostic reasoning process for making final decisions.
- CARP utilizes a fine-tuned model on a supervised dataset for kNN demonstration search during the in-context learning phase.
- CARP achieves new state-of-the-art performances on four out of five widely-used text-classification benchmarks: SST-2, AGNews, R8, and R52. It performs comparably to state-of-the-art models on MR benchmark.
- CARP shows impressive abilities in low-resource and domain-adaptation setups, achieving comparable performances with only 16 examples per class compared to supervised models with 1,024 examples per class.
- The related work section discusses large language models (LLMs) categorized into encoder-only models like BERT and decoder-only models like GPT.
- CARP significantly improves text classification performance by incorporating clue-based prompting and reasoning strategies into LLMs.
- Future work aims to explore CARP in other natural language understanding tasks.
Authors: Xiaofei Sun, Xiaoya Li, Jiwei Li, Fei Wu, Shangwei Guo, Tianwei Zhang, Guoyin Wang
Abstract: Despite the remarkable success of large-scale Language Models (LLMs) such as GPT-3, their performances still significantly underperform fine-tuned models in the task of text classification. This is due to (1) the lack of reasoning ability in addressing complex linguistic phenomena (e.g., intensification, contrast, irony etc); (2) limited number of tokens allowed in in-context learning. In this paper, we introduce \textbf{C}lue \textbf{A}nd \textbf{R}easoning \textbf{P}rompting (CARP). CARP adopts a progressive reasoning strategy tailored to addressing the complex linguistic phenomena involved in text classification: CARP first prompts LLMs to find superficial clues (e.g., keywords, tones, semantic relations, references, etc), based on which a diagnostic reasoning process is induced for final decisions. To further address the limited-token issue, CARP uses a fine-tuned model on the supervised dataset for $k$NN demonstration search in the in-context learning, allowing the model to take the advantage of both LLM's generalization ability and the task-specific evidence provided by the full labeled dataset. Remarkably, CARP yields new SOTA performances on 4 out of 5 widely-used text-classification benchmarks, 97.39 (+1.24) on SST-2, 96.40 (+0.72) on AGNews, 98.78 (+0.25) on R8 and 96.95 (+0.6) on R52, and a performance comparable to SOTA on MR (92.39 v.s. 93.3). More importantly, we find that CARP delivers impressive abilities on low-resource and domain-adaptation setups. Specifically, Specifically, using 16 examples per class, CARP achieves comparable performances to supervised models with 1,024 examples per class.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.