Text Classification via Large Language Models

AI-generated keywords: Clue And Reasoning Prompting (CARP)

AI-generated Key Points

  • CARP enhances the performance of large-scale Language Models (LLMs) in text classification tasks.
  • LLMs like GPT-3 underperform compared to fine-tuned models in text classification due to their lack of reasoning ability and limited number of tokens allowed in in-context learning.
  • CARP adopts a progressive reasoning strategy by prompting LLMs to identify superficial clues and inducing a diagnostic reasoning process for making final decisions.
  • CARP utilizes a fine-tuned model on a supervised dataset for kNN demonstration search during the in-context learning phase.
  • CARP achieves new state-of-the-art performances on four out of five widely-used text-classification benchmarks: SST-2, AGNews, R8, and R52. It performs comparably to state-of-the-art models on MR benchmark.
  • CARP shows impressive abilities in low-resource and domain-adaptation setups, achieving comparable performances with only 16 examples per class compared to supervised models with 1,024 examples per class.
  • The related work section discusses large language models (LLMs) categorized into encoder-only models like BERT and decoder-only models like GPT.
  • CARP significantly improves text classification performance by incorporating clue-based prompting and reasoning strategies into LLMs.
  • Future work aims to explore CARP in other natural language understanding tasks.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Xiaofei Sun, Xiaoya Li, Jiwei Li, Fei Wu, Shangwei Guo, Tianwei Zhang, Guoyin Wang

Pre-print
License: CC BY 4.0

Abstract: Despite the remarkable success of large-scale Language Models (LLMs) such as GPT-3, their performances still significantly underperform fine-tuned models in the task of text classification. This is due to (1) the lack of reasoning ability in addressing complex linguistic phenomena (e.g., intensification, contrast, irony etc); (2) limited number of tokens allowed in in-context learning. In this paper, we introduce \textbf{C}lue \textbf{A}nd \textbf{R}easoning \textbf{P}rompting (CARP). CARP adopts a progressive reasoning strategy tailored to addressing the complex linguistic phenomena involved in text classification: CARP first prompts LLMs to find superficial clues (e.g., keywords, tones, semantic relations, references, etc), based on which a diagnostic reasoning process is induced for final decisions. To further address the limited-token issue, CARP uses a fine-tuned model on the supervised dataset for $k$NN demonstration search in the in-context learning, allowing the model to take the advantage of both LLM's generalization ability and the task-specific evidence provided by the full labeled dataset. Remarkably, CARP yields new SOTA performances on 4 out of 5 widely-used text-classification benchmarks, 97.39 (+1.24) on SST-2, 96.40 (+0.72) on AGNews, 98.78 (+0.25) on R8 and 96.95 (+0.6) on R52, and a performance comparable to SOTA on MR (92.39 v.s. 93.3). More importantly, we find that CARP delivers impressive abilities on low-resource and domain-adaptation setups. Specifically, Specifically, using 16 examples per class, CARP achieves comparable performances to supervised models with 1,024 examples per class.

Submitted to arXiv on 15 May. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2305.08377v1

, , , , This paper presents Clue And Reasoning Prompting (CARP) as a solution to enhance the performance of large-scale Language Models (LLMs) in text classification tasks. While LLMs like GPT-3 have achieved remarkable success, they still underperform compared to fine-tuned models in text classification. This is due to their lack of reasoning ability in addressing complex linguistic phenomena and the limited number of tokens allowed in in-context learning. To overcome these limitations, CARP adopts a progressive reasoning strategy tailored specifically for handling complex linguistic phenomena involved in text classification. It first prompts LLMs to identify superficial clues such as keywords, tones, semantic relations, and references. Based on these clues, CARP induces a diagnostic reasoning process for making final decisions. In addition, CARP utilizes a fine-tuned model on a supervised dataset for kNN demonstration search during the in-context learning phase. This allows the model to leverage both the generalization ability of LLMs and the task-specific evidence provided by the labeled dataset. The results demonstrate that CARP achieves new state-of-the-art performances on four out of five widely-used text-classification benchmarks: SST-2, AGNews, R8, and R52. It also performs comparably to state-of-the-art models on MR benchmark. Notably, CARP shows impressive abilities in low-resource and domain-adaptation setups. With only 16 examples per class, CARP achieves comparable performances to supervised models with 1,024 examples per class. The related work section discusses large language models (LLMs) and categorizes them into encoder-only models like BERT and decoder-only models like GPT. These models follow the pre-training then fine-tuning paradigm for NLP tasks. In conclusion, CARP significantly improves text classification performance by incorporating clue-based prompting and reasoning strategies into large-scale Language Models (LLMs). The approach achieves state-of-the-art results on multiple benchmarks and demonstrates promising capabilities in low-resource and domain-adaptation scenarios. Future work aims to explore CARP in other natural language understanding tasks.
Created on 07 Jan. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.