A Survey of Controllable Text Generation using Transformer-based Pre-trained Language Models

AI-generated keywords: Controllable Text Generation

AI-generated Key Points

  • Controllable Text Generation (CTG) is a rapidly growing area in natural language generation (NLG)
  • Transformer-based Pre-trained Language Models (PLMs) have emerged as a new paradigm in NLG
  • Controllability of PLMs needs to be guaranteed due to the lower interpretability of deep neural networks
  • Researchers have focused on controllable text generation using transformer-based PLMs
  • Deep learning-based methods, such as GANs and Energy-based Models, have shown potential in text generation
  • Large-scale pre-trained Language Models (PLMs), such as BERT, RoBERTa, GPT, T5, and mBART, have become a new paradigm in NLP since 2018
  • PLMs can generate high-quality texts with fine-tuning for downstream tasks and specific constraints without external domain knowledge
  • This paper presents a systematic critical review of common tasks, main approaches, and evaluation methods in CTG using PLM-based models
  • Improving interpretability and controllability of PLMs is a hot research topic in generating text
  • The paper focuses on PLM-based methods as they are becoming mainstream in CTG research
  • The survey aims to help researchers understand the landscape and cutting-edge methods in PLM-based CTG.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Hanqing Zhang, Haolin Song, Shaoyu Li, Ming Zhou, Dawei Song

License: CC ZERO 1.0

Abstract: Controllable Text Generation (CTG) is emerging area in the field of natural language generation (NLG). It is regarded as crucial for the development of advanced text generation technologies that are more natural and better meet the specific constraints in practical applications. In recent years, methods using large-scale pre-trained language models (PLMs), in particular the widely used transformer-based PLMs, have become a new paradigm of NLG, allowing generation of more diverse and fluent text. However, due to the lower level of interpretability of deep neural networks, the controllability of these methods need to be guaranteed. To this end, controllable text generation using transformer-based PLMs has become a rapidly growing yet challenging new research hotspot. A diverse range of approaches have emerged in the recent 3-4 years, targeting different CTG tasks which may require different types of controlled constraints. In this paper, we present a systematic critical review on the common tasks, main approaches and evaluation methods in this area. Finally, we discuss the challenges that the field is facing, and put forward various promising future directions. To the best of our knowledge, this is the first survey paper to summarize CTG techniques from the perspective of PLMs. We hope it can help researchers in related fields to quickly track the academic frontier, providing them with a landscape of the area and a roadmap for future research.

Submitted to arXiv on 14 Jan. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2201.05337v1

Controllable Text Generation (CTG) is a rapidly growing area in natural language generation (NLG) that aims to develop advanced text generation technologies that are more natural and better suited for practical applications. In recent years, transformer-based Pre-trained Language Models (PLMs) have emerged as a new paradigm in NLG, allowing for the generation of diverse and fluent text. However, the controllability of these methods needs to be guaranteed due to the lower interpretability of deep neural networks. To address this challenge, researchers have focused on controllable text generation using transformer-based PLMs. Various approaches have been proposed targeting different CTG tasks that require different types of controlled constraints. Deep learning-based methods, such as Generative Adversarial Networks (GANs) and Energy-based Models, have shown potential in text generation by learning low-dimensional dense vectors that represent linguistic features. However, these methods heavily rely on large-scale datasets, posing challenges for supervised and cross-domain text generation tasks. Large-scale pre-trained Language Models (PLMs), such as BERT, RoBERTa, GPT, T5, and mBART, have become a new paradigm in NLP since 2018. These models leverage unsupervised learning based on the Transformer structure to learn semantic and syntactical knowledge from large corpora. They can generate high-quality texts with fine-tuning for downstream tasks. Moreover, PLMs can generate text with specific constraints without external domain knowledge. This paper presents a systematic critical review of common tasks, main approaches, and evaluation methods in this area. It also discusses the challenges faced by the field and suggests promising future directions. However, PLMs are still black boxes with limited interpretability and controllability. Improving their interpretability and controllability has become a hot research topic in generating text using PLM-based models. This paper focuses on PLM-based methods as they are becoming mainstream in CTG research. It provides a comprehensive review of the current literature, including representative application tasks, main approaches, and evaluation methodologies. The paper also discusses future research directions. Overall, this survey aims to help researchers quickly understand the landscape and cutting-edge methods in PLM-based CTG.
Created on 29 Oct. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.