A Survey of Controllable Text Generation using Transformer-based Pre-trained Language Models

AI-generated keywords: Controllable Text Generation

AI-generated Key Points

  • Controllable Text Generation (CTG) is a rapidly growing area in natural language generation (NLG)
  • CTG focuses on developing advanced text generation technologies to meet specific constraints in practical applications
  • One popular approach in CTG is the use of large-scale pre-trained language models (PLMs), particularly transformer-based PLMs
  • Limited interpretability of deep neural networks poses challenges to ensuring controllability in these methods
  • The paper presents a systematic critical review of common tasks, main approaches, and evaluation methods in CTG using transformer-based PLMs
  • PPLM is highlighted as a representative method that trains an attribute discriminant model to guide the PLM in generating corresponding text based on user-specified attributes or a single learning layer
  • PPLM achieves significant improvement in attribute alignment but slightly decreases text fluency measured by perplexity
  • MEGATRON-CNTR combines external knowledge and PLMs for controllable story generation using predictors, retrievers, and rankers
  • FAIR is proposed as a content-controlled text generation framework that utilizes BERT for constructing content plans and BART for filling masked tokens in generated text templates without modifying their structure
  • Experimental results demonstrate improved relevance and coherence between key phrases and generated texts with FAIR
  • Challenges faced by the field of CTG include reliance on large datasets for deep learning based methods and potential bias in handcrafted features
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Hanqing Zhang, Haolin Song, Shaoyu Li, Ming Zhou, Dawei Song

Accpeted by ACM Computing Surveys Journal
License: CC ZERO 1.0

Abstract: Controllable Text Generation (CTG) is emerging area in the field of natural language generation (NLG). It is regarded as crucial for the development of advanced text generation technologies that better meet the specific constraints in practical applications. In recent years, methods using large-scale pre-trained language models (PLMs), in particular the widely used transformer-based PLMs, have become a new paradigm of NLG, allowing generation of more diverse and fluent text. However, due to the limited level of interpretability of deep neural networks, the controllability of these methods need to be guaranteed. To this end, controllable text generation using transformer-based PLMs has become a rapidly growing yet challenging new research hotspot. A diverse range of approaches have emerged in the recent 3-4 years, targeting different CTG tasks that require different types of controlled constraints. In this paper, we present a systematic critical review on the common tasks, main approaches, and evaluation methods in this area. Finally, we discuss the challenges that the field is facing, and put forward various promising future directions. To the best of our knowledge, this is the first survey paper to summarize the state-of-the-art CTG techniques from the perspective of Transformer-based PLMs. We hope it can help researchers and practitioners in the related fields to quickly track the academic and technological frontier, providing them with a landscape of the area and a roadmap for future research.

Submitted to arXiv on 14 Jan. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2201.05337v5

Controllable Text Generation (CTG) is a rapidly growing area in natural language generation (NLG) that focuses on developing advanced text generation technologies to meet specific constraints in practical applications. One popular approach in CTG is the use of large-scale pre-trained language models (PLMs), particularly transformer-based PLMs, which allow for the generation of more diverse and fluent text. However, the limited interpretability of deep neural networks poses challenges to ensuring controllability in these methods. In this paper, the authors present a systematic critical review of common tasks, main approaches, and evaluation methods in CTG using transformer-based PLMs. They discuss various techniques that have emerged in the past 3-4 years to address different CTG tasks with different types of controlled constraints. The authors highlight PPLM as a representative method, which trains an attribute discriminant model to guide the PLM in generating corresponding text based on user-specified attributes or a single learning layer. While PPLM achieves significant improvement in attribute alignment, it slightly decreases text fluency measured by perplexity. Another notable framework discussed is MEGATRON-CNTR, which combines external knowledge and PLMs for controllable story generation. It uses a predictor to obtain keywords for the next sentence based on the story context and retrieves relevant knowledge-enhanced sentences from an external knowledge base using a knowledge retriever. A ranker then selects the most relevant sentences, which are fed into GPT-2 (a type of PLM) along with the story context to generate the next sentence. Human evaluation results show high control success rates using keywords. Additionally, FAIR is proposed as a content-controlled text generation framework that utilizes BERT for constructing content plans and BART for filling masked tokens in generated text templates without modifying their structure. An iterative refinement algorithm within sequence-to-sequence models improves generation quality through flexible editing. Experimental results demonstrate improved relevance and coherence between key phrases and generated texts. The authors also discuss the challenges faced by the field of CTG such as reliance on large datasets for deep learning based methods and potential bias in handcrafted features; highlighting emergence of PLMs as new paradigm capable of end to end learning and generating high quality text without external domain knowledge but requiring further research to address limitations and explore promising future directions.
Created on 29 Oct. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.