A Survey of Controllable Text Generation using Transformer-based Pre-trained Language Models
AI-generated Key Points
- Controllable Text Generation (CTG) is a rapidly growing area in natural language generation (NLG)
- CTG focuses on developing advanced text generation technologies to meet specific constraints in practical applications
- One popular approach in CTG is the use of large-scale pre-trained language models (PLMs), particularly transformer-based PLMs
- Limited interpretability of deep neural networks poses challenges to ensuring controllability in these methods
- The paper presents a systematic critical review of common tasks, main approaches, and evaluation methods in CTG using transformer-based PLMs
- PPLM is highlighted as a representative method that trains an attribute discriminant model to guide the PLM in generating corresponding text based on user-specified attributes or a single learning layer
- PPLM achieves significant improvement in attribute alignment but slightly decreases text fluency measured by perplexity
- MEGATRON-CNTR combines external knowledge and PLMs for controllable story generation using predictors, retrievers, and rankers
- FAIR is proposed as a content-controlled text generation framework that utilizes BERT for constructing content plans and BART for filling masked tokens in generated text templates without modifying their structure
- Experimental results demonstrate improved relevance and coherence between key phrases and generated texts with FAIR
- Challenges faced by the field of CTG include reliance on large datasets for deep learning based methods and potential bias in handcrafted features
Authors: Hanqing Zhang, Haolin Song, Shaoyu Li, Ming Zhou, Dawei Song
Abstract: Controllable Text Generation (CTG) is emerging area in the field of natural language generation (NLG). It is regarded as crucial for the development of advanced text generation technologies that better meet the specific constraints in practical applications. In recent years, methods using large-scale pre-trained language models (PLMs), in particular the widely used transformer-based PLMs, have become a new paradigm of NLG, allowing generation of more diverse and fluent text. However, due to the limited level of interpretability of deep neural networks, the controllability of these methods need to be guaranteed. To this end, controllable text generation using transformer-based PLMs has become a rapidly growing yet challenging new research hotspot. A diverse range of approaches have emerged in the recent 3-4 years, targeting different CTG tasks that require different types of controlled constraints. In this paper, we present a systematic critical review on the common tasks, main approaches, and evaluation methods in this area. Finally, we discuss the challenges that the field is facing, and put forward various promising future directions. To the best of our knowledge, this is the first survey paper to summarize the state-of-the-art CTG techniques from the perspective of Transformer-based PLMs. We hope it can help researchers and practitioners in the related fields to quickly track the academic and technological frontier, providing them with a landscape of the area and a roadmap for future research.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.