Multitasking Framework for Unsupervised Simple Definition Generation

AI-generated keywords: Simple Definition Generation Language Learning Low Literacy Levels Multitasking Framework Prompt Learning Techniques

AI-generated Key Points

  • Authors propose Simple Definition Generation (SDG) task to assist language learners and individuals with low literacy levels
  • Lack of learner's dictionaries in many languages poses a challenge for SDG
  • Introduce multitasking framework called SimpDefiner using standard dictionary and corpus of simple texts
  • Parameter sharing scheme between decoders enables generation of complex and simple definitions simultaneously
  • Incorporate text reconstruction task to control complexity and language modeling task to enhance performance
  • Evaluation on novel test set in English aligning Oxford Dictionary (OD) and Oxford Advanced Learner's Dictionary (OALD)
  • SimpDefiner outperforms other models in generating accurate and straightforward definitions
  • Future aim to combine method with prompt learning techniques for further complexity conditioning
  • Research accepted by ACL 2022 for presentation at main conference
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Cunliang Kong, Yun Chen, Hengyuan Zhang, Liner Yang, Erhong Yang

Accepted by ACL 2022 (main conference)
License: CC BY 4.0

Abstract: The definition generation task can help language learners by providing explanations for unfamiliar words. This task has attracted much attention in recent years. We propose a novel task of Simple Definition Generation (SDG) to help language learners and low literacy readers. A significant challenge of this task is the lack of learner's dictionaries in many languages, and therefore the lack of data for supervised training. We explore this task and propose a multitasking framework SimpDefiner that only requires a standard dictionary with complex definitions and a corpus containing arbitrary simple texts. We disentangle the complexity factors from the text by carefully designing a parameter sharing scheme between two decoders. By jointly training these components, the framework can generate both complex and simple definitions simultaneously. We demonstrate that the framework can generate relevant, simple definitions for the target words through automatic and manual evaluations on English and Chinese datasets. Our method outperforms the baseline model by a 1.77 SARI score on the English dataset, and raises the proportion of the low level (HSK level 1-3) words in Chinese definitions by 3.87%.

Submitted to arXiv on 24 Mar. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2203.12926v1

The authors propose a novel task of Simple Definition Generation (SDG) to assist language learners and individuals with low literacy levels. The lack of learner's dictionaries in many languages poses a significant challenge for this task. To address this issue, the authors introduce a multitasking framework called SimpDefiner that utilizes a standard dictionary with complex definitions and a corpus of simple texts. By carefully designing a parameter sharing scheme between two decoders, the framework can generate both complex and simple definitions simultaneously. Additionally, the authors incorporate a text reconstruction task to control text complexity and a language modeling task to enhance the decoder's performance. To evaluate the effectiveness of their proposed framework, the authors construct a novel test set in English by aligning two dictionaries - Oxford Dictionary (OD) and Oxford Advanced Learner's Dictionary (OALD). Automatic and manual evaluations demonstrate that the SimpDefiner framework outperforms other models and generation-simplification pipelines in terms of generating accurate and straightforward definitions. Moving forward, the authors aim to explore combining their current method with prompt learning techniques to allow users to condition the complexity of generated definitions further. This research was accepted by ACL 2022 for presentation at its main conference.
Created on 28 Apr. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.