Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing
AI-generated Key Points
- Investigating use of Large Language Models (LLM's) for Neural Machine Translation (NMT)
- Initial experiments showed performance degradation when fine-tuning LLM's for translation
- Proposing adapting LLM's as Automatic Post-Editors (APE) instead of direct translators
- Introducing Low-Rank-Adapter fine-tuning for APEs, leading to significant improvements in metrics and out-of-domain data generalization
- Achieving state-of-the-art accuracy rate of 89% on ContraPro test set assessing pronoun ambiguities in English to German translation
- Demonstrating the effectiveness of manual post-editing for document-level translation with reference context provided
- Exploring Chunk-Based and Batched Sliding Window approaches to enhance translation process
- Highlighting potential of using LLM's as APEs to improve translation quality at both sentence and document levels
Authors: Sai Koneru, Miriam Exel, Matthias Huck, Jan Niehues
Abstract: Large Language Models (LLM's) have demonstrated considerable success in various Natural Language Processing tasks, but they have yet to attain state-of-the-art performance in Neural Machine Translation (NMT). Nevertheless, their significant performance in tasks demanding a broad understanding and contextual processing shows their potential for translation. To exploit these abilities, we investigate using LLM's for MT and explore recent parameter-efficient fine-tuning techniques. Surprisingly, our initial experiments find that fine-tuning for translation purposes even led to performance degradation. To overcome this, we propose an alternative approach: adapting LLM's as Automatic Post-Editors (APE) rather than direct translators. Building on the LLM's exceptional ability to process and generate lengthy sequences, we also propose extending our approach to document-level translation. We show that leveraging Low-Rank-Adapter fine-tuning for APE can yield significant improvements across both sentence and document-level metrics while generalizing to out-of-domain data. Most notably, we achieve a state-of-the-art accuracy rate of 89\% on the ContraPro test set, which specifically assesses the model's ability to resolve pronoun ambiguities when translating from English to German. Lastly, we investigate a practical scenario involving manual post-editing for document-level translation, where reference context is made available. Here, we demonstrate that leveraging human corrections can significantly reduce the number of edits required for subsequent translations\footnote{Interactive Demo for integrating manual feedback can be found \href{https://huggingface.co/spaces/skoneru/contextual_refinement_ende}{here}}
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
Look for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.