Comparing Formulaic Language in Human and Machine Translation: Insight from a Parliamentary Corpus

AI-generated keywords: Neural Machine Translation Human Translations Parliamentary Corpus Text Genres Collocational Bigrams

AI-generated Key Points

Study aims to replicate previous research comparing neural machine translations to human translations
Previous study found that neural machine translations have more formulaic sequences with high-frequency words, but fewer with rare words compared to human translations
Researchers used a parliamentary corpus to replicate the findings
Corpus was translated from French to English using DeepL, Google Translate, and Microsoft Translator
Results confirmed previous observations but with less pronounced differences
Suggests that using text genres resulting in more literal translations (e.g., parliamentary corpora) is preferable for comparing human and machine translations
Google translations had fewer highly collocational bigrams than DeepL and Microsoft translations
Findings provide insights into differences between neural machine translation systems and emphasize the importance of considering text genre when evaluating translation quality.

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yves Bestgen

arXiv: 2206.10919v1 - DOI (cs.CL)

Presented at ParlaCLARIN III: Workshop on Creating, Enriching and Using Parliamentary Corpora

License: CC BY 4.0

Abstract: A recent study has shown that, compared to human translations, neural machine translations contain more strongly-associated formulaic sequences made of relatively high-frequency words, but far less strongly-associated formulaic sequences made of relatively rare words. These results were obtained on the basis of translations of quality newspaper articles in which human translations can be thought to be not very literal. The present study attempts to replicate this research using a parliamentary corpus. The text were translated from French to English by three well-known neural machine translation systems: DeepL, Google Translate and Microsoft Translator. The results confirm the observations on the news corpus, but the differences are less strong. They suggest that the use of text genres that usually result in more literal translations, such as parliamentary corpora, might be preferable when comparing human and machine translations. Regarding the differences between the three neural machine systems, it appears that Google translations contain fewer highly collocational bigrams, identified by the CollGram technique, than Deepl and Microsoft translations.

Submitted to arXiv on 22 Jun. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2206.10919v1

Comprehensive Summary
Key points
Layman's Summary
Blog article

This study aims to replicate previous research that compared neural machine translations to human translations. The previous study found that neural machine translations contain more strongly-associated formulaic sequences made of high-frequency words, but fewer strongly-associated formulaic sequences made of rare words, compared to human translations. In this study, the researchers used a parliamentary corpus to see if the findings could be replicated. The text in the corpus was translated from French to English using three well-known neural machine translation systems: DeepL, Google Translate, and Microsoft Translator. The results confirmed the observations from the news corpus but with less pronounced differences. This suggests that using text genres that typically result in more literal translations, such as parliamentary corpora, may be preferable when comparing human and machine translations. Additionally, the study found that Google translations contained fewer highly collocational bigrams than DeepL and Microsoft translations. These findings provide insights into the differences between neural machine translation systems and highlight the importance of considering text genre when evaluating translation quality.

- Study aims to replicate previous research comparing neural machine translations to human translations
- Previous study found that neural machine translations have more formulaic sequences with high-frequency words, but fewer with rare words compared to human translations
- Researchers used a parliamentary corpus to replicate the findings
- Corpus was translated from French to English using DeepL, Google Translate, and Microsoft Translator
- Results confirmed previous observations but with less pronounced differences
- Suggests that using text genres resulting in more literal translations (e.g., parliamentary corpora) is preferable for comparing human and machine translations
- Google translations had fewer highly collocational bigrams than DeepL and Microsoft translations
- Findings provide insights into differences between neural machine translation systems and emphasize the importance of considering text genre when evaluating translation quality.

Researchers wanted to do a study that is similar to another study. The other study compared translations done by computers to translations done by people. They found that computer translations have more common words, but fewer uncommon words compared to human translations. The researchers used a big collection of documents from the government to do their study. They translated the documents from French to English using three different computer programs: DeepL, Google Translate, and Microsoft Translator. The results of the new study confirmed what they found in the previous study, but the differences were not as big. This means it's better to use certain types of texts when comparing computer and human translations. Google Translate had fewer pairs of words that often go together compared to DeepL and Microsoft Translator. These findings help us understand how computer translation systems are different and remind us that we need to think about the type of text when judging translation quality." Definitions- Neural machine translations: Translations done by computers using artificial intelligence. - Formulaic sequences: Groups of words that are commonly used together. - High-frequency words: Words that are used often. - Rare words: Words that are not used very often. - Corpus: A large collection of written or spoken texts. - Literal translations: Translations where each word is translated exactly as it is in the original language. - Collocational bigrams: Pairs of words that often appear together in a specific order.

Comparing Neural Machine Translations to Human Translations

In recent years, neural machine translation (NMT) systems have become increasingly popular for quickly and accurately translating text from one language to another. However, it is still unclear how these translations compare to those of humans. A recent study sought to replicate previous research that compared NMTs with human translations in order to gain a better understanding of the differences between them.

Previous Research Findings

The previous study found that NMTs contain more strongly-associated formulaic sequences made up of high-frequency words than human translations do, but fewer strongly-associated formulaic sequences made up of rare words. This suggests that while NMTs may be good at translating common phrases and expressions, they may struggle when it comes to more complex or rarer language.

This Study's Methodology

To see if these findings could be replicated, the researchers used a parliamentary corpus as their source material for this study. The text was translated from French into English using three well-known NMT systems: DeepL, Google Translate, and Microsoft Translator.

Results

The results confirmed the observations from the news corpus but with less pronounced differences. This suggests that using text genres that typically result in more literal translations—such as parliamentary corpora—may be preferable when comparing human and machine translations. Additionally, the study found that Google translations contained fewer highly collocational bigrams than DeepL and Microsoft translations. These findings provide insights into the differences between NMT systems and highlight the importance of considering text genre when evaluating translation quality.

Conclusion

Overall, this study provides valuable insight into how different types of texts are translated by both humans and machines alike. It also highlights the importance of considering text genre when evaluating translation quality as different genres can produce very different results depending on which system is used for translation purposes.

Created on 25 Jul. 2023

Assess the quality of the AI-generated content by voting

Score: 0

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

57.0%

Direct Speech Translation for Automatic Subtitling

cs.CL

55.8%

ChatGPT-4 Outperforms Experts and Crowd Workers in Annotating Political Twitt…

cs.CL

55.1%

How Good Are GPT Models at Machine Translation? A Comprehensive Evaluation

cs.CL

54.1%

Translate to Disambiguate: Zero-shot Multilingual Word Sense Disambiguation w…

cs.CL

53.7%

BLEU, METEOR, BERTScore: Evaluation of Metrics Performance in Assessing Criti…

cs.CL

52.6%

News Summarization and Evaluation in the Era of GPT-3

cs.CL

52.5%

LLM-powered Data Augmentation for Enhanced Crosslingual Performance

cs.CL

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.