BERT-DRE: BERT with Deep Recursive Encoder for Natural Language Sentence Matching

AI-generated keywords: Natural Language Sentence Matching (NLSM)

AI-generated Key Points

  • The paper presents a deep neural architecture for Natural Language Sentence Matching (NLSM) to identify semantic similarity in input text pairs.
  • NLSM models predict a category or scale value for a pair of input texts to indicate their similarity or relationship.
  • Two main steps are used: designing a model to obtain proper representation of the text and developing a matching mechanism to extract complex interactions.
  • The authors collected a Persian Religious question matching dataset containing 18,000 samples with two questions, appropriate answers, and match or not-match labels for each question pair.
  • BERT-DRE model was implemented by adding recursive encoder module to BERT and achieved an F1-score of 90.27% on the test data when trained and evaluated using annotated religious dataset making it the strongest model among those studied.
  • Deep neural networks are preferred approach in NLSM field and researchers use convolutional and recurrent neural networks to determine semantic features and relationships among sentences.
  • BERT-DRE outperforms its predecessor achieving an accuracy score of 90.29% when employing proposed architecture on same dataset.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Ehsan Tavan, Ali Rahmati, Maryam Najafi, Saeed Bibak, Zahed Rahmati

License: CC BY 4.0

Abstract: This paper presents a deep neural architecture, for Natural Language Sentence Matching (NLSM) by adding a deep recursive encoder to BERT so called BERT with Deep Recursive Encoder (BERT-DRE). Our analysis of model behavior shows that BERT still does not capture the full complexity of text, so a deep recursive encoder is applied on top of BERT. Three Bi-LSTM layers with residual connection are used to design a recursive encoder and an attention module is used on top of this encoder. To obtain the final vector, a pooling layer consisting of average and maximum pooling is used. We experiment our model on four benchmarks, SNLI, FarsTail, MultiNLI, SciTail, and a novel Persian religious questions dataset. This paper focuses on improving the BERT results in the NLSM task. In this regard, comparisons between BERT-DRE and BERT are conducted, and it is shown that in all cases, BERT-DRE outperforms BERT. The BERT algorithm on the religious dataset achieved an accuracy of 89.70%, and BERT-DRE architectures improved to 90.29% using the same dataset.

Submitted to arXiv on 03 Nov. 2021

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2111.02188v2

This paper presents a deep neural architecture for Natural Language Sentence Matching (NLSM) that aims to identify the semantic similarity of input text pairs. The objective of NLSM models is to predict a category or scale value for a pair of input texts, which indicates their similarity or relationship. To achieve this goal, NLSM models generally use two main steps: designing a model to obtain a proper representation of whatever text will be analyzed so that it can extract semantic features from it, and by using the representation obtained from texts, developing a matching mechanism to extract complex interactions. In this study, the authors collected a Persian Religious question matching dataset containing 18,000 samples with two questions, appropriate answers, and match or not-match labels for each question pair. The dataset was used for designing a chatbot by crawling religious questions from religious question-answering websites and annotating two similar questions for each question and generating dissimilar questions automatically. They implemented BERT with Deep Recursive Encoder (BERT-DRE) model by adding recursive encoder module to BERT (Devlin et al., 2019). Three Bi-LSTM layers with residual connection were used to design a recursive encoder and an attention module was used on top of this encoder. To obtain the final vector, a pooling layer consisting of average and maximum pooling was used. The authors evaluated BERT-DRE with related models using the introduced religious and benchmark datasets such as SNLI, FarsTail, MultiNLI, SciTail. It was noted that their BERT-DRE model achieved an F1-score of 90.27% on the test data when trained and evaluated using annotated religious dataset making it the strongest model among those studied. Furthermore, in order to better evaluate the BERT-DRE model's performance on other datasets such as SNLI, MultiNLI, FarsTail, and SciTail they achieved appropriate F1-scores. The authors investigated related models to the field of NLSM and found that deep neural networks are the preferred approach. To determine semantic features and relationships among sentences, researchers use convolutional and recurrent neural networks. The structure of convolutional neural networks (CNNs) make them very capable of extracting local features while recurrent neural networks can extract temporal features by considering natural language texts as a sequence of words. The Long Short Term Memory (LSTM), a variant of RNN’s is capable of extracting long-term dependency in an appropriate way. In conclusion, this paper focuses on improving BERT results in NLSM task by adding deep recursive encoder to BERT so called BERT with Deep Recursive Encoder (BERT-DRE). The authors experiment their model on four benchmarks: SNLI; FarsTail; MultiNLI; SciTail; as well as novel Persian religious questions dataset. Comparisons between BERT-DRE and BERT are conducted showing that in all cases BERT-DRE outperforms its predecessor - achieving an accuracy score 89.70% using only the original algorithm versus 90.29% when employing proposed architecture on same dataset..
Created on 27 Apr. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.