Counts@IITK at SemEval-2021 Task 8: SciBERT Based Entity And Semantic Relation Extraction For Scientific Data

AI-generated keywords: SemEval 2021 Task 8 MeasEval span extraction classification relation extraction

AI-generated Key Points

  • System developed for SemEval 2021 Task 8 (MeasEval)
  • Utilized SciBERT with [CLS] token embedding and a CRF layer
  • Achieved an overall F1-overlap score of 0.432, ranking fifth on the leaderboard
  • Implementation of the system is available on Github
  • Background information on related work in entity extraction and relation extraction using LSTM CRF, BERT, and CRF layers
  • Task setup for SemEval 2021 Task 8: articles manually annotated for quantities, measured entities, properties, qualifiers, and units
  • Pre-processing steps using SciSpaCy to split paragraphs into sentences for input to the SciBERT model
  • Training dataset included paragraphs with quantities, measured entities, properties, and qualifiers; evaluation set used separate paragraphs for testing
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Akash Gangwar, Sabhay Jain, Shubham Sourav, Ashutosh Modi

Accepted at SemEval 2021 Task 8, 7 Pages (5 Pages main content + 1 page for references + 1 Page Appendix)
License: CC BY-NC-SA 4.0

Abstract: This paper presents the system for SemEval 2021 Task 8 (MeasEval). MeasEval is a novel span extraction, classification, and relation extraction task focused on finding quantities, attributes of these quantities, and additional information, including the related measured entities, properties, and measurement contexts. Our submitted system, which placed fifth (team rank) on the leaderboard, consisted of SciBERT with [CLS] token embedding and CRF layer on top. We were also placed first in Quantity (tied) and Unit subtasks, second in MeasuredEntity, Modifier and Qualifies subtasks, and third in Qualifier subtask.

Submitted to arXiv on 03 Apr. 2021

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2104.01364v1

This paper presents the system developed for SemEval 2021 Task 8 (MeasEval), which focuses on extracting and classifying spans and relations to identify quantities, attributes of quantities, and related information in scientific data. The submitted system utilized SciBERT with [CLS] token embedding and a CRF layer, achieving an overall F1-overlap score of 0.432 and ranking fifth on the leaderboard. The top-performing system on the leaderboard achieved an F1-overlap score of 0.519. The implementation of the system is available on Github. The paper provides background information on related work in entity extraction and relation extraction using models like LSTM CRF, BERT, and CRF layers. It also discusses the task setup for SemEval 2021 Task 8, which includes articles from various sub-domains manually annotated for quantities, measured entities, properties, qualifiers, and units. The system overview details the pre-processing steps using SciSpaCy to split paragraphs into sentences for input to the SciBERT model. The training dataset consisted of paragraphs with quantities, measured entities, properties, and qualifiers while the evaluation set included a separate set of paragraphs for testing. Overall,this paper contributes to semantic relation extraction in scientific data by participating in MeasEval Task 8 at SemEval 2021 and providing insights into system performance analysis.
Created on 18 Nov. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.