Semantic Annotation and Querying Framework based on Semi-structured Ayurvedic Text

AI-generated keywords: NLP IR Sanskrit Text Analysis Knowledge Graphs Ayurvedic Texts

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Knowledge bases (KBs) are valuable resources for NLP and IR tasks
  • Current state-of-the-art in Sanskrit NLP does not allow for automated construction of KBs
  • Authors manually annotate Sanskrit text to create a knowledge graph (KG)
  • KG consists of 410 entities and 764 relationships from the chapter Dhanyavarga of Bhavaprakashanighantu
  • Elaborate ontology is developed to capture semantics of entity and relationship types
  • 31 query templates are designed to facilitate querying of the KG
  • Sangrahaka framework is customized for manual annotation and querying processes
  • Entire system, including dataset, is available at https://sanskrit.iitk.ac.in/ayurveda/
  • Manually annotated KG will aid in future development and testing of NLP tools and enable further study of Bhavaprakasanighantu text
  • Paper presented at World Sanskrit Conference (WSC) 2022 with a total length of 19 pages including appendixes
  • Research focuses on applying NLP techniques to IR tasks related to Sanskrit texts, specifically Ayurveda domain
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Hrishikesh Terdalkar, Arnab Bhattacharya, Madhulika Dubey, Ramamurthy S, Bhavna Naneria Singh

World Sanskrit Conference (WSC) 2022
19 pages including appendix

Abstract: Knowledge bases (KB) are an important resource in a number of natural language processing (NLP) and information retrieval (IR) tasks, such as semantic search, automated question-answering etc. They are also useful for researchers trying to gain information from a text. Unfortunately, however, the state-of-the-art in Sanskrit NLP does not yet allow automated construction of knowledge bases due to unavailability or lack of sufficient accuracy of tools and methods. Thus, in this work, we describe our efforts on manual annotation of Sanskrit text for the purpose of knowledge graph (KG) creation. We choose the chapter Dhanyavarga from Bhavaprakashanighantu of the Ayurvedic text Bhavaprakasha for annotation. The constructed knowledge graph contains 410 entities and 764 relationships. Since Bhavaprakashanighantu is a technical glossary text that describes various properties of different substances, we develop an elaborate ontology to capture the semantics of the entity and relationship types present in the text. To query the knowledge graph, we design 31 query templates that cover most of the common question patterns. For both manual annotation and querying, we customize the Sangrahaka framework previously developed by us. The entire system including the dataset is available from https://sanskrit.iitk.ac.in/ayurveda/ . We hope that the knowledge graph that we have created through manual annotation and subsequent curation will help in development and testing of NLP tools in future as well as studying of the Bhavaprakasanighantu text.

Submitted to arXiv on 01 Feb. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2202.00216v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "Semantic Annotation and Querying Framework based on Semi-structured Ayurvedic Text," authors Hrishikesh Terdalkar, Arnab Bhattacharya, Madhulika Dubey, Ramamurthy S., and Bhavna Naneria Singh discuss the importance of knowledge bases (KB) in natural language processing (NLP) and information retrieval (IR) tasks. They emphasize that while KBs are valuable resources for researchers seeking information from texts, the current state-of-the-art in Sanskrit NLP does not allow for automated construction of knowledge bases due to limited availability and accuracy of tools and methods. To address this limitation, the authors describe their efforts in manually annotating Sanskrit text to create a knowledge graph (KG). Specifically, they choose the chapter Dhanyavarga from Bhavaprakashanighantu of the Ayurvedic text Bhavaprakasha for annotation. The resulting KG consists of 410 entities and 764 relationships. Given that Bhavaprakashanighantu is a technical glossary text describing various properties of different substances, the authors develop an elaborate ontology to capture the semantics of entity and relationship types present in the text. Additionally, they design 31 query templates covering common question patterns to facilitate querying of the KG. For both manual annotation and querying processes, the authors customize the Sangrahaka framework previously developed by them. The entire system including dataset used in their work is made available at https://sanskrit.iitk.ac.in/ayurveda/. The authors anticipate that this manually annotated and curated KG will aid in future development and testing of NLP tools as well as enable further study of Bhavaprakasanighantu text. The paper was presented at World Sanskrit Conference (WSC) 2022 by Hrishikesh Terdalkar et al., with a total length of 19 pages including appendixes. The authors' research focuses on exploring how NLP techniques can be applied to IR tasks related to Sanskrit texts specifically those belonging to Ayurveda domain. They hope that their work will provide useful insights into developing better NLP tools for analyzing such texts as well as facilitating further studies on these texts through providing access to a manually annotated knowledge graph created using their proposed framework.
Created on 26 Sep. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.