Semantic Annotation and Querying Framework based on Semi-structured Ayurvedic Text

AI-generated keywords: NLP IR Sanskrit Text Analysis Knowledge Graphs Ayurvedic Texts

AI-generated Key Points

⚠The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

Knowledge bases (KBs) are valuable resources for NLP and IR tasks
Current state-of-the-art in Sanskrit NLP does not allow for automated construction of KBs
Authors manually annotate Sanskrit text to create a knowledge graph (KG)
KG consists of 410 entities and 764 relationships from the chapter Dhanyavarga of Bhavaprakashanighantu
Elaborate ontology is developed to capture semantics of entity and relationship types
31 query templates are designed to facilitate querying of the KG
Sangrahaka framework is customized for manual annotation and querying processes
Entire system, including dataset, is available at https://sanskrit.iitk.ac.in/ayurveda/
Manually annotated KG will aid in future development and testing of NLP tools and enable further study of Bhavaprakasanighantu text
Paper presented at World Sanskrit Conference (WSC) 2022 with a total length of 19 pages including appendixes
Research focuses on applying NLP techniques to IR tasks related to Sanskrit texts, specifically Ayurveda domain

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Hrishikesh Terdalkar, Arnab Bhattacharya, Madhulika Dubey, Ramamurthy S, Bhavna Naneria Singh

World Sanskrit Conference (WSC) 2022

arXiv: 2202.00216v1 - DOI (cs.IR)

19 pages including appendix

License: NONEXCLUSIVE-DISTRIB 1.0

Abstract: Knowledge bases (KB) are an important resource in a number of natural language processing (NLP) and information retrieval (IR) tasks, such as semantic search, automated question-answering etc. They are also useful for researchers trying to gain information from a text. Unfortunately, however, the state-of-the-art in Sanskrit NLP does not yet allow automated construction of knowledge bases due to unavailability or lack of sufficient accuracy of tools and methods. Thus, in this work, we describe our efforts on manual annotation of Sanskrit text for the purpose of knowledge graph (KG) creation. We choose the chapter Dhanyavarga from Bhavaprakashanighantu of the Ayurvedic text Bhavaprakasha for annotation. The constructed knowledge graph contains 410 entities and 764 relationships. Since Bhavaprakashanighantu is a technical glossary text that describes various properties of different substances, we develop an elaborate ontology to capture the semantics of the entity and relationship types present in the text. To query the knowledge graph, we design 31 query templates that cover most of the common question patterns. For both manual annotation and querying, we customize the Sangrahaka framework previously developed by us. The entire system including the dataset is available from https://sanskrit.iitk.ac.in/ayurveda/ . We hope that the knowledge graph that we have created through manual annotation and subsequent curation will help in development and testing of NLP tools in future as well as studying of the Bhavaprakasanighantu text.

Submitted to arXiv on 01 Feb. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

⚠The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2202.00216v1

⚠This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

Comprehensive Summary
Key points
Layman's Summary
Blog article

In their paper titled "Semantic Annotation and Querying Framework based on Semi-structured Ayurvedic Text," authors Hrishikesh Terdalkar, Arnab Bhattacharya, Madhulika Dubey, Ramamurthy S., and Bhavna Naneria Singh discuss the importance of knowledge bases (KB) in natural language processing (NLP) and information retrieval (IR) tasks. They emphasize that while KBs are valuable resources for researchers seeking information from texts, the current state-of-the-art in Sanskrit NLP does not allow for automated construction of knowledge bases due to limited availability and accuracy of tools and methods. To address this limitation, the authors describe their efforts in manually annotating Sanskrit text to create a knowledge graph (KG). Specifically, they choose the chapter Dhanyavarga from Bhavaprakashanighantu of the Ayurvedic text Bhavaprakasha for annotation. The resulting KG consists of 410 entities and 764 relationships. Given that Bhavaprakashanighantu is a technical glossary text describing various properties of different substances, the authors develop an elaborate ontology to capture the semantics of entity and relationship types present in the text. Additionally, they design 31 query templates covering common question patterns to facilitate querying of the KG. For both manual annotation and querying processes, the authors customize the Sangrahaka framework previously developed by them. The entire system including dataset used in their work is made available at https://sanskrit.iitk.ac.in/ayurveda/. The authors anticipate that this manually annotated and curated KG will aid in future development and testing of NLP tools as well as enable further study of Bhavaprakasanighantu text. The paper was presented at World Sanskrit Conference (WSC) 2022 by Hrishikesh Terdalkar et al., with a total length of 19 pages including appendixes. The authors' research focuses on exploring how NLP techniques can be applied to IR tasks related to Sanskrit texts specifically those belonging to Ayurveda domain. They hope that their work will provide useful insights into developing better NLP tools for analyzing such texts as well as facilitating further studies on these texts through providing access to a manually annotated knowledge graph created using their proposed framework.

- Knowledge bases (KBs) are valuable resources for NLP and IR tasks
- Current state-of-the-art in Sanskrit NLP does not allow for automated construction of KBs
- Authors manually annotate Sanskrit text to create a knowledge graph (KG)
- KG consists of 410 entities and 764 relationships from the chapter Dhanyavarga of Bhavaprakashanighantu
- Elaborate ontology is developed to capture semantics of entity and relationship types
- 31 query templates are designed to facilitate querying of the KG
- Sangrahaka framework is customized for manual annotation and querying processes
- Entire system, including dataset, is available at https://sanskrit.iitk.ac.in/ayurveda/
- Manually annotated KG will aid in future development and testing of NLP tools and enable further study of Bhavaprakasanighantu text
- Paper presented at World Sanskrit Conference (WSC) 2022 with a total length of 19 pages including appendixes
- Research focuses on applying NLP techniques to IR tasks related to Sanskrit texts, specifically Ayurveda domain

Summary- Knowledge bases (KBs) are important resources for tasks involving language processing and information retrieval. - Currently, there is no advanced technology available for automatically creating KBs in Sanskrit language. - Authors manually analyze Sanskrit text to create a knowledge graph (KG), which includes entities and relationships from a specific chapter of a book called Bhavaprakashanighantu. - A detailed ontology is developed to capture the meaning of different types of entities and relationships in the KG. - The system provides 31 pre-designed question templates to help users query the KG easily. Definitions- Knowledge bases (KBs): Valuable resources that contain organized information used for tasks related to understanding and retrieving information from languages. - NLP: Short for Natural Language Processing, it refers to using computers to understand and process human language. - IR tasks: Information Retrieval tasks involve finding relevant information from large collections of data or documents. - Sanskrit: An ancient Indian language with rich literature and texts. - Automated construction: Using machines or computers to build something without human intervention. - Manually annotate: To carefully read and mark important information in a text by hand. - Knowledge graph (KG): A visual representation of knowledge that shows how different concepts are connected through relationships. - Entities: Objects or things that exist, such as people, places, or objects. - Relationships: Connections or associations between entities that show how they are related or interact with each other. - Ontology: A formal representation of knowledge

Semantic Annotation and Querying Framework based on Semi-structured Ayurvedic Text

Background Information

The authors choose the chapter Dhanyavarga from Bhavaprakashanighantu of the Ayurvedic text Bhavaprakasha for annotation. The resulting KG consists of 410 entities and 764 relationships. Given that Bhavaprakashanighantu is a technical glossary text describing various properties of different substances, the authors develop an elaborate ontology to capture the semantics of entity and relationship types present in the text. Additionally, they design 31 query templates covering common question patterns to facilitate querying of the KG. For both manual annotation and querying processes, they customize Sangrahaka framework previously developed by them. The entire system including dataset used in their work is made available at https://sanskrit.iitk.ac.in/ayurveda/.

Research Goals

The authors' research focuses on exploring how NLP techniques can be applied to IR tasks related to Sanskrit texts specifically those belonging to Ayurveda domain. They hope that their work will provide useful insights into developing better NLP tools for analyzing such texts as well as facilitating further studies on these texts through providing access to a manually annotated knowledge graph created using their proposed framework.

Paper Presentation

The paper was presented at World Sanskrit Conference (WSC) 2022 by Hrishikesh Terdalkar et al., with a total length of 19 pages including appendixes.

Conclusion

In conclusion, this research paper provides an overview into how semantic annotation can be used for creating a knowledge graph from semi-structured ayurvedic texts which can then be used for various natural language processing tasks as well as aiding further study into such texts through providing access to an easily accessible curated dataset containing detailed annotations about entities present within it along with relevant relationships between them .

Created on 26 Sep. 2023

Assess the quality of the AI-generated content by voting

Score: 0

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

⚠The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

82.8%

Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Lang…

cs.CL

79.6%

KG-BERT: BERT for Knowledge Graph Completion

cs.CL

79.3%

Semantic Parsing for Conversational Question Answering over Knowledge Graphs

cs.CL

78.7%

Integration of knowledge and data in machine learning

cs.AI

77.7%

Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Underst…

cs.AI

77.4%

ConceptNet 5.5: An Open Multilingual Graph of General Knowledge

cs.CL

77.3%

Hindi Question Generation Using Dependency Structures

cs.CL

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.