GLaM: Fine-Tuning Large Language Models for Domain Knowledge Graph Alignment via Neighborhood Partitioning and Generative Subgraph Encoding

AI-generated keywords: Large Language Models

AI-generated Key Points

  • Integrating large language models (LLMs) with knowledge graphs enhances reasoning capabilities
  • Multi-step inferences over real-world knowledge graphs while minimizing hallucination are crucial for evolving models
  • Existing LLMs have limited capacity for reasoning over domain-specialized interconnected entities
  • Introduction of a fine-tuning framework for developing Graph-aligned Language Models (GLaM) addresses the technical gap
  • GLaM transforms knowledge graphs into text representation with labeled question-answer pairs, expanding reasoning capacity
  • High-value applications in science, security, and e-commerce can benefit from enhanced reasoning capabilities of GLaM
  • Training setup involved splitting natural language questions and answers into training and test sets using datasets from DBLP, ACM, MAG, etc.
  • Microsoft Deepspeed framework was utilized for supervised prompt and response fine-tuning using base models like Llama-7b-chat-hf
  • Evaluation tasks included fact recall testing and inverse fact recall testing across UMLS and DBLP datasets
  • Results showed promising performance of GLaM in both fact recall tasks
  • Development of Graph-aligned Language Models bridges the gap between text-based models and structured data representations to enable effective reasoning over domain-specific knowledge graphs
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Stefan Dernbach, Khushbu Agarwal, Alejandro Zuniga, Michael Henry, Sutanay Choudhury

Published in AAAI Spring Symposium: AAAI-MAKE 2024
License: CC BY 4.0

Abstract: Integrating large language models (LLMs) with knowledge graphs derived from domain-specific data represents an important advancement towards more powerful and factual reasoning. As these models grow more capable, it is crucial to enable them to perform multi-step inferences over real-world knowledge graphs while minimizing hallucination. While large language models excel at conversation and text generation, their ability to reason over domain-specialized graphs of interconnected entities remains limited. For example, can we query a LLM to identify the optimal contact in a professional network for a specific goal, based on relationships and attributes in a private database? The answer is no--such capabilities lie beyond current methods. However, this question underscores a critical technical gap that must be addressed. Many high-value applications in areas such as science, security, and e-commerce rely on proprietary knowledge graphs encoding unique structures, relationships, and logical constraints. We introduce a fine-tuning framework for developing Graph-aligned LAnguage Models (GLaM) that transforms a knowledge graph into an alternate text representation with labeled question-answer pairs. We demonstrate that grounding the models in specific graph-based knowledge expands the models' capacity for structure-based reasoning. Our methodology leverages the large-language model's generative capabilities to create the dataset and proposes an efficient alternate to retrieval-augmented generation styled methods.

Submitted to arXiv on 09 Feb. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2402.06764v3

, , , , Integrating large language models (LLMs) with knowledge graphs derived from domain-specific data is a significant advancement in enhancing reasoning capabilities. The ability to perform multi-step inferences over real-world knowledge graphs while minimizing hallucination is crucial as these models continue to evolve. While LLMs excel at conversation and text generation, their capacity for reasoning over domain-specialized interconnected entities is limited. For instance, querying a LLM to identify the optimal contact in a professional network based on relationships and attributes in a private database is currently beyond existing methods. To address this technical gap, a fine-tuning framework for developing Graph-aligned Language Models (GLaM) has been introduced. This framework transforms knowledge graphs into an alternate text representation with labeled question-answer pairs, expanding the models' capacity for structure-based reasoning. By grounding the models in specific graph-based knowledge, high-value applications in areas such as science, security, and e-commerce can benefit from enhanced reasoning capabilities. For training the GLaM, focus was placed on a dataset extracted from DBLP, ACM, MAG, and other sources containing paper citations, abstracts, authors, publication years, venues, and titles. The training setup involved splitting natural language questions and answers into training and test sets for both UMLS and DBLP datasets. Microsoft Deepspeed framework was utilized for supervised prompt and response fine-tuning using base models like Llama-7b-chat-hf. Evaluation tasks included fact recall testing the model's ability to remember domain-level facts seen during training and inverse fact recall testing its capability to infer reverse relationships from trained facts. The results demonstrated that GLaM showed promising performance in both fact recall tasks across UMLS and DBLP datasets. In conclusion, the development of Graph-aligned Language Models represents a significant step towards enabling large language models to reason effectively over domain-specific knowledge graphs. By bridging the gap between text-based models and structured data representations like knowledge graphs, GLaM holds great potential for advancing various applications requiring complex reasoning abilities.
Created on 20 Jul. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.