Customized Information and Domain-centric Knowledge Graph Construction with Large Language Models

AI-generated keywords: Technology Intelligence

AI-generated Key Points

  • Comprehensive framework for technology intelligence and planning using knowledge graphs in cyber-physical systems
  • Methodology involves text mining process including information retrieval, keyphrase extraction, semantic network creation, and topic map visualization
  • Querying a technology intelligence database with over 176 million news articles, patents, and research documents for relevant information extraction
  • Knowledge graph construction pipeline utilizing tools like Owlready2, REBEL, and ChatGPT to create OWL Ontology Graph
  • Reasoning procedures based on GENIAL! Basic Ontology schema to enhance consistency and structure of the resulting knowledge graph
  • Innovation Graph database containing 50 million research publications, 81 million internet news articles, and 45 million patent filings from the past decade
  • Resources provided include research articles graph on GitHub, keyphrases extraction, semantic networks, knowledge graphs in OWL and HTML formats created with ChatGPT
  • Scalable approach for technology intelligence and planning in cyber-physical systems with significant improvements in class recognition compared to existing approaches
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Frank Wawrzik, Matthias Plaue, Savan Vekariya, Christoph Grimm

Presented at CAIPI Workshop at AAAI 2024
License: CC BY 4.0

Abstract: In this paper we propose a novel approach based on knowledge graphs to provide timely access to structured information, to enable actionable technology intelligence, and improve cyber-physical systems planning. Our framework encompasses a text mining process, which includes information retrieval, keyphrase extraction, semantic network creation, and topic map visualization. Following this data exploration process, we employ a selective knowledge graph construction (KGC) approach supported by an electronics and innovation ontology-backed pipeline for multi-objective decision-making with a focus on cyber-physical systems. We apply our methodology to the domain of automotive electrical systems to demonstrate the approach, which is scalable. Our results demonstrate that our construction process outperforms GraphGPT as well as our bi-LSTM and transformer REBEL with a pre-defined dataset by several times in terms of class recognition, relationship construction and correct "sublass of" categorization. Additionally, we outline reasoning applications and provide a comparison with Wikidata to show the differences and advantages of the approach.

Submitted to arXiv on 30 Sep. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2409.20010v1

, , , , In this paper, we present a comprehensive framework for technology intelligence and planning that leverages knowledge graphs to provide structured information for actionable insights in the realm of cyber-physical systems. The methodology involves a text mining process that encompasses information retrieval, keyphrase extraction, semantic network creation, and topic map visualization. By querying a technology intelligence database with over 176 million news articles, patents, and research documents, relevant information is extracted to facilitate data exploration. Keyphrases are utilized to create a semantic network which is visualized as a topic map, enabling interactive exploration of the domain. The highest scoring documents serve as input for the knowledge graph construction (KGC) pipeline. The KGC pipeline involves converting articles into .txt files which are transformed into an OWL Ontology Graph (.owl file) using tools like Owlready2 and transformer models such as REBEL and ChatGPT. The resulting knowledge graph is subjected to reasoning procedures based on the GENIAL! Basic Ontology schema to enhance consistency and structure. This approach ensures improved results in machine learning applications. The Innovation Graph database contains 50 million research publications, 81 million internet news articles, and 45 million patent filings from the past decade. Document genres from these sources are utilized to enrich the knowledge base of innovation and technology. Resources provided include the generated graph of research articles on GitHub along with extracted keyphrases, semantic networks, knowledge graphs in OWL and HTML formats created with ChatGPT. Additionally, an ontology for Innovation Knowledge Graph in OWL and HTML formats is available along with Wikidata queries related to the electronic domain. Overall, this framework offers a scalable approach for technology intelligence and planning in cyber-physical systems by harnessing knowledge graphs supported by advanced language models and reasoning procedures based on established ontologies. The methodology showcases significant improvements in class recognition, relationship construction, and categorization compared to existing approaches like GraphGPT or bi-LSTM models. Furthermore, it demonstrates adaptability across domains while emphasizing human oversight for content quality assurance at critical stages of development.
Created on 04 Mar. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.