, , , ,
In this paper, we present a comprehensive framework for technology intelligence and planning that leverages knowledge graphs to provide structured information for actionable insights in the realm of cyber-physical systems. The methodology involves a text mining process that encompasses information retrieval, keyphrase extraction, semantic network creation, and topic map visualization. By querying a technology intelligence database with over 176 million news articles, patents, and research documents, relevant information is extracted to facilitate data exploration. Keyphrases are utilized to create a semantic network which is visualized as a topic map, enabling interactive exploration of the domain. The highest scoring documents serve as input for the knowledge graph construction (KGC) pipeline. The KGC pipeline involves converting articles into .txt files which are transformed into an OWL Ontology Graph (.owl file) using tools like Owlready2 and transformer models such as REBEL and ChatGPT. The resulting knowledge graph is subjected to reasoning procedures based on the GENIAL! Basic Ontology schema to enhance consistency and structure. This approach ensures improved results in machine learning applications. The Innovation Graph database contains 50 million research publications, 81 million internet news articles, and 45 million patent filings from the past decade. Document genres from these sources are utilized to enrich the knowledge base of innovation and technology. Resources provided include the generated graph of research articles on GitHub along with extracted keyphrases, semantic networks, knowledge graphs in OWL and HTML formats created with ChatGPT. Additionally, an ontology for Innovation Knowledge Graph in OWL and HTML formats is available along with Wikidata queries related to the electronic domain. Overall, this framework offers a scalable approach for technology intelligence and planning in cyber-physical systems by harnessing knowledge graphs supported by advanced language models and reasoning procedures based on established ontologies. The methodology showcases significant improvements in class recognition, relationship construction, and categorization compared to existing approaches like GraphGPT or bi-LSTM models. Furthermore, it demonstrates adaptability across domains while emphasizing human oversight for content quality assurance at critical stages of development.
- - Comprehensive framework for technology intelligence and planning using knowledge graphs in cyber-physical systems
- - Methodology involves text mining process including information retrieval, keyphrase extraction, semantic network creation, and topic map visualization
- - Querying a technology intelligence database with over 176 million news articles, patents, and research documents for relevant information extraction
- - Knowledge graph construction pipeline utilizing tools like Owlready2, REBEL, and ChatGPT to create OWL Ontology Graph
- - Reasoning procedures based on GENIAL! Basic Ontology schema to enhance consistency and structure of the resulting knowledge graph
- - Innovation Graph database containing 50 million research publications, 81 million internet news articles, and 45 million patent filings from the past decade
- - Resources provided include research articles graph on GitHub, keyphrases extraction, semantic networks, knowledge graphs in OWL and HTML formats created with ChatGPT
- - Scalable approach for technology intelligence and planning in cyber-physical systems with significant improvements in class recognition compared to existing approaches
Summary1. A plan using special technology to understand and plan for smart machines.
2. Steps include reading texts, finding important words, making connections between ideas, and showing topics visually.
3. Searching a big database for useful information from news, patents, and research papers.
4. Building a map of knowledge with tools like Owlready2, REBEL, and ChatGPT.
5. Using a special system to make the knowledge map better and more organized.
Definitions- Comprehensive framework: A detailed plan that covers everything needed
- Technology intelligence: Understanding how technology works
- Cyber-physical systems: Machines that connect to the internet or other devices
- Knowledge graphs: Maps that show how different pieces of information are related
- Querying: Asking questions or searching for specific information
- Ontology Graph: A visual representation of relationships between concepts in a specific subject area
- Reasoning procedures: Logical steps taken to reach conclusions based on available information
- GENIAL! Basic Ontology schema: A structured way of organizing information for better understanding
- Innovation Graph database: A collection of data related to new ideas or inventions
- Scalable approach: A method that can grow or adapt easily
Introduction
Technology intelligence and planning are crucial aspects of any organization's success, especially in the fast-paced world of cyber-physical systems. With the rapid advancement of technology, it has become increasingly challenging to keep track of new developments and make informed decisions. In this research paper, a comprehensive framework is presented that leverages knowledge graphs to provide structured information for actionable insights in the realm of cyber-physical systems.
Methodology
The methodology involves a text mining process that encompasses information retrieval, keyphrase extraction, semantic network creation, and topic map visualization. This approach utilizes a technology intelligence database with over 176 million news articles, patents, and research documents. By querying this vast database, relevant information is extracted to facilitate data exploration.
Keyphrases are then utilized to create a semantic network which is visualized as a topic map. This allows for interactive exploration of the domain and provides valuable insights into relationships between different concepts.
Knowledge Graph Construction (KGC) Pipeline
The highest scoring documents from the previous step serve as input for the knowledge graph construction pipeline. This involves converting articles into .txt files which are then transformed into an OWL Ontology Graph (.owl file) using tools like Owlready2 and transformer models such as REBEL and ChatGPT.
Reasoning procedures based on the GENIAL! Basic Ontology schema are then applied to enhance consistency and structure within the resulting knowledge graph. This ensures improved results in machine learning applications.
Innovation Graph Database
To further enrich the knowledge base of innovation and technology, resources from various sources are utilized. The Innovation Graph database contains 50 million research publications, 81 million internet news articles, and 45 million patent filings from the past decade.
Resources Available
As part of this framework's implementation, several resources are made available to users. These include:
1) Generated graph of research articles on GitHub
2) Extracted keyphrases
3) Semantic networks
4) Knowledge graphs in OWL and HTML formats created with ChatGPT
5) Ontology for Innovation Knowledge Graph in OWL and HTML formats
6) Wikidata queries related to the electronic domain.
Results
The methodology presented in this research paper showcases significant improvements in class recognition, relationship construction, and categorization compared to existing approaches like GraphGPT or bi-LSTM models. Furthermore, it demonstrates adaptability across domains while emphasizing human oversight for content quality assurance at critical stages of development.
Conclusion
In conclusion, this framework offers a scalable approach for technology intelligence and planning in cyber-physical systems by harnessing knowledge graphs supported by advanced language models and reasoning procedures based on established ontologies. It provides valuable insights into the ever-evolving world of technology and enables organizations to make informed decisions based on structured information. With the resources made available, this framework can be easily implemented and adapted across various domains.