SemiKong: Curating, Training, and Evaluating A Semiconductor Industry-Specific Large Language Model

AI-generated keywords: Large Language Models SemiKong Natural Language Generation (NLG) expert feedback semiconductor manufacturing

AI-generated Key Points

  • Large Language Models (LLMs) have shown promise in addressing challenges within the semiconductor industry
  • SemiKong is the first industry-specific LLM for semiconductors, providing a foundation for developing tailored proprietary models
  • Focus of SemiKong 1.0 is on understanding etching problems at an expert level
  • Human evaluation of Natural Language Generation (NLG) algorithms is crucial but costly and lacks reproducibility
  • Automatic metrics like BLEU and ROUGE are not always reliable, leading to the introduction of LLMs as evaluators
  • Framework leveraging expert feedback proposed to enhance assessment reliability in complex domains like semiconductors
  • Collaboration with semiconductor experts to develop an ontology for structuring semiconductor manufacturing processes systematically
  • Contributions include creating a comprehensive semiconductor-related text corpus (SemiKong-Corpus), developing industry-specific LLM (SemiKong), advancing evaluation approaches through expert feedback integration, and highlighting significance of industry-specific LLMs in improving AI-driven solutions for semiconductor manufacturing tasks
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Christopher Nguyen, William Nguyen, Atsushi Suzuki, Daisuke Oku, Hong An Phan, Sang Dinh, Zooey Nguyen, Anh Ha, Shruti Raghavan, Huy Vo, Thang Nguyen, Lan Nguyen, Yoshikuni Hirayama

On-going work
License: CC BY 4.0

Abstract: Large Language Models (LLMs) have demonstrated the potential to address some issues within the semiconductor industry. However, they are often general-purpose models that lack the specialized knowledge needed to tackle the unique challenges of this sector, such as the intricate physics and chemistry of semiconductor devices and processes. SemiKong, the first industry-specific LLM for the semiconductor domain, provides a foundation that can be used to develop tailored proprietary models. With SemiKong 1.0, we aim to develop a foundational model capable of understanding etching problems at an expert level. Our key contributions include (a) curating a comprehensive corpus of semiconductor-related texts, (b) creating a foundational model with in-depth semiconductor knowledge, and (c) introducing a framework for integrating expert knowledge, thereby advancing the evaluation process of domain-specific AI models. Through fine-tuning a pre-trained LLM using our curated dataset, we have shown that SemiKong outperforms larger, general-purpose LLMs in various semiconductor manufacturing and design tasks. Our extensive experiments underscore the importance of developing domain-specific LLMs as a foundation for company- or tool-specific proprietary models, paving the way for further research and applications in the semiconductor domain. Code and dataset will be available at https://github.com/aitomatic/semikong

Submitted to arXiv on 21 Nov. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2411.13802v1

Large Language Models (LLMs) have shown promise in addressing challenges within the semiconductor industry. However, their general-purpose nature often lacks the specialized knowledge required for this sector. To fill this gap, SemiKong - the first industry-specific LLM for semiconductors - has been developed to provide a foundation for developing tailored proprietary models. With SemiKong 1.0, the focus is on understanding etching problems at an expert level. In evaluating Natural Language Generation (NLG) algorithms, human evaluation is crucial but can be costly and lack reproducibility. Automatic metrics like BLEU and ROUGE are not always reliable, leading to the introduction of LLMs as evaluators. However, these methods assume that LLMs can inherently understand and evaluate knowledge, which may not always be the case in complex domains like semiconductors. To address this limitation, a framework leveraging expert feedback is proposed to enhance assessment reliability and create a high-quality benchmark for the semiconductor domain. Semiconductor manufacturing involves intricate processes that require specialized knowledge for effective execution. Collaborating with semiconductor experts, an ontology has been developed to systematically structure semiconductor manufacturing processes. This collaboration aims to bridge the gap between AI researchers' expertise in AI and their lack of domain-specific knowledge in semiconductor manufacturing. The scope of this work includes curating a large-scale semiconductor-specific text corpus (SemiKong-Corpus), developing SemiKong as a foundation model focusing on etching problems in the semiconductor industry, fine-tuning SemiKong on industry-relevant data for process optimization and control tasks, introducing a framework to leverage expert feedback for evaluating domain-specific AI models, comparing SemiKong's performance with general-purpose LLMs, and discussing potential applications of industry-specific LLMs in semiconductor manufacturing. Overall contributions include creating a comprehensive semiconductor-related text corpus (SemiKong-Corpus), developing an industry-specific LLM (SemiKong) tailored to address specific challenges in semiconductors, advancing evaluation approaches through expert feedback integration, and highlighting the significance of industry-specific LLMs in improving AI-driven solutions for semiconductor manufacturing tasks.
Created on 16 Apr. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.