Empowering LLMs by hybrid retrieval-augmented generation for domain-centric Q&A in smart manufacturing

Wan, Yuwei, Chen, Zheyuan, Liu, Ying

, Chen, Chong and Packianather, Michael

2025. Empowering LLMs by hybrid retrieval-augmented generation for domain-centric Q&A in smart manufacturing. Advanced Engineering Informatics 65 , 103212. 10.1016/j.aei.2025.103212

[thumbnail of 1-s2.0-S1474034625001053-main.pdf]

Preview

PDF - Published Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.
Download (6MB) | Preview

Official URL: http://dx.doi.org/10.1016/j.aei.2025.103212

Abstract

Large language models (LLMs) have shown remarkable performances in generic question-answering (QA) but often suffer from domain gaps and outdated knowledge in smart manufacturing (SM). Retrieval-augmented generation (RAG) based on LLMs has emerged as a potential approach by incorporating an external knowledge base. However, conventional vector-based RAG delivers rapid responses but often returns contextually vague results, while knowledge graph (KG)-based methods offer structured relational reasoning at the expense of scalability and efficiency. To address these challenges, a hybrid KG-Vector RAG framework that systematically integrates structured KG metadata with unstructured vector retrieval is proposed. Firstly, a metadata-enriched KG was constructed from domain corpora by systematically extracting and indexing structured information to capture essential domain-specific relationships. Secondly, semantic alignment was achieved by injecting domain-specific constraints to refine and enhance the contextual relevance of the knowledge representations. Lastly, a layered hybrid retrieval strategy was employed that combined the explicit reasoning capabilities of the KG with the efficient search power of vector-based similarity methods, and the resulting outputs were integrated via prompt engineering to generate comprehensive, context-aware responses. Evaluated on design for additive manufacturing (DfAM) tasks, the proposed approach achieved 77.8% exact match accuracy and 76.5% context precision. This study establishes a new paradigm for industrial LLM systems, which demonstrates that hybrid symbolic-neural architectures can overcome the precision-scalability trade-off in mission-critical manufacturing applications. Experimental results indicated that integrating structured KG information with vector-based retrieval and prompt engineering can enhance retrieval accuracy, contextual relevance, and efficiency in LLM-based Q&A systems for SM.

Item Type:	Article
Date Type:	Publication
Status:	Published
Schools:	Schools > Engineering
Publisher:	Elsevier
ISSN:	1474-0346
Funders:	National Natural Science Foundation of China
Date of First Compliant Deposit:	3 March 2025
Date of Acceptance:	13 March 2025
Last Modified:	06 Mar 2025 10:15
URI:	https://orca.cardiff.ac.uk/id/eprint/176569

Actions (repository staff only)

Edit Item

Altmetric

Dimensions

Download Statistics

Downloads

Downloads per month over past year

View more statistics

CORE (COnnecting REpositories)