A Systematic Review of Retrieval-Augmented Generation for Enhancing Domain-Specific Knowledge in Large Language Models

Authors

  • Murtiyoso Universitas Amikom Purwokerto, Indonesia
  • Imam Tahyudin Universitas Amikom Purwokerto, Indonesia
  • Berlilana Universitas Amikom Purwokerto, Indonesia

DOI:

10.33395/sinkron.v9i2.14824

Keywords:

information retrieval, large language model, retrieval augmented generation

Abstract

This literature review examines the use of Retrieval-Augmented Generation (RAG) in enhancing Large Language Models (LLM) for domain-specific knowledge. RAG integrates retrieval techniques with generative models to access external knowledge sources, addressing the limitations of LLMs in handling specialized information. By leveraging external data, RAG improves the accuracy and relevance of generated content, making it particularly useful in fields that require detailed and up-to-date knowledge. This review highlights the effectiveness of RAG in overcoming challenges such as data sparsity and the dynamic nature of specialized knowledge. Furthermore, it discusses the potential of RAG to enhance LLM performance, scalability, and the ability to generate contextually accurate responses in knowledge-intensive applications. Key challenges and future research directions in the implementation of RAG for domain-specific knowledge are also identified.

GS Cited Analysis

Downloads

Download data is not yet available.

References

Abdulnazar, A., Roller, R., Schulz, S., & Kreuzthaler, M. (2024). Large Language Models for Clinical Text Cleansing Enhance Medical Concept Normalization. IEEE Access, 12(September), 147981–147990. https://doi.org/10.1109/ACCESS.2024.3472500

Alkhalaf, M., Yu, P., Yin, M., & Deng, C. (2024). Applying generative AI with retrieval augmented generation to summarize and extract key clinical information from electronic health records. Journal of Biomedical Informatics, 156(May), 104662. https://doi.org/10.1016/j.jbi.2024.104662

Arslan, M., Mahdjoubi, L., & Munawar, S. (2024). Driving sustainable energy transitions with a multi-source RAG-LLM system. Energy and Buildings, 324(July), 114827. https://doi.org/10.1016/j.enbuild.2024.114827

Chen, G., Alsharef, A., Ovid, A., Albert, A., & Jaselskis, E. (2025). Meet2Mitigate: An LLM-powered framework for real-time issue identification and mitigation from construction meeting discourse. Advanced Engineering Informatics, 64(December 2024), 103068. https://doi.org/10.1016/j.aei.2024.103068

Chen, L. C., Pardeshi, M. S., Liao, Y. X., & Pai, K. C. (2025). Application of retrieval-augmented generation for interactive industrial knowledge management via a large language model. Computer Standards and Interfaces, 94(December 2024), 103995. https://doi.org/10.1016/j.csi.2025.103995

Cho, S., Park, J., & Um, J. (2024). Development of Fine-Tuned Retrieval Augmented Language Model specialized to manual books on machine tools. IFAC-PapersOnLine, 58(19), 187–192. https://doi.org/10.1016/j.ifacol.2024.09.157

Fateen, M., Wang, B., & Mine, T. (2024). Beyond Scores: A Modular RAG-Based System for Automatic Short Answer Scoring with Feedback. IEEE Access, 12(November), 185371–185385. https://doi.org/10.1109/ACCESS.2024.3508747

Heredia Álvaro, J. A., & Barreda, J. G. (2025). An advanced retrieval-augmented generation system for manufacturing quality control. Advanced Engineering Informatics, 64(May 2024). https://doi.org/10.1016/j.aei.2024.103007

Hernandez-Salinas, B., Terven, J., ChaveZ-Urbiola, E. A., Cordova-Esparza, D. M., Romero-Gonzalez, J. A., Arguelles, A., & Cervantes, I. (2024). IDAS: Intelligent Driving Assistance System using RAG. IEEE Open Journal of Vehicular Technology, 5(August), 1139–1165. https://doi.org/10.1109/OJVT.2024.3447449

Jiang, J., Yan, L., Liu, H., Xia, Z., Wang, H., Yang, Y., & Guan, Y. (2025). Knowledge assimilation: Implementing knowledge-guided agricultural large language model. Knowledge-Based Systems, 314(January). https://doi.org/10.1016/j.knosys.2025.113197

Krishnamurthy, V., & Balaji, V. (2024). Yours Truly: A Credibility Framework for Effortless LLM-powered Fact Checking. IEEE Access, 12(November). https://doi.org/10.1109/ACCESS.2024.3520187

Lee, J., Ahn, S., Kim, D., & Kim, D. (2024). Performance comparison of retrieval-augmented generation and fine-tuned large language models for construction safety management knowledge retrieval. Automation in Construction, 168(PB), 105846. https://doi.org/10.1016/j.autcon.2024.105846

Liu, R., Zou, Z., Chen, S., Liu, Y., & Wan, J. (2025). Harnessing AI for understanding scientific literature: Innovations and applications of chat-agent system in battery recycling research. Materials Today Energy, 49(January), 101818. https://doi.org/10.1016/j.mtener.2025.101818

Liu, X., Erkoyuncu, J. A., Fuh, J. Y. H., Lu, W. F., & Li, B. (2025). Knowledge extraction for additive manufacturing process via named entity recognition with LLMs. Robotics and Computer-Integrated Manufacturing, 93(November 2024), 102900. https://doi.org/10.1016/j.rcim.2024.102900

Mustapha, K. B. (2025). A survey of emerging applications of large language models for problems in mechanics, product design, and manufacturing. Advanced Engineering Informatics, 64(July 2024), 103066. https://doi.org/10.1016/j.aei.2024.103066

Nayinzira, J. P., & Adda, M. (2024). SentimentCareBot: Retrieval-Augmented Generation Chatbot for Mental Health Support with Sentiment Analysis. Procedia Computer Science, 251, 334–341. https://doi.org/10.1016/j.procs.2024.11.118

Paneru, B., Thapa, B., & Paneru, B. (2024). Leveraging AI in ayurvedic agriculture: A RAG chatbot for comprehensive medicinal plant insights using hybrid deep learning approaches. Telematics and Informatics Reports, 16(August), 100181. https://doi.org/10.1016/j.teler.2024.100181

Qin, L., Chen, Q., Zhou, Y., Chen, Z., Li, Y., Liao, L., Li, M., Che, W., & Yu, P. S. (2025). A survey of multilingual large language models. Patterns, 6(1), 101118. https://doi.org/10.1016/j.patter.2024.101118

Shao, J., Tong, J., Wu, Q., Guo, W., Li, Z., Lin, Z., & Zhang, J. (2024). WirelessLLM: Empowering Large Language Models Towards Wireless Intelligence. Journal of Communications and Information Networks, 9(2), 99–112. https://doi.org/10.23919/JCIN.2024.10582827

Uhm, M., Kim, J., Ahn, S., Jeong, H., & Kim, H. (2025). Effectiveness of retrieval augmented generation-based large language models for generating construction safety information. Automation in Construction, 170(November 2024), 105926. https://doi.org/10.1016/j.autcon.2024.105926

Downloads


Crossmark Updates

How to Cite

Murtiyoso, M., Tahyudin, I., & Berlilana, B. (2025). A Systematic Review of Retrieval-Augmented Generation for Enhancing Domain-Specific Knowledge in Large Language Models. Sinkron : Jurnal Dan Penelitian Teknik Informatika, 9(2), 969-977. https://doi.org/10.33395/sinkron.v9i2.14824