PV-RAG:基于LLM的药物警戒检索增强生成模型
PV-RAG: Retrieval-Augmented Generation Model for Pharmacovigilance Based on Large Language Models
DOI: 10.12677/sea.2025.144068, PDF,    国家社会科学基金支持
作者: 陈文浩, 张鸿红, 薛 景, 刘帅博, 查星宇:南京邮电大学计算机学院,江苏 南京;朱云霞*, 翁艺航:南京邮电大学物联网学院,江苏 南京;魏建香:南京邮电大学图书馆,江苏 南京
关键词: 大语言模型检索增强生成药物警戒智能问答回答幻觉LLM RAG Pharmacovigilance Intelligent Q&A Answer Hallucination
摘要: 目的/意义:随着大语言模型(LLM)在自然语言处理任务中的广泛应用,其在处理专业领域问答任务中表现出局限性。药品因其“治病且可能致病”的双重特征,须在安全警戒范围内进行使用,而传统大模型受训练数据影响难以有效表达用药安全复杂且动态的信息,易导致数据缺失引发的回答幻觉问题。方法/过程:本文提出了一种面向药物警戒的检索增强生成模型(PV-RAG),其综合应用大语言模型LangChain框架、稀疏与稠密检索混合策略以及改进的查询改写机制,显著提升检索生成结果的准确性。结果/结论:对比实验结果表明,PV-RAG模型能够精准提取药品说明书中的关键信息,在药物警戒问答任务中表现出显著优势,为药物警戒智能化问答提供了新的技术支持和应用前景。
Abstract: Purpose/Significance: With the widespread application of large language models (LLMs) in natural language processing tasks, their limitations in handling domain-specific question-answering tasks have become evident. Pharmaceuticals, characterized by their dual nature of “treating diseases while potentially causing harm”, must be used within strict safety monitoring frameworks. Traditional large models, constrained by training data, struggle to accurately represent the complex and dynamic nature of drug safety information, often leading to hallucinated responses due to data gaps. Methods/Process: This study proposes a Pharmacovigilance-oriented Retrieval-Augmented Generation model (PV-RAG), which integrates the LangChain framework for LLMs, a hybrid sparse-dense retrieval strategy, and an improved query rewriting mechanism to significantly enhance the accuracy of retrieval-generated responses. Results/Conclusions: Comparative experiments demonstrate that the PV-RAG model can precisely extract critical information from drug manuals and exhibits superior performance in pharmacovigilance question-answering tasks. This approach provides new technical support and application prospects for intelligent pharmacovigilance systems.
文章引用:陈文浩, 朱云霞, 张鸿红, 薛景, 刘帅博, 查星宇, 翁艺航, 魏建香. PV-RAG:基于LLM的药物警戒检索增强生成模型[J]. 软件工程与应用, 2025, 14(4): 772-783. https://doi.org/10.12677/sea.2025.144068

参考文献

[1] Patil, R. and Gudivada, V. (2024) A Review of Current Trends, Techniques, and Challenges in Large Language Models (LLMs). Applied Sciences, 14, Article No. 2074.
[2] Hadi, M.U., Qureshi, R., Shah, A., et al. (2023) Large Language Models: A Comprehensive Survey of Its Applications, Challenges, Limitations, and Future Prospects.
[3] 国家药品不良反应监测中心. 国家药品不良反应监测年度报告(2023年) [EB/OL]. 国家药品监督管理局药品评价中心网站, 2024.
https://www.cdr-adr.org.cn/drug_1/aqjs_1/drug_aqjs_sjbg/202403/t20240326_50614.html, 2025-04-02.
[4] 刘泽垣, 王鹏江, 宋晓斌, 等. 大语言模型的幻觉问题研究综述[J]. 软件学报, 2025(3): 1152-1185.
[5] 赵静, 汤文玉, 霍钰, 等. 大模型检索增强生成(RAG)技术浅析[J]. 中国信息化, 2024(10): 71-72+70.
[6] Gao, Y., Xiong, Y., Gao, X., et al. (2023) Retrieval-Augmented Generation for Large Language Models: A Survey.
[7] 祝晓雨, 张伟光, 孙树森, 等. 中国药物警戒的发展及文献计量分析[J]. 医药导报, 2019, 38(6): 820-825.
[8] 王青宇, 陈壮. 四大国际药物警戒数据库概述及其应用[J]. 药物不良反应杂志, 2023, 25(8): 497-503.
[9] Miao, J., Thongprayoon, C., Suppadungsuk, S., Garcia Valencia, O.A. and Cheungpasitporn, W. (2024) Integrating Retrieval-Augmented Generation with Large Language Models in Nephrology: Advancing Practical Applications. Medicina, 60, Article No. 445. [Google Scholar] [CrossRef] [PubMed]
[10] Singhal, K., Azizi, S., Tu, T., Mahdavi, S.S., Wei, J., Chung, H.W., et al. (2023) Large Language Models Encode Clinical Knowledge. Nature, 620, 172-180. [Google Scholar] [CrossRef] [PubMed]
[11] Wei, J., Yang, C., Song, X., et al. (2024) Long-Form Factuality in Large Language Models.
[12] Yang, H., Li, S. and Gonçalves, T. (2024) Enhancing Biomedical Question Answering with Large Language Models. Information, 15, Article No. 494. [Google Scholar] [CrossRef
[13] Yagnik, N., Jhaveri, J., Sharma, V., et al. (2024) Medlm: Exploring Language Models for Medical Question Answering Systems.
[14] Karpukhin, V., Oguz, B., Min, S., Lewis, P., Wu, L., Edunov, S., et al. (2020) Dense Passage Retrieval for Open-Domain Question Answering. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), November 2020, 6769-6781. [Google Scholar] [CrossRef
[15] 张康, 陈明, 顾凡. 自编码器预训练和多表征交互的段落重排序模型[J]. 计算机应用研究, 2023, 40(12): 3643-3650.
[16] 李洁, 李正芳, 邹垚, 等. 大语言模型多语言词对齐能力评测方法[J]. 西南民族大学学报(自然科学版), 2024, 50(6): 681-688.
[17] Peng, W., Li, G., Jiang, Y., Wang, Z., Ou, D., Zeng, X., et al. (2024). Large Language Model Based Long-Tail Query Rewriting in Taobao Search. Companion Proceedings of the ACM Web Conference 2024, Singapore, 13-17 May 2024, 20-28.[CrossRef