MultiRAG: A Knowledge-guided Framework for Mitigating Hallucination in Multi-source Retrieval Augmented Generation

Wu, Wenlong; Wang, Haofen; Li, Bohan; Huang, Peixuan; Zhao, Xinzhe; Liang, Lei

计算机科学 > 信息检索

arXiv:2508.03553 (cs)

[提交于 2025年8月5日 ]

标题：多源检索增强生成中的幻觉缓解知识引导框架MultiRAG

标题： MultiRAG: A Knowledge-guided Framework for Mitigating Hallucination in Multi-source Retrieval Augmented Generation

Authors:Wenlong Wu, Haofen Wang, Bohan Li, Peixuan Huang, Xinzhe Zhao, Lei Liang

摘要：检索增强生成（RAG）已成为解决大型语言模型（LLMs）中幻觉问题的有前景的方案。然而，整合多个检索源虽然可能更具有信息量，但会引入新的挑战，这些挑战可能会悖论性地加剧幻觉问题。这些挑战主要体现在两个方面：多源数据的稀疏分布阻碍了逻辑关系的捕捉，以及不同源之间的固有不一致性导致信息冲突。为了解决这些问题，我们提出了MultiRAG，这是一种新颖的框架，旨在通过知识引导的方法减轻多源检索增强生成中的幻觉问题。我们的框架引入了两项关键创新：（1）一个知识构建模块，采用多源线图来高效聚合不同知识源之间的逻辑关系，有效解决数据分布稀疏的问题；（2）一个复杂的检索模块，实现了多级置信度计算机制，在图级别和节点级别进行评估，以识别并消除不可靠的信息节点，从而减少由源间不一致引起的幻觉。在四个多领域查询数据集和两个多跳问答数据集上的大量实验表明，MultiRAG显著提高了复杂多源场景下知识检索的可靠性和效率。 \textcolor{blue}\{我们的代码可在 https://github.com/wuwenlong123/MultiRAG 获取。

摘要： Retrieval Augmented Generation (RAG) has emerged as a promising solution to address hallucination issues in Large Language Models (LLMs). However, the integration of multiple retrieval sources, while potentially more informative, introduces new challenges that can paradoxically exacerbate hallucination problems. These challenges manifest primarily in two aspects: the sparse distribution of multi-source data that hinders the capture of logical relationships and the inherent inconsistencies among different sources that lead to information conflicts. To address these challenges, we propose MultiRAG, a novel framework designed to mitigate hallucination in multi-source retrieval-augmented generation through knowledge-guided approaches. Our framework introduces two key innovations: (1) a knowledge construction module that employs multi-source line graphs to efficiently aggregate logical relationships across different knowledge sources, effectively addressing the sparse data distribution issue; and (2) a sophisticated retrieval module that implements a multi-level confidence calculation mechanism, performing both graph-level and node-level assessments to identify and eliminate unreliable information nodes, thereby reducing hallucinations caused by inter-source inconsistencies. Extensive experiments on four multi-domain query datasets and two multi-hop QA datasets demonstrate that MultiRAG significantly enhances the reliability and efficiency of knowledge retrieval in complex multi-source scenarios. \textcolor{blue}{Our code is available in https://github.com/wuwenlong123/MultiRAG.

评论：	被ICDE 2025收录的研究论文
主题：	信息检索 (cs.IR) ; 计算与语言 (cs.CL)
引用方式：	arXiv:2508.03553 [cs.IR]
	(或者 arXiv:2508.03553v1 [cs.IR] 对于此版本)
	https://doi.org/10.48550/arXiv.2508.03553

提交历史

来自： Wu Wenlong [查看电子邮件]
[v1] 星期二， 2025 年 8 月 5 日 15:20:52 UTC (2,314 KB)

计算机科学 > 信息检索

标题：多源检索增强生成中的幻觉缓解知识引导框架MultiRAG

标题： MultiRAG: A Knowledge-guided Framework for Mitigating Hallucination in Multi-source Retrieval Augmented Generation

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 信息检索

标题： 多源检索增强生成中的幻觉缓解知识引导框架MultiRAG 显示英文标题

标题： MultiRAG: A Knowledge-guided Framework for Mitigating Hallucination in Multi-source Retrieval Augmented Generation

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：多源检索增强生成中的幻觉缓解知识引导框架MultiRAG