Graphs Don't Stay Secret: Practical Subgraph Reconstruction Attacks on Defended Graph RAG
图并非保密:对防御图RAG的实用子图重构攻击
Minkyoo Song, Jaehan Kim, Myungchul Kang, Hanna Kim, Seungwon Shin, Sooel Son
专题命中 知识库问答 :图RAG子图重构攻击
AI总结 提出GRASP攻击,通过多轮查询从防御的图RAG系统中重构子图,达到82.9 F1,并评估防御措施。
详情
基于图的检索增强生成(Graph RAG)越来越多地用于支持LLM应用,通过从知识图谱中检索的结构化知识增强用户查询。虽然Graph RAG改善了关系推理,但它引入了一个研究不足的威胁:攻击者可以从目标RAG系统的知识图谱中重构子图,从而推断隐私并复制精心策划的知识资产。我们表明,即使有简单的基于提示的防护,现有攻击对Graph RAG也基本无效,因为这些攻击暴露了明确的窃取意图,因此容易被轻量级的安全提示抑制。我们识别了在现实防护下进行实用Graph RAG提取的三个技术挑战,并引入了GRASP,一种黑盒、多轮子图重构攻击。GRASP (i) 将提取重新定义为上下文处理任务,(ii) 通过每条记录的标识符强制执行格式合规、基于实例的输出,以减少幻觉并保留关系细节,以及(iii) 使用发现感知调度器多样化目标驱动的攻击查询,以在严格的查询预算内操作。在两个真实知识图谱、四个安全对齐的LLM和多个Graph RAG框架上,GRASP在先前方法失败的情况下实现了最强的类型忠实重构,达到82.9 F1。我们进一步评估了防御措施,并提出了两种缓解方法,可在不损失效用的情况下有效降低重构保真度。
Graph-based retrieval-augmented generation (Graph RAG) is increasingly deployed to support LLM applications by augmenting user queries with structured knowledge retrieved from a knowledge graph. While Graph RAG improves relational reasoning, it introduces a largely understudied threat: adversaries can reconstruct subgraphs from a target RAG system's knowledge graph, enabling privacy inference and replication of curated knowledge assets. We show that existing attacks are largely ineffective against Graph RAG even with simple prompt-based safeguards, because these attacks expose explicit exfiltration intent and are therefore easily suppressed by lightweight safe prompts. We identify three technical challenges for practical Graph RAG extraction under realistic safeguards and introduce GRASP, a closed-box, multi-turn subgraph reconstruction attack. GRASP (i) reframes extraction as a context-processing task, (ii) enforces format-compliant, instance-grounded outputs via per-record identifiers to reduce hallucinations and preserve relational details, and (iii) diversifies goal-driven attack queries using a discovery-aware scheduler to operate within strict query budgets. Across two real-world knowledge graphs, four safety-aligned LLMs, and multiple Graph RAG frameworks, GRASP attains the strongest type-faithful reconstruction where prior methods fail, reaching up to 82.9 F1. We further evaluate defenses and propose two mitigations that effectively reduce reconstruction fidelity without utility loss.