RAG / 检索增强生成 - arXivDaily 专题

2606.18497 2026-06-18 cs.CR 新提交 85%

Ghost Vectors: Soft-Deleted Embeddings Remain Reconstructible in HNSW Vector Databases

幽灵向量：HNSW向量数据库中软删除的嵌入仍然可重构

Chandranil Chakraborttii, Jackeline García Alvarado, Sitora Abdulofizova, Shivanshu Dwivedi

专题命中向量检索：HNSW向量数据库软删除的安全漏洞

AI总结研究揭示HNSW向量数据库的软删除机制存在安全漏洞，被标记删除的向量仍可通过存储层恢复，并提出基于加密密钥轮换的防护方案。

Comments 13 pages, 5 figures, 12 tables. Prepared for submission

详情

AI中文摘要

检索增强生成（RAG）使大型语言模型能够访问外部和私有语料库，以生成事实性、领域特定的响应。现代RAG流水线使用分层可导航小世界（HNSW）向量数据库进行高效的相似性搜索。当用户请求数据删除时，系统通常仅将记录标记为已删除，而嵌入在磁盘上物理保持不变。这种软删除操作在GDPR第17条和HIPAA等数据擦除和保留要求下引发了合规性问题。对三种HNSW实现的分析证实，通过访问存储层的原始索引文件（绕过API访问），已删除的向量在物理上仍然可恢复。使用无需领域特定微调的Vec2Text反演模型，我们在多个真实世界数据集和数据模态上展示了这一漏洞。在维基百科在世人物数据集（BLP）上，我们成功恢复了25.5%的精确人名和46.4%的地理位置（ROUGE-L 0.185）。在高度结构化的敏感数据（NIH Synthea数据集）上，患者年龄和性别标记的恢复率达到100%（ROUGE-L 0.290）。在软删除的图像嵌入上，我们在组织病理学切片上展示了100%的组织分类（p=1.02e-07），在人脸嵌入上top-1身份恢复率达到99%（p<0.01）。本工作引入了Epoch密钥轮换，即加密向量并在删除时丢弃密钥。Epoch密钥轮换将观察到的PII恢复降至0%，并在2.5毫秒内完成500个已删除向量的处理（约0.005毫秒/记录）。此外，它还生成ECDSA签名的加密证明，作为删除事件的可审计记录。

英文摘要

Retrieval-augmented generation (RAG) allows large language models to access external and private corpora for factual, domain-specific responses. Modern RAG pipelines use hierarchical navigable small world (HNSW) vector databases for efficient similarity search. When a user requests data deletion, the systems typically only mark the record as deleted, leaving the embedding on disk physically unchanged. This soft-delete operation raises compliance concerns under data-erasure and retention requirements such as GDPR Article 17 and HIPAA. Analysis on three HNSW implementations confirms that deleted vectors remain physically recoverable by accessing the raw index files at the storage layer, bypassing API access. Using the Vec2Text inversion model without domain-specific fine-tuning, we show this vulnerability on multiple real-world datasets and data modalities. On Wikipedia biographical living persons dataset (BLP), we successfully recover 25.5% of exact person names and 46.4% of geographic locations (ROUGE-L 0.185). Recovery reaches 100% for both patient age and gender markers (ROUGE-L 0.290) on highly structured, sensitive data (NIH Synthea dataset). On soft-deleted image embeddings, we show 100% tissue classification on histopathology patches (p=1.02e-07) and top-1 identity recovery reaches 99% on facial embeddings (p<0.01). This work introduces Epoch Key Rotation, which encrypts vectors and discards the key upon deletion. Epoch key rotation reduces observed PII recovery to 0% and completes in 2.5 ms for 500 deleted vectors (approximately 0.005 ms/record). Additionally, it generates an ECDSA-signed cryptographic proof as an auditable record of the deletion event.

URL PDF HTML ☆

赞 0 踩 0

2606.18897 2026-06-18 cs.IR cs.AI 新提交 70%

SAERec: Constructing Fine-grained Interpretable Intents Priors via Sparse Autoencoders for Recommendation

SAERec：通过稀疏自编码器为推荐构建细粒度可解释意图先验

Jiangnan Xia, Xuansheng Wu, Yu Yang, Xin Wang, Ninghao Liu

发表机构 * University of Georgia（佐治亚大学）； Shanghai AI Laboratory（上海人工智能实验室）； The Education University of Hong Kong（香港教育大学）； Jilin University（吉林大学）； The Hong Kong Polytechnic University（香港理工大学）

专题命中向量检索：利用稀疏自编码器构建意图先验用于推荐

AI总结提出SAERec模型，利用稀疏自编码器从大型语言模型文本嵌入中解耦出细粒度可解释意图，作为先验指导推荐，并通过多分支注意力机制融合个人与公共意图，提升推荐性能与可解释性。

详情

AI中文摘要

基于意图的推荐系统通过建模用户行为背后的动机来提高准确性和可解释性，已获得广泛关注。现有模型大多通过聚类或原型学习直接从用户序列中推导意图，但它们对序列质量敏感，需要预设意图数量，且缺乏明确的语义基础。这些问题导致意图集不完整且粗糙，限制了推荐效果。本文提出用于基于意图的推荐的稀疏自编码器（SAERec），一种新颖的推荐模型，它从文本语料库中自动构建细粒度且可解释的意图空间来指导推荐。SAERec不将文本视为辅助信号，而是将其作为高信息密度的意图构建证据。具体而言，我们首先利用稀疏自编码器（SAE）从大型语言模型（LLM）的潜在空间中提取一组全面的细粒度可解释意图，通过解耦和解释文本嵌入，将意图相关语义与文本噪声分离。然后，对于每个用户，我们从该集合中检索相关意图作为先验来指导推荐，包括匹配用户当前兴趣的个人意图和捕捉用户间共享的一般项目模式（如质量、价格）的公共意图。最后，为了将检索到的意图集成到序列建模中，我们提出了一种多分支注意力机制，用于捕获时间依赖性并注入个人和公共意图信号，随后通过自适应融合层构建最终的用户表示以进行推荐。在公共数据集上的大量实验证明了SAERec的优越性，它持续优于最先进的基线，同时提供人类可理解的解释。

英文摘要

Intent-based recommender systems have gained significant attention for improving accuracy and interpretability by modeling the underlying motivations behind user behaviors. Most existing models derive intents directly from user sequences via clustering or prototype learning. However, they are sensitive to sequence quality, require presetting the number of intents, and lack explicit semantic grounding. These issues lead to an incomplete and coarse intent set and limit the effectiveness of recommendation. In this paper, we propose the Sparse Autoencoder for intent-based recommendation (SAERec), a novel recommender that automatically constructs a fine-grained and interpretable intent space from a textual corpus to guide recommendation. Rather than treating texts as side signals, SAERec leverages them as high information density evidence for intent construction. Specifically, we first extract a comprehensive set of fine-grained interpretable intents from the latent space of large language models (LLMs) by using a sparse autoencoder (SAE) to disentangle and interpret text embeddings, which isolates intent-related semantics from textual noise. Then, for each user, we retrieve relevant intents from this set as priors to guide recommendation. It contains personal intents matching a user's current interests and public intents capturing general item patterns shared across users (e.g., quality, price). Finally, to integrate retrieved intents into sequence modeling, we propose a multi-branch attention mechanism that captures temporal dependencies and injects both personal and public intent signals, followed by an adaptive fusion layer to construct the final user representation for recommendation. Extensive experiments on public datasets demonstrate the superiority of SAERec, consistently outperforming state-of-the-art baselines while providing human-understandable explanations.

URL PDF HTML ☆

赞 0 踩 0