arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2601.14662 2026-04-21 cs.AI cs.MA

Query-Efficient Agentic Graph Extraction Attacks on GraphRAG Systems

Shuhua Yang, Jiahao Zhang, Yilong Wang, Dongwon Lee, Suhang Wang

Comments To be published in ACL Main 2026

详情

英文摘要

Graph-based retrieval-augmented generation (GraphRAG) systems construct knowledge graphs over document collections to support multi-hop reasoning. While prior work shows that GraphRAG responses may leak retrieved subgraphs, the feasibility of query-efficient reconstruction of the hidden graph structure remains unexplored under realistic query budgets. We study a budget-constrained black-box setting where an adversary adaptively queries the system to steal its latent entity-relation graph. We propose AGEA (Agentic Graph Extraction Attack), a framework that leverages a novelty-guided exploration-exploitation strategy, external graph memory modules, and a two-stage graph extraction pipeline combining lightweight discovery with LLM-based filtering. We evaluate AGEA on medical, agriculture, and literary datasets across Microsoft-GraphRAG and LightRAG systems. Under identical query budgets, AGEA significantly outperforms prior attack baselines, recovering up to 90% of entities and relationships while maintaining high precision. These results demonstrate that modern GraphRAG systems are highly vulnerable to structured, agentic extraction attacks, even under strict query limits. The code is available at https://github.com/shuashua0608/AGEA.

URL PDF HTML ☆

赞 0 踩 0

2601.10384 2026-04-21 cs.SD

RSA-Bench: Benchmarking Audio Large Models in Real-World Acoustic Scenarios

Yibo Zhang, Liang Lin, Kaiwen Luo, Shilinlu Yan, Jin Wang, Yaoqi Guo, Yitian Chen, Yalan Qin, Zhenhong Zhou, Kun Wang, Li Sun

2601.10306 2026-04-21 cs.AI cs.CL

Evidence-Augmented Policy Optimization with Reward Co-Evolution for Long-Context Reasoning

Xin Guan, Zijian Li, Shen Huang, Pengjun Xie, Jingren Zhou, Jiuxin Cao

2601.09825 2026-04-21 cs.LG

Eluder dimension: localise it!

Alireza Bakhtiari, Alex Ayoub, Samuel Robertson, David Janz, Csaba Szepesvári

Comments This version corrects a significant error in the published NeurIPS proceedings version. We thank Marc Abeille for bringing the error to our attention

2601.08841 2026-04-21 cs.CL cs.AI cs.DL

Triples and Knowledge-Infused Embeddings for Clustering and Classification of Scientific Documents

Mihael Arcan

2601.07711 2026-04-21 cs.CL

Is Agentic RAG worth it? An experimental comparison of RAG approaches

Pietro Ferrazzi, Milica Cvjeticanin, Alessio Piraccini, Davide Giannuzzi

Comments Accepted at ACL 2026 (Industry Track)

2601.07155 2026-04-21 cs.LG cs.AI

Stable On-Policy Distillation through Adaptive Target Reformulation

Ijun Jang, Jewon Yeom, Juan Yeo, Hyunggu Lim, Taesup Kim

Comments 10 pages, 5 figures, Accepted to Findings of ACL 2026

2601.06931 2026-04-21 cs.CV cs.AI cs.CL

Measuring Social Bias in Vision-Language Models with Face-Only Counterfactuals from Real Photos

Haodong Chen, Qiang Huang, Jiaqi Zhao, Qiuping Jiang, Xiaojun Chang, Jun Yu

Comments 18 pages, 18 figures, and 3 tables

2601.06803 2026-04-21 cs.CL cs.CV

Forest Before Trees: Latent Superposition for Efficient Visual Reasoning

Yubo Wang, Juntian Zhang, Yichen Wu, Yankai Lin, Nils Lukas, Yuhan Liu

Comments Accepted by ACL 2026 Main Conference

2601.05543 2026-04-21 cs.CL cs.SD eess.AS

Closing the Modality Reasoning Gap for Speech Large Language Models

Chaoren Wang, Heng Lu, Xueyao Zhang, Shujie Liu, Yan Lu, Jinyu Li, Zhizheng Wu

Comments Accepted by ACL 2026 Main Conference

2601.05053 2026-04-21 cs.AI cs.CL

Reinforced Efficient Reasoning via Semantically Diverse Exploration

Ziqi Zhao, Zhaochun Ren, Jiahong Zou, Liu Yang, Zhiwei Xu, Xuri Ge, Zhumin Chen, Xinyu Ma, Daiting Shi, Shuaiqiang Wang, Dawei Yin, Xin Xin

Comments Accepted at ACL 2026 Main

2601.04809 2026-04-21 cs.AI

SCALER:Synthetic Scalable Adaptive Learning Environment for Reasoning

Caijun Xu, Changyi Xiao, Zhongyuan Peng, Xinrun Wang, Yixin Cao

Comments 22 pages,5 figures

2601.04740 2026-04-21 cs.CL cs.AI

StealthGraph: Exposing Domain-Specific Risks in LLMs through Knowledge-Graph-Guided Harmful Prompt Generation

Huawei Zheng, Xinqi Jiang, Sen Yang, Shouling Ji, Yingcai Wu, Dazhen Deng

2601.04695 2026-04-21 cs.AI cs.LG

Tape: A Cellular Automata Benchmark for Evaluating Rule-Shift Generalization in Reinforcement Learning

Enze Pan

Comments ICML reject and seeking for NeurIPS

2601.04448 2026-04-21 cs.CL cs.AI

Merging Triggers, Breaking Backdoors: Defensive Poisoning for Instruction-Tuned Language Models

San Kim, Gary Geunbae Lee

Comments 18 pages

2601.04052 2026-04-21 cs.RO cs.CL

Stable Language Guidance for Vision-Language-Action Models

Zhihao Zhan, Yuhao Chen, Jiaying Zhou, Qinhan Lyu, Hao Liu, Keze Wang, Liang Lin, Guangrun Wang

Comments Accepted to ACL2026 main conference

2601.04043 2026-04-21 cs.CL

When Helpers Become Hazards: A Benchmark for Analyzing Multimodal LLM-Powered Safety in Daily Life

Xinyue Lou, Jinan Xu, Jingyi Yin, Xiaolong Wang, Zhaolu Kang, Youwei Liao, Yixuan Wang, Xiangyu Shi, Fengran Mo, Su Yao, Kaiyu Huang

Comments Accepted by ACL 2026 (Findings)

2601.04029 2026-04-21 cs.CL

SpeakerSleuth: Can Large Audio-Language Models Judge Speaker Consistency across Multi-turn Dialogues?

Jonggeun Lee, Junseong Pyo, Gyuhyeon Seo, Yohan Jo

Comments Accepted at ACL 2026 (Main)

2601.03938 2026-04-21 cs.LG cs.AI cs.CL

FOREVER: Forgetting Curve-Inspired Memory Replay for Language Model Continual Learning

Yujie Feng, Hao Wang, Jian Li, Xu Chu, Zhaolu Kang, Yiran Liu, Yasha Wang, Philip S. Yu, Xiao-Ming Wu

Comments ACL 2026 Camera-ready

2601.03331 2026-04-21 cs.CV cs.AI cs.LG

MMErroR: A Benchmark for Erroneous Reasoning in Vision-Language Models

Yang Shi, Yifeng Xie, Minzhe Guo, Liangsi Lu, Mingxuan Huang, Jingchao Wang, Zhihong Zhu, Boyan Xu, Zhiqi Huang

Comments Accepted by ACL 2026 Main

2601.03190 2026-04-21 cs.CL

Maximizing Local Entropy Where It Matters: Prefix-Aware Localized LLM Unlearning

Naixin Zhai, Pengyang Shao, Binbin Zheng, Yonghui Yang, Fei Shen, Long Bai, Xun Yang

Comments Accepted to ACL 2026 main

2601.03154 2026-04-21 cs.CL

Decoupling the Effect of Chain-of-Thought Reasoning: A Human Label Variation Perspective

Beiduo Chen, Tiancheng Hu, Caiqi Zhang, Robert Litschko, Anna Korhonen, Barbara Plank

Comments Accepted by ACL 2026 Findings, 21 pages, 10 figures

2601.02970 2026-04-21 cs.CL cs.LG

Reliability-Aware Adaptive Self-Consistency for Efficient Sampling in LLM Reasoning

Junseok Kim, Nakyeong Yang, Kyungmin Min, Kyomin Jung

Comments ACL 2026, Code is available at https://github.com/junseokkim00/ReASC

2601.02933 2026-04-21 cs.CL cs.HC

Pearmut: Human Evaluation of Translation Made Trivial

Vilém Zouhar, Tom Kocmi

Comments typeset with Typst

2601.01943 2026-04-21 cs.LG

SynRXN: An Open Benchmark and Curated Dataset for Computational Reaction Modeling

Tieu-Long Phan, Nhu-Ngoc Nguyen Song, Peter F. Stadler

Comments 31 pages (including references), 3 figures, 7 tables

详情

DOI: 10.1038/s41597-026-07260-w
Journal ref: Scientific Data 13, 625 (2026)

英文摘要

We present SynRXN, a unified benchmarking framework and open-data resource for computer-aided synthesis planning (CASP). SynRXN decomposes end-to-end synthesis planning into five task families, covering reaction rebalancing, atom-to-atom mapping, reaction classification, reaction property prediction, and synthesis route design. Curated, provenance-tracked reaction corpora are assembled from heterogeneous public sources into a harmonized representation and packaged as versioned datasets for each task family, with explicit source metadata, licence tags, and machine-readable manifests that record checksums, and row counts. For every task, SynRXN provides transparent splitting functions that generate leakage-aware train, validation, and test partitions, together with standardized evaluation workflows and metric suites tailored to classification, regression, and structured prediction settings. For sensitive benchmarking, we combine public training and validation data with held-out gold-standard test sets, and contamination-prone tasks such as reaction rebalancing and atom-to-atom mapping are distributed only as evaluation sets and are explicitly not intended for model training. Scripted build recipes enable bitwise-reproducible regeneration of all corpora across machines and over time, and the entire resource is released under permissive open licences to support reuse and extension. By removing dataset heterogeneity and packaging transparent, reusable evaluation scaffolding, SynRXN enables fair longitudinal comparison of CASP methods, supports rigorous ablations and stress tests along the full reaction-informatics pipeline, and lowers the barrier for practitioners who seek robust and comparable performance estimates for real-world synthesis planning workloads.

URL PDF HTML ☆

赞 0 踩 0

2512.24827 2026-04-21 cs.LG

Inter-Agent Relative Representations for Multi-Agent Option Discovery

Raul D. Steleac, Mohan Sridharan, David Abel

2512.24086 2026-04-21 cs.CV

RainFusion2.0: Temporal-Spatial Awareness and Hardware-Efficient Block-wise Sparse Attention

Aiyue Chen, Yaofu Liu, Junjian Huang, Guang Lian, Yiwu Yao, Wangli Lan, Jing Lin, Zhixin Ma, Tingting Zhou

2512.23786 2026-04-21 cs.CV cs.RO

Bridging the Ex-Vivo to In-Vivo Gap: Synthetic Priors for Monocular Depth Estimation in Specular Surgical Environments

Ankan Aich, Emma D. Ryan, Kris Moe, Isaac Schmale, Li-Xing Man, Yangming Lee

2512.23405 2026-04-21 cs.LG stat.ML

On the Sample Complexity of Learning for Blind Inverse Problems

Nathan Buskulic, Luca Calatroni, Lorenzo Rosasco, Silvia Villa

详情

英文摘要

Blind inverse problems arise in many experimental settings where both the signal of interest and the forward operator are (partially) unknown. In this context, methods developed for the non-blind case cannot be adapted in a straightforward manner due to identifiability issues and symmetric solutions inherent to the blind setting. Recently, data-driven approaches have been proposed to address such problems, demonstrating strong empirical performance and adaptability. However, these methods often lack interpretability and are not supported by theoretical guarantees, limiting their reliability in domains such as applied imaging where a blind approach often relates to a calibration of the acquisition device. In this work, we shed light on learning in blind inverse problems within the insightful framework of Linear Minimum Mean Square Estimators (LMMSEs). We provide a theoretical analysis, deriving closed-form expressions for optimal estimators and extending classical recovery results to the blind setting. In particular, we establish equivalences with tailored Tikhonov-regularized formulations, where the regularization structure depends explicitly on the distributions of the unknown signal, of the noise, and of the random forward operator. We also show how the reconstruction error converges as the noise and the randomness of the operator diminish when we use a source condition assumption. Furthermore, we derive finite-sample error bounds that characterize the performance of the learned estimators as a function of the noise level, problem conditioning, and number of available samples. These bounds explicitly quantify the impact of operator randomness and show explicitly the dependence of the associated convergence rates to this randomness factors. Finally, we validate our theoretical findings through illustrative exemplar numerical experiments that confirm the predicted convergence behavior.

URL PDF HTML ☆

赞 0 踩 0

2512.21510 2026-04-21 cs.LG cs.CV

Missing Pattern Tree based Decision Grouping and Ensemble for Enhancing Pair Utilization in Deep Incomplete Multi-View Clustering

Jie Xu, Wenyuan Yang, Yazhou Ren, Lifang He, Philip S. Yu, Xiaofeng Zhu