arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.20452 2026-04-23 cs.IR cs.CL

HaS: Accelerating RAG through Homology-Aware Speculative Retrieval

Peng Peng, Weiwei Lin, Wentai Wu, Xinyang Wang, Yongheng Liu

Comments Accepted by ICDE 2026

详情

英文摘要

Retrieval-Augmented Generation (RAG) expands the knowledge boundary of large language models (LLMs) at inference by retrieving external documents as context. However, retrieval becomes increasingly time-consuming as the knowledge databases grow in size. Existing acceleration strategies either compromise accuracy through approximate retrieval, or achieve marginal gains by reusing results of strictly identical queries. We propose HaS, a homology-aware speculative retrieval framework that performs low-latency speculative retrieval over restricted scopes to obtain candidate documents, followed by validating whether they contain the required knowledge. The validation, grounded in the homology relation between queries, is formulated as a homologous query re-identification task: once a previously observed query is identified as a homologous re-encounter of the incoming query, the draft is deemed acceptable, allowing the system to bypass slow full-database retrieval. Benefiting from the prevalence of homologous queries under real-world popularity patterns, HaS achieves substantial efficiency gains. Extensive experiments demonstrate that HaS reduces retrieval latency by 23.74% and 36.99% across datasets with only a 1-2% marginal accuracy drop. As a plug-and-play solution, HaS also significantly accelerates complex multi-hop queries in modern agentic RAG pipelines. Source code is available at: https://github.com/ErrEqualsNil/HaS.

URL PDF HTML ☆

赞 0 踩 0

2604.20436 2026-04-23 cs.SE cs.AI

Shift-Up: A Framework for Software Engineering Guardrails in AI-native Software Development -- Initial Findings

Petrus Lipsanen, Liisa Rannikko, François Christophe, Konsta Kalliokoski, Vlad Stirbu, Tommi Mikkonen

Comments This paper has been accepted for presentation at the VibeX 2026 International Workshop on Vibe Coding and Vibe Researching

2604.20417 2026-04-23 cs.IR cs.AI

Semantic Recall for Vector Search

Leonardo Kuffo, Ioanna Tsakalidou, Roberta De Viti, Albert Angel, Jiří Iša, Rastislav Lenhardt

Comments Proceedings of the 49th International ACM SIGIR Conference on Research and Development in Information Retrieval

2604.20401 2026-04-23 cs.CR cs.AI

Onyx: Cost-Efficient Disk-Oblivious ANN Search

Deevashwer Rathee, Jean-Luc Watson, Zirui Neil Zhao, G. Edward Suh, Raluca Ada Popa

2604.20389 2026-04-23 cs.CR cs.AI

CyberCertBench: Evaluating LLMs in Cybersecurity Certification Knowledge

Gustav Keppler, Ghada Elbez, Veit Hagenmeyer

2604.20372 2026-04-23 physics.flu-dyn cs.AI cs.LG nlin.PS

AI models of unstable flow exhibit hallucination

Ramdhan Wibawa, Birendra Jha

2604.20304 2026-04-23 cond-mat.mtrl-sci cs.AI

LLM-guided phase diagram construction through high-throughput experimentation

Ryo Tamura, Haruhiko Morito, Yuna Oikawa, Guillaume Deffrennes, Shoichi Matsuda, Naruki Yoshikawa, Tomoaki Takayama, Taichi Abe, Koji Tsuda, Kei Terayama

Comments 39 pages

2604.20301 2026-04-23 stat.ML cs.LG stat.CO stat.ME

Properties and limitations of geometric tempering for gradient flow dynamics

Francesca Romana Crucinio, Sahani Pathiraja

Comments Accepted at TMLR https://openreview.net/forum?id=IP0w5LdcxC

2604.20296 2026-04-23 stat.ML cs.LG

Online Survival Analysis: A Bandit Approach under Cox PH Model

Yang Xu, Wenbin Lu, Rui Song

2604.20270 2026-04-23 eess.AS cs.SD

Embedding-Based Intrusive Evaluation Metrics for Musical Source Separation Using MERT Representations

Paul A. Bereuter, Alois Sontacchi

Comments Presented at DAGA 2026 (Annual German Conference on Acoustics)

2604.20269 2026-04-23 cs.CR cs.AI

Text Steganography with Dynamic Codebook and Multimodal Large Language Model

Jianxin Gao, Ruohan Lei, Wanli Peng

2604.20263 2026-04-23 q-bio.QM cs.AI cs.LG

AROMA: Augmented Reasoning Over a Multimodal Architecture for Virtual Cell Genetic Perturbation Modeling

Zhenyu Wang, Geyan Ye, Wei Liu, Man Tat Alexander Ng

Comments Accepted to ACL 2026 as a Findings paper. Zhenyu Wang and Geyan Ye are equal contributors; Geyan Ye is the corresponding author and project lead

2604.20245 2026-04-23 cs.IT cs.CR cs.CV eess.IV math.IT

Secure Rate-Distortion-Perception: A Randomized Distributed Function Computation Approach for Realism

Gustaf Åhlgren, Onur Günlü

Comments 20 pages, 6 figures, (submitted) journal version

2604.20211 2026-04-23 cs.SE cs.AI cs.CR

Towards Secure Logging: Characterizing and Benchmarking Logging Code Security Issues with LLMs

He Yang Yuan, Xin Wang, Kundi Yao, An Ran Chen, Zishuo Ding, Zhenhao Li

Comments Accepted at FSE 2026 Research Papers Track

2604.20179 2026-04-23 cs.CR cs.AI cs.SE

Taint-Style Vulnerability Detection and Confirmation for Node.js Packages Using LLM Agent Reasoning

Ronghao Ni, Mihai Christodorescu, Limin Jia

Comments 19 pages, 6 figures

2604.20154 2026-04-23 eess.IV cs.CV cs.LG

Maximum Likelihood Reconstruction for Multi-Look Digital Holography with Markov-Modeled Speckle Correlation

Xi Chen, Arian Maleki, Shirin Jalali

2604.20147 2026-04-23 math.OC cs.LG

Robust Out-of-Distribution Stochastic Optimization

Xianyu Li, Huan Xu, Xiaolin Huang, Chao Shang

2604.20146 2026-04-23 cs.IR cs.CL

SAKE: Self-aware Knowledge Exploitation-Exploration for Grounded Multimodal Named Entity Recognition

Jielong Tang, Xujie Yuan, Jiayang Liu, Jianxing Yu, Xiao Dong, Lin Chen, Yunlai Teng, Shimin Di, Jian Yin

Comments 23 pages, 12 figures

2604.20145 2026-04-23 cs.DB cs.LG

Pre-Execution Query Slot-Time Prediction in Cloud Data Warehouses: A Feature-Scoped Machine Learning Approach

Prashant Kumar Pathak

Comments 10 pages, 3 figures, 2 tables. Independent research

2604.20143 2026-04-23 math.NA cs.LG cs.NA physics.comp-ph

Machine learning moment closure models for the radiative transfer equation IV: enforcing symmetrizable hyperbolicity in two dimensions

Juntao Huang

2604.20134 2026-04-23 cs.CR cs.AI cs.CL

AgentSOC: A Multi-Layer Agentic AI Framework for Security Operations Automation

Joyjit Roy, Samaresh Kumar Singh

Comments 7 pages, 6 figures, 2 tables. Peer-reviewed paper published in IEEE ICAIC 2026 (IEEE Xplore)

2604.20092 2026-04-23 cs.HC cs.RO

Heterogeneous Layered Structures Can Modulate Human Softness Perception

Yuno Higuchi, Yosuke Iwashita, Yuji Ohgi, Masashi Nakatani

Comments 7 pages, 7 figures

2604.20070 2026-04-23 cs.HC cs.AI cs.CE

Auditing and Controlling AI Agent Actions in Spreadsheets

Sadra Sabouri, Zeinabsadat Saghi, Run Huang, Sujay Maladi, Esmeralda Eufracio, Sumit Gulwani, Souti Chattopadhyay

Comments 11 pages, 5 figures

2604.20011 2026-04-23 cs.CY cs.AI cs.CL cs.HC

Frictionless Love: Associations Between AI Companion Roles and Behavioral Addiction

Vibhor Agarwal, Ke Zhou, Edyta Paulina Bogucka, Daniele Quercia

Comments Accepted at the ACM Conference on Fairness, Accountability, and Transparency (FAccT) 2026

2604.20003 2026-04-23 q-bio.QM cs.AI cs.LG

scpFormer: A Foundation Model for Unified Representation and Integration of the Single-Cell Proteomics

Qifeng Zhou, Lei Yu, Yuzhi Guo, Yuwei Miao, Hehuan Ma, Wenliang Zhong, Lin Xu, Junzhou Huang

2604.19993 2026-04-23 cs.AR cs.LG

Algorithm and Hardware Co-Design for Efficient Complex-Valued Uncertainty Estimation

Zehuan Zhang, Mark Chen, He Li, Wayne Luk

Comments Accepted to 63rd ACM/IEEE Design Automation Conference (DAC '26). 7 pages, 6 figures

2604.19984 2026-04-23 cs.CY cs.AI cs.CL

Bias in the Tails: How Name-conditioned Evaluative Framing in Resume Summaries Destabilizes LLM-based Hiring

Huy Nghiem, Phuong-Anh Nguyen-Le, Sy-Tuyen Ho, Hal Daume

Comments First version, 43 pages

2604.19971 2026-04-23 cs.HC cs.AI

Semantic Prompting: Agentic Incremental Narrative Refinement through Spatial Semantic Interaction

Xuxin Tang, Ibrahim Tahmid, Eric Krokos, Kirsten Whitley, Xuan Wang, Chris North

Comments 9 pages, 7 figures, accepted by ACM AVI 2026

2604.19925 2026-04-23 econ.GN cs.AI cs.CY cs.HC q-fin.EC

Behavioral Transfer in AI Agents: Evidence and Privacy Implications

Shilei Luo, Zhiqi Zhang, Hengchen Dai, Dennis Zhang

2604.19856 2026-04-23 cs.AR cs.AI cs.LG

ChipCraftBrain: Validation-First RTL Generation via Multi-Agent Orchestration

Cagri Eryilmaz

Comments 17 pages, 6 figures. Preprint

详情

英文摘要

Large Language Models (LLMs) show promise for generating Register-Transfer Level (RTL) code from natural language specifications, but single-shot generation achieves only 60-65% functional correctness on standard benchmarks. Multi-agent approaches such as MAGE reach 95.9% on VerilogEval yet remain untested on harder industrial benchmarks such as NVIDIA's CVDP, lack synthesis awareness, and incur high API costs. We present ChipCraftBrain, a framework combining symbolic-neural reasoning with adaptive multi-agent orchestration for automated RTL generation. Four innovations drive the system: (1) adaptive orchestration over six specialized agents via a PPO policy over a 168-dim state (an alternative world-model MPC planner is also evaluated); (2) a hybrid symbolic-neural architecture that solves K-map and truth-table problems algorithmically while specialized agents handle waveform timing and general RTL; (3) knowledge-augmented generation from a 321-pattern base plus 971 open-source reference implementations with focus-aware retrieval; and (4) hierarchical specification decomposition into dependency-ordered sub-modules with interface synchronization. On VerilogEval-Human, ChipCraftBrain achieves 97.2% mean pass@1 (range 96.15-98.72% across 7 runs, best 154/156), on par with ChipAgents (97.4%, self-reported) and ahead of MAGE (95.9%). On a 302-problem non-agentic subset of CVDP spanning five task categories, we reach 94.7% mean pass@1 (286/302, averaged over 3 runs), a 36-60 percentage-point lift per category over the published single-shot baseline; we additionally lead three of four categories shared with NVIDIA's ACE-RTL despite using roughly 30x fewer per-problem attempts. A RISC-V SoC case study demonstrates hierarchical decomposition generating 8/8 lint-passing modules (689 LOC) validated on FPGA, where monolithic generation fails entirely.

URL PDF HTML ☆

赞 0 踩 0