arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.19351 2026-04-23 cs.CL

DASH-KV: Accelerating Long-Context LLM Inference via Asymmetric KV Cache Hashing

Jinyu Guo, Zhihan Zhang, Yutong Li, Jiehui Xie, Md. Tamim Iqbal, Dongshen Han, Lik-Hang Lee, Sung-Ho Bae, Jie Zou, Yang Yang, Chaoning Zhang

Comments Accepted by ACL 2026 (Findings)

详情

英文摘要

The quadratic computational complexity of the standard attention mechanism constitutes a fundamental bottleneck for large language models in long-context inference. While existing KV cache compression methods alleviate memory pressure, they often sacrifice generation quality and fail to address the high overhead of floating-point arithmetic. This paper introduces DASH-KV, an innovative acceleration framework that reformulates attention as approximate nearest-neighbor search via asymmetric deep hashing. Under this paradigm, we design an asymmetric encoding architecture that differentially maps queries and keys to account for their distinctions in precision and reuse characteristics. To balance efficiency and accuracy, we further introduce a dynamic mixed-precision mechanism that adaptively retains full-precision computation for critical tokens. Extensive experiments on LongBench demonstrate that DASH-KV significantly outperforms state-of-the-art baseline methods while matching the performance of full attention, all while reducing inference complexity from O(N^2) to linear O(N). The code is available at https://github.com/Zhihan-Zh/DASH-KV

URL PDF HTML ☆

赞 0 踩 0

2604.19245 2026-04-23 cs.CL cs.AI

Talking to a Know-It-All GPT or a Second-Guesser Claude? How Repair reveals unreliable Multi-Turn Behavior in LLMs

Clara Lachenmaier, Hannah Bultmann, Sina Zarrieß

Comments Preprint accepted at ACL Main Conference 2026

2604.19054 2026-04-23 cs.CV

Evaluation of Winning Solutions of 2025 Low Power Computer Vision Challenge

Zihao Ye, Yung-Hsiang Lu, Xiao Hu, Shuai Zhang, Taotao Jing, Xin Li, Zhen Yao, Bo Lang, Zhihao Zheng, Seungmin Oh, Hankyul Kang, Seunghun Kang, Jongbin Ryu, Kexin Chen, Yuan Qi, George K Thiruvathukal, Mooi Choo Chuah

Comments 11 pages, 8 figures, 4 tables

2604.18878 2026-04-23 cs.CL

LegalBench-BR: A Benchmark for Evaluating Large Language Models on Brazilian Legal Decision Classification

Pedro Barbosa de Carvalho Neto

Comments 8 pages, 1 figure. Preprint. First public benchmark for Brazilian legal text classification. Dataset and model available on Hugging Face

2604.18644 2026-04-23 cs.LG cs.AI

FASE : A Fairness-Aware Spatiotemporal Event Graph Framework for Predictive Policing

Pronob Kumar Barman, Pronoy Kumar Barman, Plaban Kumar Barman, Rohan Mandar Salvi

2604.18570 2026-04-23 cs.LG cs.AI cs.CL

A multimodal and temporal foundation model for virtual patient representations at healthcare system scale

Andrew Zhang, Tong Ding, Sophia J. Wagner, Caiwei Tian, Ming Y. Lu, Rowland Pettit, Joshua E. Lewis, Alexandre Misrahi, Dandan Mo, Long Phi Le, Faisal Mahmood

详情

英文摘要

Modern medicine generates vast multimodal data across siloed systems, yet no existing model integrates the full breadth and temporal depth of the clinical record into a unified patient representation. We introduce Apollo, a multimodal temporal foundation model trained and evaluated on over three decades of longitudinal hospital records from a major US hospital system, composed of 25 billion records from 7.2 million patients, representing 28 distinct medical modalities and 12 major medical specialties. Apollo learns a unified representation space integrating over 100 thousand unique medical events in our clinical vocabulary as well as images and clinical text. This "atlas of medical concepts" forms a computational substrate for modeling entire patient care journeys comprised of sequences of structured and unstructured events, which are compressed by Apollo into virtual patient representations. To assess the potential of these whole-patient representations, we created 322 prognosis and retrieval tasks from a held-out test set of 1.4 million patients. We demonstrate the generalized clinical forecasting potential of Apollo embeddings, including predicting new disease onset risk up to five years in advance (95 tasks), disease progression (78 tasks), treatment response (59 tasks), risk of treatment-related adverse events (17 tasks), and hospital operations endpoints (12 tasks). Using feature attribution techniques, we show that model predictions align with clinically-interpretable multimodal biomarkers. We evaluate semantic similarity search on 61 retrieval tasks, and moreover demonstrate the potential of Apollo as a multimodal medical search engine using text and image queries. Together, these modeling capabilities establish the foundation for computable medicine, where the full context of patient care becomes accessible to computational reasoning.

URL PDF HTML ☆

赞 0 踩 0

2604.18562 2026-04-23 cs.CV

AnchorSeg: Language Grounded Query Banks for Reasoning Segmentation

Rui Qian, Chuanhang Deng, Qiang Huang, Jian Xiong, Mingxuan Li, Yingbo Zhou, Wei Zhai, Jintao Chen, Dejing Dou

Comments This work has been accepted to ACL 2026, please refer to https://github.com/rui-qian/AnchorSeg

2604.17931 2026-04-23 cs.AI

LiteResearcher: A Scalable Agentic RL Training Framework for Deep Research Agent

Wanli Li, Bince Qu, Bo Pan, Jianyu Zhang, Zheng Liu, Pan Zhang, Wei Chen, Bo Zhang

Comments Preprint. Under review

2604.17555 2026-04-23 cs.AI cs.CL cs.IR

CoSearch: Joint Training of Reasoning and Document Ranking via Reinforcement Learning for Agentic Search

Hansi Zeng, Liam Collins, Bhuvesh Kumar, Neil Shah, Hamed Zamani

2604.17517 2026-04-23 cs.AI cs.CR

From Admission to Invariants: Measuring Deviation in Delegated Agent Systems

Marcelo Fernandez

Comments 21 pages, 6 figures. 3rd paper (Paper 2) in the 6-paper Agent Governance Series (Papers 0-5). Zenodo: https://doi.org/10.5281/zenodo.19672589. Companion: P0 (arXiv:2604.17511), P1/ACP (arXiv:2603.18829), P3 (zenodo.19672597), P4 (zenodo.19672608), P5/RAM (zenodo.19669430)

详情

DOI: 10.5281/zenodo.19672589

英文摘要

Autonomous agent systems are governed by enforcement mechanisms that flag hard constraint violations at runtime. The Agent Control Protocol identifies a structural limit of such systems: a correctly-functioning enforcement engine can enter a regime in which behavioral drift is invisible to it, because the enforcement signal operates below the layer where deviation is measurable. We show that enforcement-based governance is structurally unable to determine whether an agent behavior remains within the admissible behavior space A0 established at admission time. Our central result, the Non-Identifiability Theorem, proves that A0 is not in the sigma-algebra generated by the enforcement signal g under the Local Observability Assumption, which every practical enforcement system satisfies. The impossibility arises from a fundamental mismatch: g evaluates actions locally against a point-wise rule set, while A0 encodes global, trajectory-level behavioral properties set at admission time. An agent can therefore drift -- systematically shifting its behavioral distribution away from admission-time expectations -- while every individual action remains within the permitted action space. We define the Invariant Measurement Layer (IML), which bypasses this limitation by retaining direct access to the generative model of A0, restoring observability precisely in the region where enforcement is structurally blind. We prove an information-theoretic impossibility for enforcement-based monitoring and show IML detects admission-time drift with provably finite detection delay. Validated across four settings: three drift scenarios (300 and 1000 steps), a live n8n webhook pipeline, and a LangGraph StateGraph agent -- enforcement triggers zero violations while IML detects each drift type within 9-258 steps of drift onset.

URL PDF HTML ☆

赞 0 踩 0

2604.16914 2026-04-23 cs.CV eess.IV

Unified Ultrasound Intelligence Toward an End-to-End Agentic System

Chen Ma, Yunshu Li, Junhu Fu, Shuyu Liang, Yuanyuan Wang, Yi Guo

Comments Accepted by ISBI2026. 5 pages, 2 figures

2604.16879 2026-04-23 cs.CV

Adaptive Forensic Feature Refinement via Intrinsic Importance Perception

Jiazhen Yang, Junjun Zheng, Kejia Chen, Xiangheng Kong, Jie Lei, Zunlei Feng, Bingde Hu, Yang Gao

2604.16607 2026-04-23 cs.CL cs.AI

Spotlights and Blindspots: Evaluating Machine-Generated Text Detection

Kevin Stowe, Kailash Patil

Comments 15 pages, 4 figures, 4 tables

2604.15451 2026-04-23 cs.CV

Weak-to-Strong Knowledge Distillation Accelerates Visual Learning

Baiang Li, Wenhao Chai, Felix Heide

Comments 18 pages, 7 figures

2604.15153 2026-04-23 cs.CL cs.AI

Compressing Sequences in the Latent Embedding Space: $K$-Token Merging for Large Language Models

Zihao Xu, John Harvill, Ziwei Fan, Yizhou Sun, Hao Ding, Hao Wang

Comments Under Review

2604.14980 2026-04-23 cs.AI cs.CL cs.HC

Hybrid Decision Making via Conformal VLM-generated Guidance

Debodeep Banerjee, Burcu Sayin, Stefano Teso, Andrea Passerini

2604.14593 2026-04-23 cs.CL cs.AI

Mechanistic Decoding of Cognitive Constructs in Large Language Models

Yitong Shou, Manhao Guan

2604.14128 2026-04-23 cs.CL cs.AI cs.LG

Rhetorical Questions in LLM Representations: A Linear Probing Study

Louie Hong Yao, Vishesh Anand, Yuan Zhuang, Tianyu Jiang

Comments 18 pages, 15 figures, accepted to ACL 2026

2604.14116 2026-04-23 cs.AI cs.CL

TREX: Automating LLM Fine-tuning via Agent-Driven Tree-based Exploration

Zerun Ma, Guoqiang Wang, Xinchen Xie, Yicheng Chen, He Du, Bowen Li, Yanan Sun, Wenran Liu, Kai Chen, Yining Li

2604.13871 2026-04-23 cs.LG cs.SY eess.SY

Hardware-Efficient Neuro-Symbolic Networks with the Exp-Minus-Log Operator

Eymen Ipek

Comments This paper has been withdrawn by the authors due to the discovery of a fundamental limitation in EML method

2604.13533 2026-04-23 cs.RO cs.CV

Evolvable Embodied Agent for Robotic Manipulation via Long Short-Term Reflection and Optimization

Jianzong Wang, Botao Zhao, Yayun He, Junqing Peng, Xulong Zhang

Comments This work has been accepted for publication in the Proceedings of the 2026 International Joint Conference on Neural Networks (IJCNN 2026)

2604.12752 2026-04-23 cs.CV

Scaling In-Context Segmentation with Hierarchical Supervision

T. Camaret Ndir, Marco Reisert, Robin T. Schirrmeister

2604.12652 2026-04-23 cs.CV cs.AI

PromptEcho: Annotation-Free Reward from Vision-Language Models for Text-to-Image Reinforcement Learning

Jinlong Liu, Wanggui He, Peng Zhang, Mushui Liu, Hao Jiang, Pipei Huang

2604.10960 2026-04-23 cs.AI

RAG-KT: Cross-platform Explainable Knowledge Tracing with Multi-view Fusion Retrieval Generation

Zhiyi Duan, Hongyu Yuan, Rui Liu

2604.10647 2026-04-23 cs.RO

OmniUMI: Towards Physically Grounded Robot Learning via Human-Aligned Multimodal Interaction

Shaqi Luo, Yuanyuan Li, Youhao Hu, Chenhao Yu, Chaoran Xu, Jiachen Zhang, Guocai Yao, Tiejun Huang, Ran He, Zhongyuan Wang

2604.10063 2026-04-23 cs.CL

Mirroring Minds: Asymmetric Linguistic Accommodation and Diagnostic Identity in ADHD and Autism Reddit Communities

Saad Mankarious, Nour Zeid, Iyad Ait Hou, Rebecca Hwa, Aya Zirikly

2604.09563 2026-04-23 cs.AI cs.CL cs.LG

Seven simple steps for log analysis in AI systems

Magda Dubois, Ekin Zorer, Maia Hamin, Joe Skinner, Alexandra Souly, Jerome Wynne, Harry Coppock, Lucas Sato, Sayash Kapoor, Sunishchal Dev, Keno Juchems, Kimberly Mai, Timo Flesch, Lennart Luettgau, Charles Teague, Eric Patey, JJ Allaire, Lorenzo Pacchiardi, Jose Hernandez-Orallo, Cozmin Ududec

2604.08948 2026-04-23 cs.CL

TaxPraBen: A Scalable Benchmark for Structured Evaluation of LLMs in Chinese Real-World Tax Practice

Gang Hu, Yating Chen, Haiyan Ding, Wang Gao, Jiajia Huang, Min Peng, Qianqian Xie, Kun Yue

2604.08712 2026-04-23 cs.AI

Model Space Reasoning as Search in Feedback Space for Planning Domain Generation

James Oswald, Daniel Obolensky, Volodymyr Varha, Vasilije Dragovic, Kavitha Srinivas, Harsha Kokel, Michael Katz, Shirin Sohrabi

Comments Accepted at ICLR 2026 the 2nd Workshop on World Models: Understanding, Modelling and Scaling

2604.07798 2026-04-23 cs.AI

Lightweight LLM Agent Memory with Small Language Models

Jiaquan Zhang, Chaoning Zhang, Shuxu Chen, Zhenzhen Huang, Pengcheng Zheng, Zhicheng Wang, Ping Guo, Fan Mo, Sung-Ho Bae, Jie Zou, Jiwei Wei, Yang Yang

Comments Accepted by ACL 2026 (main)