arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.11462 2026-04-14 cs.AI

Escaping the Context Bottleneck: Active Context Curation for LLM Agents via Reinforcement Learning

Xiaozhe Li, Tianyi Lyu, Yizhao Yang, Liang Shan, Siyi Yang, Ligao Zhang, Zhuoyi Huang, Qingwen Liu, Yang Li

详情

英文摘要

Large Language Models (LLMs) struggle with long-horizon tasks due to the "context bottleneck" and the "lost-in-the-middle" phenomenon, where accumulated noise from verbose environments degrades reasoning over multi-turn interactions. To address this issue, we introduce a symbiotic framework that decouples context management from task execution. Our architecture pairs a lightweight, specialized policy model, ContextCurator, with a powerful frozen foundation model, TaskExecutor. Trained via reinforcement learning, ContextCurator actively reduces information entropy in the working memory. It aggressively prunes environmental noise while preserving reasoning anchors, that is, sparse data points that are critical for future deductions. On WebArena, our framework improves the success rate of Gemini-3.0-flash from 36.4% to 41.2% while reducing token consumption by 8.8% (from 47.4K to 43.3K). On DeepSearch, it achieves a 57.1% success rate, compared with 53.9%, while reducing token consumption by a factor of 8. Remarkably, a 7B ContextCurator matches the context management performance of GPT-4o, providing a scalable and computationally efficient paradigm for autonomous long-horizon agents.

URL PDF HTML ☆

赞 0 踩 0

2604.11447 2026-04-14 cs.RO cs.SY eess.SY

Safe Human-to-Humanoid Motion Imitation Using Control Barrier Functions

Wenqi Cai, John Abanes, Nikolaos Evangeliou, Anthony Tzes

2604.11446 2026-04-14 cs.LG cs.AI cs.CL

Low-rank Optimization Trajectories Modeling for LLM RLVR Acceleration

Zhipeng Chen, Tao Qian, Wayne Xin Zhao, Ji-Rong Wen

Comments Working in progress

2604.11444 2026-04-14 cs.CV

HuiYanEarth-SAR: A Foundation Model for High-Fidelity and Low-Cost Global Remote Sensing Imagery Generation

Yongxiang Liu, Jie Zhou, Yafei Song, Tianpeng Liu, Li Liu

2604.11423 2026-04-14 cs.RO

Dyadic Partnership(DP): A Missing Link Towards Full Autonomy in Medical Robotics

Nassir Navab, Zhongliang Jiang

2604.11422 2026-04-14 cs.LG cs.AI

Emulating Non-Differentiable Metrics via Knowledge-Guided Learning: Introducing the Minkowski Image Loss

Filippo Quarenghi, Ryan Cotsakis, Tom Beucler

详情

英文摘要

The ``differentiability gap'' presents a primary bottleneck in Earth system deep learning: since models cannot be trained directly on non-differentiable scientific metrics and must rely on smooth proxies (e.g., MSE), they often fail to capture high-frequency details, yielding ``blurry'' outputs. We develop a framework that bridges this gap using two different methods to deal with non-differentiable functions: the first is to analytically approximate the original non-differentiable function into a differentiable equivalent one; the second is to learn differentiable surrogates for scientific functionals. We formulate the analytical approximation by relaxing discrete topological operations using temperature-controlled sigmoids and continuous logical operators. Conversely, our neural emulator uses Lipschitz-convolutional neural networks to stabilize gradient learning via: (1) spectral normalization to bound the Lipschitz constant; and (2) hard architectural constraints enforcing geometric principles. We demonstrate this framework's utility by developing the Minkowski image loss, a differentiable equivalent for the integral-geometric measures of surface precipitation fields (area, perimeter, connected components). Validated on the EUMETNET OPERA dataset, our constrained neural surrogate achieves high emulation accuracy, completely eliminating the geometric violations observed in unconstrained baselines. However, applying these differentiable surrogates to a deterministic super-resolution task reveals a fundamental trade-off: while strict Lipschitz regularization ensures optimization stability, it inherently over-smooths gradient signals, restricting the recovery of highly localized convective textures. This work highlights the necessity of coupling such topological constraints with stochastic generative architectures to achieve full morphological realism.

URL PDF HTML ☆

赞 0 踩 0

2604.11419 2026-04-14 cs.AI cs.CR

Beyond RAG for Cyber Threat Intelligence: A Systematic Evaluation of Graph-Based and Agentic Retrieval

Dzenan Hamzic, Florian Skopik, Max Landauer, Markus Wurzenberger, Andreas Rauber

2604.11416 2026-04-14 cs.LG

Exact Certification of Neural Networks and Partition Aggregation Ensembles against Label Poisoning

Ajinkya Mohgaonkar, Lukas Gosch, Mahalakshmi Sabanayagam, Debarghya Ghoshdastidar, Stephan Günnemann

Comments Workshop on Principled Design for Trustworthy AI @ ICLR 2026

2604.11415 2026-04-14 cs.CV

Observe Less, Understand More: Cost-aware Cross-scale Observation for Remote Sensing Understanding

Zhenghao Xie, Jing Xiao, Zhenqi Wang, Kexin Ma, Liang Liao, Gui-Song Xia, Mi Wang

2604.11411 2026-04-14 cs.CV

Online Reasoning Video Object Segmentation

Jinyuan Liu, Yang Wang, Zeyu Zhao, Weixin Li, Song Wang, Ruize Han

2604.11410 2026-04-14 cs.LG cs.SY eess.SY

Active Bayesian Inference for Robust Control under Sensor False Data Injection Attacks

Axel Andersson, György Dán

Comments 8 pages, 4 figures. This work has been submitted to the IEEE for possible publication

2604.11402 2026-04-14 cs.CV

Scene Change Detection with Vision-Language Representation Learning

Diwei Sheng, Vijayraj Gohil, Satyam Gaba, Zihan Liu, Giles Hamilton-Fletcher, John-Ross Rizzo, Yongqing Liang, Chen Feng

2604.11401 2026-04-14 cs.CV

GS4City: Hierarchical Semantic Gaussian Splatting via City-Model Priors

Qilin Zhang, Jinyu Zhu, Olaf Wysocki, Benjamin Busam, Boris Jutzi

2604.11400 2026-04-14 cs.RO cs.CV

EagleVision: A Multi-Task Benchmark for Cross-Domain Perception in High-Speed Autonomous Racing

Zakhar Yagudin, Murad Mebrahtu, Ren Jin, Jiaqi Huang, Yujia Yue, Dzmitry Tsetserukou, Jorge Dias, Majid Khonji

2604.11399 2026-04-14 cs.CV cs.CL

Reasoning Resides in Layers: Restoring Temporal Reasoning in Video-Language Models with Layer-Selective Merging

Zihang Fu, Haonan Wang, Jian Kang, Kenji Kawaguchi, Jiaying Wu

2604.11395 2026-04-14 cs.CV

Video-based Heart Rate Estimation with Angle-guided ROI Optimization and Graph Signal Denoising

Gan Pei, Junhao Ning, Boqiu Shen, Yan Zhu, Menghan Hu

Comments This paper has been accepted by ICASSP 2026

2604.11389 2026-04-14 cs.CV

ConvFormer3D-TAP: Phase/Uncertainty-Aware Front-End Fusion for Cine CMR View Classification Pipelines

Nafiseh Ghaffar Nia, Vinesh Appadurai, Suchithra V., Chinmay Rane, Daniel Pittman, James Carr, Adrienne Kline

2604.11386 2026-04-14 cs.RO cs.CV

ComSim: Building Scalable Real-World Robot Data Generation via Compositional Simulation

Yiran Qin, Jiahua Ma, Li Kang, Wenzhan Li, Yihang Jiao, Xin Wen, Xiufeng Song, Heng Zhou, Jiwen Yu, Zhenfei Yin, Xihui Liu, Philip Torr, Yilun Du, Ruimao Zhang

Comments 14 pages, 8 figures, 4 tables; supplementary material included; Project page: https://faceong.github.io/ComSim/

2604.11378 2026-04-14 cs.AI cs.SY eess.SY

From Agent Loops to Structured Graphs:A Scheduler-Theoretic Framework for LLM Agent Execution

Hu Wei

Comments 51 pages, 4 figures

2604.11376 2026-04-14 cs.CV cs.AI

From Redaction to Restoration: Deep Learning for Medical Image Anonymization and Reconstruction

Adrienne Kline, Abhijit Gaonkar, Daniel Pittman, Chris Kuehn, Nils Forkert

2604.11374 2026-04-14 cs.CV cs.CL

What Do Vision-Language Models Encode for Personalized Image Aesthetics Assessment?

Koki Ryu, Hitomi Yanaka

Comments To appear at ACL 2026 findings

2604.11373 2026-04-14 cs.RO cs.AI

Minimal Embodiment Enables Efficient Learning of Number Concepts in Robot

Zhegong Shangguan, Alessandro Di Nuovo, Angelo Cangelosi

2604.11365 2026-04-14 cs.AI cs.CL

Learning from Contrasts: Synthesizing Reasoning Paths from Diverse Search Trajectories

Peiyang Liu, Zhirui Chen, Xi Wang, Di Liang, Youru Li, Zhi Cai, Wei Ye

2604.11359 2026-04-14 cs.AI cs.LG

CoRe-ECG: Advancing Self-Supervised Representation Learning for 12-Lead ECG via Contrastive and Reconstructive Synergy

Zehao Qin, Xiaojian Lin, Ping Zhang, Hongliang Wu, Xinkang Wang, Guangling Liu, Bo Chen, Wenming Yang, Guijin Wang

2604.11355 2026-04-14 cs.CV

LEADER: Learning Reliable Local-to-Global Correspondences for LiDAR Relocalization

Jianshi Wu, Minghang Zhu, Dunqiang Liu, Wen Li, Sheng Ao, Siqi Shen, Chenglu Wen, Cheng Wang

Comments Accepted to CVPR 2026 (Highlight)

2604.11351 2026-04-14 cs.RO

WM-DAgger: Enabling Efficient Data Aggregation for Imitation Learning with World Models

Anlan Yu, Zaishu Chen, Peili Song, Zhiqing Hong, Haotian Wang, Desheng Zhang, Tian He, Yi Ding, Daqing Zhang

2604.11349 2026-04-14 cs.RO

Learning Racket-Ball Bounce Dynamics Across Diverse Rubbers for Robotic Table Tennis

Thomas Gossard

2604.11348 2026-04-14 cs.CV

LoGo-MR: Screening Breast MRI for Cancer Risk Prediction by Efficient Omni-Slice Modeling

Xin Wang, Yuan Gao, George Yiasemis, Antonio Portaluri, Zahra Aghdam, Muzhen He, Luyi Han, Yaofei Duan, Chunyao Lu, Xinglong Liang, Tianyu Zhang, Vivien van Veldhuizen, Yue Sun, Tao Tan, Ritse Mann, Jonas Teuwen

2604.11334 2026-04-14 cs.AI

Dynamic Summary Generation for Interpretable Multimodal Depression Detection

Shiyu Teng, Jiaqing Liu, Hao Sun, Yu Li, Shurong Chai, Ruibo Hou, Tomoko Tateyama, Lanfen Lin, Yen-Wei Chen

2604.11332 2026-04-14 cs.CV cs.AI

A Compact and Efficient 1.251 Million Parameter Machine Learning CNN Model PD36-C for Plant Disease Detection: A Case Study

Shkelqim Sherifi

Comments 17 pages, 24 figures