arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.19536 2026-04-22 cs.RO

LiveVLN: Breaking the Stop-and-Go Loop in Vision-Language Navigation

Xiangchen Wang, Weiye Zhu, Teng Wang, TianTian Geng, Zekai Zhang, Zhiyuan Qi, Jinyu Yang, Feng Zheng

Comments 8 pages, 4 figures

详情

英文摘要

Recent navigation systems achieve strong benchmark results, yet real-world deployment often remains visibly stop-and-go. This bottleneck arises because the sense-inference-execution loop is still blocking: after each new observation, the controller must wait for sensing, transmission, and inference before motion can continue. Reducing action-generation cost alone therefore does not remove redundant waiting. To address this issue, we present LiveVLN, a training-free framework for more continuous embodied navigation by augmenting pretrained VLM navigators with multi-step action continuation. Instead of pausing for each full sense-and-inference round, LiveVLN overlaps execution with the processing of newly arrived observations, allowing refreshed future actions to be handed off before the current executable prefix is exhausted. This design keeps actions continuously available during motion, reducing idle waiting and enabling smoother online execution. The framework operates at runtime and can be integrated with compatible pretrained VLM navigators. Across R2R and RxR, LiveVLN preserves benchmark performance while reducing waiting time and improving action availability. In real-world deployments, it cuts average episode waiting time by up to $77.7\%$ and shortens wall-clock episode time by $12.6\%$ on StreamVLN and $19.6\%$ on NaVIDA, yielding more coherent execution during deployment. Code is available at https://github.com/NIneeeeeem/LiveVLN.

URL PDF HTML ☆

赞 0 踩 0

2604.19523 2026-04-22 cs.AI

Revac: A Social Deduction Reasoning Agent

Mihir Shriniwas Arya, Avinash Anish, Aditya Ranjan

2604.19522 2026-04-22 cs.RO

GenerativeMPC: VLM-RAG-guided Whole-Body MPC with Virtual Impedance for Bimanual Mobile Manipulation

Marcelino Julio Fernando, Miguel Altamirano Cabrera, Jeffrin Sam, Yara Mahmoud, Konstantin Gubernatorov, Dzmitry Tsetserukou

Comments 6 pages, 7 figures

2604.19520 2026-04-22 cs.AI

SimDiff: Depth Pruning via Similarity and Difference

Yuli Chen, Shuhao Zhang, Fanshen Meng, Bo Cheng, Jiale Han, Qiang Tong, Xiulei Liu

2604.19518 2026-04-22 cs.LG cs.SY eess.SY

Accelerating Optimization and Machine Learning through Decentralization

Ziqin Chen, Zuang Wang, Yongqiang Wang

2604.19516 2026-04-22 cs.AI

From Experience to Skill: Multi-Agent Generative Engine Optimization via Reusable Strategy Learning

Beining Wu, Fuyou Mao, Jiong Lin, Cheng Yang, Jiaxuan Lu, Yifu Guo, Siyu Zhang, Yifan Wu, Ying Huang, Fu Li

Comments ACL 2026 Findings

2604.19514 2026-04-22 cs.LG cs.AI cs.CR cs.SI

When Graph Structure Becomes a Liability: A Critical Re-Evaluation of Graph Neural Networks for Bitcoin Fraud Detection under Temporal Distribution Shift

Saket Maganti

Comments Code to be released soon

2604.19510 2026-04-22 cs.CV

Evaluating Histogram Matching for Robust Deep learning-Based Grapevine Disease Detection

Ruben Pascual, Inés Hernández, Salvador Gutiérrez, Javier Tardaguila, Pedro Melo-Pinto, Daniel Paternain, Mikel Galar

2604.19509 2026-04-22 cs.RO cs.MA

Assessing VLM-Driven Semantic-Affordance Inference for Non-Humanoid Robot Morphologies

Jess Jones, Raul Santos-Rodriguez, Sabine Hauert

Comments AAMAS 2026 (main track), 9 pages, 4 figures

2604.19508 2026-04-22 cs.CL

Bangla Key2Text: Text Generation from Keywords for a Low Resource Language

Tonmoy Talukder, G M Shahariar

Comments 18 pages, uses lrec2026.sty

2604.19489 2026-04-22 cs.CV cs.CY

Seeing Candidates at Scale: Multimodal LLMs for Visual Political Communication on Instagram

Michael Achmann-Denkler, Mario Haim, Christian Wolff

Comments An earlier version was presented at #SMSociety 2024 (London)

2604.19488 2026-04-22 cs.AI

CoDA: Towards Effective Cross-domain Knowledge Transfer via CoT-guided Domain Adaptation

Jianzhi Yan, Le Liu, Buzhou Tang, Yang Xiang, Dongning Sun, Zhiming Li

Comments 12 pages, 6 figures

2604.19485 2026-04-22 cs.LG cs.AI cs.CL

EVPO: Explained Variance Policy Optimization for Adaptive Critic Utilization in LLM Post-Training

Chengjun Pan, Shichun Liu, Jiahang Lin, Dingwei Zhu, Jiazheng Zhang, Shihan Dou, Songyang Gao, Zhenhua Han, Binghai Wang, Rui Zheng, Xuanjing Huang, Tao Gui, Yansong Feng

2604.19480 2026-04-22 cs.CV

Deep sprite-based image models: An analysis

Zeynep Sonat Baltacı, Romain Loiseau, Mathieu Aubry

2604.19477 2026-04-22 cs.SD cs.CL

Deep Supervised Contrastive Learning of Pitch Contours for Robust Pitch Accent Classification in Seoul Korean

Hyunjung Joo, GyeongTaek Lee

2604.19473 2026-04-22 cs.CV

TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generation

Hongyu Zhang, Yufan Deng, Zilin Pan, Peng-Tao Jiang, Bo Li, Qibin Hou, Zhiyang Dou, Zhen Dong, Daquan Zhou

Comments ICLR 2026, code available at: https://github.com/Hong-yu-Zhang/TS-Attn

2604.19469 2026-04-22 cs.RO cs.SY eess.SY

Wrench-Aware Admittance Control for Unknown-Payload Manipulation

Hossein Gholampour, Logan E. Beaver

2604.19464 2026-04-22 cs.CL cs.AI

LePREC: Reasoning as Classification over Structured Factors for Assessing Relevance of Legal Issues

Fanyu Wang, Xiaoxi Kang, Paul Burgess, Aashish Srivastava, Chetan Arora, Adnan Trakic, Lay-Ki Soon, Md Khalid Hossain, Lizhen Qu

Comments Accepted by ACL 2026 Main Conference

2604.19459 2026-04-22 cs.AI cs.CL cs.LO

Do LLMs Game Formalization? Evaluating Faithfulness in Logical Reasoning

Kyuhee Kim, Auguste Poiroux, Antoine Bosselut

Comments 25 pages, 4 figures, 22 tables. Published at the VerifAI-2 Workshop, ICLR 2026 (non-archival). Code and data: https://github.com/koreankiwi99/formalization-gaming

2604.19457 2026-04-22 cs.AI

Four-Axis Decision Alignment for Long-Horizon Enterprise AI Agents

Vasundra Srininvasan

Comments 21 pages, 5 figures, 8 tables. PDFLaTeX. Code and artifacts: https://github.com/vasundras/decision-alignment-long-horizon-agents

详情

英文摘要

Long-horizon enterprise agents make high-stakes decisions (loan underwriting, claims adjudication, clinical review, prior authorization) under lossy memory, multi-step reasoning, and binding regulatory constraints. Current evaluation reports a single task-success scalar that conflates distinct failure modes and hides whether an agent is aligned with the standards its deployment environment requires. We propose that long-horizon decision behavior decomposes into four orthogonal alignment axes, each independently measurable and failable: factual precision (FRP), reasoning coherence (RCS), compliance reconstruction (CRR), and calibrated abstention (CAR). CRR is a novel regulatory-grounded axis; CAR is a measurement axis separating coverage from accuracy. We exercise the decomposition on a controlled benchmark (LongHorizon-Bench) covering loan qualification and insurance claims adjudication with deterministic ground-truth construction. Running six memory architectures, we find structure aggregate accuracy cannot see: retrieval collapses on factual precision; schema-anchored architectures pay a scaffolding tax; plain summarization under a fact-preservation prompt is a strong baseline on FRP, RCS, EDA, and CRR; and all six architectures commit on every case, exposing a decisional-alignment axis the field has not targeted. The decomposition also surfaced a pre-registered prediction of our own, that summarization would fail factual recall, which the data reversed at large magnitude, an axis-level reversal aggregate accuracy would have hidden. Institutional alignment (regulatory reconstruction) and decisional alignment (calibrated abstention) are under-represented in the alignment literature and become load-bearing once decisions leave the laboratory. The framework transfers to any regulated decisioning domain via two steps: build a fact schema, and calibrate the CRR auditor prompt.

URL PDF HTML ☆

赞 0 踩 0

2604.19453 2026-04-22 cs.LG

ZC-Swish: Stabilizing Deep BN-Free Networks for Edge and Micro-Batch Applications

Suvinava Basak

2604.19451 2026-04-22 cs.LG stat.ML

Heterogeneity-Aware Personalized Federated Learning for Industrial Predictive Analytics

Yuhan Hu, Xiaolei Fang

2604.19447 2026-04-22 cs.CL

'The Order in the Horse's Heart': A Case Study in LLM-Assisted Stylometry for the Discovery of Biblical Allusion in Modern Literary Fiction

Ewan Cameron

Comments 39 pages, 1 figure

2604.19445 2026-04-22 cs.CV

LoViF 2026 Challenge on Real-World All-in-One Image Restoration: Methods and Results

Xiang Chen, Hao Li, Jiangxin Dong, Jinshan Pan, Xin Li, Xin He, Naiwei Chen, Shengyuan Li, Fengning Liu, Haoyi Lv, Haowei Peng, Yilian Zhong, Yuxiang Chen, Shibo Yin, Yushun Fang, Xilei Zhu, Yahui Wang, Chen Lu, Kaibin Chen, Xu Zhang, Xuhui Cao, Jiaqi Ma, Ziqi Wang, Shengkai Hu, Yuning Cui, Huan Zhang, Shi Chen, Bin Ren, Lefei Zhang, Guanglu Dong, Qiyao Zhao, Tianheng Zheng, Chunlei Li, Lichao Mou, Chao Ren, Wangzhi Xing, Xin Lu, Enxuan Gu, Jingxi Zhang, Diqi Chen, Qiaosi Yi, Bingcai Wei, Mingyu Liu, Pengyu Wang, Ce Liu, Miaoxin Guan, Boyu Chen, Hongyu Li, Jian Zhu, Xinrui Luo, Ziyang He, Jiayu Wang, Yichen Xiang, Huayi Qi, Haoyu Bian, Yiran Li, Sunlichen Zhou

Comments CVPR Workshops 2026; https://lowlevelcv.com/

2604.19444 2026-04-22 cs.LG

Unsupervised Confidence Calibration for Reasoning LLMs from a Single Generation

Thomas Zollo, Jimmy Wang, Richard Zemel

Comments 41 pages, 14 tables, 12 figures

2604.19440 2026-04-22 cs.CL cs.NE

What Makes an LLM a Good Optimizer? A Trajectory Analysis of LLM-Guided Evolutionary Search

Xinhao Zhang, Xi Chen, François Portet, Maxime Peyrard

Comments 9 pages, 8 figures, Accepted at Findings of ACL 2026

2604.19432 2026-04-22 cs.CV

DINO Eats CLIP: Adapting Beyond Knowns for Open-set 3D Object Retrieval

Xinwei He, Yansong Zheng, Qianru Han, Zhichuan Wang, Yuxuan Cai, Yang Zhou, Jingbo Xia, Yulong Wang, Jinhai Xiang, Xiang Bai

Comments Accepted to CVPR 2026

2604.19420 2026-04-22 cs.CV

TESO: Online Tracking of Essential Matrix by Stochastic Optimization

Jaroslav Moravec, Radim Šára, Akihiro Sugimoto

Comments Accepted at CVPR 2026 (Oral)

2604.19419 2026-04-22 cs.RO cs.NA math.DG math.DS math.NA physics.class-ph

Forward Dynamics of Variable Topology Mechanisms - The Case of Constraint Activation

Andreas Mueller

2604.19412 2026-04-22 cs.CV cs.CL

VCE: A zero-cost hallucination mitigation method of LVLMs via visual contrastive editing

Yanbin Huang, Yisen Li, Guiyao Tie, Xiaoye Qu, Pan Zhou, Hongfei Wang, Zhaofan Zou, Hao Sun, Xuelong Li

Comments ICASSP 2026