arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2412.03594 2026-04-23 cs.CL cs.AI cs.DC cs.LG

BatchLLM: Optimizing Large Batched LLM Inference with Global Prefix Sharing and Throughput-oriented Token Batching

Zhen Zheng, Xin Ji, Taosong Fang, Fanghao Zhou, Chuanjie Liu, Gang Peng

Comments Accepted at MLSys 2026

详情

英文摘要

Large language models (LLMs) increasingly play an important role in a wide range of information processing and management tasks in industry. Many of these tasks are performed in large batches or even offline, and the performance indicator for which is throughput. These tasks usually show the characteristic of prefix sharing, where different prompt input can partially show the common prefix. However, the existing LLM inference engines tend to optimize the streaming requests and show limitations of supporting the large batched tasks with the prefix sharing characteristic. The existing solutions use the LRU-based cache to reuse the KV context of common prefix between requests. The KV context that are about to be reused may be prematurely evicted with the implicit cache management. Besides, the streaming oriented systems do not leverage the request-batch information and can not mix the decoding tokens with the prefill chunks to the best for the batched scenarios, and thus fails to saturate the GPU. We propose BatchLLM to address the above problems. BatchLLM explicitly identifies the common prefixes globally. The requests sharing the same prefix will be scheduled together to reuse the KV context the best. BatchLLM reorders the requests and schedules the requests with larger ratio of decoding first to better mix the decoding tokens with the latter prefill chunks, and applies memory-centric token batching to enlarge the token-batch sizes, which helps to increase the GPU utilization. Extensive evaluation shows that BatchLLM outperforms vLLM and SGLang by $1.3\times$ to $10.8\times$ on a set of microbenchmarks and a typical industry workload under different hardware environments. Code is available at https://github.com/microsoft/MixLLM/tree/batchllm_vllm_064.

URL PDF HTML ☆

赞 0 踩 0

2604.20675 2026-04-23 cs.LG

Improving clinical interpretability of linear neuroimaging models through feature whitening

Sara Petiton, Antoine Grigis, Raphaël Vock, Edouard Duchesnay

2604.20666 2026-04-23 cs.CL cs.AI

ORPHEAS: A Cross-Lingual Greek-English Embedding Model for Retrieval-Augmented Generation

Ioannis E. Livieris, Athanasios Koursaris, Alexandra Apostolopoulou, Konstantinos Kanaris Dimitris Tsakalidis, George Domalis

Comments This paper has been accepted for presentation at Engineering Applications and Advances of Artificial Intelligence 2026 (EAAAI'26)

2604.20659 2026-04-23 cs.LG cs.AI

GRPO-VPS: Enhancing Group Relative Policy Optimization with Verifiable Process Supervision for Effective Reasoning

Jingyi Wang, Lei Zhu, Tengjin Weng, Song-Li Wu, Haochen Tan, Jierun Chen, Chaofan Tao, Haoli Bai, Lu Hou, Lifeng Shang, Xiao-Ping Zhang

2604.20652 2026-04-23 cs.AI cs.HC econ.GN q-fin.EC

Large Language Models Outperform Humans in Fraud Detection and Resistance to Motivated Investor Pressure

Nattavudh Powdthavee

Comments 36 pages

2604.20651 2026-04-23 cs.AI

CHORUS: An Agentic Framework for Generating Realistic Deliberation Data

A. Koursaris, G. Domalis, A. Apostolopoulou, K. Kanaris, D. Tsakalidis, I. E. Livieris

Comments This paper has been accepted for presentation at Engineering Applications and Advances of Artificial Intelligence 2026

2604.20627 2026-04-23 cs.LG cs.RO

Occupancy Reward Shaping: Improving Credit Assignment for Offline Goal-Conditioned Reinforcement Learning

Aravind Venugopal, Jiayu Chen, Xudong Wu, Chongyi Zheng, Benjamin Eysenbach, Jeff Schneider

Comments ICLR 2026

2604.20622 2026-04-23 cs.AI cs.LG cs.MA

pAI/MSc: ML Theory Research with Humans on the Loop

Mahmoud Abdelmoneum, Pierfrancesco Beneventano, Tomaso Poggio

Comments 34 pages, 7 tables

2604.20614 2026-04-23 cs.LG math.DS math.OC stat.ML

Too Sharp, Too Sure: When Calibration Follows Curvature

Alessandro Morosini, Matea Gjika, Tomaso Poggio, Pierfrancesco Beneventano

Comments 33 pages, 23 figures

2604.20606 2026-04-23 cs.CV cs.AI

Beyond ZOH: Advanced Discretization Strategies for Vision Mamba

Fady Ibrahim, Guangjun Liu, Guanghui Wang

2604.20601 2026-04-23 cs.AI cs.CL

Self-Guided Plan Extraction for Instruction-Following Tasks with Goal-Conditional Reinforcement Learning

Zoya Volovikova, Nikita Sorokin, Dmitriy Lukashevskiy, Aleksandr Panov, Alexey Skrynnik

2604.20596 2026-04-23 cs.LG cs.CR

Differentially Private Clustered Federated Learning with Privacy-Preserving Initialization and Normality-Driven Aggregation

Jie Xu, Haaris Mehmood, Rogier Van Dalen, Karthikeyan Saravanan, Mete Ozay

Comments Accepted to ICASSP 2026 (Oral)

2604.20594 2026-04-23 cs.CV

Physics-Informed Conditional Diffusion for Motion-Robust Retinal Temporal Laser Speckle Contrast Imaging

Qian Chen, Yuehao Chen, Qiang Wang, Lei Zhu, Yanye Lu, Qiushi Ren

2604.20591 2026-04-23 cs.CV

Structure-Augmented Standard Plane Detection with Temporal Aggregation in Blind-Sweep Fetal Ultrasound

Keli Niu, He Zhao, Qianhui Men

2604.20586 2026-04-23 cs.LG cs.SY eess.SY

A Hierarchical MARL-Based Approach for Coordinated Retail P2P Trading and Wholesale Market Participation of DERs

Patrick Wilk, Ethan Cantor, Yikui Liu, Jie Li

Comments 11 pages, 6 figures, 7 tables

2604.20585 2026-04-23 cs.CV

On the Impact of Face Segmentation-Based Background Removal on Recognition and Morphing Attack Detection

Eduarda Caldeira, Guray Ozgur, Fadi Boutros, Naser Damer

Comments Accepted at FG 2026

2604.20574 2026-04-23 cs.CV

Where are they looking in the operating room?

Keqi Chen, Séraphin Baributsa, Lilien Schewski, Vinkle Srivastav, Didier Mutter, Guido Beldi, Sandra Keller, Nicolas Padoy

详情

英文摘要

Purpose: Gaze-following, the task of inferring where individuals are looking, has been widely studied in computer vision, advancing research in visual attention modeling, social scene understanding, and human-robot interaction. However, gaze-following has never been explored in the operating room (OR), a complex, high-stakes environment where visual attention plays an important role in surgical workflow analysis. In this work, we introduce the concept of gaze-following to the surgical domain, and demonstrate its great potential for understanding clinical roles, surgical phases, and team communications in the OR. Methods: We extend the 4D-OR dataset with gaze-following annotations, and extend the Team-OR dataset with gaze-following and a new team communication activity annotations. Then, we propose novel approaches to address clinical role prediction, surgical phase recognition, and team communication detection using a gaze-following model. For role and phase recognition, we propose a gaze heatmap-based approach that uses gaze predictions solely; for team communication detection, we train a spatial-temporal model in a self-supervised way that encodes gaze-based clip features, and then feed the features into a temporal activity detection model. Results: Experimental results on the 4D-OR and Team-OR datasets demonstrate that our approach achieves state-of-the-art performance on all downstream tasks. Quantitatively, our approach obtains F1 scores of 0.92 for clinical role prediction and 0.95 for surgical phase recognition. Furthermore, it significantly outperforms existing baselines in team communication detection, improving previous best performances by over 30%. Conclusion: We introduce gaze-following in the OR as a novel research direction in surgical data science, highlighting its great potential to advance surgical workflow analysis in computer-assisted interventions.

URL PDF HTML ☆

赞 0 踩 0

2604.20570 2026-04-23 cs.CV

Exploring Spatial Intelligence from a Generative Perspective

Muzhi Zhu, Shunyao Jiang, Huanyi Zheng, Zekai Luo, Hao Zhong, Anzhou Li, Kaijun Wang, Jintao Rong, Yang Liu, Hao Chen, Tao Lin, Chunhua Shen

Comments Accepted by CVPR 2026. Project page: https://aim-uofa.github.io/GSI-Bench/

2604.20564 2026-04-23 cs.CL

Where Reasoning Breaks: Logic-Aware Path Selection by Controlling Logical Connectives in LLMs Reasoning Chains

Seunghyun Park, Yuanyuan Lei

2604.20560 2026-04-23 cs.CL

LLM StructCore: Schema-Guided Reasoning Condensation and Deterministic Compilation

Serhii Zabolotnii

Comments 16 pages, 1 figure, 5 tables. Preprint of a paper accepted to the Third Workshop on Patient-oriented Language Processing (CL4Health), co-located with LREC-COLING 2026

2604.20557 2026-04-23 cs.RO

Passive Variable Impedance For Shared Control

Maximilian Mühlbauer, Nepomuk Werner, Ribin Balachandran, Thomas Hulin, João Silvério, Freek Stulp, Alin Albu-Schäffer

Comments submitted for publication at the IEEE Robotics and Automation Letters (RA-L)

2604.20556 2026-04-23 cs.CL cs.AI

LayerTracer: A Joint Task-Particle and Vulnerable-Layer Analysis framework for Arbitrary Large Language Model Architectures

Yuhang Wu, Qinyuan Liu, Qiuyang Zhao, Qingwei Chong

Comments 5 pages, 3 figures

2604.20549 2026-04-23 cs.CL cs.AI

Toward Cross-Lingual Quality Classifiers for Multilingual Pretraining Data Selection

Yassine Turki, Vinko Sabolčec, Bettina Messmer, Martin Jaggi

Comments Accepted at the 3rd Workshop on Navigating and Addressing Data Problems for Foundation Models (DATA-FM @ ICLR 2026). 31 pages, 4 figures

2604.20548 2026-04-23 cs.CL cs.AI cs.DL cs.IR

Enhancing Research Idea Generation through Combinatorial Innovation and Multi-Agent Iterative Search Strategies

Shuai Chen, Chengzhi Zhang

Comments Scientometrics

2604.20545 2026-04-23 cs.AI

Measuring the Machine: Evaluating Generative AI as Pluralist Sociotechical Systems

Rebecca L. Johnson

Comments PhD Thesis - Author formatted. Original available on the University of Sydney library website

2604.20544 2026-04-23 cs.CV cs.AI

Evian: Towards Explainable Visual Instruction-tuning Data Auditing

Zimu Jia, Mingjie Xu, Andrew Estornell, Jiaheng Wei

Comments Accepted at ACL 2026

2604.20535 2026-04-23 cs.CL cs.HC

Aligning Stuttered-Speech Research with End-User Needs: Scoping Review, Survey, and Guidelines

Hawau Olamide Toyin, Mutiah Apampa, Toluwani Aremu, Humaid Alblooshi, Ana Rita Valente, Gonçalo Leal, Zhengjun Yue, Zeerak Talat, Hanan Aldarmaki

Comments Submitted to Interspeech 2026

2604.20531 2026-04-23 cs.CL

Effects of Cross-lingual Evidence in Multilingual Medical Question Answering

Anar Yeginbergen, Maite Oronoz, Rodrigo Agerri

2604.20511 2026-04-23 cs.LG cs.AI cs.CL cs.CV cs.CY

CHASM: Unveiling Covert Advertisements on Chinese Social Media

Jingyi Zheng, Tianyi Hu, Yule Liu, Zhen Sun, Zongmin Zhang, Zifan Peng, Wenhan Dong, Xinlei He

Comments NeuIPS 2025 (Datasets and Benchmarks Track)

2604.20505 2026-04-23 cs.LG

Explicit Dropout: Deterministic Regularization for Transformer Architectures

Vidhi Agrawal, Illia Oleksiienko, Alexandros Iosifidis