arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2510.07043 2026-04-20 cs.LG

COMPASS: Benchmarking Constrained Optimization in LLM Agents

Tian Qin, Felix Bai, Ting-Yao Hu, Raviteja Vemulapalli, Hema Swetha Koppula, Zhiyang Xu, Bowen Jin, Mert Cemri, Jiarui Lu, Zirui Wang, Meng Cao

详情

英文摘要

Human decision-making often involves constrained optimization. As LLM agents are deployed to assist with real-world tasks like travel planning, shopping, and scheduling, they must mirror this capability. We introduce COMPASS, a benchmark that evaluates whether LLM agents can perform constrained optimization in realistic travel planning settings. To success in these tasks, agents must engage in multi-turn conversations with user to gather task information as well as use tools to gather information from the database. Then agents must propose a solution that not only satisfies hard constraints but also optimizes user's utility objective. Evaluating state-of-the-art models, we reveal a significant feasible-optimal gap: while models achieve 70-90% feasibility (constraint satisfaction), they reach only 20-60% optimality (utility optimization). Our analysis shows that tool use is not the bottleneck. Instead, the core limitation is insufficient exploration of the search space, with success strongly correlating with information gathered. Coding agents show a promising approach to mitigate this gap. Together, COMPASS provides a testbed for developing LLM agents that can truly mirror human decision-making by both satisfying constraints and optimizing objectives.

URL PDF HTML ☆

赞 0 踩 0

2510.06953 2026-04-20 cs.AI cs.CL

Revisiting the Uniform Information Density Hypothesis in LLM Reasoning

Minju Gwak, Guijin Son, Jaehyung Kim

Comments ACL 2026 Findings

2509.25897 2026-04-20 cs.CL cs.AI cs.CY

RoleConflictBench: A Benchmark of Role Conflict Scenarios for Evaluating LLMs' Contextual Sensitivity

Jisu Shin, Hoyun Song, Juhyun Oh, Changgeon Ko, Eunsu Kim, Chani Jung, Alice Oh

Comments Accepted to Findings of ACL 2026

2509.21623 2026-04-20 cs.CL cs.AI cs.LG

OjaKV: Context-Aware Online Low-Rank KV Cache Compression

Yuxuan Zhu, David H. Yang, Mohammad Mohammadi Amiri, Keerthiram Murugesan, Tejaswini Pedapati, Pin-Yu Chen

详情

英文摘要

The expanding long-context capabilities of large language models are constrained by a significant memory bottleneck: the key-value (KV) cache required for autoregressive generation. This bottleneck is substantial; for instance, a Llama-3.1-8B model processing a 32K-token prompt at a batch size of 4 requires approximately 16GB for its KV cache, a size exceeding the model's weights. While KV-cache compression via low-rank projection is a promising direction, existing methods rely on a static, offline-learned subspace that performs poorly under data distribution shifts. To overcome these limitations, we introduce OjaKV, a novel framework that integrates a strategic hybrid storage policy with online subspace adaptation. First, OjaKV recognizes that not all tokens are equally important for compression; it preserves the crucial first and most recent tokens in full-rank, maintaining high-fidelity anchors for attention. Second, for the vast majority of intermediate tokens, it applies low-rank compression by incrementally adapting the projection basis using Oja's algorithm for online principal component analysis. This adaptation involves a comprehensive update during prompt prefilling and lightweight periodic updates during decoding, ensuring the subspace remains aligned with the evolving context. Crucially, our framework is fully compatible with modern attention modules like FlashAttention. Experiments demonstrate that OjaKV maintains or even improves zero-shot accuracy at high compression ratios. In particular, OjaKV achieves its strongest gains on very long-context benchmarks that require complex reasoning, highlighting the importance of online subspace adaptation in dynamically tracking context shifts. These results establish our hybrid framework as a practical, plug-and-play solution for memory-efficient long-context inference without requiring model fine-tuning.

URL PDF HTML ☆

赞 0 踩 0

2509.21010 2026-04-20 cs.LG cs.AI

Bridging the phenotype-target gap for molecular generation via multi-objective reinforcement learning

Haotian Guo, Hui Liu

2509.19104 2026-04-20 cs.LG stat.ML

Online Distributionally Robust LLM Alignment via Regression to Relative Reward

Sharan Sahu, Martin T. Wells

Comments 70 pages, 7 figures, 1 table

2509.17292 2026-04-20 cs.CL cs.AI

Multi-View Attention Multiple-Instance Learning Enhanced by LLM Reasoning for Cognitive Distortion Detection

Jun Seo Kim, Hyemi Kim, Woo Joo Oh, Hongjin Cho, Hochul Lee, Hye Hyeon Kim

Comments Accepted to the main conference of ACL 2026

2509.11982 2026-04-20 cs.LG cs.CY

Examining the Relationship between Scientific Publishing Activity and Hype-Driven Financial Bubbles: A Comparison of the Dot-Com and AI Eras

Aksheytha Chelikavada, Casey C. Bennett

2509.05489 2026-04-20 cs.LG

Self-Aligned Reward: Towards Effective and Efficient Reasoners

Peixuan Han, Adit Krishnan, Gerald Friedland, Jiaxuan You, Chris Kong

2509.01944 2026-04-20 cs.RO cs.CV

AutoDrive-R$^2$: Incentivizing Reasoning and Self-Reflection Capacity for VLA Model in Autonomous Driving

Zhenlong Yuan, Chengxuan Qian, Jing Tang, Rui Chen, Zijian Song, Lei Sun, Xiangxiang Chu, Yujun Cai, Dapeng Zhang, Shuo Li

2508.21675 2026-04-20 cs.CL cs.CV cs.GR

Is this chart lying to me? Automating the detection of misleading visualizations

Jonathan Tonglet, Jan Zimny, Tinne Tuytelaars, Iryna Gurevych

Comments Camera-ready version accepted at ACL 2026 Main conference. Code and data available at: https://github.com/UKPLab/acl2026-misviz

2508.21322 2026-04-20 cs.RO

Robust Real-Time Coordination of CAVs: A Distributed Optimization Framework under Uncertainty

Haojie Bai, Tingting Zhang, Cong Guo, Yang Wang, Xiongwei Zhao, Hai Zhu

Comments Accept by IEEE TVT

2508.16739 2026-04-20 cs.CV

Two-Stage Framework for Efficient UAV-Based Wildfire Video Analysis with Adaptive Compression and Fire Source Detection

Yanbing Bai, Rui-Yang Ju, Lemeng Zhao, Junjie Hu, Jianchao Bi, Erick Mas, Shunichi Koshimura

Comments IEEE JSTARS; Extended Journal Version of IROS 2024

2508.01345 2026-04-20 cs.CV

Predicting Video Slot Attention Queries from Random Slot-Feature Pairs

Rongzhen Zhao, Jian Li, Juho Kannala, Joni Pajarinen

Comments Accepted to AAAI 2026

2507.18756 2026-04-20 cs.LG cs.IR

Exploitation Over Exploration: Unmasking the Bias in Linear Bandit Recommender Offline Evaluation

Pedro R. Pires, Gregorio F. Azevedo, Pietro L. Campos, Rafael T. Sereicikas, Tiago A. Almeida

Comments Published in RecSys'25, 10 pages, 3 figures

2507.16727 2026-04-20 cs.AI

Deliberative Searcher: Improving LLM Reliability via Reinforcement Learning with constraints

Zhenyun Yin, Shujie Wang, Xuhong Wang, Xingjun Ma, Yinchun Wang

Comments Accepted by ACL 2026

2507.02935 2026-04-20 cs.CL cs.AI cs.MA

Theory of Mind in Action: The Instruction Inference Task in Dynamic Human-Agent Collaboration

Fardin Saad, Pradeep K. Murukannaiah, Munindar P. Singh

Comments 66 pages with appendix, 10 figures (Appendix: 26 Figures), 11 tables. Code available at: https://github.com/fardinsaad/Tomcat-LLM

2506.20020 2026-04-20 cs.AI cs.CL

Persona-Assigned Large Language Models Exhibit Human-Like Motivated Reasoning

Saloni Dash, Amélie Reymond, Emma S. Spiro, Aylin Caliskan

Comments ACL Findings 2026

2506.19840 2026-04-20 cs.CV

GenHSI: Controllable Generation of Human-Scene Interaction Videos

Zekun Li, Rui Zhou, Rahul Sajnani, Xiaoyan Cong, Daniel Ritchie, Srinath Sridhar

详情

英文摘要

Large-scale pre-trained video diffusion models have exhibited remarkable capabilities in diverse video generation. However, existing solutions face several challenges in generating long videos with rich human-scene interactions (HSI), including unrealistic dynamics and affordance, lack of subject identity preservation, and the need for expensive training. To this end, we propose GenHSI, a training-free method for controllable generation of long HSI videos with 3D awareness. Taking inspiration from movie animation, we subdivide the video synthesis into three stages: (1) script writing, (2) pre-visualization, and (3) animation. Given an image of a scene and a character with a user description, we use these three stages to generate long videos that preserve human identity and provide rich and plausible HSI. Script writing converts a complex text prompt involving a chain of HSI into simple atomic actions that are used in the pre-visualization stage to generate 3D keyframes. To synthesize plausible human interaction poses in 3D keyframes, we utilize pre-trained 2D inpainting diffusion models to generate plausible 2D human interactions based on view canonicalization, which eliminates the need for multi-view fitting in previous works. We then extend these interactions to 3D using robust iterative optimization, informed by contact cues and reasoning from VLMs. Prompted by these 3D keyframes, the pretrained video diffusion models can better generate consistent long videos with plausible dynamics and affordance in a 3D-aware manner. We are the first to synthesize a long video sequence with a chain of HSI actions without training based on the image references of the scene and character. Experiments demonstrate that our method can generate HSI videos that effectively preserve scene content and character identity with plausible human-scene interaction from a single image scene.

URL PDF HTML ☆

赞 0 踩 0

2506.15064 2026-04-20 cs.LG cs.NA cs.NE math.NA

HiPreNets: High-Precision Neural Networks through Progressive Training

Ethan Mulle, Wei Kang, Qi Gong

2506.13215 2026-04-20 cs.CV

DVP-MVS++: Synergize Depth-Normal-Edge and Harmonized Visibility Prior for Multi-View Stereo

Zhenlong Yuan, Dapeng Zhang, Zehao Li, Chengxuan Qian, Jianing Chen, Yinda Chen, Kehua Chen, Tianlu Mao, Zhaoxin Li, Hao Jiang, Zhaoqi Wang

2506.10315 2026-04-20 cs.LG

PyLO: Towards Accessible Learned Optimizers in PyTorch

Paul Janson, Benjamin Therien, Quentin Anthony, Xiaolong Huang, Abhinav Moudgil, Eugene Belilovsky

Comments Accepted at MLSys Conference 2026

2506.08125 2026-04-20 cs.LG cs.CL

Not All Tokens Matter: Towards Efficient LLM Reasoning via Token Significance in Reinforcement Learning

Hanbing Liu, Lang Cao, Yuanyi Ren, Mengyu Zhou, Haoyu Dong, Xiaojun Ma, Shi Han, Dongmei Zhang

2506.03103 2026-04-20 cs.CV

DyTact: Capturing Dynamic Contacts in Hand-Object Manipulation

Xiaoyan Cong, Angela Xing, Chandradeep Pokhariya, Rao Fu, Srinath Sridhar

Comments 3DV 2026 Oral, Webpage: https://ivl.cs.brown.edu/research/dytact

2505.23383 2026-04-20 cs.LG

Automated Modeling Method for Pathloss Model Discovery

Ahmad Anaqreh, Shih-Kai Chou, Blaž Bertalanič, Mihael Mohorčič, Thomas Lagkas, Carolina Fortuna

详情

DOI: 10.1109/TMC.2026.3683609

英文摘要

Modeling propagation is the cornerstone for designing and optimizing next-generation wireless systems, with a particular emphasis on 5G and beyond era. Traditional modeling methods have long relied on statistic-based techniques to characterize propagation behavior across different environments. With the expansion of wireless communication systems, there is a growing demand for methods that guarantee the accuracy and interpretability of modeling. Artificial intelligence (AI)-based techniques, in particular, are increasingly being adopted to overcome this challenge, although the interpretability is not assured with most of these methods. Inspired by recent advancements in AI, this paper proposes a novel approach that accelerates the discovery of path loss models while maintaining interpretability. The proposed method automates the formulation, evaluation, and refinement of the model, facilitating the discovery of the model. We examine two techniques: one based on Deep Symbolic Regression, offering full interpretability, and the second based on Kolmogorov-Arnold Networks, providing two levels of interpretability. Both approaches are evaluated on two synthetic and two real-world datasets. Our results show that Kolmogorov-Arnold Networks achieve the coefficient of determination value R^2 close to 1 with minimal prediction error, while Deep Symbolic Regression generates compact models with moderate accuracy. Moreover, on the selected examples, we demonstrate that automated methods outperform traditional methods, achieving up to 75% reduction in prediction errors, offering accurate and explainable solutions with potential to increase the efficiency of discovering next-generation path loss models.

URL PDF HTML ☆

赞 0 踩 0

2505.21569 2026-04-20 cs.LG cs.AI cs.CL

ChemAmp: Amplified Chemistry Tools via Composable Agents

Zhucong Li, Powei Chang, Jin Xiao, Zhijian Zhou, Qianyu He, Jiaqing Liang, Fenglei Cao, Xu Yinghui, Yuan Qi

Comments Accepted to ACL 2026 Findings ; Code available at https://github.com/Chang-pw/ChemAmp

2505.16176 2026-04-20 cs.AI cs.CL

Dynamic Sampling that Adapts: Self-Aware Iterative Data Persistent Optimization for Mathematical Reasoning

Jun Rao, Xuebo Liu, Hexuan Deng, Zepeng Lin, Zixiong Yu, Jiansheng Wei, Xiaojun Meng, Min Zhang

Comments ACL2026 Findings

2505.13792 2026-04-20 cs.CL cs.AI

Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation

Siddhant Bhambri, Upasana Biswas, Subbarao Kambhampati

Comments Accepted at The 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)

2505.11274 2026-04-20 cs.AI cs.CL

SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning

Zheng Li, Qingxiu Dong, Jingyuan Ma, Di Zhang, Kai Jia, Zhifang Sui

2504.20966 2026-04-20 cs.LG

Softpick: No Attention Sink, No Massive Activations with Rectified Softmax

Zayd M. K. Zuhri, Erland Hilman Fuadi, Alham Fikri Aji

Comments Updated to camera-ready version