arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

Shangbin Feng, Yuyang Bai, Ziyuan Yang, Yike Wang, Zhaoxuan Tan, Jiajie Yan, Zhenyu Lei, Wenxuan Ding, Weijia Shi, Haojin Wang, Zhenting Qi, Yuru Jiang, Heng Wang, Chengsong Huang, Yu Fei, Jihan Yao, Yilun Du, Luke Zettlemoyer, Yejin Choi, Yulia Tsvetkov

Comments Moco is available at https://github.com/BunsenFeng/model_collaboration

2601.18231 2026-04-21 cs.LG cs.AI

Rethinking Cross-Modal Fine-Tuning: Optimizing the Interaction Between Feature Alignment and Target Fitting

Trong Khiem Tran, Manh Cuong Dao, Phi Le Nguyen, Thao Nguyen Truong, Trong Nghia Hoang

Comments Accepted AISTATS 20226

2601.17230 2026-04-21 cs.CL cs.LG

CaseFacts: A Benchmark for Legal Fact-Checking and Precedent Retrieval

Akshith Reddy Putta, Jacob Devasier, Chengkai Li

2601.17172 2026-04-21 cs.CL cs.AI cs.CY cs.LG

Who Gets Which Message? Auditing Demographic Bias in LLM-Generated Targeted Text

Tunazzina Islam

Comments Accepted at Findings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026). Camera-ready

2601.15724 2026-04-21 cs.CV cs.AI

VideoThinker: Building Agentic VideoLLMs with LLM-Guided Tool Reasoning

Chenglin Li, Qianglong Chen, Feng Han, Yikun Wang, Xingxi Yin, Yan Gong, Ruilin Li, Yin Zhang, Jiaqi Wang

2601.15690 2026-04-21 cs.AI stat.AP

From Passive Metric to Active Signal: The Evolving Role of Uncertainty Quantification in Large Language Models

Jiaxin Zhang, Wendi Cui, Zhuohang Li, Lifu Huang, Bradley Malin, Caiming Xiong, Chien-Sheng Wu

Comments This paper has been accepted by ACL 2026

2601.15625 2026-04-21 cs.LG cs.AI

Robust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors

Zhiwei Zhang, Fei Zhao, Rui Wang, Zezhong Wang, Bin Liang, Jiakang Wang, Yao Hu, Shaosheng Cao, Kam-Fai Wong

Comments 9 pages, 4 figures, 4 tables. Accepted to ACL 2026 Main Conference

2601.13707 2026-04-21 cs.CV cs.AI cs.LG

Attention-space Contrastive Guidance for Efficient Hallucination Mitigation in LVLMs

Yujin Jo, Sangyoon Bae, Taesup Kim

Comments Accepted at CVPR 2026 Findings

2601.13099 2026-04-21 cs.CL

Alexandria: A Multi-Domain Dialectal Arabic Machine Translation Dataset for Culturally Inclusive and Linguistically Diverse LLMs

Abdellah El Mekki, Samar M. Magdy, Houdaifa Atou, Ruwa AbuHweidi, Baraah Qawasmeh, Omer Nacar, Thikra Al-hibiri, Razan Saadie, Hamzah Alsayadi, Nadia Ghezaiel Hammouda, Alshima Alkhazimi, Aya Hamod, Al-Yas Al-Ghafri, Wesam El-Sayed, Asila Al sharji, Mohamad Ballout, Anas Belfathi, Karim Ghaddar, Serry Sibaee, Alaa Aoun, Areej Asiri, Lina Abureesh, Ahlam Bashiti, Majdal Yousef, Abdulaziz Hafiz, Yehdih Mohamed, Emira Hamedtou, Brakehe Brahim, Rahaf Alhamouri, Youssef Nafea, Aya El Aatar, Walid Al-Dhabyani, Emhemed Hamed, Sara Shatnawi, Fakhraddin Alwajih, Khalid Elkhidir, Ashwag Alasmari, Abdurrahman Gerrio, Omar Alshahri, AbdelRahim A. Elmadany, Ismail Berrada, Amir Azad Adli Alkathiri, Fadi A Zaraket, Mustafa Jarrar, Yahya Mohamed El Hadj, Hassan Alhuzali, Muhammad Abdul-Mageed

Comments Accepted to ACL 2026 Main; Project resources will be available here: https://github.com/UBC-NLP/Alexandria

2601.11886 2026-04-21 cs.CL

Faithfulness vs. Safety: Evaluating LLM Behavior Under Counterfactual Medical Evidence

Kaijie Mo, Siddhartha Venkatayogi, Chantal Shaib, Ramez Kouzy, Wei Xu, Byron C. Wallace, Junyi Jessy Li

Comments Accepted to Findings of ACL 2026

2601.11038 2026-04-21 cs.CL

Budget-Aware Anytime Reasoning with LLM-Synthesized Preference Data

Xuanming Zhang, Shwan Ashrafi, Aziza Mirsaidova, Amir H. Rezaeian, Miguel Ballesteros, Lydia B. Chilton, Zhou Yu, Dan Roth

Comments ACL 2026 Findings, 13 pages, 3 figures, 1 table

2601.09173 2026-04-21 cs.LG cs.CL q-bio.QM stat.ML

Geometric Stability: The Missing Axis of Representations

Prashant C. Raju

详情

英文摘要

Representational similarity analysis and related methods have become standard tools for comparing the internal geometries of neural networks and biological systems. These methods measure what is represented, the alignment between two representational spaces, but not whether that structure is robust. We introduce geometric stability, a distinct dimension of representational quality that quantifies how reliably a representation's pairwise distance structure holds under perturbation. Our metric, Shesha, measures self-consistency through split-half correlation of representational dissimilarity matrices constructed from complementary feature subsets. A key formal property distinguishes stability from similarity: Shesha is not invariant to orthogonal transformations of the feature space, unlike CKA and Procrustes, enabling it to detect compression-induced damage to manifold structure that similarity metrics cannot see. Spectral analysis reveals the mechanism: similarity metrics collapse after removing the top principal component, while stability retains sensitivity across the eigenspectrum. Across 2463 encoder configurations in seven domains -- language, vision, audio, video, protein sequences, molecular profiles, and neural population recordings -- stability and similarity are empirically uncorrelated ($ρ=-0.01$). A regime analysis shows this independence arises from opposing effects: geometry-preserving transformations make the metrics redundant, while compression makes them anti-correlated, canceling in aggregate. Applied to 94 pretrained models across 6 datasets, stability exposes a "geometric tax": DINOv2, the top-performing model for transfer learning, ranks last in geometric stability on 5/6 datasets. Contrastive alignment and hierarchical architecture predict stability, providing actionable guidance for model selection in deployment contexts where representational reliability matters.

URL PDF HTML ☆

赞 0 踩 0

2601.08276 2026-04-21 cs.AI

ACE-Router: Generalizing History-Aware Routing from MCP Tools to the Agent Web

Zhiyuan Yao, Zishan Xu, Yifu Guo, Zhiguang Han, Cheng Yang, Shuo Zhang, Weinan Zhang, Xingshan Zeng, Weiwen Liu

2601.06767 2026-04-21 cs.CL cs.AI cs.LG

GanitLLM: Difficulty-Aware Bengali Mathematical Reasoning through Curriculum-GRPO

Shubhashis Roy Dipta, Khairul Mahbub, Nadia Najjar

Comments Accepted at ACL 2026 (Findings)

2601.06328 2026-04-21 cs.AI

C-World: A Computer Use Agent Environment Creator

Ziqiao Xi, Shuang Liang, Qi Liu, Jiaqing Zhang, Letian Peng, Fang Nan, Meshal Nayim, Tianhui Zhang, Rishika Mundada, Lianhui Qin, Biwei Huang, Kun Zhou

Comments Submitted to ACL 2026 12 pages, 4 figures Ziqiao Xi and Shuang Liang contributed equally to this work

2601.05654 2026-04-21 cs.CL cs.AI

Learning to Retrieve User History and Generate User Profiles for Personalized Persuasiveness Prediction

Sejun Park, Yoonah Park, Jongwon Lim, Yohan Jo

Comments This paper has been accepted for publication at Findings of ACL 2026

2601.05062 2026-04-21 cs.CL cs.AI cs.LG

Compositional Steering of Large Language Models with Steering Tokens

Gorjan Radevski, Kiril Gashteovski, Giwon Hong, Carolin Lawrence, Goran Glavaš

Comments Accepted at ACL 2026

2601.04745 2026-04-21 cs.AI cs.IR

KnowMe-Bench: Benchmarking Person Understanding for Lifelong Digital Companions

Tingyu Wu, Zhisheng Chen, Ziyan Weng, Shuhe Wang, Chenglong Li, Shuo Zhang, Sen Hu, Silin Wu, Qizhen Lan, Huacan Wang, Ronghao Chen

2601.04744 2026-04-21 cs.SD cs.AI

Semi-Supervised Diseased Detection from Speech Dialogues with Multi-Level Data Modeling

Xingyuan Li, Mengyue Wu

Comments Accepted for publication as a Findings paper at the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)

2601.04609 2026-04-21 cs.CL

When More Words Say Less: Decoupling Length and Specificity in Image Description Evaluation

Rhea Kapur, Robert Hawkins, Elisa Kreiss

2601.04278 2026-04-21 cs.CL cs.AI cs.CR cs.LG

From Domains to Instances: Dual-Granularity Data Synthesis for LLM Unlearning

Xiaoyu Xu, Minxin Du, Zitong Li, Zi Liang, Zhibiao Guo, Shiyu Zhang, Peizhao Hu, Qingqing Ye, Haibo Hu

Comments ACL 2026 (Findings), accepted to appear

2601.03559 2026-04-21 cs.CL

DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs

Shidong Cao, Hongzhan Lin, Yuxuan Gu, Ziyang Luo, Jing Ma

Comments DiffCoT improves multi-step LLM reasoning by applying diffusion-based iterative denoising to correct intermediate Chain-of-Thought steps

Journal ref The 64th Annual Meeting of the Association for Computational Linguistics 2026

2601.03043 2026-04-21 cs.CL cs.AI cs.LG

Lil: Less is Less When Applying Post-Training Sparse-Attention Algorithms in Long-Decode Stage

Junhao Hu, Fangze Li, Mingtao Xu, Feifan Meng, Shiju Zhao, Tiancheng Hu, Ting Peng, Anmin Liu, Wenrui Huang, Chenxu Liu, Ziyue Hua, Tao Xie

2601.02735 2026-04-21 cs.LG cs.DS cs.PF

Revisiting Forest Proximities via Sparse Leaf-Incidence Kernels

Adrien Aumon, Guy Wolf, Kevin R. Moon, Jake S. Rhodes

2601.00514 2026-04-21 cs.AI cs.CL

The Illusion of Insight in Reasoning Models

Liv G. d'Aliberti, Manoel Horta Ribeiro

2601.00296 2026-04-21 cs.CV

TimeColor: Flexible Reference Colorization via Temporal Concatenation

Bryan Constantine Sadihin, Yihao Meng, Michael Hua Wang, Matteo Jiahao Chen, Hang Su

Comments Our project page is available at https://bconstantine.github.io/TimeColor/

2512.20249 2026-04-21 cs.LG cs.CV eess.IV

Unified Multimodal Brain Decoding via Cross-Subject Soft-ROI Fusion

Xuanyu Hu

Comments 15 pages, 2 figures, 4 tables

2512.20033 2026-04-21 cs.CV

FlashLips: 100-FPS Mask-Free Latent Lip-Sync using Reconstruction Instead of Diffusion or GANs

Andreas Zinonos, Michał Stypułkowski, Antoni Bigata, Stavros Petridis, Maja Pantic, Nikita Drobyshev

2512.16055 2026-04-21 cs.CV cs.RO

Driving in Corner Case: A Real-World Adversarial Closed-Loop Evaluation Platform for End-to-End Autonomous Driving

Jiaheng Geng, Jiatong Du, Xinyu Zhang, Ye Li, Panqu Wang, Yanjun Huang

Comments Update some experimental details

2512.11988 2026-04-21 cs.CV

CARI4D: Category Agnostic 4D Reconstruction of Human-Object Interaction

Xianghui Xie, Bowen Wen, Yan Chang, Hesam Rabeti, Jiefeng Li, Ye Yuan, Gerard Pons-Moll, Stan Birchfield

Comments CVPR2026 camera ready version. Project page: https://nvlabs.github.io/CARI4D/

AI 大模型

视觉与机器人

科学与医疗

MoCo: A One-Stop Shop for Model Collaboration Research