arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.06004 2026-05-08 cs.LG cs.AI math.ST stat.TH

A Fine-Grained Understanding of Uniform Convergence for Halfspaces

Aryeh Kontorovich, Kasper Green Larsen

详情

英文摘要

We study the fine-grained uniform convergence behavior of halfspaces beyond worst-case VC bounds. For inhomogeneous halfspaces in $\mathbb{R}^d$ with $d\ge 2$, we show that standard first-order VC bounds are essentially tight: even consistent hypotheses can incur population error $Θ(d\ln(n/d)/n)$, and in the agnostic setting the deviation scales as $\sqrt{τ\ln(1/τ)}$ at true error $τ$. In contrast, homogeneous halfspaces in $\mathbb{R}^2$ exhibit a markedly different behavior. In the realizable case, every hypothesis consistent with the sample has error $O(1/n)$. In the agnostic case, we prove a bandwise, log-free deviation bound on each dyadic risk band via a critical-wedge localization argument. Unioning over bands incurs only a $\ln\ln n$ overhead, and we establish a matching lower bound showing this overhead is unavoidable. Together, these results give a fine-grained and nearly complete picture of uniform convergence for halfspaces, revealing sharp dimensional and structural thresholds.

URL PDF HTML ☆

赞 0 踩 0

2605.05994 2026-05-08 cs.LG

DiBA: Diagonal and Binary Matrix Approximation for Neural Network Weight Compression

Nobutaka Ono

2605.05990 2026-05-08 cs.CV cs.AI

iPhoneBlur: A Difficulty-Stratified Benchmark for Consumer Device Motion Deblurring

Abdullah Al Shafi, Kazi Saeed Alam

Comments 21 Pages, 12 figures

2605.05985 2026-05-08 cs.AI cs.MA q-bio.QM

BioResearcher: Scenario-Guided Multi-Agent for Translational Medicine

Remigiusz Kinas, Joanna Krawczyk, Rafał Powalski, Przemysław Pietrzak, Agnieszka Kowalewska, Krzysztof Kolmus, Maciej Sypetkowski, Łukasz Smoliński, Tomasz Jetka

Comments 5 pages (main text), 21 pages (appendix), 8 figures, 11 tables

2605.05982 2026-05-08 cs.SD

Do Melody and Rhythm Coevolve?

Harin Lee, Rainer Polak, Manuel Anglada-Tort, Marc Schönwiesner, Minsu Park, Nori Jacoby

Comments 6 pages, 3 figures, to be included in Proceedings of the Annual Meeting of the Cognitive Science Society

2605.05980 2026-05-08 cs.AI

TACT: Mitigating Overthinking and Overacting in Coding Agents via Activation Steering

Yuan Sui, Yulin Chen, Yibo Li, Xue Jiang, Yufei He, Yihong Dong, Xiaoxin He, Tianyu Gao, Bryan Hooi

Comments Work in progress

2605.05979 2026-05-08 cs.CV

Prompt-Free and Efficient SAM2 Adaptation for Biomedical Semantic Segmentation via Dual Adapters

Hinako Mitsuoka, Kazuhiro Hotta

Comments Accepted by ICIP2026

2605.05977 2026-05-08 cs.AI

BehaviorGuard: Online Backdoor Defense for Deep Reinforcement Learning

Yinbo Yu, Xueyu Yin, Jiadai Wang, Chunwei Tian, Sai Xu, Qi Zhu, Daoqiang Zhang

Comments 11 pages

2605.05975 2026-05-08 cs.LG physics.flu-dyn

Physical Fidelity Reconstruction via Improved Consistency-Distilled Flow Matching for Dynamical Systems

Sicheng Ma, Tianyue Yang, Xiuzhe Wu, Xiao Xue

2605.05967 2026-05-08 cs.LG math.OC stat.ML

Sharper Guarantees for Misspecified Kernelized Bandit Optimization

Davide Maran, Csaba Szepesvári

2605.05965 2026-05-08 cs.LG cs.AI

Beyond Uniform Credit Assignment: Selective Eligibility Traces for RLVR

Chaoli Mou, Zhan Zhuang, Xinning Chen, Yu Zhang

2605.05963 2026-05-08 cs.AI cs.CL

TheraAgent: Self-Improving Therapeutic Agent for Precise and Comprehensive Treatment Planning

Junkai Li, Yunghwei Lai, Tianyi Zhu, Zheng Long Lee, Weizhi Ma, Yang Liu

Comments Accepted to ACL 2026

2605.05962 2026-05-08 cs.CL

Tatarstan Toponyms: A Bilingual Dataset and Hybrid RAG System for Geospatial Question Answering

Mullosharaf K. Arabov

Comments Preprint

2605.05960 2026-05-08 cs.RO

Plug-and-Play Label Map Diffusion for Universal Goal-Oriented Navigation

Zhixuan Shen, Yijie Zeng, Shengxiang Luo, Tianrui Li, Haonan Luo

Comments 21 pages, 10 figures, Extended Version of accepted ICML 2026 Paper

2605.05959 2026-05-08 cs.AI cs.DC cs.LG

From Coordinate Matching to Structural Alignment: Rethinking Prototype Alignment in Heterogeneous Federated Learning

Xinghao Wu, Jianwei Niu, Guogang Zhu, Xuefeng Liu, Shaojie Tang, Jiayuan Zhang

Comments 14 pages, 10 figures, 9 tables

详情

英文摘要

Heterogeneous federated learning (HtFL) aims to enable collaboration among clients that differ in both data distributions and model architectures. Prototype-based methods, which communicate class-level feature centers (prototypes) instead of full model parameters, have recently shown strong potential for HtFL. Existing prototype-based HtFL methods typically reuse the MSE-based or cosine-based alignment mechanism developed for homogeneous FL when aligning client-specific representations with global prototypes. These approaches are essentially coordinate alignment, where representations of clients are forced to match the global prototypes in the embedding space in an element-wise manner. Such alignment implicitly assumes that all clients should map their representations into the feature subspace defined by the global prototypes. This assumption is reasonable in homogeneous FL, where all clients share the same feature extractor. However, it becomes problematic in HtFL, since heterogeneous feature extractors naturally induce client-specific feature subspaces, and forcing all clients to optimize within a single global subspace unnecessarily suppresses their learning capacity. We observe that coordinate alignment implicitly couples two distinct objectives: aligning inter-class semantic structure, which is directly beneficial for classification, and enforcing a shared feature basis, which is unnecessary and even harmful under model heterogeneity. Building on this insight, we design FedSAF, which shifts the alignment objective from absolute coordinates to inter-class relational structure. We demonstrate that structural alignment consistently outperforms coordinate alignment in heterogeneous settings. Experiments on multiple benchmarks show that our structural alignment outperforms state-of-the-art prototype-based HtFL methods by up to 3.52\%.

URL PDF HTML ☆

赞 0 踩 0

2605.05955 2026-05-08 cs.CL cs.CV

TableVista: Benchmarking Multimodal Table Reasoning under Visual and Structural Complexity

Zheyuan Yang, Liqiang Shang, Junjie Chen, Xun Yang, Chenglong Xu, Bo Yuan, Chenyuan Jiao, Yaoru Sun, Yilun Zhao

Comments ACL 2026 Findings

2605.05953 2026-05-08 cs.CL cs.AI

Hallucination as an Anomaly: Dynamic Intervention via Probabilistic Circuits

Erik Nielsen, Elia Cunegatti, Marcus Vukojevic, Giovanni Iacca

2605.05951 2026-05-08 cs.AI

HaM-World: Soft-Hamiltonian World Models with Selective Memory for Planning

Haoyun Tang, Haodong Cui, Keyao Xu, Kun Wang, Zhandong Mei

Comments 22 pages, 5 figures. Code: https://github.com/HaoyunT/HaM_World

2605.05950 2026-05-08 cs.CL

Lightweight Stylistic Consistency Profiling: Robust Detection of LLM-Generated Textual Content for Multimedia Moderation

Siyuan Li, Aodu Wulianghai, Xi Lin, Xibin Yuan, Qinghua Mao, Guangyan Li, Xiang Chen, Jun Wu, Jianhua Li

2605.05097 2026-05-08 cs.LG cs.AI cs.CL

Continual Knowledge Updating in LLM Systems: Learning Through Multi-Timescale Memory Dynamics

Andreas Pattichis, Constantine Dovrolis

Comments Preprint. 9 pages, 2 figures

2605.05014 2026-05-08 cs.CV

CARD: A Multi-Modal Automotive Dataset for Dense 3D Reconstruction in Challenging Road Topography

Gasser Elazab, Frank Neuhaus, Tilman Koß, Malte Splietker, Aditya Date, Michael Unterreiner, Maximilian Jansen, Olaf Hellwich

Comments Accepted at CVPR 2026 (Highlight). Project page: https://card.content.cariad.digital

2605.02241 2026-05-08 cs.AI cs.CL cs.ET

Zero-Shot Confidence Estimation for Small LLMs: When Supervised Baselines Aren't Worth Training

Luong N. Nguyen

2605.01699 2026-05-08 cs.LG cs.AI cs.CR cs.NE

Probe-Geometry Alignment: Erasing the Cross-Sequence Memorization Signature Below Chance

Anamika Paul Rupa, Anietie Andy

详情

英文摘要

Recent attacks show that behavioural unlearning of large language models leaves internal traces recoverable by adversarial probes. We characterise where this retention lives and show it can be surgically removed without measurable capability cost. Our central protocol is a leave-one-out cross-sequence probe that tests whether a memorisation signature generalises across held-out sequences. The signature is real and consistent across scale: memorisation-specific gaps of +0.32, +0.19, +0.30 on Pythia-70M, GPT-2 medium, and Mistral-7B; on Pythia-70M, the random-initialisation control collapses to -0.04 at the deepest layer where the pretrained signature peaks. The probe direction is causally separable from recall -- projecting it out collapses the signature locally (+0.44 -> -0.19) while behavioural recall barely changes -- and a probe trained on naturally memorised content does not classify fine-tuning-injected secrets, marking two representationally distinct regimes. We then introduce probe-geometry alignment (PGA), a surgical erasure that aligns activations along the probe's live readout direction at each depth. PGA drives the cross-sequence probe below random chance at all four scales tested (toy depth-4: 0.17; Pythia-70M: 0.07; Mistral-7B: 0.45; GPT-2 medium: 0.06 via MD-PGA k=2) and remains robust to six adversarial probe variants. Against a re-fitting attacker who trains a fresh probe on PGA-treated activations, we extend PGA adversarially, defeating the re-fit probe at every memorisation-relevant depth while preserving five zero-shot capability benchmarks within 2.8 percentage points per task (mean Δacc = +0.2pp). The cross-sequence signature is a real, causally separable, regime-specific property of pretrained representations -- removable below chance with a single rank-one intervention per depth at no measurable capability cost.

URL PDF HTML ☆

赞 0 踩 0

2605.01627 2026-05-08 cs.LG

Importance-Guided Basis Selection for Low-Rank Decomposition of Large Language Models

Daniel Agyei Asante, Ernie Chang, Yang Li

2605.01291 2026-05-08 cs.LG

Congestion-Aware Dynamic Axonal Delay for Spiking Neural Networks

Dewei Bai, Hongxiang Peng, Yunyun Zeng, Ziyu Zhang, Hong Qu

2605.00649 2026-05-08 cs.LG

Model Compression with Exact Budget Constraints via Riemannian Manifolds

Michael Helcig, Dan Alistarh

2605.00199 2026-05-08 cs.CL cs.AI cs.IR cs.LG

RSAT: Structured Attribution Makes Small Language Models Faithful Table Reasoners

Jugal Gajjar, Kamalasankari Subramaniakuppusamy

Comments 8 pages, 8 tables, 9 figures, and a 3-page Appendix. Accepted at the SURGeLLM Workshop at ACL 2026 and will be included in the proceedings

2604.28192 2026-05-08 cs.RO cs.CV

LaST-R1: Reinforcing Robotic Manipulation via Adaptive Physical Latent Reasoning

Hao Chen, Jiaming Liu, Zhonghao Yan, Nuowei Han, Renrui Zhang, Chenyang Gu, Jialin Gao, Ziyu Guo, Siyuan Qian, Yinxi Wang, Peng Jia, Shanghang Zhang, Pheng-Ann Heng

2604.27607 2026-05-08 cs.CL

JaiTTS: A Thai Voice Cloning Model

Jullajak Karnjanaekarin, Pontakorn Trakuekul, Narongkorn Panitsrisit, Sumana Sumanakul, Vichayuth Nitayasomboon, Nithid Guntasin, Thanavin Denkavin, Attapol T. Rutherford

2604.27155 2026-05-08 cs.LG

Generalizing the Geometry of Model Merging Through Frechet Averages

Marvin F. da Silva, Mohammed Adnan, Felix Dangel, Sageev Oore