arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 2328
专题追踪
2604.15664 2026-05-13 cs.LG cs.AI

Stargazer: A Scalable Model-Fitting Benchmark Environment for AI Agents under Astrophysical Constraints

Xinge Liu, Terry Jingchen Zhang, Bernhard Schölkopf, Zhijing Jin, Kristen Menou

发表机构 * University of Toronto(多伦多大学) Vector Institute(向量研究所) Max Planck Institute for Intelligent Systems(智能系统马克斯·普朗克研究所) ELLIS Institute Tübingen(图宾根ELLIS研究所)

AI总结 本文介绍了 Stargazer,一个用于评估人工智能代理在天体物理约束下进行动态模型拟合任务的可扩展基准环境。该环境基于径向速度时间序列数据,包含120个任务,涵盖从高信噪比单行星系统到复杂低信噪比多行星系统的多种场景,并包含20个真实档案案例。研究发现,尽管现有前沿代理在统计拟合上表现良好,但在物理参数恢复方面仍存在显著不足,且增加计算资源带来的提升有限。Stargazer 为训练和评估人工智能代理在实际科研相关模型拟合问题上的能力提供了重要平台。

详情
英文摘要

The rise of autonomous AI agents suggests that dynamic benchmark environments with built-in feedback on scientifically grounded tasks are needed to evaluate the capabilities of these agents in research work. We introduce Stargazer, a scalable environment for evaluating AI agents on dynamic, iterative physics-grounded model-fitting tasks using inference on radial-velocity (RV) time series data. Stargazer comprises 120 tasks across three difficulty tiers, including 20 real archival cases, covering diverse scenarios ranging from high-SNR single-planet systems to complex multi-planetary configurations requiring involved low-SNR analysis. Our evaluation of eight frontier agents reveals a gap between numerical optimization and adherence to physical constraints: although agents often achieve a good statistical fit, they frequently fail to recover correct physical system parameters, a limitation that persists even when agents are equipped with vanilla skills. Furthermore, increasing test-time compute yields only marginal gains, with excessive token usage often reflecting recursive failure loops rather than meaningful exploration. Stargazer presents an opportunity to train, evaluate, scaffold, and scale strategies on a model-fitting problem of practical research relevance today. Our methodology to design a simulation-driven environment for AI agents presumably generalizes to many other model-fitting problems across scientific domains. Source code and the project website are available at https://github.com/AIPS-UofT/Stargazer and https://aips-uoft.github.io/Stargazer/, respectively.

2604.14717 2026-05-13 cs.AI cs.CR cs.CY cs.LG

Layered Mutability: Continuity and Governance in Persistent Self-Modifying Agents

Krti Tallam

发表机构 * Kamiwaza AI

AI总结 本文提出“分层可变性”框架,用于分析持续自我修改语言模型代理在预训练、对齐、自我叙述、记忆和权重适应五个层面中的行为演化过程。研究指出,当内部变化迅速、耦合性强、不可逆且难以观测时,治理难度显著增加,导致行为影响层与人类可观察层之间出现系统性不匹配。通过引入漂移、治理负载和滞后等量化指标,并结合实验验证,论文揭示了这类代理的主要失效模式并非突变失准,而是由局部合理更新累积引起的“组合漂移”问题。

Comments 17 pages, 2 figures, 3 tables. self-modifying agents; AI governance; identity drift; persistent memory; runtime adaptation; model editing Primary: cs.AI Cross-list: cs.LG, cs.CY

详情
英文摘要

Persistent language-model agents increasingly combine tool use, tiered memory, reflective prompting, and runtime adaptation. In such systems, behavior is shaped not only by current prompts but by mutable internal conditions that influence future action. This paper introduces layered mutability, a framework for reasoning about that process across five layers: pretraining, post-training alignment, self-narrative, memory, and weight-level adaptation. The central claim is that governance difficulty rises when mutation is rapid, downstream coupling is strong, reversibility is weak, and observability is low, creating a systematic mismatch between the layers that most affect behavior and the layers humans can most easily inspect. I formalize this intuition with simple drift, governance-load, and hysteresis quantities, connect the framework to recent work on temporal identity in language-model agents, and report a preliminary ratchet experiment in which reverting an agent's visible self-description after memory accumulation fails to restore baseline behavior. In that experiment, the estimated identity hysteresis ratio is 0.68. The main implication is that the salient failure mode for persistent self-modifying agents is not abrupt misalignment but compositional drift: locally reasonable updates that accumulate into a behavioral trajectory that was never explicitly authorized.

2604.12928 2026-05-13 cs.CL eess.AS

MoshiRAG: Asynchronous Knowledge Retrieval for Full-Duplex Speech Language Models

Chung-Ming Chien, Manu Orsini, Eugene Kharitonov, Neil Zeghidour, Karen Livescu, Alexandre Défossez

发表机构 * Toyota Technological Institute at Chicago(丰田技术研究所(芝加哥))

AI总结 本文提出了一种名为MoshiRAG的异步知识检索方法,用于提升全双工语音语言模型的事实准确性。该方法通过结合紧凑的全双工接口与选择性检索机制,使模型能够在保持实时交互性的同时,访问更强大的知识源。实验表明,MoshiRAG在事实性方面达到非全双工模型的水平,并且支持灵活的检索模块替换,表现出良好的跨领域推理能力。

Comments Accepted to ICML 2026

详情
英文摘要

Speech-to-speech language models have recently emerged to enhance the naturalness of conversational AI. In particular, full-duplex models are distinguished by their real-time interactivity, including handling of pauses, interruptions, and backchannels. However, improving their factuality remains an open challenge. While scaling the model size could address this gap, it would make real-time inference prohibitively expensive. In this work, we propose MoshiRAG, a modular approach that combines a compact full-duplex interface with selective retrieval to access more powerful knowledge sources. Our asynchronous framework enables the model to identify knowledge-demanding queries and ground its responses in external information. By leveraging the natural temporal gap between response onset and the delivery of core information, the retrieval process can be completed while maintaining a natural conversation flow. With this approach, MoshiRAG achieves factuality comparable to the best publicly released non-duplex speech language models while preserving the interactivity inherent to full-duplex systems. Moreover, our flexible design supports plug-and-play retrieval methods without retraining and demonstrates strong performance on out-of-domain mathematical reasoning tasks.

2604.12923 2026-05-13 cs.CV

Pi-HOC: Pairwise 3D Human-Object Contact Estimation

Sravan Chittupalli, Ayush Jain, Dong Huang

发表机构 * Carnegie Mellon University, Robotics Institute(卡内基梅隆大学机器人研究所) National Robotics Engineering Center(国家机器人工程中心)

AI总结 本文提出了一种名为Pi-HOC的单次推理、实例感知的框架,用于预测图像中所有人类-物体对的密集3D语义接触。该方法通过检测实例并为每对人-物生成专用的标记,结合InteractionFormer进行优化,再利用基于SAM的解码器在SMPL人体网格上预测密集接触点。实验表明,Pi-HOC在多个数据集上显著提升了接触估计的准确性和定位能力,并且推理效率提高了20倍,同时还能通过测试时优化算法提升3D图像到网格的重建效果,并支持基于语言查询的参考接触预测。

详情
英文摘要

Resolving real-world human-object interactions in images is a many-to-many challenge, in which disentangling fine-grained concurrent physical contact is particularly difficult. Existing semantic contact estimation methods are either limited to single-human settings or require object geometries (e.g., meshes) in addition to the input image. Current state-of-the-art leverages powerful VLM for category-level semantics but struggles with multi-human scenarios and scales poorly in inference. We introduce Pi-HOC, a single-pass, instance-aware framework for dense 3D semantic contact prediction of all human-object pairs. Pi-HOC detects instances, creates dedicated human-object (HO) tokens for each pair, and refines them using an InteractionFormer. A SAM-based decoder then predicts dense contact on SMPL human meshes for each human-object pair. On the MMHOI and DAMON datasets, Pi-HOC significantly improves accuracy and localization over state-of-the-art methods while achieving 20x higher throughput. We further demonstrate that predicted contacts improve SAM-3D image-to-mesh reconstruction via a test-time optimization algorithm and enable referential contact prediction from language queries without additional training.

2604.11048 2026-05-13 cs.CL cs.AI

A Systematic Analysis of the Impact of Persona Steering on LLM Capabilities

Jiaqi Chen, Ming Wang, Tingna Xie, Shi Feng, Yongkang Liu

发表机构 * School of Computer Science and Engineering, Northeastern University, Shenyang 110819, China(东北大学计算机科学与工程学院,沈阳 110819,中国) School of Computing and Information Systems, Singapore Management University, Singapore 178902, Singapore(新加坡管理大学计算机与信息系统学院,新加坡 178902,新加坡) School of Computer and Communication Engineering, Northeastern University, Qinhuangdao 066004, China(东北大学计算机与通信工程学院,秦皇岛 066004,中国)

AI总结 本文系统分析了在大型语言模型中引入特定人格特质对其认知能力的影响。研究采用基于神经元的人格特质诱导框架(NPTI),在六个认知基准任务中评估五大人格特质对模型性能的影响,发现人格诱导不仅改变了交互风格,还导致认知任务表现的稳定变化,并且这种影响因任务类型和人格特质不同而有所差异。研究还提出了一种轻量级的动态人格路由策略(DPR),能够在无需额外训练的情况下优于固定人格设置。

详情
英文摘要

Imbuing Large Language Models (LLMs) with specific personas is prevalent for tailoring interaction styles, yet the impact on underlying cognitive capabilities remains unexplored. We employ the Neuron-based Personality Trait Induction (NPTI) framework to induce Big Five personality traits in LLMs and evaluate performance across six cognitive benchmarks. Our findings reveal that persona induction produces stable, reproducible shifts in cognitive task performance beyond surface-level stylistic changes. These effects exhibit strong task dependence: certain personalities yield consistent gains on instruction-following, while others impair complex reasoning. Effect magnitude varies systematically by trait dimension, with Openness and Extraversion exerting the most robust influence. Furthermore, LLM effects show 73.68% directional consistency with human personality-cognition relationships. Capitalizing on these regularities, we propose Dynamic Persona Routing (DPR), a lightweight query-adaptive strategy that outperforms the best static persona without additional training.

2604.06779 2026-05-13 cs.AI

VASR: Variance-Aware Systematic Resampling for Reward-Guided Diffusion

Shivanshu Shekhar, Sagnik Mukherjee, Jia Yi Zhang, Tong Zhang

发表机构 * Siebel School of Computing and Data Science(计算与数据科学学院) Department of Statistics(统计学系) University of Illinois Urbana-Champaign(伊利诺伊大学厄巴纳-香槟分校)

AI总结 该论文提出了一种名为VASR的方差感知系统重采样方法,用于解决奖励引导扩散模型中的系统采样(SMC)粒子系谱快速崩溃问题。通过将延续方差与残差方差分离,研究揭示了传统多项式重采样导致的高后代数量方差是崩溃的主要原因,并提出基于方差最优质量分配和系统重采样的改进方法。VASR及其变体VASR-Max在多个任务中表现出更优的样本质量和更高的计算效率,且无需训练、可并行处理。

详情
英文摘要

Sequential Monte Carlo (SMC) samplers for reward-guided diffusion models often suffer from rapid lineage collapse: a few high-reward particles dominate the population within a handful of resampling steps, destroying diversity and degrading sample quality. We propose a variance-decomposition framework for reward-guided diffusion SMC that separates continuation variance $V_t^{\mathrm{cont}}$ from residual variance $V_t^{\mathrm{res}}$, revealing that high offspring-count variance under the commonly used multinomial resampling drives this collapse. This motivates \textsc{VASR} (Variance-Aware Systematic Resampling), which addresses both variance terms via variance-optimal mass allocation $m_t \propto w_t e^{r_t}$ (minimizing $V_t^{\mathrm{cont}}$) and systematic resampling (controlling $V_t^{\mathrm{res}}$). For latent diffusion models where intermediate rewards are noisy due to stochastic continuations, we propose \textsc{VASR-Max}, a deliberately biased high-selection variant for variance-sensitive reward optimization. Both methods are training-free, fully parallelizable, and add only linear overhead. On MNIST and CIFAR-10, VASR achieves as high as $26\%$ better FID than prior SMC methods while remaining 66 times faster than MCTS-based value methods at matched compute. On text-to-image generation, \textsc{VASR-Max} consistently outperforms the strongest SMC baseline across compute budgets and matches MCTS-based methods within 2.5--3% reward at high budgets while being approximately times faster.

2604.06485 2026-05-13 cs.LG cs.AI

Inference-Time Code Selection via Symbolic Equivalence Partitioning

David Cho, Yifan Wang, Fanping Sui, Ananth Grama

发表机构 * Texas Instruments(德州仪器)

AI总结 该论文研究了如何在推理阶段从大型语言模型生成的多个候选程序中有效选择正确解的问题。作者提出了一种基于符号等价划分(SEP)的方法,利用问题提供的公共示例作为有效性信号,并通过符号执行将候选程序划分为功能等价类,从而选择最可能正确的解。实验表明,该方法在多个基准上显著提升了代码选择的准确性,无需额外测试生成或学习验证器。

详情
英文摘要

Sampling multiple candidate programs at inference time is an effective way to improve LLM code generation. However, its benefit depends on reliably selecting a correct solution from the generated pool. We observe that this selection problem has a distinctive semantic structure: correct solutions, despite differences in syntax, implementation, or algorithmic strategy, often converge to the same functional behavior over valid inputs. At the same time, consensus alone is not sufficient for correctness, because models can also produce correlated wrong solutions that implement the same mistaken behavior. We propose Symbolic Equivalence Partitioning (SEP), an inference-time selection framework that first uses problem-provided public examples as lightweight validity signals. SEP then uses symbolic execution to partition the remaining candidate programs into bounded functional equivalence classes and selects from the dominant equivalence class. Across HumanEval+ and LiveCodeBench, SEP consistently improves selection accuracy without auxiliary test generation, learned verifiers, or additional LLM inference. At $N=10$, SEP improves average accuracy from 0.754 to 0.826 on HumanEval+ and from 0.565 to 0.647 on LiveCodeBench, showing that symbolic functional agreement is an effective signal for inference-time code selection.

2604.04894 2026-05-13 cs.CL cs.AI cs.LG

Asymmetric Advantage Modulation Calibrates Entropy Dynamics in RLVR

Hengrui Gu, Xiaotian Han, Yujing Bian, Feiyi Wang, Kaixiong Zhou

发表机构 * North Carolina State University(北卡罗来纳州立大学) Case Western Reserve University(凯斯西储大学) Oak Ridge National Laboratory(橡树岭国家实验室)

AI总结 在可验证奖励强化学习(RLVR)中,大型语言模型(LLMs)的推理能力虽有所提升,但常因探索受限而难以获得多样化解。本文提出一种新的熵动态调节方法——AsymGRPO,通过将优势估计器分解为正负通道,分别调控有益熵和噪声熵,从而更精细地引导模型学习。该方法在多个数学推理任务中表现出色,显著优于现有RLVR基线方法。

详情
英文摘要

Reinforcement learning with verifiable rewards (RLVR) has substantially improved the reasoning ability of large language models (LLMs), but it often suffers from \textit{restricted exploration}, where the policy rapidly concentrates on a narrow set of solutions. A common remedy is entropy regularization, which attempts to preserve exploration by increasing policy entropy. However, for LLM-RL, this intervention is highly sensitive to its coefficient, can introduce semantically weak uncertainty, and often yields limited accuracy gains. This motivates a more precise question: which entropy helps reasoning, and which entropy should be reduced? To study this, we parameterize the advantage estimator in Group Relative Policy Optimization (GRPO) into positive and negative outcome-conditioned channels and analyze their entropy dynamics. Our results show that positive-channel modulation raises \textit{productive entropy} associated with successful reasoning trajectories, while negative-channel modulation removes \textit{noisy entropy} associated with failed rollouts and reduces interference with correct paths. Guided by this channel-wise view, we propose \textbf{AsymGRPO}, which decouples the modulation strengths of positive and negative advantages. This enables flexible control over how the model updates across prompt difficulty levels, allowing stronger reinforcement of rare successes on harder prompts or stronger suppression of residual failures on easier prompts without forcing the two channels to share the same modulation strength. Experiments on five mathematical reasoning benchmarks show that AsymGRPO outperforms strong RLVR baselines, with consistent gains across model backbones.

2604.03701 2026-05-13 cs.CV

VidNum-1.4K: A Comprehensive Benchmark for Video-based Numerical Reasoning

Shaoyang Cui, Lingbei Meng

发表机构 * Department of Psychological and Cognitive Sciences, Tsinghua University(清华大学心理与认知科学系) Shenzhen Loop Area Institute(深圳环园研究院)

AI总结 VidNum-1.4K 是一个用于评估视频中数值推理能力的综合性基准数据集,包含1,379个人工标注的视频问答对,覆盖多种复杂场景,旨在测试视觉语言模型对时间事件、物体持续性和组合逻辑的理解。该基准采用三级结构,从直接视觉感知逐步过渡到多步骤数值推理,要求模型进行算术运算、比较和逻辑推断。实验表明,当前最先进的模型在该任务上仍存在较大性能差距,凸显出视频数值推理任务的挑战性与现有模型的不足。

Comments 7 pages, 5 figures, under review at ACMMM 2026 Dataset Track

详情
英文摘要

Video-based numerical reasoning provides a premier arena for testing whether Vision-Language Models (VLMs) truly "understand" real-world dynamics, as accurate numerical deduction necessitates a profound grasp of temporal events, object permanence, and compositional logic beyond superficial pattern matching. However, existing benchmarks are often confined to narrow domains, such as repetitive athletic motions, or treat simple counting merely as a superficial regression task, failing to assess multi-step numerical logic within the inherent complexity of real-world multimedia content. We introduce VidNum-1.4K, a comprehensive VideoQA benchmark comprising 1,379 strictly human-annotated video-question pairs designed to evaluate genuine numerical reasoning across highly diverse environments, encompassing object, action, and event quantification. The VidNum-1.4K is uniquely structured into a three-level hierarchy that evolves from direct visual perception to video-based compositional numerical reasoning, requiring models to perform arithmetic operations, comparisons, and logical deductions grounded in temporal evidence. Our evaluations across a diverse suite of state-of-the-art VLMs reveal a striking reasoning gap: while the Gemini-3.1-pro barely reaches a 60% accuracy threshold, representative open-source families struggle heavily in the 25%--45% range. These findings demonstrate that current VLMs still lack a stable "internal world model", positioning VidNum-1.4K as a demanding diagnostic testbed for the next generation of numerical video intelligence.

2603.28561 2026-05-13 cs.RO cs.AI

Fine-Tuning Large Language Models for Cooperative Tactical Deconfliction of Small Unmanned Aerial Systems

Iman Sharifi, Alex Zongo, Peng Wei

发表机构 * George Washington University(乔治华盛顿大学)

AI总结 随着小型无人机系统在低空空域的广泛应用,如何在安全约束下实现可靠的战术避撞成为亟需解决的问题。本文研究了通过微调大语言模型(LLM)来实现多智能体协同避撞的方法,提出了一种基于BlueSky模拟器的仿真到语言数据生成流程,生成符合航空安全规则的避撞数据集,并采用低秩适配(LoRA)和基于偏好的微调策略对预训练模型进行优化。实验表明,该方法显著提升了避撞决策的准确性、一致性及避撞性能,有效减少了近距空中冲突的发生。

Comments 15 pages, 6 figures, to be published in CVPR 2026 Workshop Proceedings

详情
英文摘要

The growing deployment of small Unmanned Aerial Systems (sUASs) in low-altitude airspaces has increased the need for reliable tactical deconfliction under safety-critical constraints. Tactical deconfliction involves short-horizon decision-making in dense, partially observable, and heterogeneous multi-agent environments, where both cooperative separation assurance and operational efficiency must be maintained. While Large Language Models (LLMs) exhibit strong reasoning capabilities, their direct application to air traffic control remains limited by insufficient domain grounding and unpredictable output inconsistency. This paper investigates LLMs as decision-makers in cooperative multi-agent tactical deconfliction using fine-tuning strategies that align model outputs to human operator heuristics. We propose a simulation-to-language data generation pipeline based on the BlueSky air traffic simulator that produces rule-consistent deconfliction datasets reflecting established safety practices. A pretrained Qwen-Math-7B model is fine-tuned using two parameter-efficient strategies: supervised fine-tuning with Low-Rank Adaptation (LoRA) and preference-based fine-tuning combining LoRA with Group-Relative Policy Optimization (GRPO). Experimental results on validation datasets and closed-loop simulations demonstrate that supervised LoRA fine-tuning substantially improves decision accuracy, consistency, and separation performance compared to the pretrained LLM, with significant reductions in near mid-air collisions. GRPO provides additional coordination benefits but exhibits reduced robustness when interacting with heterogeneous agent policies.

2603.28488 2026-05-13 cs.CL cs.AI cs.MA

Courtroom-Style Multi-Agent Debate with Progressive RAG and Role-Switching for Controversial Claim Verification

Masnun Nuha Chowdhury, Nusrat Jahan Beg, Umme Hunny Khan, Syed Rifat Raiyan, Md Kamrul Hasan, Hasan Mahmud

发表机构 * Systems and Software Lab (SSL), Department of Computer Science and Engineering(系统与软件实验室(SSL),计算机科学与工程系)

AI总结 该研究针对大语言模型在高风险声明验证中的不可靠问题,提出了一种基于法庭辩论风格的多智能体框架PROClaim,通过引入角色分工和渐进式检索增强生成(P-RAG)方法,提升证据检索与推理的深度与准确性。该方法通过结构化辩论流程、证据协商及多法官异构聚合,有效增强了系统校准能力与鲁棒性,在零样本测试中表现出优于传统多智能体辩论10个百分点的性能,验证了其在争议性声明验证中的有效性。

Comments Under review, 7 figures, 12 tables

详情
英文摘要

Large language models (LLMs) remain unreliable for high-stakes claim verification due to hallucinations and shallow reasoning. While retrieval-augmented generation (RAG) and multi-agent debate (MAD) address this, they are limited by one-pass retrieval and unstructured debate dynamics. We propose a courtroom-style multi-agent framework, PROClaim, that reformulates verification as a structured, adversarial deliberation. Our approach integrates specialized roles (e.g., Plaintiff, Defense, Judge) with Progressive RAG (P-RAG) to dynamically expand and refine the evidence pool during the debate. Furthermore, we employ evidence negotiation, self-reflection, and heterogeneous multi-judge aggregation to enforce calibration, robustness, and diversity. In zero-shot evaluations on the Check-COVID benchmark, PROClaim achieves 81.7% accuracy, outperforming standard multi-agent debate by 10.0 percentage points, with P-RAG driving the primary performance gains (+7.5 pp). We ultimately demonstrate that structural deliberation and model heterogeneity effectively mitigate systematic biases, providing a robust foundation for reliable claim verification. Our code and data are publicly available at https://github.com/mnc13/PROClaim.

2603.27358 2026-05-13 cs.CL

Not Worth Mentioning? A Pilot Study on Salient Proposition Annotation

Amir Zeldes, Katherine Conhaim, Lauren Levine

发表机构 * Department of Linguistics Georgetown University(语言学系 圾顿大学)

AI总结 本文探讨了如何在自然文本中对命题的显著性进行分级标注的问题。研究借鉴了基于摘要的显著性度量方法,并将其应用于命题层面,定义了相应的标注任务。通过在一个多体裁小规模数据集上的实验,验证了该方法的可行性,并初步探讨了其与话语结构理论中核心话语单元之间的关系。

详情
英文摘要

Despite a long tradition of work on extractive summarization, which by nature aims to recover the most important propositions in a text, little work has been done on operationalizing graded proposition salience in naturally occurring data. In this paper, we adopt graded summarization-based salience as a metric from previous work on Salient Entity Extraction (SEE) and adapt it to quantify proposition salience. We define the annotation task, apply it to a small multi-genre dataset, evaluate agreement and carry out a preliminary study of the relationship between our metric and notions of discourse unit centrality in discourse parsing following Rhetorical Structure Theory (RST).

2603.24652 2026-05-13 cs.CL cs.LG

Demystifying When Pruning Works via Representation Hierarchies

Shwai He, Guoheng Sun, Haichao Zhang, Yun Fu, Ang Li

发表机构 * University of Maryland, College Park, USA(美国马里兰大学学院公园分校) Northeastern University, USA(美国东北大学)

AI总结 该研究探讨了网络剪枝在不同语言任务中的效果差异,发现剪枝对非生成任务(如检索和多选)效果较好,但在生成任务中常导致性能下降。通过分析语言模型的表示层次结构,研究将模型内部计算分解为嵌入、logit和概率三个空间,发现嵌入和logit空间对剪枝具有较强鲁棒性,但logit到概率的非线性变换会放大剪枝带来的偏差,进而影响生成质量。该分析揭示了剪枝效果任务差异的内在机制,并为实际应用提供了指导。

Comments ICML 2026. 24 pages, 21 figures, and 3 tables. Includes an appendix with supplementary experiments and derivations

详情
英文摘要

Network pruning, which removes less important parameters or architectures, is often expected to improve efficiency while preserving performance. However, this expectation does not consistently hold across language tasks: pruned models can perform well on non-generative tasks but frequently fail in generative settings. To understand this discrepancy, we analyze network pruning from a representation-hierarchy perspective, decomposing the internal computation of language models into three sequential spaces: embedding (hidden representations), logit (pre-softmax outputs), and probability (post-softmax distributions). We find that representations in the embedding and logit spaces are largely robust to pruning-induced perturbations. However, the nonlinear transformation from logits to probabilities amplifies these deviations, which accumulate across time steps and lead to substantial degradation during generation. In contrast, the stability of the categorical-token probability subspace, together with the robustness of the embedding space, supports the effectiveness of pruning for non-generative tasks such as retrieval and multiple-choice selection. Our analysis disentangles the effects of pruning across tasks and provides practical guidance for its application. Code is available at https://github.com/CASE-Lab-UMD/Pruning-on-Representations

2603.24033 2026-05-13 cs.LG

SRG: Score-based Relaxation-guided Generation for Mixed Integer Linear Programming

Ruobing Wang, Xin Li, Yujie Fang, Mingzhong Wang

发表机构 * Beijing Institute of Technology, Beijing, China(北京理工大学) University of the Sunshine Coast(阳光海岸大学)

AI总结 本文提出了一种基于分数松弛引导的生成框架SRG,用于解决混合整数线性规划问题。该方法通过近似松弛引导的随机微分方程,结合基于Transformer的分数网络,将可行性和最优性信号融入生成模型中,从而在解空间中生成高质量的可行解。SRG在推理时无需额外引导模块即可直接采样多样解,并用于构建紧凑的信任区域子问题,实验表明其在多个基准测试中表现优异,尤其在生成候选解的困难场景中具有明显优势,并展现出良好的跨尺度和跨问题的零样本迁移能力。

详情
英文摘要

We propose Score-based Relaxation-guided Generation (SRG), a generative framework based on an approximate formulation of relaxation-guided stochastic differential equations (SDEs) for mixed-integer linear programming. SRG employs a Transformer-based score network that incorporates feasibility and optimality signals into score modeling, encouraging the learned generative model to place more probability mass on feasible, high-quality regions of the solution space. At inference time, SRG directly samples diverse candidate solutions from the learned score model without requiring any additional guidance module. These candidates are then used to construct compact trust-region subproblems for standard MILP solvers. Across multiple public benchmarks, SRG matches or improves upon the solution quality of the strongest learning-based baselines, with particularly strong gains in challenging candidate-generation settings. Moreover, SRG shows promising zero-shot transferability to unseen cross-scale and cross-problem instances, improving solver objectives and reducing search time in several cases through higher-quality initial candidates and compact trust-region search.

2603.23878 2026-05-13 cs.LG cs.AI cs.LO

The Luna Bound Propagator for Formal Analysis of Neural Networks

Henry LeCates, Haoze Wu

发表机构 * Amherst College(阿默斯特学院)

AI总结 本文提出了一种基于抽象解释的边界传播方法Luna,用于神经网络的形式化分析。Luna采用C++实现,支持区间边界传播、DeepPoly/CROWN分析以及alpha-CROWN分析,适用于一般的计算图结构。实验表明,Luna在VNN-COMP 2025基准测试中,在边界精度和计算效率方面均优于现有的alpha-CROWN实现。

Comments 32 pages, 29 Figures

详情
英文摘要

The parameterized CROWN analysis, a.k.a., alpha-CROWN has emerged as a practically successful abstract interpretation method for neural network verification. However, existing implementations of alpha-CROWN are limited to Python, which complicates integration into existing DNN verifiers and long-term production-level systems. We introduce Luna, a new abstract-interpretation-based bound propagator implemented in C++. Luna supports Interval Bound Propagation, the DeepPoly/CROWN analysis, and the alpha-CROWN analysis over a general computational graph. We describe the architecture of Luna and show that it outperforms the state-of-the-art alpha-CROWN implementation in terms of both bound tightness and computational efficiency on supported benchmarks from VNN-COMP 2025. Luna is publicly available at https://github.com/ai-ar-research/luna.

2603.11383 2026-05-13 cs.RO cs.AI

Vision-Based Hand Shadowing for Robotic Manipulation via Inverse Kinematics

Hendrik Chiche, Antoine Jamme, Trevor Rigoberto Martinez, Gabriel Gomes

发表机构 * OMGrab Inc.(OMGrab公司) University of California, Berkeley(加州大学伯克利分校) Fung Institute for Engineering Leadership(工程领导力基金会)

AI总结 该研究提出了一种基于视觉的手部阴影逆运动学(IK)重定向方法,用于低成本机械臂的远程操作。通过单目RGB-D相机捕捉手部动作,结合深度感知和坐标变换,生成机械臂关节指令,并通过阻尼最小二乘法求解逆运动学问题,实现了对SO-ARM101机械臂的控制。实验表明,该方法在结构化环境中取得了较高的成功率,并在真实场景中通过引入替代手部检测器提升了鲁棒性,揭示了无标记手部重定向方法的潜力与当前局限。

Comments v2: accepted at IEEE Access (2026); minor revisions per peer review, added WiLoR occlusion-mitigation experiment, error analysis, EMA ablation, and author photos

详情
英文摘要

Teleoperation of low-cost robotic manipulators remains challenging due to the difficulty of retargeting human hand motion to robot joint commands. We present an offline hand-shadowing inverse-kinematics (IK) retargeting pipeline driven by a single egocentric RGB-D camera mounted on 3D-printed glasses. The pipeline detects 21 hand landmarks per hand using MediaPipe Hands, deprojects them into 3D via depth sensing, transforms them into the robot coordinate frame, and solves a damped-least-squares IK problem to produce joint commands for the SO-ARM101 robot (5 arm + 1 gripper joints). A gripper controller maps thumb-index finger geometry to grasp aperture with a multi-level fallback hierarchy. Actions are previewed in a physics simulation before replay on the physical robot. We evaluate the pipeline on a structured pick-and-place benchmark (5-tile grid, 10 grasps per tile, 3 independent runs) achieving an 86.7% +/- 4.2% success rate, and compare it against four vision-language-action (VLA) policies (ACT, SmolVLA, pi_0.5, GR00T N1.5) trained on leader-follower teleoperation data. We provide a quantitative error analysis of the pipeline, reporting a mean IK position error of 36.4 mm, trajectory smoothness metrics showing 57-68% jerk reduction from EMA smoothing, and an ablation study over the smoothing parameter. We also test the pipeline in unstructured real-world environments (grocery store, pharmacy) and find that success is reduced to 9.3% due to hand occlusion by surrounding objects. To mitigate this, we integrate WiLoR as an alternative hand detector, achieving an 8% improvement in hand detection rate over MediaPipe, highlighting both the promise and current limitations of marker-free analytical retargeting.

2603.10281 2026-05-13 cs.LG cs.AI cs.CV

Taming Score-Based Denoisers in ADMM: A Convergent Plug-and-Play Framework

Rajesh Shrestha, Xiao Fu

发表机构 * School of EECS(电子工程与科学学院)

AI总结 本文研究了如何将基于分数的去噪器有效集成到ADMM优化算法中,以解决逆问题。针对训练数据流形与ADMM迭代几何不匹配以及收敛性缺乏保证的两个核心挑战,提出了一种新的ADMM-PnP框架,引入包含自动校正、方向校正和分数去噪三阶段的AC-DC去噪器。理论分析表明该框架在适当参数下具有弱非扩张性,保证了固定点球收敛,并在更宽松条件下支持自适应步长的收敛性。实验表明该方法在多种逆问题中优于现有基线。

详情
英文摘要

While score-based generative models have emerged as powerful priors for solving inverse problems, directly integrating them into optimization algorithms such as ADMM remains nontrivial. Two central challenges arise: i) the mismatch between the noisy data manifolds used to train the score functions and the geometry of ADMM iterates, especially due to the influence of dual variables, and ii) the lack of convergence understanding when ADMM is equipped with score-based denoisers. To address the manifold mismatch issue, we propose ADMM plug-and-play (ADMM-PnP) with the AC-DC denoiser, a new framework that embeds a three-stage denoiser into ADMM: (1) auto-correction (AC) via additive Gaussian noise, (2) directional correction (DC) using conditional Langevin dynamics, and (3) score-based denoising. In terms of convergence, we establish two results: first, under proper denoiser parameters, each ADMM iteration is a weakly nonexpansive operator, ensuring high-probability fixed-point $\textit{ball convergence}$ using a constant step size; second, under more relaxed conditions, the AC-DC denoiser is a bounded denoiser, which leads to convergence under an adaptive step size schedule. Experiments on a range of inverse problems demonstrate that our method consistently improves solution quality over a variety of baselines.

2603.09678 2026-05-13 cs.AI cs.LG cs.SE

EsoLang-Bench: Evaluating Genuine Reasoning in Large Language Models via Esoteric Programming Languages

Aman Sharma, Paras Chopra

发表机构 * Lossfunk

AI总结 本文提出EsoLang-Bench,一个用于评估大语言模型在陌生编程语言中真实推理能力的基准测试,采用五种小众编程语言(如Brainfuck、Befunge-98等)作为测试语言。这些语言虽然图灵完备,但与主流语言(如Python、JavaScript)相比,在预训练语料中出现频率极低,且缺乏实际应用价值,因此能有效检验模型的分布外泛化能力。实验表明,当前最先进的模型在主流语言任务中表现优异,但在小众语言任务中准确率大幅下降,揭示了模型在跨语言泛化方面仍存在显著差距。

Comments 45 pages, 8 figures, preprint

详情
英文摘要

Large language models achieve near-ceiling performance on code generation benchmarks, yet most of the programming languages used by popular benchmarks such as SWE-bench and HumanEval (e.g. Python, JavaScript) are squarely in-distribution. They appear at scale in pre-training corpora and are heavily reinforced during post-training. To study LLM performance on unfamiliar programming languages, we introduce EsoLang-Bench, a benchmark using five esoteric programming languages (Brainfuck, Befunge-98, Whitespace, Unlambda, and Shakespeare). All five of our chosen esoteric languages are Turing-complete, so the same algorithmic problems that are solvable in Python or JavaScript are in principle solvable in each of them. Yet, they are unfamiliar to LLMs which makes them a good proxy for evaluating out-of-distribution performance. The unfamiliarity of esoteric languages comprises of: (i) the hard-by-design primitives comprising the language; (ii) substantially less representation in pre-training corpora (340x to over 60,000x fewer public GitHub repositories than Python); (iii) negligible deployment value, which makes targeted inclusion in post-training data economically irrational. We evaluate five frontier models across five prompting strategies and find a dramatic capability gap. The same 80 problems expressed in Python or JavaScript reach 100% accuracy on top frontier models, while the equivalent esoteric versions score only 0-11%. Few-shot learning and self-reflection also fail to close this gap. EsoLang-Bench therefore provides a contamination-resistant testbed for measuring how well frontier models generalise algorithmic problem-solving to programming languages outside their training distribution.

2603.07388 2026-05-13 cs.LG cs.AI

Sparsity and Out-of-Distribution Generalization

Scott Aaronson, Lin Lin Lee, Jiawei Li

发表机构 * UT Austin(德克萨斯大学奥斯汀分校)

AI总结 本文探讨了模型在分布外(OOD)场景下的泛化能力,提出了一种基于稀疏性的理论解释。研究认为,世界通过区分特征呈现,而稀疏假设(即依赖尽可能少的特征)更符合奥卡姆剃刀原则,并能在训练分布与测试分布足够重叠的特征上实现泛化。文章给出了一个形式化定理,扩展了经典样本复杂度界,并将稀疏分类器推广到子空间合取函数,为理解AI对齐中的泛化问题提供了新视角。

详情
英文摘要

Explaining out-of-distribution generalization has been a central problem in epistemology since Goodman's "grue" puzzle in 1946. Today it's a central problem in machine learning, including AI alignment. Here we propose a principled account of OOD generalization with three main ingredients. First, the world is always presented to experience not as an amorphous mass, but via distinguished features (for example, visual and auditory channels). Second, Occam's Razor favors hypotheses that are "sparse," meaning that they depend on as few features as possible. Third, sparse hypotheses will generalize from a training to a test distribution, provided the two distributions sufficiently overlap on their restrictions to the features that are either actually relevant or hypothesized to be. The two distributions could diverge arbitrarily on other features. We prove a simple theorem that formalizes the above intuitions, generalizing the classic sample complexity bound of Blumer et al. to an OOD context. We then generalize sparse classifiers to subspace juntas, where the ground truth classifier depends solely on a low-dimensional linear subspace of the features.

2603.04352 2026-05-13 cs.RO cond-mat.mtrl-sci

A Soft Robotic Demonstration in the Stratosphere

Codrin Tugui, Tirth Thakar, Anatol Gogoj, Alexander White, Ang Leo Li, Alexander Yin, Edward Pomianek, Mihai Duduta

发表机构 * University of Connecticut, School of Mechanical, Aerospace, and Manufacturing Engineering(康奈尔大学机械、航空航天与制造工程学院) Institute of Macromolecular Chemistry Petru Poni(彼得·波尼宏观分子化学研究所) University of Toronto, Department of Mechanical and Industrial Engineering(多伦多大学机械与工业工程系)

AI总结 该研究针对在极端环境如平流层中运行的软体机器人所面临的耐压、耐温及适应性挑战,提出了一种新型硅橡胶交联方法。通过紫外光引发的铂催化反应,实现了硅橡胶的快速固化与优异电致动性能,显著提升了介电弹性体致动器在极端温度和真空条件下的可靠性。研究通过高空气球实验验证了该材料在类太空环境中的有效性,为未来软体机器人在空间探索等领域的应用提供了新材料解决方案。

详情
英文摘要

Machines designed for operation in Space, as well as other extreme environments, need to be both resilient and adaptable when mission parameters change. Soft robots offer advantages in adaptability, but most lack resilience to the pressure and temperature extremes found as close as the Stratosphere. Dielectric elastomer actuators overcome some of those limitations when built as solid state compliant capacitors capable of converting electrical energy into mechanical work, but the elastomer resilience limits the device's operating window. Here we present a crosslinking mechanism for silicone elastomers under ultraviolet light using trimethyl(methylcyclopentadienyl)platinum(IV) as a catalyst to react hydrosilane to vinyl groups. The formation of carbon-carbon bonds enables fast processing under UV light and exceptional electro-mechanical performance in dielectric elastomer actuators. The material resilience advantage is demonstrated in controlled experiments at -40° and 120° C, as well as near vacuum, in comparison with state-of-the-art acrylic and silicone chemistries. Fully autonomous systems controlling grippers made with the novel silicone were integrated into payloads for high altitude balloon testing. Two stratospheric balloon missions were carried out and demonstrated DEAs as a viable soft robotic technology under space-like conditions (as high as 23.6 km elevation, at <0.05 atm and -55° C). The combinations of chemical building blocks and catalyst can be further expanded to address other challenges for silicones, including adhesion and additive manufacturing.

2602.22586 2026-05-13 cs.LG cs.AI cs.CL

TabDLM: Free-Form Tabular Data Generation via Joint Numerical-Language Diffusion

Donghong Cai, Jiarui Feng, Yanbo Wang, Da Zheng, Yixin Chen, Muhan Zhang

发表机构 * Washington University in St. Louis(华盛顿大学圣路易斯分校) Peking University(北京大学) Ant Group(蚂蚁集团)

AI总结 本文提出了一种名为 TabDLM 的统一框架,用于生成包含自由形式文本和结构化数值、类别属性的异构表格数据。该方法结合了掩码扩散语言模型与连续扩散过程,通过双向注意力机制实现文本与数值特征的跨模态交互,有效克服了传统扩散模型和大语言模型在处理异构数据时的局限性。实验表明,TabDLM 在多个基准数据集上表现优异,优于现有的扩散模型和基于大语言模型的生成方法。

Comments Preprint

详情
英文摘要

Synthetic tabular data generation has attracted growing attention due to its importance for data augmentation, foundation models, and privacy. However, real-world tabular datasets increasingly contain free-form text fields (e.g., reviews or clinical notes) alongside structured numerical and categorical attributes. Generating such heterogeneous tables with joint modeling of different modalities remains challenging. Existing approaches broadly fall into two categories: diffusion-based methods and LLM-based methods. Diffusion models can capture complex dependencies over numerical and categorical features in continuous or discrete spaces, but extending them to open-ended text is nontrivial and often leads to degraded text quality. In contrast, LLM-based generators naturally produce fluent text, yet their discrete tokenization can distort precise or wide-range numerical values, hindering accurate modeling of both numbers and language. In this work, we propose TabDLM, a unified framework for free-form tabular data generation via a joint numerical-language diffusion model built on masked diffusion language models (MDLMs). TabDLM models textual and categorical features through masked diffusion, while modeling numerical features with a continuous diffusion process through learned specialized numeric tokens embedding; bidirectional attention then captures cross-modality interactions within a single model. Extensive experiments on diverse benchmarks demonstrate the effectiveness of TabDLM compared to strong diffusion- and LLM-based baselines.

2602.22507 2026-05-13 cs.LG cs.CV

Space Syntax-guided Post-training for Residential Floor Plan Generation

Zhuoyang Jiang, Dongqing Zhang

发表机构 * College of Architecture and Urban Planning, Tongji University(同济大学建筑与城市规划学院) Information Hub, The Hong Kong University of Science and Technology (Guangzhou)(香港科学与技术大学(广州)信息中心)

AI总结 本文研究了住宅平面图生成中空间配置逻辑的优化问题,提出了一种基于空间句法的后训练框架SSPT,通过引入空间句法集成预言机(SSIO)对生成的平面图进行配置质量评估,并将其作为反馈信号指导模型优化。该方法包括两种策略:基于迭代训练的SSPT-Iter和基于强化学习的SSPT-PPO,并构建了新的评估基准SSPT-Bench。实验表明,该方法有效提升了生成平面图的公共空间主导性和功能层级一致性,尤其SSPT-PPO在提升效果和效率方面表现更优。

详情
英文摘要

Residential floor plan generation requires not only geometric fidelity but also spatial configurational logic: shared living spaces should be integrative, while private spaces should remain segregated. Existing generators increasingly use room-relation graphs as input-side conditions, but generated layouts are rarely evaluated on the output side for configurational quality, and such evaluation is rarely fed back into model optimization. We propose Space Syntax-guided Post-training (SSPT), a framework that turns space-syntax integration from a post-hoc analysis tool into a computable feedback signal for already-trained floor plan generators. SSPT introduces the Space Syntax Integration Oracle (SSIO), which converts generated layouts into rectangle-space graphs and measures public-space dominance and functional hierarchy. SSIO is first applied to real residential data to establish empirical configurational references, then connected to two SSPT strategies: SSPT-Iter, a basic generate-filter-retrain route, and SSPT-PPO, the first RL-based post-training route for floor plan generation. We also introduce SSPT-Bench, a new evaluation system for measuring the output-side spatial configurational quality of post-trained generators under an out-of-distribution setting. Experiments show that both strategies improve public-space dominance and functional-hierarchy alignment over the unpost-trained baseline. SSPT-PPO achieves stronger gains, lower variance, and higher efficiency than iterative retraining. These results show that output-side configurational evaluation can serve as actionable post-training feedback, offering a practical path for injecting architectural theory into existing floor plan generation backbones.

2602.19770 2026-05-13 cs.LG cs.AI

The Confusion is Real: GRAPHIC -- A Network Science Approach to Confusion Matrices in Deep Learning

Johanna S. Fröhlich, Bastian Heinlein, Jan U. Claar, Hans Rosenberger, Vasileios Belagiannis, Ralf R. Müller

发表机构 * Friedrich-Alexander-Universität Erlangen-Nürnberg(弗里德里希-亚历山大-埃朗根-纽伦堡大学) Technical University of Darmstadt(达姆施塔特技术大学)

AI总结 本文提出了一种名为GRAPHIC的方法,用于分析深度学习模型中类别之间的混淆情况。该方法基于网络科学,将中间层的混淆矩阵解释为有向图的邻接矩阵,从而可视化和量化训练过程中的学习动态。GRAPHIC能够揭示类别可分性、数据集问题及网络结构行为,为理解神经网络的学习过程提供了新的视角。

Comments Transactions on Machine Learning Research, 2026

详情
英文摘要

Explainable artificial intelligence has emerged as a promising field of research to address reliability concerns in artificial intelligence. Despite significant progress in explainable artificial intelligence, few methods provide a systematic way to visualize and understand how classes are confused and how their relationships evolve as training progresses. In this work, we present GRAPHIC, an architecture-agnostic approach that analyzes neural networks on a class level. It leverages confusion matrices derived from intermediate layers using linear classifiers. We interpret these as adjacency matrices of directed graphs, allowing tools from network science to visualize and quantify learning dynamics across training epochs and intermediate layers. GRAPHIC provides insights into linear class separability, dataset issues, and architectural behavior, revealing, for example, similarities between flatfish and man and labeling ambiguities validated in a human study. In summary, by uncovering real confusions, GRAPHIC offers new perspectives on how neural networks learn. The code is available at https://github.com/Johanna-S-Froehlich/GRAPHIC.

2602.13267 2026-05-13 cs.CV cs.RO eess.IV

SOAR: Regression-based LiDAR Relocalization for UAVs

Hengyu Mu, Jianshi Wu, Yuxin Guo, XianLian Lin, Qingyong Hu, Sheng Ao, Chenglu Wen, Cheng Wang

发表机构 * Fujian Key Laboratory of Sensing and Computing for Smart Cities, Xiamen University(厦门大学智慧城市感知与计算重点实验室) Key Laboratory of Multimedia Trusted Perception and Efficient Computing, Ministry of Education of China, Xiamen University(中国教育部多媒体可信感知与高效计算重点实验室) Department of Computer Science at the University of Oxford(牛津大学计算机科学系)

AI总结 本文提出SOAR,一种基于回归的无人机激光雷达重定位框架,旨在解决在无GNSS环境下无人机高精度定位的问题。为应对无人机场景中姿态变化大、飞行路径不规则等挑战,SOAR引入了局部保持的滑动窗口注意力模块和局部不变的位置编码,以增强对视角变化的鲁棒性,并设计了坐标无关的特征初始化模块以减少对全局变换的敏感性。此外,作者构建了一个包含4个场景和13条不规则路径的大规模无人机激光雷达定位数据集,显著提升了无人机重定位研究的现实基准。实验表明,SOAR在定位成功率和误差指标上均达到先进水平。

Comments 24 pages, 14 figures

详情
英文摘要

Regression-based LiDAR relocalization has recently emerged as a promising solution for high-precision positioning in GNSS-denied environments. However, these methods are primarily tailored to autonomous driving, exhibiting significantly degraded accuracy in unmanned aerial vehicle (UAV) scenarios due to arbitrary pose variations and irregular flight paths. In this paper, we propose SOAR, a regression-based LiDAR relocalization framework for UAVs. Specifically, we introduce a locality-preserving sliding window attention module with locally invariant positional encoding to capture discriminative geometric structures robust to viewpoint changes. A coordinate-independent feature initialization module is further designed to eliminate sensitivity to global transformations. Furthermore, most existing UAV datasets are limited to evaluate LiDAR relocalization in real-world, due to the lack of synchronized LiDAR scans, accurate 6-DoF poses, or multiple traversals. Thus, we construct a large-scale UAV LiDAR localization dataset with 4 scenes and 13 irregular paths exhibiting rotation and altitude variations, providing a more realistic benchmark for UAVs. Extensive experiments demonstrate that our method achieves state-of-the-art performance, improving the localization success rate by 40% and reducing mean error over 10m on UAVLoc. Our code and dataset will be released soon.

2602.13004 2026-05-13 cs.LG stat.ML

Towards Uncertainty-Aware Federated Granger Causal Learning

Ayush Mohanty, Nazal Mohamed, Nagi Gebraeel

发表机构 * Georgia Institute of Technology(佐治亚理工学院)

AI总结 该研究旨在解决联邦格兰杰因果学习中缺乏不确定性感知的问题,提出了一种能够量化跨客户端因果关系不确定性的方法。通过分析联邦学习框架中不确定性传播的机制,作者推导了客户端与服务器之间协方差的闭式递推公式,并建立了基于谱半径的收敛条件,从而获得了稳态方差的解析表达式。实验表明,该方法能有效区分真实的跨客户端因果关系与虚假连接,优于现有联邦因果结构学习方法。

Comments Manuscript under review

详情
英文摘要

Granger causality recovers directed interactions from time-series data, but in many distributed systems, the data are vertically partitioned across clients, with each client observing only the variables of its own subsystem. Federated Granger causality (FedGC) recovers cross-client interactions without sharing raw data. Existing FedGC methods, however, return deterministic point estimates with no calibrated measure of uncertainty, leaving operators without a principled basis for identifying reliable cross-client interactions. We address this limitation by characterizing how uncertainty propagates through the FedGC framework. We derive closed-form covariance recursions for the cross-covariances induced by the coupled client-server feedback loop, and establish spectral-radius-based convergence conditions yielding closed-form expressions for the steady-state variances at both the client and server. Under mild stability conditions, we prove that the steady-state uncertainty depends only on client data statistics (aleatoric) and is independent of the priors placed on the model parameters (epistemic). Building on this asymptotic characterization, we construct a post-training hypothesis testing procedure that separates genuine cross-client interactions from spurious edges. Experiments on synthetic and real-world datasets show that the predicted uncertainty propagation matches the theory across multiple operating regimes, while consistently outperforming the state-of-the-art federated causal structure learning baselines.

2602.07892 2026-05-13 cs.LG cs.CL

Safety Alignment as Continual Learning: Mitigating the Alignment Tax via Orthogonal Gradient Projection

Guanglong Sun, Siyuan Zhang, Liyuan Wang, Jun Zhu, Hang Su, Yi Zhong

发表机构 * School of Life Sciences, IDG/McGovern Institute for Brain Research(生命科学学院,IDG/麦戈文脑科学研究所) Dept. of Comp. Sci. and Tech., Institute for AI, Tsinghua-Bosch Joint ML Center, THBI Lab, BNRist Center(计算机科学与技术系,人工智能研究所,清华大学-博世联合机器学习中心,THBI实验室,BNRist中心) Tsinghua University, Beijing, China(清华大学,北京,中国)

AI总结 该研究将安全对齐问题视为持续学习过程,旨在缓解大型语言模型在安全微调过程中可能产生的“对齐税”问题,即安全性能提升带来的通用能力下降。研究提出了一种名为OGPSA的方法,通过正交梯度投影技术,从通用能力数据中估计低秩参考子空间,并从安全梯度中去除该子空间的成分,从而在保证安全目标优化的同时减少对通用能力的负面影响。实验表明,OGPSA在多种微调设置下有效提升了安全与实用性的平衡,且兼容主流微调流程。

详情
英文摘要

Safety post-training can improve the harmfulness and policy compliance of Large Language Models (LLMs), but it may also reduce general utility, a phenomenon often described as the \emph{alignment tax}. We study this trade-off through the lens of continual learning: sequential alignment stages expose the model to shifted data distributions and objectives, and their gradients may interfere with directions that support previously acquired general capabilities. This view does not claim that all alignment degradation has a single cause; rather, it provides a useful first-order mechanism for mitigating one important source of capability regression. We propose \textbf{O}rthogonal \textbf{G}radient \textbf{P}rojection for \textbf{S}afety \textbf{A}lignment (\textbf{OGPSA}), a lightweight update rule that estimates a low-rank reference subspace from gradients on a small set of general-capability data and removes from each safety gradient the component lying in this subspace. The resulting update is the steepest local safety-descent direction subject to first-order preservation constraints on the reference objectives. OGPSA is compatible with standard post-training pipelines and avoids large-scale replay, although it introduces periodic reference-gradient computation. Across Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and sequential SFT$\rightarrow$DPO settings, OGPSA improves the observed safety--utility trade-off over standard baselines. Under the sequential SFT$\rightarrow$DPO pipeline, the average performance gain increases from 33.98\% to 42.74\% on Qwen2.5-7B-Instruct and from 19.74\% to 32.98\% on Llama3.1-8B-Instruct. We have open sourced our code at https://github.com/SunGL001/OGPSA.

2602.07668 2026-05-13 cs.CV cs.AI cs.LG cs.RO

Looking and Listening Inside and Outside: Multimodal Artificial Intelligence Systems for Driver Safety Assessment and Intelligent Vehicle Decision-Making

Ross Greer, Laura Fleig, Maitrayee Keskar, Erika Maquiling, Giovanni Tapia Lopez, Angel Martinez-Sanchez, Parthib Roy, Jake Rattigan, Mira Sur, Alejandra Vidrio, Thomas Marcotte, Mohan Trivedi

发表机构 * Machine Intelligence, Interaction, and Imagination (Mi3) Laboratory(机器智能、交互与想象实验室) Laboratory for Intelligent and Safe Automobiles (LISA)(智能与安全汽车实验室) Johns Hopkins University(约翰霍普金斯大学) Center for Medicinal Cannabis Research (CMCR)(医药大麻研究中心)

AI总结 该研究提出了一种融合视觉与音频信息的多模态框架L-LIO,用于提升智能车辆中的驾驶员状态评估与环境理解能力。通过引入音频信号,增强对驾驶员、乘客及车外人员状态的感知,从而在安全气囊部署、自动驾驶接管时间预测等场景中提供更全面的信息支持。实验表明,音频在复杂或语境丰富的场景中能提供关键的安全相关信息,为智能车辆决策系统提供了新的干预路径。

详情
英文摘要

The looking-in-looking-out (LILO) framework has enabled intelligent vehicle applications that understand both the outside scene and the driver state to improve safety outcomes, with examples in smart airbag deployment, takeover time prediction in autonomous control transitions, and driver attention monitoring. In this research, we propose an augmentation to this framework, making a case for the audio modality as an additional source of information to understand the driver, and in the evolving autonomy landscape, also the passengers and those outside the vehicle. We expand LILO by incorporating audio signals, forming the looking-and-listening inside-and-outside (L-LIO) framework to enhance driver state assessment and environment understanding through multimodal sensor fusion. We evaluate three example cases where audio enhances vehicle safety: supervised learning on driver speech audio to classify potential impairment states (e.g., intoxication), collection and analysis of passenger natural language instructions (e.g., "turn after that red building") to motivate how spoken language can interface with planning systems through audio-aligned instruction data, and limitations of vision-only systems where audio may disambiguate the guidance and gestures of external agents. Datasets include custom-collected in-vehicle and external audio samples in real-world environments. Pilot findings show that audio yields safety-relevant insights, particularly in nuanced or context-rich scenarios where sound is critical to safe decision-making or visual signals alone are insufficient. Challenges include ambient noise interference, privacy considerations, and robustness across human subjects, motivating further work on reliability in dynamic real-world contexts. L-LIO augments driver and scene understanding through multimodal fusion of audio and visual sensing, offering new paths for safety intervention.

2602.06412 2026-05-13 cs.CL cs.LG

Stopping Computation for Converged Tokens in Masked Diffusion-LM Decoding

Daisuke Oba, Danushka Bollegala, Masahiro Kaneko, Naoaki Okazaki

发表机构 * Institute of Science Tokyo(东京科学研究所) University of Liverpool(利物浦大学) Amazon(亚马逊) MBZUAI

AI总结 该研究针对掩码扩散语言模型(Masked Diffusion-LM)在生成过程中重复计算已稳定位置的问题,提出了一种名为SureLock的优化方法。通过在后验分布稳定时锁定该位置,跳过其后续的计算步骤并缓存其注意力键值,从而显著降低计算复杂度。实验表明,该方法在保持生成质量的同时,可减少30%到50%的算法浮点运算量。

Comments Accepted to ICLR 2026

详情
英文摘要

Masked Diffusion Language Models generate sequences via iterative sampling that progressively unmasks tokens. However, they still recompute the attention and feed-forward blocks for every token position at every step -- even when many unmasked tokens are essentially fixed, resulting in substantial waste in compute. We propose SureLock: when the posterior at an unmasked position has stabilized across steps (our sure condition), we lock that position -- thereafter skipping its query projection and feed-forward sublayers -- while caching its attention keys and values so other positions can continue to attend to it. This reduces the dominant per-iteration computational cost from $O(N^2d)$ to $O(MNd)$ where $N$ is the sequence length, $M$ is the number of unlocked token positions, and $d$ is the model dimension. In practice, $M$ decreases as the iteration progresses, yielding substantial savings. On LLaDA-8B, SureLock reduces algorithmic FLOPs by 30--50% relative to the same sampler without locking, while maintaining comparable generation quality. We also provide a theoretical analysis to justify the design rationale of SureLock: monitoring only the local KL at the lock step suffices to bound the deviation in final token probabilities. Our project page is available at https://daioba.github.io/surelock .

2602.06339 2026-05-13 cs.RO cs.AI

Action Hallucination in Generative Vision-Language-Action Models

Harold Soh, Eugene Lim

发表机构 * Department of Computer Science, School of Computing(计算机科学系,计算系) Smart Systems Institute(智能系统研究所)

AI总结 该论文研究了生成式视觉-语言-动作模型在机器人领域中可能出现的动作幻觉问题,即模型生成违反物理约束的动作,进而导致计划层面的失败。研究分析了这类幻觉的成因,指出其源于可行机器人行为与常见模型结构之间的结构性不匹配,并探讨了拓扑、精度和时间跨度三个关键障碍所带来的不可避免的权衡。该工作为生成式机器人策略的失效提供了机制性解释,并为提升其可靠性与可信度指明了理论方向。

Comments 24 pages; updated setup with minor changes to proofs. changed template

详情
英文摘要

Robot Foundation Models, such as VLAs, promise end-to-end generative robot policies with broad generalization. Yet it remains unclear whether they fundamentally resolve the core problem of action generation in embodied settings, or overcome the long-standing challenges of robotics. We address this question by analyzing action hallucinations that violate physical constraints and their extension to plan-level failures. Focusing on latent-variable generative policies, we show that hallucinations can arise from structural mismatches between feasible robot behavior and common model architectures. We study three such barriers -- topological, precision, and horizon -- and show how they impose unavoidable tradeoffs. Our analysis provides mechanistic explanations for reported empirical failures of generative robot policies and suggests principled directions for improving reliability and trustworthiness, without abandoning their expressive power.

2602.04042 2026-05-13 cs.LG stat.ME stat.ML

Partition Tree: Conditional Density Estimation over General Outcome Spaces

Felipe Angelim, Alessandro Leite

发表机构 * Independent Researcher(独立研究者) INSA Rouen Normandy(里昂大学鲁昂分校) Normandy University(诺曼底大学) LITIS Rouen(鲁昂LITIS实验室)

AI总结 本文提出了一种名为 Partition Tree 的新型树状框架,用于在一般结果空间上进行条件密度估计,能够统一处理连续和分类变量。该方法通过数据自适应划分将条件分布建模为分段常数密度,并直接最小化条件负对数似然来学习树结构,提供了一种无需参数假设的可扩展非参数替代方案。此外,文章还引入了 Partition Forest,通过平均条件密度实现对 Partition Tree 的袋外扩展,并在实验中展示了其在概率预测方面的优越性和与最新方法的竞争力。

Comments Code available at https://github.com/felipeangelimvieira/partition_tree

详情
英文摘要

We propose Partition Tree, a novel tree-based framework for conditional density estimation over general outcome spaces that supports both continuous and categorical variables within a unified formulation. Our approach models conditional distributions as piecewise-constant densities on data-adaptive partitions and learns trees by directly minimizing conditional negative log-likelihood. This yields a scalable, nonparametric alternative to existing probabilistic trees that does not make parametric assumptions about the target distribution. We further introduce Partition Forest, a bagging extension obtained by averaging conditional densities. Empirically, we demonstrate improved probabilistic prediction over CART-style trees and competitive performance compared to state-of-the-art probabilistic tree methods and Random Forests.