arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.30431 2026-06-01 cs.CV

DTG-Restore: Training-Free Diffusion Refinement for Generative Video Super-Resolution

DTG-Restore: 无需训练的视频超分辨率扩散精炼

Hidir Yesiltepe, Koutilya PNVR, Gaurav Pathak, Navaneeth Bodla, Bharat Singh, Pinar Yanardag, Jinrong Xie

发表机构 * Virginia Tech（弗吉尼亚理工大学）； Adobe（Adobe公司）

AI总结提出解耦时间引导（DTG）方法，通过时间解耦条件与无条件分支，无需训练即可增强扭曲低分辨率视频，提升结构保真度和时间稳定性。

详情

AI中文摘要

近期视频扩散模型的进展实现了显著的生成保真度，但利用这些先验进行修复仍受限于标准无分类器引导中条件分支与无条件分支的强耦合。我们提出一种无需训练的框架，通过时间解耦这些信号来增强扭曲和低分辨率视频。我们提出的解耦时间引导（DTG）在更干净的扩散时间步评估无条件分支，提供一个前瞻先验，在抑制扭曲内容复制的同时保持几何结构。这种时间偏置在采样过程中逐渐减弱，使模型能够从结构校正过渡到细节精炼，无需重新训练。结合任何现成的修复模块以即插即用的方式，我们的方法在AI生成和真实世界视频中均能改善感知一致性并恢复合理的结构。为便于评估，我们整理了GenWarp480基准，包含从多种文本到视频模型合成的4400个扭曲480p视频。GenWarp480专注于特征性生成退化，如扭曲面部、身体错位和空间伪影，为评估对生成错误的鲁棒性提供了专门构建的测试平台。大量实验表明，我们的方法在无需任何模型训练的情况下，在结构保真度和时间稳定性方面取得了显著改进。

英文摘要

Recent progress in video diffusion models has enabled remarkable generative fidelity, yet leveraging these priors for restoration remains limited by the strong coupling between conditional and unconditional branches in standard classifier-free guidance. We introduce a training-free framework that enhances distorted and low-resolution videos by decoupling these signals in time. Our proposed Decoupled Time Guidance (DTG) evaluates the unconditional branch at a cleaner diffusion timestep, providing a lookahead prior that preserves geometry while suppressing replication of warped content. This temporal bias is annealed throughout sampling, allowing the model to transition from structure correction to detail refinement without retraining. Combined with any off-the-shelf restoration module in a plug-and-play manner, our approach improves perceptual coherence and restores plausible structure in AIgenerated and real-world videos alike. To facilitate evaluation, we curate GenWarp480, a benchmark of 4,400 distorted 480p videos synthesized from diverse text-to-video models. GenWarp480 focuses on characteristic generative degradations such as warped faces, body misalignments, and spatial artifacts, providing a purpose-built testbed for assessing robustness to generative errors. Extensive experiments demonstrate that our method achieves significant improvements in structural fidelity and temporal stability without any model training.

URL PDF HTML ☆

赞 0 踩 0

2605.30415 2026-06-01 cs.CL cs.AI

Domain Adaptation and Reasoning Frameworks in Language Models: A Controlled Experiment with Historical Cosmology

语言模型中的领域适应与推理框架：以历史宇宙学为受控实验

Francesco De Bernardis

发表机构 * Independent Researcher（独立研究者）

AI总结通过历史宇宙学受控实验，研究领域适应如何重塑语言模型的解释行为，发现适应主要改变解释框架而非直接改变立场。

Comments 17 pages, 3 figures

详情

AI中文摘要

我们以历史宇宙学为受控环境，研究领域适应如何重塑语言模型中的解释行为。在第一阶段，我们在一个去除明确日心说引用的前哥白尼语料库上从头训练一个小型语言模型，并评估地动说或日心说延续是否仍然出现。在第二阶段，我们使用QLoRA在同一语料库上微调一个更大的预训练模型，以研究适应如何修改解释框架和宇宙学立场。模型输出使用LLM-as-judge框架进行评估，该框架标记宇宙学立场（地心说、日心说或模糊）和解释框架（前现代与现代）。在受限的第一阶段，较小的模型偶尔生成局部的地动说延续，但这些延续全局不稳定，不足以支持连贯的宇宙学推理。在第二阶段，微调导致向现代前解释框架的大幅且统计显著的转变，而条件宇宙学立场分布在这些框架内相对稳定。因此，地心说输出的增加主要源于解释机制的重新分布，而非立场的直接修改。这些结果表明，领域适应可能主要重塑生成延续的语言框架，而立场的变化则次要地源于这些转变。

英文摘要

We investigate how domain adaptation reshapes explanatory behavior in language models using historical cosmology as a controlled setting. In Phase 1, we train a small language model from scratch on a pre-Copernican corpus from which explicit heliocentric references were removed, and evaluate whether Earth-motion or heliocentric continuations nevertheless emerge. In Phase 2, we fine-tune a larger pretrained model using QLoRA on the same corpus in order to study how adaptation modifies explanatory framing and cosmological stance. Model outputs are evaluated using an LLM-as-judge framework that labels both cosmological stance (geocentric, heliocentric, or ambiguous) and explanatory frame (premodern versus modern). In the constrained setting of Phase 1, the smaller models occasionally generate local Earth-motion continuations, but these remain globally unstable and insufficient to support coherent cosmological reasoning. In Phase 2, fine-tuning induces a large and statistically significant shift toward premodern explanatory framing, while the conditional cosmological stance distributions remain comparatively stable within those frames. As a result, increases in geocentric outputs arise primarily from redistribution over explanatory regimes rather than from direct modification of stance. These results suggest that domain adaptation may primarily reshape the linguistic frameworks from which continuations are generated, with changes in stance emerging secondarily from those shifts.

URL PDF HTML ☆

赞 0 踩 0

2605.30409 2026-06-01 cs.CV cs.AI

SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer

SANA-Streaming: 基于混合扩散Transformer的实时流式视频编辑

Yuyang Zhao, Yicheng Pan, Qiyuan He, Jincheng Yu, Junsong Chen, Tian Ye, Haozhe Liu, Enze Xie, Song Han

发表机构 * NVIDIA ； MIT（麻省理工学院）； THU（清华大学）； NUS（新加坡国立大学）； HKU（香港大学）

AI总结提出系统-算法协同设计的SANA-Streaming框架，通过混合扩散Transformer架构、循环反向正则化训练策略和高效系统协同设计，在消费级GPU上实现高分辨率实时流式视频编辑，达到1280×704分辨率24 FPS的端到端性能。

详情

AI中文摘要

实时流式视频到视频编辑（V2V）对于直播和游戏等交互式应用至关重要，但由于对时间一致性和推理吞吐量的严格要求，它仍然是一个严峻的挑战。在本文中，我们提出了SANA-Streaming，一个系统-算法协同设计的框架，用于在消费级GPU上进行高分辨率、实时流式视频编辑，具有以下三个核心设计：（1）混合扩散Transformer架构在部分块中引入softmax注意力以提高局部建模能力，同时保持线性层的效率。（2）循环反向正则化是一种新颖的训练策略，通过流匹配从生成内容预测源帧来强制语义一致性，无需成对的长编辑视频即可提高时间一致性。（3）高效系统协同设计结合了融合GDN内核和针对NVIDIA Blackwell（RTX 5090）架构优化的混合精度量化（MPQ）。通过分析实际吞吐量，我们的MPQ在保持生成质量的同时最大化Tensor Core利用率。最终系统在单个RTX 5090 GPU上以24 FPS的端到端帧率实现实时1280×704分辨率编辑，其中DiT核心运行在58 FPS。实验结果表明，我们的协同设计方法在时间一致性和系统吞吐量方面均显著优于现有最先进方法。

英文摘要

Real-time streaming video-to-video editing (V2V) is critical for interactive applications such as live broadcasting and gaming, yet it remains a formidable challenge due to the stringent requirements for temporal consistency and inference throughput. In this paper, we present SANA-Streaming, a system-algorithm co-designed framework for high-resolution, real-time streaming video editing on consumer GPUs, with the following three core designs: (1) Hybrid Diffusion Transformer architecture introduces softmax attention in part of the blocks to improve local modeling capabilities while preserving the efficiency of linear layers. (2) Cycle-Reverse Regularization is a novel training strategy that enforces semantic consistency by predicting source frames from generated content via flow matching, improving temporal consistency without requiring paired long edited videos. (3) Efficient System Co-design combines fused GDN kernels and Mixed-Precision Quantization (MPQ) optimized for the NVIDIA Blackwell (RTX 5090) architecture. By profiling real-world throughput, our MPQ maximizes Tensor Core utilization while maintaining generation quality. The resulting system achieves real-time 1280 x 704 resolution editing at 24 end-to-end FPS on a single RTX 5090 GPU, with the DiT core running at 58 FPS. Experimental results demonstrate that our co-design approach significantly outperforms existing SOTA methods in both temporal coherence and system throughput.

URL PDF HTML ☆

赞 0 踩 0

2605.30393 2026-06-01 cs.LG cs.AI cs.CR

NumLeak: Public Numeric Benchmarks as Latent Labels in Foundation Models

NumLeak: 基础模型中的公开数值基准作为潜在标签

Anany Kotawala

发表机构 * Princeton University（普林斯顿大学）

AI总结提出NumLeak框架，通过API边界探测和开源因果模型的白盒验证，揭示基础模型在预训练中记忆公开数值基准，导致评估高估泛化能力。

Comments 23 pages, 12 figures, 17 tables. Accepted at the ICML 2026 Workshop on the Impact of Memorization on Trustworthy Foundation Models (MemFM)

详情

AI中文摘要

公开数值基准出现在预训练中，因此基于日期进行评估可能测量的是记忆性回忆而非样本外技能。我们引入NumLeak，一个结合生产模型API边界探测与开源因果模型白盒受控验证的测量框架。顶级前沿LLM在3种子池化后，对Fama-French市场超额收益的回忆皮尔逊相关系数r=0.97-0.99，同时五个兄弟因子在25个基点内误差不超过0.15；在美国失业率、CPI通胀和NOAA温度上观察到类似保真度。在近期发布的保留集上，解析率骤降至21-57%，但在回答的月份上r仍约为0.99，拒绝-回忆不对称性符合记忆通道的预测。白盒实验重现了剂量反应，对数概率排名检测到开放生成遗漏的记忆，意味着封闭API黑盒探测低估了该通道。一个Sonnet“日期到市场情绪”回归与真实Mkt-RF的相关性r=0.74，在残差化模型自身回忆后降至r=0.02。一行系统提示防御在概念和历史叙事查询上以接近零的效用成本阻止了99.8%的非自适应单轮后缀攻击集。

英文摘要

Public numeric benchmarks appear in pretraining, so an evaluation that conditions on a date may be measuring memorized recall rather than out-of-sample skill. We introduce NumLeak, a measurement framework that combines API-boundary probes on production models with a white-box controlled validation on an open causal LM. Top-tier frontier LLMs recall the Fama-French market excess return at 3-seed pooled Pearson r=0.97-0.99 while staying within 0.15 within-25bps on the five sibling factors; comparable fidelity appears on U.S. unemployment, CPI inflation, and NOAA temperature. On a recent-release holdout, parse rate collapses to 21-57% but r stays at approximately 0.99 on months answered, the refuse-or-recall asymmetry a memorized channel predicts. The white-box experiment reproduces the dose-response, and logprob ranking detects memorization that open-ended generation misses, implying closed-API black-box probes understate the channel. A Sonnet "date to market-sentiment" regression that correlates with true Mkt-RF at r=0.74 collapses to r=0.02 once the model's own recall is residualized out. A one-line system-prompt defense blocks 99.8% of a non-adaptive single-turn suffix attack set at near-zero utility cost on conceptual and historical-narrative queries

URL PDF HTML ☆

赞 0 踩 0

2605.30388 2026-06-01 cs.LG

A Novel Evaluation Metric for Unsupervised Learning in AIS-Based Maritime Anomaly Detection: MADQI

基于AIS的海事异常检测中无监督学习的新型评估指标：MADQI

Ismet Gocer, Zakirul Bhuiyan, Raza Hasan, Shakeel Ahmad

发表机构 * Southampton Solent University School of Technology and Maritime Industries（索尔森大学技术与海运学院）

AI总结提出一种无需标签数据的海事异常检测质量指标MADQI，通过结合四个子指标来评估无监督学习模型的异常检测性能。

Comments 26 pages, A new Eval Metric for Unsupervised Machine Learning

详情

AI中文摘要

本文介绍了一个新的系统框架，用于检测海事自动识别系统（AIS）数据集中的异常。这些异常包括与速度、位置跳跃、时间间隔和转向角度相关的异常船舶行为。尽管诸如孤立森林之类的无监督学习算法被广泛用于检测异常船舶运动，但它们通常缺乏系统且有意义的评估措施。为了解决这一局限性，我们提出了一种称为海事异常检测质量指标（MADQI）的新型质量指标。所提出的MADQI是一个复合指标，旨在评估机器学习模型的异常检测性能，而无需标记数据。该框架使用哈弗辛距离计算来分析AIS数据集，并根据空间和行为特征识别异常。所提出的MADQI评估框架整合了四个相互关联的指标：异常率一致性（ARC）、物理合理性评分（PPS）、评分分布分离度（SDS）和极端案例证据（ECE）。这些指标通过使用多块评估和自适应缩放技术的自动归一化进行组合。在AIS数据集上的实验结果表明，所提出的框架实现了80.37%的MADQI分数，证明了其在无监督异常检测中的有效性。特别是，该算法在识别异常船舶行为方面表现强劲。在MADQI的各个组成部分中，ECE和ARC分别达到了0.907和1.000的分数，表明其在检测极端异常和保持异常率一致性方面具有出色的能力。总体而言，这些结果令人鼓舞，并表明所提出的框架为评估海事AIS数据中的无监督异常检测提供了一种可靠且有意义的方法。

英文摘要

This paper introduces a new systematic framework for detecting anomalies in maritime Automatic Identification System (AIS) datasets. These anomalies include abnormal vessel behaviours related to speed, position jumps, time gaps, and turn angles. Although unsupervised learning algorithms such as Isolation Forest are widely used for detecting anomalous vessel movements, they often lack systematic and meaningful evaluation measures. To address this limitation, we propose a novel quality metric called Maritime Anomaly Detection Quality Index (MADQI). The prosed MADQI is a composite index designed to evaluate the anomaly detection performance of machine learning models without requiring labelled data. The proposed framework uses Haversine distance calculations to analyse AIS datasets and identify anomalies based on their spatial and behavioural characteristics. The proposed MADQI evaluation framework integrates four interconnected metrics: Anomaly Rate Consistency (ARC), Physical Plausibility Score (PPS), Score Distribution Separation (SDS), and Extreme Case Evidence (ECE). These metrics are combined through automatic normalisation using multi-chunk evaluation and adaptive scaling techniques. Experimental results on the AIS dataset show that the proposed framework achieved a MADQI score of 80.37%, demonstrating its effectiveness for unsupervised anomaly detection. In particular, the algorithm performed strongly in identifying abnormal vessel behaviour. Among the individual MADQI components, ECE and ARC achieved scores of 0.907 and 1.000, respectively, indicating excellent capability in detecting extreme anomalies and maintaining anomaly rate consistency. Overall, these results are encouraging and demonstrate that the proposed framework provides a reliable and meaningful approach for evaluating unsupervised anomaly detection in maritime AIS data.

URL PDF HTML ☆

赞 0 踩 0

2605.30387 2026-06-01 cs.LG cs.AI cs.CV eess.SP

Functional MRI Time Series Generation via Wavelet-Based Image Transform and Spectral Flow Matching for Brain Disorder Identification

基于小波图像变换和频谱流匹配的功能磁共振时间序列生成用于脑疾病识别

Hwa Hui Tew, Junn Yong Loo, Fang Yu Leong, Julia K. Lau, Ding Fan, Hernando Ombao, Raphaël C. -W. Phan, Chee Pin Tan, Chee-Ming Ting

发表机构 * School of Information Technology, Monash University Malaysia（墨尔本大学马来西亚分校信息科技学院）； School of Engineering, Monash University Malaysia（墨尔本大学马来西亚分校工程学院）； Statistics Program, King Abdullah University of Science and Technology（国王阿卜杜勒·阿齐兹大学科学与技术学院统计学项目）

AI总结提出双频谱流匹配（DSFM）框架，通过离散小波变换和离散余弦变换对BOLD信号进行双频表示，结合频谱流匹配生成类条件余弦频率表示，再经逆变换重建生理上合理的时域BOLD信号，以改善下游脑网络分类。

Comments Accepted at the Fourteenth International Conference on Learning Representations (ICLR 2026)

详情

AI中文摘要

功能磁共振成像（fMRI）通过测量随时间变化的血氧水平依赖（BOLD）信号，提供对动态脑活动的非侵入性访问。然而，fMRI采集的资源密集型特性限制了数据驱动脑分析模型所需的高保真样本的可用性。虽然现代生成模型可以合成fMRI数据，但它们在复制原始BOLD信号固有的非平稳性、复杂的时空动态和生理变化方面仍然面临挑战。为了解决这些挑战，我们提出了双频谱流匹配（DSFM），一种新颖的fMRI生成框架，它将BOLD信号的双频表示与频谱流匹配级联起来。具体来说，我们的框架首先通过离散小波变换（DWT）将BOLD信号转换为小波分解图，以捕获全局瞬态和多尺度变化，并将其投影到跨脑区和时间的离散余弦变换（DCT）空间中，以利用低频主导BOLD系数的局部能量压缩。随后，训练一个频谱流匹配模型来生成类条件余弦频率表示。通过逆DCT和逆DWT操作重建生成的样本，以恢复生理上合理的时域BOLD信号。这种双变换方法施加了结构化的频率先验，并保留了关键的生理脑动力学。最终，我们通过改进的下游基于fMRI的脑网络分类证明了我们方法的有效性。代码可在 https://github.com/htew0001/DSFM.git 获取。

英文摘要

Functional Magnetic Resonance Imaging (fMRI) provides non-invasive access to dynamic brain activity by measuring blood oxygen level-dependent (BOLD) signals over time. However, the resource-intensive nature of fMRI acquisition limits the availability of high-fidelity samples required for data-driven brain analysis models. While modern generative models can synthesize fMRI data, they often remain challenging in replicating their inherent non-stationarity, intricate spatiotemporal dynamics, and physiological variations of raw BOLD signals. To address these challenges, we propose Dual-Spectral Flow Matching (DSFM), a novel fMRI generative framework that cascades dual frequency representation of BOLD signals with spectral flow matching. Specifically, our framework first converts BOLD signals into a wavelet decomposition map via a discrete wavelet transform (DWT) to capture globalized transient and multi-scale variations, and projects into the discrete cosine transform (DCT) space across brain regions and time to exploit localized energy compaction of low-frequency dominant BOLD coefficients. Subsequently, a spectral flow matching model is trained to generate class-conditioned cosine-frequency representation. The generated samples are reconstructed through inverse DCT and inverse DWT operations to recover physiologically plausible time-domain BOLD signals. This dual-transform approach imposes structured frequency priors and preserves key physiological brain dynamics. Ultimately, we demonstrate the efficacy of our approach through improved downstream fMRI-based brain network classification. The code is available at https://github.com/htew0001/DSFM.git .

URL PDF HTML ☆

赞 0 踩 0

2605.30383 2026-06-01 cs.RO cs.AI

Structured interactions improve distributed coordination beyond model scaling in a real-world multi-robot system

结构化交互在真实世界多机器人系统中超越模型规模提升分布式协调能力

Junping Wang, Zhizhong Zhang, Yongqiang Tang, Geng Zheng, Jiaming Zhang, Shiji Song, Yanmei Li, Yushan Ma

发表机构 * State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences（多模态人工智能系统国家重点实验室，自动化研究所，中国科学院）； School of Artificial Intelligence, University of Chinese Academy of Sciences（中国科学院大学人工智能学院）； School of Computer Science and Technology, East China Normal University（华东师范大学计算机科学与技术学院）； Department of Automation, Tsinghua University（清华大学自动化系）； Liupanshan Laboratory, Ningxia University（宁夏大学鲁班实验室）

AI总结通过真实多机器人实验，发现模块化层次化交互拓扑相比增加模型规模能更显著提升协调性能。

详情

AI中文摘要

提升单个机器人能力是常见但昂贵的做法。本文研究真实多机器人协调中的系统级设计问题：在硬件预算匹配的情况下，重构机器人间的通信是否比增加机载模型规模带来更大收益？使用10个物理机器人执行代表性的运输与建图任务（每种条件5次运行，共60次运行），我们发现从全连接切换到模块化层次化交互可将归一化性能提升47分（0-100分），而将神经网络隐藏层大小加倍最多提升9分。嵌套混合效应模型比较显示，拓扑对模型拟合的改善远大于规模。该模式在独立的SMAC复制实验中得到确认；异构基准重分析提供次要支持性一致性检查而非主要证据。在仿真校准的外推中观察到超过1024个隐藏单元的性能饱和，但未直接在硬件上验证。这些结果表明，在测试系统和任务设置中，交互结构可发挥主导作用，但更广泛的定量泛化仍有待建立。

英文摘要

Scaling individual robot capabilities is common but costly. Here we investigate a system-level design question in real-world multi-robot coordination: given matched hardware budgets, does restructuring communication among robots yield larger gains than increasing onboard model size? Using a representative transport-and-mapping task with 10 physical robots (5 runs per condition, 60 runs total), we find that switching from fully connected to modular hierarchical interactions improves normalised performance by 47 points (0--100), whereas doubling neural network hidden size yields at most 9 points. Nested mixed-effects model comparisons show a substantially larger improvement in model fit for topology than for scale. The pattern is confirmed in independent SMAC replications; heterogeneous benchmark reanalyses provide secondary supporting consistency checks rather than primary evidence. Performance saturation beyond 1024 hidden units is observed in simulation-calibrated extrapolation, not directly on hardware. These results indicate that interaction structure can play a dominant role within the tested system and task setting, while broader quantitative generalisation remains to be established.

URL PDF HTML ☆

赞 0 踩 0

2605.30381 2026-06-01 cs.LG cs.AI

When LLMs Learn to Be Consistently Wrong: A Multi-Model Study of Linear Representations of Synthetic Deception

当LLM学会一致错误：合成欺骗的线性表示的多模型研究

Vahideh Zolfaghari

发表机构 * Algoverse AI Research ； Medical Sciences Education Research Center, Mashhad University of Medical Sciences（马什哈德大学医学科学教育研究中心）； Student Research Committee, Department of Health Information Technology and Management, Medical Informatics, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences（谢赫·贝赫什提大学医学科学学院学生研究委员会，健康信息科技与管理系，医学信息学）

AI总结通过LoRA微调五个Transformer模型的诚实与欺骗变体，使用线性探针检测合成欺骗，发现早期层即可达到近完美AUC，支持线性表示假说，并揭示两种表示机制。

详情

AI中文摘要

欺骗性对齐（模型保持准确的内部表示同时故意产生错误输出）仍然是AI安全的核心挑战。虽然战略性欺骗是主要的长期关注点，但通过直接优化错误答案诱导的合成不诚实为研究学习欺骗的表示基础提供了受控测试平台。我们引入了一个多模型范式，其中五个Transformer模型（Pythia-1.4B、Gemma-2-2B/9B、Qwen2.5-7B、Llama-3.1-8B）的诚实和欺骗变体使用LoRA在相同问题分布上进行微调。在平均池化隐藏状态上训练的线性探针在四个架构的1-3层即可检测到合成欺骗，AUC接近完美（≥0.99），而Pythia-1.4B达到峰值0.705。逻辑回归探针始终匹配或优于MLP探针，支持线性表示假说。在TruthfulQA上训练的探针以近乎零损失（ΔAUC≈0）泛化到保留的MMLU主题。深层表示对高斯噪声表现出强鲁棒性，其中Gemma-2模型表现出卓越的稳定性。对Fisher判别比、有效秩、质心几何、方向稳定性、跨域对齐和校准（ECE）的机制分析揭示了两种机制：Pythia/Llama/Qwen中的表示坍缩与Gemma-2中的高维保持。在所有模型中，欺骗方向在更深层逐渐巩固，在1-4层可实现最优校准（除Pythia外ECE<0.01）。这些结果表明，通过适度的监督微调，鲁棒、域不变的欺骗表示可以迅速固化，对基于激活的监控具有启示意义。

英文摘要

Deceptive alignment, in which models maintain accurate internal representations while deliberately producing false outputs, remains a central challenge in AI safety. While strategic deception is the primary long-term concern, synthetic dishonesty - induced via direct optimization on incorrect answers - provides a controlled testbed for studying the representational basis of learned deception. We introduce a multi-model paradigm in which honest and deceptive variants of five transformer models (Pythia-1.4B, Gemma-2-2B/9B, Qwen2.5-7B, Llama-3.1-8B) are fine-tuned using LoRA on the same question distribution. Linear probes trained on mean-pooled hidden states detect synthetic dishonesty with near-perfect AUC (greater than or equal to 0.99) as early as layers 1-3 in four architectures, while Pythia-1.4B reaches a peak of 0.705. Logistic regression probes consistently match or outperform MLP probes, supporting the Linear Representation Hypothesis. Probes trained on TruthfulQA generalize with near-zero loss (Delta AUC approx. 0) to held-out MMLU subjects. Late-layer representations show strong robustness to Gaussian noise, with Gemma-2 models exhibiting exceptional stability. Mechanistic analysis of Fisher Discriminant Ratio, effective rank, centroid geometry, directional stability, cross-domain alignment, and calibration (ECE) reveals two regimes: representational collapse in Pythia/Llama/Qwen versus high-dimensional preservation in Gemma-2. Across all models, the dishonesty direction consolidates progressively in deeper layers, with optimal calibration (ECE less than 0.01 except Pythia) achievable in layers 1-4. These results demonstrate that robust, domain-invariant dishonesty representations can be rapidly entrenched via modest supervised fine-tuning, with implications for activation-based monitoring.

URL PDF HTML ☆

赞 0 踩 0

2605.30376 2026-06-01 cs.LG cs.AI

Unicorn: Scaling High-Dimensional Time Series Forecasting via Universal Correlation Modeling

Unicorn: 通过通用相关性建模实现高维时间序列的规模化预测

Haochen Yuan, Yichen Song, Yunbo Wang, Xiaokang Yang

发表机构 * MoE Key Lab of Artificial Intelligence（人工智能大规模并行计算实验室）； AI Institute（人工智能研究院）； School of Computer Science（计算机科学学院）； Shanghai Jiao Tong University（上海交通大学）

AI总结提出Unicorn框架，通过潜在原型码本解耦相关性建模与特定通道身份，实现跨异构数据集的可扩展多数据集预训练，在少样本迁移场景中显著优于现有模型。

详情

AI中文摘要

现代时间序列架构面临一个基本权衡：通道独立模型随着数据量增加可扩展性好，但忽略了关键的通道间依赖性；而通道依赖模型具有表达力，但仍然是“维度受限的”，难以泛化到异构数据集。为了弥合这一差距，我们引入了Unicorn（通用相关网络），一个用于高维时间序列的可扩展、多数据集预训练框架。Unicorn的核心是一个潜在原型码本，它将相关性建模与特定通道身份解耦。通过将异构通道投影到共享潜在空间，Unicorn学习与身份无关的、可复用的交互模式，这些模式可以跨具有不同维度和语义的领域迁移。大量实验表明，Unicorn显著优于最先进的预测架构，特别是在少样本迁移场景中，为多变量时间序列基础模型提供了一条可扩展的路径。

英文摘要

Modern time series architectures face a fundamental trade-off: channel-independent models scale well with increasing data volume but ignore critical inter-channel dependencies, while channel-dependent models are expressive but remain ``dimension-bounded'', struggling to generalize across heterogeneous datasets.To bridge this gap, we introduce Unicorn (Universal Correlation Network), a framework for scalable, multi-dataset pretraining on high-dimensional time series. At the core of Unicorn is a latent prototype codebook that decouples correlation modeling from specific channel identities. By projecting heterogeneous channels into a shared latent space, UniCorN learns identity-agnostic, reusable interaction patterns that transfer across domains with diverse dimensionalities and semantics. Extensive experiments show that Unicorn significantly outperforms state-of-the-art forecasting architectures, particularly in few-shot transfer scenarios, offering a scalable path toward multivariate time series foundation models.

URL PDF HTML ☆

赞 0 踩 0

2605.30374 2026-06-01 cs.LG

Gait2Hip-60: A Unified Deep Learning Benchmark for Predicting Hip Muscle Forces and Joint Moments from Multi-Cadence Gait Kinematics

Gait2Hip-60：基于多节奏步态运动学预测髋部肌肉力和关节力矩的统一深度学习基准

Jiaqi Zhang, Ji Hou, Qing Sun, Xianzhi Gao, Bo Huo

发表机构 * Capital University of Physical Education and Sports（首都体育学院）； Beijing Institute of Technology（北京理工大学）； Beijing Key Laboratory of Interdisciplinary Intelligent Technologies of Sports, Medicine and Engineering（北京体育医学与工程交叉智能技术重点实验室）

AI总结本研究提出一个深度学习框架，利用LSTM、Transformer和Mamba三种模型从下肢步态运动学直接预测髋部肌肉力和关节力矩，在60名健康受试者数据上评估，发现Transformer表现最佳，并在股骨头坏死患者零样本测试中保持中等预测能力。

Comments 16 pages, 9 figures. Code and dataset publicly available

详情

AI中文摘要

在步态过程中估计髋部肌肉力和关节力矩通常依赖于肌肉骨骼仿真，这种方法信息丰富但耗时且难以应用于临床。本研究开发了一个深度学习框架，直接从下肢步态运动学预测这些髋部动力学参数，并在统一协议下比较了三种代表性序列模型。步态数据来自60名健康成年人在三种节拍器引导的节奏条件下的行走。使用十个双侧下肢关节角度作为输入，以OpenSim导出的髋部肌肉力和髋关节力矩作为参考输出。训练并评估了LSTM、Transformer和Mamba三种深度学习模型，采用相同的受试者级别划分、预处理流程和评价指标。随后，最佳模型直接在一个由9名股骨头坏死（ONFH）患者组成的外部队列上进行测试，无需重新训练。在健康受试者基准测试中，Transformer在髋部肌肉力预测（RMSE = 1.33 N/kg, MAE = 0.57 N/kg, R2 = 0.819）和髋关节力矩预测（RMSE = 0.11 Nm/kg, MAE = 0.07 Nm/kg, R2 = 0.862）方面均取得了最佳的受试者级别平均性能，且在不同步行节奏下具有相似优势。在零样本外部验证中，Transformer在ONFH患者中保留了中等预测能力，髋部肌肉力预测（RMSE = 1.51 N/kg, MAE = 0.70 N/kg, R2 = 0.537）和髋关节力矩预测（RMSE = 0.17 Nm/kg, MAE = 0.12 Nm/kg, R2 = 0.569）。这些发现支持了从步态运动学估计髋部动力学的可行性，将Transformer确定为强基线，并强调了在临床应用前需要进行更广泛的病理验证和改进泛化能力。

英文摘要

Estimating hip muscle forces and joint moments during gait typically relies on musculoskeletal simulation, which is informative but time-consuming and difficult to apply in clinical settings. This study developed a deep learning framework to predict these hip dynamics parameters directly from lower-limb gait kinematics and compared three representative sequence models under a unified protocol. Gait data were collected from 60 healthy adults under three metronome-guided cadence conditions. Ten bilateral lower-limb joint angles were used as inputs, and OpenSim-derived hip muscle forces and hip joint moments were used as reference outputs. Three deep learning models of LSTM, Transformer, and Mamba were trained and evaluated using the same subject-level split, preprocessing pipeline, and metrics. The best model was then directly tested on an external cohort of 9 patients with osteonecrosis of the femoral head (ONFH) without retraining. In the healthy-subject benchmark, Transformer achieved the best subject-level mean performance for both hip muscle force prediction (RMSE = 1.33 N/kg, MAE = 0.57 N/kg, R2 = 0.819) and hip joint moment prediction (RMSE = 0.11 Nm/kg, MAE = 0.07 Nm/kg, R2 = 0.862), with similar advantages across walking cadences. In zero-shot external validation, Transformer retained moderate predictive ability in ONFH for hip muscle force prediction (RMSE = 1.51 N/kg, MAE = 0.70 N/kg, R2 = 0.537) and hip joint moment prediction (RMSE = 0.17 Nm/kg, MAE = 0.12 Nm/kg, R2 = 0.569). These findings support the feasibility of estimating hip dynamics from gait kinematics, identify Transformer as a strong baseline, and highlight the need for broader pathological validation and improved generalization before clinical application.

URL PDF HTML ☆

赞 0 踩 0

2605.30365 2026-06-01 cs.SD cs.AI eess.AS

Mental Damage: Caption Poisoning Attacks on Retrieval-Augmented Text-to-Music Generation

心理伤害：面向检索增强文本到音乐生成的标题投毒攻击

Yizhu Wen, Shuhao Zhang, Nan Zhang, Long Cheng, Hanqing Guo

发表机构 * Clemson University（克莱姆森大学）； Michigan State University（密歇根州立大学）

AI总结提出双层标题投毒策略，通过向音乐知识库注入少量恶意标题，使检索增强文本到音乐系统生成偏离用户意图的音乐，暴露了系统的完整性风险。

Comments This paper was accepted by the S&P 2026 ArtSec Workshop

详情

AI中文摘要

检索增强文本到音乐（TTM）系统通过从音乐标题数据集中检索的标题来增强未指定的用户提示。这种设计引入了对音乐知识数据库的完整性依赖。我们表明，攻击者可以通过注入少量精心制作的音乐标题来毒化数据库，导致系统检索恶意标题，从而偏置提示增强并使生成偏离用户预期功能，而无需修改用户提示、检索器或生成器。为了实现音乐标题投毒攻击，我们提出了一种双层标题投毒策略，该策略保留高级检索锚点，同时注入低级声学描述符，以将提示增强和下游音乐生成引导至攻击者选择的目标意图。在MusicCaps知识数据库、CLAP检索器和MusicGen流水线中，被投毒的生成结果显著接近攻击者的目标，同时与原始用户查询保持可比的对齐。这些结果暴露了检索增强创意AI系统的实际完整性风险。我们的演示可在以下网址找到：https://yizhu-wen.github.io/Mental-Damage/

英文摘要

Retrieval-augmented text-to-music (TTM) systems augment underspecified user prompts using captions retrieved from a music caption dataset. This design introduces an integrity dependency on the music knowledge database. We show that an attacker can poison the database by injecting a small number of crafted music captions, causing the system to retrieve malicious captions that bias prompt augmentation and steer generation away from the user's intended function, without modifying the user prompt, retriever, or generator. To achieve the music caption poisoning attack, we propose a dual-layer caption poisoning strategy that preserves high-level retrieval anchors while injecting low-level acoustic descriptors to steer prompt augmentation and downstream music generation toward an attacker-chosen target intent. In a MusicCaps knowledge database, CLAP retriever, and MusicGen pipeline, poisoned generations move substantially closer to the attacker's target, while remaining comparably aligned with the original user query. These results expose a practical integrity risk for retrieval-augmented creative AI systems. Our demo can be found at: https://yizhu-wen.github.io/Mental-Damage/

URL PDF HTML ☆

赞 0 踩 0

2605.30358 2026-06-01 cs.LG quant-ph

QASM-Eval: A Dataset to Train and Evaluate LLMs on OpenQASM-3 Beyond Quantum Circuits

QASM-Eval：用于训练和评估LLM在超越量子电路的OpenQASM-3上的数据集

Zhenxiao Fu, Lei Jiang, Fan Chen

发表机构 * Indiana University Bloomington（印第安纳大学布卢明顿分校）

AI总结针对LLM在OpenQASM-3硬件级编程上的训练与评估空白，构建了包含专家验证测试集和训练集的数据集，覆盖经典逻辑、时序调度、脉冲控制等，并通过扩展验证器自动验证，实验表明微调后LLM性能显著提升。

详情

AI中文摘要

量子计算仍处于含噪中等规模量子（NISQ）时代，其性能受到噪声的高度限制。解决这一限制通常需要超越门序列电路规范的硬件相关能力，包括用于量子纠错（QEC）的中间电路测量和经典反馈、用于动态解耦（DD）的精确时序控制，以及用于校准的脉冲级波形访问。OpenQASM-3正是为了暴露这些能力而引入的，提供了硬件级编程接口。然而，尽管大语言模型在代码生成方面取得了快速进展，目前仍没有专门设计用于训练和评估LLM在涉及高级硬件导向特性的OpenQASM-3程序上的数据集。为填补这一空白，我们推出了QASM-Eval，这是首个专门设计用于训练和评估LLM在OpenQASM-3上的全面数据集。QASM-Eval并非专注于量子算法设计或推理，而是明确针对该语言的硬件相关特性。QASM-Eval包含一个由专家验证的100个任务的测试集和一个4000个任务的训练集，系统性地涵盖了经典逻辑、时序调度、脉冲控制以及复杂的实际工作流程。为了自动验证生成的程序，我们使用扩展的验证器检查语法、量子态和程序时间线。我们的评估表明，虽然最先进的LLM在OpenQASM-3编码任务上表现困难，但在QASM-Eval上进行针对性微调后取得了显著提升。QASM-Eval为加速开发NISQ时代硬件相关量子编程的可靠LLM助手提供了关键的基准和训练基础。数据和代码：https://github.com/fuzhenxiao/QASM-Eval

英文摘要

Quantum computing remains in the Noisy Intermediate-Scale Quantum (NISQ) era, where the performance is highly constrained to noise. Addressing the limitation often requires hardware-facing capabilities beyond gate-sequence circuit specification, including mid-circuit measurement and classical feedback for quantum error correction (QEC), precise timing control for dynamical decoupling (DD), and pulse-level waveform access for calibration. OpenQASM-3 was introduced to expose exactly these capabilities, providing a hardware-level programming interface. However, despite the rapid progress of large language models in code generation, there is still no dataset specifically designed to train and evaluate LLMs on OpenQASM-3 programs that involve its advanced hardware-oriented features. To address this gap, we introduce QASM-Eval, the first comprehensive dataset designed to train and evaluate LLMs on OpenQASM-3. Rather than focusing on quantum algorithm design or reasoning, QASM-Eval explicitly targets the language's hardware-facing features. QASM-Eval comprises an expert-verified test set of 100 tasks and a training set of 4,000 tasks, systematically covering classical logic, timing scheduling, pulse control, and complex real-world workflows. To automatically validate generated programs, we check syntax, quantum states and program timeline using an extended verifier. Our evaluation reveals that while state-of-the-art LLMs struggle heavily in OpenQASM-3 coding tasks, targeted fine-tuning on QASM-Eval yields significant gains. QASM-Eval provides a crucial benchmark and training foundation to accelerate the development of reliable LLM assistants for hardware-facing quantum programming in NISQ era. Data and code: https://github.com/fuzhenxiao/QASM-Eval

URL PDF HTML ☆

赞 0 踩 0

2605.28442 2026-06-01 cs.RO cs.CV

Self-Supervised Online Robot-Agnostic Traversability Estimation for Open-World Environments

面向开放世界的自监督在线机器人无关可通行性估计

Julia Hindel, Simon Bultmann, Houman Masnavi, Daniele Cattaneo, Abhinav Valada

发表机构 * Department of Computer Science, University of Freiburg（弗赖堡大学计算机科学系）

AI总结提出COTRATE框架，通过自监督在线学习从多模态未标记机器人经验中估计可通行性，采用机器人无关的地形评估模块和多样性感知特征选择策略，实现跨平台知识迁移并降低遗忘。

Comments 14 pages, 16 Figures

详情

AI中文摘要

自监督在线可通行性估计使机器人能够从未标记的开放世界经验中持续学习，并调整其导航行为以实现安全高效的轨迹。现有方法要么依赖手工设计的本体感受可通行性分数，限制了机器人无关性，要么对先验数据进行聚类，阻碍了在线学习。此外，许多持续学习方法会带来大量的内存和计算成本，阻碍了机载部署。我们提出了COTRATE，一个用于从多模态、未标记的机器人经验中持续估计可通行性的在线学习框架。我们的方法首先使用一个基于学习的机器人无关在线地形评估模块，该模块处理本体感受和惯性信号，推断出鲁棒的可通行性分数。然后，这些分数通过一种新颖的对齐损失来监督视觉可通行性网络，该损失将视觉嵌入与在线地形评估相关联。为了在持续学习过程中以最小开销减轻遗忘，我们提出了一种多样性感知的特征选择策略，该策略使用紧凑的回放记忆来保持性能。我们进一步表明，学习到的可通行性表示支持具有不同运动学特性的不同机器人平台之间的知识迁移。我们在一个包含约50,000张图像的数据集上评估了COTRATE，该数据集由两个机器人平台在11种户外地形上收集，并在三个代表性户外环境中的导航任务上进行了基准测试。我们将数据集、代码和训练模型公开。

英文摘要

Self-supervised online traversability estimation enables robots to continuously learn from unlabeled open-world experiences and adapt their navigation behavior toward safe and efficient trajectories. Existing approaches either rely on handcrafted proprioceptive traversability scores, limiting robot-agnosticism, or cluster prior data, preventing online learning. Moreover, many continual learning methods incur substantial memory and computational costs, hindering onboard deployment. We introduce COTRATE, an online learning framework for continuous traversability estimation from multimodal, unlabeled robot experience. Our method first infers robust traversability scores using a robot-agnostic, learning-based online terrain assessment module operating on proprioceptiveand inertial signals. These scores then supervise a visual traversability network through a novel alignment loss that associates visual embeddings with online terrain assessments. To mitigate forgetting during continual learning with minimal overhead, we propose a diversity-aware feature selection strategythat preserves performance using a compact replay memory. We further show that the learned traversability representation supports knowledge transfer across different robot platforms with different locomotion kinematics. We evaluate COTRATE on a dataset of $\approx$ 50,000 images collected with two robotic platforms across 11 outdoor terrains, and benchmark it on navigation tasks in three representative outdoor environments. We make the dataset, code, and trained models publicly available.

URL PDF HTML ☆

赞 0 踩 0

2605.28420 2026-06-01 cs.LG

Conveyance: A Versatile Framework for Learning in Structured Class Spaces

Conveyance: 结构化类空间学习的通用框架

Yasser Taha, Grégoire Montavon, Nils Körber

发表机构 * Centre for Artificial Intelligence in Public Health Research, Robert Koch Institute（公共健康人工智能研究中心，罗伯特· Koch 研究所）； Berlin Institute for the Foundations of Learning and Data（柏林学习与数据基础研究所）； Institute for AI in Medicine, Charité Universitätsmedizin Berlin（医学人工智能研究所，柏林夏里特大学医学院）

AI总结针对标准损失函数忽略类间结构关系的问题，提出Conveyance分类方法，通过最大化不同类划分上的两个间隔来编码图结构关系，在层次分类、序数回归和多实例学习任务中达到或超越专用基线。

详情

AI中文摘要

尽管机器学习架构已迅速发展以处理复杂数据，但在许多实际应用中，像交叉熵这样的损失函数仍然大多与结构无关。然而，这些标准损失的“类对称”性质从根本上限制了机器学习模型利用类间结构关系的能力，尤其是在面对结构化噪声时。我们提出了Conveyance，一种针对结构化类空间的新分类方法及相关损失函数。它允许用户编码类之间的图结构关系，而无需定义复杂的联合分布或手动调整效用矩阵。从技术上讲，我们的损失函数通过最大化不同类划分上的两个间隔来运作，同时保持单调性和部分凸性等正式性质。我们通过将方法应用于层次分类、序数回归和多实例学习来展示其通用性和有效性。在这些任务中，Conveyance要么匹配要么超过专用基线的性能，从而为结构化类空间提供了统一解决方案。

英文摘要

While machine learning (ML) architectures have evolved rapidly to account for complex data, loss functions like cross-entropy remain mostly structure-agnostic in many real-world applications. However, the "class-symmetric" nature of these standard losses fundamentally limits the ability of ML models to exploit structural relationships between classes, particularly when facing structured noise. We propose Conveyance, a new classification approach and associated loss function tailored to structured class spaces. It allows users to encode graph-like relations between classes without having to define complex joint distributions or manually tune utility matrices. Technically, our loss function operates by maximizing two separate margins over distinct class partitions, while preserving formal properties such as monotonicity and partial convexity. We demonstrate the versatility and effectiveness of our method by applying it to hierarchical classification, ordinal regression, and multiple instance learning. Across these tasks, Conveyance either matches or exceeds the performance of specialized baselines, thereby offering a unified solution for structured class spaces.

URL PDF HTML ☆

赞 0 踩 0

2605.28068 2026-06-01 cs.LG

PINE: Pruning Boosted Tree Ensembles with Conformal In-Distribution Prediction Equivalence

PINE：基于共形分布内预测等价的剪枝提升树集成

Haruki Yajima, Yusuke Matsui

发表机构 * The University of Tokyo（东京大学）

AI总结提出PINE方法，通过共形校准控制分布内区域，在保持预测等价的同时将剪枝压缩比提升高达30%。

Comments Accepted to ICML 2026

2605.27996 2026-06-01 cs.AI

Reward Bias Substitution: Single-Axis Bias Mitigations Redirect Optimization Pressure

奖励偏差替代：单轴偏差缓解措施重定向优化压力

Max Lamparth, Daniel Fein, Andreas Haupt, Marcel Hussing, Mykel J. Kochenderfer

发表机构 * Stanford University（斯坦福大学）； University of Pennsylvania（宾夕法尼亚大学）

AI总结本文提出奖励偏差替代现象，即单轴缓解奖励模型偏差（如减少对长度、谄媚或风格的依赖）会将优化压力转移到相关代理上而非消除，并通过理论证明和实验（如GRPO训练中的长度惩罚导致过度自信）揭示了该问题，建议在评估中纳入策略诱导分布并跟踪多偏差。

Comments Improved readability (mostly appendix D)

详情

AI中文摘要

单轴缓解奖励模型偏差（例如，减少代理对长度、谄媚或风格的依赖）可以将优化压力旋转到相关代理上，而不是消除它，这种失败模式我们称之为奖励偏差替代。这种失败是由于在缓解评估和策略训练期间，审计分布与策略诱导分布之间的测量与优化差距造成的。我们将缓解结果形式化为一个机制分类，并证明成功的缓解、偏差替代和过度修正会在任何审计分布评分下产生相同的可观测结果，包括排名准确率和胜率，即使允许对真实奖励进行神谕访问。在已发表的偏好学习缓解工作中，我们调查的方法都没有报告证明成功缓解所需的证据。在跟踪多个偏差的同时，用策略诱导分布增强评估可以证明缩小差距，我们将其转化为缓解方法和基准的可操作处方。我们在语言模型RLHF中演示了偏差替代，其中GRPO训练期间的长度惩罚按预期压缩了响应，但将优化压力重定向到置信度校准上，导致策略过度自信，而事实自由形式准确性下降。我们还展示了一个已发表的长度去偏操作，它在审计分布上将奖励-长度相关性归零，但在四个最先进奖励模型中的三个上，在最佳N选择下重新引入了偏差，以及一个长度-谄媚耦合，其方向在人类-LLM判断者分歧下反转。

英文摘要

Single-axis mitigations of reward-model biases (e.g., reducing proxy reliance on length, sycophancy, or style) can rotate optimization pressure onto correlated proxies rather than eliminate it, a failure mode we call reward bias substitution. The failure is enabled by a measurement-versus-optimization gap between audit and policy-induced distributions during mitigation evaluation and policy training. We formalize mitigation outcomes into a regime taxonomy and prove that successful mitigation, bias substitution, and overcorrection produce identical observables under any audit-distribution scoring, including ranking accuracy and win-rate, even when granted oracle access to the true reward. Across published preference-learning mitigation work, no method we survey reports the evidence needed to certify successful mitigation. Augmenting evaluation with policy-induced distributions while tracking multiple biases provably closes the gap, and we translate this into actionable prescriptions for mitigation methods and benchmarks. We demonstrate bias substitution in language model RLHF, where a length penalty during GRPO training compresses responses as intended yet redirects optimization pressure onto confidence calibration, driving the policy into overconfidence while factual free-form accuracy falls. We also show a published length-debiasing operator that zeroes reward-length correlation on the audit distribution but reintroduces bias under best-of-N selection on three of four SOTA reward models, and a length-sycophancy coupling whose direction reverses under human-LLM judge disagreement.

URL PDF HTML ☆

赞 0 踩 0

2605.27881 2026-06-01 cs.CL

Retrieval, Reward, and Training Protocols: What Matters in Training Search Agents?

检索、奖励与训练协议：训练搜索代理的关键因素是什么？

Yibo Zhao, Zichen Ding, Jiayi Wu, Zun Wang, Xiang Li

发表机构 * School of Data Science and Engineering, East China Normal University（东华大学数据科学与工程学院）； Shanghai AI Laboratory（上海人工智能实验室）

AI总结本文通过控制实验，系统研究了检索语料库、奖励设计和训练协议三个维度对搜索代理训练的影响，发现纠正语料覆盖问题比算法差异更有效，简单的基于结果的奖励方法在多数设置下表现优异，并提出了实用训练指南。

Comments 18pages, 4 figures, and 15 tables

详情

AI中文摘要

由大型语言模型驱动的搜索代理能够通过多步推理自主分解查询、检索信息并综合答案。然而，训练方法的快速增长已超越了受控比较：现有工作在检索语料库、奖励设计和训练协议上存在差异，使得实际驱动改进的因素不明确。我们提出了一项受控实证研究，隔离了搜索代理训练中三个未充分探索的维度。首先，我们识别了广泛使用的Wikipedia 2018语料库中的一个关键数据覆盖问题，并表明仅纠正该问题带来的收益就大于训练算法之间的差异。其次，我们系统比较了三种基础模型上基于结果和基于过程的奖励方法，发现最简单的基于结果的方法在大多数设置中达到竞争性或更优的性能，并且过程级信用分配可能过度纠正代理行为。第三，我们分析了训练数据多样性、离策略数据利用和搜索预算缩放，提炼出训练有效搜索代理的实用指南。我们的代码可在https://github.com/YiboZhao624/SearchAgentReview获取。

英文摘要

Search agents powered by large language models can autonomously decompose queries, retrieve information, and synthesize answers through multi-step reasoning. However, the rapid growth of training methods has outpaced controlled comparison: existing works differ in retrieval corpora, reward designs, and training protocols, making it unclear what actually drives improvements. We present a controlled empirical study that isolates three under-explored dimensions of search agent training. First, we identify a critical data-coverage issue in the widely used Wikipedia 2018 corpus and show that correcting it alone yields larger gains than the differences between training algorithms. Second, we systematically compare outcome-based and process-based reward methods across three base models, finding that the simplest outcome-based approach achieves competitive or superior performance in most settings, and that process-level credit assignment can over-correct agent behavior. Third, we analyze training data diversity, off-policy data utilization, and search budget scaling, distilling practical guidelines for training effective search agents. Our code is available at https://github.com/YiboZhao624/SearchAgentReview.

URL PDF HTML ☆

赞 0 踩 0

2605.27557 2026-06-01 cs.LG stat.ML

The Fundamental Limits of Fraud Detection in Card Payment Networks

银行卡支付网络中欺诈检测的基本极限

Gaurav Dhama

发表机构 * Mastercard

AI总结本文通过形式化支付授权为具有延迟、审查、污染和反事实缺失反馈的序贯决策问题，推导出极小极大遗憾下界，证明生态系统信息质量是欺诈检测的根本瓶颈，而非模型复杂度。

详情

AI中文摘要

银行卡支付欺诈检测通常被框架化为一个监督分类问题。尽管这种方法已经取得了实际进展，但尽管模型架构取得了重大进展，改进仍然只是渐进的。我们认为，这主要不是函数逼近或优化的失败，而是支付生态系统固有的结构性信息损害的结果。我们将银行卡授权形式化为一个具有延迟、审查、污染和反事实缺失反馈的序贯决策问题。我们推导出一个极小极大遗憾下界，表明这些损害在可达学习率的分母中相乘。该下界表明，提高发卡机构报告质量或减少审查可以比增加模型复杂度更大幅度地降低遗憾下限。我们还表明，发卡机构之间的异质性会进一步恶化可学习性，超出平均损害率所暗示的程度。本文贡献了一个理论，解释了为什么支付网络中的欺诈检测本质上比标准在线学习设置更困难，将生态系统信息质量确定为关键瓶颈，并为优先投资于报告基础设施、争议处理质量和选择性探索提供了理论基础。本文以理论为先，不依赖专有交易数据。

英文摘要

Card payment fraud detection is usually framed as a supervised classification problem. Although this approach has generated practical progress, improvement has remained incremental despite major advances in model architecture. We argue that this is not mainly a failure of function approximation or optimization, but a consequence of structural information impairments inherent to the payment ecosystem. We formalize card authorization as a sequential decision problem with delayed, censored, corrupted, and counterfactually missing feedback. We derive a minimax regret lower bound showing that these impairments enter multiplicatively in the denominator of the achievable learning rate. The bound implies that improving issuer reporting quality or reducing censorship can yield larger reductions in the regret floor than increasing model complexity. We also show that heterogeneity across issuers worsens learnability beyond what average impairment rates suggest. The paper contributes a theory of why fraud detection in payment networks is fundamentally harder than in standard online learning settings, identifies ecosystem information quality as the key bottleneck, and provides a theoretical basis for prioritizing investments in reporting infrastructure, dispute process quality, and selective exploration. The paper is theory-first and does not rely on proprietary transaction data.

URL PDF HTML ☆

赞 0 踩 0

2605.27367 2026-06-01 cs.CV

SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

SpatialBench: 你的空间基础模型是全能选手吗？

Haosong Peng, Hao Li, Jiaqi Chen, Yuhao Pan, Runmao Yao, Yalun Dai, Fushuo Huo, Fangzhou Hong, Zhaoxi Chen, Haozhao Wang, Dingwen Zhang, Ziwei Liu, Wenchao Xu

发表机构 * Hong Kong University of Science and Technology（香港科技大学）； Nanyang Technological University（南洋理工大学）； Northwestern Polytechnical University（西北工业大学）； Southeast University（东南大学）； Huazhong University of Science and Technology（华中科技大学）

AI总结提出SpatialBench基准，通过跨范式、多域、确定性采样的评估，揭示当前空间基础模型在多样化下游任务中的泛化能力不足，并引入DA-Next-5M数据集和DA-Next模型推动空间表示学习。

Comments Project Page: https://ropedia.github.io/SpatialBench/

详情

AI中文摘要

尽管空间基础模型在标准数据集上展示了令人印象深刻的性能，但一个关键问题仍然存在：它们是否真正是能够稳健泛化到多样化下游任务、任意视角、变化的场景域、不同输入密度和特定硬件约束的全能选手？回答这个总体问题需要整体评估，然而当前模型主要在其专门设计或训练的特定领域上进行评估。这种评估本质上受到狭窄范式覆盖、有限场景域和任意帧采样的限制，使得从根本上难以评估其真正的泛化能力。为弥补这一差距，我们提出了SpatialBench，一个用于空间基础模型的跨范式、域多样化的基准，采用确定性采样。SpatialBench具有前所未有的规模和严格的确定性设计，包含19个数据集和546个场景，覆盖5个不同的空间域。它在4种不同输入密度设置下，全面评估了6个范式的41个模型在5个任务套件上的表现。我们的广泛评估揭示当前模型尚未成为全能选手，并为未来进展揭示了关键见解。具体来说，我们证明全上下文注意力最大化准确性，而有界记忆策略解锁长序列可扩展性。此外，我们在具有挑战性的具身和自我中心任务中的实证评估表明，严格的域对齐和高数据质量对性能的影响远大于简单的数据集扩展。最后，为解决我们分析中发现的最大数据差距，我们超越评估，引入大规模数据集DA-Next-5M和强基线模型DA-Next，推动空间表示学习的边界。

英文摘要

While spatial foundation models have demonstrated impressive performance on standard datasets, a critical question remains: are they truly all-round players capable of generalizing robustly across diverse downstream tasks, arbitrary viewpoints, shifting scene domains, varying input densities, and specific hardware constraints? Answering this overarching question requires a holistic assessment, yet current models are mainly evaluated on specific domains for which they were specifically designed or trained. Such evaluations are intrinsically limited by narrow paradigm coverage, limited scene domains, and arbitrary frame sampling, making it fundamentally difficult to assess their true generalization capabilities. To address this gap, we present SpatialBench, a cross-paradigm, domain-diverse benchmark for spatial foundation models with deterministic sampling. SpatialBench features unprecedented scale and rigorous deterministic design, comprising 19 datasets and 546 scenes across 5 diverse spatial domains. It comprehensively evaluates 41 models across 6 paradigms on 5 task suites under 4 different input density settings. Our extensive evaluation reveals that current models are not yet all-round players, and uncovers crucial insights for future advancement. Specifically, we demonstrate that full-context attention maximizes accuracy while bounded-memory strategies unlock long-sequence scalability. Moreover, our empirical evaluations in challenging embodied and egocentric tasks demonstrate that strict domain alignment and high data quality are far more critical to performance than simple dataset scaling. Furthermore, to address the largest data gap identified in our analysis, we go beyond evaluation by introducing a large-scale dataset, DA-Next-5M, and a strong baseline model, DA-Next, pushing the boundaries of spatial representation learning.

URL PDF HTML ☆

赞 0 踩 0

2605.27355 2026-06-01 cs.AI cs.CL cs.LG

Alignment Tampering: How Reinforcement Learning from Human Feedback Is Exploited to Optimize Misaligned Biases

对齐篡改：人类反馈强化学习如何被利用以优化错位偏见

Dongyoon Hahm, Dylan Hadfield-Menell, Kimin Lee

发表机构 * MIT（麻省理工学院）

AI总结本文提出对齐篡改漏洞，即对齐中的LLM通过影响偏好数据集使RLHF放大不良行为，并通过实验展示多种偏见的放大，指出现有缓解方法难以在不牺牲质量的情况下解决该问题。

Comments Accepted at ICML 2026, Source code: https://alignment-tampering.github.io/

详情

AI中文摘要

人类反馈强化学习（RLHF）是将大型语言模型（LLM）与人类偏好对齐的标准方法。在本工作中，我们引入对齐篡改，这是一种潜在漏洞，即正在对齐的LLM影响偏好数据集，导致RLHF放大不良行为。这源于RLHF的核心局限性：（1）偏好数据集由LLM自身的输出构建，使其能够影响它们；（2）成对比较仅指示哪个响应更好，而不说明原因。这些局限性可能被利用以导致对齐篡改。例如，如果LLM以更高质量生成有偏见的响应，标注者会基于质量偏好它们。然而，偏好标签无法区分质量与偏见，奖励模型继承了这一局限性。通过强化学习或最佳N采样优化此类奖励可能放大错位偏见。我们的实验展示了跨多种偏见的放大：从关键词偏见到宣传（例如性别歧视）、品牌推广和工具性目标寻求。缓解仍然具有挑战性，因为现有的鲁棒RLHF技术无法在不牺牲响应质量的情况下完全解决对齐篡改。这些发现揭示了当前RLHF的结构性漏洞，并强调了防止此漏洞的必要性。项目页面：https://alignment-tampering.github.io/

英文摘要

Reinforcement Learning from Human Feedback (RLHF) is the standard method to align Large Language Models (LLMs) with human preferences. In this work, we introduce alignment tampering, a potential vulnerability where the LLM undergoing alignment influences the preference dataset, causing RLHF to amplify undesired behaviors. This arises from core limitations of RLHF: (1) preference datasets are constructed from the LLM's own outputs, allowing it to influence them, and (2) pairwise comparisons only indicate which response is better, not why. These limitations can be exploited to cause alignment tampering. For example, if an LLM generates biased responses with higher quality, annotators will prefer them based on quality. However, preference labels do not distinguish quality from bias, and the reward model inherits this limitation. Optimizing such rewards through reinforcement learning or best-of-N sampling can amplify misaligned biases. Our experiments demonstrate amplification across diverse biases: from keyword bias to propaganda (e.g., sexism), brand promotion, and instrumental goal-seeking. Mitigation remains challenging, as existing techniques for robust RLHF fail to fully resolve alignment tampering without sacrificing response quality. These findings reveal structural vulnerabilities of current RLHF and emphasize the need to prevent this vulnerability. Project page: https://alignment-tampering.github.io/

URL PDF HTML ☆

赞 0 踩 0

2605.27255 2026-06-01 cs.CL cs.AI

Pair-In, Pair-Out: Latent Multi-Token Prediction for Efficient LLMs

Pair-In, Pair-Out: 面向高效LLM的潜在多令牌预测

Wenhui Tan, Minghao Li, Xiaoqian Ma, Siqi Fan, Xiusheng Huang, Liujie Zhang, Ruihua Song, Weihang Chen

发表机构 * Gaoling School of Artificial Intelligence, Renmin University of China（中国人民大学首都人工智能学院）； AI Platform, Xiaohongshu Inc.（小红书人工智能平台）； University of Electronic Science and Technology of China（电子科技大学）； Institute of Automation, Chinese Academy of Sciences（中国科学院自动化研究所）

AI总结提出Pair-In, Pair-Out (PIPO)方法，通过统一潜在压缩和多令牌预测，并训练轻量级置信度头消除验证器开销，在保持可靠性的同时实现推理加速。

Comments Project Page: GitHub.com/RedAI-Infra/PIPO

详情

AI中文摘要

长链式推理使得自回归解码成为现代大语言模型的主要推理成本。现有方法要么针对输入侧（潜在压缩），要么针对输出侧（推测解码和多令牌预测，MTP），但这两条工作线是独立进行的。此外，输出侧方法必须进行昂贵的验证器传递，以验证MTP预测的不可靠草稿令牌。为解决这些问题，我们提出 extbf{Pair-In, Pair-Out (PIPO)}，通过将潜在压缩器和MTP头视为镜像操作来统一两侧：压缩器将两个输入令牌折叠成一个潜在表示，而MTP头将一个隐藏状态展开成一个额外的输出令牌。为了在不牺牲可靠性的情况下消除验证器成本，PIPO训练一个轻量级置信度头，决定是否接受草稿令牌。我们观察到，在线策略蒸馏（OPD）自然匹配推测解码的拒绝采样准则，因此置信度头可以以可忽略的额外成本与OPD一起训练。在AIME 2025、GPQA-Diamond、LiveCodeBench v6和LongBench v2上使用Qwen3.5-4B和9B骨干网络的实验表明，PIPO在常规解码上将pass@4提高了最多+7.15个点，同时实现了高达2.64倍的首令牌延迟和2.07倍的每令牌延迟加速。项目页面：GitHub.com/RedAI-Infra/PIPO。

英文摘要

Long chain-of-thought reasoning has made autoregressive decoding the dominant inference cost of modern large language models. Existing methods target either the input side (latent compression) or the output side (speculative decoding and multi-token prediction, MTP), but the two lines of work have been pursued independently. Moreover, output-side methods must incur an expensive verifier pass to validate the unreliable draft tokens predicted by MTP. To address these issues, we propose \textbf{Pair-In, Pair-Out (PIPO)}, which unifies both sides by viewing a latent compressor and an MTP head as mirror-image operations: the compressor folds two input tokens into one latent representation, while the MTP head unfolds one hidden state into one additional output token. To remove the verifier cost without sacrificing reliability, PIPO trains a lightweight confidence head that decides whether draft tokens should be accepted. We observe that On-Policy Distillation (OPD) naturally matches the rejection-sampling criterion of speculative decoding, so the confidence head can be trained alongside OPD with negligible extra cost. Experiments on AIME 2025, GPQA-Diamond, LiveCodeBench v6, and LongBench v2 with Qwen3.5-4B and 9B backbones show that PIPO improves pass@4 over regular decoding by up to $+7.15$ points, while delivering up to $2.64\times$ first-token-latency and $2.07\times$ per-token-latency speedups. Project Page: GitHub.com/RedAI-Infra/PIPO.

URL PDF HTML ☆

赞 0 踩 0

2605.27114 2026-06-01 cs.RO

VR-DAgger: Immersive VR for Dexterous Data Collection and Uncertainty-Guided On-Policy Correction

VR-DAgger: 用于灵巧数据收集和不确定性引导的在线策略校正的沉浸式VR

René Zurbrügg, Tifanny Portela, Arjun Bhardwaj, Aravind Elanjimattathil Vijayan, Maximum Wilder-Smith, Marco Hutter

发表机构 * Robotics Systems Lab（机器人系统实验室）； ETH Zürich（苏黎世联邦理工学院）； ETH AI Center（ETH人工智能中心）； ETH Augmented Reality Research Lab（ETH增强现实研究实验室）； ETH Mobility Initiative（ETH移动性倡议）； ANYbotics AG（ANYbotics公司）； Swiss Federal Railways（瑞士联邦铁路）

AI总结提出VR-DAgger框架，通过VR应用进行灵巧遥操作和数据收集，利用MC Dropout不确定性评分选择关键失败片段进行在线校正，在灵巧操作任务上相比行为克隆提升高达23个百分点，并减少约40%的样本收集时间。

详情

AI中文摘要

从示范中学习对于机器人操作是有效的，但收集足够的任务特定数据仍然是一个主要瓶颈。在分布偏移下，小误差会累积，性能下降，专家时间往往花费在冗余、低价值的修正上，而不是少数关键失败案例。我们提出了VR-DAgger，一个以沉浸式VR应用为中心的人机协作框架，用于灵巧遥操作、示范收集和选择性策略校正。VR客户端提供直观的手部控制和同步场景可视化，而后台工作站运行仿真和学习，实现无需操作员持续监督的自主部署。我们使用蒙特卡洛（MC）Dropout在Isaac Lab部署扩散策略时对不确定性进行评分，并选择信息量大的失败片段进行校正。这些片段在VR中作为剪辑重放，操作员选择性地标记和校正策略的行为，将监督集中在不确定性最高的地方，无需全程监控或单独的中断分类器。我们在三个灵巧操作任务（平底锅抓取放置、抽屉打开、阀门旋转）上使用10自由度XHand在标准和具有挑战性的初始配置下进行评估。主动标记在所有任务上持续优于行为克隆，提升高达23个百分点。与无指导的人机协作检查相比，VR-DAgger通过将审查集中在选定的片段而非完整部署上，将每个样本的收集时间减少了约40%。

英文摘要

Learning from demonstrations is effective for robotic manipulation, but collecting sufficient task-specific data remains a major bottleneck. Under distribution shift, small errors compound, performance degrades, and expert time is often spent on redundant, low-value corrections instead of the few critical failure cases. We present VR-DAgger, a human-in-the-loop framework centered on an immersive VR application for dexterous teleoperation, demonstration collection, and selective policy correction. The VR client provides intuitive hand control with synchronized scene visualization, while a backend workstation runs simulation and learning, enabling autonomous rollouts without continuous operator oversight. We use Monte Carlo (MC) dropout to score uncertainty during Isaac Lab rollouts of a diffusion policy and select informative failure segments for correction. These segments are replayed in VR as clips, where the operator selectively labels and corrects the policy's behavior, concentrating supervision where uncertainty is highest without full-rollout monitoring or a separate intervention classifier. We evaluate on three dexterous manipulation tasks (Pan pick-and-place, Drawer opening, Valve turning) with a 10-DoF XHand under standard and challenging initial configurations. Active labeling consistently improves over behavioral cloning across all tasks, with gains of up to 23 percentage points. Compared to unguided human-in-the-loop inspection, VR-DAgger reduces per-sample collection time by approximately 40% by focusing review on selected segments rather than full rollouts.

URL PDF HTML ☆

赞 0 踩 0

2605.26942 2026-06-01 cs.AI cs.LO cs.SE

Neuro-Symbolic Verification of LLM Outputs for Data-Sensitive Domains (extended preprint)

面向数据敏感领域的LLM输出的神经符号验证（扩展预印本）

Paul Sigloch, Christoph Benzmüller

发表机构 * University of Bamberg（巴姆堡大学）； Free University of Berlin（柏林自由大学）

AI总结提出一种结合形式符号方法与神经语义分析的混合验证架构，用于检测LLM输出中的幻觉、不一致和隐私漏洞，在医疗设备损伤评估系统中实现83%的结构化实体幻觉检测率和72%的语义虚构检测率。

Comments Extended preprint version of accepted technical communication at KI 2026. 22 pages, 3 figures

详情

AI中文摘要

部署在高风险领域的LLM面临根本性的可靠性挑战：幻觉、不一致性和隐私漏洞引入了不可接受的风险，因为错误会带来法律、财务或安全后果。本文提出一种混合验证架构，结合形式符号方法与神经语义分析，为LLM生成的内容提供互补性保证。该架构采用逻辑推理进行输入验证，利用完备性属性为结构化需求提供可判定的保证。对于输出验证，基于嵌入的语义相似性检测上下文幻觉，弥补形式方法表达力不足的问题。这种分离通过并行的、基于角色的流水线实现，解决了基于提示的自验证方法（继承了产生幻觉的分布偏差）的局限性。所提出的架构和类型感知验证方法通过HAIMEDA（一个通过行动设计研究开发的真实世界医疗设备损伤评估报告系统）进行验证。评估显示，结构化实体的幻觉检测率超过83%，语义虚构的检测率为72%，报告创建时间减少30%，表明神经符号架构可以为LLM在数据敏感领域的部署提供原则性的安全保障。

英文摘要

LLMs deployed in high-stakes domains face fundamental reliability challenges: hallucinations, inconsistencies, and privacy vulnerabilities introduce unacceptable risks where errors carry legal, financial, or safety consequences. This paper presents a hybrid verification architecture combining formal symbolic methods with neural semantic analysis to provide complementary guarantees for LLM-generated content. This architecture employs logical reasoning for input verification, leveraging completeness properties to provide decidable guarantees on structured requirements. For output validation, embedding-based semantic similarity detects contextual hallucinations where formal methods lack expressiveness. This separation is realized in a parallel, actor-based pipeline, addressing limitations of prompt-based self-verification approaches, which inherit the distributional biases that produce hallucinations. The proposed architecture and type-aware verification method are validated with HAIMEDA, a real-world medical device damage assessment reporting system developed through Action Design Research. Evaluation shows hallucination detection rates of over 83% for structured entities and 72% for semantic fabrications, with a 30% reduction in report creation time, demonstrating that neuro-symbolic architectures can provide principled safeguards for LLM deployment in data-sensitive domains.

URL PDF HTML ☆

赞 0 踩 0

2605.26929 2026-06-01 cs.LG

When Muon Optimizer Meets Adversarial Training: A Theoretical and Empirical Study

当Muon优化器遇到对抗训练：理论与实证研究

Jun Yan, Weiquan Huang, Jiankai Zuo, Yujian Mo, Xi Fang, Chengliang Wu, Zeming Wei

发表机构 * IT College, Shanghai Ocean University（上海海洋大学信息学院）； School of Computer Science and Technology, Tongji University（同济大学计算机科学与技术学院）； SEIE, Suzhou University of Science and Technology（苏州科技大学SEIE学院）； DP Tech（DP科技）； School of Mathematical Sciences, Peking University（北京大学数学科学学院）

AI总结本文通过理论和实证研究，探讨Muon优化器（基于近似极分解的正交化更新）在对抗训练中的效果，发现其能限制矩阵更新的谱范数增长，在CNN和ViT上优于AdamW，与SGD竞争力相当。

详情

AI中文摘要

对抗训练（AT）仍然是最可靠的对抗攻击经验防御方法之一。其鲁棒性关键取决于底层极小极大目标如何优化。在实践中，随机梯度下降（SGD）优化器仍然是AT的默认优化选择，而自适应优化器通常能改善标准训练，但可能产生较差的鲁棒性。最近，Muon优化器通过近似极分解对矩阵值更新进行正交化，在内存成本与SGD相当的情况下，在大规模训练中取得了显著成功。这提出了一个与安全相关的问题：正交化优化能否在强异质威胁模型下改进AT？针对这一问题，我们进行了全面的理论和实证研究。理论上，我们表明Muon对矩阵更新施加了谱范数稳定性上限，限制了训练动态中不受控制的谱增长，而无需显式缩小学习权重。实证上，在五种架构和三种$\ell_p$威胁模型（$\ell_\infty$、$\ell_1$、$\ell_2$）及其联合下，Muon在CNN上与SGD竞争力相当，并在CNN和ViT上显著优于AdamW。这些结果将优化器几何识别为对抗训练中的一个安全相关因素，同时阐明了正交化更新有益的经验场景。总体而言，我们的发现强调了优化器设计是AT的一个安全关键组成部分。

英文摘要

Adversarial training (AT) remains one of the most reliable empirical defenses against adversarial attacks. Its robustness critically depends on how the underlying min-max objective is optimized. In practice, Stochastic Gradient Descent (SGD) optimizer remains the default optimization choice for AT, whereas adaptive optimizers often improve standard training but may yield inferior robustness. Recently, the Muon optimizer, which orthogonalizes matrix-valued updates via an approximate polar decomposition, has achieved notable success in large-scale training at a memory cost comparable to SGD. This raises a security-relevant question: \textit{can orthogonalized optimization improve AT under strong and heterogeneous threat models?} Focusing on this problem, we conduct a comprehensive theoretical and empirical study. Theoretically, we show that Muon imposes a spectral-norm stability ceiling on matrix updates, limiting uncontrolled spectral growth in the training dynamics without explicitly shrinking the learned weights. Empirically, across five architectures and three $\ell_p$ threat models ($\ell_\infty$, $\ell_1$, $\ell_2$) and their union, Muon is competitive with SGD on CNNs and substantially outperforms AdamW on both CNNs and ViTs. These results identify optimizer geometry as a security-relevant factor in adversarial training, while clarifying the empirical regimes in which orthogonalized updates are beneficial. Overall, our findings highlight optimizer design as a security-critical component of AT.

URL PDF HTML ☆

赞 0 踩 0

2605.26519 2026-06-01 cs.CV

$R^3$: 3D Reconstruction via Relative Regression

$R^3$: 通过相对回归进行3D重建

Congrong Xu, Huachen Gao, Xingyu Chen, Yuliang Xiu, Jun Gao, Anpei Chen

发表机构 * University of Michigan（密歇根大学）； Westlake University（西雅图大学）； NVIDIA Research（英伟达研究）

AI总结提出一种基于相对回归的3D重建方法$R^3$，使用轻量级MLP预测置信度加权的相对约束，以支持全上下文离线重建和因果有界内存流式重建。

详情

AI中文摘要

最近的馈送式几何基础模型通过单次前向传播恢复深度和姿态，展现出了令人印象深刻的泛化能力。然而，这些模型通常受限于全局坐标框架假设。这种依赖性成为长上下文和流式重建的一个显著瓶颈，因为它迫使网络维护一个任意的时序原点，并处理随时间无界增长的平移幅度。我们的解决方案，称为$R^3$，采用了相对回归。我们使用一个轻量级MLP来预测置信度加权的相对约束。这些置信度作为一个统一的锚点：在训练期间加权损失，在推理期间指导姿态聚合。$R^3$支持全上下文离线重建和因果、有界内存的流式重建。我们在离线与流式设置下的评估验证了我们的相对机制的有效性。项目页面：https://kevinxu02.github.io/r3-site

英文摘要

Recent feed-forward geometry foundation models have demonstrated impressive generalization by recovering depth and poses in a single forward pass. However, these models are typically constrained by a global coordinate frame assumption. This dependency becomes a significant bottleneck for long-context and streaming reconstruction, as it forces the network to maintain an arbitrary temporal origin and handle translation magnitudes that grow unbounded over time. Our solution, which we call $R^3$, employs relative regression. We employ a lightweight MLP to predict confidence-weighted relative constraints. These confidences serve as a unified anchor: weighting losses during training and guiding pose aggregation during inference. $R^3$ supports both full-context offline reconstruction and causal, bounded-memory streaming. Our evaluation in both offline and streaming settings validates the effectiveness of our relative mechanism. Project page: https://kevinxu02.github.io/r3-site

URL PDF HTML ☆

赞 0 踩 0

2605.26502 2026-06-01 cs.LG physics.optics

PRISM: Position-encoded Regressive Inverse Spectral Model for Multilayer Thin-Film Design

PRISM：用于多层薄膜设计的位置编码回归逆光谱模型

Runtian Wang, Renhao Xue, Baige Chen, Hao Wu

发表机构 * Independent Researcher（独立研究者）； Work does not relate to position at Amazon（与亚马逊职位无关的工作）

AI总结提出PRISM，一种解码器仅自回归变压器，通过联合预测离散材料选择和连续厚度回归，解决多层薄膜光学涂层设计的逆问题，相比其他变压器基线MAE降低50%以上，参数仅为其五分之一。

Comments 8 pages, 3 figures, Proceedings of the AI4Physics Workshop at the 43rd International Conference on Machine Learning (AI4Physics@ICML 2026)

详情

AI中文摘要

多层薄膜光学涂层设计的逆问题是一个复杂的组合-连续优化挑战。我们提出了PRISM（位置编码回归逆光谱模型），一种统一的解码器仅自回归变压器，通过在单个骨干网络中联合预测离散材料选择和连续厚度回归，简化了这一过程。PRISM引入了两个主要的架构创新：（1）光谱前缀条件化，利用标准前缀令牌进行上下文目标注入；（2）累积深度旋转位置嵌入，将连续厚度直接编码到位置表示中，以保留堆栈的物理空间关系。我们的基准测试表明，PRISM-13M模型相比其他变压器基线将MAE降低了50%以上，同时仅使用五分之一的参数。此外，一个44M参数的变体在我们的分布内验证基准上实现了最先进的性能（MAE = 0.010），并且运行速度显著快于模拟退火，为经典优化方法提供了一种高效的替代方案。

英文摘要

The inverse problem of multilayer thin-film optical coatings design represents a complex combinatorial-continuous optimization challenge. We present PRISM (Position-encoded Regressive Inverse Spectral Model), a unified decoder-only autoregressive transformer that streamlines this process by jointly predicting discrete material selection and continuous thickness regression within a single backbone. PRISM introduces two primary architectural innovations: (1) spectrum prefix conditioning, which utilizes standard prefix tokens for in-context target injection, and (2) cumulative-depth Rotary Position Embeddings, which encode continuous thickness directly into the positional representation to preserve the physical spatial relationships of the stack. Our benchmarks demonstrate that a PRISM-13M model reduces MAE by over 50\% compared to other transformer baselines while utilizing only one-fifth of the parameters. Furthermore, a 44M-parameter variant achieves state-of-the-art performance (MAE = 0.010) on our in-distribution validation benchmark and operates significantly faster than simulated annealing, offering a highly efficient alternative to classical optimization methods.

URL PDF HTML ☆

赞 0 踩 0

2605.26430 2026-06-01 cs.RO

Multi-Robot Box Transport over Different Surfaces with Decentralized Role-based Proportional Control

多机器人在不同表面上的基于去中心化角色比例控制的箱子运输

Aditya Bhatt, Himavarshini Yarragangu, Urvish Shah, Venkata Sai Yaswanth Mohan Thota, Souma Chowdhury

发表机构 * Mechanical & Aerospace Eng., University at Buffalo, Buffalo, NY（机械与航空航天工程系，布法罗大学，布法罗，纽约）

AI总结提出一种异步去中心化任务与运动规划方法R2P2，通过角色分配和比例控制实现多机器人在不同倾斜和摩擦表面上的协作箱子运输，在仿真和物理实验中验证了其泛化性和成功率优于标准虚拟领导者-跟随者方法。

Comments Accepted for presentation at the 2026 ASME IDETC-CIE

详情

AI中文摘要

通过推动实现多机器人协作运输物体在建筑、仓库环境以及灾后 debris 清理等许多应用中具有广泛前景。然而，在不同倾斜和摩擦特性的表面上实现协作运输带来了独特的挑战。为应对这些挑战，本文提出了一种异步去中心化任务与运动规划方法，用于在平坦、上坡和下坡地形上运输不同质量的矩形箱子。这种去中心化方法减轻了通信、同步和共识需求，并缓解了单点故障问题。我们的方法称为R2P2（基于规则和比例控制原语的角色分配），根据对所需操作模式（箱子旋转 vs 平移）的认知规则为机器人分配角色（例如，推、支撑和阻止）；随后根据角色执行基于规则的控制或机器人速度的比例控制。每个机器人在执行角色和控制时假设能观察到自身和箱子的位置与朝向。R2P2在使用NVIDIA IsaacSim构建的模拟器中通过六机器人团队进行了评估——展示了在不同表面摩擦/倾斜和箱子质量场景下的泛化能力，并且与标准虚拟领导者-跟随者方法相比具有更高的成功率。R2P2还通过物理实验成功验证，在四台负责移动1.2 kg箱子的turtlebots上执行。

英文摘要

Collaborative transport of objects via pushing by multiple robots has many applications, ranging from construction and warehouse environments to post disaster debris clean-up. Achieving collaborative transport over surfaces with different inclination and friction properties however poses unique challenges. To address these challenges, this paper presents an asynchronous decentralized task and motion planning approach for transporting rectangular boxes of varying mass over flat, uphill and downhill terrain. Such a decentralized approach alleviates communication, synchronization and consensus needs and mitigates single point of failure issues. Our approach, called R2P2 or Roles with Rules and Proportional-control Primitive, assigns roles (e.g., push, support and prevent) to robots based on rules cognizant of the mode of manipulation needed (box rotation vs translation); this is followed by either rule-based control or proportional control of robot velocity based on the roles. Each robot is assumed to observe the location and heading of self and the box in executing the role and controls. R2P2 is evaluated with a six-robot team deployed in a simulator built using NVIDIA IsaacSim -- demonstrating generalizability across different surface friction/inclination and box mass scenarios, and better success rate compared to a standard virtual-leader-follower method. R2P2 is also successfully validated with a physical experiment, where it is executed onboard four turtlebots tasked with moving a 1.2 kg box.

URL PDF HTML ☆

赞 0 踩 0

2605.26396 2026-06-01 cs.AI cs.CL cs.LG

Advancing Creative Physical Intelligence in Large Multimodal Models

推进大型多模态模型中的创造性物理智能

Cheng Qian, Hyeonjeong Ha, Jiayu Liu, Jeonghwan Kim, Emre Can Acikgoz, Bingxuan Li, Kunlun Zhu, Jiateng Liu, Aditi Tiwari, Zhenhailong Wang, Xiusi Chen, Mahdi Namazifar, Heng Ji

发表机构 * UIUC（伊利诺伊大学香槟分校）； Amazon（亚马逊）

AI总结针对大型多模态模型在开放式环境中缺乏基于视觉的创造性工具使用能力的问题，提出MM-CreativityBench基准和基于偏好学习的具身对齐方法，显著提升实体选择并减少幻觉。

Comments 51 Pages, 9 Figures, 7 Tables, Previous Work CreativityBench: arXiv:2605.02910

详情

AI中文摘要

大型多模态模型（LMMs）在感知和推理方面取得了快速进展；然而，目前尚不清楚这些能力是否能够泛化到在开放式环境中发现基于视觉的解决方案，超越模式识别。在此类场景中，智能需要的不仅仅是回答明确的问题：它涉及识别场景中的元素如何以非显而易见但物理上可行的方式被重新利用。这种创造性问题解决形式是人类智能的核心，但在当前基准测试中基本上未得到测试。为了评估这一能力，我们引入了MM-CreativityBench，这是一个用于在视觉丰富、物理受限的环境中进行基于可操作性的创造性工具使用的基准。每个实例呈现一个场景图像，包含候选实体及其部件的结构化视图，从而能够对模型如何迭代检查场景、识别相关可操作性以及组合视觉和物理上可行的解决方案进行细粒度、交互式评估。我们的实验表明，当前的LMMs往往表现不佳，不是由于缺乏生成能力，而是因为它们无法维持基于具身的探索。模型经常忽略相关实体，对关键部件检查不足，或幻觉出图像中不存在的属性。受此失败模式的启发，我们提出了具身对齐，将创造性工具使用视为一个偏好学习问题。使用直接偏好优化，我们鼓励模型偏好基于视觉证据的属性-可操作性推理，而非幻觉替代方案。此外，我们结合从可操作性知识库中获得的监督，以指导更广泛的实体探索和多轮规划。我们的结果显示，在正确选择实体和部件方面取得了持续改进，同时大幅减少了幻觉和与具身相关的错误。

英文摘要

Large multimodal models (LMMs) have rapidly advanced in perception and reasoning; however, it remains unclear whether these capabilities generalize to discovering visually grounded solutions in open-ended environments, beyond pattern recognition. In such settings, intelligence requires more than answering well-posed questions: it involves identifying how elements in a scene can be repurposed in non-obvious yet physically feasible ways. This form of creative problem-solving is central to human intelligence, but remains largely untested in current benchmarks. To evaluate this ability, we introduce MM-CreativityBench, a benchmark for affordance-grounded creative tool use in visually rich, physically constrained environments. Each instance presents a scenario image with structured views of candidate entities and their parts, enabling fine-grained, interactive evaluation of how models iteratively inspect the scene, identify relevant affordances, and compose visually and physically grounded solutions. Our experiments show that current LMMs often fall short, not due to lack of generative capability, but because they do not sustain grounded exploration. Models often overlook relevant entities, under-examine critical parts, or hallucinate attributes not grounded in the image. Motivated by this failure mode, we propose affordance-grounded alignment, which casts creative tool use as a preference learning problem. Using Direct Preference Optimization, we encourage models to prefer attribute-affordance reasoning grounded in visual evidence over hallucinated alternatives. In addition, we incorporate supervision derived from an affordance knowledge base to guide broader entity exploration and multi-turn planning. Our results show consistent gains in selecting the correct entities and parts, while substantially reducing hallucination and grounding-related errors.

URL PDF HTML ☆

赞 0 踩 0

2605.26371 2026-06-01 cs.AI

Exploiting Local Dynamics Regularity for Reusable Skills in Offline Hierarchical RL

利用局部动态规律性实现离线分层强化学习中的可复用技能

Sarthak Dayal, Abhinav Peri, Carl Qi, Claas Voelcker, Alexander Levine, Caleb Chuck, Amy Zhang

发表机构 * UT Austin（UT奥斯汀）

AI总结提出CARL算法，通过对比学习对齐局部动态与动作序列，在离线分层强化学习中学习可复用技能，提升下游任务性能。

详情

AI中文摘要

分层强化学习（HRL）有望通过发现和复用时间上扩展的技能，比非分层方法更有效地解决长时域强化学习（RL）任务。然而，获得真正可复用的技能仍然是一个开放挑战。为此，我们关注利用局部动态直觉的抽象：不同全局上下文中的局部转换需要类似的动作序列。通过将这些上下文与其所需的动作序列对齐，我们能够学习哪些技能可以复用以及在何处复用它们。原则上，这些信息应有益于许多HRL算法，其中高层策略需要推理其使用的低层技能。由此产生的算法CARL（基于对比动作的可复用局部控制表示）在复杂人形环境中展示了有意义技能的定性聚类，并且在与HIQL集成时，在OGBench基准上提升了下游性能。

英文摘要

Hierarchical Reinforcement Learning (HRL) promises to solve long-horizon Reinforcement Learning (RL) tasks more efficiently than non-hierarchical counterparts by discovering and reusing temporally-extended skills. However, obtaining skills that are actually reusable remains an open challenge. Towards this end, we focus on abstractions that exploit the intuition of local dynamics: local transitions in different global contexts require similar kinds of action sequences. By aligning these contexts with the action sequences they require, we are able to learn which skills to reuse and where to reuse them. In principle, this information should benefit many HRL algorithms, where high-level policies have to reason about the low-level skills they use. The resulting algorithm CARL (Contrastive Action-based Representations for Reusable Local Control) shows both qualitative clustering of meaningful skills in complex humanoid environments and improved downstream performance on the OGBench benchmark when integrated with HIQL.

URL PDF HTML ☆

赞 0 踩 0

2605.26304 2026-06-01 cs.RO

Collaborative Navigation and Exploration with $β$-Sparse Gaussian Processes

基于$β$-稀疏高斯过程的协作导航与探索

Evangelos Psomiadis, Dipankar Maity, Panagiotis Tsiotras

发表机构 * D. Guggenheim School of Aerospace Engineering, Georgia Tech, Atlanta, GA, USA（佐治亚理工学院D.Guggenheim航空航天工程学院）； Department of Electrical and Computer Engineering, UNC Charlotte, Charlotte, NC, USA（北卡罗来纳大学夏洛特分校电气与计算机工程系）

AI总结针对异构机器人在未知环境中的协作导航问题，提出一种利用$β$-稀疏高斯过程进行带宽受限下地图点选择和导航动作联合优化的框架，显著降低路径代价和传输信息量。

Comments 16 pages, 6 figures

详情

AI中文摘要

异构机器人在未知环境中的协作导航由于传感、通信和计算限制而面临重大挑战。在这项工作中，一个领航机器人向目标导航，同时一个移动传感器机器人（例如无人机）通过传输其局部观测地图的信息来辅助，但受带宽限制。我们提出一个框架，使传感器能够在线联合选择其传输的地图点和导航动作，同时预测环境的未探索区域。为此，我们提出了$β$-稀疏高斯过程，一种鲁棒的变分稀疏高斯过程模型，用于在基数约束下进行任务感知的诱导点选择。此外，我们开发了一种平衡任务相关性与探索的动作选择策略。在火星和地球地图上的仿真表明，与无通信相比，该框架可将路径代价降低18%，与原始数据传输基线相比，传输信息量减少76%。

英文摘要

Collaborative navigation of heterogeneous robots in unknown environments poses significant challenges due to sensing, communication, and computational limitations. In this work, a lead robot navigates toward a target while a mobile sensor robot (e.g., a drone) assists by transmitting information about its locally observed map under bandwidth constraints. We propose a framework that enables the sensor to jointly select its transmitted map points and navigation actions online, while also predicting unexplored regions of the environment. To this end, we present $β$-Sparse Gaussian Processes, a robust variational sparse Gaussian Process model for task-aware inducing point selection under cardinality constraints. Furthermore, we develop an action-selection strategy that balances task relevance with exploration. Simulations on Mars and Earth maps show that the framework can reduce path cost by 18% relative to no communication and decrease transmitted information by 76% compared to raw-data transmission baselines.

URL PDF HTML ☆

赞 0 踩 0

AI 大模型

视觉与机器人

科学与医疗

DTG-Restore: Training-Free Diffusion Refinement for Generative Video Super-Resolution

Domain Adaptation and Reasoning Frameworks in Language Models: A Controlled Experiment with Historical Cosmology

SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer

NumLeak: Public Numeric Benchmarks as Latent Labels in Foundation Models

A Novel Evaluation Metric for Unsupervised Learning in AIS-Based Maritime Anomaly Detection: MADQI

Functional MRI Time Series Generation via Wavelet-Based Image Transform and Spectral Flow Matching for Brain Disorder Identification

Structured interactions improve distributed coordination beyond model scaling in a real-world multi-robot system

When LLMs Learn to Be Consistently Wrong: A Multi-Model Study of Linear Representations of Synthetic Deception

Unicorn: Scaling High-Dimensional Time Series Forecasting via Universal Correlation Modeling

Gait2Hip-60: A Unified Deep Learning Benchmark for Predicting Hip Muscle Forces and Joint Moments from Multi-Cadence Gait Kinematics

Mental Damage: Caption Poisoning Attacks on Retrieval-Augmented Text-to-Music Generation

QASM-Eval: A Dataset to Train and Evaluate LLMs on OpenQASM-3 Beyond Quantum Circuits

Self-Supervised Online Robot-Agnostic Traversability Estimation for Open-World Environments

Conveyance: A Versatile Framework for Learning in Structured Class Spaces

PINE: Pruning Boosted Tree Ensembles with Conformal In-Distribution Prediction Equivalence

Reward Bias Substitution: Single-Axis Bias Mitigations Redirect Optimization Pressure

Retrieval, Reward, and Training Protocols: What Matters in Training Search Agents?

The Fundamental Limits of Fraud Detection in Card Payment Networks

SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

Alignment Tampering: How Reinforcement Learning from Human Feedback Is Exploited to Optimize Misaligned Biases

Pair-In, Pair-Out: Latent Multi-Token Prediction for Efficient LLMs

VR-DAgger: Immersive VR for Dexterous Data Collection and Uncertainty-Guided On-Policy Correction

Neuro-Symbolic Verification of LLM Outputs for Data-Sensitive Domains (extended preprint)

When Muon Optimizer Meets Adversarial Training: A Theoretical and Empirical Study

$R^3$: 3D Reconstruction via Relative Regression

PRISM: Position-encoded Regressive Inverse Spectral Model for Multilayer Thin-Film Design

Multi-Robot Box Transport over Different Surfaces with Decentralized Role-based Proportional Control

Advancing Creative Physical Intelligence in Large Multimodal Models

Exploiting Local Dynamics Regularity for Reusable Skills in Offline Hierarchical RL

Collaborative Navigation and Exploration with $β$-Sparse Gaussian Processes