2606.07289 2026-06-08 cs.LG cs.CV 新提交

整流流泄漏之处：沿插值路径表征成员信号

Thomas Sesmat, Gabriel Meseguer-Brocal, Geoffroy Peeters

发表机构 * University of Amsterdam（阿姆斯特丹大学）

AI总结本文分析整流流（Rectified Flows）在插值路径上的训练数据成员信号，发现训练与测试数据的重建差异呈钟形曲线，并在高斯假设下推导出峰值位置，验证了该结构的普适性，并利用其进行成员推断攻击。

Comments ICML 2026 article, 9 main pages and 25 with annexes, 11 figures

详情

Journal ref: 43rd International Conference on Machine Learning, Seoul, South Korea, 2026

AI中文摘要

理解生成模型从训练数据中保留了什么仍然具有挑战性，这对版权和隐私有影响。除了逐字复制外，模型可以编码训练数据中更微妙的痕迹，这些痕迹从未出现在输出中，但仍可利用。我们针对整流流（Rectified Flows）研究了这一机制，整流流越来越多地用于部署的生成系统。我们分析了定义整流流训练的插值路径 $X_\lambda = (1-\lambda)X_0 + \lambda X_1$。我们展示了训练数据和测试数据的重建之间存在一个差距，该差距在 $\lambda$ 上呈钟形曲线，并在训练过程中累积，而验证指标保持稳定。该信号有一个最大值，我们在高斯假设下推导出其位置的闭式解。我们在音频和图像上验证了这些预测，并表明钟形结构是普遍的，而峰值预测在我们的假设满足时成立。作为概念验证，我们利用这种特定的 $\lambda$ 解析结构进行成员推断攻击，区分训练集的成员和非成员。

英文摘要

Understanding what generative models retain from training data remains challenging, with implications for copyright and privacy. Beyond verbatim reproduction, models can encode subtler traces of their training data that never surface in their outputs yet remain exploitable. We study this regime for Rectified Flows, which are increasingly used in deployed generative systems. We analyse the interpolation path $X_λ= (1-λ)X_0 + λX_1$ that defines the Rectified Flow training. We show that a gap exists between the reconstruction of train and test data that follows a bell-shaped curve over $λ$, wich accumulates during training, while the validation metrics remain stable. The signal has a maximum whose location we derive in closed form under Gaussian assumptions. We validate these predictions on both audio and images and show that the bell-shaped structure is universal, while the peak prediction holds when our assumptions are satisfied. As a proof of concept, we exploit this specific $λ$-resolved structure to perform a Membership Inference Attack, distinguishing members of the training set from non-members.

URL PDF HTML ☆

赞 0 踩 0

2606.07254 2026-06-08 cs.LG cs.FL 新提交

A Held-Out Transition-Pair Falsifier for Long-Horizon Non-Abelian State Tracking

长视野非阿贝尔状态跟踪的保留转移对验证器

Jeonghoon Lee

发表机构 * Attractor Dynamics（吸引子动力学）

AI总结针对序列模型在非交换状态跟踪中的局限，提出保留转移对验证协议，在投影循环状态模型上实现长达百万步的完美预测，揭示显式非交换状态组合作为有效归纳偏置。

Comments Technical preprint, 24 pages. 7 figures

详情

AI中文摘要

状态跟踪揭示了序列模型的一个尖锐限制：相关信号通常不是观测令牌的摘要，而是通过非交换变换演化的有序潜在状态。我们引入了一个用于有限非阿贝尔群跟踪的保留转移对验证器。该协议在训练期间禁止选定的有序生成器对，并在评估期间要求相同的局部模式，从而阻断了一条直接的局部转移记忆路径。在一个受控的 $S_3 \ imes S_3$ 基准测试中，仅在长度为8的序列上训练的投影循环状态模型，在长达1,048,576个令牌的评估视野中，跨五个种子产生了无错误的最终状态预测（每个视野完美250/250）。匹配的原生读出基线，包括bag、GRU和单配置结构化状态空间模型，在相同协议下保持接近基线水平。投影匹配的GRU、结构化SSM和bag基线配备了类似的有限群原型读出，在相同分割下也保持接近随机水平。机制诊断显示，硬投影与低同态误差、低状态一致性漂移和非平凡交换子分离同时出现，而软投影则导致最终状态精度崩溃。干净分割审计验证了训练和评估分区之间零逐字缩减词重叠和零结构模板重叠。该证据限于这个受控的有限群验证器，而非通用架构排名。在该范围内，显式投影的非交换状态组合作为长视野隐藏状态跟踪的有用归纳偏置。

英文摘要

State tracking exposes a sharp limitation of sequence models: the relevant signal is often not a summary of observed tokens, but an ordered latent state that evolves through non-commutative transformations. We introduce a held-out transition-pair falsifier for finite non-Abelian group tracking. The protocol forbids selected ordered generator pairs during training and requires the same local patterns during evaluation, blocking one direct local-transition memorization pathway. In a controlled $S_3 \times S_3$ benchmark, a projected recurrent state model trained only on length-8 sequences produces error-free final-state predictions (perfect 250/250 per horizon) through evaluation horizons up to 1,048,576 tokens across five seeds. Matched native-readout baselines, including bag, GRU, and a single-configuration structured state-space model, remain near floor under the same protocol. Projection-matched GRU, structured SSM, and bag baselines equipped with analogous finite-group prototype readouts also remain near chance under the same split. Mechanism diagnostics show that hard projection coincides with low homomorphism error, low state-consistency drift, and non-trivial commutator separation, while softened projection collapses final-state accuracy. Clean-split audits verify zero verbatim reduced-word overlap and zero structural-template overlap between training and evaluation partitions. The evidence is scoped to this controlled finite-group falsifier rather than to a general architecture ranking. Within that regime, explicit projected non-commutative state composition acts as a useful inductive bias for long-horizon hidden-state tracking.

URL PDF HTML ☆

赞 0 踩 0

2606.07253 2026-06-08 cs.AI econ.EM 新提交

TOPSIS-RAD: Ranking According to Desires

TOPSIS-RAD：根据期望排序

Leonardo Fernandes Costa, Helder Gomes Costa, Diogo Lima, Brunno Rodrigues

发表机构 * Universidade Federal Fluminense（联邦弗里蒙特大学）； Leonardo Sistemas Consultoria LTDA（莱昂纳多咨询公司）

AI总结提出TOPSIS-RAD方法，通过引入决策者定义的否决绩效水平和期望绩效水平，解决传统TOPSIS排序与决策者需求不一致、对异常值敏感及排名反转问题。

Comments 21 pages, 15 Tables and 6 figures. The numerical computation of the data that appear in the Toy Examples was Supported by the Visual TOPSIS RAD that is available at https://topsis-ranking.vercel.app/. The data of the Toy examples are also available in this URL and can be loaded in the app as the template "Article"

详情

AI中文摘要

传统TOPSIS从观测到的备选方案集中推导其参考点——正理想解（PIS）和负理想解（NIS），这使得排序容易与决策者（DM）需求不一致，对异常值表现敏感，并导致排名反转。本文提出TOPSIS-RAD，通过引入两组DM定义的参考水平来解决这些问题。否决绩效水平（VPL）在归一化之前排除不可行的备选方案，防止它们扭曲排序边界。期望绩效水平（DPL）在归一化之前将表现上限设定在DM期望的水平，将PIS锚定在明确的期望而非数据集极端值上。三个简单示例展示了每种机制：VPL通过移除不可行备选方案重塑归一化边界；固定的DPL边界通过限制远高于期望水平的表现的影响来稳定排序。该方法保留了TOPSIS熟悉的基于距离的结构，同时将排序建立在稳定的、DM指定的边界上。还讨论了局限性和未来研究方向。

英文摘要

Traditional TOPSIS derives its reference points -- the Positive Ideal Solution ($PIS$) and Negative Ideal Solution ($NIS$) -- from the observed alternative set, making rankings susceptible to misalignment with decision-maker (DM) requirements, sensitivity to outlier performances, and rank reversal. This paper proposes TOPSIS-RAD, which addresses these issues by incorporating two arrays of DM-defined reference levels. Vetoed Performance Levels ($VPL$) exclude non-viable alternatives before normalisation, preventing them from distorting the ranking frontiers. Desired Performance Levels ($DPL$) cap performances at the DM's desired level before normalisation, anchoring the $PIS$ in explicit aspirations rather than dataset extremes. Three toy examples demonstrate each mechanism: $VPL$ reshapes normalisation boundaries by removing a non-viable alternative; fixed $DPL$ frontiers stabilise rankings by limiting the influence of performances well above the desired level. The method preserves the familiar distance-based structure of TOPSIS while grounding the ranking in stable, DM-specified boundaries. Limitations and future research directions are also discussed.

URL PDF HTML ☆

赞 0 踩 0

2606.07249 2026-06-08 cs.CV 新提交

Reconstructing Multi-Decadal Forest Disturbances: A Spatio-Temporal Transformer Approach

重建多年代森林干扰：一种时空Transformer方法

Linus Scheibenreif, Anton Raichuk, Maxim Neumann

发表机构 * Google DeepMind（谷歌深Mind）

AI总结提出时空Transformer框架，同时建模时间轨迹和空间邻域，利用Landsat、Sentinel-1/2数据重建美国1984-2022年森林干扰图，在手动标注验证集上达到高精度并减少空间伪影。

详情

AI中文摘要

准确监测森林干扰对于理解碳动态和土地管理至关重要，但传统方法通常依赖卫星时间序列的逐像素分析，忽略了空间上下文。我们提出了一种深度学习框架，通过同时建模时间轨迹和空间邻域，绘制了美国本土38年（1984-2022）的森林干扰图。通过利用视觉Transformer架构，我们的方法有效过滤了弱监督信号中的噪声，生成了空间连贯的干扰图。我们在多个卫星（Landsat、Sentinel-1、Sentinel-2）和时间窗口（38年及最近6年）上进行了详尽评估，并使用新的人工标注验证数据集（n=300）和独立火周界数据集（n=706）验证了性能。结果凸显了任务的复杂性：我们的时空模型表现出高精度（在MTBS上±1年检测精度高达98.2%，在CONUS验证数据集上高达71.3%，F1分数分别高达75.8%和47.3%），并有效减少了空间伪影，但与逐像素基线相比，在不同干扰类型上存在性能权衡。我们的方法为一致的森林监测提供了有前景的基础。

英文摘要

Accurate monitoring of forest disturbances is essential for understanding carbon dynamics and land management, yet traditional approaches typically rely on pixel-wise analysis of satellite time-series, ignoring spatial context. We present a deep learning framework that maps 38 years (1984-2022) of forest disturbance across the contiguous United States by modeling temporal trajectories and spatial neighborhoods simultaneously. By leveraging a vision transformer architecture, our approach effectively filters noise from weak supervision signals to produce spatially coherent disturbance maps. We perform exhaustive evaluations across multiple satellites (Landsat, Sentinel-1, Sentinel-2) and temporal windows (38 years and the more recent 6 years), validating performance against a novel, manually annotated validation dataset (n=300) and independent fire perimeter dataset (n=706). The results highlight the complexity of the task: while our spatio-temporal model demonstrates high precision (up to 98.2% for +-1 year detection on MTBS and up to 71.3% on the CONUS validation datasets, with F1-scores up to 75.8% and 47.3%, respectively) and effectively reduces spatial artifacts, it exhibits performance trade-offs across different disturbance regimes compared to pixel-wise baselines. Our method offers a promising foundation for consistent forest monitoring.

URL PDF HTML ☆

赞 0 踩 0

2606.07244 2026-06-08 cs.RO cs.AI cs.CV 新提交

Beyond Waypoints: A Trajectory-Centric Waypointing Paradigm for Vision-Language Navigation

超越航点：面向视觉语言导航的轨迹中心航点范式

Haoxiang Shi, Xiang Deng, Haoyu Zhang, Qiaohui Chu, Yaowei Wang, Liqiang Nie

发表机构 * Harbin Institute of Technology (Shenzhen)（哈尔滨工业大学（深圳））； Pengcheng Laboratory（鹏城实验室）

AI总结提出轨迹航点范式，通过TSDF引导的扩散策略预测可执行轨迹，解决VLN-CE中航点不可达与规划控制不一致问题，在基准上取得最优性能。

详情

AI中文摘要

连续环境中的视觉语言导航（VLN-CE）要求智能体在类似真实世界的环境中遵循自然语言指令进行导航。大多数VLN-CE方法采用三阶段框架：航点预测器提出可导航航点，导航器选择最佳航点，低层控制器执行移动。然而，这种解耦范式常导致航点不可达或规划与控制不一致。本文提出一种称为轨迹航点的新范式，将每个候选航点锚定到可执行轨迹上。为此，我们设计了TSDF引导的扩散策略作为轨迹航点预测器，引导轨迹生成避开障碍物，从本质上保证预测航点的可达性。进一步提出轨迹增强导航器，将关联轨迹作为额外信息注入规划，实现高层语义决策与低层执行的严格一致性。在VLN-CE基准上的大量实验表明，我们的轨迹航点范式优于基线方法。

英文摘要

Vision-Language Navigation in Continuous Environments (VLN-CE) requires agents to follow natural-language instructions while navigating in real-world-like environments. Most VLN-CE approach\-es adopt a three-stage framework: a waypoint predictor proposes navigable waypoints, and a navigator selects the best waypoint, with a low-level controller executing the movement to it. However, this decoupled paradigm often leads to unreachable waypoints or inconsistencies between planning and control. In this work, instead of predicting isolated waypoints, we introduce a novel paradigm called Trajectory Waypoint, which grounds each candidate waypoint in an executable trajectory. To realize this, we design a Trajectory Waypoint Predictor formulated as a TSDF-guided diffusion policy, which steers trajectory generation away from obstacles, inherently ensuring the reachability of the predicted waypoints. We further propose a trajectory-enhanced navigator that injects the associated trajectory as additional information for planning, enabling strict consistency between high-level semantic decisions and low-level execution. Extensive experiments on the VLN-CE benchmark show that our Trajectory Waypoint paradigm achieves superior performance over the baselines.

URL PDF HTML ☆

赞 0 踩 0

2606.07240 2026-06-08 cs.CL cs.SD 新提交

KIT's Submission to Cross-Lingual Voice Cloning in IWSLT 2026

KIT 提交至 IWSLT 2026 跨语言语音克隆任务

Seymanur Akti, Alexander Waibel

发表机构 * Karlsruhe Institute of Technology (KIT)（卡尔斯鲁厄理工学院）； Carnegie Mellon University (CMU)（卡内基梅隆大学）； KIT Campus Transfer (KCT)（KIT校区转移）

AI总结针对跨语言语音克隆中的口音变化和领域词汇问题，基于FishAudio-S2-Pro多语言文本转语音模型，引入语言标签提示、强化学习微调和参考条件词汇匹配方法，提升可懂度和自然度。

2606.07239 2026-06-08 cs.LG 新提交

AI生成的社交机器人内容的对抗性创建与检测

Mykola Trokhymovych, Ricardo Baeza-Yates, Alessandro Flammini, Diego Saez-Trumper, Filippo Menczer

发表机构 * Universitat Pompeu Fabra（庞培法拉大学）； Observatory on Social Media, Indiana University（社交媒体观测站，印第安纳大学）； KTH Royal Institute of Technology（皇家理工学院）

AI总结提出对抗性方法模拟恶意用户冒充真人，构建多语言跨平台配对数据集，训练检测模型显著优于现有方法。

2606.07217 2026-06-08 cs.RO cs.CV cs.LG 新提交

Robotic Policy Adaptation via Weight-Space Meta-Learning

通过权重空间元学习实现机器人策略自适应

Christian Bianchi, Siamak Yousefi, Alessio Sampieri, Andrea Roberti, Luca Rigazio, Fabio Galasso, Luca Franco

发表机构 * ItalAI ； University of Verona（威尼斯大学）； Sapeinza University of Rome（罗马萨佩因扎大学）

AI总结提出WIZARD框架，通过权重空间元学习从语言指令和演示视频生成任务特定LoRA参数，无需微调即可适应新任务，在LIBERO上性能提升高达14倍。

详情

AI中文摘要

视觉-语言-动作（VLA）模型正成为机器人操作的一种有前景的范式，能够从大规模演示和动作标签语料库中训练通用策略。然而，将这些模型适应新任务通常仍需要任务特定的演示、动作注释和额外的微调，使得部署成本高昂且难以扩展。我们提出WIZARD，一种权重空间元学习框架，通过为冻结的VLA策略生成任务特定的LoRA参数来避免任务特定的微调。仅凭语言指令和简短的演示视频，WIZARD即可在单次前向传播中预测相应的自适应权重，无需目标任务动作标签或测试时优化。在元训练期间，WIZARD学习将任务证据直接映射到专家LoRA更新，在权重空间中捕获任务之间的关系。在LIBERO上的实验表明，WIZARD在未见过的数据集集合上性能提升高达约2倍，在未见过的任务上提升高达约14倍。在Franka Emika Panda机器人上，WIZARD持续优于真实域自适应基线，表明生成的适配器提供了超越仿真的任务级特化。

英文摘要

Vision-Language-Action (VLA) models are emerging as a promising paradigm for robotic manipulation, enabling general-purpose policies trained from large corpora of demonstrations and action labels. However, adapting these models to new tasks still typically requires task-specific demonstrations, action annotations, and additional fine-tuning, making deployment costly and difficult to scale. We propose WIZARD, a weight-space meta-learning framework that sidesteps task-specific fine-tuning by generating task-specific LoRA parameters for a frozen VLA policy. Given only a language instruction and a short demonstration video, WIZARD predicts the corresponding adaptation weights in a single forward pass, without target-task action labels or test-time optimization. During meta-training, WIZARD learns to map task evidence directly to expert LoRA updates, capturing relationships between tasks in weight space. Experiments on LIBERO show that WIZARD improves performance by up to ~2x on unseen dataset collections and up to ~14x on unseen tasks. On a Franka Emika Panda, WIZARD consistently improves over a real-domain adapted baseline, showing that generated adapters provide task-level specialization beyond simulation.

URL PDF HTML ☆

赞 0 踩 0

2606.07211 2026-06-08 cs.RO cs.AI 新提交

An Abstract Architecture for Explainable Autonomy in Hazardous Environments

危险环境中可解释自主性的抽象架构

Matt Luckcuck, Hazel M Taylor, Marie Farrell

发表机构 * Maynooth University（梅诺斯大学）； University of Manchester（曼彻斯特大学）

AI总结提出一种支持自主系统解释其行为的抽象架构，旨在通过设计可解释性增强用户信任，并以民用核工业为例展示应用。

Comments Originally published 20th of October 2022 at the Second International Workshop on Requirements Engineering for Explainable Systems (RE4ES), which was hosted by the International Requirements Engineering Conference 2022

详情

DOI: 10.1109/REW56159.2022.00027

AI中文摘要

自主机器人系统被提议用于危险环境，通常是为了减少人类工人的风险。在不久的将来，人类工人可能会继续使用和指挥这些自主机器人，就像其他计算机化工具一样，但具有更复杂的决策能力。因此，工程努力的一个重要方向是确保这些用户信任系统。最近的文献表明，可解释性与系统的可信度密切相关。与安全性和保密性属性一样，可解释性应该被设计到系统中，而不是事后添加。本文提出了一种抽象架构，支持自主系统解释其行为（可解释自主性），为实施可解释自主系统提供了设计模板。我们给出了一个工作示例，说明我们的架构如何应用于民用核工业，其中工人和监管机构都需要信任系统的决策能力。

英文摘要

Autonomous robotic systems are being proposed for use in hazardous environments, often to reduce the risks to human workers. In the immediate future, it is likely that human workers will continue to use and direct these autonomous robots, much like other computerised tools but with more sophisticated decision-making. Therefore, one important area on which to focus engineering effort is ensuring that these users trust the system. Recent literature suggests that explainability is closely related to how trustworthy a system is. Like safety and security properties, explainability should be designed into a system, instead of being added afterwards. This paper presents an abstract architecture that supports an autonomous system explaining its behaviour (explainable autonomy), providing a design template for implementing explainable autonomous systems. We present a worked example of how our architecture could be applied in the civil nuclear industry, where both workers and regulators need to trust the system's decision-making capabilities.

URL PDF HTML ☆

赞 0 踩 0

2606.07210 2026-06-08 cs.SD cs.CR 新提交

A Large-Scale Per-Speaker Analysis of Re-identification Risk in Speech Anonymization

语音匿名化中重识别风险的大规模每说话人分析

Orane Dufour, Paul Magron, Mickael Rouvier, Emmanuel Vincent

发表机构 * Université de Lorraine, CNRS, Inria, LORIA（洛林大学、国家科学研究中心、法国国家信息与自动化技术研究院、LORIA实验室）； LIA, Avignon University（阿维尼翁大学LIA实验室）

AI总结通过大规模每说话人分析，发现语音匿名化中重识别风险在个体间差异巨大，且风险由攻击者、匿名化器和可用语音量共同决定，挑战了固有说话人隐私风险的概念。

Comments Accepted to Interspeech

详情

AI中文摘要

语音匿名化通常使用平均情况指标（如等错误率）进行评估，这可能会掩盖个体间重识别风险的巨大差异。在本文中，我们基于最坏情况下的可链接性度量，进行了大规模每说话人隐私分析。评估了近5000名说话人在多个匿名化系统、攻击者架构和对话长度下的表现。虽然可链接性分数在说话人层面上高度极化，但易于重识别和难以重识别的说话人集合在不同配置下差异显著。我们表明，没有单一因素可以解释说话人的脆弱性。相反，重识别风险源于攻击者、匿名化器和可用语音量之间的相互作用。这些结果挑战了固有说话人级隐私风险的概念，并强调需要明确以攻击者和匿名化器为条件的评估协议。

英文摘要

Speech anonymization is commonly evaluated using averagecase metrics such as the equal error rate, which can hide large disparities in re-identification risks across individuals. In this paper, we conduct a large-scale per-speaker privacy analysis using a linkability-based metric under a worst-case scenario. Nearly 5,000 speakers are evaluated across multiple anonymization systems, attacker architectures, and conversation lengths. While linkability scores are highly polarized at the speaker level, the sets of easy to re-identify and hard to re-identify speakers vary substantially across configurations. We show that no single factor explains speaker vulnerability. Instead, the re-identification risk emerges from the interaction between the attacker, the anonymizer, and the amount of available speech. These results challenge the notion of intrinsic speaker-level privacy risks and emphasize the need for evaluation protocols that are explicitly conditioned on the attacker and anonymizer.

URL PDF HTML ☆

赞 0 踩 0

2606.07207 2026-06-08 cs.SD cs.LG eess.AS 新提交

Entropy as a Structural Prior: How a Log-Barrier on DiT Belief Space Drives Musical Diversity and Development

熵作为结构先验：DiT信念空间上的对数障碍如何驱动音乐多样性与发展

Zixi Li, Youzhen Li

发表机构 * Sun Yat-sen University（中山大学）； Datawhale（数据 whale）

AI总结提出Eisbach对数障碍，利用DiT输出空间能量分布的熵作为权重，在监督扩散训练中通过调节梯度步长促进音乐主题发展、声学区分和纹理多样性，避免模式崩溃。

详情

AI中文摘要

基于置信度的损失加权通常在生成模型中被避免，因为当模型自信地错误时会加速误差，但这种直觉在监督扩散训练中不成立。我们引入了Eisbach对数障碍，一种无参数权重，源自DiT输出空间能量分布的熵：高熵抑制梯度，低熵保留梯度。将其应用于Stable Audio 3 Medium在MusicCaps上的LoRA微调，意外地产生了比未加权训练更强的主题发展、更清晰的声学区分和更高的纹理多样性，这与模式崩溃相反。这是因为在监督扩散中，梯度方向锁定于真实值，因此置信度仅缩放步长，并且因为时间熵对平坦样本降权而保留高对比度样本。结果是一个在线、自引用的数据课程，完全从前向传播中涌现，并分析了噪声级动态和可测试的预测。

英文摘要

Confidence-based loss weighting is usually avoided in generative models because it accelerates errors when the model is confidently wrong, but this intuition breaks down in supervised diffusion training. We introduce the Eisbach log-barrier, a parameter-free weight derived from the entropy of the DiT output's spatial energy distribution: high entropy damps the gradient, while low entropy preserves it. Applied to LoRA fine-tuning of Stable Audio 3 Medium on MusicCaps, it unexpectedly yields stronger thematic development, clearer acoustic differentiation, and higher textural diversity than unweighted training, the opposite of mode collapse. This works because in supervised diffusion the gradient direction is locked to ground truth, so confidence only scales the step size, and because temporal entropy downweights flat samples while preserving high-contrast ones. The result is an online, self-referential data curriculum that emerges purely from the forward pass, with analyzed noise-level dynamics and testable predictions.

URL PDF HTML ☆

赞 0 踩 0

2606.07196 2026-06-08 cs.LG 新提交

Structure-Preserving Correction Learning for Sparse Bayesian Inference in Brain Source Imaging

脑源成像中稀疏贝叶斯推断的结构保持校正学习

Marco Morik, Xiao Ruiting, Shinichi Nakajima, Stefan Haufe, Ismail Huseynov

发表机构 * Berlin Institute for the Foundations of Learning and Data (BIFOLD)（柏林学习与数据基础研究所（BIFOLD））； Technische Universität Berlin（柏林技术大学）； RIKEN Center for Advanced Intelligence Project (AIP)（理化学研究所先进智能项目中心（AIP））； Physikalisch-Technische Bundesanstalt（物理技术联邦机构）； Charité – Universitätsmedizin Berlin（柏林夏里特大学医学院）

AI总结提出一种结构保持的校正学习方法，通过展开经典联合超参数求解器为可训练神经网络，在保留贝叶斯结构的同时学习更新机制，提升M/EEG脑源成像的重建性能和收敛性。

Comments preprint

详情

AI中文摘要

经典的稀疏Type-II贝叶斯方法用于M/EEG脑成像支持源和噪声超参数的联合估计，但依赖于固定的迭代更新规则。尽管这些更新是有原则且可解释的，但其动态无法从数据中适应。我们提出学习更新机制本身，同时通过将经典联合超参数求解器展开为可训练的神经架构（其层镜像原始迭代）来保留底层贝叶斯结构。得到的框架初始化为在训练前精确恢复经典求解器，并通过逐渐更具表达力的校正学习机制（从可学习偏置到自适应MLP和基于注意力的上下文细化）得到丰富。这样，训练不会用黑箱预测器替代贝叶斯推断，而是学习结构化的校正项，同时保留原始更新动态的可解释性和基于模型的特性。因此，结构保持校正学习旨在改善经验重建性能，而不替代原始的基于模型的推断机制。实验结果表明，学习的校正变体在保留算法透明性的同时，改善了基线展开求解器的重建性能和收敛行为。

英文摘要

Classical sparse Type-II Bayesian methods for M/EEG brain imaging support joint estimation of source and noise hyperparameters, but rely on fixed iterative update rules. Although these updates are principled and interpretable, their dynamics cannot be adapted from data. We propose to learn the update mechanism itself while preserving the underlying Bayesian structure by unfolding a classical joint hyperparameter-learning solver into a trainable neural architecture whose layers mirror the original iterations. The resulting framework is initialized to recover the classical solver exactly before training and is enriched through progressively more expressive correction-learning mechanisms, ranging from learnable biases to adaptive MLP and attention-based contextual refinements. In this way, training does not replace Bayesian inference with a black-box predictor, but instead learns structured correction terms while retaining the interpretability and model-based character of the original update dynamics. Structured correction learning therefore aims to improve empirical reconstruction performance without replacing the original model-based inference mechanism. Experimental results show that the learned correction variants improve reconstruction performance and convergence behavior over the baseline unfolded solver while preserving its algorithmic transparency.

URL PDF HTML ☆

赞 0 踩 0

2606.07193 2026-06-08 cs.RO 新提交

Shield-Loco: Shielding Locomotion Policies with Predictive Safety Filtering

Shield-Loco：基于预测性安全过滤的防护运动策略

Aditya Shirwatkar, Sebastian Sanokowski, Shishir Kolathaya, Aaron Johnson, Majid Khadiv

发表机构 * Robert Bosch Center for Cyber Physical Systems（罗伯特·博世网络物理系统中心）； Indian Institute of Science（印度科学研究院）； Munich Institute of Robotics and Machine Intelligence (MIRMI)（慕尼黑机器人与机器智能研究所（MIRMI））； Technical University of Munich（慕尼黑技术大学）； Department of Computer Science & Automation（计算机科学与自动化部门）； Department of Mechanical Engineering（机械工程系）； Carnegie Mellon University（卡内基梅隆大学）； Institute for Advanced Study（高级研究 institute）

AI总结提出一种预测性安全过滤器，通过全物理模型优化接触序列，减少四足机器人在密集杂乱环境中的安全违规，同时保持任务性能。

详情

AI中文摘要

强化学习（RL）策略能够实现动态腿部运动，但缺乏避免训练中未出现的约束违反的机制。大规模离线安全学习对于覆盖所有边缘情况是不切实际的。现有的安全框架要么依赖无法推理全身行为的降阶模型，要么需要保守的恢复控制器，这会降低任务性能。我们提出一种预测性安全过滤器，它对输入到RL策略的名义接触位置进行事后过滤。当预测到碰撞时，基于采样的优化器使用全物理模型异步搜索更安全的接触序列，而学习的价值函数则引导长期回报。我们的三个算法组件（采样接触的几何投影、动量增强更新和副本交换）使得在不连续的接触景观中优化变得可行。我们在密集杂乱环境中的四足机器人上验证了该过滤器，无论是在仿真还是真实世界中，都显示出在最小偏离名义输入的情况下大幅减少安全违规。

英文摘要

Reinforcement learning (RL) policies enable dynamic legged locomotion but lack mechanisms to avoid violations of safety constraints that are absent during training. Large-scale offline safe learning is impractical for covering all edge cases. Existing safety frameworks either rely on reduced-order models that cannot reason about whole-body behaviors or require conservative recovery controllers that degrade task performance. We propose a predictive safety filter that post-hoc filters the nominal contact locations fed to the RL policy. When a collision is predicted, a sampling-based optimizer asynchronously searches for safer contact sequences using a full-physics model, while a learned value function bootstraps long-horizon returns. Our three algorithmic components (geometric projection of sampled contacts, momentum-augmented updates, and replica-exchange) make the optimization tractable in a discontinuous contact landscape. We validate the filter on a quadruped robot in dense, cluttered environments, both in simulation and in the real world, showing substantial reductions in safety violations with minimal deviation from the nominal input.

URL PDF HTML ☆

赞 0 踩 0

2606.07190 2026-06-08 cs.CL 新提交

From Correctness to Utility: Gain-Based Prefix Evaluation for LLM Reasoning

从正确性到效用：基于增益的LLM推理前缀评估

Yuhang Zhou, Yixin Cao, Guangnan Ye

发表机构 * Fudan University（复旦大学）； Shanghai Innovation Institute（上海创新研究院）

AI总结提出前缀增益概念，训练前缀效用模型（PUM）通过成对排序目标评估推理前缀对成功率的提升，在数学推理任务中优于传统正确性评估。

详情

AI中文摘要

推理前缀塑造了LLM问题求解的未来轨迹，然而现有的过程奖励模型通常通过局部步骤正确性来评估它们。我们认为正确性是最终关心效果的有用但间接的代理：即前缀是否增加了成功完成的概率。我们将此效果定义为前缀增益，即通过在一个前缀上条件化轻量级学生模型组所导致的求解率提升，并使用简单的成对排序目标训练前缀效用模型（PUM）。PUM学习基于结果的前缀效用，并能对完整轨迹和部分推理前缀进行评分。在数学推理的Best-of-$N$选择、束搜索和强化学习中，PUM提供了强大的前缀级监督信号，尤其是在候选池大、搜索预算增加或基于规则的奖励稀疏时。我们在该https URL发布所有数据、模型和代码。

英文摘要

Reasoning prefixes shape the future trajectory of LLM problem solving, yet existing process reward models usually evaluate them through local step correctness. We argue that correctness is a useful but indirect proxy for the effect we ultimately care about: whether a prefix increases the probability of successful completion. We define this effect as prefix gain, the solve-rate improvement induced by conditioning lightweight student model group on a prefix, and use it to train a Prefix Utility Model (PUM) with a simple pairwise ranking objective. PUM learns outcome-grounded prefix utility and can score both complete trajectories and partial reasoning prefixes. Across Best-of-$N$ selection, beam search, and reinforcement learning on mathematical reasoning, PUM provides a strong prefix-level supervision signal, especially when candidate pools are large, search budgets increase, or rule-based rewards are sparse. We release all data, models, and code at https://zhiqix.github.io/pum-project-page.

URL PDF HTML ☆

赞 0 踩 0

2606.07186 2026-06-08 cs.RO cs.SE 新提交

OPTIMUS-Prime：深度视觉模型的最小且充分的概念解释

Arthur Hoarau, Chenrui Zhu, Vu Linh Nguyen

发表机构 * Université de Lorraine（洛林大学）； CentraleSupélec Loria（中央超导Loria）； CNRS（国家科学研究中心）； Metz, France（法国梅斯）； Université de technologie de Compiègne UMR CNRS 7253 Heudiasyc（图卢兹技术大学UMR CNRS 7253 Heudiasyc）； France（法国）

AI总结提出OPTIMUS框架，基于主蕴含项理论生成视觉热图解释，满足充分性和最小性，提供形式化保证。

详情

AI中文摘要

自动化决策中日益增长的透明度需求已将可解释人工智能（XAI）推向机器学习研究的前沿。然而，在计算机视觉中，现有的解释方法通常优先考虑最终用户的可访问性，而牺牲了形式化保证，在实用性和理论严谨性之间留下了关键差距。在本文中，我们通过引入OPTIMUS（一种用于深度分类模型的基于概念的可视化解释的新框架）来弥补这一差距。OPTIMUS解释采用视觉热图的形式，不仅对最终用户保持可解释性，而且基于成熟的主蕴含项理论，提供了现有基于显著性方法所缺乏的形式化保证。具体来说，OPTIMUS解释满足两个理想性质：充分性，确保被强调的概念可证明地保证分类器的预测；以及最小性，确保这些概念的严格子集不再保留此保证。这两个性质共同产生了逻辑上紧凑且视觉上连贯的解释。我们在视觉分类基准上验证了我们的方法，证明OPTIMUS热图自然且忠实地呈现了模型预测背后的决策相关概念。

英文摘要

The growing demand for transparency in automated decision-making has propelled eXplainable Artificial Intelligence (XAI) to the forefront of machine learning research. In computer vision, however, existing explanation methods often prioritize end-user accessibility at the expense of formal guarantees, leaving a critical gap between practical utility and theoretical rigor. In this paper, we address this gap by introducing OPTIMUS, a novel framework for generating concept-based visual explanations for deep classification models. OPTIMUS explanations take the form of visual heatmaps that not only remain interpretable to end users, but are grounded in the well-established theory of prime implicants, providing formal guarantees that have been largely absent from existing saliency-based methods. Specifically, OPTIMUS explanations satisfy two desirable properties: sufficiency, ensuring that the highlighted concepts provably guarantee the classifier's prediction, and minimality, ensuring that no strict subset of those concepts retains this guarantee. Together, these properties yield explanations that are both logically tight and visually coherent. We validate our approach on a visual classification benchmark, demonstrating that OPTIMUS heatmaps naturally and faithfully surface the decision-relevant concepts underlying model predictions.

URL PDF HTML ☆

赞 0 踩 0