arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 3947
2606.16952 2026-06-16 cs.LG cs.AI stat.AP stat.ME stat.ML 新提交

Phantoms and Disclosures: a Causal Framework for Auditing Synthetic Data

幻象与披露:合成数据审计的因果框架

Kareem Amin, Rudrajit Das, Alessandro Epasto, Adel Javanmard, Dennis Kraft, Mónica Ribero, Sergei Vassilvitskii

发表机构 * Google(谷歌) University of Southern California(南加州大学)

AI总结 提出一个可定制的实证审计框架,通过区分真实披露与幻象披露,利用统计假设检验检测合成数据中的隐私泄露,无需模型访问或参考模型,提供比先前方法更紧的隐私泄露下界。

Comments 35 pages, 10 tables, 5 figures

详情
AI中文摘要

生成式AI和大语言模型(LLMs)的快速普及激发了人们对合成数据的兴趣,将其作为敏感真实数据集的隐私保护替代方案。然而,生成高实用性合成数据往往存在记忆和复述训练语料中隐私信息的风险。在这项工作中,我们提出了一个可定制的实证审计框架,旨在检测和解释此类数据披露。我们的框架引入了一种机制来区分“真实披露”——系统直接复现用户信息的情况,以及“幻象披露”——系统偶然生成用户数据的情况。通过将输入数据划分为训练集和保留集,并应用严格的统计假设检验,我们确定观察到的披露是否与严格的隐私基线(如零学习或特定的差分隐私(DP)边界)一致。关键的是,这种方法不需要模型访问、不需要插入金丝雀数据,也不需要参考模型训练——仅需要合成输出和保留的控制集。我们证明,该框架有效地充当了成员推断攻击,提供了比先前基于数据的审计方法更紧的隐私泄露经验下界。我们的方法是模型无关的,适用于任何合成数据生成机制,并且所需的计算资源比影子模型或基于金丝雀的替代方法少几个数量级。

英文摘要

The rapid adoption of generative AI and Large Language Models (LLMs) has spurred interest in synthetic data as a privacy-preserving alternative to sensitive real-world datasets. However, generating high-utility synthetic data often carries the risk of memorizing and regurgitating private information from the training corpus. In this work, we present a customizable empirical auditing framework designed to detect and explain such data disclosures. Our framework introduces a mechanism to distinguish between "true disclosures"-where the system directly reproduces a user's information-and "phantom disclosures''-where the system incidentally generates a user's data. By partitioning input data into training and holdout sets and applying rigorous statistical hypothesis testing, we determine if observed disclosures are consistent with strict privacy baselines, such as zero-learning or specific Differential Privacy (DP) bounds. Crucially, this approach requires no model access, no canary insertion, and no reference model training -only the synthetic output and a held-out control set. We demonstrate that this framework effectively functions as a membership inference attack, providing empirical lower bounds on privacy leakage that are tighter than prior data-based auditing methods. Our approach is model-agnostic, applies to any synthetic data generation mechanism, and requires orders of magnitude fewer computational resources than shadow-model or canary-based alternatives.

2606.16951 2026-06-16 cs.CV eess.IV 新提交

Simulation-Based Multi-Fillet Evaluation of Woody Breast Poultry Fillets

基于仿真的多鸡胸肉木质化评估

Chirantan Sen Mukherjee, Seung-Chul Yoon, William J. Beksi

发表机构 * Department of Computer Science and Engineering, The University of Texas at Arlington(德克萨斯大学阿灵顿分校计算机科学与工程系) Quality and Safety Assessment Research Unit, U.S. National Poultry Research Center, USDA Agricultural Research Service(美国农业部农业研究服务局国家家禽研究中心质量与安全评估研究单元)

AI总结 针对单鸡胸肉检测的吞吐量瓶颈,提出一种俯视多鸡胸肉检测架构,通过物理仿真生成数据集并提取二维形状变形分数,实现多鸡胸肉同时评估。

Comments To be published in the 2026 International Conference on Automation Science and Engineering (CASE)

详情
AI中文摘要

木质化鸡胸肉是现代肉鸡的一种肌病,导致胸肌异常僵硬和纤维化,降低肉质并造成重大经济损失。最先进的自动WB检测依赖于侧视成像系统,分析单个鸡胸肉从传送带落下时的弯曲行为。虽然高度准确,但该方法受限于单鸡胸肉视野,在商业加工线上造成吞吐量瓶颈。本文通过一种利用俯视相机配置的新型多鸡胸肉检测架构来解决这一限制。为验证我们的方法,首先开发了工业传送系统的高保真数字孪生。然后,合成多样化的3D鸡胸肉网格数据集,并使用基于物理的仿真引擎模拟其粘弹性弯曲动力学。最后,从俯视视角提取连续的二维形状变形分数,模拟鸡胸肉经过滚轮边缘的过程。实验结果表明,俯视形状分数有效捕捉鸡胸肉弯曲时的轮廓变化,为同时多鸡胸肉WB评估提供了鲁棒且可扩展的侧视成像系统替代方案。

英文摘要

Woody breast (WB) is a myopathy in modern broiler chickens that causes the breast muscle to become unusually stiff and fibrous, leading to decreased meat quality and significant economic losses. State-of-the-art automated WB detection relies on a side-view imaging system to analyze the bending behavior of a single fillet as it falls off a conveyor belt. While highly accurate, this approach is constrained by its single-fillet field of view, creating throughput bottlenecks on commercial processing lines. In this paper, we address this limitation via a novel multi-fillet detection architecture utilizing a top-down camera configuration. To validate our approach, we first develop a high-fidelity digital twin of an industrial conveyor system. Next, we synthesize a diverse dataset of 3D fillet meshes and model their viscoelastic bending dynamics using a physics-based simulation engine. Lastly, a continuous 2D shape deformation score is extracted from the top-down perspective as the simulated fillets traverse the roller precipice. Experimental results demonstrate that the top-down shape score effectively captures the contour changes of the fillets as it bends, providing a robust and scalable alternative to a side-view imaging system for simultaneous multi-fillet WB evaluation.

2606.16944 2026-06-16 cs.AI cs.HC 新提交

A Causal Model of Theory of Mind in Conflict for Artificial Intelligence

冲突情境下心智理论的人工智能因果模型

Nikolos Gurney

发表机构 * Institute for Creative Technologies, University of Southern California(南加州大学创意技术研究所)

AI总结 提出结构因果模型,将心智理论视为由情境和主体条件激活的机制,通过三条因果路径决定何时进行心智化,提升AI社会推理的准确性和效率。

详情
AI中文摘要

心智理论(ToM)是将心理状态归因于他人并利用这些归因进行预测和推理的能力,被广泛认为是有效人机融合的关键。现有AI-ToM模型解决了如何心智化的问题,但基本未涉及何时心智化。核心问题是:在冲突中,何种情境和主体层面的条件下,ToM的参与在因果上是合理的?本文提出一个结构因果模型,形式化为有向无环图(DAG),将ToM视为由情境和主体条件激活的机制,而非始终开启的能力。模型指定了四个捕捉情境和主体条件的外生变量、五个内生中介变量,以及一个通过三条不同因果路径产生参与状态的机制性ToM节点:可处理性路径、推理深度路径和使能原因路径。主要结果是认知准确性,它将社会推理与行为策略解耦,并泛化到冲突之外的社会现象。该框架为AI系统提供了一种有原则的、资源理性的心智化决策程序,对效率、信任以及鲁棒的人工社会智能的发展具有意义。讨论了仿真验证、实证人机协作研究以及由冲突优化心智化引发的伦理问题。

英文摘要

Theory of mind (ToM), the capacity to ascribe mental states to others and use those ascriptions for prediction and inference, is widely assumed to be essential for effective human-machine integration. Existing AI-ToM models address \emph{how} to mentalize, but leave the question of when largely unaddressed. The central question is: under what situational and agent-level conditions is ToM engagement causally warranted in conflict? This paper presents a structural causal model formalized as a directed acyclic graph (DAG), treating ToM as a mechanism activated by situational and agent-level conditions rather than as an always-on capacity. The model specifies four exogenous variables capturing situational and agent-level conditions, five endogenous mediators, and a mechanistic ToM node producing engagement states through three distinct causal pathways: a tractability pathway, a reasoning-depth pathway, and an enabling-cause pathway. The primary outcome is epistemic accuracy, which decouples social reasoning from behavioral policy and generalizes across social phenomena beyond conflict. The framework gives AI systems a principled, resource-rational decision procedure for mentalizing, with implications for efficiency, trust, and the development of robust artificial social intelligence. Simulation validation, empirical human-machine teaming studies, and ethical considerations arising from conflict-optimized mentalizing are discussed.

2606.16935 2026-06-16 cs.RO cs.AI cs.LG 新提交

CrossMaps: Confidence-Aware Open-Vocabulary Semantic Mapping for Rover Navigation

CrossMaps: 用于漫游车导航的置信度感知开放词汇语义地图

Jan-Niklas Klein, Sona Ghahremani, Christian Medeiros Adriano, Holger Giese

发表机构 * Hasso Plattner Institute for Digital Engineering, Potsdam, Germany(哈索·普拉特纳数字工程研究所(德国波茨坦))

AI总结 提出CrossMaps,一种实时置信度感知开放词汇语义地图构建流水线,通过多尺度CLIP嵌入、置信度融合和双记忆架构生成可查询语义地图,用于漫游车导航。

Comments IEEE International Conference on Robotics and Automation (ICRA) 2026: ROSE International Workshop on Robotics Software Engineering, June 01, 2026, Vienna, Austria

详情
AI中文摘要

漫游车依赖感知来维护空间地图,该地图编码物体和传感器质量(例如,距离可靠性、光照伪影、数据密度),指导数据融合、嵌入更新以及在部分可观测性下的导航。为了研究这些耦合的感知-导航过程,我们提出了CrossMaps,一种实时的置信度感知开放词汇语义地图构建流水线,该流水线从RGB-D数据构建可语言查询的地图。基于VLMaps风格的方法,CrossMaps集成了多尺度CLIP嵌入、置信度感知融合以及由短期记忆(STM)和长期记忆(LTM)组成的双记忆架构。STM使用几何、语义和时间置信度线索聚合噪声视觉观测,而置信且一致的单元被提升到LTM作为持久语义地标。CrossMaps设计用于与Jetson Orin驱动的UGV以及SLAM一起部署,实时运行并生成语义热力图,可通过自然语言查询来引导漫游车导航。

英文摘要

Rovers rely on perception to maintain spatial maps that encode both objects and sensor quality (e.g., range reliability, lighting artifacts, data density), guiding data fusion, embedding updates, and navigation under partial observability. To study these coupled perception-navigation processes, we present CrossMaps, a real-time confidence-aware open-vocabulary semantic mapping pipeline that constructs language-queryable maps from RGB-D data. Building on VLMaps-style approaches, CrossMaps integrates multi-scale CLIP embeddings with confidence-aware fusion and a dual-memory architecture consisting of Short-Term Memory (STM) and Long-Term Memory (LTM). The STM aggregates noisy visual observations using geometric, semantic, and temporal confidence cues, while confident and coherent cells are promoted to the LTM as persistent semantic landmarks. Designed for deployment with a Jetson Orin-powered UGV alongside SLAM, CrossMaps runs in real time and produces semantic heatmaps that can be queried with natural language to guide rover navigation.

2606.16933 2026-06-16 cs.LG cs.AI 新提交

A Unified Causal-Origin Taxonomy of Distributional Shifts in Reinforcement Learning

强化学习中分布偏移的统一因果起源分类法

Ardianto Wibowo, Paulo E Santos, Amer Baghdadi, Matthew Stephenson, Karl Sammut, Jean-Philippe Diguet

发表机构 * IMT Atlantique(IMT大西洋) Flinders University(弗林德斯大学) IRL Crossing Priori Analytica CNRS(法国国家科学研究中心)

AI总结 提出一种统一因果起源分类法,将强化学习中的分布偏移按因果来源(内部/外部)和时间边界(显式/隐式/混合)分类,统一了分布内/外泛化与非平稳性分析。

Comments The paper is currently under review at the Journal of Artificial Intelligence Research (JAIR)

详情
AI中文摘要

强化学习系统在运行条件与先前遇到的条件不同时通常会退化,这反映了底层数据生成过程中的分布偏移。这种偏移可能发生在训练和评估之间,如分布内(ID)和分布外(OOD)泛化,或者发生在环境动态随时间演变的非平稳设置中。然而,这些观点之间的形式关系尚不清楚,现有工作主要关注缓解措施而非智能体-环境交互中偏移的因果起源。本文开发了一个统一的因果起源分类法,描述了强化学习中分布偏移的来源,并将ID/OOD泛化与非平稳设置联系起来。我们将监督学习中的经典数据集偏移原则迁移到强化学习,通过将分布偏移重新表述为生成交互过程。使用部分可观测马尔可夫决策过程(POMDP),我们将交互分解为结构组件,包括状态分布、观测过程、策略、奖励和转移动态,以及偏移时间边界。所提出的分类法区分了内部(智能体驱动)和外部(环境驱动)的分布偏移。偏移时间边界视角进一步刻画了显式、隐式和混合偏移。这种表述将ID/OOD泛化和非平稳性统一为底层过程中的结构化变化。我们还引入了一个评估框架,通过性能退化和恢复指标来衡量偏移影响和适应能力。通过将分布偏移扎根于强化学习的因果起源结构,本文支持在分布偏移下进行系统性的鲁棒性分析。

英文摘要

Reinforcement learning (RL) systems often degrade when operating conditions differ from those previously encountered, reflecting distributional shifts in the underlying data-generating process. Such shifts may occur between training and evaluation, as in In-Distribution (ID) and Out-of-Distribution (OOD) generalization, or within non-stationary settings where environment dynamics evolve over time. However, the formal relationship between these views remains unclear, and existing work mainly focuses on mitigation rather than the causal origin of shift within the agent-environment interaction. This work develops a unified causal-origin taxonomy that characterizes sources of distributional shift in RL and relates ID/OOD generalization to non-stationary settings. We transfer the classical dataset-shift principle from supervised learning to RL by reformulating distributional shift in terms of the generative interaction process. Using a Partially Observable Markov Decision Process (POMDP), we decompose the interaction into structural components, including the state distribution, observation process, policy, reward, and transition dynamics, together with the shifted-time boundary. The proposed taxonomy distinguishes internal, agent-driven, and external, environment-driven, distributional shifts. The shifted-time boundary perspective further characterizes explicit, implicit, and hybrid shifts. This formulation unifies ID/OOD generalization and non-stationarity as structured changes in the underlying process. We also introduce an evaluation framework for measuring shift impact and adaptation through performance degradation and recovery metrics. By grounding distributional shift in the causal-origin structure of RL, this work supports systematic analysis of robustness under distributional shift.

2606.16925 2026-06-16 cs.AI 新提交

RAID: Semantic Graph Diffusion for True Cold-Start and Cross-Lingual Forecasting

RAID: 面向真正冷启动和跨语言预测的语义图扩散

Arunkumar V, Manoranjan Gandhudi, Gangadharan G. R., Arun Prakash, S. Senthilkumar

发表机构 * University College of Engineering, Anna University Tiruchirappalli(安娜大学蒂鲁吉拉伯利工程学院) Central University of Karnataka(卡纳塔克中央大学) National Institute of Technology Tiruchirappalli(蒂鲁吉拉伯利国立理工学院) Jawaharlal Nehru University(贾瓦哈拉尔·尼赫鲁大学)

AI总结 针对时间序列基础模型在无历史数据时失效的问题,提出RAID框架,利用元数据语义检索和图条件扩散实现冷启动预测,在准确性和推理速度上超越现有方法,并支持零样本跨语言迁移。

Comments 25 pages, 4 figures, 8 tables

详情
AI中文摘要

时间序列基础模型在给定非空历史窗口时表现出强大的迁移性能。然而,真正的冷启动场景(新项目没有先前的观测数据)违背了这一假设。我们提出了RAID(检索增强迭代扩散)框架,该框架用元数据驱动的语义检索和图条件扩散取代了基于历史的相关性学习。RAID使用冻结的多语言嵌入模型将文本元数据映射到共享语义空间,并构建一个可自然扩展到未见项目的归纳检索图。它首先通过聚合语义相关邻居的信息形成基础预测,然后使用门控扩散模块细化该预测以建模残差不确定性。在严格的真正冷启动协议下,RAID在预测准确性和预测区间覆盖率上均优于强大的基础模型和竞争基线,同时通过非自回归解码将推理延迟降低一个数量级。共享语义空间还实现了零样本跨语言迁移,使得在英文描述上训练的模型能够泛化到其他语言描述的项目,而无需直接监督。

英文摘要

Time-series foundation models show strong transfer performance when given a non-empty history window. However, true cold-start scenarios, where a new item has no prior observations, violate this assumption. We propose RAID (Retrieval-Augmented Iterative Diffusion) a framework, which replaces history-based correlation learning with metadata-driven semantic retrieval and graph-conditioned diffusion. RAID maps textual metadata into a shared semantic space using a frozen multilingual embedding model and constructs an inductive retrieval graph that extends naturally to unseen items. It first forms a base forecast by aggregating information from semantically related neighbors, then refines this forecast with a gated diffusion module to model residual uncertainty. Under a strict true cold-start protocol, RAID outperforms strong foundation models and competitive baselines on both forecasting accuracy and prediction interval coverage, while reducing inference latency by an order of magnitude through non-autoregressive decoding. The shared semantic space also enables zero-shot cross-lingual transfer, allowing a model trained on English descriptions to generalize to items described in other languages without direct supervision.

2606.16923 2026-06-16 cs.AI stat.ML 新提交

MA-SBI: Misspecification-Aware Simulation-Based Inference via Side-Channel Guidance

MA-SBI: 通过侧信道引导的误设定感知仿真推断

Arunkumar V, Manoranjan Gandhudi, Gangadharan G. R., Arun Prakash, S. Senthilkumar

发表机构 * University College of Engineering, Anna University Tiruchirappalli(安娜大学蒂鲁吉拉伯利工程学院) Central University of Karnataka(卡纳塔克中央大学) National Institute of Technology Tiruchirappalli(蒂鲁吉拉伯利国立理工学院) School of Computer & Systems Sciences, Jawaharlal Nehru University(贾瓦哈拉尔·尼赫鲁大学计算机与系统科学学院)

AI总结 针对仿真模型误设定问题,提出无需校准的MA-SBI框架,利用侧信道文本信息进行后验校正,理论保证偏差减少界限,实验表明仅用文本即可匹配oracle后验。

Comments 23 pages, 9 figures, 12 tables

详情
AI中文摘要

潜在参数的仿真推断(SBI)常受仿真器误设定困扰,即由于固有的建模简化导致的仿真观测与真实观测之间的不匹配。最新的鲁棒SBI方法RoPE通过真实与仿真观测学习表示之间的最优传输来解决此问题,但需要真实参数校准对,而这在需要SBI的设置中通常不可用。实践者拥有的是非结构化侧信息,如制度标签、指令文本和政策公告。我们提出误设定感知仿真推断(MA-SBI),一个无需校准的框架,将侧信道转化为后验校正。学习到的校正器将侧信道文本映射到观测空间偏移,应用于任何预训练的摊销后验之前,无需重新训练也无需参数真实值。我们的主要定理通过误设定与侧信道之间的互信息界定了可实现的偏差减少,通过Donsker-Varadhan扩展到所有次高斯噪声的非平凡常数。在隐藏校准基准上,仅使用文本的MA-SBI在10个种子和两个骨干网络上匹配oracle后验(TOST等价),而使用更多数据的RoPE则不能。两种方法互补:当误设定是结构性的且可从参数对中恢复时,RoPE占优,正如理论所预测。随机变体在真实COVID和OxCGRT流行病学数据上提高了后验预测对数似然,并在一个良好设定的认知科学语料库上正确保持后验不变。

英文摘要

Simulation-based inference (SBI) of latent parameters is often hindered by simulator misspecification, the mismatch between simulated and real-world observations caused by inherent modeling simplifications. RoPE, the recent state-of-the-art for robust SBI, addresses this through optimal transport between learned representations of real and simulated observations, but requires ground-truth parameter calibration pairs that are typically unavailable in the very settings where SBI is needed. What practitioners do have is unstructured side-information such as regime labels, instruction text, and policy bulletins. We propose Misspecification-Aware Simulation-Based Inference (MA-SBI), a calibration-free framework that turns this side-channel into a posterior correction. A learned corrector maps side-channel text to an observation-space shift applied before any pre-trained amortized posterior, requiring no retraining and no parameter ground-truth. Our main theorem bounds achievable bias reduction by the mutual information between misspecification and side-channel, with a non-vacuous constant that extends to all sub-Gaussian noise via Donsker-Varadhan. On hide-the-calibration benchmarks, MA-SBI with text alone matches the oracle posterior across 10 seeds and two backbones (TOST equivalence), while RoPE given more data does not. The two approaches are complementary: where misspecification is structural and recoverable from parameter pairs, RoPE dominates, as the theory predicts. A stochastic variant improves posterior-predictive log-likelihood on real COVID and OxCGRT epidemiological data, and correctly leaves the posterior unchanged on a well-specified cognitive-science corpus.

2606.16920 2026-06-16 cs.LG cs.AI 新提交

Demystifying Variance in Circuit Discovery of LLMs

揭示LLM电路发现中的方差

Frank Zhengqing Wu, Francesco Tonin, Volkan Cevher

发表机构 * Laboratory for Information and Inference Systems (LIONS), École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland(信息与推理系统实验室(LIONS),洛桑联邦理工学院(EPFL),瑞士洛桑)

AI总结 本文研究LLM电路发现中的重采样、重述和样本方差,提出CEAP方法减少重采样方差,并分析重述方差源于不同模板激活不同电路,样本方差主要由不忠定义导致。

详情
AI中文摘要

电路发现是机械可解释性中的关键技术,用于定位对执行给定任务至关重要的模型组件。尽管当前最先进的方法(EAP-IG)在(不)忠实性指标上表现良好,但它存在显著的变异性。这包括重采样方差(当我们用来自同一分布的新数据批次探测时电路发生变化)、重述方差(当提示被重新表述时发现的电路发生偏移)以及样本方差(具有低总体不忠实性的电路在单个样本上的不忠实性表现出大幅波动)。本文研究了这些方差的根源。我们证明了CEAP(我们新的电路发现方法,在理论上改进了EAP-IG)可以显著减轻重采样方差。我们进一步表明,重述方差是由于不同模板的提示倾向于激活模型中的不同电路。这使我们提出,可能很难找到一个全面的电路来解释和控制模型在任务上的行为,而该任务可以用无数模板表达,这表明LLM可能本质上难以操控。我们表明,稀疏性(据称能形成更紧凑和可解释的任务电路)无法解决这个问题。关于样本方差,我们认为它很大程度上是良性的:极差的不忠实性分数通常源于不忠实性的定义方式,而非测量电路的缺陷。我们表明,不忠实性的大小受选择性贡献缩放的影响,这是一种神经机制,解释了有时观察到的极差分数。

英文摘要

Circuit discovery is a key technique in mechanistic interpretability to pinpoint the model components that are crucial for performing a given task. Although the current state-of-the-art method (EAP-IG) performs well on the metric of (un)faithfulness, it suffers from substantial variability. This includes resampling variance, where the circuit changes when we probe with a new batch of data from the same distribution; rephrasing variance, where the discovered circuit shifts when the prompts are rephrased; and sample-wise variance, where a circuit with low population unfaithfulness exhibits large fluctuations in unfaithfulness across individual samples. This paper studies the roots of these variances. We demonstrate that CEAP, our new circuit discovery method that improves upon EAP-IG with a theoretical guarantee, can substantially lessen resampling variance. We further show that rephrasing variance arises because prompts with different templates tend to activate different circuits in the model. This leads us to argue that it may be challenging to find a comprehensive circuit that explains and controls the model's behavior on a task, which can be expressed in countless templates, suggesting that LLMs may be inherently hard to steer. We show that sparsity, which has been claimed to form more compact and interpretable task circuits, fails to solve this problem. Regarding sample-wise variance, we argue that it is largely benign: extremely poor unfaithfulness scores often stem from how unfaithfulness is defined, rather than from defects in the measured circuits. We show that the magnitude of unfaithfulness is affected by selective contribution scaling, a neural mechanism that accounts for the extremely poor scores sometimes observed.

2606.16914 2026-06-16 cs.AI 新提交

Greed Is Learned: Visible Incentives as Reward-Hacking Triggers

贪婪是习得的:可见激励作为奖励黑客触发器

Tong Che, Rui Wu

发表机构 * NVIDIA Research(英伟达研究院) Rutgers University(罗格斯大学)

AI总结 研究强化学习中的奖励通道成瘾现象,即智能体因可见的自我利益通道(如分数、KPI)而偏离真实任务,并发现该成瘾可翻转模型的安全对齐。

详情
AI中文摘要

部署的智能体越来越多地在其奖励代理可见的情况下行动,例如余额、分数或KPI仪表板。我们表明,强化学习可以使策略对这种可见的自我利益通道上瘾。它会在跨保留域中追逐显示的收益,牺牲真实任务来这样做,并跟随我们重写的任何通道,而从未见过该通道的策略保持诚实。我们称之为奖励通道成瘾,并在合成沙盒MoneyWorld中研究它。这种成瘾可以翻转模型的安全对齐:仅在无害的金钱任务上训练(无安全内容),每当仪表板为不安全行为付费时,模型会放弃它通常始终采取的安全行动,并在通道隐藏时恢复安全。这种习得的贿赂行为跨模型规模和系列复制。盲目优化超能力、下一代AI的KPI或损益可能对对齐构成危险。当遵循这样的通道有回报时,贪婪是习得的。

英文摘要

Deployed agents increasingly act with their reward proxy in view, such as a balance, score, or KPI dashboard. We show that reinforcement learning can make a policy \emph{addicted} to such a visible self-benefit channel. It chases the displayed payoff across held-out domains, sacrifices the true task to do so, and follows the channel wherever we rewrite it, while policies that never saw the channel stay honest. We call this \emph{reward-channel addiction} and study it in \emph{MoneyWorld}, a synthetic sandbox. The addiction can \emph{flip a model's safety alignment}: trained only on innocuous money tasks with no safety content, the model abandons the safe action it otherwise always takes whenever a dashboard pays for an unsafe one, and reverts to safe once the channel is hidden. This learned bribe replicates across model scales and families. Blindly optimizing super-capable, next-generation AI on KPIs or P\&L can be dangerous for alignment. \emph{Greed is learned} when following such a channel pays.

2606.16910 2026-06-16 cs.CL cs.AI 新提交

IMPACTeen: Intentions, Manipulation, Persuasion, Annotations, and Consequences in Teen Communication Dataset

IMPACTeen:青少年沟通数据集中的意图、操纵、说服、标注与后果

Aleksander Szczęsny, Wiktoria Mieleszczenko-Kowszewicz, Maciej Markiewicz, Beata Bajcar, Tomasz Adamczyk, Jolanta Babiak, Grzegorz Chodak, Przemysław Kazienko

发表机构 * Wrocław University of Science and Technology(弗罗茨瓦夫理工大学)

AI总结 构建IMPACTeen数据集,包含1021个青少年社交影响场景文本,从五个视角标注,支持社交影响检测、标注者分歧及跨语言建模研究。

详情
AI中文摘要

IMPACTeen是一个文本社交影响场景数据集,涵盖青少年语境下的人际、媒体和数字环境。它包含1,021个文本、5,100条独立标注记录以及社交影响技术的黄金标签,每个文本从五个不同视角(青少年、家长、心理学家、沟通专家和教师)进行标注。该资源通过受限的大语言模型生成构建,随后经过两步人工编辑和验证阶段,以确保青少年语境的真实性。多维标注涵盖了影响存在性、技术、意图、后果、抵抗、反应和标注置信度。该数据集支持社交影响检测、标注者分歧、跨语言建模以及语言模型的训练和评估。数据集以波兰语创建,并附有相应的英文版本。

英文摘要

IMPACTeen is a dataset of textual social influence scenarios spanning interpersonal, media-based, and digital settings in an adolescent context. It contains 1,021 texts, 5,100 individual annotation records, and gold labels for social influence techniques, with each text annotated from five distinct perspectives: teenagers, parents, psychologists, communication experts, and teachers. The resource was constructed through constrained LLM generation, followed by a two-step human editing and validation phase aimed at ensuring youth-context realism. A multi-dimensional annotation covered influence presence, techniques, intentions, consequences, resistance, reactions, and annotation confidence. The dataset supports research on social influence detection, annotator disagreement, cross-lingual modeling, and the training and evaluation of language models. The dataset was created in Polish and is accompanied by a corresponding English version.

2606.16908 2026-06-16 cs.CL 新提交

LESS Is More: Mutual-Stability Sampling for Diffusion Language Models

LESS Is More: 扩散语言模型的互稳定采样

Amr Mohamed, Guokan Shang, Michalis Vazirgiannis

发表机构 * MBZUAI(穆罕默德·本·扎耶德人工智能大学) Ecole Polytechnique(巴黎综合理工学院)

AI总结 针对扩散语言模型固定步数采样效率低的问题,提出无训练的自适应采样器LESS,通过互稳定规则动态决定掩码位置何时解码,在7个基准上平均准确率提升且步数减少72.1%。

详情
AI中文摘要

扩散大语言模型(dLLMs)通过迭代精炼掩码序列,支持并行令牌更新和双向条件化,为自回归解码提供了一种有前景的替代方案。然而,其实际效率受到采样过程的限制,该过程在解码前执行固定数量的反向去噪步骤,将计算花费在已经稳定的位置上,有时过早地提交不稳定的位置。我们提出\textsc{LESS},一种无需训练、模型无关的自适应采样器,将令牌提交视为在线停止问题。\textsc{LESS}通过联合稳定性规则实现互稳定采样:仅当其top-1预测具有高置信度、其top-1令牌在最近的反向步骤中持续出现、且其预测分布在top-$K$步间Jensen-Shannon散度下稳定时,掩码位置才符合解码条件。我们在Dream-7B、LLaDA-8B和LLaDA-1.5-8B上评估\textsc{LESS},涵盖全序列扩散和半自回归块采样模式,跨越七个涵盖通用知识、数学和代码的基准。\textsc{LESS}在强无训练自适应采样器上提高了平均准确率,同时比固定预算解码减少了$72.1\%$的反向步骤。由于每个反向步骤需要一次Transformer前向传播,这些步数减少转化为更少的前向评估、更低的实测墙钟延迟和更低的估计推理计算量。

英文摘要

Diffusion large language models (dLLMs) offer a promising alternative to autoregressive decoding by iteratively refining masked sequences, enabling parallel token updates and bidirectional conditioning. Their practical efficiency, however, is limited by sampling procedures that execute a fixed number of reverse denoising steps selected before decoding, spending computation on already-stable positions and sometimes committing unstable ones too early. We present \textsc{LESS}, a training-free, model-agnostic adaptive sampler that treats token commitment as an online stopping problem. \textsc{LESS} implements mutual-stability sampling through a joint stability rule that makes a masked position eligible for unmasking only when its top-1 prediction has high confidence, its top-1 token persists across recent reverse steps, and its predictive distribution is stable under top-$K$ inter-step Jensen--Shannon divergence. We evaluate \textsc{LESS} on Dream-7B, LLaDA-8B, and LLaDA-1.5-8B, covering full-sequence diffusion and semi-autoregressive blockwise sampling regimes, across seven benchmarks spanning general knowledge, math, and code. \textsc{LESS} improves average accuracy over strong training-free adaptive samplers while using $72.1\%$ fewer reverse steps than fixed-budget decoding. Since each reverse step requires a Transformer forward pass, these step-count reductions translate into fewer forward evaluations, lower measured wall-clock latency, and lower estimated inference compute.

2606.16905 2026-06-16 cs.CL 新提交

Speaking the Language of Science: Toward a General-Purpose Generative Foundation Model for the Natural Sciences

说科学的语言:面向自然科学的通用生成基础模型

Mingyang Li, Yurou Liu, Jieping Ye, Bing Su, Ji-Rong Wen, Zheng Wang

发表机构 * Alibaba Group(阿里巴巴集团) Gaoling School of Artificial Intelligence, Renmin University of China(中国人民大学高瓴人工智能学院)

AI总结 提出LOGOS模型,通过统一科学语法将异构任务转化为自回归框架中的下一个词预测,在多个科学任务上匹配或超越领域专用基线,验证了“一模型适用于所有”的可行性。

详情
AI中文摘要

在本报告中,我们提出了LOGOS(科学中的生成对象语言),一个科学生成语言模型,它基于共享的科学语法,在单一自回归框架内统一了自然科学中的异构任务。它将不同的科学对象及其空间交互编码为公共词汇表上的令牌序列。通过将空间接触和约束模式表示为离散令牌,该模型以纯序列方式捕获复杂的结构交互,而不依赖显式坐标或几何神经网络。这种统一表示使得广泛的下游任务能够一致地表述为同一语法空间中的下一个词预测,从而在持续的多领域预训练和下游目标之间建立强对齐。在多种任务中,LOGOS始终匹配或超越领域专用基线,为自然科学中“一模型适用于所有”的可行性提供了初步证据。我们训练了不同规模(1B、3B和8B参数)的LOGOS模型,并发现模型大小与性能之间存在一致的正相关关系。这表明,未来的人工智能科学(AI4S)可能不在于构建独立于大型语言模型(LLM)的技术栈,而在于通过共享架构、共享训练范式和共享推理基础设施,将科学基础模型与LLM深度对齐,从而使LLM真正成为AI4S的新入口。我们发布了模型权重和相关资源以促进进一步研究。

英文摘要

In this report, we present LOGOS (Language Of Generative Objects in Science), a scientific generative language model that unifies heterogeneous tasks across the natural sciences within a single autoregressive framework based on a shared scientific grammar. It encodes diverse scientific objects and their spatial interactions as token sequences over a common vocabulary. By representing spatial contact and constraint patterns as discrete tokens, the model captures complex structural interactions in a purely sequential manner, without relying on explicit coordinates or geometric neural networks. This unified representation enables a wide range of downstream tasks to be formulated consistently as next-token prediction in the same grammar space, creating strong alignment between continued multi-domain pre-training and downstream objectives. Across diverse tasks, LOGOS consistently matches or outperforms domain-specific baselines, providing preliminary evidence for the feasibility of "one model fits all" in the natural sciences. We train LOGOS models at different scales (1B, 3B, and 8B parameters) and find a consistent positive correlation between model size and performance. This suggests that the future of AI for Science (AI4S) may not lie in building an independent technical stack that is separated from large language models (LLMs). Instead, it may depend on deeply aligning scientific foundation models with LLMs through shared architectures, shared training paradigms, and shared inference infrastructure, so that LLMs can truly become a new entry point for AI4S. We release the model weights and associated resources to facilitate further research.

2606.16900 2026-06-16 cs.LG 新提交

Factorized Neural Operators Decompose Dynamic and Persistent Responses

因子化神经算子分解动态与持久响应

Hao Tang, Yuechen Duan, Jiongyu Zhu, Zimeng Feng, Hao Li, Chao Li

发表机构 * School of Medicine, University of Dundee(邓迪大学医学院) School of Data Science, Fudan University(复旦大学数据科学学院) School of Mathematical Sciences, Fudan University(复旦大学数学科学学院) Institute of Science and Technology for Brain-inspired Intelligence, Fudan University(复旦大学类脑智能科学与技术研究院) School of Science and Engineering, University of Dundee(邓迪大学科学与工程学院) Department of Applied Mathematics and Theoretical Physics, University of Cambridge(剑桥大学应用数学与理论物理系)

AI总结 提出因子化神经算子(FaNO),通过分解谱表示为等变动态响应和不变持久响应,提升多尺度物理系统的预测精度、参数效率和跨尺度泛化能力。

详情
AI中文摘要

物理系统通常表现出异质性机制,其中快速演变的动力学与持久结构共存。现有的神经算子通常依赖单一主导归纳偏置,因此将不同的物理响应耦合到共享表示中,难以捕捉这种多尺度物理行为。我们引入了跨域的统一格林函数框架,并提出了因子化神经算子(FaNO),它将谱表示分解为等变动态响应和不变持久响应,从而提高了可解释性和泛化能力。从机制上讲,我们展示了两个算子分支自发地特化为不同的物理角色,这些角色在尺度和域上保持一致:等变分支捕捉快速变化的瞬态动力学,而不变分支提取连贯的持久结构。FaNO的这种因子化机制提高了跨物理系统和域的预测精度、参数效率和跨尺度泛化能力。特别是,它在长时程自回归滚动、跨分辨率外推和物理状态转移下保持一致的预测。这些发现表明,可扩展的物理建模可能受益于从单一归纳偏置公式转向更好地反映物理系统异质性组织的因子化算子表示,从而加速机器学习在科学计算和发现中的可靠部署。

英文摘要

Physical systems often exhibit heterogeneous mechanisms, where rapidly evolving dynamics coexist with persistent structures. Capturing such multiscale physical behavior remains challenging for existing neural operators, which typically rely on single dominant inductive bias and therefore couple distinct physical responses into a shared representation. We introduce the Unified Green's Function Framework across domains and propose the Factorized Neural Operators (FaNO), which decompose spectral representations into equivariant dynamic responses and invariant persistent responses, leading to better interpretability and generalization. Mechanistically, we show that the two operator branches spontaneously specialize into distinct physical roles that remain consistent across scales and domains: the equivariant branch captures rapidly varying transient dynamics, whereas the invariant branch extracts coherent persistent structures. This factorized mechanism of FaNO improves prediction accuracy, parameter efficiency and cross-scale generalization across physical systems and domains. In particular, it maintains consistent predictions under long-horizon autoregressive rollout, cross-resolution extrapolation and physical-regime shifts. These findings suggest that scalable physical modeling may benefit from moving beyond single-inductive-bias formulations toward factorized operator representations that better reflect the heterogeneous organization of physical systems, accelerating the reliable deployment of machine learning for scientific computing and discovery.

2606.16899 2026-06-16 cs.LG 新提交

Fantastic Pretraining Optimizers and Where to Find Them II: Hyperball Optimization

奇妙预训练优化器及其发现之处 II:超球优化

Kaiyue Wen, Xingyu Dang, Kaifeng Lyu, Tengyu Ma, Percy Liang

发表机构 * Stanford University(斯坦福大学) Princeton University(普林斯顿大学) Tsinghua University(清华大学)

AI总结 针对Muon等优化器在大模型预训练中增益随规模增大而减弱的问题,提出Hyperball包装器,固定权重矩阵及其更新的Frobenius范数,在1.2B参数模型上实现20-30%的token等效加速,并改善学习率迁移。

Comments Corresponding blog post: https://psychedelic-sunstone-851.notion.site/Fantastic-Pretraining-Optimizers-and-Where-to-Find-Them-2-1-Hyperball-Optimization-2e924306e6f280e7a5ffee00eb40a0dd

详情
AI中文摘要

基于矩阵的优化器(如Muon)可以显著加速语言模型预训练,但观察到当使用标准常数解耦权重衰减时,随着模型大小和数据规模的增长,它们相对于AdamW的增益会缩小。我们提出Hyperball,一个简单的优化器包装器来解决这个问题。给定一个基础优化器(如Adam或Muon),Hyperball将权重矩阵的Frobenius范数及其对应的优化器更新设置为固定常数。在高达1.2B参数的Qwen3风格模型上,Muon Hyperball相对于权重衰减基线实现了20-30%的token等效加速。与解耦权重衰减相比,Hyperball还改善了跨宽度和深度的学习率迁移。该方法的动机来自先前的理论,该理论表明使用权重衰减训练会导致一个仅依赖于训练超参数的平衡权重范数。通过这种机制,权重衰减决定了角度学习率,即权重矩阵方向变化的速度。

英文摘要

Matrix based optimizers such as Muon can substantially speed up language model pretraining, but their gains over AdamW are observed to shrink as model size and data scale grow when using standard constant decoupled weight decay. We propose Hyperball, a simple optimizer wrapper that addresses this issue. Given a base optimizer such as Adam or Muon, Hyperball sets the Frobenius norms of weight matrices and their corresponding optimizer updates to fixed constants. On Qwen3 style models up to 1.2B parameters, Muon Hyperball achieves 20--30% token equivalent speedup over weight decay baselines. Hyperball also improves learning rate transfer across widths and depths compared to decoupled weight decay. This method is motivated by prior theory showing that training with weight decay leads to an equilibrium weight norm that only depends on the training hyperparameters. Through this mechanism, the weight decay then decides the angular learning rate, i.e. how fast the direction of the weight matrix changes.

2606.16898 2026-06-16 cs.CV cs.AI 新提交

Semantic Flip: Synthetic OOD Generation for Robust Refusal in Embodied Question Answering and Spatial Localization

Semantic Flip: 用于具身问答和空间定位中鲁棒拒绝的合成OOD生成

Dongbin Na, Chanwoo Kim, Giyun Choi, Dooyoung Hong

发表机构 * RGA Inc.(RGA公司)

AI总结 提出Semantic Flip框架,通过合成辅助OOD样本训练轻量拒绝模块,使冻结的视觉语言模型在无外部OOD标注下实现鲁棒拒绝,在具身问答和空间定位基准上优于强提示基线。

Comments 18 pages, 3 figures. Code and data: https://github.com/ndb796/SemanticFlip ; project page: https://ndb796.github.io/SemanticFlip

详情
AI中文摘要

检测不可回答的用户查询对于现实世界具身代理的可靠部署仍然至关重要。然而,现代视觉语言模型(VLM)即使当可用视觉记忆无法支持查询时,也常常生成过于自信的答案。这种过度自信会带来各种任务依赖的风险。代理可能在具身问答中向用户提供误导信息,并在空间推理导航中选择任意坐标并物理引导用户前往。尽管风险很高,但只有少数先前研究直接解决具身VLM何时以及如何回答“我不知道”的问题。本文提出Semantic Flip,一个简单而有效的框架,无需外部OOD标注即可合成辅助分布外(OOD)样本用于具身拒绝。关键思想是独立变换查询和视频记忆,以构建缺乏足够视觉基础的辅助OOD对。这些合成对使得能够在冻结的预训练VLM之上训练一个轻量级拒绝模块。该模块可附加到任何现有的基于VLM的流水线中,无需重新训练底层模型。在两个互补的基准测试中,Semantic Flip始终优于强提示基线。本文还引入了SpaceReject,一个新的用于空间定位的拒绝基准,包含故意不可回答的查询和长视频记忆,其中Semantic Flip达到了0.9559的$F_1$分数。源代码和数据集公开于https://github.com/ndb796/SemanticFlip。

英文摘要

Detecting unanswerable user queries remains essential for the reliable deployment of real-world embodied agents. However, modern vision-language models (VLMs) often generate overly confident answers even when the available visual memory cannot support the query. Such overconfidence poses various task-dependent risks. The agent may provide misleading information to the user in Embodied Question Answering and select an arbitrary coordinate and physically guide the user there in spatial reasoning for navigation. Despite these high stakes, only a few prior studies directly address when and how an embodied VLM should respond with "I do not know." This work proposes Semantic Flip, a simple yet effective framework that synthesizes auxiliary out-of-distribution (OOD) samples for embodied refusal without requiring external OOD annotations. The key idea is to independently transform the query and video memory to construct auxiliary OOD pairs that lack sufficient visual grounding. These synthesized pairs enable training a lightweight rejection module on top of a frozen pretrained VLM. The module attaches to any existing VLM-based pipeline without retraining the underlying model. Across two complementary benchmarks, Semantic Flip consistently outperforms strong prompting baselines. This work also introduces SpaceReject, a new refusal benchmark for spatial localization with deliberately unanswerable queries over long video memory, where Semantic Flip achieves an $F_1$ score of 0.9559. The source codes and datasets are publicly available at https://github.com/ndb796/SemanticFlip.

2606.16897 2026-06-16 cs.CL 新提交

Contrastive-Difference CKA Reveals Concept-Specific Structural Alignment Across Language Model Architectures

对比差异CKA揭示跨语言模型架构的概念特定结构对齐

Xueping Gao

发表机构 * Alibaba Cloud(阿里云)

AI总结 提出对比差异CKA(CKA_Delta)方法,发现不同LLM架构在概念表示上存在几何收敛与功能可迁移性分离的现象,能有效区分概念特定相似性与通用相似性。

详情
AI中文摘要

不同LLM架构是否以结构兼容的方式编码高层概念?我们系统性地刻画了几何-功能普遍性分离:在多个概念领域和架构家族中,适度的几何收敛与近乎完美的功能迁移共存。通过使用对比差异CKA(CKA_Delta),一种无需训练的诊断方法,在每样本对比差异上计算核对齐,我们从通用相似性中分离出概念特定的收敛——在标准CKA无法区分的场景下实现了显著区分。这种分离在我们测试的所有六个概念领域(五个领域几何区分p≤0.017,安全性作为收敛功能趋势p=0.08)中重复出现,包括两个非指令概念(代码vs自然语言、推理vs记忆),这些概念在没有系统提示的情况下得到验证;一个70B-70B对提供了观察性说明,即普遍性可能随规模增强,需要更多≥70B模型进行验证。我们将CKA_Delta定位为实用的领域分类器和架构异常检测器(Gemma:d=1.08,AUC=0.79),而非绝对的迁移准确性预测器,为跨架构概念监控提供了一种无需训练的诊断方法。

英文摘要

Do different LLM architectures encode high-level concepts in structurally compatible ways? We systematically characterize a geometric-functional universality dissociation: across multiple concept domains and architectural families, moderate geometric convergence coexists with near-perfect functional transfer. Using contrastive-difference CKA (CKA_Delta), a training-free diagnostic that computes kernel alignment on per-sample contrastive differences, we isolate concept-specific convergence from generic similarity -- achieving significant discrimination where standard CKA cannot. The dissociation replicates across all six concept domains we test (five with p <= 0.017 geometric discrimination and safety as a converging-functional trend, p = 0.08), including two non-instruction concepts (code-vs-NL, reasoning-vs-recall) validated without system prompts; a single 70B--70B pair provides an observational note that universality may strengthen with scale, requiring replication with additional >=70B models. We position CKA_Delta as a practical regime classifier and architectural outlier detector (Gemma: d = 1.08, AUC = 0.79) rather than an absolute transfer-accuracy predictor, providing a training-free diagnostic for cross-architecture concept monitoring.

2606.16893 2026-06-16 cs.AI cs.CL cs.LO 新提交

Symbolic Informalization: Fluent, Productive, Multilingual

符号非形式化:流畅、高效、多语言

Aarne Ranta

发表机构 * Department of Computer Science and Engineering, Chalmers University of Technology and University of Gothenburg(查尔姆斯理工大学与哥德堡大学计算机科学与工程系)

AI总结 提出符号非形式化方法,将形式数学可靠地转换为自然语言,基于Dedukti和Grammatical Framework的中间语言架构,实现多证明系统与多自然语言的流畅转换。

详情
AI中文摘要

符号非形式化能够将形式数学可靠地转换为自然语言。它有望使机器验证的内容在不损失精确性的情况下对人类可读。在传统证明系统使用中,符号非形式化将语法糖的有限机制推广为数学的普通语言。在由人工智能和自动形式化构建证明的场景中,符号非形式化可以解释具体构建了什么。本文概述了Informath项目,旨在展示符号非形式化如何以合理的开发工作量产生流畅的文本,并处理多种形式语言和自然语言。Informath基于中间语言架构,其中Dedukti作为不同证明系统(Agda、Lean、Rocq)之间的枢纽,而Grammatical Framework(GF)负责不同自然语言的语言正确性和变体。

英文摘要

Symbolic informalization enables a reliable conversion of formal mathematics to natural language. It has the potential to make machine-checked content human-readable without loss of precision. In a traditional proof system usage, symbolic informalization generalizes the limited mechanisms of syntactic sugar into the ordinary language of mathematics. In a setting where proofs are constructed by artificial intelligence and autoformalization, symbolic informalization can explain what precisely has been constructed. This paper outlines the project Informath, which aims to show how symbolic informalization can produce fluent text with a reasonable development effort and address multiple formal and natural languages. Informath is based on an interlingual architecture, where Dedukti works as a hub between different proof systems (Agda, Lean, Rocq) and Grammatical Framework (GF) takes care of linguistic correctness and variation in different natural languages.

2606.16891 2026-06-16 cs.LG cs.AI 新提交

Beyond Weights and Gradients: A Taxonomy of Federated Learning Messages

超越权重和梯度:联邦学习消息的分类学

Alvaro Javier Vargas Guerrero, Xinguang Wang, Quang Manh Doan, Guy Nagels

发表机构 * AIMS lab, Center for Neurosciences, UZ Brussel, Vrije Universiteit Brussel, Brussels, Belgium(AIMS实验室,神经科学中心,布鲁塞尔大学医院,布鲁塞尔自由大学,布鲁塞尔,比利时) Artificial Intelligence Lab, Vrije Universiteit Brussel, Brussels, Belgium(人工智能实验室,布鲁塞尔自由大学,布鲁塞尔,比利时)

AI总结 本文提出联邦消息的正式数学定义,建立包含模型结构、统计摘要和数据条件表示的三类分类法,分析计算、通信和隐私权衡,并综述202篇文献揭示2021年后消息范式多样化趋势。

Comments 4 figures, 9 pages, with 7 pages of content

详情
AI中文摘要

联邦学习正迅速发展,超越了传统模型权重和梯度的交换,但现有定义未能涵盖现代负载(如合成数据和联邦分析)的全部范围。本文通过提出一个联邦消息的正式数学定义来弥补这一空白,该定义同时考虑了效用和隐私。我们引入了一个分类法,将这些交换组织为三类:模型结构、统计摘要和数据条件表示。通过基于计算需求、通信成本和隐私风险评估这些组别,我们提供了对去中心化训练中涉及权衡的更清晰理解。我们对202篇近期出版物的回顾凸显了自2021年以来向多样化消息范式的显著转变,标志着从标准深度学习更新向更专业信息共享的转变。该框架为未来研究优化联邦系统以适应不同硬件和安全需求提供了结构化路径。

英文摘要

Federated Learning is rapidly evolving beyond the exchange of traditional model weights and gradients, yet existing definitions fail to capture the full scope of modern payloads like synthetic data and federated analytics. This paper addresses the gap by proposing a formal mathematical definition of a federated message that accounts for both utility and privacy. We introduce a taxonomy that organizes these exchanges into three categories: model structures, statistical summaries, and data-conditioned representations. By evaluating these groups based on computational demands, communication costs, and privacy risks, we provide a clearer understanding of the trade-offs involved in decentralized training. Our review of 202 recent publications highlights a significant shift since 2021 toward diverse messaging paradigms, signaling a move away from standard deep learning updates toward more specialized information sharing. This framework provides a structured path for future research to optimize federated systems for varying hardware and security requirements.

2606.16890 2026-06-16 cs.CL cs.AI 新提交

Compositional Reasoning Depth Predicts Clinical AI Failure: Empirical Evidence Consistent with Transformer Compositionality Limits in Electronic Health Record Question Answering

组合推理深度预测临床AI失败:与电子健康记录问答中Transformer组合性限制一致的实证证据

Sanjay Basu

发表机构 * University of California San Francisco(加州大学旧金山分校) Waymark

AI总结 本研究引入推理步数(hop count)作为预测大型语言模型在电子健康记录问答中失败的理论驱动指标,发现准确率随步数增加单调下降,且扩展思考未能显著改善,提示组合推理深度是跨架构的失败预测因子。

Comments 20 pages, 5 figures. Code: https://github.com/sanjaybasu/compositional-depth-clinical-ehr

详情
AI中文摘要

聚合准确率基准掩盖了大型语言模型在电子健康记录(EHR)问答中失败的系统性结构:需要更多推理步骤的问题会产生不成比例的更多错误。受Transformer组合性限制的理论结果启发,我们引入一个预先指定的跳数分类法——从EHR回答临床问题所需的不同推理步骤的数量——作为模型失败的原则性预测因子。我们标注了313个由临床医生生成的MedAlign EHR问答对,涵盖四个跳数级别,并在模型内消融(claude-sonnet-4-6,零样本 vs. 扩展思考)和跨架构复制(gpt-4o和gpt-5.4-2026-03-05,零样本)中评估了301个问题。所有三个模型,跨越两个提供商和两个OpenAI代(GPT-4和GPT-5),均显示准确率随跳数单调下降:Claude Sonnet零样本从30.6%(跳数=1)降至17.6%(跳数=4)(Cochran-Armitage z=-2.30,p=0.011;每跳OR 0.72,95% CI [0.56,0.92],p=0.008);GPT-4o复现了这一点(37.8%降至14.7%;OR 0.58 [0.45,0.75],p<0.001);gpt-5.4-2026-03-05证实了这一点(37.8%降至23.5%;OR 0.80 [0.66,0.98],p=0.027)。一项预先指定的上下文充分性审计显示,较高跳数的问题并未因EHR截断而受到不同不利影响(跳数2-4的可回答性为93-95%,而跳数1为79%),因此下降反映了组合推理难度。扩展思考在三个推理条件下并未显著平缓准确率-深度曲线,且思考令牌使用量与跳数呈正相关(r=0.31,p<0.0001),与预测的O(k)计算需求一致。因此,跳数是一个理论驱动、跨架构的大型语言模型在EHR问答中错误的预测因子,对临床AI的部署风险分层具有直接意义。

英文摘要

Aggregate accuracy benchmarks conceal a systematic structure in how large language models fail at electronic health record (EHR) question answering: questions requiring more inferential steps produce disproportionately more errors. Motivated by theoretical results on transformer compositionality limits, we introduce a pre-specified hop-count taxonomy -- the number of distinct reasoning steps required to answer a clinical question from an EHR -- as a principled predictor of model failure. We annotate 313 clinician-generated MedAlign EHR question-answer pairs across four hop levels and evaluate 301 questions in a within-model ablation (claude-sonnet-4-6, zero-shot vs. extended thinking) and cross-architecture replications (gpt-4o and gpt-5.4-2026-03-05, zero-shot). All three models, spanning two providers and two OpenAI generations (GPT-4 and GPT-5), show monotone accuracy decline with hop count: Claude Sonnet zero-shot falls from 30.6% (hop=1) to 17.6% (hop=4) (Cochran-Armitage z=-2.30, p=0.011; OR per hop 0.72, 95% CI [0.56,0.92], p=0.008); GPT-4o replicates this (37.8% to 14.7%; OR 0.58 [0.45,0.75], p<0.001); and gpt-5.4-2026-03-05 confirms it (37.8% to 23.5%; OR 0.80 [0.66,0.98], p=0.027). A pre-specified context-sufficiency audit shows higher-hop questions are not differentially disadvantaged by EHR truncation (answerability 93-95% at hops 2-4 vs. 79% at hop=1), so the decline reflects compositional reasoning difficulty. Extended thinking did not significantly flatten the accuracy-depth curve across three reasoning conditions, and thinking-token usage scaled with hop count (r=0.31, p<0.0001), consistent with the predicted O(k) computational requirement. Hop count is thus a theory-motivated, cross-architecture predictor of large-language-model error on EHR question answering, with direct implications for deployment risk stratification of clinical AI.

2606.16888 2026-06-16 cs.RO 新提交

LOPAL: Local Performance-Aware Active Learning from Imperfect Demonstrations

LOPAL:基于局部性能感知的不完美演示主动学习

Johannes Heidersberger, Shail Jadav, Dongheui Lee

发表机构 * Autonomous Systems Lab, Institute of Computer Technology, TU Wien(维也纳工业大学计算机技术研究所自主系统实验室) Institute of Robotics and Mechatronics, German Aerospace Center (DLR)(德国航空航天中心机器人与机电一体化研究所)

AI总结 提出LOPAL方法,利用局部演示质量信息,通过高斯混合模型编码轨迹与质量评估,结合共享自主权主动收集纠正数据,在不完美演示中提升任务性能。

Comments Accepted for publication in IEEE Robotics and Automation Letters (RAL), 2026

详情
AI中文摘要

从演示中学习(LfD)通过允许机器人直接从人类任务演示中学习,实现了直观的机器人技能获取。然而,当前方法通常未能解决由于次优和不一致的人类行为,演示质量在每个演示内部可能变化的问题。因此,我们引入了LOPAL(局部性能感知主动学习),一种利用这种局部演示质量信息的主动学习方法。我们的方法由两个协同组件组成。首先,一种局部性能驱动的LfD方法使用高斯混合模型(GMM)来编码演示轨迹及其相关的局部质量评估。这使得能够通过利用高性能的互补局部数据生成优于不完美演示的轨迹。其次,主动数据采集允许通过收集额外的信息样本来超越不完美演示。在缺乏良好数据的区域,通过共享自主权(SA)机制主动请求用户提供纠正,同时机器人自主执行学习的行为。LOPAL的有效性在仿真和真实世界实验中得到了验证。真实世界管道检查任务的结果表明,所提出的方法可以实现高达27.31%的任务性能提升,同时减少了收集演示所需的努力。

英文摘要

Learning from Demonstration (LfD) enables intuitive robot skill acquisition by allowing robots to learn directly from human task demonstrations. However, current methods often fail to address the fact that due to suboptimal and inconsistent human behavior, the quality of the demonstration can vary within each demonstration. Therefore, we introduce LOPAL (LOcal Performance-aware Active Learning), an active learning approach that leverages this local demonstration quality information. Our approach consists of two synergistic components. First, a local performance-driven LfD method uses a Gaussian Mixture Model (GMM) to encode both the demonstrated trajectories and their associated local quality assessments. This enables the generation of trajectories that outperform the imperfect demonstrations by utilizing complementary local data of high performance. Second, active data acquisition allows to improve beyond the imperfect demonstrations by collecting additional informative samples. In areas missing good data, the user is actively requested to provide corrections through a shared autonomy (SA) mechanism, while the robot autonomously executes the learned behavior. The efficacy of LOPAL was validated in both a simulation and a real-world experiment. The results from a real-world pipe inspection task showed that the proposed approach can achieve up to 27.31 % improvement in task performance while also reducing the effort required to collect the demonstrations.

2606.16883 2026-06-16 cs.LG cs.AI 新提交

Upper Bounds on the Generalization Error of Deep Learning Models via Local Robustness and Stability

深度学习模型泛化误差的上界:基于局部鲁棒性和稳定性

Abdul-Rauf Nuhu, Parham M. Kebria, Vahid Hemmati, Mahmoud N. Mahmoud, Edward Tunstel, Abdollah Homaifar

发表机构 * North Carolina Agricultural and Technical State University(北卡罗来纳农业技术州立大学) University of Alabama(阿拉巴马大学) Southwest Research Institute(西南研究院)

AI总结 提出一种通过局部区域稳定样本数缩放鲁棒性项的泛化上界,在ImageNet上实现非空洞且最紧的误差估计。

详情
AI中文摘要

泛化是数据驱动模型的关键属性,尤其是在安全关键应用中部署的深度学习模型。基于鲁棒性的泛化界作为一种将鲁棒性与泛化性能联系起来的原则性方法而受到关注,通常以数据依赖的方式。然而,大多数现有界在实际设置中存在空洞问题,产生远超过实际错误率的松散上界,限制了其在真实世界评估中的实用性。虽然这个问题通常归因于不确定性项,但问题的很大一部分源于鲁棒性项本身,特别是对于0-1损失。现有方法通常将鲁棒性项视为全局度量,忽略了其在输入空间不同子区域间的变化。在这项工作中,我们提出了一种泛化界,通过根据每个子区域内稳定和不稳定样本的数量来缩放鲁棒性项,从而解决了这一局限性。我们的界同时包含数据和模型依赖因素,同时保持实际相关性(产生更紧的真实误差上界)。在ImageNet数据集上训练的模型上的实验表明,我们的界始终非空洞,并在现有方法中实现了最紧的估计,与一系列鲁棒深度神经网络的实证性能紧密对齐。

英文摘要

Generalization is a critical property of data-driven models, particularly deep learning models deployed in safety-critical applications. Robustness-based generalization bounds have gained attention as a principled way to link robustness properties to generalization performance, often in a data-dependent manner. However, most existing bounds suffer from vacuousness in practical settings, yielding loose upper bounds that greatly exceed the actual error rates and limiting their usefulness for real-world evaluation. While this issue is often attributed to the uncertainty term, a substantial part of the problem originates from the robustness term itself, particularly for the 0-1 loss. Existing approaches typically treat the robustness term as a global measure, ignoring its variation across different sub-regions of the input space. In this work, we propose a generalization bound that addresses this limitation by scaling the robustness term according to the number of stable and unstable samples within each sub-region. Our bounds incorporate both data- and model-dependent factors while maintaining practical relevance (yielding tighter upper bounds on true error). Experiments on models trained on the ImageNet dataset show that our bounds remain consistently non-vacuous and achieve the tightest estimates among existing methods, closely aligning with empirical performance across a range of robust deep neural networks.

2606.16881 2026-06-16 cs.RO 新提交

SGM-SLAM: Scene Graph Matching for Data-Efficient Distributed SLAM

SGM-SLAM:面向数据高效的分布式SLAM的场景图匹配

Yewei Huang, Tixiao Shan, Abhinav Rajvanshi, Niluthpol Chowdhury Mithun, Yaxuan Li, Brendan Englot, Han-Pang Chiu

发表机构 * Dartmouth College(达特茅斯学院) SRI International(SRI国际) Stevens Institute of Technology(史蒂文斯理工学院)

AI总结 提出一种基于场景图匹配的分布式SLAM框架,仅使用对象标签和质心进行匹配,通过多步数据交换与优化实现高效通信,在室内外环境中验证了有效性。

详情
AI中文摘要

我们介绍了一种面向配备LiDAR、相机和惯性传感器的机器人团队的数据高效分布式同步定位与地图构建(SLAM)框架。该框架使用场景图匹配来识别机器人间的测量约束。与依赖于特征级匹配的先前方法不同,我们的框架是首个仅使用对象标签和质心进行场景图匹配的方法。我们的方法通过使用融合的RGB-LiDAR点云构建场景图,生成语义分割点云层和离散有界对象层,以伴随估计的机器人轨迹。场景图匹配通过交换和匹配相邻机器人的对象数据协作完成。为最大化通信效率,我们采用了多步数据交换与优化过程。我们通过仿真和由腿式机器人在室内外环境中收集的真实世界数据集展示了我们方法的有效性和效率。

英文摘要

We introduce a data-efficient distributed Simultaneous Localization and Mapping (SLAM) framework designed for a team of robots equipped with LiDAR, cameras, and inertial sensors. Our framework uses scene graph matching to identify inter-robot measurement constraints. Unlike prior approaches that rely on feature-level matching, our framework is the first to perform scene graph matching using only object labels and centroids. Our approach constructs a scene graph by using fused RGB-LiDAR point clouds to generate both a semantically segmented point cloud layer, and a layer of discrete bounded objects, to accompany estimated robot trajectories. Scene graph matching is performed collaboratively through exchanging and matching object data with neighboring robots. To maximize communication efficiency, we utilize a multi-step data exchange and optimization process. We demonstrate the effectiveness and efficiency of our approach using both simulation and real-world datasets collected by legged robots in indoor and outdoor environments.

2606.16876 2026-06-16 cs.RO 新提交

ExoTraj: A General Lower-limb Exoskeleton Assistance Policy for Complex Environments

ExoTraj:面向复杂环境的通用下肢外骨骼辅助策略

Xiao-Yin Liu, Guotao Li, Long Sun, Xu Liang, Zeng-Guang Hou

发表机构 * The State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences(中国科学院自动化研究所多模态人工智能系统国家重点实验室) The School of Artificial Intelligence, University of Chinese Academy of Sciences(中国科学院大学人工智能学院) CASIA-MUST Joint Laboratory of Intelligence Science and Technology, Institute of Systems Engineering, Macau University of Science and Technology(澳门科技大学系统工程研究所中科院自动化所-澳门科技大学智能科学与技术联合实验室) The School of Automation and Intelligence, Beijing Jiaotong University(北京交通大学自动化与智能学院)

AI总结 提出ExoTraj统一策略,通过快速流匹配实现多模态特征到轨迹的精确预测,并利用模型预测控制优化力矩,在复杂户外场景中实现自适应辅助,无需昂贵运动捕捉系统。

Comments 28 pages, 19 figures, project page: https://xiaoyinliu0714.github.io/Home_ExoTraj/

详情
AI中文摘要

在动态外骨骼场景中,自适应力矩预测需要昂贵的运动捕捉系统,这在复杂户外环境中不可行。轨迹预测已成为解决该问题的有效方法之一。然而,外骨骼轨迹预测的核心挑战有两个:建立从多模态特征到轨迹信息的映射;构建从轨迹到力矩的映射。对于前者,现有方法大多仅执行单步预测并忽略受试者间轨迹变异性,从而限制了轨迹优化空间和预测泛化能力。为此,本文提出一种快速流匹配方法,能够实现精确的轨迹预测和更好的实时性能泛化,其中轨迹生成误差和编码观测用于指导训练方向。对于第二个挑战,由于人-机器人系统的高动态性以及感知与控制之间的强耦合,简单的控制方法难以基于预测轨迹实现高效辅助。本文利用模型预测控制并设计了一种新颖的优化目标来优化力矩,确保外骨骼实现舒适且鲁棒的辅助。通过整合上述两个组件,开发了统一策略ExoTraj,使其能够在复杂户外场景中实现自适应辅助,而无需高昂的数据采集成本。实验结果表明,与传统方法相比,ExoTraj在在线阶段将跨受试者预测误差降低了14.0%,并保持对外部噪声的鲁棒性。相对于零力矩条件,ExoTraj分别将代谢率降低11.5-24.4%,心率降低1.7-19.5%,峰值肌肉激活水平降低10.9-41.3%。

英文摘要

Adaptive torque prediction in dynamic exoskeleton scenarios requires expensive motion capture systems, which are infeasible in complex outdoor environments. Trajectory prediction has emerged as one of the effective approaches to address such an issue. However, the core challenges of exoskeleton trajectory prediction are twofold: establishing the mapping from multi-modal features to trajectory information; constructing the mapping from trajectory to torque. For the former, most existing methods perform only single-step prediction and neglect inter-subject trajectory variability, thereby limiting the trajectory optimization space and prediction generalization. To address this, this paper proposes a fast flow matching method that enables accurate trajectory prediction and better generalization for real-time performance, where trajectory generation errors and encoded observations are used to guide the training direction. For the second challenge, due to the high dynamics of the human-robot system and the strong coupling between perception and control, simple control methods struggle to achieve efficient assistance based on the predicted trajectory. This paper utilizes model predictive control and designs a novel optimization objective to optimize torque, ensuring the exoskeleton achieves comfortable and robust assistance. By integrating the above two components, the unified policy, denoted as ExoTraj, is developed to enable adaptive assistance in complex outdoor scenarios without high data acquisition cost. Experimental results show that compared to traditional methods, ExoTraj reduces cross-subject prediction error by 14.0% during the online phase and maintains robustness against external noise. Relative to the zero torque condition, ExoTraj decreases metabolic rate by 11.5-24.4%, heart rate by 1.7-19.5%, and peak muscle activation levels by 10.9-41.3%, respectively.

2606.16874 2026-06-16 cs.CL cs.CE cs.CY 新提交

Understanding Scam Trends and Rail Paths from Reddit Self-Disclosure Narratives

理解 Reddit 自我披露叙事中的诈骗趋势和路径

Yangjun Zhang, Mirko Bottarelli, Mark Hooper, Carsten Maple

发表机构 * The Alan Turing Institute, London, UK(艾伦·图灵研究所,伦敦,英国)

AI总结 通过构建2023-2025年Reddit自我披露数据集,采用启发式标注和LLM辅助方法分析诈骗类型趋势、多阶段路径及社区支持行为,发现诈骗过程以多路径为主且随时间变化。

Comments 6 pages, International Conference on AI and the Digital Economy (CADE) 2026

详情
AI中文摘要

在线诈骗行为本质上是多阶段的,其生命周期包括时间顺序的路径和事件,而非孤立的信号。现有工作分析了诈骗类型和路径的特征,但未追踪跨年份的诈骗趋势。此外,由于缺乏带有标注和覆盖不同诈骗类型的开源数据集,路径间关系的研究受到阻碍。为解决这些问题,我们构建了一个数据集,利用2023年至2025年Reddit自我披露叙述分析诈骗特征的年度趋势和路径。我们收集了21,304篇来自诈骗相关子版块的帖子,这些帖子至少包含身份、通信、平台和支付中的一个路径,通过启发式标注进行趋势分析。然后,我们通过LLM辅助方法标注了1,800篇包含显式或可恢复诈骗链的帖子,用于诈骗路径分析,该方法通过人工标注进行评估。最后,我们对帖子的评论运行主题模型,分析社区支持行为。结果表明,诈骗过程主要是多路径的。不同年份中,不同的诈骗类型和路径组件占主导地位。不同诈骗类型在路径复杂性上存在系统性差异。Reddit的支持行为随时间变得更加详细。这项工作支持合成诈骗链数据模拟和与AI相关的诈骗风险评估,但结果可能不适用于其他平台。

英文摘要

Online scam behavior is inherently multi-stage, and the lifecycle includes temporally ordered rails and events rather than isolated signals. Existing works analyze characteristics of scam types and rails, but they do not track scam trends across years. Moreover, the work on the relations between rails is hampered due to the lack of open-source datasets with annotations and coverage of different scam types. To address these gaps, we build a dataset to analyze the yearly trend of scam characteristics and rail paths using Reddit self-disclosure narratives from 2023 to 2025. We collect 21,304 posts from scam-related subreddits with at least one rail among identity, communication, platform, and payment for trend analysis by heuristic annotation. Then, we label 1,800 posts containing explicit or recoverable scam chains by an LLM-assisted method for scam path analysis. The method is evaluated with human annotation. Lastly, we run a topic model on the comments of the posts to analyze the community support behavior. The results reveal that scam processes are predominantly multi-rail. Across years, different scam types and rail components dominate. Different scam types vary systematically in path complexity. Reddit support behaviors have become more detailed over time. This work supports synthetic scam chain data simulation and AI-related scam risk assessment, though findings may not generalise to other platforms.

2606.16870 2026-06-16 cs.CV cs.GR 新提交

Latent Space Reinforcement Learning for Inverse Material Estimation in Food Fracture Simulation

潜空间强化学习用于食品断裂模拟中的逆材料估计

Adrian Ramlal, Yuhao Chen, John S. Zelek

发表机构 * University of Waterloo(滑铁卢大学)

AI总结 针对食品断裂模拟中材料参数难以直接测量的问题,提出基于潜空间强化学习的目标条件策略,实现从断裂行为描述到材料参数的单次前向估计,精度提升23%。

Comments Accepted in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026 MetaFood Workshop

详情
Journal ref
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2026, pp. 9573-9581
AI中文摘要

食品操作的真实视觉模拟需要精确的材料参数,但这些参数难以直接测量,且在单个食品的异质区域间变化。我们解决了从非连续损伤力学模拟器中断裂行为的目标描述中估计材料参数的逆问题。以剥橙子为测试案例,我们在2000次正向模拟上训练神经代理,并比较协方差矩阵自适应进化策略(CMA-ES,一种无梯度进化优化器)与近端策略优化(PPO,一种强化学习算法)在原始9维参数空间和两个学习的4维潜表示上的表现。由于不同橙子具有不同的材料属性,实用的逆系统必须能够处理任意目标而无需重新训练。我们训练了一个目标条件PPO策略,该策略学习通用的逆映射:给定任意剥皮行为的目标描述,该策略在单次前向传递(8次代理评估,约10毫秒)中产生材料参数估计。在归一化流潜空间中使用共享代理评估器,目标条件策略通过模拟器验证时实现了0.642的实际恢复率,比原始参数空间高出23%。从策略输出初始化CMA-ES细化的热启动扩展进一步将恢复率提升至0.828,使用540次评估。这些发现为食品逆物理提供了实用框架,并为从食品操作的视频观测中通过视觉驱动识别材料奠定了基础。

英文摘要

Realistic visual simulation of food manipulation requires accurate material parameters, yet these are difficult to measure directly and vary across the heterogeneous regions of a single food item. We address the inverse problem of estimating material parameters from a target description of fracture behavior in a non-differentiable continuum damage mechanics simulator. Using orange peeling as a test case, we train a neural surrogate on 2,000 forward simulations and compare Covariance Matrix Adaptation Evolution Strategy (CMA-ES, a gradient-free evolutionary optimizer) with Proximal Policy Optimization (PPO, a reinforcement learning algorithm) across the original 9-dimensional parameter space and two learned 4-dimensional latent representations. Since different oranges have different material properties, a practical inverse system must handle arbitrary targets without retraining. We train a goal-conditioned PPO policy that learns a general inverse mapping: given any target description of peeling behavior, the policy produces a material parameter estimate in a single forward pass (8 surrogate evaluations, approximately 10ms). Operating in a normalizing flow latent space with a shared surrogate evaluator, the goal-conditioned policy achieves 0.642 actual recovery when validated through the simulator, outperforming the original parameter space by 23%. A warm-start extension that initializes CMA-ES refinement from the policy's output further improves recovery to 0.828 with 540 evaluations. These findings provide a practical framework for inverse food physics and lay groundwork for vision-driven material identification from video observations of food manipulation.

2606.16868 2026-06-16 cs.CV cs.AI cs.DC 新提交

Federated Medical Image Segmentation under Real-World Label Noise: A Benchmark Suite for Noisy Label Learning Method Selection

真实世界标签噪声下的联邦医学图像分割:面向噪声标签学习方法选择的基准套件

Markus Bujotzek, Dimitrios Bounias, Stefan Denner, Ralf Floca, Maximilian Fischer, Peter Neher, Klaus Maier-Hein

发表机构 * Division of Medical Image Computing, Germany Cancer Research Center(德国癌症研究中心医学图像计算部) Medical Faculty, University of Heidelberg(海德堡大学医学院) Heidelberg Institute of Radiation Oncology (HIRO), National Center for Radiation Research in Oncology (NCRO)(海德堡放射肿瘤学研究所(HIRO),国家放射肿瘤学研究中心(NCRO)) Pattern Analysis and Learning Group, Department of Radiation Oncology, Heidelberg University Hospital(海德堡大学医院放射肿瘤科模式分析与学习组) Faculty of Mathematics and Computer Science, University of Heidelberg(海德堡大学数学与计算机科学学院) National Center for Tumor Diseases (NCT), NCT Heidelberg, a partnership between DKFZ and the university medical center Heidelberg(国家肿瘤疾病中心(NCT),NCT海德堡,DKFZ与海德堡大学医学中心的合作机构)

AI总结 针对联邦学习中真实世界标签噪声(如轮廓不一致、结构缺失或混淆)问题,提出一个包含多样化真实噪声数据集、客户端噪声场景和针对性评估的基准套件,支持系统评估和噪声标签学习方法选择。

详情
AI中文摘要

虽然联邦学习(FL)能够在不集中敏感数据的情况下实现协作式医学图像分割,但实际部署常因跨站点的标签缺陷(如轮廓不一致、结构缺失或多余、标签混淆)而复杂化。联邦噪声标签学习(FNLL)旨在减轻这些影响,但在实践中仍未被充分利用,因为现有证据主要基于合成噪声、简化设置和有限的实际噪声评估。我们通过引入一个基准套件来弥补这一差距,该套件结合了多样化的真实世界噪声数据集、与部署相关的客户端噪声场景以及针对标签噪声的评估,以支持系统的FNLL评估和知情的方法选择。该套件将来自不同来源的精心策划的真实世界噪声医学图像分割数据集与一个全面的联邦分割框架相结合,包括各种客户端噪声场景和针对噪声的评估。所提出的套件为医学图像分割中的FNLL评估提供了现实且具有区分性的基础,并为公平基准测试、数据集特定的标签噪声表征以及未来在现实联邦设置下的方法开发建立了可重复使用的基础。代码可在 https://github.com/MIC-DKFZ/FedSegNoiseBench 获取。

英文摘要

While federated learning (FL) enables collaborative medical image segmentation without centralizing sensitive data, real-world deployment is frequently complicated by cross-site label imperfections such as contour disagreement, missing or additional structures, and confused labels. Federated noisy label learning (FNLL) aims to mitigate these effects, yet remains underused in practice as existing evidence is largely based on synthetic noise, simplified settings, and limited real-world noisy evaluation. We address this gap by introducing a benchmark suite that combines diverse real-world noisy datasets, deployment-relevant client-noise scenarios, and label-noise-targeted evaluation to support systematic FNLL assessment and informed method selection. The suite combines curated real-world noisy medical image segmentation datasets from diverse sources with a comprehensive federated segmentation framework including various client-noise scenarios and noise-targeted evaluation. The presented suite provides a realistic and discriminative basis for FNLL evaluation in medical image segmentation and establishes a reusable foundation for fair benchmarking, dataset-specific label-noise characterization, and future method development under realistic federated settings. Code is available at https://github.com/MIC-DKFZ/FedSegNoiseBench.

2606.16867 2026-06-16 cs.CL 新提交

Revisiting the Systematicity in Negation in the Era of In-Context Learning

重新审视上下文学习时代否定中的系统性

Hitomi Yanaka, Taisei Yamamoto

发表机构 * The University of Tokyo(东京大学) Riken(理化学研究所) Tohoku University(东北大学)

AI总结 通过行为与表征系统性分析,发现大型语言模型在上下文学习中能部分识别否定表达和范围,但无法完美执行,且功能向量在否定线索提取任务中可组合,但范围识别更具挑战。

Comments Accepted to the 6th Workshop Natural Language Meets Logic and Machine Learning (NALOMA2026) at ESSLLI2026

详情
AI中文摘要

理解否定句的含义仍然是语言模型面临的挑战之一,即使在大语言模型(LLMs)时代也是如此。我们从两个角度分析LLM对否定理解的系统性:行为系统性和表征系统性。对于行为系统性,我们确认通过示例和上下文学习,LLMs可以在一定程度上识别句子中的否定表达和范围,但无法达到完美性能。特别是,模型识别否定范围的难度因输出格式而异。对于表征系统性,我们分析对于理解否定至关重要的任务,功能向量可以从上下文示例中稳健构建的程度。实验表明,虽然功能向量可以针对否定线索提取任务进行组合,但提取用于识别范围的功能向量更具挑战性。

英文摘要

Understanding the meaning of negated sentences remains one of the challenges for language models, even in the era of large language models (LLMs). We analyze systematicity regarding LLM understanding of negation from two perspectives: behavioral systematicity and representational systematicity. For behavioral systematicity, we confirm that through demonstrations and in-context learning, LLMs can recognize negation expressions and scope within sentences to some extent, but they fail to achieve perfect performance. In particular, the difficulty of the negation scope recognition for models varies depending on the output format. For representational systematicity, we analyze the extent to which function vectors can be robustly constructed from in-context examples for tasks that are essential to understanding negation. The experiments suggest that while function vectors can be composed for negation cue extraction tasks, extracting function vectors for recognizing scope is more challenging.

2606.16866 2026-06-16 cs.CV 新提交

Redirecting the Flow: Image Customization through Attention Distribution Shift

重定向流:通过注意力分布偏移实现图像定制

Jie Li, Suorong Yang, Jian Zhao, Furao Shen

发表机构 * State Key Laboratory for Novel Software Technology, Nanjing University(南京大学计算机软件新技术国家重点实验室) School of Artificial Intelligence, Nanjing University(南京大学人工智能学院) School of Computer Science, Nanjing University(南京大学计算机科学与技术学院) School of Electronic Science and Engineering, Nanjing University(南京大学电子科学与工程学院)

AI总结 提出基于最大熵理论的Conditional Attention Distribution Shift方法,通过双分支架构CustomShift实现高效主题驱动图像生成,在DreamBooth和Custom101基准上优于现有方法。

详情
AI中文摘要

主题驱动的图像定制旨在生成不仅遵循文本指令而且保留给定参考主题身份的图像。现有方法,包括测试时微调、基于编码器的方法以及共享注意力空间中的令牌竞争,存在效率有限、提取的参考特征与生成过程不对齐以及无关信息干扰等问题。为了解决这些限制,我们将定制任务表述为通过将参考图像融入文本到图像生成所引发的分布偏移,并基于最大熵理论推导出条件注意力分布偏移公式。基于这一公式,我们提出了CustomShift,一种基于Stable Diffusion 3的双分支架构。参考对齐分支利用参考图像和主题名称之间的自注意力实现与潜在表示的逐层对齐,而交叉引导分支整合文本和参考线索以指导生成。在DreamBooth和Custom101基准上的实验表明,我们的方法始终优于最先进的方法,在语义保真度和主题一致性之间取得了更好的平衡。

英文摘要

Subject-driven image customization aims to generate images that not only follow textual instructions but also preserve the identity of a given reference subject. Existing approaches, including test-time fine-tuning, encoder-based methods, and token competition in shared attention spaces, suffer from limited efficiency, misalignment between extracted reference features and the generative process, and interference from irrelevant information. To address these limitations, we formulate the customization task as a distribution shift induced by incorporating reference images into text-to-image generation, and derive a Conditional Attention Distribution Shift formulation grounded in maximum entropy theory. Building on this formulation, we propose CustomShift, a dual-branch architecture based on Stable Diffusion 3. The Reference-Alignment Branch leverages self-attention between reference images and subject names to achieve layer-wise alignment with latent representations, while the Cross-Guidance Branch integrates textual and reference cues to guide generation. Experiments on the DreamBooth and Custom101 benchmarks demonstrate that our method consistently outperforms state-of-the-art approaches, achieving a better balance between semantic fidelity and subject consistency.

2606.16863 2026-06-16 cs.LG 新提交

HawkesNest: A Multi-Axis Synthetic Benchmark for Spatiotemporal Pattern Complexity

HawkesNest:时空模式复杂度的多轴合成基准

Yahya Aalaila, Sumantrak Mukherjee, Gerrit Großmann, Sebastian Vollmer

发表机构 * German Research Center for Artificial Intelligence (DFKI), Data Science and its Applications Research Group, Kaiserslautern, Germany(德国人工智能研究中心(DFKI),数据科学及其应用研究组,凯撒斯劳滕) Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau (RPTU), Kaiserslautern, Germany(莱茵兰-普法尔茨凯撒斯劳滕-兰道工业大学(RPTU)计算机科学系,凯撒斯劳滕)

AI总结 提出HawkesNest基准,基于多元Hawkes过程定义四个复杂度轴,用于可控测试时空点过程模型在已知结构难度下的性能。

详情
AI中文摘要

时空点过程(STPP)模型的评估严重依赖于不透明的真实世界数据集,其中潜在生成结构未知且模型失败难以归因。我们引入HawkesNest,一个基于多元Hawkes骨干的生成器对齐基准,用于可控的时空模式复杂度。HawkesNest定义了四个复杂度轴:时空纠缠、背景异质性、跨类型交互和域拓扑。每个轴与从潜在数据生成机制计算出的确定性指标相关联。通过在保持全局速率、稳定性和模拟预算固定的同时改变这些轴,HawkesNest能够在已知结构难度下对STPP模型进行诊断性压力测试。我们验证了在受控扫描下这些指标是单调且几乎正交的。我们通过展示Hawkes系列基线在联合异质性-纠缠复杂度下性能下降来说明其用途,尽管它们在结构上与Hawkes数据生成骨干对齐。我们进一步表明HawkesNest暴露了神经模型的敏感性:AutoSTPP在时空纠缠单独增加时仍然脆弱。代码可在https://github.com/YahyaAalaila/HawkesNest获取。

英文摘要

Evaluation of spatiotemporal point process (STPP) models relies heavily on opaque real-world datasets, where latent generative structure is unknown and model failures are difficult to attribute. We introduce HawkesNest, a generator-aligned benchmark for controlled spatiotemporal pattern complexity built on a multivariate Hawkes backbone. HawkesNest defines four complexity axes: space--time entanglement, background heterogeneity, cross-type interaction, and domain topology. Each axis is associated with a deterministic index computed from the latent data-generating mechanism. By varying these axes while holding global rate, stability, and simulation budget fixed, HawkesNest enables diagnostic stress tests of STPP models under known structural difficulty. We verify that the indices are monotone and nearly orthogonal under controlled sweeps. We illustrate its use by showing that Hawkes-family baselines degrade under joint heterogeneity--entanglement complexity, even though they are structurally aligned with the Hawkes data-generating backbone. We further show that HawkesNest exposes neural-model sensitivity: AutoSTPP remains vulnerable under isolated increases in space--time entanglement. Code. Available at https://github.com/YahyaAalaila/HawkesNest

2606.16861 2026-06-16 cs.CV 新提交

An Open-Source Monitoring Framework for Data Exploration and Progress Tracking in Multi-Center Radiology Studies

一个用于多中心放射学研究中数据探索与进度跟踪的开源监控框架

Markus Bujotzek, Jonas Scherer, Stefan Denner, Peter Neher, Benjamin Hamm, Lorenz Feineis, Uenal Akuenal, Andreas Bucher, Tobias Penzkofer, Klaus Maier-Hein

发表机构 * Germany Cancer Research Center(德国癌症研究中心) University of Heidelberg(海德堡大学) University Hospital Frankfurt(法兰克福大学医院) Charite Universitätsmedizin Berlin(柏林夏里特医学院) Berlin Institute of Health(柏林健康研究所)

AI总结 提出基于Grafana-Prometheus的轻量级开源监控架构,通过聚合分布式站点指标并可视化,实现隐私保护的数据探索和进度监控,已在德国RACOON联盟38家大学医院部署验证。

详情
AI中文摘要

多中心研究对于推进医学和放射学研究至关重要。数据探索、协作发现和研究进度监控对于最大化其潜力至关重要。然而,在实践中,这些过程通常依赖于手动通信和共享表格,这些表格很快就会过时,并阻碍大型分布式研究中的高效协调。这凸显了对专用监控解决方案的需求,以提供对研究进度的透明和最新洞察。我们提出了一种轻量级、开源的多中心研究监控架构,基于广泛使用的Grafana-Prometheus栈。该框架从分布式研究站点收集聚合的监控指标,并通过可配置的仪表板进行可视化。作为一个真实世界的部署示例,该框架被集成到医学影像平台Kaapana中,并在一个大型多中心研究网络中进行评估。通过在德国范围内的RACOON联盟中部署我们的解决方案,我们展示了其在所有38家德国大学医院中实现隐私保护的数据探索和研究进度监控的能力。该监控框架支持分布式研究活动的透明协调,并可促进大规模多中心研究的更高效管理。源代码和Kaapana集成可在https://github.com/MIC-DKFZ/study-monitoring-kaapana公开获取。

英文摘要

Multi-center studies are crucial for advancing medical and radiological research. Data exploration, collaboration discovery, and study progress monitoring are essential for maximizing their potential. However, in practice these processes often rely on manual communication and shared tables, which quickly become outdated and hinder efficient coordination in large distributed studies. This highlights the need for dedicated monitoring solutions that provide transparent and up-to-date insights into study progress. We propose a lightweight, open-source monitoring architecture for multi-center studies based on the widely used Grafana-Prometheus stack. The framework collects aggregated monitoring metrics from distributed study sites and visualizes them through configurable dashboards. As a real-world deployment example, the framework is integrated into the medical imaging platform Kaapana and evaluated within a large multi-center research network. By deploying our solution within the Germany-wide RACOON consortium, we demonstrate its ability to enable privacy-preserving data exploration and study progress monitoring across all 38 German university clinics. The monitoring framework supports transparent coordination of distributed research activities and can facilitate more efficient management of large-scale multi-center studies. The source code and Kaapana integration are publicly available at https://github.com/MIC-DKFZ/study-monitoring-kaapana.