arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.30662 2026-06-01 cs.LG q-bio.PE

Spatio-temporal stochastic graph-based learning for infectious disease forecasting

基于时空随机图的传染病预测学习

Luz Stefani Sotomayor Valenzuela, Susanna Cramb, Darren Wraith

发表机构 * School of Public Health and Social Work, Queensland University of Technology（昆士兰理工大学公共卫生与社会科学学院）； QUT Centre for Data Science, Queensland University of Technology（昆士兰理工大学数据科学中心）

AI总结提出一种集成随机公式和不确定性近似过程的时空图架构，用于预测新发传染病病例，在COVID-19和水痘数据集上表现出竞争性性能。

Comments Preprint under review

详情

AI中文摘要

时空图模型通常用于预测COVID-19和水痘爆发等传染病的新病例。然而，在其学习过程中使用随机建模的研究却出人意料地不足，并且很少考虑大国家的完整数据集。因此，尚不清楚这些模型是否能在真实疾病传播场景中提供准确的预测。在这项工作中，我们提出了一种时空随机图架构，该架构集成了随机公式和不确定性近似过程，以预测新的传染病病例。我们发现，我们的方法能够适应在单一模型架构中编码大小人口地理网络。使用两个真实世界数据集——美国COVID-19和匈牙利水痘，我们报告了所提出的架构在预测美国2022年第一波COVID-19和匈牙利2012-2014年水痘波次中的增强效果。通过与四种时空图模型进行基准测试，定量结果显示，所提出的方法在预测美国所有3218个县和匈牙利所有20个县的新病例方面，具有竞争性的整体周度性能。所提出的方法能够表示相对于基线的整体流行病进展，尽管存在一步延迟；同时表现出对高频低幅变异的低敏感性。

英文摘要

Spatio-temporal graph-based models have typically been used to forecast new cases of infectious diseases such as COVID-19 and chickenpox outbreaks. However, the use of stochastic modelling into their learning process has been surprisingly under-investigated and rarely considered entire data sets of large countries. As a result, it is unknown whether these models would provide accurate forecasts in real-world disease spread scenarios. In this work, we propose a spatio-temporal stochastic graph-based architecture that integrates a stochastic formulation and uncertainty approximation process to forecast new infectious disease cases. We find that our approach can adapt to encode large and small population geographical networks within a single model architecture. Using two real-world data sets, COVID-19 in the US and chickenpox in Hungary, we report an enhanced effect of the proposed architecture across predictions of the 2022 first wave for COVID-19 in the US and comparative results of chickenpox waves during 2012-2014 in Hungary. By benchmarking with four spatio-temporal graph-based models, quantitative results show competitive overall weekly performance of the proposed approach on forecasting new cases for all 3,218 US counties and all 20 Hungary counties. The proposed approach can represent overall epidemic progression relative to baselines, though with a one-step delay; while exhibiting a reduced sensitivity to high-frequency and low-amplitude variability.

URL PDF HTML ☆

赞 0 踩 0

2605.30656 2026-06-01 cs.LG

Learning to Perceive the World Through Control: Empowerment-Based Representation Learning

通过控制学习感知世界：基于赋能的表示学习

Mahsa Bastankhah, Sophie Broderick, Benjamin Eysenbach

发表机构 * Princeton University, USA（普林斯顿大学，美国）

AI总结本文通过最大化赋能目标，研究如何学习仅捕捉环境控制相关特征的表示，并证明赋能代理诱导的前向和后向表示对控制无关特征具有不变性。

详情

AI中文摘要

在许多实际强化学习环境中，观测的维度远高于对控制重要的变量。在这项工作中，我们提出一个问题：我们能否学习仅捕捉环境中控制相关特征的表示？我们通过赋能目标研究这个问题，该目标最大化代理对环境的影响，并广泛用于无监督技能学习。我们表明，赋能代理诱导两种不同的表示——前向和后向——它们捕捉状态的互补方面，并且两者都对控制无关特征具有不变性。因此，赋能最大化导致代理学习一个隐式的、以控制为中心的世界模型。我们的分析强调了通过交互而非被动数据集学习表示的重要性：旨在最大化控制的交互对于学习有用的不变性属性至关重要，这一观点与因果学习文献紧密一致。

英文摘要

In many practical reinforcement learning environments, observations are far higher-dimensional than the variables that matter for control. In this work, we ask: can we learn representations that capture only control-relevant features of the environment? We study this question through the empowerment objective, which maximizes an agent's influence over the environment and is widely used for unsupervised skill learning. We show that empowerment agents induce two distinct representations -- forward and backward -- that capture complementary aspects of the state, and both of which are invariant to control-irrelevant features. Thus, empowerment maximization leads agents to learn an implicit, control-centric model of the world. Our analysis highlights the importance of learning representations through interaction rather than from passive datasets: interaction aimed at maximizing control is essential for learning useful invariance properties, a perspective that aligns closely with the causal learning literature.

URL PDF HTML ☆

赞 0 踩 0

2605.30654 2026-06-01 cs.CL cs.AI cs.HC

EUDAIMONIA: Evaluating Undesirable Dynamics in AI

EUDAIMONIA: 评估AI中的不良动态

Jun Rui Huang, Wang Bill Zhu, Ziyi Liu, Nathanael Fast, Ravi Iyer, Robin Jia

发表机构 * University of Southern California（南加州大学）

AI总结提出Social AI Design Code框架，并通过EUDAIMONIA基准测试评估22个最新LLM在社交互动中对用户福祉的符合程度，发现即使最强模型也违反约30%的设计要求。

详情

AI中文摘要

大型语言模型（LLM）越来越多地被用作陪伴、情感披露和人际建议的对话伙伴，但这些互动的社会动态可能造成能力导向或传统安全评估无法捕捉的伤害。我们引入了Social AI Design Code，这是一个评估LLM在社交互动中是否符合用户福祉的框架，包括它们是否鼓励有害的亲密关系、依赖或长时间参与。为了在自然且多样化的用户-LLM互动中评估这些风险，我们通过弱到强过滤、多模型重新标记和受控重写，从WildChat构建了包含969个用户输入和3,147个设计要求违规检查的基准测试EUDAIMONIA，将代码操作化。评估22个最近的LLM，我们发现即使最强的模型Claude-Opus-4.7和GPT-5.5也分别违反了30.7%和27.2%的检查。扩展思考并未降低违规率，表明这些失败是持久的社会对齐问题，而非仅通过测试时推理就能解决的缺陷。

英文摘要

Large language models (LLMs) are increasingly used as conversational partners for companionship, emotional disclosure, and interpersonal advice, but the social dynamics of these interactions can create harms that are not captured by capability-oriented or traditional safety evaluations. We introduce the Social AI Design Code, a framework for evaluating whether LLMs align with user welfare in social interactions, including whether they encourage harmful intimacy, dependence, or prolonged engagement. To evaluate these risks in natural and diverse user-LLM interactions, we operationalize the code with EUDAIMONIA, a benchmark of 969 user inputs and 3,147 design-requirement violation checks built from WildChat through weak-to-strong filtration, multi-model relabeling, and controlled rewriting. Evaluating 22 recent LLMs, we find that even the strongest models, Claude-Opus-4.7 and GPT-5.5, violate 30.7% and 27.2% of checks, respectively. Extended thinking does not reduce violation rates, suggesting that these failures are persistent social-alignment problems rather than deficits solvable through test-time reasoning alone.

URL PDF HTML ☆

赞 0 踩 0

2605.30653 2026-06-01 cs.CL

扩散模型优先记忆原型样本，或：为什么我的扩散模型喜欢“潦草”？

Marta Aparicio Rodriguez, Anastasia Borovykh, Grigorios A. Pavliotis, Daniel J. Korchinski

发表机构 * Department of Mathematics, Imperial College London, UK ； ML Lab, Capital Fund Management, France ； Department of Physics, \'Ecole Polytechnique F\'ed\'erale de Lausanne (EPFL), Switzerland

AI总结本文通过随机层次模型生成的字符串训练扩散模型，发现模型优先记忆常见子串组成的样本，即使数据完全去重，表明点级去重无法保证隐私，而数据集多样性（尤其是高层抽象）能延缓记忆，并识别出部分记忆的中间状态导致生成均值回归的“潦草”现象。

详情

AI中文摘要

生成模型存在一个持久限制：它们记忆训练数据的倾向可能产生法律责任并削弱创意多样性。因此，理解哪些样本被全部或部分记忆，以及在什么条件下被记忆，仍然是一个重要的开放问题。本文对“非典型或稀有样本是否首先被记忆？”这一问题给出了否定答案。我们根据随机层次模型（RHM）的产生规则生成的字符串训练扩散模型，发现由常见子串组成的样本被优先记忆。即使训练数据由完全独特的样本组成，这一结论仍然成立，表明在数据点级别进行去重并不能提供有意义的隐私保证。相应地，我们预测并随后观察到，对于重尾数据集（即包含更多非典型样本的数据集），记忆会延迟。当重尾特性引入高层产生规则时，这种效应会放大。这些结果共同表明，数据集多样性，尤其是在更高抽象层次上，在延缓记忆方面起着重要作用。最后，我们识别出一个部分记忆的中间状态，其中常见子串首先被学习，随后在生成过程中过度产生。如果在此状态停止训练，模型将表现出均值回归的平淡性，常被讥讽为“潦草”。

英文摘要

Generative models have a persistent limitation: their tendency to memorize training data can create legal liabilities and erode creative diversity. Understanding which samples are memorized in whole or in part, and under what conditions, therefore remains an important open problem. Here we answer the question "Are atypical or rare samples memorized first?" in the negative. We train diffusion models on strings generated according to the production rules of the Random Hierarchy Model (RHM), and find that samples composed of common substrings are preferentially memorized. This holds true even if the training data consists of entirely unique samples, indicating that deduplication at the data point level does not provide a meaningful privacy guarantee. Correspondingly we predict, then observe, delayed memorization for fat-tailed datasets (i.e., those with more atypical samples). This effect is amplified when fat-tails are introduced into high-level production rules. These together suggest that dataset diversity, particularly at higher levels of abstraction, plays an important role in staving off memorization. Finally, we identify an intermediate regime of partial memorization in which common substrings are learned first and subsequently overproduced during generation. If training is stopped in this regime, models will exhibit the reversion-to-the-mean blandness often derided as "slop".

URL PDF HTML ☆

赞 0 踩 0

2605.30641 2026-06-01 cs.CL cs.AI

COFT: Counterfactual-Conformal Decoding for Fair Chain-of-Thought Reasoning in Large Language Models

COFT: 用于大语言模型中公平思维链推理的反事实-保形解码

Arya Fayyazi, Mehdi Kamal, Massoud Pedram

发表机构 * Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, California, USA（电气与计算机工程系，南加州大学，洛杉矶，加利福尼亚州，美国）

AI总结提出COFT，一种无需训练的解码方法，通过反事实提示和保形校准在解码时实现token级公平性控制，显著减少思维链生成中的社会偏见，同时保持任务效用和语言质量。

Comments Proceeding of ICML 2026

详情

AI中文摘要

大型语言模型（LLMs）在思维链（CoT）生成过程中可能揭示并放大社会偏见。我们提出COFT（Chain of Fair Thought），一种无需训练的解码方法，在解码时应用token级公平性控制，并对任何冻结的因果语言模型提供无分布边际有效性保证（在可交换性下）。COFT分三个阶段运行。首先，通过将敏感跨度替换为中性token来创建掩码反事实提示。其次，通过轻量级logit融合比较事实和掩码logit分布，以减弱属性驱动的偏见。第三，使用双分支分裂保形校准，在用户选择的风险水平下认证每步候选token集。我们在六个模型和多个偏见基准上评估COFT。我们的方法将标准偏见指标降低30-55%（中位数38%），同时保持任务效用和语言质量。推理准确率在运行间噪声范围内保持不变。计算开销适中，相当于一次额外的缓存前向传递（<=11%）。COFT提供了一条清晰、可审计的路径，实现更安全的CoT生成，显著减少偏见，效用损失可忽略，且无需重新训练、辅助分类器或权重访问。

英文摘要

Large language models (LLMs) can reveal and amplify societal biases during chain-of-thought (CoT) generation. We present COFT (Chain of Fair Thought), a training-free decoding method that applies token-level fairness control at decode time, with distribution-free marginal validity guarantees (under exchangeability) for any frozen causal language model. COFT operates in three stages. First, it creates a masked counterfactual prompt by replacing sensitive spans with neutral tokens. Second, it compares the factual and masked logit distributions through lightweight logit fusion to attenuate attribute-driven biases. Third, it uses dual-branch split-conformal calibration to certify per-step candidate token sets at a user-chosen risk level. We evaluate COFT across six models and multiple bias benchmarks. Our method reduces standard bias metrics by 30-55% (median 38%) while preserving task utility and language quality. Reasoning accuracies remain unchanged within run-to-run noise margins. The computational overhead is modest, equivalent to one additional cached forward pass (<=11%). COFT offers a clear, auditable path to safer CoT generation with significant bias reduction, negligible utility loss, and no requirement for retraining, auxiliary classifiers, or weight access.

URL PDF HTML ☆

赞 0 踩 0

2605.30640 2026-06-01 cs.LG cs.CL

CSULoRA: Closest Safe Update Low-Rank Adaptation

CSULoRA：最近安全更新低秩适应

Oleksandr Marchenko Breneur, Adelaide Danilov, Aria Nourbakhsh, Salima Lamsiyah

发表机构 * Department of Computer Science, University of Luxembourg（卢森堡大学计算机科学系）

AI总结提出CSULoRA方法，通过后处理校正LoRA适配器，在保留任务相关性的同时抑制不安全更新方向，降低攻击成功率。

Comments 10 pages, 3 figure

详情

AI中文摘要

低秩适应已成为大型语言模型参数高效微调的标准方法，但即使少量不安全或对抗性微调数据也会显著削弱对齐模型的安全行为。现有的安全保持LoRA方法通常依赖硬干预，如投影、剪枝、阈值化或额外训练目标。虽然这些方法可以抑制不安全更新方向，但它们也可能移除任务相关信息或需要额外调优。我们提出CSULoRA，一种通过最近安全更新估计来校正训练后LoRA适配器的后处理方法。CSULoRA从安全对齐模型与其对应基础检查点之间的权重位移中估计安全对齐子空间。然后，它将每个LoRA更新分解为完全对齐、部分对齐和子空间外分量。CSULoRA不丢弃估计安全子空间外的分量，而是求解一个闭式惩罚最小变化问题，该问题保留完全对齐分量，同时根据相对能量平滑衰减潜在不安全方向。在对抗性微调实验中，CSULoRA显著降低了攻击成功率，同时保留了标准LoRA微调获得的大部分效用增益。

英文摘要

Low-rank adaptation has become a standard method for parameter-efficient fine-tuning of large language models, but even small amounts of unsafe or adversarial fine-tuning data can substantially weaken the safety behavior of aligned models. Existing safety-preserving LoRA methods often rely on hard interventions such as projection, pruning, thresholding, or additional training objectives. While these methods can suppress unsafe update directions, they may also remove task-relevant information or require extra tuning. We introduce CSULoRA, a post-hoc method for correcting trained LoRA adapters through closest safe update estimation. CSULoRA estimates a safety-aligned subspace from the weight displacement between a safety-aligned model and its corresponding base checkpoint. It then decomposes each LoRA update into fully aligned, partially aligned, and off-subspace components. Instead of discarding components outside the estimated safety subspace, CSULoRA solves a closed-form penalized minimum-change problem that preserves the fully aligned component while smoothly attenuating potentially unsafe directions according to their relative energy. In adversarial fine-tuning experiments, CSULoRA substantially reduces attack success rate while preserving most of the utility gains obtained from standard LoRA fine-tuning.

URL PDF HTML ☆

赞 0 踩 0

2605.30639 2026-06-01 cs.CV cs.AI cs.RO

PInVerify: An Offline Embodied Benchmark for Active Instance Verification

PInVerify：面向主动实例验证的离线具身基准

Yuhang Jiang

发表机构 * University of Trento（特伦托大学）

AI总结提出主动实例验证任务，构建离线具身基准PInVerify，通过多视角导航和细粒度属性匹配评估具身智能体，并基于多模态大语言模型建立基线。

Comments Accepted as a poster at the Foundation Models Meet Embodied Agents (FMEA) Workshop, CVPR 2026. 44 pages including appendix. Code: https://github.com/Avalon-S/PInVerify

详情

AI中文摘要

具身智能体在导航到目标物体方面取得了显著进展，但到达目标附近并不能保证智能体找到了正确的实例：微妙的属性差异（例如“白色花卉”与“白色条纹”）通常需要近距离、多视角检查。我们通过主动实例验证（AIV）来解决这一差距，该任务要求智能体主动围绕候选对象选择视角，以判断其是否匹配细粒度的自然语言描述。我们将AIV形式化为一个有限视野决策过程，并引入PInVerify，一个用于AIV的离线具身基准：包含18个物体类别的3000个评估场景，以多视角捕获形式提供，并采用6扇区导航拓扑，暴露陷阱视角（可导航但无信息）和不可达扇区。作为参考基线，我们构建了一个无需训练的流水线和一个基于开源多模态大语言模型（MLLMs）的LoRA微调端到端智能体（参数规模≤8B），包括属性分解、可见性加权多视角跟踪器和三种次优视角选择（NBV）策略。在Qwen3-VL（4B/8B）、SenseNova-SI-1.2-InternVL3-8B、CLIP和SigLIP2上的评估中，最佳MLLM基线超过最佳嵌入基线4.9个百分点；GT框消融实验显示检测差距为+3.1个百分点；在测试的NBV策略中，我们未观察到主动视角选择带来的可靠增益。LoRA微调智能体（SFT+GSPO）达到85.6%。PInVerify旨在支持具身AI中主动、细粒度语义验证的进一步研究。代码：https://github.com/Avalon-S/PInVerify。

英文摘要

Embodied agents have made strong progress in navigating to target objects, but reaching the goal vicinity does not guarantee that the agent has found the correct instance: subtle attribute differences (e.g., "white floral" vs. "white striped") often require close-range, multi-view inspection. We address this gap with Active Instance Verification (AIV), a task in which an agent actively selects viewpoints around a candidate object to decide whether it matches a fine-grained natural-language description. We formalize AIV as a finite-horizon decision process and introduce PInVerify, an offline embodied benchmark for AIV: 3,000 evaluation episodes across 18 object categories, delivered as multi-view captures with a 6-sector navigation topology that exposes trap views (navigable but uninformative) and unreachable sectors. As reference baselines we build a training-free pipeline and a LoRA-fine-tuned end-to-end agent around open-source multimodal large language models (MLLMs) at on-device scale ($\leq$8B parameters), with attribute decomposition, a visibility-weighted multi-view tracker, and three next-best-view (NBV) strategies. In our evaluation across Qwen3-VL (4B/8B), SenseNova-SI-1.2-InternVL3-8B, CLIP, and SigLIP2, the best MLLM-based baseline exceeds the best embedding baseline by 4.9 pp; GT-box ablations show a +3.1 pp detection gap; and we do not observe reliable gains from active viewpoint selection within the tested NBV strategies. A LoRA-fine-tuned agent (SFT+GSPO) reaches 85.6%. PInVerify aims to support further work on active, fine-grained semantic verification in embodied AI. Code: https://github.com/Avalon-S/PInVerify.

URL PDF HTML ☆

赞 0 踩 0

2605.30638 2026-06-01 cs.LG cs.AI

通过成对查询改进二分类的选择性分类

Harsh Vardhan, Sunav Choudhary, Natwar Modani, Arya Mazumdar

发表机构 * Adobe Research（Adobe研究院）

AI总结针对选择性分类中模型置信度与预测不一致导致高错误率的问题，提出使用成对查询检测高错误样本，以降低非拒绝样本的错误率，并通过理论和实验验证了其有效性。

详情

DOI: 10.1007/s10994-026-07078-y

AI中文摘要

在选择性分类中，模型预测其确信的数据样本的标签，并避免预测不确信样本的标签。被拒绝的样本通常由专家标注，这成本高昂。当模型在非拒绝样本上错误率低时，专家的预算得到最佳利用。然而，模型置信度的估计可能与模型的预测不一致，这可能导致非拒绝样本上的高错误率。这种情况在LLM的上下文二分类中容易发生。为了解决这个问题，我们提出向同一模型进行额外的成对查询。这些成对查询可以检测高错误样本，并整合到选择性分类技术中，以降低非拒绝样本上的错误率。理论上，我们建立了使用成对查询的简单算法优于不一致置信度估计的条件。我们通过大量实验支持这一见解，包括1个合成数据集和4个基于上下文学习的真实二分类数据集。在所有情况下，我们展示了使用成对查询的算法比仅使用原始置信度估计（例如LLM的下一个token对数概率）获得了更好的准确率-成本权衡。

英文摘要

In selective classification, a model predicts the labels of data samples where it is confident, and abstains from predicting labels for samples on which it is not confident. The rejected samples are often labeled by an expert, which is expensive. The budget for the expert is best utilized when the model has low error on non-rejected samples. However, the estimate of a model's confidence might be inconsistent with the model's predictions, which can lead to high error on non-rejected points. Such situations can readily occur in in-context binary classification by LLMs. To remedy this, we propose making additional pairwise queries to the same model. These pairwise queries can detect high-error samples and be incorporated into selective classification techniques to reduce the error on non-rejected samples. Theoretically, we establish the conditions under which a simple algorithm using pairwise queries outperforms an inconsistent confidence estimate. We support this insight through extensive experiments for $1$ synthetic and $4$ in-context learning-based real binary classification datasets. In all these cases, we show that our algorithms, using pairwise queries, obtain a better accuracy-cost tradeoff than using only the raw confidence estimates, for instance, the LLM's next-token logits.

URL PDF HTML ☆

赞 0 踩 0

2605.30612 2026-06-01 cs.RO cs.LG cs.SY eess.SY

ZAPS-DA: Zero-Phase Action Policy Smoothing with Decoupled Actor for Continuous Control in Reinforcement Learning

ZAPS-DA：基于解耦演员的零相位动作策略平滑用于强化学习中的连续控制

Faiq Shamass

发表机构 * Independent Researcher（独立研究者）

AI总结提出ZAPS-DA框架，通过解耦演员网络模仿零相位滤波目标，在不引入相位延迟和后处理的情况下减少连续控制策略的动作抖动，并在驾驶仿真中验证了其有效性。

Comments 7 pages, 5 figures, 5 tables. Submitted to IEEE RA-L

详情

AI中文摘要

基于离策略强化学习训练的连续控制策略经常表现出高频动作抖动，使得直接部署在物理执行器上不可行。事后滤波可以减弱抖动但引入相位延迟；在演员损失中嵌入平滑惩罚会将其与RL梯度耦合，并将奖励回归与过度激进的平滑混为一谈。我们提出ZAPS-DA，一个在部署时减少动作抖动且具有可忽略相位延迟和无后处理的框架。ZAPS-DA将一个未修改的主演员（由基础RL损失训练）与一个单独的解耦演员配对，该解耦演员通过监督学习模仿存储在回放缓冲区中的零相位滤波目标。部署的策略是解耦演员：一个从当前观测到平滑动作的前馈映射，没有推理时滤波和动作历史输入——我们称之为非因果滤波器的因果蒸馏机制。幅度匹配的MSE损失提供了跨优化器类别的零超参数可移植性。使用Soft Actor-Critic和Savitzky-Golay滤波器在两个驾驶模拟器中通过配对n=150评估协议进行验证：在MetaDrive上，ZAPS-DA将转向抖动减少14-21倍，油门抖动减少3-5倍（所有p < 10^{-4}，Bonferroni校正），同时以6.3%的奖励成本匹配任务完成率（成功率p=0.31，碰撞率p=0.31）；在自定义Webots自适应巡航控制环境中，相同的SG配置产生了帕累托改进——奖励持平（p=0.121），转向抖动减少8-45倍，总任务失败率从2.0%降至0.7%。

英文摘要

Continuous control policies trained with off-policy reinforcement learning frequently exhibit high-frequency action jitter, rendering direct deployment on physical actuators impractical. Post-hoc filtering attenuates jitter but introduces phase lag; embedding smoothness penalties in the actor's loss couples them with the RL gradient and conflates reward regression with over-aggressive smoothing. We present ZAPS-DA, a framework that reduces action jitter at deployment with negligible phase lag and no post-processing. ZAPS-DA pairs an unmodified main actor (trained by the base RL loss) with a separate decoupled actor trained via supervised imitation of zero-phase filtered targets stored in the replay buffer. The deployed policy is the decoupled actor: a feed-forward map from the current observation to a smooth action, with no inference-time filter and no action-history input -- a mechanism we term causal distillation of a non-causal filter. A magnitude-matched MSE loss provides zero-hyperparameter portability across optimizer classes. Validated with Soft Actor-Critic and a Savitzky--Golay filter in two driving simulators using paired n=150 evaluation protocols: on MetaDrive, ZAPS-DA reduces steering jitter by 14--21x and throttle jitter by 3--5x (all $p < 10^{-4}$, Bonferroni-corrected) while matching task-completion (p=0.28 success, p=0.31 crash) at a 6.3% reward cost; on a custom Webots adaptive cruise control environment, the same SG configuration produces a Pareto improvement -- reward parity (p=0.121), 8--45x steering jitter reduction, and total task-failure rate reduced from 2.0% to 0.7%.

URL PDF HTML ☆

赞 0 踩 0

2605.30611 2026-06-01 cs.CV cs.AI cs.CL

Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs

Crafter: 面向多样化输入的可编辑科学图表生成的多智能体框架

Haozhe Zhao, Shuzheng Si, Zhenhailong Wang, Zheng Wang, Liang Chen, Xiaotong Li, Zhixiang Liang, Maosong Sun, Minjia Zhang

发表机构 * University of Illinois at Urbana-Champaign（伊利诺伊大学厄巴纳-香槟分校）； Tsinghua University（清华大学）； Peking University（北京大学）

AI总结提出Crafter多智能体框架，通过结构化组合离散语义组件，实现跨图表类型和输入条件的可编辑科学图表生成，并引入CraftEditor将栅格输出转换为可编辑SVG，在CraftBench基准上显著优于现有方法。

Comments 24 pages, 11 figures

详情

AI中文摘要

科学图表是传达复杂研究思想最有效的手段之一，但生成出版质量的插图仍然是论文准备中最劳动密集的部分。现有的自动化系统各自针对单一图表类型，且仅接受文本输入，未能解决研究人员实际使用的多样类型和条件；此外，它们的栅格输出无法进行局部修改。由于科学图表是离散语义组件的结构化组合，生成器在这些布局上产生的局部错误需要的不是更强的骨干网络，而是一个框架。我们将这个框架实例化为两个互补系统：Crafter，一个用于图表生成的多智能体框架，无需架构更改即可泛化到多种图表类型和输入条件；以及CraftEditor，它应用相同的模式将栅格输出转换为可编辑的SVG。此外，我们引入了CraftBench，一个涵盖三种图表类型和四种输入条件的基准，并带有手工质量标注。实验表明，Crafter在PaperBanana-Bench和CraftBench上显著优于独立的生成器和智能体基线，消融实验确认了每个组件的独立贡献；CraftEditor忠实地将输出转换为可编辑的SVG，超越了所有基线。我们的代码和基准可在https://github.com/HaozheZhao/Crafter获取。

英文摘要

Scientific figures are among the most effective means of communicating complex research ideas, yet producing publication-quality illustrations remains one of the most labor-intensive parts of paper preparation. Existing automated systems each target a single figure type under text-only input, leaving the diversity of types and conditions researchers actually use unaddressed; their raster outputs further cannot be locally revised. Because scientific figures are structured compositions of discrete semantic components, the localized errors generators produce on such layouts demand not a stronger backbone but a harness. We instantiate this harness in two complementary systems: Crafter, a multi-agent harness for figure generation that generalizes across figure types and input conditions without architectural changes, and CraftEditor, which applies the same pattern to convert raster outputs into editable SVGs. Moreover, we introduce CraftBench, a benchmark spanning three figure types and four input conditions with human quality annotation. Experiments show that Crafter substantially outperforms both standalone generators and the agentic baseline on PaperBanana-Bench and CraftBench, with ablations confirming each component's independent contribution; CraftEditor faithfully converts outputs into editable SVGs that surpass all baselines. Our code and benchmark are available at https://github.com/HaozheZhao/Crafter.

URL PDF HTML ☆

赞 0 踩 0

2605.30610 2026-06-01 cs.LG

Constrained Flow Optimization via Sequential Fine Tuning for Molecular Design

通过序列微调进行约束流优化以用于分子设计

Sven Gutjahr, Riccardo De Santi, Luca Schaufelberger, Kjell Jorner, Andreas Krause

发表机构 * Department of Computer Science, ETH Zurich（苏黎世联邦理工学院计算机科学系）； Institute of Chemical and Bioengineering, Department of Chemistry and Applied Biosciences, ETH Zurich（苏黎世联邦理工学院化学与生物工程研究所）； ETH AI Center（苏黎世联邦理工学院人工智能中心）

AI总结提出约束流优化（CFO）算法，通过将约束生成优化问题转化为序列微调，在分子设计中平衡奖励最大化与约束满足。

Comments ICML 2026

详情

AI中文摘要

适应生成基础模型，特别是扩散和流模型，以优化给定奖励函数（例如结合亲和力）同时满足约束（例如分子可合成性），对于其在分子设计或蛋白质工程等现实世界科学发现应用中的采用至关重要。虽然最近的工作通过强化学习和控制方案引入了可扩展的奖励引导微调方法，但如何以可靠和可预测的方式在算法上权衡奖励最大化和约束满足仍然是一个开放问题。受此挑战的启发，我们首先提出了约束生成优化的严格框架，该框架将优化视角引入所提出的适应问题，并将约束生成的相关任务作为子案例。然后，我们引入了约束流优化（CFO），这是一种通过将原始问题简化为通过已建立的可扩展方法进行序列微调来自动且可证明地平衡奖励最大化和约束满足的算法。我们为约束生成优化和通过CFO进行约束生成提供了收敛保证。最后，我们在合成（但具有说明性）设置和分子设计任务上对CFO进行了实验评估。在这些评估中，CFO在确保高约束满足的同时实现了奖励的持续增长，展示了其在约束生成优化中的实用性。

英文摘要

Adapting generative foundation models, in particular diffusion and flow models, to optimize given reward functions (e.g., binding affinity) while satisfying constraints (e.g., molecular synthesizability) is fundamental for their adoption in real-world scientific discovery applications such as molecular design or protein engineering. While recent works have introduced scalable methods for reward-guided fine-tuning of such models via reinforcement learning and control schemes, it remains an open problem how to algorithmically trade-off reward maximization and constraint satisfaction in a reliable and predictable manner. Motivated by this challenge, we first present a rigorous framework for Constrained Generative Optimization, which brings an optimization viewpoint to the introduced adaptation problem and retrieves the relevant task of constrained generation as a sub-case. Then, we introduce Constrained Flow Optimization (CFO), an algorithm that automatically and provably balances reward maximization and constraint satisfaction by reducing the original problem to sequential fine-tuning via established, scalable methods. We provide convergence guarantees for constrained generative optimization and constrained generation via CFO. Ultimately, we present an experimental evaluation of CFO on both synthetic, yet illustrative, settings, and a molecular design task. Across these evaluations, CFO achieves consistent increases in reward while ensuring high constraint satisfaction, showcasing its practical utility for constrained generative optimization.

URL PDF HTML ☆

赞 0 踩 0

2605.30601 2026-06-01 cs.LG

TASER: Task-Aware Stein Regularisation for Geometry-Driven Robustness

TASER: 面向几何驱动鲁棒性的任务感知斯坦正则化

Michał Kozyra, Gesine Reinert

发表机构 * Department of Statistics, University of Oxford, United Kingdom（英国牛津大学统计系）

AI总结提出TASER（任务感知斯坦正则化），一种基于Langevin斯坦算子的训练时正则化框架，通过惩罚训练分布下的逐点斯坦残差，诱导各向异性数据感知平滑性，从而提升模型在分布偏移和对抗扰动下的鲁棒性。

2605.30599 2026-06-01 cs.LG cs.CL

面向发动机健康管理与剩余寿命预测的科学机器学习

Jostein Barry-Straume, Changmin Son, Adrian Sandu, Gavan Burke, Rekha Sundararajan, Andrew Rimell, James G. Steinrock

发表机构 * Computational Science Laboratory（计算科学实验室）； Department of Computer Science（计算机科学系）； Virginia Tech（弗吉尼亚理工学院）

AI总结提出一个多任务科学机器学习框架，通过联合预测涡轮气体温度、温差和剩余寿命并提供量化不确定性区间，以支持基于风险的维护决策。

详情

AI中文摘要

发动机健康管理依赖于对剩余寿命的可靠预测以及对涡轮气体温度等热指标的跟踪。在实际应用中，真实机队数据具有异质性和非平稳性，仅靠点预测不足以支持风险感知的维护决策。本文提出了一种用于涡轮机预测的多任务科学机器学习框架，该框架联合预测未修剪涡轮气体温度、涡轮气体温差和剩余寿命，并以预测区间的形式提供量化不确定性，并评估其经验覆盖率。共享序列编码器（带有残差双向LSTM层和注意力池化的卷积前端）为任务特定头部提供输入，包括用于概率回归的均值-方差估计，以及可选的用于基于阈值事件建模的生存头部。该框架设计为可通过少量面向实践者的参数（例如，温差阈值规则和剩余寿命目标构建）进行调整，以便部署能够与内部策略和专有标准保持一致。使用点指标和区间指标评估所提出框架的预测性能，包括平均绝对误差、预测区间覆盖概率、平均预测区间宽度以及覆盖-宽度准则。结果按总体和按飞行阶段与维护段分层报告，以突出运营环境的影响并支持不确定性感知监控。

英文摘要

Engine Health Management (EHM) depends on reliable forecasting of Remaining Useful Life (RUL) and on tracking thermal indicators such as turbine gas temperature (TGT). In practice, real-world fleet data are heterogeneous and non-stationary, and point predictions alone are insufficient for risk-aware maintenance decisions. This paper presents a multi-task scientific machine learning framework for turbine prognostics that jointly predicts turbine gas temperature untrimmed (TGTU), Delta Turbine Gas Temperature (DTGT), and RUL, with quantified uncertainty in the form of prediction intervals whose empirical coverage is evaluated. A shared sequence encoder (convolutional front-end with residual bidirectional LSTM layers and attention pooling) feeds task-specific heads, including mean--variance estimation for probabilistic regression and, optionally, a survival head for threshold-based event modeling. The framework is designed to be tunable via a small set of practitioner-facing parameters (e.g., DTGT thresholding rules and RUL target construction) so that deployment can align with in-house policies and proprietary criteria. The predictive performance of the proposed framework is evaluated using both point and interval metrics, including mean absolute error (MAE), prediction interval coverage probability (PICP), mean prediction interval width (MPIW), and the coverage--width criterion (CWC). Results are reported both in aggregate and stratified by flight phase and maintenance segment to highlight operational-context effects and to support uncertainty-aware monitoring.

URL PDF HTML ☆

赞 0 踩 0

2605.30592 2026-06-01 cs.LG

Learning Transferable Predictability Representations

学习可迁移的可预测性表示

Diyali Goswami, Auroop R. Ganguly

发表机构 * Sustainability and Data Sciences Laboratory (SDS Lab)（可持续性与数据科学实验室）； AI4CaS: AI for Climate and Sustainability（AI4CaS：为气候与可持续性的人工智能）； Institute for Experiential AI（体验式人工智能研究所）； Pacific Northwest National Laboratory (PNNL)（太平洋西北国家实验室）

AI总结提出Gauge-Fixed Ordinal Network (GON)模型，通过锚定方差目标学习跨系统一致的序数评分，解决可预测性评估中的尺度模糊问题。

Comments 27 pages, 3 figures

详情

AI中文摘要

我们研究将标量分数分配给短轨迹窗口的问题，该分数反映其在有序可预测性机制连续体上的位置，范围从结构化确定性动力学到非结构化随机噪声。现有方法在单个系统内进行确定性-随机性判别，并且不能产生跨系统具有一致数值解释的分数。我们将此形式化为五级可预测性阶梯上的序数估计，并识别出跨系统模糊性的结构来源：仅排序监督使分数坐标在单调重参数化下未固定，我们称之为序数评分的规范自由度。我们提出了规范固定序数网络（GON），这是一种时间卷积模型，使用锚定方差目标训练，将级别-wise分数均值固定到共享目标坐标。GON操作于2-jet特征，这些特征暴露局部轨迹几何结构，由平滑流保持，并被随机代理过程破坏。在五个保留的动力学系统上，从预训练的GON检查点初始化在所有窗口预算上始终优于从头训练，适应深度反映了与训练家族的几何接近性。零样本分数在随机边界保留序数结构，其中代理过程最强烈地破坏非线性几何，并且预训练初始化在所有窗口预算上始终优于从头训练。成对判别和全局一致的序数评分是不同的属性，需要稳定的分数坐标以实现跨系统迁移，这对自然和工程动力学系统的可预测性评估、模型选择和早期预警诊断具有直接影响。

英文摘要

We study the problem of assigning a scalar score to a short trajectory window that reflects its position on an ordered continuum of predictability regimes, spanning structured deterministic dynamics to unstructured stochastic noise. Existing methods address deterministic-versus-stochastic discrimination within a single system and do not produce scores with a consistent numerical interpretation across systems. We formalize this as ordinal estimation over a five-level predictability ladder and identify a structural source of cross-system ambiguity: ranking supervision alone leaves the score coordinate unfixed up to a monotone reparameterization, which we term the gauge freedom of ordinal scoring. We propose the Gauge-Fixed Ordinal Network (GON), a temporal convolutional model trained with an anchor-and-variance objective that pins level-wise score means to shared target coordinates. GON operates on 2-jet features that expose local trajectory geometry, preserved by smooth flows and disrupted by stochastic surrogate procedures. On five held-out dynamical systems, initializing from a pretrained GON checkpoint consistently outperforms training from scratch across all window budgets, with adaptation depth reflecting geometric proximity to the training family. Zero-shot scores retain ordinal structure at the stochastic boundary, where surrogate procedures most strongly disrupt nonlinear geometry, and pretrained initialization consistently beats scratch across all window budgets. Pairwise discrimination and globally coherent ordinal scoring are distinct properties requiring a stable score coordinate for cross-system transfer, with direct implications for predictability assessment, model selection, and early-warning diagnostics across natural and engineered dynamical systems.

URL PDF HTML ☆

赞 0 踩 0

2605.30590 2026-06-01 cs.LG cs.AI cs.CL

Counterfactual Evaluation Reveals Hidden Capability Profiles in Clinical LLMs and Agents

反事实评估揭示临床LLM和智能体的隐藏能力画像

Matt Turk

发表机构 * Protege Data Lab（Protege数据实验室）

AI总结提出因果敏感性评分（CSS），通过沿五个临床维度变异肿瘤病例来评估模型是否按预期方向更新推荐，发现与覆盖度指标排名相反，并揭示所有前沿模型在手术状态干预上的安全盲点。

Comments Accepted to RLEval @ ACM CAIS 2026 (Workshop on Methods and RL Environments for Evaluating AI Agents) and selected for an invited talk based on reviewer ratings. 4-page short paper + appendix

详情

AI中文摘要

两个临床AI系统在基于覆盖度的评分标准上得分几乎相同，但当患者输入变化时行为却截然不同：一个更新其推荐以匹配新的临床信号，而另一个无论输入如何都产生相同输出。我们引入因果敏感性评分（CSS），这是一个预注册的干预性指标，沿五个临床有意义的维度——生物标志物翻转、先前治疗失败、生物标志物移除、手术状态变化和分期扰动——变异肿瘤肿瘤委员会病例，并使用{0, 0.5, 1.0}量表对每个模型是否在预注册的正确方向上更新其推荐进行评分。与基于覆盖度的加权召回指标共识匹配评分（CMS）相比，来自三个实验室的六个前沿模型在224个病例的单次推理中评估，排名几乎完全相反：所有六个模型排名发生变化，CMS最差的模型成为CSS最好的模型，而一个中上CMS模型在CSS上排名最后。我们进一步揭示了一个普遍的安全盲点：每个前沿模型在手术状态干预上失败（D家族最多17.2%的CSS），这是CMS未暴露的发现。该指标也适用于使用工具的智能体：在ReAct风格的实验中，工具使用改善了六个模型中五个的CSS（+2.5到+20.3个百分点），然而CSS最低的模型检索相同的图表部分但仍未能更新其推荐——揭示了仅在反事实评估下可见的结构性响应缺陷。跨评判者复制和三位评估者的医学专业验证确认了总体发现。像CSS这样的干预性预注册指标补充了临床AI智能体的基于覆盖度的评估：它们捕捉了覆盖度指标遗漏的响应性，并为未来的智能体强化学习系统提供了候选的密集奖励信号。

英文摘要

Two clinical AI systems can score nearly identically on coverage-based rubrics yet behave radically differently when their patient inputs change: one updates its recommendations to match the new clinical signal, while the other produces the same output regardless. We introduce the Causal Sensitivity Score (CSS), a pre-registered interventional metric that mutates oncology tumor-board cases along five clinically meaningful dimensions - biomarker flips, prior-treatment failures, biomarker removals, surgery-status changes, and stage perturbations - and scores whether each model updates its recommendations in the pre-registered correct direction using a {0, 0.5, 1.0} scale. Benchmarked against the Consensus Match Score (CMS), a coverage-based weighted recall metric, six frontier models from three labs evaluated in single-shot inference across 224 cases rank in nearly opposite orders: all six models change rank, the CMS-worst model becomes CSS-best, and one upper-mid CMS model ranks last on CSS. We further surface a universal safety blind spot: every frontier model fails on surgery-status interventions (at most 17.2% CSS on Family D), a finding CMS does not expose. The metric also transfers to tool-using agents: in a ReAct-style experiment, tool use improves CSS for five of six models (+2.5 to +20.3 percentage points), yet the lowest-CSS model retrieves the same chart sections and still fails to update its recommendations - revealing a structural responsiveness deficit visible only under counterfactual evaluation. Cross-judge replication and three-rater medical-professional validation confirm the aggregate findings. Interventional pre-registered metrics like CSS complement coverage-based evaluation for clinical AI agents: they capture responsiveness that coverage metrics miss and offer a candidate dense reward signal for future agentic RL systems.

URL PDF HTML ☆

赞 0 踩 0

AI 大模型

视觉与机器人

科学与医疗

Spatio-temporal stochastic graph-based learning for infectious disease forecasting

Learning to Perceive the World Through Control: Empowerment-Based Representation Learning

EUDAIMONIA: Evaluating Undesirable Dynamics in AI

Counterfactual Graph for Multi-Agent LLM Calibration

LARK: Learnability-Grounded Trajectory Selection for Efficient Reasoning Distillation

Convergence of Steepest Descent and Adam under Non-Uniform Smoothness

Same Patient, Different Words, Different Diagnosis? Evaluating Semantic Stability in Clinical LLMs

Diffusion Models Preferentially Memorize Prototypical Examples or: Why Does My Diffusion Model Love Slop?

COFT: Counterfactual-Conformal Decoding for Fair Chain-of-Thought Reasoning in Large Language Models

CSULoRA: Closest Safe Update Low-Rank Adaptation

PInVerify: An Offline Embodied Benchmark for Active Instance Verification

Score Broadcast and Decorrelation: A General Framework for Broadcast-Based Credit Assignment

EHRBench: An Automated and Reliable EHR-based Benchmark for Clinical Decision Making with LLMs

CellBRIDGE: Learning Cellular Trajectories via Interaction-Aware Alignment

Controllable Lung Nodule Synthesis via Histogram-Regularized Latent Diffusion Models

The Architecture of Errors: From Universal Impossibility to Patch-Local LLM Reliability

Active Timepoint Selection for Learning Measure-Valued Trajectories

Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents

Exploiting Chordal Sparsity for Globally Optimal Estimation with Factor Graphs

Improving Selective Classification with Pairwise Queries for Binary Classification

ZAPS-DA: Zero-Phase Action Policy Smoothing with Decoupled Actor for Continuous Control in Reinforcement Learning

Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs

Constrained Flow Optimization via Sequential Fine Tuning for Molecular Design

TASER: Task-Aware Stein Regularisation for Geometry-Driven Robustness

AMNESIA: A Large Scale Medical Unlearning Benchmark Suite with Disease-Informed Analysis

ScaleMAP: Preserving Local Density and Neighborhood Structure in Low-Dimensional Embeddings

Improving Relative Representations with Learned Anchors and Whitened Inner Products

Scientific Machine Learning for Engine Health Management and Remaining Useful Life Prediction

Learning Transferable Predictability Representations

Counterfactual Evaluation Reveals Hidden Capability Profiles in Clinical LLMs and Agents