arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 2075
专题追踪
2605.06977 2026-05-11 cs.LG cs.AI cs.IT math.IT stat.ML

$f$-Divergence Regularized RLHF: Two Tales of Sampling and Unified Analyses

$f$-Divergence Regularized RLHF:采样两则与统一分析

Di Wu, Chengshuai Shi, Jing Yang, Cong Shen

发表机构 * Department of Electrical and Computer Engineering, University of Virginia, Virginia, United States(弗吉尼亚大学电气与计算机工程系) Princeton Language and Intelligence, Princeton University, New Jersey, United States(普林斯顿大学语言与智能)

AI总结 本文提出基于一般$f$-散度正则化的在线RLHF框架,通过两种采样策略实现统一理论分析,证明了算法在$O(\log T)$ regret和$O(1/T)$ sub-optimality gap下的效率。

Comments ICML 2026

详情
AI中文摘要

强化学习从人类反馈(RLHF)已成为训练大语言模型的关键技术。尽管现有方法多依赖反向KL正则化,但近期研究开始探索替代的散度(如前向KL、卡方)作为正则化器。然而,一般$f$-散度正则化的统一理论理解仍不足。本文开发了在线RLHF的综合理论框架,采用整体视角提出两种算法,分别基于不同的采样原则。第一种扩展了经典乐观原则并引入精心设计的探索奖励,第二种则利用$f$-散度正则化下最优策略对奖励扰动的敏感性。理论分析显示可实现$O(\log T)$后悔和$O(1/T)$次优差距,证明了两种算法的效率,并为一般$f$-散度正则化的在线RLHF建立了首个性能界。

英文摘要

Reinforcement Learning from Human Feedback (RLHF) has become a cornerstone technique for post-training large language models. While most existing approaches rely on the reverse KL-regularization, recent empirical studies have begun exploring alternative divergences (e.g., forward KL, chi-squared) as regularizers in RLHF. However, a unified theoretical understanding of general $f$-divergence regularization remains under-explored. To fill this gap, this work develops a comprehensive theoretical framework for online RLHF with a general $f$-divergence regularized objective. Rather than treating each possible divergence function individually, we adopt a holistic perspective across the entire function class and propose two algorithms based on distinct sampling principles. The first extends the classical optimism principle with a carefully designed exploration bonus, while the second introduces a new method that exploits the sensitivity of the optimal policy to reward perturbations under $f$-divergence regularization. Theoretical analysis shows that $O(\log T)$ regret and $O(1/T)$ sub-optimality gap are achievable, establishing provable efficiency of both algorithms and, to the best of our knowledge, the first performance bounds for online RLHF under general $f$-divergence regularization.

2605.06966 2026-05-11 cs.RO cs.SE

Traffic Scenario Orchestration from Language via Constraint Satisfaction

通过约束满足实现基于语言的交通场景编排

Frieda Rong, Chris Zhang, Kelvin Wong, Raquel Urtasun

发表机构 * University of Toronto(多伦多大学) Waabi

AI总结 本文提出一种基于约束满足的交通场景编排方法,通过自然语言生成约束条件,利用现成求解器实现闭环测试中的精确场景控制,显著提高了场景编排成功率。

Comments 19 pages, 10 figures; full version of paper accepted for poster presentation at ICRA 2026

详情
AI中文摘要

自动驾驶车辆需要在模拟中进行大量测试,但生成驾驶场景的测试用例很耗时。所需的场景通常是分布外的,并对与被测AV策略的交互有精确要求。手动编程场景可以实现精确控制,但难以扩展。另一方面,统计模型可以利用计算和数据,但在分布外情况下难以实现精确控制。我们将场景编排视为一个约束求解问题,并提出一种语言输入、模拟输出的场景编排器,用于闭环测试自动驾驶车辆。我们的方法利用基础模型推理,将通用的自然语言描述转换为一组约束作为场景表示。这使得我们可以利用现成的求解器来求解满足精确测试意图的演员行为。在经过精心设计和多样化的场景描述基准测试中,我们的方法在编排成功率上显著优于基线方法。我们进一步证明,我们的闭环方法对于需要自我反应规范的场景尤其重要。

英文摘要

Autonomous vehicles (AVs) require extensive testing in simulation, but test case generation for driving scenarios is laborious. The desired scenarios are often out-of-distribution and have precise requirements on interactions with the AV policy under test. Manually programming scenarios allows for precise controllability but is difficult to scale. On the other hand, statistical models can leverage compute and data, but struggle with precise controllability when out-of-distribution. We cast scenario orchestration as a constraint-solving problem and present a language-in, simulation-out scenario orchestrator for closed-loop testing AVs. Our approach leverages foundation model reasoning to translate general, natural language descriptions into a set of constraints as a scenario representation. This then allows us to leverage off the shelf solvers to solve for actor behaviors which meet precise testing intentions in closed-loop. Under a benchmark of carefully crafted and diverse scenario descriptions, our approach greatly outperforms our baselines in orchestration success rate. We further show that our closed-loop approach is especially important for scenarios which require ego-reactive specifications.

2605.06957 2026-05-11 cs.AI

Learning and Reusing Policy Decompositions for Hierarchical Generalized Planning with LLM Agents

基于LLM代理的分层通用规划中的策略分解学习与重用

Shirin Sohrabi, Haritha Ananthakrishnan, Harsha Kokel, Kavitha Srinivas, Michael Katz

发表机构 * IBM

AI总结 本文提出一种结合通用规划与分层任务分解的动态策略学习方法,通过自动分解学习可重用的策略组件,提升任务执行效率与准确性。

详情
AI中文摘要

我们提出了一种动态策略学习方法,结合通用规划和分层任务分解,用于基于大语言模型(LLM)的智能体。我们的方法,即通用策略分层组件学习(HCL-GP),学习能够跨任务实例泛化的参数化策略,并自动从成功执行中提取可重用的组件,将其组织成组件库以生成组合策略。我们解决了三个挑战:(1)通过自动化分解学习组件;(2)将组件泛化以最大化重用;(3)通过语义搜索实现高效检索。在AppWorld基准测试中,我们的方法在正常任务上达到98.2%的准确率,在挑战任务上达到97.8%的准确率,比静态合成在挑战场景中提升了15.8个百分点。对于开源模型,动态重用使成功率达到62.5%,而无重用则接近零。这表明经典规划概念可以有效与LLM智能体结合,提升准确性和效率。

英文摘要

We present a dynamic policy-learning approach that combines generalized planning and hierarchical task decomposition for LLM-based agents. Our method, Hierarchical Component Learning for Generalized Policies (HCL-GP ), learns parameterized policies that generalize across task instances and automatically extracts reusable components from successful executions, organizing them into a component library for compositional policy generation. We address three challenges: (1) learning components through automated decomposition, (2) generalizing components to maximize reuse, and (3) efficient retrieval via semantic search. Evaluated on the AppWorld benchmark, our approach achieves 98.2% accuracy on normal tasks and 97.8% on challenge tasks with unseen applications, improving 15.8 points over static synthesis on challenging scenarios. For open-source models, dynamic reuse enables 62.5% success versus near-zero without reuse. This demonstrates that classical planning concepts can be effectively integrated with LLM agents for improved accuracy and efficiency.

2605.06955 2026-05-11 cs.LG cs.AI

Kurtosis-Guided Denoising Score Matching for Tabular Anomaly Detection

偏度引导的去噪分数匹配用于表格异常检测

Victor Livernoche, Jie Zan, Reihaneh Rabbany

发表机构 * McGill University(麦吉尔大学) Mila

AI总结 本文提出基于偏度的去噪分数匹配方法,通过每特征调整噪声水平以提升低密度区域覆盖和高密度区域精度,无需额外模型复杂度,在半监督和全无监督设置中均取得优异性能。

Comments 39 pages, 10 figures, 14 tables

详情
AI中文摘要

去噪分数匹配(DSM)通过训练神经网络从噪声污染样本中恢复分数函数(定义为对数密度的梯度)来学习数据分布。一旦训练完成,测试点的分数幅度反映该点与学习分布的一致性,成为自然的异常信号。关键的实践挑战是选择扰动尺度:噪声过少会导致稀疏区域分数估计不稳定,噪声过多会抹去局部结构并削弱异常敏感性。这在异常未知且无验证集的情况下更加复杂。我们引入基于偏度的噪声缩放(K-DSM),一种每特征方案,通过每个边缘分布的形状设置噪声水平,提高低密度区域的覆盖并提高高密度区域的精度,而无需额外模型复杂度。与之前声称多尺度或噪声条件训练是必要的不同,我们发现经过仔细训练的单尺度模型已经是强大的异常检测器。在标准表格异常检测基准上,K-DSM在半监督设置中取得最佳性能。当结合轻量级EMA-教师过滤规则,该规则在每次梯度步骤前移除低密度训练点时,也在全无监督(受污染)设置中取得强性能,表明简单的数据自适应噪声缩放能够实现稳健的异常检测,同时减少对超参数调优的依赖。

英文摘要

Denoising score matching (DSM) provides a way to learn data distributions by training a neural network to recover the score function, defined as the gradient of the log density, from noise-corrupted samples. Once trained, the score magnitude at a test point reflects how consistent that point is with the learned distribution, making it a natural anomaly signal. The key practical challenge is selecting the perturbation scale: too little noise yields unstable score estimates in sparse regions, while too much erases local structure and weakens anomaly sensitivity. This is compounded by the difficulty of hyperparameter tuning when anomalies are unknown and no validation set is available. We introduce kurtosis-based noise scaling (K-DSM), a per-feature scheme that sets noise levels from the shape of each marginal distribution, improving coverage of low-density regions and precision in high-density regions without extra model complexity. Contrary to prior claims that multi-scale or noise-conditioned training is necessary, we find that a carefully trained single-scale model is already a strong anomaly detector. On standard tabular anomaly detection benchmarks, K-DSM achieves state-of-the-art performance in the semi-supervised setting. When combined with a lightweight EMA-teacher filtering rule that removes low-density training points before each gradient step, it also achieves strong performance in the fully unsupervised (contaminated) setting, suggesting that simple, data-adaptive noise scaling enables robust anomaly detection while reducing reliance on hyperparameter tuning.

2605.06951 2026-05-11 cs.AI cs.LG cs.MA

Multi-Objective Constraint Inference using Inverse reinforcement learning

多目标约束推断使用逆强化学习

Syed Ihtesham Hussain Shah, Floris den Hengst, Aneta Lisowska, Annette ten Teije

发表机构 * Faculty of Sciences(科学学院)

AI总结 本文提出多目标约束推断框架MOCI,用于从异质专家轨迹中联合提取共享约束和个体偏好,有效处理不同目标下的多样且可能冲突的行为,实验证明其在预测性能和计算效率方面优于现有方法。

详情
AI中文摘要

约束推断被广泛认为是通过观察专家示范将强化学习代理对齐安全边界和操作指南的关键。然而,现有方法通常假设示范是同质的(即由单一专家或目标相同的多个专家生成)。它们还难以捕捉个体偏好,且常面临计算效率低下的问题。本文引入多目标约束推断(MOCI),一种新的框架,旨在从异质专家轨迹中联合提取共享约束和个体偏好,其中多个专家追求不同目标。MOCI有效建模和学习多样且可能冲突的行为。实验证明,MOCI在预测性能上显著优于现有基线方法,并在标准网格世界基准上保持竞争性的计算效率。这些结果确立了MOCI作为准确、灵活且计算可行的方法,用于现实中的约束推断和偏好学习任务。

英文摘要

Constraint inference is widely considered essential to align reinforcement learning agents with safety boundaries and operational guidelines by observing expert demonstrations. However, existing approaches typically assume homogeneous demonstrations (i.e., generated by a single expert or multiple experts with identical objectives). They also have limited ability to capture individual preferences and often suffer from computational inefficiencies. In this paper, we introduce Multi-Objective Constraint Inference (MOCI), a novel framework designed to jointly extract shared constraints and individual preferences from heterogeneous expert trajectories, where multiple experts pursue different objectives. MOCI effectively models and learns from diverse, and potentially conflicting, behaviors. Empirical evaluations demonstrate that MOCI significantly outperforms existing baselines, achieving improved predictive performance, and maintaining competitive computational efficiency on a standard grid-world benchmark. These results establish MOCI as an accurate, flexible, and computationally practical approach for real-world constraint inference and preference learning tasks.

2605.06947 2026-05-11 cs.LG

Rollback-Free Stable Brick Structures Generation

无需回滚的稳定砖块结构生成

Chenhui Xu, Ziyue Bai, Fuxun Yu, Heng Huang, Jinjun Xiong

发表机构 * University of Buffalo(布法罗大学) University of Maryland, College Park(马里兰大学学院公园分校) Microsoft(微软) University of Texas at San Antonio(德克萨斯大学圣安东尼奥分校)

AI总结 本文提出一种强化学习方法,通过训练时的策略优化而非测试时的纠正,实现稳定砖块结构的高效生成,提升生成质量和推理速度。

详情
AI中文摘要

尽管自回归模型在3D生成方面有所进展,但创建物理稳定的砖块结构仍面临挑战,因为需要严格考虑重力和连接性。现有方法在推理过程中依赖外部物理模拟器进行拒绝采样和逐块回滚,严重限制了效率。为此,我们提出了一种强化学习范式,将物理有效性检查从测试时的纠正转移到训练时的策略优化。通过利用装配级奖励,模型优化碰撞避免、全局连接性、结构互锁和形状一致性。该范式使模型能够内化物理先验,实现首个无需回滚的稳定砖块结构生成。实验结果表明,我们的方法在生成质量上达到最新水平,同时将推理速度提升多个数量级。我们的代码和数据集可在https://github.com/miniHuiHui/STABLE获得。我们的模型可在https://huggingface.co/miniHui/STABLE获得。

英文摘要

While autoregressive models have advanced 3D generation, creating physically stable brick structures remains a challenge due to the strict requirements of gravity and interconnectivity. Existing approaches rely on external physical simulators during inference to perform rejection sampling and brick-by-brick rollbacks, which severely bottlenecks efficiency. To address this, we propose a reinforcement learning paradigm that shifts physical validity enforcement from test-time correction to training-time policy optimization. By utilizing assembly-level rewards, the model optimizes for collision avoidance, global connectivity, structural interlocking, and shape conformity. This paradigm allows the model to internalize physical priors, enabling the first rollback-free generation of stable brick structures. Experimental results demonstrate that our approach achieves state-of-the-art generation quality while accelerating inference speed by orders of magnitude. Our code and dataset are available at https://github.com/miniHuiHui/STABLE. Our models are available at https://huggingface.co/miniHui/STABLE.

2605.06946 2026-05-11 cs.LG cs.AI

Adaptive Memory Decay for Log-Linear Attention

自适应记忆衰减的对数线性注意

Yaxita Amin, Helen Zichen Li, Mengfan Zhang, Samet Ayhan

发表机构 * University of Maryland(马里兰大学)

AI总结 本文提出通过轻量级两层MLP学习记忆衰减参数λ,实现按内容自适应的衰减,提升长距离记忆任务性能。

Comments 19 pages, 13 figures. Preprint

详情
AI中文摘要

序列模型面临记忆容量与计算效率之间的根本权衡。Transformer以二次成本实现表达性上下文建模,而线性注意和状态空间模型通过将上下文压缩为固定大小的隐藏状态在线性时间内运行,本质上限制了回忆能力。对数线性注意通过在Fenwick树层次结构中组织内存,在log-linear计算成本下以对数方式增长隐藏状态。然而,其记忆衰减参数λ固定且独立于输入,对所有层次分配均匀权重,引入了不必要的刚性。本文提出通过轻量级两层MLP直接从输入学习λ,产生按token和层次的衰减,适应内容而非位置。软plus激活使每个Fenwick树层次独立扩展,避免softmax引入的层次间竞争。此修改保持log-linear复杂度精确,并增加 negligible 参数开销。我们在关联回忆、选择性复制和语言建模上进行评估,发现输入依赖的衰减在长距离记忆设置中表现最佳,其中基线λ退化或完全崩溃。

英文摘要

Sequence models face a fundamental tradeoff between memory capacity and computational efficiency. Transformers achieve expressive context modeling at quadratic cost, while linear attention and state-space models run in linear time by compressing context into a fixed-size hidden state, inherently limiting recall. Log-linear attention navigates this tradeoff by organizing memory across a Fenwick tree hierarchy, growing its hidden state logarithmically with sequence length at log-linear compute cost. However, its memory decay parameter λ is fixed and independent of the input, assigning uniform weights across all hierarchy levels regardless of the content, which introduces unnecessary rigidity. We propose learning λ directly from the input via a lightweight two-layer MLP, producing per-token, per-level decay that adapts to content rather than position. A softplus activation lets each Fenwick tree level scale independently, avoiding the inter-level competition that softmax introduces. This modification preserves log-linear complexity exactly and adds negligible parameter overhead. We evaluate on associative recall, selective copying, and language modeling, finding that input-dependent decay consistently outperforms the baseline, with the largest gains in long-range memory settings where baseline λ degrades or collapses entirely.

2605.06943 2026-05-11 cs.LG

ProtoSSL: Interpretable Prototype Learning from Unlabeled Time-Series Data

ProtoSSL: 从未标记时间序列数据中学习可解释的原型

Steven Song, Sahil Sethi, Brett Beaulieu-Jones, Robert L. Grossman

发表机构 * Department of Computer Science(计算机科学系) Center for Translational Data Science(转化数据科学中心) Medical Scientist Training Program(医学科学家培训计划) Pritzker School of Medicine(皮尔兹克医学院) Center for Computational Medicine & Clinical AI(计算医学与临床人工智能中心) Section of Biomedical Data Science, Department of Medicine(医学部生物医学数据科学部门)

AI总结 ProtoSSL通过自监督学习从未标记时间序列数据中学习可解释的原型,提升低数据场景下的性能,并在人类评估中表现更优,扩展至音频分类。

详情
AI中文摘要

在需要预测性能和可解释性的时序领域,深度神经网络虽表现优异,但缺乏预测机制的洞察。基于投影的原型网络通过将预测基于代表性训练示例的相似性,实现案例解释和全局原型检查。然而,现有方法依赖标签监督,将原型绑定到特定任务,需大量标注数据。我们引入ProtoSSL,一种从未标注时间序列数据中学习可解释、基于投影的原型的新框架,并将其适应到下游任务。我们的关键思想是将动机发现与标签对齐分离。ProtoSSL首先通过直接应用于原型激活的自监督目标学习可重用的原型库,然后通过高效的分配过程将这些原型对齐到下游任务。在六个心电图(ECG)数据集中,ProtoSSL提高了标签效率,在少于256个标注示例的低数据场景中优于监督原型基线;在微调后,其在完整数据集规模上优于监督原型基线。在一项人类评估研究中,ProtoSSL生成的原型和基于原型的解释在直接标签监督下学习的原型中更受好评。我们进一步表明,该框架可扩展至音频分类。因此,ProtoSSL实现了从已知下游标签空间之前从未标注数据中学习可推广的原型,以及随后将可解释、基于投影的原型分配到新时间序列任务。

英文摘要

In time-series domains where both predictive performance and interpretability are essential, deep neural networks achieve strong results but provide limited insight into how their predictions are made. Projection-based prototype networks address this limitation by grounding predictions in similarity to representative training examples, enabling case-based explanations and global prototype inspection. However, existing approaches rely on label supervision, tying prototypes to a specific task and requiring large labeled datasets. We introduce ProtoSSL, a novel framework for learning interpretable, projection-based prototypes from unlabeled time-series data and adapting them to downstream tasks. Our key idea is to separate motif discovery from label alignment. ProtoSSL first learns a reusable prototype bank using a self-supervised objective applied directly to prototype activations, and then aligns these prototypes to downstream tasks through an efficient assignment procedure. Across six electrocardiography (ECG) datasets, ProtoSSL improves label efficiency, outperforming supervised prototype baselines in low-data regimes with as few as 256 labeled examples; with fine-tuning, ProtoSSL outperforms supervised prototype baselines at full dataset scale. In a human evaluation study, ProtoSSL produces prototypes and prototype-based explanations that are judged more favorably than those learned with direct label supervision. We further show that the framework extends to audio classification. Thus, ProtoSSL enables both learning generalizable prototypes from unlabeled data before the downstream label space is known, and subsequent assignment of interpretable, projection-grounded prototypes to new time-series tasks.

2605.06941 2026-05-11 cs.LG math.OC

Causal-Aware Foundation-Model for Bilevel Optimization in Discrete Choice Settings

面向离散选择场景的因果感知基础模型用于双层优化

Shivaram Subramanian, Zhengliang Xue, Markus Ettl, Yingdong Lu, Jayant Kalagnanam

发表机构 * IBM T.J. Watson Research Center(IBM T.J. Watson研究院)

AI总结 本文提出一种因果感知基础模型框架,用于离散选择环境中实时最优决策。通过约束三头价格优化网络解决服务提供者选择最优商品组合的问题,同时考虑用户个性化接受或拒绝选择。模型在模拟、合成和真实数据集上表现优异,应用于医疗、招标定价等领域,提升定价KPI。

详情
AI中文摘要

我们介绍了一种因果感知基础模型框架,用于离散选择环境中的实时最优决策。我们提出了一种约束三头价格优化(C3PO)网络,用于解决服务提供者选择最优商品组合的问题,同时考虑异质用户个性化接受或拒绝选择的问题。C3PO集成了价格模仿学习、多任务收入响应学习以及上下文学习价格弹性,以生成定价建议并遵守业务约束。在推理过程中,前沿模型提示从行为经济学文献中检索增强的弹性先验,以提高定价效果。我们使用模拟、合成和真实世界数据集展示了强大的上下文学习性能。C3PO是在由多个经典离散选择模型生成的模拟数据上训练的。模型在包含模拟客户群体和反事实动作和结果对的数据上进行训练,并在无访问底层偏好结构的随机生成选择环境中进行评估。训练后的模型在定价KPI上持续改进,随着客户价格敏感性增加,收益增加。我们还部署了调优的基础模型用于医疗、招标定价、航空附加服务定价等实际应用,实现了多个产品、市场和部门的显著收益。

英文摘要

We introduce a causal aware foundation-model framework for real time optimal decision making in discrete choice environments. We propose a constrained triple-head price optimization (C3PO) network to solve a bilevel decision problem in which a service provider selects an optimal assortment while heterogeneous users make personalized acceptance or rejection choices optimizing their own personalized preferences. C3PO integrates imitation learning of prices, multi-task learning of revenue responses, and in context learning of price elasticity to generate pricing recommendations while adhering to business constraints. During inference, frontier model prompting retrieves an enhanced elasticity prior for new products from behavioral economics literature, improving pricing effectiveness. We demonstrate strong in context learning performance using simulated, synthetic, and real-world datasets. C3PO is trained on simulated data generated from multiple classical discrete choice models in economics. The model is trained on data comprising simulated customer segments and counterfactual action and outcome pairs and evaluated on randomly generated choice environments with no access to the underlying preference structure. The trained model consistently improves the pricing KPIs, with gains increasing as customer price sensitivity increases. We also deploy the tuned foundation model for optimal pricing in real-world applications such as healthcare, tender pricing, airline ancillary pricing, and other domains, achieving substantial gains across multiple products, markets, and divisions.

2605.06939 2026-05-11 cs.LG stat.ME stat.ML

Bias and Uncertainty in LLM-as-a-Judge Estimation

大语言模型作为裁判的偏差与不确定性估计

James Fiedler

发表机构 * Indeed Inc.(Indeed公司)

AI总结 研究大语言模型作为裁判评估中偏差和不确定性的系统性问题,提出J和ΔJ作为诊断工具,分析共享校准比较的可靠性及报告指导。

详情
AI中文摘要

LLM-as-a-Judge评估已成为评估基础模型性能的标准工具。然而,通过简单估计器(即原始裁判输出)来表征性能存在系统性偏差。近期工作提出了修正此偏差的估计器,但其可靠性依赖于裁判质量以及模型比较中的校准稳定性。在比较模型时共享校准在实践中具有吸引力,但可能引入严重偏差,包括在比较估计指向错误方向且置信度高时的情况。我们通过分析结果、在裁判质量(J)和跨模型校准不稳定性(ΔJ)上的模拟,以及一个实际数据MMLU-Pro案例研究,探讨这些失败模式。我们提出J和ΔJ作为诊断工具,用于判断修正后的估计,尤其是共享校准比较时的可靠性,并提供LaaJ评估的报告指导。

英文摘要

LLM-as-a-Judge evaluation has become a standard tool for assessing base model performance. However, characterizing performance via the naive estimator, i.e., raw judge outputs, is systematically biased. Recent work has proposed estimators to correct this bias, but their reliability depends critically on judge quality and, for model comparisons, on calibration stability. Sharing calibration across compared models is practically attractive but can introduce severe bias, including cases where the comparison estimate points in the wrong direction with high apparent confidence. We study these failure modes through analytical results, simulations over judge quality ($J$) and cross-model calibration instability ($ΔJ$), and a real-data MMLU-Pro case study with sign reversal. We propose $J$ and $ΔJ$ as diagnostics for when corrected estimates, especially shared-calibration comparisons, are likely unreliable, and provide reporting guidance for LaaJ evaluation.

2605.06938 2026-05-11 cs.LG cs.AI

A Generalized Singular Value Theory for Neural Networks

神经网络的广义奇异值理论

Brian Charles Brown, Robert Bridges, David Grimsman, Mauricio Munoz, Sean Warnick

发表机构 * Department of Computer Science(计算机科学系) Brigham Young University AI Sweden

AI总结 本文基于Brown等人提出的广义奇异值分解理论,证明现代神经网络架构可通过广义SVD表示实现左可逆性,并提出数据驱动算法用于估计该表示。

详情
AI中文摘要

基于Brown等人[2025]提出的抽象广义奇异值分解(GSVD)理论,我们证明大多数现代神经架构允许广义SVD表示,在最终线性层之前具有左可逆性,且不改变输入输出行为。进一步地,输入输出行为的左可逆非线性部分可以被设计为规范保持,即嵌入(最终线性层前的激活)的扰动与输入扰动成比例,从而在特征空间中的距离可直接校准到输入空间中的距离。我们提供了一种数据驱动的算法来从训练模型中估计这种表示,并提出一种自然促进分解的模型架构。然后,我们提供了一个证明概念,表明学习到的表示可用于识别模型输入的对抗扰动,并开发了未来应用于模型偏见和可逆性等领域的理论。

英文摘要

Building on the abstract Generalized Singular Value Decomposition (GSVD) theory of Brown et al. [2025], we prove that most modern neural architectures admit a generalized SVD representation in which they are left-invertible before a final linear layer, with no change in input-output behavior. Furthermore, the left-invertible nonlinear portion of the input-output behavior can be made to be \emph{norm preserving}, meaning that perturbations in the left-invertible ``embedding'' (the activations prior to the final linear layer in this representation) correspond proportionally to changes in the input, i.e., distance in feature space can be calibrated directly to distance in input space. We provide a data-driven algorithm for estimating this representation from trained models and propose a model architecture that naturally facilitates the decomposition. We then provide a proof-of-concept that the learned representation can be used to identify adversarial perturbations to model inputs, and develop the theory necessary for future applications to areas such as model bias and invertibility.

2605.06937 2026-05-11 cs.LG

A Reproducible Optimisation Protocol for Calibrating Prompt-Based Large Language Model Workflows in Evidence Synthesis

一种可重复的优化协议用于校准基于提示的大型语言模型在证据综合中的工作流程

Teo Susnjak

发表机构 * School of Mathematical and Computational Sciences(数学与计算科学学院)

AI总结 本文提出了一种可重复的校准流程,用于基于提示的大型语言模型在结构化证据综合任务中的优化。方法将定义科学任务的规则与可变的提示框架分离,并通过标记或参考示例及显式任务指标优化框架,最终保存校准的工作流程作为可检查的制品。

详情
AI中文摘要

本文的方法文章提出了一种可重复的校准流程,用于基于提示的大型语言模型(LLMs)在结构化证据综合任务中的优化。该方法将定义科学任务的规则与可变提示框架分离,该框架框架并应用这些规则。它通过标记或参考示例以及显式任务指标优化该框架,然后将校准后的工作流程保存为可检查的制品,包括其规范、指标、设置和评估轨迹。示例代码使用DSPy和GEPA工具实例化该协议,但其底层逻辑可以转移到其他支持结构化任务定义、指标引导搜索和制品重用的提示优化框架中。标题和摘要筛选是工作验证案例,因为它提供了标记的基准数据和清晰的评估指标。演示的工作流程使用较小的学生LLM执行科学任务,并使用较大的反思LLM在校准期间引导提示优化过程。本工作展示了编译、制品往返以及优化预算如何影响较小的学生模型。

英文摘要

This methods article presents a reproducible calibration workflow for prompt-based large language models (LLMs) in structured evidence-synthesis tasks. The method separates the rules that define the scientific task from the mutable prompt harness that frames and applies them. It optimises that harness against labelled or reference examples and an explicit task metric, then preserves the calibrated workflow as an inspectable artefact with its specification, metric, settings, and evaluation traces. The example code instantiates the protocol with DSPy and GEPA tools, but the underlying logic can transfer to other prompt-optimisation frameworks that support structured task definitions, metric-guided search, and artefact reuse. Title and abstract screening is the worked validation case because it provides labelled benchmark data and clear evaluation metrics. The demonstrated workflow uses a smaller student LLM for performing the scientific task execution and a larger reflection LLM to steer the prompt optimisation process during calibration. This work shows compilation, artefact round-tripping, and how optimisation budget affects a smaller student model.

2605.06934 2026-05-11 cs.LG

Learned Lyapunov Shielding for Adaptive Control

基于学习的李雅普诺夫防护用于自适应控制

Giansalvo Cirrincione, Adriano Fagiolini

发表机构 * MIRPALab, Department of Engineering, University of Palermo(MIRPAL实验室、工程系、巴勒莫大学) Laboratoire LTI, Université de Picardie Jules Verne(LTI实验室、皮卡第大学朱利斯·韦尔纳大学) University of Palermo(巴勒莫大学)

AI总结 本文提出了一种结合学习组件的自适应控制器,通过结构二次李雅普诺夫函数、软演员-评论家策略和物理感知神经网络,实现安全过滤和动态补偿,验证了控制器的全局可行性、指数稳定性及收敛性。

详情
AI中文摘要

我们为Euler-Lagrange系统增强了Slotine-Li自适应控制器,加入了三个学习组件:一个结构二次李雅普诺夫函数V_ψ,其正定性来自Cholesky参数化;一个残差Soft Actor-Critic策略,通过有界扭矩修正补充解析基线;以及一个物理感知神经网络,用于估计未建模动力学。一个闭合形式的安全过滤器,基于单个仿射约束V_ψ+αV_ψ≤0,将每个策略输出投影到安全集,无需在线QP求解器。我们证明了在漂移衰减条件下过滤器的全局可行性;在精确防护下,指数稳定性,具有鲁棒扩展,其边距取决于PINN近似误差;三次尺度策略-证书-乘数更新几乎必然收敛到KKT点;并为证书的PAC泛化界。在具有非线性摩擦和可变负载的2-DOF机械臂上,学习证书贡献了大部分经验增益:在名义摩擦下跟踪误差降低41%,在激进摩擦下降低24%。7-DOF可扩展性研究在Franka Emika Panda上证实了在工业规模下的清洁收敛,识别了在何种条件下增益应和不应超过基于精确模型的基线,且记录了学习证书的warm-start病理现象,这对部署有实际影响。

英文摘要

We augment the Slotine--Li adaptive controller for Euler--Lagrange systems with three learned components: a structured-quadratic Lyapunov function \(V_ψ\) whose positive-definiteness follows from a Cholesky parameterization, a residual Soft Actor--Critic policy that adds bounded torque corrections to the analytic baseline, and a physics-informed neural network that estimates unmodeled dynamics. A closed-form safety filter, derived from the single affine constraint \(\dot V_ψ+ αV_ψ\le 0\), projects every policy output onto the safe set without requiring an online QP solver. We prove: global feasibility of the filter under a drift-decay condition on the control-degeneracy set; exponential stability under exact shielding, with a robust extension whose margin depends on the PINN approximation error; almost-sure convergence of the three-timescale policy--certificate--multiplier updates to a KKT point; and a PAC generalization bound for the certificate over compacts. On a 2-DOF manipulator with nonlinear friction and variable payload, the learned certificate accounts for most of the empirical gain: tracking error drops by 41\% on nominal friction and 24\% on aggressive friction at the centroid of the training distribution. A 7-DOF scalability study on a Franka Emika Panda confirms clean convergence of the full pipeline at industrial scale, identifies the conditions under which gains over exact model-based baselines should and should not be expected, and documents a warm-start pathology of the learned certificate that has practical implications for deployment.

2605.06931 2026-05-11 cs.LG

Target-Aware Data Augmentation for SAT Prediction

面向目标的数据增强用于SAT预测

Eshed Gal, Uri Ascher, Eldad Haber

发表机构 * Department of Computer Science(计算机科学系) The University of British Columbia(不列颠哥伦比亚大学) Department of Earth, Ocean and Atmospheric Sciences(地球、海洋和大气科学系)

AI总结 本文提出一种无需求解器的数据生成框架,通过生成正确标注的SAT和UNSAT实例,提升GNN在SAT预测中的性能。

详情
AI中文摘要

基于NP难问题的学习方法显示出越来越大的潜力,但其进展受到生成标记训练数据高成本的限制。在布尔可满足性(SAT)领域,标准流程依赖于求解器循环标注,这在问题规模扩大时表现不佳,并限制了可用监督的量。这一瓶颈阻碍了利用机器学习捕捉硬组合问题结构的更广泛目标。本文提出了一种面向目标、无需求解器的数据生成框架,通过构造生成实例正确标注SAT和UNSAT实例,消除了昂贵求解器调用的需要。我们的方法使生成实例与目标基准的结构特性对齐,使合成数据在下游学习中有效。我们进一步开发了一种线性规划感知的图神经网络(LPGNN)架构,该架构将约束违反残差纳入消息传递,使模型能够利用潜在的优化结构。这些贡献支持了一种以数据为中心的学习范式,其中可扩展、任务对齐的数据生成与模型设计同样关键。我们的方法在数据生成方面实现了数量级的加速,证明了与基准对齐的合成数据能够有效增强基于GNN的SAT预测的求解器标注数据集。

英文摘要

Learning-based approaches to NP-hard problems have shown increasing promise, but their progress is fundamentally constrained by the high cost of generating labeled training data. In domains such as Boolean satisfiability (SAT), standard pipelines rely on solver-in-the-loop labeling, which scales poorly with problem size and limits the amount of usable supervision. This bottleneck hinders the broader goal of leveraging machine learning to capture structure in hard combinatorial problems. In this work, we propose a target-aware, solver-free data generation framework for SAT that produces correctly labeled SAT and UNSAT instances by construction, eliminating the need for expensive solver calls. Our method aligns generated instances with the structural properties of a target benchmark, making synthetic data effective for downstream learning. We further develop a linear-programming-aware graph neural network (LPGNN) architecture that incorporates constraint-violation residuals into message passing, enabling the model to exploit underlying optimization structure. Together, these contributions support a data-centric paradigm for learning on NP-hard problems, where scalable, task-aligned data generation is as critical as model design. Our approach yields orders-of-magnitude speedups in data generation, demonstrating that benchmark-aligned synthetic data can effectively augment solver-labeled datasets for GNN-based SAT prediction.

2605.06927 2026-05-11 cs.CV cs.AI

XiYOLO: Energy-Aware Object Detection via Iterative Architecture Search and Scaling

XiYOLO:通过迭代架构搜索和缩放实现能量感知的目标检测

Tony Tran, Richie R. Suganda, Bin Hu

发表机构 * Department of Research Computing(研究计算系) University of Houston(休斯敦大学) Department of Electrical and Computer Engineering(电气与计算机工程系) Department of Engineering Technology(工程科技系)

AI总结 本文提出XiYOLO,通过迭代架构搜索和缩放,在异构边缘设备上实现能量感知的目标检测,实验显示其在能耗与准确率之间有更优的权衡。

详情
AI中文摘要

在异构边缘设备上的目标检测必须满足严格的能耗、延迟和内存约束,同时仍能为下游自主性提供可靠的感知。现有的能量感知NAS方法往往针对有限的部署设置,而真实能耗难以优化,因为其高度依赖设备且测量成本高。本文通过结合能量感知的XiResOFA搜索空间、两级能耗估计器和迭代搜索,识别出单一的高效基础架构。然后通过复合缩放将此基础设计转换为XiYOLO家族,适用于不同的部署预算,从而在稀疏硬件测量下实现可解释的准确率-能耗权衡。在PascalVOC、COCO和真实设备部署上的实验表明,XiYOLO在能耗-准确率权衡上优于YOLO基线。在PascalVOC上,中等规模的XiYOLO模型在GPU上比YOLOv12m减少20.6%的能耗,在NPU上减少35.9%。在COCO上,XiYOLO在GPU上比YOLOv12减少高达53.7%的能耗,在NPU上减少51.6%。所提出的两级估计器在少量样本适应下也提高了样本效率。

英文摘要

Object detection on heterogeneous edge devices must satisfy strict energy, latency, and memory constraints while still providing reliable perception for downstream autonomy. Existing energy-aware NAS methods often target limited deployment settings, while real energy remains difficult to optimize because it is highly device-dependent and costly to measure. We address these challenges with an energy-adaptive framework that combines an energy-aware XiResOFA search space, a two-stage energy estimator, and iterative search to identify a single energy-efficient base architecture. We then apply compound scaling to transform this base design into the XiYOLO family across deployment budgets, enabling interpretable accuracy-energy tradeoffs under sparse hardware measurements. Experiments on PascalVOC, COCO, and real-device deployment show that XiYOLO achieves a stronger energy-accuracy tradeoff than YOLO baselines. On PascalVOC, the medium XiYOLO model reaches 86.15 mAP50 while reducing energy relative to YOLOv12m by 20.6% on GPU and 35.9% on NPU. On COCO, XiYOLO reduces energy relative to YOLOv12 by up to 53.7% on GPU and 51.6% on NPU at the small scale. The proposed two-stage estimator also improves sample efficiency over a joint predictor under few-shot adaptation with only 2-20 target-device samples.

2605.06924 2026-05-11 cs.CV cs.AI

A$^2$RD: Agentic Autoregressive Diffusion for Long Video Consistency

A$^2$RD:基于代理的自回归扩散用于长视频一致性

Do Xuan Long, Yale Song, Min-Yen Kan, Tomas Pfister, Long T. Le

发表机构 * Google Cloud AI Research(谷歌云人工智能研究)

AI总结 A$^2$RD通过Retrieve-Synthesize-Refine-Update循环实现长视频一致性合成,提升视频生成的连贯性和叙事一致性。

Comments Project page: http://dxlong2000.github.io/AARD

详情
AI中文摘要

合成一致且连贯的长视频仍是基础挑战。现有方法在长视界下存在语义漂移和叙事崩溃问题。本文提出A$^2$RD,一种将创意合成与一致性执行解耦的代理自回归扩散架构。A$^2$RD将长视频合成视为闭环过程,通过检索-合成-细化-更新循环逐段生成和自我改进视频。其包含三个核心组件:(i) 多模态视频记忆追踪跨模态的视频进展;(ii) 自适应段生成在生成模式间切换以实现自然进展和视觉一致性;(iii) 分层测试时自我改进在帧和视频级别自我改进以防止误差传播。进一步引入LVBench-C基准,包含非线性实体和环境转换以压力测试长视界一致性。在公共和LVBench-C基准上,A$^2$RD在一致性上比最先进基线高出30%,叙事连贯性提高20%。人类评估证实这些收益,同时凸显运动和过渡平滑性的显著改进。

英文摘要

Synthesizing consistent and coherent long video remains a fundamental challenge. Existing methods suffer from semantic drift and narrative collapse over long horizons. We present A$^2$RD, an Agentic Auto-Regressive Diffusion architecture that decouples creative synthesis from consistency enforcement. A$^2$RD formulates long video synthesis as a closed-loop process that synthesizes and self-improves video segment-by-segment through a Retrieve--Synthesize--Refine--Update cycle. It comprises three core components: (i) Multimodal Video Memory that tracks video progression across modalities; (ii) Adaptive Segment Generation that switches among generation modes for natural progression and visual consistency; and (iii) Hierarchical Test-Time Self-Improvement that self-improves each segment at frame and video levels to prevent error propagation. We further introduce LVBench-C, a challenging benchmark with non-linear entity and environment transitions to stress-test long-horizon consistency. Across public and LVBench-C benchmarks spanning one- to ten-minute videos, A$^2$RD outperforms state-of-the-art baselines by up to 30% in consistency and 20% in narrative coherence. Human evaluations corroborate these gains while also highlighting notable improvements in motion and transition smoothness.

2605.06919 2026-05-11 cs.CL

Can LLMs Take Retrieved Information with a Grain of Salt?

大语言模型能否对获取的信息持谨慎态度?

Behzad Shayegh, Mohamed Osama Ahmed, Fred Tung, Leo Feng

发表机构 * RBC Borealis

AI总结 本文评估了八种LLM在上下文确定性服从方面的表现,发现其在不确定上下文中回忆先验知识、解释确定性表达和信任复杂上下文方面存在系统性限制,并提出结合先验提示、确定性重校准和上下文简化的方法,将服从误差降低了25%。

详情
AI中文摘要

大型语言模型已展示了出色的检索增强能力。然而,一个关键领域仍未被充分探索:它们在适当调整响应以匹配表达的上下文确定性方面的能力。在医学和金融等高风险领域,这一限制具有实际后果。我们评估了八种LLM在上下文确定性服从方面的表现,测量其调整响应以匹配表达的上下文确定性的能力。我们的分析揭示了系统性限制:LLM在观察不确定上下文后难以回忆先验知识,误解表达的确定性,并过度信任复杂上下文。为解决这些问题,我们提出了一种结合先验提示、确定性重校准和上下文简化的交互策略。这种方法平均将服从误差降低了25%,而无需修改模型权重,证明了交互设计在增强LLM可靠性方面的有效性。我们的贡献包括一个原则性的评估度量、对LLM不确定性处理的实证洞察,以及一种可移植的策略,以提高不同LLM的上下文确定性服从性。

英文摘要

Large language models have demonstrated impressive retrieval-augmented capabilities. However, a crucial area remains underexplored: their ability to appropriately adapt responses to the certainty of the retrieved information. It is a limitation with real consequences in high-stakes domains like medicine and finance. We evaluate eight LLMs on their context-certainty obedience, measuring how well they adjust responses to match expressed context certainty. Our analysis reveals systematic limitations: LLMs struggle to recall prior knowledge after observing an uncertain context, misinterpret expressed certainties, and overtrust complex contexts. To address these, we propose an interaction strategy combining prior reminders, certainty recalibration, and context simplification. This approach reduces obedience errors by 25% on average, without modifying model weights, demonstrating the efficacy of interaction design in enhancing LLM reliability. Our contributions include a principled evaluation metric, empirical insights into LLMs' uncertainty handling, and a portable strategy to improve context-certainty obedience across diverse LLMs.

2605.06916 2026-05-11 cs.LG

Tyche: One Step Flow for Efficient Probabilistic Weather Forecasting

Tyche:一步流用于高效的概率天气预报

Fan Xu, Yuan Gao, Kun Wang, Rui Su, Fenghua Ling, Hao Wu, Wanli Ouyang

发表机构 * Shanghai AI Lab(上海人工智能实验室)

AI总结 Tyche提出一种一步条件流模型,通过JVP正则化目标和Swin式变换器,在高维地理场中实现高效概率天气预报,实验显示其性能优于多步生成基线和ECMWF IFS集合预报。

详情
AI中文摘要

概率天气预报不仅需要准确的轨迹,还需要在合理的大气未来上校准分布。最近的数据驱动系统在确定性技能上取得了显著进展,扩散基的集合预报器显著提高了样本真实性和不确定性量化。然而,其推理成本随预报时间、集合大小和每个转换所需的去噪步骤数量而增加,使大规模操作集合昂贵。为此,我们提出了Tyche,一种一步条件流模型,用于高效的概率天气预报。Tyche通过目的地感知的平均速度流模型,将高斯噪声直接映射到未来天气状态,单次函数评估(1-NFE)。为了使这种一步传输在高维地理场中可学习,我们推导了一个JVP正则化的校正目标,强制源和目的地流时间步之间的时间自一致性,而无需显式形成雅可比矩阵。传输场由一个各向同性的Swin式变换器参数化,保留了细尺度空间结构,同时在全球网格上保持可扩展性。为了提高自回归预报下的集合可靠性,我们进一步引入了基于回放的微调阶段,结合课程CRPS校准监督。在ERA5数据上1.5°和6小时分辨率的实验显示,我们的Tyche仅使用一次NFE,其预报技能和校准性能与最先进的多步生成基线和操作ECMWF IFS集合相当。

英文摘要

Probabilistic weather forecasting requires not only accurate trajectories, but calibrated distributions over plausible atmospheric futures. Recent data-driven systems have achieved remarkable deterministic skill, and diffusion-based ensemble forecasters have substantially improved sample realism and uncertainty quantification. However, their inference cost scales with forecast horizon, ensemble size, and the number of denoising steps required for each transition, making large operational ensembles expensive. To address this, we present Tyche, a one-step conditional flow model for efficient probabilistic weather forecasting. Tyche models the conditional forecast distribution with a destination-aware average-velocity flow that maps Gaussian noise directly to future weather states in a single function evaluation (1-NFE). To make this one-step transport learnable in high-dimensional geophysical fields, we derive a JVP-regularized rectification objective that enforces temporal self-consistency across source and destination flow timesteps without explicitly forming Jacobians. The transport field is parameterized by an isotropic Swin-style transformer that preserves fine-scale spatial structure while remaining scalable on global grids. To improve ensemble reliability under autoregressive forecasting, we further introduce a rollout-based finetuning stage with curriculum CRPS calibration supervision. Experiments on ERA5 at 1.5$^\circ$ and 6-hour resolution show that our Tyche, using merely a single NFE, matches or exceeds the forecast skill and calibration of state-of-the-art multi-step generative baselines and the operational ECMWF IFS ensemble.

2605.06912 2026-05-11 cs.CV

Advancing Reliable Synthetic Video Detection: Insights from the SAFE Challenge

推进可靠的合成视频检测:来自SAFE挑战的见解

Kirill Trapeznikov, Gabriel Mancino-Ball, Jonathan Li, Paul Cummer, Jai Aslam, Danial Samadi Vahdati, Tai Nguyen, Matthew C. Stamm, Peter Bautista, Michael Davinroy, Laura Cassani, Jill Crisman

发表机构 * STR Drexel University(德鲁伊大学) Aptima, Inc.(Aptima公司) UL Research Institutes(UL研究机构)

AI总结 本文介绍了SAFE挑战的设计、数据集构建及评估方法,揭示了当前合成视频检测方法在跨生成器泛化和抗后处理攻击方面的进展与不足。

详情
AI中文摘要

生成视频技术的普及加剧了对可靠检测和表征合成媒体的需求。为应对这一挑战,我们组织了SAFE:合成视频检测挑战,与在ICCV 2025上举办的

英文摘要

The proliferation of generative video technologies has intensified the need for reliable methods to detect and characterize synthetic media. To address this challenge, we organized the \href{https://safe-video-2025.dsri.org}{SAFE: Synthetic Video Detection Challenge}, co-located with the \textit{Authenticity and Provenance in the Age of Generative AI (APAI) Workshop }at ICCV 2025. The competition invited participants to develop and evaluate algorithms capable of distinguishing real from synthetic videos under fully blind evaluation conditions with over 600 submissions from 12 teams over a 90 day span. Hosted on the Hugging Face platform, the challenge comprised two primary tasks: (1) detection of synthetic video content generated by diverse state-of-the-art models, and (2) detection of synthetic content following common post-processing operations such as resizing, re-compression, motion blur and others. The challenge data consisted of 13 modern high quality synthetic video models with generated content matched to real videos from 21 diverse and challenge sources, all adding up to 20 hours of 6,000 video samples. This paper describes the challenge design, dataset construction, evaluation methodology, and outcomes, offering insights into the generalization and robustness of contemporary synthetic video detection methods. Our findings highlight measurable progress in cross-generator generalization but also persistent vulnerabilities to post-processing artifacts. https://safe-video-2025.dsri.org

2605.06911 2026-05-11 cs.LG

Dual-Scale Temporal Fusion Reveals Structured Predictability in Subseasonal-to-Seasonal Temperature Prediction

双尺度时间融合揭示亚季节至季节温度预测中的结构可预测性

Elnaz Bashir, Jiali Wang, Lin Yan

发表机构 * Department of Computer Science, Iowa State University(艾奥瓦州立大学计算机科学系) Environmental Science Division, Argonne National Laboratory(阿贡国家实验室环境科学部)

AI总结 本文通过双尺度学习框架分离历史气候背景与近期天气演变,揭示亚季节至季节温度预测中可预测性的结构特征,提升预测稳定性与空间一致性。

Comments 10 pages, 5 figures

详情
AI中文摘要

亚季节至季节(S2S)温度预测,涵盖数周至数月,对农业、能源和极端天气风险管理至关重要,但其可靠性因季节和地区而异。本文表明,S2S可预测性由相互作用的时间成分、空间异质性和大尺度模式一致性组织,可通过显式表征和利用。本文开发了双尺度学习框架,分离日历对齐的历史气候背景与与预报时间匹配的近期天气演变,通过空间自适应融合结合,实现30至90天窗口内的稳定温度预测。学习的融合权重揭示,两种时间尺度的平衡随季节和地理变化系统性变化:冬季,年际背景主导高纬度和复杂地形,而夏季预测则在领域内更平衡。这种空间显式可预测性重构,而非简单的预报时间衰减,成为亚季节窗口内预测能力的主要决定因素。拓扑感知的结构约束进一步提高预测温度场的空间一致性,稳定复杂地形上的大尺度模式组织。这些结果将S2S可预测性重新定义为结构化、多尺度现象,为改进预测系统和指导其应用提供了更可解释的基础。

英文摘要

Subseasonal-to-seasonal (S2S) temperature forecasts, spanning several weeks to a few months, are critically needed in agriculture practice, energy planning, and extreme-weather induced risk management, yet their reliability varies substantially across seasons and regions. Forecast skill is often attributed primarily to lead time, but this perspective does not fully explain the spatiotemporal patterns of predictability. Here we show that S2S predictability is organized across interacting temporal components, spatial heterogeneity, and large-scale pattern coherence, and that this structure can be explicitly characterized and exploited. We develop a dual-scale learning framework that separates calendar-aligned historical climate context from lead-time matched recent weather evolution, combining them through spatially adaptive fusion to enable stable temperature forecasts across the 30 to 90-day window. The learned fusion weights reveal that the balance between these two temporal scales shifts systematically with season and geography: during winter, interannual context dominates over high latitudes and complex terrain where forecast is the most difficult, while summer predictions reflect a more balanced temporal contribution across the domain. This spatially explicit reorganization of predictability, rather than simple lead-time decay, emerges as the primary determinant of forecast skill within the subseasonal window. Topology-aware structural constraints further improve spatial coherence of predicted temperature fields, stabilizing large-scale pattern organization particularly over complex terrain. These results reframe S2S predictability as a structured, multi-scale phenomenon, providing a more interpretable foundation for improving forecast systems and informing their use in practice.

2605.06908 2026-05-11 cs.LG cs.AI

Same Signal, Opposite Meaning: Direction-Informed Adaptive Learning for LLM Agents

同信号,异意义:方向感知的自适应学习用于大语言模型代理

Ziming Li, Jiatan Huang, Xiaoguang Guo, Guilin Wang, Chuxu Zhang

发表机构 * University of Connecticut(康奈尔大学) New Jersey Institute of Technology(新泽西理工学院)

AI总结 本文提出DIAL方法,通过方向感知的自适应学习解决LLM代理中计算需求与计算适宜性差异问题,提升性能-成本平衡。

详情
AI中文摘要

适应性测试时间计算旨在仅在提升性能时调用额外计算。现有方法通常使用置信度、不确定性或难度基于的门控,假设门控信号通过计算需求到计算价值的固定方向。这使门控成为一种效用校准问题:门控信号应与额外计算是否改善最终结果对齐。我们显示这种对齐是不稳定的:同一信号在不同设置中预测滚出收益或损害,甚至在任务固定时在不同环境中和不同架构间反转。错误方向的门控可能通过精确选择有害状态来降低性能。这种反转反映了计算需求与计算适宜性之间的深层区别:高不确定性信号可能指示决策困难状态,滚出有助于比较替代方案,或干预不适宜状态,当前上下文不支持基于滚出的改进。在该双源模型下,固定方向门控在异构设置中不可靠。为此,我们提出DIAL(方向感知自适应学习),从信号无关的反事实探索中训练稀疏门控,学习每个(环境,架构)状态特征的效用方向。在六个环境和三个架构上,DIAL比固定方向基线实现更优的性能-成本平衡。

英文摘要

Adaptive test-time compute for LLM agents aims to invoke extra computation only when it improves performance. Existing methods typically use confidence-, uncertainty-, or difficulty-based gates, assuming a fixed direction from the gating signal through compute need to the value of computation. This makes gating a utility-calibration problem: gating signals should align with whether extra computation improves the final outcome over the base policy. We show that this alignment is unstable: the same signal predicts rollout benefit in one setting and rollout harm in another, with reversals across environments and backbones even when the task is fixed. Wrong-direction gates can therefore worsen performance by precisely selecting harmful states. This reversal reflects a deeper distinction between compute need and compute suitability: a high uncertainty signal may indicate decision-difficult states where rollouts help compare alternatives, or intervention-unsuitable states where the current context does not support useful rollout-based improvement. Under this two-source model, fixed-direction gates are unreliable across heterogeneous settings. To address this, we propose DIAL (Direction-Informed Adaptive Learning), a sparse gate trained from signal-agnostic counterfactual exploration to learn the utility direction of state features per (environment, backbone). Across six environments and three backbones, DIAL yields a stronger overall success-cost trade-off than fixed-direction baselines.

2605.06906 2026-05-11 cs.LG

TraXion: Rethinking Pre-training Frameworks for Mobility and Beyond

TraXion:重新思考移动性及其他领域的预训练框架

Shang-Ling Hsu, Mark Tenzer, Cyrus Shahabi, Khurram Shafique

发表机构 * University of Southern California(南加州大学) Novateur Research Solutions(Novateur研究解决方案)

AI总结 本文提出TraXion框架,针对多实体时空事件流(MESES)的三个特性设计预训练目标和架构,在六个公开移动数据集上表现优异,并成功应用于企业认证日志和ICU死亡预测等不同领域。

Comments 31 pages, 2 figures

详情
AI中文摘要

人类移动性与文本和通用时间序列有三个结构性差异:访问是元组值事件,其意义取决于位置、时间和活动的联合分布;用户在轨迹中携带持久签名;访问在不同用户之间不独立,因为共享地点的共现是主要信号。现有移动性预训练方法借鉴语言模型的目标,将轨迹视为句子、访问视为标记,这种类比在上述三个特性上均失效。这些特性定义了一个更广泛的类别,即多实体时空事件流(MESES),涵盖企业认证日志、电子健康记录等其他事件流领域,其中实体共享基础设施、日程或上下文。本文将这些特性精确化为三个公理,任何适用于MESES的预训练框架都应满足,并引入TraXion,其目标和架构共同设计以满足这些要求。单个TraXion检查点在每个数据集上均优于任务特定基线,在六个公开移动数据集上的四个任务(异常检测、下POI推荐、下访问预测、社交链接预测)中表现优异。相同的配方,应用于企业认证日志和ICU死亡预测,与先前工作相比表现匹配或更优,表明移动性、安全性和医疗等不同领域的事件流可在单一框架下建模。

英文摘要

Human mobility differs from text and from generic time series in three structural ways: visits are tuple-valued events whose meaning depends on the joint distribution over location, time, and activity; users carry persistent signatures across trajectories; and visits are not independent across users, since co-location at shared places is a primary signal. Existing pre-training recipes for mobility import objectives from language modeling, treating trajectories as sentences and visits as tokens, an analogy that fails against each of the three properties above. These properties define a broader class, multi-entity spatiotemporal event streams (MESES), spanning enterprise authentication logs, electronic health records, and other event-stream domains where entities share infrastructure, schedules, or contexts. We make the properties precise as three axioms that any pre-training framework for MESES should satisfy, and introduce TraXion, whose objectives and architecture are jointly designed to meet them. A single TraXion checkpoint per dataset beats task-specific baselines on every task across six public mobility datasets covering anomaly detection, next-POI recommendation, next-visit prediction, and social-link prediction. The same recipe, applied unchanged to enterprise authentication logs and ICU mortality prediction, matches or exceeds prior work on both, showing that event streams from domains as different as mobility, security, and healthcare can be modeled under a single framework.

2605.06905 2026-05-11 cs.LG

Conservative Flows: A New Paradigm of Generative Models

保守流:生成模型的新范式

Eshed Gal, Md Shahriar Rahim Siddiqui, Moshe Eliasof, Eldad Haber

发表机构 * The University of British Columbia(不列颠哥伦比亚大学) University of Cambridge(剑桥大学)

AI总结 本文提出一种新的生成模型范式,通过离散随机动力学保持数据分布不变,利用预训练流模型,开发了两种概率保持采样机制,验证了在合成数据集上的优越性。

详情
AI中文摘要

现代生成建模主要依赖从噪声先验到数据的传输。我们提出了一种替代范式,通过离散随机动力学在数据支持状态下生成,保持数据分布不变。该框架可利用任何预训练的流模型。我们开发了两种概率保持采样机制,即带有Metropolis调整的校正Langevin动力学和预测-校正流,直接在现有检查点上运行。我们在合成瑞士卷目标、ImageNet-256和Oxford Flowers-102上验证了该框架,结果显示我们的采样器在原始生成过程中持续改进。

英文摘要

Modern generative modeling is dominated by transport from a noise prior to data. We propose an alternative paradigm in which generation is performed by a discrete stochastic dynamics that leaves the data distribution invariant, initialized from data-supported states rather than from noise. The framework can utilize any pretrained flow model. We develop two probability-preserving sampling mechanisms, a corrected Langevin dynamics with a Metropolis adjustment and a predictor-corrector flow, that operate directly on existing checkpoints. We validate the framework on a synthetic Swiss-roll target, ImageNet-256 and Oxford Flowers-102, where our samplers consistently improve over the original generation procedures.

2605.06903 2026-05-11 cs.CL cs.AI

MELD: Multi-Task Equilibrated Learning Detector for AI-Generated Text

MELD:多任务平衡学习检测器用于AI生成文本

Chenjun Li, Cheng Wan, Johannes C. Paetzold

发表机构 * Cornell University(康奈尔大学) Weill Cornell Medicine(韦尔医学院) Cornell Tech(康奈尔科技)

AI总结 MELD通过多任务学习提升AI生成文本检测的鲁棒性与泛化能力,结合生成器、攻击类型和源域头,平衡损失函数以提高检测精度和低误报率。

Comments 17 pages, 6 figures

详情
AI中文摘要

大型语言模型如今已融入日常写作流程,可靠的人工智能生成文本检测对学术诚信、内容审核和溯源追踪至关重要。然而,实际中,检测器不仅要实现高整体AUROC在干净的分布内人类和AI文本上,还应具备对抗攻击的鲁棒性、迁移至未见过的生成器和领域的能力,并在低误报率(FPR)下运行。大多数现有检测器优化单一AI/人类目标,使表示学习生成器、攻击或领域结构的激励有限一旦二元任务饱和。我们引入MELD(多任务平衡学习检测器),一种用于AI生成文本的部署检测器,通过辅助监督增强二元检测。MELD将生成器家族、攻击类型和源域头连接到共享编码器,并通过学习同方差不确定性权重平衡四个损失。为提高鲁棒性,EMA教师在干净输入上进行预测,同时攻击增强的学生被蒸馏至教师。MELD进一步使用硬负样本对称排名损失以扩大AI生成文本与最混淆的人类文本之间的分数边缘。在推理时,所有辅助头都被丢弃,使MELD具有与标准检测器相同的接口和成本。在公共RAID排行榜上,MELD是最强的开源检测器,与领先的商业模型竞争,特别是在对抗攻击和低FPR下。在标准的保留测试基准上,MELD匹配或优于监督基线。我们进一步引入MELD-eval,一个由四个主要LLM提供商最近发布的聊天模型构建的保留评估池。无需额外微调,MELD在MELD-eval上实现99.9%的TPR在1% FPR下,而许多基线急剧下降。

英文摘要

Large language models are now embedded in everyday writing workflows, making reliable AI-generated text detection important for academic integrity, content moderation, and provenance tracking. In practice, however, a detector must do more than achieve high aggregate AUROC on clean, in-distribution human and AI text: it should remain robust to attacks and adversarial rewrites, transfer to unseen generators and domains, and operate at low false-positive rates (FPR). Most existing detectors optimize a single AI/Human objective, giving the representation little incentive to learn generator, attack, or domain structure once the binary task saturates. We introduce MELD (Multi-Task Equilibrated Learning Detector), a deployable detector for AI-generated text that enriches binary detection with auxiliary supervision. MELD attaches generator-family, attack-type, and source-domain heads to a shared encoder, and balances the four losses with learned homoscedastic uncertainty weights. To improve robustness, an EMA teacher predicts on clean inputs while an attack-augmented student is distilled toward the teacher. MELD further uses a hard-negative pairwise ranking loss to enlarge the score margin between AI-generated texts and the most confusable human texts. At inference, all auxiliary heads are discarded, giving MELD the same interface and cost as a standard detector. On the public RAID leaderboard, MELD is the strongest open-source detector and is competitive with leading commercial models, especially under attack and at low FPR. Across standard held-out benchmarks, MELD matches or outperforms supervised baselines. We further introduce MELD-eval, a held-out evaluation pool built from recent chat models released by four major LLM providers. Without additional finetuning, MELD achieves 99.9% TPR at 1% FPR on MELD-eval, while many baselines degrade sharply.

2605.06902 2026-05-11 cs.LG

Streaming Adversarial Robustness in Fuzzy ARTMAP: Mechanism-Aligned Evaluation, Progressive Training, and Interpretable Diagnostics

流式对抗鲁棒性在模糊ARTMAP中的研究:机制对齐的评估、渐进训练和可解释的诊断

Shane Cairns, Leonardo Enzo Brito da Silva, Sasha Petrenko, Donald C. Wunsch, Jian Liu

发表机构 * Department of Electrical and Computer Engineering(电气与计算机工程系) Kummer Institute Center for Artificial Intelligence and Autonomous Systems (KICAIAS)(人工智能与自主系统中心(KICAIAS)) Missouri University of Science and Technology(密苏里科技大学)

AI总结 本文研究模糊ARTMAP在流式学习中的对抗鲁棒性,提出WB-Softmax攻击代理,并展示渐进两阶段选择训练在无回放鲁棒性上的优势,同时通过显式类别几何实现可解释的诊断。

Comments 35 pages, 3 figures, 11 tables. Preprint submitted to Neural Networks

详情
AI中文摘要

对抗鲁棒性在离线深度网络中已广泛研究,但对严格单次流式神经学习器知之甚少。本文研究基于类别竞争、补码编码、匹配跟踪和无回放原型更新的模糊ARTMAP架构的对抗鲁棒性。我们引入WB-Softmax,一种与ARTMAP类别竞争和映射场预测机制对齐的可微白盒攻击代理,并正式化一个流式评估原则,要求鲁棒性在最终部署模型上评估。在四个图像基准上,WB-Softmax在原始模糊ARTMAP模型上达到89-100%的攻击成功率。我们展示防御排名在不同协议下可能反转:离线对抗训练在转移攻击下可能表现强劲,但在适应性白盒评估下崩溃,而渐进两阶段选择训练提供最强的无回放鲁棒性。我们进一步展示ART的显式类别几何可解释分离崩溃和匹配分数倒置。这些结果提供了一个机制对齐、协议感知的框架,用于流式原型学习器的对抗鲁棒性。

英文摘要

Adversarial robustness has been studied extensively for offline deep networks, but less is known about strict single-pass streaming neural learners. This paper studies adversarial robustness in Fuzzy ARTMAP, an Adaptive Resonance Theory architecture based on category competition, complement coding, match tracking, and replay-free prototype updates. We introduce WB-Softmax, a differentiable white-box attack surrogate aligned with ARTMAP's category-competition and map-field prediction mechanism, and formalize a streaming evaluation principle requiring robustness to be assessed on the final deployed model. Across four image benchmarks, WB-Softmax achieves 89-100% attack success on vanilla Fuzzy ARTMAP models. We show that defense rankings can reverse across protocols: offline adversarial training may appear strong under transfer attacks yet collapse under adaptive white-box evaluation, whereas progressive two-stage selective training provides the strongest overall replay-free robustness. We further show that ART's explicit category geometry enables interpretable diagnosis of separation collapse and match-score inversion. These results provide a mechanism-aligned, protocol-aware framework for adversarial robustness in streaming prototype-based learners.

2605.06901 2026-05-11 cs.CL

Reflections and New Directions for Human-Centered Large Language Models

人类中心大语言模型的反思与新方向

Caleb Ziems, Dora Zhao, Rose E. Wang, Matthew Jörke, Ahmad Rushdi, Advit Deepak, Sunny Yu, Anshika Agarwal, Harshvardhan Agarwal, Gabriela Aranguiz-Dias, Aditri Bhagirath, Justine Breuch, Huanxing Chen, Ruishi Chen, Sarah Chen, Haocheng Fan, William Fang, Cat Gonzales Fergesen, Daniel Frees, Tian Gao, Ziqing Huang, Vishal Jain, Yucheng Jiang, Kirill Kalinin, Su Doga Karaca, Arpandeep Khatua, Teland La, Isabelle Levent, Miranda Li, Xinling Li, Yongce Li, Angela Liu, Minsik Oh, Nathan J. Paek, Anthony Qin, Emily Redmond, Michael J. Ryan, Aadesh Salecha, Xiaoxian Shen, Pranava Singhal, Shashanka Subrahmanya, Mei Tan, Irawadee Thawornbut, Michelle Vinocour, Xiaoyue Wang, Zheng Wang, Henry Jin Weng, Pawan Wirawarn, Shirley Wu, Sophie Wu, Yichen Xie, Patrick Ye, Sean Zhang, Yutong Zhang, Cathy Zhou, Yiling Zhao, James Landay, Diyi Yang

发表机构 * Stanford University(斯坦福大学)

AI总结 本文提出人类中心大语言模型框架,整合NLP、HCI和负责任的AI视角,强调在模型开发全流程中关注人类需求与价值观,通过案例研究探讨HCLLMs的未来工作影响。

详情
AI中文摘要

大型语言模型(LLMs)正日益塑造用户的私人和专业生活,应用于商业、教育、金融、医疗、法律和科学领域。随着全球影响力的增长,迫切需要以优先考虑技术能力与人类优先事项的方式构建、评估和部署这些系统。本文提出人类中心大语言模型(HCLLMs)框架,整合自然语言处理(NLP)、人机交互(HCI)和负责任的AI视角。考虑语言模型的伦理、经济和技术目标,我们认为模型开发者需要在模型开发的每一个阶段,严谨而细致地处理人类关切、偏好、价值观和目标。本文为开发者提供从系统设计到数据采集、模型训练、评估和负责任部署的全方位人类中心洞察和建议。最后,通过案例研究应用这些洞察,探讨HCLLMs的未来工作影响。

英文摘要

Large Language Models (LLMs) are increasingly shaping the private and professional lives of users, with numerous applications in business, education, finance, healthcare, law, and science. With this rise in global influence comes greater urgency to build, evaluate, and deploy these systems in a manner that prioritizes not only technical capabilities but also human priorities. This work presents a framework for developing Human-Centered Large Language Models (HCLLMs), which integrates perspectives from Natural Language Processing (NLP), Human-Computer Interaction (HCI), and responsible AI. Considering the ethics, economics, and technical objectives of language modeling, we argue that model developers need to address human concerns, preferences, values, and goals, not only during a cursory post-training stage, but rather with rigor and care at every stage of the pipeline. This paper offers human-centered insights and recommendations for developers at each stage, from system design to data sourcing, model training, evaluation, and responsible deployment. Then we conclude with a case study, applying these insights to understand the future of work with HCLLMs.

2605.06898 2026-05-11 cs.AI

Self-Programmed Execution for Language-Model Agents

语言模型代理的自编程执行

Luke J. O'Connor

发表机构 * Harvard Medical School(哈佛医学院)

AI总结 本文提出自编程执行(SPE)架构,使语言模型自身成为调度器,无需固定策略。通过agentic machines概念,实现无固定轮次策略的状态转移,并引入Spell语言处理程序自我编辑与重评估,实验表明前沿模型可完成复杂代理任务。

详情
AI中文摘要

现有语言模型代理的核心是固定调度程序负责轮次间状态转移。本文引入自编程执行(SPE),一种代理架构中模型完成本身即为调度程序,Harness评估此程序但不强加自身调度策略。我通过agentic machines概念形式化这一思想:SPE状态是从其中模型完成可加载嵌入式机器任一状态,意味着无固定轮次调度策略。在实践中实现SPE非 trivial,因为相同数据既是模型上下文又是可执行程序。因此我引入基于Lisp的Spell语言,其中程序可编辑并重评自身,效果表达如模型调用被结构化,使得重评编辑后的程序不重放其副作用。使用未训练于SPE或Spell的现有模型进行实验,显示前沿模型可在该模式下运作并完成挑战性代理任务。这些结果展示了LM如何在无固定调度策略下充当代理,并提出问题:训练于自编程执行的模型可能学会何种自调度策略。代码见https://github.com/lukejoconnor/spell。

英文摘要

At the heart of existing language model agents is a fixed orchestrator program responsible for the state transition between consecutive turns. This paper introduces self-programmed execution (SPE), an agent architecture in which the model completion is itself the orchestrator program, and the harness evaluates this program but does not impose its own orchestration policy. I formalize this idea using agentic machines: an SPE state is one from which a model completion can load any state of an embedded copy of the machine, meaning that it is subject to no fixed turn-to-turn orchestration policy. Realizing SPE in practice is nontrivial because the same data is both model context and executable program. I therefore introduce Spell, a Lisp-based language in which programs can edit and re-evaluate themselves, and effectful expressions like model invocations are structured such that re-evaluating an edited program does not replay its side effects. Experiments with existing models, not trained for SPE or Spell, show that frontier models can operate in this regime and accomplish challenging agentic tasks. These results demonstrate how an LM can act as an agent without any fixed orchestration policy, and they raise the question of what self-orchestration strategies might be learned by a model trained for self-programmed execution. Code is available at https://github.com/lukejoconnor/spell .

2605.06897 2026-05-11 cs.CL cs.AI cs.HC cs.MM cs.SD eess.AS

MIST: Multimodal Interactive Speech-based Tool-calling Conversational Assistants for Smart Homes

MIST:面向智能家居的多模态交互语音工具调用对话助手

Maximillian Chen, Xuanming Zhang, Michael Peng, Zhou Yu, Alexandros Papangelis, Yohan Jo

发表机构 * Columbia University(哥伦比亚大学) Seoul National University(首尔国立大学)

AI总结 本文提出MIST数据集,用于研究具备物理世界约束推理能力的多模态语音助手,发现开放和闭源多模态LLM在该任务上存在显著差距。

Comments Project Page: https://billyzhang24kobe.github.io/mist-smarthome/

详情
AI中文摘要

物联网设备在物理世界的兴起要求具备处理复杂用户体验的语音界面。尽管现代大语言模型已展示出强大的工具使用能力,但建模现实中的物联网设备仍是一个困难且研究不足的挑战,它结合了空间时间约束建模、语音输入、动态状态跟踪和混合主动交互模式。我们引入MIST(多模态交互语音工具调用数据集),一个基于物联网设备的合成多轮语音驱动代码生成任务。我们发现开放和闭源多模态LLM在MIST上存在显著差距,且即使前沿闭源LLM仍有较大提升空间。我们发布MIST和一个可扩展的数据生成框架,以构建相关数据集,促进混合主动语音助手的研究,这些助手能推理物理世界约束。

英文摘要

The rise of Internet of Things (IoT) devices in the physical world necessitates voice-based interfaces capable of handling complex user experiences. While modern Large Language Models (LLMs) already demonstrate strong tool-usage capabilities, modeling real-world IoT devices presents a difficult, understudied challenge which combines modeling spatiotemporal constraints with speech inputs, dynamic state tracking, and mixed-initiative interaction patterns. We introduce MIST (the Multimodal Interactive Speech-based Tool-calling Dataset), a synthetic multi-turn, voice-driven code generation task that operates over IoT devices. We find that there is a significant gap between open- and closed-weight multimodal LLMs on MIST, and that even frontier closed-weight LLMs have substantial headroom. We release MIST and an extensible data generation framework to build related datasets in order to facilitate research on mixed-initiative voice assistants which reason about physical world constraints.

2605.06895 2026-05-11 cs.AI

Mitigating Cognitive Bias in RLHF by Altering Rationality

通过调整理性度来缓解强化学习中的人类反馈认知偏差

Tiffany Horter, Andrew Markham, Niki Trigoni, Serena Booth

发表机构 * University of Oxford(牛津大学) Brown University(布朗大学)

AI总结 本文提出通过动态调整理性参数beta来缓解人类反馈中的认知偏差,提升奖励学习模型的理性度,即使在存在强烈偏见的偏好数据集上也能获得更理性的下游模型。

详情
AI中文摘要

如何使模型对即使不完美的反馈也具有鲁棒性?在人类反馈强化学习(RLHF)中,人类偏好用于训练奖励模型,该模型对响应分配标量值。由于这些奖励是通过成对比较推断得出的,这种学习依赖于假设的潜在奖励差异与观察到的偏好之间的关系,通常使用Boltzmann公式建模,其中理性参数beta决定了偏好反映奖励差异的一致性程度。在实践中,beta通常被视为固定常数,反映假设的均匀注释可靠性。然而,人类反馈在实践中并不那么简单:真实的人类判断受认知偏差影响,导致系统性偏离奖励一致行为,这在上下文中出现。为了解决这个问题,我们将理性度视为上下文和注释依赖的。我们设计了一种方法,在奖励学习过程中使用LLM-as-judge动态调整理性参数beta,以评估认知偏差的可能存在。这种方法有效降低了可能反映偏见或不可靠判断的比较权重。实证上,我们表明这种方法学习了一个更理性的下游模型,即使在具有强烈偏见的偏好数据集上进行微调时也是如此。

英文摘要

How can we make models robust to even imperfect human feedback? In reinforcement learning from human feedback (RLHF), human preferences over model outputs are used to train a reward model that assigns scalar values to responses. Because these rewards are inferred from pairwise comparisons, this learning depends on an assumed relationship between latent reward differences and observed preferences, typically modeled using a Boltzmann formulation in which a rationality parameter beta informs how consistently preferences reflect reward differences. In practice, beta is typically treated as a fixed constant that reflects assumed uniform annotator reliability. However, human feedback is not this simplistic in practice: real human judgments are shaped by cognitive biases, leading to systematic deviations from reward-consistent behavior that arise contextually. To address this, we treat rationality as context- and annotation-dependent. We design an approach to dynamically adjust the rationality parameter beta during reward learning using an LLM-as-judge to assess the likely presence of cognitive biases. This approach effectively downweights comparisons that are likely to reflect biased or unreliable judgments. Empirically, we show that this approach learns a more rational downstream model, even when finetuning on datasets with strongly biased preferences.

2605.06892 2026-05-11 cs.CV

Not All Tokens Need 40 Steps: Heterogeneous Step Allocation in Diffusion Transformers for Efficient Video Generation

并非所有标记都需要40步:扩散变换器中的异质步分配用于高效视频生成

Ernie Chu, Vishal M. Patel

发表机构 * Johns Hopkins University(约翰霍普金斯大学)

AI总结 本文提出HSA算法,通过根据速度动态分配不同时间空间标记的步数预算,提升视频生成效率,实验表明在加速运行时表现优于现有方法。

Comments Project page: https://ernestchu.github.io/hsa

详情
AI中文摘要

扩散变换器(DiTs)在视频生成质量上达到最先进的水平,但标准推理对序列中的每个标记应用相同的去噪步骤数导致计算成本极高。本文引入异质步分配(HSA),一种无需训练的推理算法,根据速度动态为不同时间空间标记分配不同的步数预算。为了解决由此产生的序列长度不匹配问题而不牺牲全局上下文,HSA引入了一种KV缓存同步机制,允许活跃标记能够参加完整序列的注意力计算,同时完全绕过不活跃标记。此外,我们推导出一种缓存欧拉更新,通过单次操作推进跳过标记的潜在状态,而无需额外的模型评估。我们在Wan-2和LTX-2模型上评估了HSA用于文本到视频(T2V)和图像到视频(I2V)生成。结果表明,HSA在激进加速条件下(如50%和25%的运行时间)显著优于先前的最先进缓存方法和 vanilla Flow Matching 基线。关键的是,HSA在无需昂贵的离线分析的情况下实现了更优的质量-运行时间帕累托前沿,即使在计算预算紧张的情况下也能稳健地保持结构完整性和生成质量。

英文摘要

Diffusion Transformers (DiTs) have achieved state-of-the-art video generation quality, but they incur immense computational cost because standard inference applies the same number of denoising steps uniformly to every token in the sequence. It is well known that human vision ignores vast amounts of redundant motion. Why, then, do our densest models treat every spatiotemporal token with equal priority? In this paper, we introduce Heterogeneous Step Allocation (HSA), a training-free inference algorithm that assigns varying step budgets to different spatiotemporal tokens based on their velocity dynamics. To resolve the resulting sequence-length mismatch without sacrificing global context, HSA introduces a KV-cache synchronization mechanism that allows active tokens to attend to the full sequence while entirely bypassing inactive tokens. Furthermore, we derive a cached Euler update that advances the latent states of skipped tokens in a single operation without additional model evaluations. We evaluate HSA on the Wan-2 and LTX-2 models for both text-to-video (T2V) and image-to-video (I2V) generation. Our results demonstrate that HSA significantly outperforms previous state-of-the-art caching methods and the vanilla Flow Matching baseline, especially at aggressive acceleration regimes (e.g., 50% and 25% runtimes). Crucially, HSA achieves a superior quality-runtime Pareto frontier without the need for expensive offline profiling, robustly preserving structural integrity and generation quality even under tight computational budgets. Project page: https://ernestchu.github.io/hsa