arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 2367
专题追踪
2604.14889 2026-05-29 cs.AI

MemoSight: Unifying Context Compression and Multi Token Prediction for Reasoning Acceleration

MemoSight: 统一上下文压缩与多令牌预测以加速推理

Xinyu Liu, Xin Liu, Bo Jin, Runsong Zhao, Pengcheng Huang, Junhao Ruan, Bei Li, Chunyang Xiao, Chenglong Wang, Tong Xiao, Jingbo Zhu

发表机构 * School of Computer Science and Engineering, Northeastern University, China(东北大学计算机科学与工程学院) Meituan Inc.(美团公司) NiuTrans Research, Shenyang, China(牛译研所)

AI总结 提出 MemoSight 框架,通过特殊令牌和位置布局统一上下文压缩与多令牌预测,在保持思维链推理性能的同时减少 KV 缓存使用并提升推理速度。

详情
AI中文摘要

虽然思维链推理使大型语言模型能够解决具有挑战性的推理任务,但 KV 缓存的线性增长导致了大量的内存和推理开销。现有方法如上下文压缩和多令牌预测通过压缩历史令牌和并行生成未来令牌两个互补方向提高效率。然而,由于它们不同的训练范式和架构假设,有效结合它们仍然具有挑战性。在这项工作中,我们提出 MemoSight(基于记忆与前瞻的推理),一个统一框架,集成了上下文压缩和多令牌预测,以提高推理效率同时保持思维链性能。MemoSight 采用基于特殊令牌和令牌特定位置布局的共享极简设计,用于压缩和并行预测。在四个推理基准上的实验表明,与普通 SFT 基线相比,MemoSight 将 KV 缓存使用减少高达 66%,推理速度提升 56%,同时平均推理准确率下降不到 3%,相比现有的思维链压缩方法实现了更好的效率-准确率权衡。

英文摘要

While chain-of-thought (CoT) reasoning enables LLMs to solve challenging reasoning tasks, the linear growth of the KV cache leads to substantial memory and inference overhead. Existing approaches such as context compression and multi-token prediction (MTP) improve efficiency from two complementary directions by compressing historical tokens and generating future tokens in parallel. However, effectively combining them remains challenging due to their different training paradigms and architectural assumptions. In this work, we propose MemoSight (Memory-Foresight-Based Reasoning), a unified framework that integrates context compression and MTP to improve inference efficiency while preserving CoT performance. MemoSight adopts a shared minimalist design based on special tokens and token-specific positional layouts for both compression and parallel prediction. Experiments on four reasoning benchmarks show that, compared to the vanilla SFT baseline, MemoSight reduces KV cache usage by up to 66% and improves inference speed by 56%, while incurring less than a 3% drop in average reasoning accuracy, yielding a better efficiency-accuracy trade-off than existing CoT compression methods.

2604.13197 2026-05-29 cs.CL

Unleashing Implicit Rewards: Prefix-Value Learning for Distribution-Level Optimization

释放隐式奖励:前缀值学习用于分布级优化

Shiping Gao, Hongzhan Chen, Xiaojun Quan, Qifan Wang, Lifu Huang

发表机构 * Sun Yat-sen University(中山大学) Shenzhen Loop Area Institute(深圳环 Area 研究院) Meta AI University of California, Davis(加州大学戴维斯分校)

AI总结 提出隐式前缀值奖励模型(IPVRM)直接学习每个前缀的正确概率,并通过时序差分差异获得步骤信号,解决训练与推理不匹配问题;进一步引入分布级强化学习(DistRL)利用前缀值进行密集反事实更新,提升推理性能。

详情
AI中文摘要

过程奖励模型(PRM)为推理提供细粒度监督,但可靠的PRM通常需要步骤标注或繁重的验证流水线,使得它们在在线RL中扩展和刷新成本高昂。隐式PRM通过从轨迹级结果标签训练对数似然比奖励来降低这一成本。然而,对数比率在训练期间仅作为序列级聚合被约束,而推理时将其分解为部分前缀的token级或步骤级分数。这种训练-推理不匹配导致局部信用识别薄弱,因此分布级评分可能放大误导性优势。我们提出隐式前缀值奖励模型(IPVRM),直接从结果标签学习每个前缀最终正确的概率。然后通过连续前缀值之间的时序差分(TD)差异获得步骤信号,使训练目标与推理时使用对齐。IPVRM显著提高了ProcessBench上的步骤验证F1分数。为了在策略优化中利用这些前缀值,我们进一步引入分布级强化学习(DistRL),它将TD优势应用于采样token和高概率候选token,无需额外rollout即可提供密集反事实更新。实验表明,DistRL与不可靠隐式奖励结合时收益有限,但与IPVRM配对时持续改善下游推理。我们的方法实现可在https://github.com/gaoshiping/IPVRM获取。

英文摘要

Process reward models (PRMs) provide fine-grained supervision for reasoning, but reliable PRMs often require step annotations or heavy verification pipelines, making them costly to scale and refresh during online RL. Implicit PRMs reduce this cost by training log-likelihood-ratio rewards from trajectory-level outcome labels. However, the log-ratio is constrained only as a sequence-level aggregate during training, while inference decomposes it into token- or step-level scores for partial prefixes. This train-inference mismatch leaves local credits weakly identified, so distribution-wide scoring can amplify misleading advantages. We propose Implicit Prefix-Value Reward Model (IPVRM), which directly learns the probability of eventual correctness for each prefix from outcome labels. Step signals are then obtained as temporal-difference (TD) differences between consecutive prefix values, aligning the training target with inference-time use. IPVRM markedly improves step-verification F1 on ProcessBench. To exploit these prefix values during policy optimization, we further introduce Distribution-Level RL (DistRL), which applies TD advantages to both sampled tokens and high-probability candidate tokens, providing dense counterfactual updates without additional rollouts. Experiments show that DistRL brings limited gains with unreliable implicit rewards, but consistently improves downstream reasoning when paired with IPVRM. The implementation of our method is available at https://github.com/gaoshiping/IPVRM .

2604.10511 2026-05-29 cs.AI cs.CL

Thinking Fast, Thinking Wrong: Intuitiveness Modulates LLM Counterfactual Reasoning in Policy Evaluation

快思考,错思考:直觉性调节LLM在政策评估中的反事实推理

Yanjie He

发表机构 * Independent Researcher(独立研究者)

AI总结 本研究构建了一个基于经济学和社会科学实证案例的基准,通过8000次实验评估大型语言模型在政策评估中的反事实推理,发现链式思维提示在反直觉案例中效果显著减弱,且直觉性是主导因素,表明模型存在知识-推理分离。

Comments 10 pages, 6 figures, 6 tables

详情
AI中文摘要

大型语言模型(LLM)越来越多地用于因果和反事实推理,但它们在现实世界政策评估中的可靠性仍未得到充分探索。我们构建了一个包含40个实证政策评估案例的基准,这些案例来自经济学和社会科学,每个案例都基于同行评审的证据,并根据直觉性进行分类——即实证结果是否符合(明显)、相对于(模糊)或违背(反直觉)常见的先验预期。我们评估了四个前沿LLM,采用五种提示策略,进行了8000次实验试验,并使用混合效应逻辑回归分析结果。我们的发现揭示了三个关键结果:(1)链式思维(CoT)悖论,即链式思维提示在明显案例上显著提升性能,但在反直觉案例上这种收益大幅减弱(交互OR = 0.278,p < 0.001);(2)直觉性是主导因素,案例层面的方差超过模型选择或提示策略(ICC = 0.671);(3)知识-推理分离,基于引用的熟悉度与准确性无关(p = 0.84),表明模型拥有相关知识,但当结果与直觉相悖时无法利用这些知识进行推理。我们通过双过程理论(系统1与系统2)的视角来框架这些结果,并认为当前LLM的“慢思考”仅实现了对直觉先验的部分抑制——产生了深思熟虑推理的形式,但未能完全实现其实质。

英文摘要

Large language models (LLMs) are increasingly used for causal and counterfactual reasoning, yet their reliability in real-world policy evaluation remains underexplored. We construct a benchmark of 40 empirical policy evaluation cases drawn from economics and social science, each grounded in peer-reviewed evidence and classified by intuitiveness -- whether the empirical finding aligns with (obvious), is unclear relative to (ambiguous), or contradicts (counter-intuitive) common prior expectations. We evaluate four frontier LLMs across five prompting strategies with 8,000 experimental trials and analyze the results using mixed-effects logistic regression. Our findings reveal three key results: (1) a chain-of-thought (CoT) paradox, where chain-of-thought prompting dramatically improves performance on obvious cases but this benefit is substantially attenuated on counter-intuitive ones (interaction OR = 0.278, $p < 0.001$); (2) intuitiveness as the dominant factor, with case-level variance exceeding that of model choice or prompting strategy (ICC = 0.671); and (3) a knowledge-reasoning dissociation, where citation-based familiarity is unrelated to accuracy ($p = 0.84$), suggesting models possess relevant knowledge but fail to reason with it when findings contradict intuition. We frame these results through the lens of dual-process theory (System 1 vs. System 2) and argue that current LLMs' "slow thinking" achieves only partial inhibition of intuitive priors -- producing the form of deliberative reasoning without fully delivering its substance.

2604.10228 2026-05-29 cs.AI

SVSR: A Self-Verification and Self-Rectification Paradigm for Multimodal Reasoning

SVSR:一种用于多模态推理的自我验证与自我修正范式

Zhe Qian, Nianbing Su, Zhonghua Wang, Hebei Li, Zhongxing Xu, Yueying Li, Fei Luo, Zhuohan Ouyang, Yanbiao Ma

发表机构 * South China Agricultural University(华南农业大学) University of Glasgow(格拉斯哥大学) University of Electronic Science and Technology of China(电子科技大学) Monash University(莫纳什大学) University of Science and Technology of China(中国科学技术大学) National University of Defense Technology(国防科技大学) Renmin University of China(中国人民大学) South China Normal University(华南师范大学)

AI总结 提出SVSR框架,通过三阶段训练(统一偏好数据集构建、冷启动监督微调、半在线直接偏好优化)将自我验证与自我修正显式集成到多模态推理流程中,提升复杂视觉理解和多模态推理的鲁棒性与可靠性。

详情
AI中文摘要

当前多模态模型常存在浅层推理问题,导致因不完整或不一致的思维过程而产生错误。为解决这一局限,我们提出自我验证与自我修正(SVSR)统一框架,将自我验证和自我修正显式集成到模型的推理流程中,显著提升复杂视觉理解和多模态推理任务的鲁棒性与可靠性。SVSR基于一种新颖的三阶段训练范式。首先,通过精炼预训练视觉语言模型的推理轨迹,结合前向和后向推理嵌入自我反思信号,构建高质量统一偏好数据集。其次,在该数据集上进行冷启动监督微调,学习结构化、多步推理行为。第三,应用半在线直接偏好优化(Semi-online DPO)过程,通过强大的教师VLM筛选的高质量模型生成推理轨迹持续增强训练语料。该流程使模型能够学习、激发并精炼其自我验证与自我修正能力。跨多个基准的广泛实验表明,SVSR提升了推理准确性,并增强了对未见任务和问题类型的泛化能力。值得注意的是,经过显式自我反思推理训练后,模型还展现出改进的隐式推理能力,即使在没有显式推理轨迹的情况下也优于强基线。这些结果凸显了SVSR在构建更可靠、内省且认知对齐的多模态系统方面的潜力。

英文摘要

Current multimodal models often suffer from shallow reasoning, leading to errors caused by incomplete or inconsistent thought processes. To address this limitation, we propose Self-Verification and Self-Rectification (SVSR), a unified framework that explicitly integrates self-verification and self-rectification into the model's reasoning pipeline, substantially improving robustness and reliability in complex visual understanding and multimodal reasoning tasks. SVSR is built on a novel three-stage training paradigm. First, we construct a high-quality unified preference dataset by refining reasoning traces from pre-trained vision-language models, incorporating both forward and backward reasoning to embed self-reflective signals. Second, we perform cold-start supervised fine-tuning on this dataset to learn structured, multi-step reasoning behaviors. Third, we apply a Semi-online Direct Preference Optimization (Semi-online DPO) process, continuously augmenting the training corpus with high-quality, model-generated reasoning traces filtered by a powerful teacher VLM. This pipeline enables the model to learn, elicit, and refine its ability to self-verify and self-rectify. Extensive experiments across diverse benchmarks demonstrate that SVSR improves reasoning accuracy and enables stronger generalization to unseen tasks and question types. Notably, once trained with explicit self-reflective reasoning, the model also exhibits improved implicit reasoning ability, outperforming strong baselines even when no explicit reasoning traces are provided. These results highlight the potential of SVSR for building more dependable, introspective, and cognitively aligned multimodal systems.

2604.10219 2026-05-29 cs.AI

Cognitive Pivot Points and Visual Anchoring: Unveiling and Rectifying Hallucinations in Multimodal Reasoning Models

认知支点与视觉锚定:揭示并纠正多模态推理模型中的幻觉

Zhe Qian, Yanbiao Ma, Zhuohan Ouyang, Zhonghua Wang, Zhongxing Xu, Fei Luo, Xinyu Liu, Zongyuan Ge, Yike Guo, Jungong Han

发表机构 * Gaoling School of Artificial Intelligence, Renmin University of China(中国人民大学 龙城人工智能学院) South China Agricultural University(华南农业大学) South China Normal University(华南师范大学) Monash University(莫纳什大学) Jishou University(吉首大学) Hong Kong University of Science and Technology(香港科技大学) Tsinghua University(清华大学)

AI总结 针对多模态大推理模型在长链推理中易产生幻觉的问题,提出V-STAR训练范式,通过分层视觉注意力奖励和强制反思机制,将视觉锚定引入推理过程以减轻幻觉。

Comments TPAMI under review

详情
AI中文摘要

多模态大推理模型(MLRMs)通过测试时计算扩展在视觉推理方面取得了显著进展,但长链推理仍然容易出现幻觉。我们识别出一个称为“推理视觉真相脱节”(RVTD)的令人担忧的现象:幻觉与认知分叉点高度相关,这些分叉点通常表现出高熵状态。我们将这种脆弱性归因于视觉语义锚定的崩溃,这种崩溃位于网络中间层;具体来说,在这些高不确定性过渡期间,模型未能查询视觉证据,而是退回到语言先验。因此,我们主张从仅关注结果层面的监督转向增加细粒度的内部注意力引导。为此,我们提出V-STAR(视觉结构训练与注意力强化),一种轻量级、整体的训练范式,旨在内化视觉感知的推理能力。我们方法的核心是分层视觉注意力奖励(HVAR),集成在GRPO框架内。在检测到高熵状态时,该机制动态激励关键中间层的视觉注意力,从而将推理过程锚定回视觉输入。此外,我们引入了强制反思机制(FRM),一种轨迹编辑策略,通过在高熵认知分叉点触发反思并鼓励后续步骤与视觉输入进行验证,从而打破认知惯性,将外部去偏干预转化为减轻幻觉的内在能力。

英文摘要

Multimodal Large Reasoning Models (MLRMs) have achieved remarkable strides in visual reasoning through test time compute scaling, yet long chain reasoning remains prone to hallucinations. We identify a concerning phenomenon termed the Reasoning Vision Truth Disconnect (RVTD): hallucinations are strongly correlated with cognitive bifurcation points that often exhibit high entropy states. We attribute this vulnerability to a breakdown in visual semantic anchoring, localized within the network's intermediate layers; specifically, during these high uncertainty transitions, the model fails to query visual evidence, reverting instead to language priors. Consequently, we advocate a shift from solely outcome level supervision to augmenting it with fine grained internal attention guidance. To this end, we propose V-STAR (Visual Structural Training with Attention Reinforcement), a lightweight, holistic training paradigm designed to internalize visually aware reasoning capabilities. Central to our approach is the Hierarchical Visual Attention Reward (HVAR), integrated within the GRPO framework. Upon detecting high entropy states, this mechanism dynamically incentivizes visual attention across critical intermediate layers, thereby anchoring the reasoning process back to the visual input. Furthermore, we introduce the Forced Reflection Mechanism (FRM), a trajectory editing strategy that disrupts cognitive inertia by triggering reflection around high entropy cognitive bifurcation points and encouraging verification of subsequent steps against the visual input, thereby translating external debiasing interventions into an intrinsic capability for hallucination mitigation.

2604.00789 2026-05-29 cs.CL

Valency Classification of Mapudungun Verbal Roots. Established by the language's own morphotactics

马普切语动词词根的配价分类:基于该语言自身形态句法规则

Andrés Chandía

发表机构 * Department of Catalan Philology and General Linguistics University of Barcelona(加泰罗尼亚语言学与一般语言学系巴塞罗那大学)

AI总结 本文利用马普切语自身的形态句法规则,通过分析后缀与词根或动词词干的允许和限制组合,对已确认为动词的词根进行配价分类,旨在改进形态分析器并促进对马普切语动词配价问题的理解。

Comments 37 pages

详情
AI中文摘要

在先前的工作中,我们对被识别为动词的词根进行了词汇(重新)分类——或确认其给定类别——以准确确定其原始类别。在此基础上,本文利用马普切语自身的形态句法规则,具体通过考察马普切动词形式中各种后缀与词根或动词词干的允许和限制组合,对已确认为动词的马普切语词根进行了配价分类。与迄今为止的所有工作一样,本文呈现的结果旨在改进形态分析器(Dungupeyum),将所有经过验证的发现纳入系统。从理论角度来看,我们也希望有助于认识和理解与马普切语动词形式配价相关的问题。

英文摘要

In the previous work, a lexical (re)categorisation -- or confirmation of the given category -- of roots identified as verbal was undertaken to determine their original category accurately. Building on this, the present paper offers an account of the valency classification of those Mapudungun roots confirmed to be verbal, using the language's own morphotactics; specifically, by examining the permissible and restricted combinations of various suffixes with roots or verbal stems in the Mapuche verb form. As with all work conducted thus far, the results presented here aim to improve the morphological analyser (Dungupeyum) with all verified findings incorporated into the system. From a theoretical perspective, we also hope to contribute to the recognition and understanding of issues related to the valency of Mapuche verb forms.

2603.29954 2026-05-29 cs.CV

Detecting Unknown Objects via Energy-based Separation for Open World Object Detection

基于能量分离的未知物体检测用于开放世界目标检测

Jun-Woo Heo, Keonhee Park, Gyeong-Moon Park

发表机构 * Korea University, South Korea(韩国大学) Seoul National University, South Korea(首尔国立大学)

AI总结 提出DEUS框架,通过等角紧框架子空间未知分离和基于能量的已知区分损失,解决开放世界目标检测中未知物体检测和类别遗忘问题。

Comments 8 pages, Accepted at CVPR 2026

详情
AI中文摘要

在这项工作中,我们解决了开放世界目标检测(OWOD)问题。这一具有挑战性的场景要求检测器在不遗忘的情况下增量学习分类已知物体,同时在没有监督的情况下识别未知物体。先前的OWOD方法增强了未知发现过程,并采用记忆重放来缓解灾难性遗忘。然而,由于现有方法严重依赖检测器的已知类别预测来检测未知物体,它们难以有效学习和识别未知物体表示。此外,虽然记忆重放缓解了旧类别的遗忘,但往往牺牲了新学习类别的知识。为了解决这些限制,我们提出了DEUS(基于能量分离的未知检测),这是一个新颖的框架,应对开放世界目标检测的挑战。DEUS由等角紧框架(ETF)-子空间未知分离(EUS)和基于能量的已知区分(EKD)损失组成。EUS利用基于ETF的几何特性创建正交子空间,从而实现已知和未知物体表示的更干净分离。与仅考虑已知空间的先前基于能量的方法不同,EUS利用两个空间的能量来更好地捕捉未知物体的独特模式。此外,EKD损失强制先前和当前分类器之间的分离,从而在记忆重放期间最小化先前和新学习类别之间的知识干扰。我们在OWOD基准上彻底验证了DEUS,展示了在未知检测方面的显著性能改进,同时保持竞争力的已知类别性能。

英文摘要

In this work, we tackle the problem of Open World Object Detection (OWOD). This challenging scenario requires the detector to incrementally learn to classify known objects without forgetting while identifying unknown objects without supervision. Previous OWOD methods have enhanced the unknown discovery process and employed memory replay to mitigate catastrophic forgetting. However, since existing methods heavily rely on the detector's known class predictions for detecting unknown objects, they struggle to effectively learn and recognize unknown object representations. Moreover, while memory replay mitigates forgetting of old classes, it often sacrifices the knowledge of newly learned classes. To resolve these limitations, we propose DEUS (Detecting Unknowns via energy-based Separation), a novel framework that addresses the challenges of Open World Object Detection. DEUS consists of Equiangular Tight Frame (ETF)-Subspace Unknown Separation (EUS) and an Energy-based Known Distinction (EKD) loss. EUS leverages ETF-based geometric properties to create orthogonal subspaces, enabling cleaner separation between known and unknown object representations. Unlike prior energy-based approaches that consider only the known space, EUS utilizes energies from both spaces to better capture distinct patterns of unknown objects. Furthermore, EKD loss enforces the separation between previous and current classifiers, thus minimizing knowledge interference between previous and newly learned classes during memory replay. We thoroughly validate DEUS on OWOD benchmarks, demonstrating outstanding performance improvements in unknown detection while maintaining competitive known class performance.

2603.27667 2026-05-29 cs.SD cs.AI

EvA: An Evidence-First Audio Understanding Paradigm for LALMs

EvA: 一种面向LALM的以证据为先的音频理解范式

Xinyuan Xie, Shunian Chen, Zhiheng Liu, Yuhao Zhang, Zhiqiang Lv, Liyin Liang, Benyou Wang

发表机构 * The Chinese University of Hong Kong, Shenzhen(香港中文大学(深圳)) Didi Chuxing(滴滴出行)

AI总结 提出EvA双路径架构,通过分层聚合和非压缩时间对齐融合增强声学证据保留,并在统一零样本协议下在MMAU、MMAR和MMSU上取得最佳开源感知结果,支持以证据为先的假设。

详情
AI中文摘要

大型音频语言模型(LALM)在复杂声学场景中仍然存在困难,因为它们往往在推理开始前未能保留与任务相关的声学证据。我们将这种错误模式识别为证据瓶颈:最先进的系统在声学证据提取方面的缺陷大于下游推理,这表明上游感知通常是限制因素。为了解决这个问题,我们提出了EvA(以证据为先的音频),一种双路径架构,通过分层聚合和非压缩、时间对齐融合来增强声学证据保留。我们还构建了EvA-Perception,一个大规模训练集,包含约54K个事件排序描述和500K个基于证据的问答对。在统一的零样本协议下,EvA在MMAU、MMAR和MMSU上取得了最佳开源感知结果,在感知密集型分割上增益最大。对开放描述的人工评估进一步显示了改进的细粒度声学覆盖和描述质量。这些结果支持以证据为先的假设:更强的音频理解依赖于在推理前保留声学证据。项目地址:https://satsuki2486441738.github.io/EvA/。

英文摘要

Large Audio Language Models (LALMs) still struggle in complex acoustic scenes because they often fail to preserve task-relevant acoustic evidence before reasoning begins. We identify this error pattern as the evidence bottleneck: state-of-the-art systems show larger deficits in acoustic evidence extraction than in downstream reasoning, suggesting that upstream perception is often the limiting factor. To address this problem, we propose EvA (Evidence-First Audio), a dual-path architecture that enhances acoustic evidence preservation through hierarchical aggregation and non-compressive, time-aligned fusion. We also build EvA-Perception, a large-scale training set with about 54K event-ordered captions and 500K evidence-grounded QA pairs. Under a unified zero-shot protocol, EvA achieves the best open-source \emph{Perception} results on MMAU, MMAR, and MMSU, with the largest gains on perception-heavy splits. Human evaluation on open-ended captioning further shows improved fine-grained acoustic coverage and caption quality. These results support the evidence-first hypothesis: stronger audio understanding depends on preserving acoustic evidence before reasoning. Project can be found at https://satsuki2486441738.github.io/EvA/.

2603.23853 2026-05-29 cs.AI cs.MA

SCoOP: Semantic Consistent Opinion Pooling for Uncertainty Quantification in Multiple Vision-Language Model Systems

SCoOP: 多视觉-语言模型系统中用于不确定性量化的语义一致意见池化

Chung-En Johnny Yu, Brian Jalaian, Nathaniel D. Bastian

发表机构 * University of West Florida(西佛罗里达大学) United States Military Academy(美国军事学院)

AI总结 提出SCoOP框架,通过不确定性加权的线性意见池化聚合多个视觉-语言模型的输出,实现无训练的不确定性量化,有效检测幻觉并支持高不确定性样本的弃权。

Comments Accepted to ICLR 2026 Workshop on Agentic AI in the Wild: From Hallucinations to Reliable Autonomy

详情
AI中文摘要

结合多个视觉-语言模型(VLM)可以增强多模态推理和鲁棒性,但聚合异构模型的输出会放大不确定性并增加幻觉风险。我们提出SCoOP(语义一致意见池化),一种无需训练的不确定性量化(UQ)框架,通过不确定性加权的线性意见池化用于多VLM系统。核心思想是将每个VLM视为概率“专家”,采样多个输出,映射到统一空间,聚合它们的意见,并产生系统级不确定性分数。与先前为单模型设计的UQ方法不同,SCoOP明确测量跨多个VLM的集体系统级不确定性,从而实现对高不确定性样本的有效幻觉检测和弃权。在ScienceQA上,SCoOP在幻觉检测中实现了0.866的AUROC,优于基线(0.732-0.757)约10-13%。对于弃权,它达到了0.907的AURAC,超过基线(0.818-0.840)7-9%。尽管有这些提升,SCoOP相对于基线仅引入微秒级的聚合开销,与典型的VLM推理时间(秒级)相比微不足道。这些结果表明,SCoOP为不确定性感知聚合提供了一种高效且原则性的机制,推动了多模态AI系统的可靠性。我们的代码公开于https://github.com/chungenyu6/SCoOP。

英文摘要

Combining multiple Vision-Language Models (VLMs) can enhance multimodal reasoning and robustness, but aggregating heterogeneous models' outputs amplifies uncertainty and increases the risk of hallucinations. We propose SCoOP (Semantic-Consistent Opinion Pooling), a training-free uncertainty quantification (UQ) framework for multi-VLM systems through uncertainty-weighted linear opinion pooling. The core idea is to treat each VLM as a probabilistic "expert," sample multiple outputs, map them to a unified space, aggregate their opinions, and produce a system-level uncertainty score. Unlike prior UQ methods designed for single models, SCoOP explicitly measures collective, system-level uncertainty across multiple VLMs, enabling effective hallucination detection and abstention for highly uncertain samples. On ScienceQA, SCoOP achieves an AUROC of 0.866 for hallucination detection, outperforming baselines (0.732-0.757) by approximately 10-13%. For abstention, it attains an AURAC of 0.907, exceeding baselines (0.818-0.840) by 7-9%. Despite these gains, SCoOP introduces only microsecond-level aggregation overhead relative to the baselines, which is trivial compared to typical VLM inference time (on the order of seconds). These results demonstrate that SCoOP provides an efficient and principled mechanism for uncertainty-aware aggregation, advancing the reliability of multimodal AI systems. Our code is publicly available at https://github.com/chungenyu6/SCoOP.

2603.23234 2026-05-29 cs.AI cs.LG

MemCollab: Cross-Model Memory Collaboration via Contrastive Trajectory Distillation

MemCollab:通过对比轨迹蒸馏实现跨模型记忆协作

Yurui Chang, Yiran Wu, Qingyun Wu, Lu Lin

发表机构 * Pennsylvania State University(宾夕法尼亚州立大学) AG2AI

AI总结 针对不同骨干模型代理间共享记忆性能下降的问题,提出MemCollab框架,通过对比同一任务上不同模型生成的推理轨迹来蒸馏共享的抽象推理约束,并引入任务感知检索机制,提升异构代理的准确性和推理效率。

详情
AI中文摘要

LLM代理越来越依赖记忆机制来重用过去问题解决经验中的知识。然而,现有方法通常为单个代理构建记忆,并与同一底层模型重用,将存储的知识紧密耦合到特定模型的推理风格。在异构部署中,代理可能使用不同大小、架构或专业化的骨干模型实例化,这引发了一个关键问题:一个单一的记忆系统能否在不同骨干模型的代理之间共享?我们发现,简单的跨模型记忆传输可能会降低性能,因为存储的记忆常常将任务相关知识纠缠到模型特定的偏见中。为了解决这一挑战,我们提出了MemCollab,一个协作记忆框架,通过对比不同基于模型的代理在同一任务上生成的推理轨迹来构建共享的跨模型记忆。通过这一对比过程,MemCollab蒸馏出捕获共享任务级不变量的抽象推理约束,同时抑制模型特定的伪影。我们进一步引入了一种任务感知检索机制,根据任务类别调节记忆访问,确保在推理时只检索相关的约束。在数学推理和代码生成基准上的实验表明,MemCollab在不同代理(包括不同模型族设置)上一致地提高了准确性和推理效率。这些结果表明,协作构建的跨模型记忆可以作为异构基于LLM的代理的共享推理资源。

英文摘要

LLM agents increasingly rely on memory mechanisms to reuse knowledge from past problem-solving experiences. However, existing methods typically construct memory for a single agent and reuse it with the same underlying model, tightly coupling stored knowledge to model-specific reasoning styles. In heterogeneous deployments, where agents may be instantiated with backbone models of different sizes, architectures, or specializations, this raises a key question: can a single memory system be shared across agents with different backbone models? We find that naive cross-model memory transfer can degrade performance, because stored memories often entangle task-relevant knowledge with model-specific biases. To address this challenge, we propose MemCollab, a collaborative memory framework that builds shared cross-model memory by contrasting reasoning trajectories generated by different model-based agents on the same task. Through this contrastive process, MemCollab distills abstract reasoning constraints that capture shared task-level invariants while suppressing model-specific artifacts. We further introduce a task-aware retrieval mechanism that conditions memory access on task category, ensuring that only relevant constraints are retrieved at inference time. Experiments on mathematical reasoning and code generation benchmarks show that MemCollab consistently improves both accuracy and inference-time efficiency across diverse agents, including settings with different model families. These results demonstrate that collaboratively constructed cross-model memory can serve as a shared reasoning resource for heterogeneous LLM-based agents.

2603.23085 2026-05-29 cs.AI

When Models Learn to Ask Why: Adaptive Causal Reasoning for Trustworthy Medical Vision-Language Models

当模型学会问为什么:面向可信医疗视觉语言模型的自适应因果推理

Jianxin Lin, Chunzheng Zhu, Peter J. Kneuertz, Yunfei Bai, Yuan Xue

发表机构 * The Ohio State University(俄亥俄州立大学) Hunan University(湖南大学) Amazon(亚马逊)

AI总结 提出MedCausalX框架,通过因果推理链、自适应反射架构和轨迹级因果校正,解决医疗VLM中的虚假相关和推理不一致问题,显著提升诊断一致性和减少幻觉。

Comments Accepted by CVPR 2026 Findings

详情
AI中文摘要

视觉语言模型(VLM)通过整合视觉感知与语言推理,实现了可解释的医疗诊断。然而,现有的医疗思维链(CoT)模型缺乏明确表示和执行因果推理的机制,使其易受虚假相关影响,限制了临床可靠性。我们指出了医疗CoT推理中的三个核心挑战:如何自适应触发因果校正、构建高质量因果-虚假对比样本、以及保持推理轨迹的因果一致性。为应对这些挑战,我们提出MedCausalX,一个显式建模医疗VLM中因果推理链的端到端框架。我们首先引入CRMed数据集,提供细粒度解剖标注、结构化因果推理链和反事实变体,引导学习超越表面相关性的因果关系。基于CRMed,MedCausalX采用两阶段自适应反射架构,配备⟨causal⟩和⟨verify⟩标记,使模型能够自主决定何时以及如何进行因果分析和验证。最后,通过错误归因强化学习优化的轨迹级因果校正目标,细化推理链,使模型能够区分真正的因果依赖与捷径关联。在多个基准上的大量实验表明,MedCausalX持续优于最先进方法,诊断一致性提升+5.4分,幻觉减少超过10分,并达到最高的空间定位IoU,从而为因果基础的医疗推理设立了新标准。代码和数据集可在https://github.com/zhcz328/MedCausalX获取。

英文摘要

Vision-Language Models (VLMs) have enabled interpretable medical diagnosis by integrating visual perception with linguistic reasoning. Yet, existing medical chain-of-thought (CoT) models lack explicit mechanisms to represent and enforce causal reasoning, leaving them vulnerable to spurious correlations and limiting their clinical reliability. We pinpoint three core challenges in medical CoT reasoning: how to adaptively trigger causal correction, construct high-quality causal-spurious contrastive samples, and maintain causal consistency across reasoning trajectories. To address these challenges, we propose MedCausalX, an end-to-end framework explicitly models causal reasoning chains in medical VLMs. We first introduce the CRMed dataset providing fine-grained anatomical annotations, structured causal reasoning chains, and counterfactual variants that guide the learning of causal relationships beyond superficial correlations. Building upon CRMed, MedCausalX employs a two-stage adaptive reflection architecture equipped with $\langle$causal$\rangle$ and $\langle$verify$\rangle$ tokens, enabling the model to autonomously determine when and how to perform causal analysis and verification. Finally, a trajectory-level causal correction objective optimized through error-attributed reinforcement learning refines the reasoning chain, allowing the model to distinguish genuine causal dependencies from shortcut associations. Extensive experiments on multiple benchmarks show that MedCausalX consistently outperforms state-of-the-art methods, improving diagnostic consistency by +5.4 points, reducing hallucination by over 10 points, and attaining top spatial grounding IoU, thereby setting a new standard for causally grounded medical reasoning. The code and dataset are available at https://github.com/zhcz328/MedCausalX.

2603.23069 2026-05-29 cs.CL cs.AI

AuthorMix: Modular Authorship Style Transfer via Layer-wise Adapter Mixing

AuthorMix: 通过逐层适配器混合实现模块化作者风格迁移

Sarubi Thillainathan, Ji-Ung Lee, Michael Sullivan, Alexander Koller

发表机构 * Saarland University(萨尔兰大学)

AI总结 提出AuthorMix框架,通过训练特定风格的LoRA适配器并利用逐层适配器混合,仅需少量目标风格样本即可实现轻量级、模块化的作者风格迁移,在低资源场景下优于现有方法并显著提升语义保留。

Comments Under review

详情
AI中文摘要

作者风格迁移任务涉及在保留原文含义的同时,将文本重写为目标作者的风格。现有的风格迁移方法在大型语料库上训练单一模型以同时建模所有目标风格:这种高成本方法为目标特定适应提供的灵活性有限,并且常常为了风格迁移而牺牲语义保留。在本文中,我们提出了AuthorMix:一个轻量级、模块化且可解释的风格迁移框架。我们在少量高资源作者上训练个体、风格特定的LoRA适配器,通过学习的逐层适配器混合,仅使用少量目标风格训练示例,即可快速训练每个新目标的专门适应模型。AuthorMix在低资源目标上优于现有的最先进风格迁移基线以及GPT-5.1,在自动和人工评估中均获得最高总分,并显著提高了语义保留。

英文摘要

The task of authorship style transfer involves rewriting text in the style of a target author while preserving the meaning of the original text. Existing style transfer methods train a single model on large corpora to model all target styles at once: this high-cost approach offers limited flexibility for target-specific adaptation, and often sacrifices meaning preservation for style transfer. In this paper, we propose AuthorMix: a lightweight, modular, and interpretable style transfer framework. We train individual, style-specific LoRA adapters on a small set of high-resource authors, allowing the rapid training of specialized adaptation models for each new target via learned, layer-wise adapter mixing, using only a handful of target-style training examples. AuthorMix outperforms existing, SoTA style-transfer baselines-as well as GPT-5.1-for low-resource targets, achieving the highest overall score and substantially improving meaning preservation in both automatic and human evaluations.

2603.21746 2026-05-29 cs.CV

Getting to the Point: Pointing Improves LVLMs at Counting

直击要点:指向提升LVLMs的计数能力

Simone Alghisi, Massimo Rizzoli, Seyed Mahed Mousavi, Giuseppe Riccardi

发表机构 * Signals and Interactive Systems Lab, University of Trento(信号与交互系统实验室,特伦托大学)

AI总结 提出Point-then-Count方法,通过生成目标物体坐标进行零样本计数,在多个LVLM上取得最高准确率,并揭示坐标编码的空间信息是性能提升的关键。

详情
AI中文摘要

基于指向的方法将复杂任务分解为顺序的定位和推理步骤。给定查询,模型首先生成相关对象的坐标进行定位,然后基于这些点预测答案。虽然这种方法已被证明能提高大型视觉语言模型(LVLM)的性能,但其为何以及如何改善模型的视觉推理仍不清楚。在这项工作中,我们评估了基于指向的方法在视觉场景零样本计数任务中的表现。我们在最先进的LVLM上实验了多种微调和免训练方法,并将其与Point-then-Count(PtC)进行比较,其中模型首先生成目标对象的点坐标,然后预测其数量。我们的结果表明,PtC在评估方法中达到了最高准确率,预测的点在超过94%的情况下正确位于图像中(基于F1分数)。机制分析表明,性能提升源于预测坐标中编码的空间信息。然而,定位性能在不同图像区域存在差异,揭示了空间偏差。最后,结果表明PtC在合成和真实数据上都改善了分布外泛化,表明坐标有潜力帮助LVLM提升计数技能。

英文摘要

Pointing-based methods decompose complex tasks as sequential grounding and reasoning steps. Given a query, the model first grounds the relevant objects by generating their coordinates, and then predicts an answer conditioned on these points. While this approach has been shown to increase the performance of Large Vision-Language Models (LVLMs), it remains unclear why and how it improves the models' visual reasoning. In this work, we evaluate pointing-based methods in the task of zero-shot counting in visual scenes. We experiment with multiple fine-tuning and training-free approaches on state-of-the-art LVLMs, and compare them with Point-then-Count (PtC), where models first generate point coordinates for the target objects and then predict their count. Our results show that PtC achieves the highest accuracy among the evaluated approaches, with predicted points correctly grounded in the image in more than 94% of cases (based on F1-score). Mechanistic analyses show that gains arise from spatial information encoded in the predicted coordinates. Nevertheless, grounding performance varies across image regions, revealing spatial biases. Finally, the results indicate that PtC improves out-of-distribution generalization on both synthetic and real data, suggesting the potential of coordinates to help LVLMs improve their counting skills.

2603.21621 2026-05-29 cs.LG

Path-Space Mirror Descent for On-Policy Reinforcement Learning under the Generalized Schrödinger Bridge

广义薛定谔桥下在线强化学习的路径空间镜像下降

Yuehu Gong, Zeyuan Wang, Yulin Chen, Shutong Ding, Qingyuan Zhou, Yanwei Fu

发表机构 * School of Data Science, Fudan University(复旦大学数据科学学院) Laboratory for Big Data and Decision, National University of Defense Technology(国防科技大学大数据与决策实验室) ShanghaiTech University(上海科技大学) College of Computer Science and Artificial Intelligence, Fudan University(复旦大学计算机科学与人工智能学院) Shanghai Innovation Institute(上海创新研究院)

AI总结 针对生成式策略在在线策略优化中终端动作密度难处理的问题,提出GSB-MDPO方法,通过将策略优化建模为广义薛定谔桥问题并利用路径空间KL散度作为近端项,实现了无需显式终端似然评估的稳定更新。

详情
AI中文摘要

经典的在线策略算法如PPO和镜像下降策略优化通过可处理的动作似然提供稳定的近端策略更新,但通常使用简单的Gaussian策略,其在复杂连续控制任务中的表达能力有限。基于扩散和流模型的生成式策略提供了更具表达力的动作分布,但它们自然地定义了多步去噪路径上的分布,其终端动作密度通常是难以处理的,这造成了与基于似然的在线策略近端更新的不匹配。为了解决这种不匹配,我们引入了GSB-MDPO(广义薛定谔桥镜像下降策略优化),它将在线策略生成式策略优化表述为状态条件生成路径上的广义薛定谔桥问题,并通过镜像下降策略优化实例化得到的路径测度更新。关键洞察是GSB路径空间KL散度在MDPO中扮演了近端项的角色,同时上界了终端动作KL散度,从而无需显式终端动作似然评估即可直接控制执行的动作分布。在Playground和Gym-MuJoCo上的14个连续控制任务上的实验证明了GSB-MDPO的经验有效性,并支持路径空间正则化作为多步生成式策略的原则性近端更新。

英文摘要

Classical on-policy algorithms such as PPO and mirror descent policy optimization provide stable proximal policy updates through tractable action likelihoods, but are typically instantiated with simple Gaussian policies whose expressiveness can be limited in complex continuous-control tasks. Generative policies based on diffusion and flow models provide more expressive action distributions, but they naturally define distributions over multi-step denoising paths whose terminal action density is often intractable, creating a mismatch with likelihood-based on-policy proximal updates. To address this mismatch, we introduce \textbf{GSB-MDPO} (\emph{Generalized Schrödinger Bridge Mirror Descent Policy Optimization}), which formulates on-policy generative policy optimization as a Generalized Schrödinger Bridge problem over state-conditioned generation paths and instantiates the resulting path-measure update through mirror descent policy optimization. The key insight is that the GSB path-space KL plays the role of the proximal term in MDPO while upper-bounding the terminal action KL, enabling direct control of the executed action distribution without explicit terminal action likelihood evaluation. Experiments on 14 continuous-control tasks across Playground and Gym-MuJoCo demonstrate the empirical effectiveness of GSB-MDPO and support path-space regularization as a principled proximal update for multi-step generative policies.

2603.19828 2026-05-29 cs.AI

FormalEvolve: Neuro-Symbolic Evolutionary Search for Diverse Autoformalization

FormalEvolve: 用于多样化自动形式化的神经符号进化搜索

Haijian Lu, Wei Wang, Jing Liu

发表机构 * School of Artificial Intelligence, Xidian University(西安电子科技大学人工智能学院) Beijing Institute for General Artificial Intelligence(北京通用人工智能研究院)

AI总结 提出FormalEvolve,一种结合LLM变异、交叉、补丁修复和符号AST重写的神经符号进化搜索方法,将自动形式化重构为预算测试时搜索,通过维护可编译存档并报告去重语义接受库,在CombiBench和ProofNet上分别达到58.0%和84.9%的SH@100,并提升下游定理证明性能。

Comments 27 pages, 12 figures

详情
AI中文摘要

自动形式化旨在生成编译通过并忠实保留非正式数学预期含义的形式化陈述。然而,标准单输出评估协议将一个多对多问题简化为单输出预测任务。对于下游证明,这种粒度过于粗糙:形式化陈述不仅是忠实的翻译终点,还是一个面向证明者的接口,其结构可以在固定预算下改变证明搜索。因此,我们将自动形式化重新定义为预算测试时搜索:FormalEvolve维护一个可编译的存档以供重用,同时报告去重后的语义接受库用于评估和下游证明。它通过LLM驱动的变异、交叉、有界补丁修复和符号抽象语法树(AST)重写来扩展存档,以实现结构多样性。在生成器调用预算T=100且使用固定LLM语义判断器的情况下,FormalEvolve在CombiBench上达到58.0%的SH@100,在ProofNet上达到84.9%的SH@100,优于所有无存档对照,同时减少了跨问题的语义成功集中度。为了评估下游价值,我们在固定B=64证明器预算下评估生成的库,它们比匹配的无存档对照提高了定理完全证明;额外的更强基础语句生成实验表明,存档搜索收益在使用更强的种子和修复模型时仍然保持。手动忠实性审计对这些判断器正输出进行了校准。

英文摘要

Autoformalization aims to produce formal statements that compile and faithfully preserve the intended meaning of informal mathematics. Yet standard single-output evaluation protocols collapse a many-to-many problem into a single-output prediction task. For downstream proving, this granularity is too coarse: a formal statement is not merely a faithful translation endpoint, but also a prover-facing interface whose structure can alter proof search under a fixed budget. We therefore recast autoformalization as budgeted test-time search: FormalEvolve maintains a compilation-feasible archive for reuse, while reporting the deduplicated semantically accepted repertoire for evaluation and downstream proving. It expands the archive with LLM-driven mutation, crossover, bounded patch repair, and symbolic Abstract Syntax Tree (AST) rewrites for structural diversity. Under a generator-call budget of T=100 with a fixed LLM semantic judge, FormalEvolve reaches SH@100 of 58.0% on CombiBench and 84.9% on ProofNet, improving over all no-archive controls while reducing the cross-problem concentration of semantic successes. To assess downstream value, we evaluate the resulting repertoires under a fixed B=64 prover budget, where they improve theorem-complete proving over the matched no-archive control; additional stronger-base statement-generation experiments show that archive-search gains hold with stronger seed and repair models. Manual faithfulness audits calibrate these judge-positive outputs.

2603.17945 2026-05-29 cs.CL

ShapleyLaw: A Game-Theoretic Approach to Multilingual Scaling Laws

ShapleyLaw:多语言缩放定律的博弈论方法

Xuyang Cao, Qianying Liu, Chuan Xiao, Yusuke Oda, Jiayi Wang, Pontus Stenetorp, Daisuke Kawahara, Makoto Onizuka, Sadao Kurohashi, Shuyuan Zheng

发表机构 * NII LLMC Osaka University(大阪大学) Nara Institute of Science and Technology(奈良科学技術大學) University College London(伦敦大学学院) Kyoto University(京都大学) Waseda University(早稻田大学)

AI总结 将多语言预训练视为合作博弈,利用Shapley值量化跨语言迁移效应,提出ShapleyLaw缩放定律以优化语言混合比例。

Comments 18 pages

详情
AI中文摘要

在多语言预训练中,预训练模型的测试损失受预训练数据中每种语言的比例(即语言混合比例)的显著影响。多语言缩放定律可以预测不同语言混合比例下的测试损失,因此可用于估计最优比例。然而,当前的多语言缩放定律方法未衡量跨语言迁移效应,导致混合比例次优。本文将多语言预训练视为一个合作博弈,其中每种语言作为一个参与者共同贡献于预训练,并将由此带来的测试损失降低作为收益。因此,从合作博弈论的角度,我们通过每种语言在博弈中的贡献来量化其跨语言迁移,并提出一种基于博弈论的多语言缩放定律,称为ShapleyLaw。我们的实验表明,ShapleyLaw在模型性能预测和语言混合优化方面优于基线方法。

英文摘要

In multilingual pretraining, the test loss of a pretrained model is heavily influenced by the proportion of each language in the pretraining data, namely the \textit{language mixture ratios}. Multilingual scaling laws can predict the test loss under different language mixture ratios and can therefore be used to estimate the optimal ratios. However, the current approaches to multilingual scaling laws do not measure the \textit{cross-lingual transfer} effect, resulting in suboptimal mixture ratios. In this paper, we consider multilingual pretraining as a cooperative game in which each language acts as a player that jointly contributes to pretraining, gaining the resulting reduction in test loss as the payoff. Consequently, from the perspective of cooperative game theory, we quantify the cross-lingual transfer from each language by its contribution in the game, and propose a game-theoretic multilingual scaling law called \textit{ShapleyLaw}. Our experiments show that ShapleyLaw outperforms baseline methods in model performance prediction and language mixture optimization.

2603.14315 2026-05-29 cs.LG math.OC

Enhancing LLM Training via Spectral Clipping

通过谱裁剪增强大语言模型训练

Xiaowen Jiang, Andrei Semenov, Sebastian U. Stich

发表机构 * CISPA Helmholtz Center for Information Security(信息安全研究中心) EPFL(瑞士联邦理工学院)

AI总结 提出SPECTRA框架,通过对优化器更新进行谱裁剪以约束谱范数、对梯度进行预谱裁剪以抑制噪声尖峰,从而提升多种优化器在大语言模型预训练中的性能。

Comments v2: ICML 2026

详情
AI中文摘要

虽然基于谱的优化器(如Muon)直接对更新的谱进行操作,但标准自适应方法(如AdamW)没有考虑权重和梯度的谱结构,使它们容易受到大语言模型(LLM)训练中两个经验问题的影响:(i)优化器更新可能具有较大的谱范数,可能破坏训练稳定性并降低泛化能力;(ii)随机梯度噪声可能表现出稀疏的谱尖峰,少数主导奇异值远大于其余值。我们提出SPECTRA,一个通用框架,通过(i)对更新进行后谱裁剪以施加谱范数约束,(ii)可选地对梯度进行预谱裁剪以抑制谱噪声尖峰,来解决这些问题。我们证明后谱裁剪构成了一种具有谱范数约束和权重正则化的复合Frank-Wolfe方法。我们进一步分析了预谱裁剪如何缓解稀疏谱尖峰。我们通过Newton-Schulz迭代提出了高效的软谱裁剪,避免了昂贵的SVD。在LLM预训练上的实验表明,SPECTRA对各种优化器(包括AdamW、Signum、Mars和AdEMAMix)一致地改善了验证损失,其中表现最佳的变体达到了最先进的结果。使用SPECTRA训练的模型表现出更小的权重范数,证实了谱裁剪与正则化之间的联系。

英文摘要

While spectral-based optimizers like Muon operate directly on the spectrum of updates, standard adaptive methods such as AdamW do not account for the spectral structure of weights and gradients, leaving them vulnerable to two empirical issues in large language model (LLM) training: (i) the optimizer updates can have large spectral norms, potentially destabilizing training and degrading generalization; (ii) stochastic gradient noise can exhibit sparse spectral spikes, with a few dominant singular values much larger than the rest. We propose SPECTRA, a general framework addressing these by (i) post-spectral clipping of updates to enforce spectral-norm constraints (ii) optional pre-spectral clipping of gradients to suppress spectral noise spikes. We prove that post-clipping constitutes a Composite Frank-Wolfe method with spectral-norm constraints and weight regularization. We further analyze how pre-clipping mitigates sparse spectral spikes. We propose efficient soft spectral clipping via Newton-Schulz iterations, avoiding expensive SVD. Experiments on LLM pretraining show SPECTRA uniformly improves validation loss for various optimizers, including AdamW, Signum, Mars, and AdEMAMix, with the best-performing variants achieving state-of-the-art results. Models trained with SPECTRA exhibit smaller weight norms, confirming the link between spectral clipping and regularization.

2603.13249 2026-05-29 cs.CL cs.AI cs.CY

Steering at the Source: Style Modulation Heads for Robust Persona Control

源头操控:用于稳健角色控制的风格调制头

Yoshihiro Izawa, Gouki Minegishi, Koshi Eguchi, Sosuke Hosokawa, Kenjiro Taura

发表机构 * The University of Tokyo, Tokyo, Japan(东京大学) National Institute of Informatics Research(信息处理研究所) Development Center for Large Language Models, Japan(大型语言模型发展中心)

AI总结 本文通过识别并仅干预少量注意力头(风格调制头),在无需微调的情况下实现对大型语言模型角色和风格的稳健控制,同时显著缓解了残差流干预导致的连贯性下降问题。

Comments 8 main pages with appendix

详情
AI中文摘要

激活操控提供了一种计算高效的机制,无需微调即可控制大型语言模型(LLM)。虽然能有效控制目标特征(如角色),但连贯性下降仍然是安全和实际部署的主要障碍。我们假设这种下降源于对残差流的干预,该干预无差别地影响聚合特征,并无意中放大了非目标噪声。在这项工作中,我们识别出一组稀疏的注意力头(仅三个头),它们独立控制角色和风格形成,我们将其称为风格调制头。具体来说,这些头可以通过内部表示的几何分析进行定位,结合层间余弦相似度和头部贡献分数。我们证明,仅针对这些特定头的干预能够实现稳健的行为控制,同时显著减轻残差流操控中观察到的连贯性下降。更广泛地说,我们的发现表明,精确的组件级定位能够实现更安全、更精确的模型控制。

英文摘要

Activation steering offers a computationally efficient mechanism for controlling Large Language Models (LLMs) without fine-tuning. While effectively controlling target traits (e.g., persona), coherency degradation remains a major obstacle to safety and practical deployment. We hypothesize that this degradation stems from intervening on the residual stream, which indiscriminately affects aggregated features and inadvertently amplifies off-target noise. In this work, we identify a sparse subset of attention heads (only three heads) that independently govern persona and style formation, which we term Style Modulation Heads. Specifically, these heads can be localized via geometric analysis of internal representations, combining layer-wise cosine similarity and head-wise contribution scores. We demonstrate that intervention targeting only these specific heads achieves robust behavioral control while significantly mitigating the coherency degradation observed in residual stream steering. More broadly, our findings show that precise, component-level localization enables safer and more precise model control.

2603.12588 2026-05-29 cs.CV

SDF-Net: Structure-Aware Disentangled Feature Learning for Opticall-SAR Ship Re-identification

SDF-Net:面向光学-SAR船舶重识别的结构感知解耦特征学习

Furui Chen, Han Wang, Yuhan Sun, Jianing You, Yixuan Lv, Zhuang Zhou, Hong Tan, Shengyang Li

发表机构 * Technology and Engineering Center for Space Utilization, Chinese Academy of Sciences(中国科学院空间利用技术与工程中心) Key Laboratory of Space Utilization, Chinese Academy of Sciences(中国科学院空间利用重点实验室) University of Chinese Academy of Sciences(中国科学院大学) School of Software, Beihang University(北航软件学院)

AI总结 针对光学与SAR图像间辐射差异导致的船舶重识别挑战,提出SDF-Net,通过结构一致性约束和解耦特征学习,实现模态不变的身份特征提取,在HOSS-ReID数据集上达到最优性能。

详情
AI中文摘要

光学与合成孔径雷达(SAR)图像之间的跨模态船舶重识别(ReID)面临根本性挑战,即被动光学成像与相干主动雷达传感之间的严重辐射差异。现有方法主要依赖统计分布对齐或语义匹配,但往往忽略了一个关键的物理先验:船舶是刚性物体,其几何结构在不同传感模态下保持稳定,而纹理外观则高度依赖模态。本文提出SDF-Net,一种结构感知解耦特征学习网络,系统地将几何一致性引入光学-SAR船舶重识别。基于ViT骨干网络,SDF-Net引入结构一致性约束,从中间层提取尺度不变的梯度能量统计量,以稳健地锚定表示对抗辐射变化。在终端阶段,SDF-Net将学习到的表示解耦为模态不变的身份特征和模态特定的特征。然后通过无参数的加性残差融合整合这些解耦线索,有效增强判别能力。在HOSS-ReID数据集上的大量实验表明,SDF-Net持续优于现有最先进方法。代码和训练模型已在https://github.com/cfrfree/SDF-Net公开。

英文摘要

Cross-modal ship re-identification (ReID) between optical and synthetic aperture radar (SAR) imagery is fundamentally challenged by the severe radiometric discrepancy between passive optical imaging and coherent active radar sensing. While existing approaches primarily rely on statistical distribution alignment or semantic matching, they often overlook a critical physical prior: ships are rigid objects whose geometric structures remain stable across sensing modalities, whereas texture appearance is highly modality-dependent. In this work, we propose SDF-Net, a Structure-Aware Disentangled Feature Learning Network that systematically incorporates geometric consistency into optical--SAR ship ReID. Built upon a ViT backbone, SDF-Net introduces a structure consistency constraint that extracts scale-invariant gradient energy statistics from intermediate layers to robustly anchor representations against radiometric variations. At the terminal stage, SDF-Net disentangles the learned representations into modality-invariant identity features and modality-specific characteristics. These decoupled cues are then integrated through a parameter-free additive residual fusion, effectively enhancing discriminative power. Extensive experiments on the HOSS-ReID dataset demonstrate that SDF-Net consistently outperforms existing state-of-the-art methods. The code and trained models are publicly available at https://github.com/cfrfree/SDF-Net.

2603.11331 2026-05-29 cs.LG cs.AI

Jailbreak Scaling Laws for Large Language Models: Polynomial-Exponential Crossover

大型语言模型的越狱缩放定律:多项式-指数交叉

Indranil Halder, Annesya Banerjee, Cengiz Pehlevan

发表机构 * John A. Paulson School of Engineering And Applied Sciences, Harvard University(哈佛大学约翰·A·保罗森工程与应用科学学院) Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology(麻省理工学院脑科学与认知科学系) Speech and Hearing Bioscience and Technology, Harvard Medical School(哈佛医学院语音与听力生物科学与技术系) Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University(哈佛大学自然与人工智能研究学院) Center for Brain Science, Harvard University(哈佛大学脑科学中心)

AI总结 研究发现对抗性提示注入攻击可使攻击成功率从无注入时的缓慢多项式增长变为随推理样本数指数增长,并通过自旋玻璃模型从理论上解释了这一现象。

详情
AI中文摘要

对抗性攻击可以可靠地将安全对齐的大型语言模型引导至不安全行为。经验上,我们发现对抗性提示注入攻击可以将攻击成功率从无注入时观察到的缓慢多项式增长放大为随推理样本数指数增长。我们首先通过一组关于上下文安全生成分布的最小假设,确定了这两种机制的统计基础,并推导出两种缩放定律。为了进一步解释这一现象,我们提出了一个基于自旋玻璃系统的代理语言理论生成模型,该系统处于复制对称破缺状态,生成样本来自相关的吉布斯测度,并将低能、有偏大小的子集标记为不安全。我们分析展示了该模型如何自然实现最小假设。短注入提示对应于指向不安全簇中心的弱磁场,导致攻击成功率随推理样本数呈幂律缩放;而长注入提示(即强磁场)则导致指数缩放。我们在参数规模从3B到70B的广泛大型语言模型中观察到了定性一致的行为。特别是,主要趋势在多种攻击方法(如GCG和AutoDAN)以及基准数据集(如AdvBench和HarmBench)中保持稳定。

英文摘要

Adversarial attacks can reliably steer safety-aligned large language models toward unsafe behavior. Empirically, we find that adversarial prompt-injection attacks can amplify attack success rate from the slow polynomial growth observed without injection to exponential growth with the number of inference-time samples. We first identify a minimal statistical mechanism for these two regimes by giving a small set of assumptions on the distribution of safe generation across contexts under which both scaling laws follow. To explain this phenomenon further, we propose a theoretical generative model of proxy language in terms of a spin-glass system operating in a replica-symmetry-breaking regime, where generations are drawn from the associated Gibbs measure and a subset of low-energy, size-biased clusters is designated unsafe. We analytically show how this model naturally realizes the minimal assumptions. Short injected prompts correspond to a weak magnetic field aligned towards unsafe cluster centers and yield a power-law scaling of attack success rate with the number of inference-time samples, while long injected prompts, i.e., strong magnetic field, yield exponential scaling. We observe qualitatively consistent behavior across a broad range of large language models, spanning parameter scales from 3B to 70B. In particular, the main trends remain stable across multiple attack methods, such as GCG and AutoDAN, as well as across benchmark datasets such as AdvBench and HarmBench.

2603.10474 2026-05-29 cs.LG cs.NE cs.RO

Muscle Synergy Priors Enhance Biomechanical Fidelity in Predictive Musculoskeletal Locomotion Simulation

肌肉协同先验增强预测性肌肉骨骼运动模拟的生物力学保真度

Ilseung Park, Eunsik Choi, Jangwhan Ahn, Jooeun Ahn

发表机构 * Department of Mechanical Engineering(机械工程系) Carnegie Mellon University(卡内基梅隆大学) Department of Physical Education(体育系) Seoul National University(首尔国立大学) Lampe Joint Department of Biomedical Engineering(生物医学工程联合部门) UNC-Chapel Hill and NC State University(北卡罗来纳大学教堂山分校和北卡罗来纳州立大学)

AI总结 提出一种生理学启发的强化学习框架,通过肌肉协同约束控制,在有限实验数据下提高了预测性人体运动模拟的生物力学保真度和泛化能力。

Comments Added a manuscript footnote stating "Project page with supplementary videos: https://ces40320.github.io/WebHomepage__Walk-RL ."

详情
AI中文摘要

人类运动源于高维神经肌肉控制,这使得预测性肌肉骨骼模拟具有挑战性。我们提出了一种生理学启发的强化学习框架,利用肌肉协同约束控制。我们从少量地面行走试验的逆肌肉骨骼分析中提取了低维协同基,并将其作为动作空间,用于训练一个肌肉驱动的三维模型,该模型在可变速度、坡度和不平坦地形上进行训练。由此产生的控制器在0.7-1.8 m/s的速度和±6°的坡度上生成了稳定的步态,并再现了关节角度、关节力矩和地面反作用力的条件依赖性调节。与无约束控制器相比,协同约束控制减少了非生理性膝关节运动学,并将膝关节力矩曲线保持在实验包络内。在各种条件下,模拟的垂直地面反作用力与人体测量值强相关,肌肉激活时间大多落在受试者间变异范围内。这些结果表明,将神经生理结构嵌入强化学习可以在有限实验数据下提高预测性人体运动模拟的生物力学保真度和泛化能力。

英文摘要

Human locomotion emerges from high-dimensional neuromuscular control, making predictive musculoskeletal simulation challenging. We present a physiology-informed reinforcement-learning framework that constrains control using muscle synergies. We extracted a low-dimensional synergy basis from inverse musculoskeletal analyses of a small set of overground walking trials and used it as the action space for a muscle-driven three-dimensional model trained across variable speeds, slopes and uneven terrain. The resulting controller generated stable gait from 0.7-1.8 m/s and on $\pm$ 6$^{\circ}$ grades and reproduced condition-dependent modulation of joint angles, joint moments and ground reaction forces. Compared with an unconstrained controller, synergy-constrained control reduced non-physiological knee kinematics and kept knee moment profiles within the experimental envelope. Across conditions, simulated vertical ground reaction forces correlated strongly with human measurements, and muscle-activation timing largely fell within inter-subject variability. These results show that embedding neurophysiological structure into reinforcement learning can improve biomechanical fidelity and generalization in predictive human locomotion simulation with limited experimental data.

2603.07916 2026-05-29 cs.AI cs.DB cs.LG

Rel-MOSS: Towards Imbalanced Relational Deep Learning on Relational Databases

Rel-MOSS:面向关系数据库中不平衡关系深度学习的解决方案

Jun Yin, Peng Huo, Bangguo Zhu, Hao Yan, Senzhang Wang, Shirui Pan, Chengqi Zhang

发表机构 * Department of Data Science and Artificial Intelligence(数据科学与人工智能系) Hong Kong Polytechnic University(香港理工大学) School of Computer Science and Engineering(计算机科学与工程学院) Central South University(中南大学) School of Information and Communication Technology(信息与通信技术学院) Griffith University(格里菲斯大学) National Super Computing Center(国家超级计算中心)

AI总结 针对关系数据库中实体分类的类别不平衡问题,提出关系中心少数类合成过采样GNN(Rel-MOSS),通过关系门控控制器和关系引导的少数类合成器提升少数类表示,在12个数据集上平均平衡准确率提升2.46%,G-Mean提升4.00%。

详情
AI中文摘要

在最近的进展中,为了实现关系数据库(RDB)上完全数据驱动的学习范式,提出了关系深度学习(RDL),将RDB结构化为异构实体图,并采用图神经网络(GNN)作为预测模型。然而,现有的RDL方法忽略了RDB中关系数据的不平衡问题,可能导致少数实体表示不足,从而在实践中产生不可用的模型。在这项工作中,我们首次研究了RDB实体分类中的类别不平衡问题,并设计了以关系为中心的少数类合成过采样GNN(Rel-MOSS),以填补当前文献中的关键空白。具体来说,为了缓解少数类相关信息被多数类信息淹没的问题,我们设计了关系门控控制器来调节来自每个单独关系类型的邻域消息。基于关系门控表示,我们进一步提出了用于过采样的关系引导的少数类合成器,该合成器整合了实体关系签名以保持关系一致性。在12个实体分类数据集上的大量实验为Rel-MOSS的优越性提供了令人信服的证据,与最先进的RDL方法和处理类别不平衡的经典方法相比,在平衡准确率和G-Mean上分别平均提高了2.46%和4.00%。

英文摘要

In recent advances, to enable a fully data-driven learning paradigm on relational databases (RDB), relational deep learning (RDL) is proposed to structure the RDB as a heterogeneous entity graph and adopt the graph neural network (GNN) as the predictive model. However, existing RDL methods neglect the imbalance problem of relational data in RDBs and risk under-representing the minority entities, leading to an unusable model in practice. In this work, we investigate, for the first time, class imbalance problem in RDB entity classification and design the relation-centric minority synthetic over-sampling GNN (Rel-MOSS), in order to fill a critical void in the current literature. Specifically, to mitigate the issue of minority-related information being submerged by majority counterparts, we design the relation-wise gating controller to modulate neighborhood messages from each individual relation type. Based on the relational-gated representations, we further propose the relation-guided minority synthesizer for over-sampling, which integrates the entity relational signatures to maintain relational consistency. Extensive experiments on 12 entity classification datasets provide compelling evidence for the superiority of Rel-MOSS, yielding an average improvement of up to 2.46% and 4.00% in terms of Balanced Accuracy and G-Mean, compared with SOTA RDL methods and classic methods for handling class imbalance.

2603.07860 2026-05-29 cs.LG

Sparse Scheduled Diffusion Guidance for Inverse Problems

稀疏调度扩散引导用于逆问题

Abduragim Shtanchaev, Albina Ilina, Yazid Janati, Arip Asadulaev, Martin Takac, Eric Moulines

发表机构 * MBZUAI(穆扎伊人工智能研究院) Institute of Foundation Models(基础模型研究所) EPITA

AI总结 提出Spin方法,通过从中间时间步开始后验采样并仅在调度步骤应用轻量级校正,实现高效逆问题求解,在FFHQ和ImageNet上速度提升2-50倍且内存更低。

详情
AI中文摘要

预训练扩散模型是贝叶斯逆问题的有效先验,但使用这些先验进行后验采样通常成本高昂,因为数据一致性引导应用于整个反向轨迹。现有方法表明,有时可以避免通过去噪器的向量-雅可比乘积,但它们通常仍然依赖于整个轨迹的密集引导或昂贵的内部求解。我们提出了稀疏调度扩散引导用于逆问题(Spin),这是一种避免从纯噪声开始后验采样的求解器。Spin首先在中间时间步$t_*$从后验时间边际采样,然后将该状态作为引导反向扩散过程的热启动。在引导时间,Spin不是在每个去噪步骤强制执行测量约束,而是仅在调度的时间步应用轻量级校正,此时去噪器仍能清理伪影。由此产生的过程将先验细化与数据一致性解耦:先验提供去噪,而轻量级像素空间优化强制执行测量约束,无需通过去噪器或解码器进行反向传播。在FFHQ和ImageNet上的线性和非线性逆问题中,Spin以显著更好的运行时-内存曲线实现了有竞争力的重建质量,在像素空间模型上运行速度提高2倍,在潜在扩散模型上运行速度提高50倍,且内存成本更低。

英文摘要

Pretrained diffusion models are effective priors for Bayesian inverse problems, but posterior sampling with these priors is often costly because data-consistency guidance is applied throughout the full reverse trajectory. Existing methods have shown that vector-Jacobian products through the denoiser can sometimes be avoided, yet they typically still rely on dense guidance through the full trajectory or expensive inner solves. We introduce Sparse Scheduled Diffusion Guidance for Inverse Problems (Spin), a solver that avoids starting posterior sampling from pure noise. Spin first samples from a posterior time-marginal at an intermediate timestep $t_*$, and then uses that state as a warm start for a guided reverse diffusion process. At guidance time, instead of enforcing the measurement constraint at every denoising step, Spin applies lightweight corrections only at scheduled timesteps where the denoiser can still clean up artifacts. The resulting procedure decouples prior refinement from data consistency: the prior supplies denoising, while lightweight pixel-space optimization enforces the measurement constraint without backpropagation through the denoiser or decoder. Across linear and nonlinear inverse problems on FFHQ and ImageNet, Spin achieves competitive reconstruction quality with a substantially better runtime--memory profile, running 2x faster on pixel-space models and up to 50x faster on latent diffusion models, with lower memory costs.

2603.05488 2026-05-29 cs.CL cs.AI cs.LG

Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought

推理剧场:从思维链中分离模型信念

Siddharth Boppana, Annabel Ma, Max Loeffler, Raphael Sarfati, Eric Bigelow, Atticus Geiger, Owen Lewis, Jack Merullo

发表机构 * Harvard University, Cambridge, MA(哈佛大学,马萨诸塞州剑桥)

AI总结 通过激活探针、早期强制回答和思维链监控器分析,发现推理模型存在表演性思维链现象,并利用探针引导的早期退出实现高效计算。

详情
AI中文摘要

我们提供了推理模型中表演性思维链(CoT)的证据,即模型对其最终答案变得非常自信,但继续生成令牌而不揭示其内部信念。我们的分析比较了两个大型模型(DeepSeek-R1 671B 和 GPT-OSS 120B)中的激活探针、早期强制回答和思维链监控器,并发现了任务难度特定的差异:模型的最终答案可以从思维链中远早于监控器能够判断的激活中解码,特别是对于基于回忆的简单MMLU问题。我们将此与困难的多跳GPQA-Diamond问题中的真正推理进行对比。尽管如此,转折点(例如回溯、“啊哈”时刻)几乎只出现在探针显示大信念转变的响应中,表明这些行为追踪的是真正的不确定性,而不是学到的“推理剧场”。最后,探针引导的早期退出在MMLU上减少了高达80%的令牌,在GPQA-Diamond上减少了30%,且准确率相似,将注意力探针定位为检测表演性推理和实现自适应计算的高效工具。

英文摘要

We provide evidence of performative chain-of-thought (CoT) in reasoning models, where a model becomes strongly confident in its final answer, but continues generating tokens without revealing its internal belief. Our analysis compares activation probing, early forced answering, and a CoT monitor across two large models (DeepSeek-R1 671B & GPT-OSS 120B) and find task difficulty-specific differences: The model's final answer is decodable from activations far earlier in CoT than a monitor is able to say, especially for easy recall-based MMLU questions. We contrast this with genuine reasoning in difficult multihop GPQA-Diamond questions. Despite this, inflection points (e.g., backtracking, 'aha' moments) occur almost exclusively in responses where probes show large belief shifts, suggesting these behaviors track genuine uncertainty rather than learned "reasoning theater." Finally, probe-guided early exit reduces tokens by up to 80% on MMLU and 30% on GPQA-Diamond with similar accuracy, positioning attention probing as an efficient tool for detecting performative reasoning and enabling adaptive computation.

2603.05002 2026-05-29 cs.LG math.OC stat.ML

Non-Euclidean Gradient Descent Operates at the Edge of Stability

非欧几里得梯度下降在稳定性边缘运行

Rustem Islamov, Michael Crawshaw, Jeremy Cohen, Robert Gower

发表机构 * University of Basel(巴塞尔大学) George Mason University(乔治·马歇尔大学) Flatiron Institute(Flatiron研究所)

AI总结 本文通过方向光滑性解释梯度下降中的稳定性边缘现象,并将其推广到非欧几里得范数,定义广义尖锐度,实验表明非欧几里得梯度下降也表现出渐进尖锐化和阈值振荡。

详情
AI中文摘要

稳定性边缘(EoS)是一种现象,其中Hessian矩阵的尖锐度(最大特征值)在梯度下降(GD)中接近并徘徊在稳定性阈值$2/η$附近(步长为$η$)。尽管(表面上)违反了经典光滑性假设,但EoS在深度学习中已被广泛观察到,其理论基础仍不完整。我们通过方向光滑性[Mishkin et al., 2024]的视角提供了对EoS的解释。这种解释自然地扩展到非欧几里得范数,我们用它来定义任意范数下的广义尖锐度。我们的广义尖锐度度量包括先前研究的普通GD和预处理GD作为特例,以及尚未研究EoS的方法,例如$\ell_{\infty}$下降、块坐标下降、谱GD及其归一化版本。通过在神经网络上的实验,我们表明具有广义尖锐度的非欧几里得GD也表现出渐进尖锐化,随后在阈值$2/η$附近或之上振荡。在实践中,我们的框架提供了一种几何感知的谱诊断方法,可应用于广泛的非欧几里得梯度方法类别。

英文摘要

The Edge of Stability (EoS) is a phenomenon where the sharpness (largest eigenvalue) of the Hessian approaches and then hovers near the stability threshold $2/η$ during gradient descent (GD) with step size $η$. Despite (apparently) violating classical smoothness assumptions, EoS has been widely observed in deep learning, but its theoretical foundations remain incomplete. We provide an interpretation of EoS through the lens of Directional Smoothness [Mishkin et al., 2024]. This interpretation naturally extends to non-Euclidean norms, which we use to define generalized sharpness under an arbitrary norm. Our generalized sharpness measure includes previously studied vanilla GD and preconditioned GD as special cases, as well as methods for which EoS has not been studied, such as $\ell_{\infty}$-descent, Block CD, Spectral GD, and their normalized versions. Through experiments on neural networks, we show that non-Euclidean GD with our generalized sharpness also exhibits progressive sharpening followed by oscillations around or above the threshold $2/η$. Practically, our framework provides a geometry-aware spectral diagnostic that can be applied across a broad class of non-Euclidean gradient methods.

2603.04678 2026-05-29 cs.CL cs.AI

Post-Training Language Models for Crosslingual Consistency

后训练语言模型以实现跨语言一致性

Tianyu Liu, Jirui Qi, Mrinmaya Sachan, Ryan Cotterell, Raquel Fernández, Arianna Bisazza

发表机构 * ETH Zürich(苏黎世联邦理工学院) CLCG, University of Groningen(格罗宁根大学CLCG中心) University of Amsterdam(阿姆斯特丹大学)

AI总结 针对多语言模型对翻译等价提示响应不一致的问题,提出基于信息论的跨语言一致性定义,并开发后训练方法直接一致性优化(DCO)以提升一致性。

Comments ICML 2026. The first two authors contributed equally. Codes available at: https://github.com/Betswish/ConsistencyRL

详情
AI中文摘要

语言模型通常对跨语言的翻译等价提示响应不一致,这损害了多语言系统的可靠性。为了量化这一点,我们从信息论角度将跨语言一致性定义为模型响应分布与其跨语言往返推前分布之间的散度界。然后,我们引入惩罚一致性优化(PCO),这是一种后训练程序,将该散度与固定参考语言模型的Kullback-Leibler惩罚相结合。由于直接优化PCO需要昂贵的策略内展开,我们提出了一个易于处理的替代方案——直接一致性优化(DCO),它可以在策略外进行优化。在多种语言模型和26种语言中,DCO显著提高了跨语言一致性,优于现有方法,并实现了对低资源语言的有针对性的对齐。

英文摘要

Language models often respond inconsistently to translation-equivalent prompts across languages, undermining the reliability of multilingual systems. To quantify this, we give an information-theoretic definition of crosslingual consistency as a divergence bound between a model's response distribution and its round-trip pushforward across languages. We then introduce penalized consistency optimization (PCO), a post-training procedure that couples this divergence with a Kullback-Leibler penalty to a fixed reference language model. Because direct optimization of PCO requires expensive on-policy roll-outs, we propose a tractable surrogate, direct consistency optimization (DCO), which can be optimized off-policy. Across diverse language models and 26 languages, DCO significantly improves crosslingual consistency, outperforms existing methods, and enables targeted alignment of low-resource languages.

2603.04314 2026-05-29 cs.CV cs.AI

MOO: A Multi-view Oriented Observations Dataset for Viewpoint Analysis in Cattle Re-Identification

MOO:用于牛个体重识别视角分析的多视角观测数据集

William Grolleau, Achraf Chaouch, Astrid Sabourin, Guillaume Lapouge, Catherine Achard

发表机构 * Universite Paris-Saclay, CEA, List(巴黎-萨克雷大学,CEA,List) Sorbonne University, CNRS, ISIR(索邦大学,CNRS,ISIR)

AI总结 提出大规模合成多视角观测数据集MOO,通过128个均匀采样视角的1000头牛图像,量化视角变化对重识别的影响,并验证合成几何先验在真实场景中的迁移性。

Comments 6 pages, 3 figures, accepted to the CVPR 2026 Workshop on Computer Vision for Animal Behavior Tracking and Modeling (CV4Animals)

详情
AI中文摘要

动物重识别(ReID)由于视角变化面临严峻挑战,特别是在航空-地面(AG-ReID)场景中,模型需要跨越剧烈的高度变化匹配个体。然而,现有数据集缺乏精确的角度标注来系统分析这些几何变化。为此,我们引入了多视角观测(MOO)数据集,这是一个大规模合成AG-ReID数据集,包含从128个均匀采样视角捕获的1000头牛个体(128,000张标注图像)。利用这个受控数据集,我们量化了高度的影响,并识别出一个关键高度阈值,超过该阈值模型对未见视角的泛化能力显著提升。最后,我们在零样本和监督设置下验证了向真实世界应用的迁移性,展示了在四个真实牛数据集上的性能提升,并确认合成几何先验有效弥合了领域差距。总之,该数据集和分析为跨视角动物ReID的未来模型开发奠定了基础。MOO公开于https://github.com/TurtleSmoke/MOO。

英文摘要

Animal re-identification (ReID) faces critical challenges due to viewpoint variations, particularly in Aerial-Ground (AG-ReID) settings where models must match individuals across drastic elevation changes. However, existing datasets lack the precise angular annotations required to systematically analyze these geometric variations. To address this, we introduce the Multi-view Oriented Observation (MOO) dataset, a large-scale synthetic AG-ReID dataset of $1,000$ cattle individuals captured from $128$ uniformly sampled viewpoints ($128,000$ annotated images). Using this controlled dataset, we quantify the influence of elevation and identify a critical elevation threshold, above which models generalize significantly better to unseen views. Finally, we validate the transferability to real-world applications in both zero-shot and supervised settings, demonstrating performance gains across four real-world cattle datasets and confirming that synthetic geometric priors effectively bridge the domain gap. Collectively, this dataset and analysis lay the foundation for future model development in cross-view animal ReID. MOO is publicly available at https://github.com/TurtleSmoke/MOO.

2603.03805 2026-05-29 cs.LG cs.AI cs.DB

Relational In-Context Learning via Synthetic Pre-training with Structural Prior

通过结构先验的合成预训练实现关系上下文学习

Yanbo Wang, Jiaxuan You, Chuan Shi, Muhan Zhang

发表机构 * Institute for Artificial Intelligence, Peking University(北京大学人工智能研究院) University of Illinois at Urbana-Champaign(伊利诺伊大学香槟分校) Institute of Computing Technology, Beijing University of Post(北京邮电大学计算机学院) State Key Laboratory of General Artificial Intelligence(通用人工智能国家重点实验室)

AI总结 提出RDB-PFN,首个仅通过合成数据训练的关系基础模型,利用结构因果模型生成多样关系数据库,实现对新数据库的即时上下文学习,在19个真实关系预测任务上优于现有表格基础模型。

详情
AI中文摘要

关系数据库是现代业务的支柱,但它们缺乏与文本或视觉领域相当的基础模型。一个关键障碍是高质量的关系数据库是私有的、稀缺的且结构异构,使得互联网规模的预训练不可行。为了克服这种数据稀缺性,我们引入了RDB-PFN,这是第一个完全通过合成数据训练的关系基础模型。受先验数据拟合网络的启发,其中从结构因果模型生成的合成数据能够实现单表推理,我们设计了一个关系先验生成器,从零开始创建无限多样的关系数据库流。在超过200万个合成单表和关系任务上进行预训练后,RDB-PFN通过真正的上下文学习学会即时适应任何新数据库。实验表明,RDB-PFN在19个真实世界的关系预测任务上实现了强大的少样本性能,优于在相同DFS线性化输入上评估的最先进的表格基础模型,同时使用轻量级架构和快速推理。代码可在https://github.com/MuLabPKU/RDBPFN获取。

英文摘要

Relational Databases (RDBs) are the backbone of modern business, yet they lack foundation models comparable to those in text or vision. A key obstacle is that high-quality RDBs are private, scarce, and structurally heterogeneous, making internet-scale pre-training infeasible. To overcome this data scarcity, we introduce RDB-PFN, the first relational foundation model trained purely via synthetic data. Inspired by Prior-Data Fitted Networks (PFNs), where synthetic data generated from Structural Causal Models (SCMs) enables reasoning on single tables, we design a Relational Prior Generator to create an infinite stream of diverse RDBs from scratch. Pre-training on over 2 million synthetic single-table and relational tasks, RDB-PFN learns to adapt to any new database instantly via genuine in-context learning. Experiments show that RDB-PFN achieves strong few-shot performance on 19 real-world relational prediction tasks, outperforming state-of-the-art tabular foundation models evaluated on the same DFS-linearized inputs, while using a lightweight architecture and fast inference. The code is available at https://github.com/MuLabPKU/RDBPFN.

2603.03503 2026-05-29 cs.CV cs.LG

Geographically-Weighted Weakly Supervised Bayesian High-Resolution Transformer for 200m Resolution Pan-Arctic Sea Ice Concentration Mapping and Uncertainty Estimation using Sentinel-1, RCM, and AMSR2 Data

地理加权弱监督贝叶斯高分辨率Transformer:利用Sentinel-1、RCM和AMSR2数据实现200米分辨率泛北极海冰密集度制图与不确定性估计

Mabel Heffring, Lincoln Linlin Xu

发表机构 * Department of Geomatics Engineering, Schulich School of Engineering, University of Calgary(地质工程系,Schulich 工程学院,卡尔加里大学)

AI总结 提出一种贝叶斯高分辨率Transformer模型,结合地理加权弱监督损失函数和决策级数据融合,利用Sentinel-1、RCM和AMSR2数据实现200米分辨率泛北极海冰密集度制图与不确定性量化。

Comments 23 pages, 20 figures

详情
AI中文摘要

尽管具有可靠对应不确定性的泛北极海冰高分辨率制图对于业务化海冰密集度(SIC)制图至关重要,但由于冰特征信号的细微性、SIC标签的不精确性、模型不确定性和数据异质性等关键挑战,这是一项艰巨的任务。本研究提出了一种新颖的贝叶斯高分辨率Transformer方法,利用Sentinel-1、RADARSAT星座任务(RCM)和先进微波扫描辐射计2(AMSR2)数据,实现200米分辨率泛北极SIC制图和不确定性量化。首先,为了改进微小和细微海冰特征(例如裂缝/水道、融池和浮冰)的提取,我们设计了一种新颖的高分辨率Transformer模型,该模型具有全局和局部模块,能够更好地区分海冰模式的细微差异。其次,为了解决低分辨率和非精确SIC标签的问题,我们设计了一种地理加权弱监督损失函数,在区域级别而非像素级别监督模型,并优先考虑纯开阔水和冰盖特征,同时减轻边缘冰区(MIZ)中模糊性的影响。第三,为了改进不确定性量化,我们设计了所提Transformer模型的贝叶斯扩展,将其参数视为随机变量,以更有效地捕获不确定性。第四,为了解决数据异质性,我们在决策级融合三种不同类型的数据(Sentinel-1、RCM和AMSR2),以改进SIC制图和不确定性量化。所提方法在2021年和2025年泛北极最小范围条件下进行了评估。结果表明,所提模型在使用Sentinel-1数据时实现了0.70的总体特征检测精度,同时保留了泛北极SIC模式(相对于ARTIST海冰产品,Sentinel-1 R² = 0.90)。

英文摘要

Although high-resolution mapping of pan-Arctic sea ice with reliable corresponding uncertainty is essential for operational sea ice concentration (SIC) charting, it is a difficult task due to key challenges, such as the subtle nature of ice signature features, inexact SIC labels, model uncertainty, and data heterogeneity. This study presents a novel Bayesian High-Resolution Transformer approach for 200 meter resolution pan-Arctic SIC mapping and uncertainty quantification using Sentinel-1, RADARSAT Constellation Mission (RCM), and Advanced Microwave Scanning Radiometer 2 (AMSR2) data. First, to improve small and subtle sea ice feature (e.g., cracks/leads, ponds, and ice floes) extraction, we design a novel high-resolution Transformer model with both global and local modules that can better discern the subtle differences in sea ice patterns. Second, to address low-resolution and inexact SIC labels, we design a geographically-weighted weakly supervised loss function to supervise the model at region level instead of pixel level, and to prioritize pure open water and ice pack signatures while mitigating the impact of ambiguity in the marginal ice zone (MIZ). Third, to improve uncertainty quantification, we design a Bayesian extension of the proposed Transformer model, treating its parameters as random variables to more effectively capture uncertainties. Fourth, to address data heterogeneity, we fuse three different data types (Sentinel-1, RCM, and AMSR2) at decision-level to improve both SIC mapping and uncertainty quantification. The proposed approach is evaluated under pan-Arctic minimum-extent conditions in 2021 and 2025. Results demonstrate that the proposed model achieves 0.70 overall feature detection accuracy using Sentinel-1 data, while also preserving pan-Arctic SIC patterns (Sentinel-1 R\textsuperscript{2} = 0.90 relative to the ARTIST Sea Ice product).

2603.02082 2026-05-29 cs.CL

What Exactly do Children Receive in Language Acquisition? A Case Study on CHILDES with Automated Detection of Filler-Gap Dependencies

儿童在语言习得中究竟获得了什么?基于CHILDES的填充词-空位依赖自动检测案例研究

Zhenghao Herbert Zhou, William Dai, Maya Viswanathan, Simon Charlow, R. Thomas McCoy, Robert Frank

发表机构 * Department of Linguistics, Yale University(耶鲁大学语言学系) Department of Computer Science, Yale University(耶鲁大学计算机科学系) Wu Tsai Institute, Yale University(耶鲁大学吴氏研究所)

AI总结 通过自动检测英语口语语料中的三种核心填充词-空位结构,量化儿童语言输入中的分布证据,并分析儿童产出轨迹,为先天语法知识与统计学习之争提供数据支持。

Comments Camera-ready version accepted to CoNLL 2026

详情
AI中文摘要

儿童对填充词-空位依赖的习得,一些研究者认为依赖于先天语法知识,而另一些则认为儿童导向言语中可用的分布证据足以解释。不幸的是,相关输入难以大规模细粒度量化,使得这一问题难以解决。我们提出一个系统,能够识别英语口语语料中的三种核心填充词-空位结构——主句wh-疑问句、嵌入式wh-疑问句和关系从句——并进一步识别提取位置(即主语、宾语或附加语)。我们的方法结合了成分分析和依存分析,利用它们在结构分类和提取位置识别上的互补优势。我们在人工标注数据上验证了该系统,发现其在大多数类别上表现良好。将该系统应用于57个英语CHILDES语料库,我们能够描述儿童在发育过程中接收的填充词-空位输入及其产出轨迹,包括特定结构的频率和提取位置不对称性。由此产生的细粒度标签为未来的习得研究和计算研究提供了基础,我们通过一个使用语言模型进行过滤语料训练的案例研究进行了演示。

英文摘要

Children's acquisition of filler-gap dependencies has been argued by some to depend on innate grammatical knowledge, while others suggest that the distributional evidence available in child-directed speech suffices. Unfortunately, the relevant input is difficult to quantify at scale with fine granularity, making this question difficult to resolve. We present a system that identifies three core filler-gap constructions in spoken English corpora -- matrix wh-questions, embedded wh-questions, and relative clauses -- and further identifies the extraction site (i.e., subject vs. object vs. adjunct). Our approach combines constituency and dependency parsing, leveraging their complementary strengths for construction classification and extraction site identification. We validate the system on human-annotated data and find that it scores well across most categories. Applying the system to 57 English CHILDES corpora, we are able to characterize children's filler-gap input and their filler-gap production trajectories over the course of development, including construction-specific frequencies and extraction-site asymmetries. The resulting fine-grained labels enable future work in both acquisition and computational studies, which we demonstrate with a case study using filtered corpus training with language models.