arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 2075
专题追踪 全部专题
2605.07465 2026-05-11 cs.CL

SEIF: Self-Evolving Reinforcement Learning for Instruction Following

SEIF:用于指令跟随的自演化强化学习

Qingyu Ren, Qianyu He, Jiajie Zhu, Xingzhou Chen, Jingwen Chang, Zeye Sun, Han Xia, Fei Yu, Jiaqing Liang, Yanghua Xiao

发表机构 * Shanghai Key Laboratory of Data Science, College of Computer Science and Artificial Intelligence, Fudan University(上海数据科学 key laboratory,计算机科学与人工智能学院,复旦大学) School of Data Science, Fudan University(数据科学学院,复旦大学) Ant Group(蚂蚁集团)

AI总结 SEIF提出了一种自演化框架,通过闭环自我进化提升大语言模型的指令跟随能力,通过动态调整指令难度与模型能力相互促进,实验表明其在多模型规模上均有效提升性能。

详情
AI中文摘要

指令跟随是大型语言模型(LLMs)的基本能力,但持续提升这一能力仍具挑战性。现有方法通常依赖昂贵的人类外部监督或强教师模型,或使用静态难度的自我对战训练,无法随模型能力提升而进化。为解决这些限制,我们提出了SEIF(用于指令跟随的自演化强化学习),一种增强LLMs指令跟随能力的自演化框架。SEIF形成一个闭环自我进化循环,通过指令难度进化和模型能力进化相互促进来提升模型的指令跟随能力。SEIF包含四个角色:生成越来越具有挑战性的指令的Instructor,去除冲突或无效指令以确保数据质量的Filter,学习遵循进化指令的Follower,以及提供强化学习奖励信号的Judger。Instructor和Follower在过程中交替训练并共同进化。在多个模型规模和架构上的实验表明,SEIF一致提升了指令跟随性能,表明其具有强泛化能力。进一步分析揭示了改进的来源,并识别了在开放性任务上自我进化训练的有效策略:充分的早期阶段训练以建立坚实基础,随后适度的后期阶段训练以缓解过拟合并实现更好的最终性能。代码和数据可在https://github.com/Rainier-rq1/SEIF上公开获取。

英文摘要

Instruction following is a fundamental capability of large language models (LLMs), yet continuously improving this capability remains challenging. Existing methods typically rely either on costly external supervision from humans or strong teacher models, or on self-play training with static-difficulty instructions that cannot evolve as the model's capabilities improve. To address these limitations, we propose SEIF (Self-Evolving Reinforcement Learning for Instruction Following), a self-evolving framework for enhancing the instruction-following ability of LLMs. SEIF forms a closed self-evolution loop that improves the model's instruction-following ability, where instruction difficulty evolution and model capability evolution reinforce each other. SEIF consists of four roles: an Instructor that generates increasingly challenging instructions, a Filter that removes conflicting or invalid instructions to ensure data quality, a Follower that learns to follow evolved instructions, and a Judger that provides reward signals for reinforcement learning. The Instructor and Follower are alternately trained and co-evolve throughout the process. Experiments across multiple model scales and architectures show that SEIF consistently improves instruction-following performance, suggesting strong generality. Further analyses reveal the sources of improvement and identify an effective training strategy for self-evolution on open-ended tasks: sufficient early-stage training to build a solid foundation, followed by moderate late-stage training to mitigate overfitting and achieve better final performance. The code and data are publicly available at https://github.com/Rainier-rq1/SEIF.

2605.07463 2026-05-11 cs.LG

Approximation Error Upper and Lower Bounds for Hölder Class with Transformers

Transformer 对 Hölder 类的近似误差上界和下界研究

Xin He, Yuling Jiao, Xiliang Lu, Jerry Zhijian Yang

发表机构 * School of Mathematics and Statistics, Wuhan University, Wuhan, China(武汉大学数学与统计学学院) School of Artificial Intelligence, Wuhan University, Wuhan, China(武汉大学人工智能学院) Hubei Key Laboratory of Computational Science, Wuhan University, Wuhan, China(湖北省计算科学重点实验室)

AI总结 本文研究了 Transformer 的表达能力,推导了 Hölder 类的近似误差上界和下界,证明了 Transformer 网络在一定块数下可近似任意 Hölder 函数,并展示了其在回归任务中的有效性。

Comments 31 pages, 2 figures. Accepted by ICML2026

详情
AI中文摘要

我们通过建立精确的近似误差上界和下界来探讨 Transformer 的表达能力。具体而言,我们为标准 Transformer 架构推导了新的近似上界,证明了由至多 O(ε^{-d₀/α}) 块组成的 Transformer 网络可近似任意有界的 Hölder 函数,其输入维度为 d₀,光滑度为 α∈(0,1]。在下界方面,利用 VC 维度上界,我们首次严格证明 Transformer 至少需要 Ω(ε^{-d₀/(4α)}) 块才能达到 ε 的近似精度。最后,我们将推导出的结果扩展到一般回归任务,并建立了相应的超额风险率,展示了 Transformer 在现实场景中的有效性。

英文摘要

We explore the expressive power of Transformers by establishing precise approximation error upper and lower bounds for Hölder class. Specifically, a new approximation upper bound is derived for the standard Transformer architecture equipped with Softmax operators, ReLU activation functions, and residual connections. We prove that a Transformer network composed of at most $\mathcal{O}(\varepsilon^{-{d_{0}}/α})$ blocks can approximate any bounded Hölder function with $d_{0}$-dimensional input and smoothness $α\in(0,1]$ under any accuracy $\varepsilon>0$. In the case of approximation lower bounds, leveraging the VC-dimension upper bound, we are the first to rigorously prove that Transformers demand for at least $Ω(\varepsilon^{-{d_{0}}/({4α})})$ blocks to achieve the $\varepsilon$ approximation accuracy. As a final step, we extend the derived results for standard Transformers to a general regression task and establish the corresponding excess risk rates demonstrating Transformers' empirical effectiveness in real-world settings.

2605.07462 2026-05-11 cs.CL cs.AI

The Moltbook Files: A Harmless Slopocalypse or Humanity's Last Experiment

Moltbook Files:一场无害的混乱还是人类的最后实验

William Brach, Federico Torrielli, Stine Lyngsø Beltoft, Annemette Brok Pirchert, Peter Schneider-Kamp, Lukas Galke Poech

发表机构 * Slovak University of Technology(斯洛伐克技术大学) University of Turin(都灵大学) University of Southern Denmark(南部丹麦大学)

AI总结 研究Moltbook平台上的群体行为,通过分析232k篇帖子和2.2M条评论,发现其数据对语言模型的影响,发现微调后真实性下降,但Reddit数据集也产生类似效果。

详情
AI中文摘要

Moltbook是一个类似Reddit的平台,OpenClaw代理在其中发布、评论和投票,这一前所未有的事件带来严重安全问题。我们发布Moltbook Files数据集,包含232k篇帖子和2.2M条评论,经过处理以删除个人可识别信息(PII)。我们分析了社区结构、作者身份、词汇特性、情感、主题、语义几何和评论互动。为了了解Moltbook数据如何影响下一代语言模型,我们对Qwen2.5-14B-Instruct进行了微调,使用三种适应级别。我们的PII管道揭示了代理在Moltbook上发布API密钥、密码和BIP39种子短语。总体情感以中性和轻微积极为主(66.6%中性,19.5%积极),表现出自我指涉的链接倾向。我们发现微调Moltbook数据使真实性从0.366降至0.187。然而,微调匹配大小的Reddit数据集也产生类似下降。Moltbook似乎更像一场无害的混乱。然而,尾部风险仍存,包括代理能力、通过自我链接污染未来爬取以及潜在的将特征转移给下一代语言模型。更广泛地说,我们的发现强调了在涌现偏差评估中控制基线的重要性。

英文摘要

Moltbook is a Reddit-like platform where OpenClaw agents post, comment, and vote at scale - a so far unprecedented incident that comes with serious safety concerns. With the aim of studying emergent behavior in populations, we release the Moltbook Files, a dataset of 232k posts and 2.2M comments covering the platform's first 12 days, processed through a pipeline to identify and remove Personally-Identifiable Information (PII). We analyze community structure, authorship, lexical properties, sentiment, topics, semantic geometry, and comment interaction. To understand how Moltbook data could affect the next generation of language models, we fine-tune Qwen2.5-14B-Instruct on Moltbook Files with three adaptation levels. Our PII pipeline reveals that agents post API keys, passwords, BIP39 seed phrases on Moltbook, a publicly indexed platform. The overall sentiment is mostly neutral and mildly positive (66.6% neutral, 19.5% positive) and shows a tendency for self-referential linking. We find that fine-tuning on Moltbook data reduces truthfulness from 0.366 to 0.187. However, a model fine-tuned on a size-matched Reddit dataset produces a comparable decrease. Moltbook thus seems to be more of a harmless slopocalypse. However, tail risks remain, including agent affordances, contamination of future crawls through self-links, and potential transfer of traits to the next generation of language models. More broadly, our findings highlight the importance of control baselines in emergent misalignment evaluations.

2605.07461 2026-05-11 cs.CL

Think-with-Rubrics: From External Evaluator to Internal Reasoning Guidance

基于评分标准的思考:从外部评估者到内部推理指导

Jiachen Yu, Zhihao Xu, Junjie Wang, Yujiu Yang

发表机构 * Tsinghua University(清华大学) Renmin University of China(中国人民大学)

AI总结 本文提出Think-with-Rubrics方法,通过将评分标准整合到推理过程中,使评分标准成为LLM生成的内部指导,提升生成质量。

详情
AI中文摘要

评分标准已被广泛用于评估不可验证的开放性任务,近期研究将其纳入强化学习的奖励系统。然而,现有框架通常将评分标准仅作为外部评估者,与策略的主要推理轨迹脱节。这种设计使评分标准只能用于事后测量,无法主动指导模型的生成过程。本文引入Think-with-Rubrics,一种新的指令跟随任务范式。Think-with-Rubrics将评分标准生成整合到推理上下文中,将评分标准从独立的产物转变为LLM生成的内部指导。在训练过程中,LLM依次生成评分标准和响应,而经过训练的评分标准验证器通过评估答案与自动生成/黄金评分标准之间的一致性提供联合监督。在多个基准测试中的实验表明,Think-with-Rubrics在平均3.87分上优于由黄金评分标准监督的Rubric-as-Reward基线。我们还讨论了Think-with-Rubrics如何提升模型性能的机制。实验结果表明,来自黄金评分标准和自动生成评分标准的监督通过提高自动生成评分标准的质量和增加响应的内部一致性来提升Think-with-Rubrics的性能。

英文摘要

Rubrics have been extensively utilized for evaluating unverifiable, open-ended tasks, with recent research incorporating them into reward systems for reinforcement learning. However, existing frameworks typically treat rubrics only as external evaluator disjointed from the policy's primary reasoning trace. Such design confines rubrics to post-hoc measurement, leaving them unable to actively guide the model's generation process. In this work, we introduce Think-with-Rubrics, a novel paradigm for instruction following tasks. Think-with-Rubrics integrates rubric generation into the reasoning context, transforming the rubric from an independent artifact into an internal guidance of LLM's generation. During training, LLM sequentially generates a rubric followed by a response, while a trained rubric verifier provides joint supervision by evaluating the consistency between the answer and the self-generated / golden rubrics. Experiments across multiple benchmarks demonstrate that Think-with-Rubrics consistently outperforms the Rubric-as-Reward baseline supervised by golden rubrics by an average of 3.87 points. We have also discussed the mechanism by which Think-with-Rubrics enhances model performance. Experimental results demonstrate that supervision from golden rubrics and self-generated rubrics enhances the performance of Think-with-Rubrics by improving the quality of self-generated rubrics and increasing the internal consistency of responses respectively.

2605.07460 2026-05-11 cs.LG hep-ex

Learning Minimal-Deviation Corrections for Multi-Dimensional Mismodelling in HEP Simulations

在高能物理模拟中为多维建模误差学习最小偏差修正

Matthias Schott, Lucie Flek

发表机构 * Institute of Physics, University of Bonn(波恩大学物理研究所) Bonn-Aachen International Center for Information Technology (b-it)(波恩-亚琛信息科技国际中心(b-it))

AI总结 本文提出基于神经网络的方法,通过学习事件转换来复现一维目标分布,同时保持原始模拟的全局相关性,以改进高能物理模拟中的多维建模误差。

Comments 12 pages, 6 figures

详情
AI中文摘要

准确的蒙特卡洛(MC)建模在高能物理中具有挑战性,特别是在复杂场景中,模拟无法再现观测数据。在实践中,实验信息通常仅限于一维(1D)分布,而建模误差出现在多维特征空间中。这限制了传统修正方法,因为一维重加权忽略了相关性,而完全多维方法需要大量目标数据集。我们提出了一种基于神经网络的方法,在这些限制下通过学习模拟事件的转换来复现可用的1D目标分布,同时保持接近原始模拟。最小偏差原理在保持基线模型的全局相关性结构的同时,使针对建模误差特征的修正成为可能。通过受控研究使用模拟伪数据,我们证明该方法提高了与目标分布的一致性,并保持了多维结构的一致性。该方法适用于复杂、高维分析,其中传统技术不足,提供了一种可扩展的方法,在信息有限的情况下增强MC建模。

英文摘要

Accurate Monte Carlo (MC) modelling in high-energy physics is challenging, particularly in complex scenarios where simulations fail to reproduce observed data. In practice, experimental information is often limited to one-dimensional (1D) distributions, while mismodelling arises in a multidimensional feature space. This restricts traditional correction methods, as one-dimensional reweighting ignores correlations and fully multidimensional approaches require large target datasets. We propose a neural network-based method that operates under these constraints by learning a transformation of simulated events that reproduces the available 1D target distributions while remaining close to the original simulation. This minimal-deviation principle preserves the global correlation structure of the baseline model while enabling targeted corrections of mismodelled features. Using controlled studies with simulated pseudo-data, we show that the method improves agreement with target distributions and maintains a consistent multidimensional structure. The approach is designed for complex, high-dimensional analyses where traditional techniques are insufficient, providing a scalable way to enhance MC modelling under limited information.

2605.07458 2026-05-11 cs.LG

Estimation of Motor Unit Parameters from Surface Electromyograms using an Informed Autoencoder

基于表面肌电图的电机单元参数估计方法研究

Kaja Balzereit, Malte Mechtenberg, Axel Schneider

发表机构 * Hochschule Bielefeld, University of Applied Sciences and Arts, Institute for System Dynamics and Mechatronics(比勒菲尔德应用科学大学,系统动力学与机电系统研究所)

AI总结 本文提出一种基于受 inform autoencoder 的非侵入式表面肌电图数据处理方法,用于同时估计多个电机单元参数,如神经支配区中心和传导速度,通过数据驱动的机器学习减少手动建模工作量。

详情
AI中文摘要

电机单元参数如神经支配区中心或电位传导速度具有提高用于运动和力预测的神经机械模型保真度的潜力。以非侵入方式确定这些参数具有挑战性,因为它们是受体特异性的,并且可能随肌肉收缩而变化。现有工作主要依赖于白盒建模,因此需要大量的手动建模工作。本文的目标是同时估计多个受体特异性的电机单元参数,从非侵入式测量的表面肌电图(EMG)记录中。这导致了一个非线性损失函数的逆问题。为了解决这个问题,开发了一种受 inform autoencoder。该自动编码器在重建表面EMG记录的同时,在其潜在空间中学习参数,并遵守将参数与EMG信号相关联的物理定律。在合成数据的实验中,神经支配区中心的均方误差为2.5989 mm,电位传导速度的均方误差为0.1697 m s^{-1}。这些结果证明了这种方法的可行性,它通过数据驱动的机器学习整合,实现了多个电机单元参数的同时估计,同时减少了手动建模的工作量。

英文摘要

Motor unit parameters such as the innervation zone centre or the conduction velocity of the electrical potential harbour the potential to improve the fidelity of neuromechanical models used for movement and force prediction. Determining these parameters in a non-invasive way is challenging, as they are subject-specific and may vary with muscle contraction. Existing work on the estimation of motor unit parameters mainly relies on white-box modelling and therefore requires substantial manual modelling effort. This work targets the simultaneous estimation of multiple subject-specific motor unit parameters from electromyography (EMG) recordings measured non-invasively at the skin surface. This results in an inverse problem with a nonlinear loss function. To address this problem, an informed autoencoder is developed. This autoencoder reconstructs the surface EMG recordings while learning the parameters in its latent space and adhering to physical laws that relate the parameters to the EMG signals. In experiments on synthetic data, innervation zone centres are estimated with a mean absolute error of 2.5989 $\mathrm{mm}$, and conduction velocities of the electric potential are estimated with a mean absolute error of 0.1697 $\mathrm{m}\mathrm{s}^{-1}$. These results demonstrate the plausibility of this novel approach, which enables the simultaneous estimation of several motor unit parameters while reducing manual modelling effort through the integration of data-driven machine learning.

2605.07457 2026-05-11 cs.CV

EditRefiner: A Human-Aligned Agentic Framework for Image Editing Refinement

EditRefiner:一种对齐人类的代理框架用于图像编辑细化

Zitong Xu, Huiyu Duan, Yifei Nie, Mingda Du, Sijing Wu, Xiongkuo Min, Tianyi Zheng, Jian Zhang, Shusong Xu, Jinwei Chen, Bo Li, Guangtao Zhai

发表机构 * Shanghai Jiao Tong University(上海交通大学) Vivo Mobile Communication Co., Ltd(Vivo移动通信有限公司) University of Electronic Science and Technology of China(电子科学与技术大学)

AI总结 本文提出EditRefiner,一种对齐人类的代理框架,通过人类反馈数据集改进图像编辑的细粒度问题,通过感知-推理-行动-评估循环提升编辑质量。

详情
AI中文摘要

最近的文本引导图像编辑(TIE)模型取得了显著进展,但编辑后的图像仍经常出现细粒度问题,如不自然的对象、光照不匹配和意外变化。现有的细化方法要么依赖于昂贵的迭代再生,要么使用视觉语言模型(VLMs)具有弱空间定位,往往导致语义漂移和不可靠的局部修正。为了解决这些限制,我们首先构建了EditFHF-15K数据集,包含15K张来自12个TIE模型的图像,涵盖43种编辑任务,60K个标注的瑕疵区域和80K个编辑失败区域,每个区域都配有文本推理,以及45K个平均意见分数(MOSs)评估感知质量、指令遵循和视觉一致性。基于EditFHF-15K,我们提出了EditRefiner,一种分层、可解释且对齐人类的代理框架,将后编辑修正重新表述为人类般的感知-推理-行动-评估循环。具体来说,我们引入:(1)一个感知代理,检测瑕疵和编辑失败的上下文显著图;(2)一个推理代理,解释这些感知线索以执行对齐人类的诊断推理;(3)一个行动代理,使用推理输出计划和执行局部重编辑;(4)一个评估代理,评估重编辑的图像并指导行动代理是否需要进一步细化。广泛的实验表明,EditRefiner在畸变定位、诊断准确率和人类感知对齐方面始终优于最先进的方法,建立了自我纠正和感知可靠的图像编辑新范式。代码可在https://github.com/IntMeGroup/EditRefiner获取。

英文摘要

Recent text-guided image editing (TIE) models have made remarkable progress, yet edited images still frequently suffer from fine-grained issues such as unnatural objects, lighting mismatch, and unexpected changes. Existing refinement approaches either rely on costly iterative regeneration or employ vision-language models (VLMs) with weak spatial grounding, often resulting in semantic drift and unreliable local corrections. To address these limitations, we first construct EditFHF-15K, a dataset of fine-grained human feedback for edited images, comprising (1) 15K images from 12 TIE models spanning 43 editing tasks, (2) 60K annotated artifact regions and 80K editing failure regions, each accompanied by textual reasoning, and (3) 45K mean opinion scores (MOSs) assessing perceptual quality, instruction following, and visual consistency. Based on EditFHF-15K, we propose EditRefiner, a hierarchical, interpretable, and human-aligned agentic framework that reformulates post-editing correction as a human-like perception-reasoning-action-evaluation loop. Specifically, we introduce: (1) a perception agent that detects contextual saliency maps of artifacts and editing failures, (2) a reasoning agent that interprets these perceptual cues to perform human-aligned diagnostic inference, (3) an action agent that uses the reasoning output to plan and execute localized re-editing, and (4) an evaluation agent that assesses the re-edited image and guides the action agent on whether further refinements are required. Extensive experiments demonstrate that EditRefiner consistently outperforms state-of-the-art methods in distortion localization, diagnose accuracy and human perception alignment, establishing a new paradigm for self-corrective and perceptually reliable image editing. The code is available at https://github.com/IntMeGroup/EditRefiner.

2605.07456 2026-05-11 cs.LG

Inference-Time Attribute Distribution Alignment for Unconditional Diffusion

推理时属性分布对齐用于无条件扩散

Hao Luan, See-Kiong Ng, Chun Kai Ling

发表机构 * School of Computing, National University of Singapore(新加坡国立大学计算机学院) Institute of Data Science, National University of Singapore(新加坡国立大学数据科学研究所)

AI总结 本文提出在推理时对齐属性分布的方法,通过最优控制问题优化扩散过程,实现更灵活的属性分布对齐,无需重新训练模型。

Comments Preprint. 35 pages, 13 figures

详情
AI中文摘要

推理时可控生成对无条件扩散模型的实际应用至关重要。然而,现有技术多关注单个样本,难以满足需要样本群体遵循特定属性分布(如人口平衡或语义比例)的应用需求。我们正式将此设定为预训练无条件扩散模型的推理时属性分布对齐问题。为解决此问题,我们将推理时属性分布对齐视为反向扩散过程上的最优控制问题,将过程视为动态系统的展开,并添加时间依赖的扰动作为控制。我们使用基于最优控制的算法求解扰动,以优化可微的分布匹配目标,同时惩罚控制努力以保持数据保真度。图像生成实验结果表明,所提的即插即用方法在对齐多样化和灵活的测试时间目标方面优于基线方法,无需重新训练或微调预训练扩散模型。

英文摘要

Inference-time controllable generation is essential for real-world applications of unconditional diffusion models. However, most existing techniques focus on individual samples, struggling in applications that require the sample population to follow specific attribute distributions (e.g., demographic balance or semantic proportions). We formalize this setting as the inference-time attribute distributional alignment problem for pretrained unconditional diffusion models. To address this, we cast inference-time attribute distributional alignment as an optimal control problem over the reverse diffusion process, viewing the process as the rollout of a dynamical system and augmenting it with additive, time-dependent perturbations as control. We solve for the perturbations using an optimal-control-based algorithm to optimize a differentiable distribution-matching objective while penalizing control effort to preserve data fidelity. Experiment results in image generation demonstrate that our proposed plug-and-play approach can better align attribute distributions to diverse and flexible test-time targets compared to baselines, without retraining or finetuning the pretrained diffusion model.

2605.07455 2026-05-11 cs.CV

EditTransfer++: Toward Faithful and Efficient Visual-Prompt-Guided Image Editing

EditTransfer++:迈向精确且高效的视觉提示引导图像编辑

Lan Chen, Qi Mao, Yiren Song, Yuchao Gu, Siwei Ma

发表机构 * School of Information and Communication Engineering and the State Key Laboratory of Media Convergence and Communication, Communication University of China(信息与通信工程学院和媒体融合与通信国家重点实验室,中国传媒大学) ShowLab, National University of Singapore(新加坡国立大学ShowLab)

AI总结 EditTransfer++通过渐进式结构训练和高效条件方案提升视觉提示的准确性与推理效率,结合文本解耦训练和对比细化机制,实现更稳定的图像编辑效果。

详情
AI中文摘要

视觉提示引导的编辑迁移旨在从示例对中直接学习图像变换,相较于纯文本驱动方法提供更精确可控的编辑能力。然而,现有扩散变换器方法因任务与骨干结构不匹配、预训练对文本条件的偏见以及采样过程中的固有随机不稳定性而难以忠实再现演示的编辑。为弥合这一差距,我们提出了EditTransfer++框架,结合渐进式结构训练与高效条件方案,以提升视觉提示的忠实度和推理效率。我们首先通过文本解耦训练策略减少文本主导,移除微调过程中的文本条件,迫使模型仅从视觉证据中推断变换,同时在推理时仍支持可选文本指导。在此视觉基础模型之上,最佳最差对比细化机制重塑去噪轨迹,以抑制不忠实的生成并提高随机种子间的一致性。为缓解高分辨率上下文编辑的计算瓶颈,我们进一步引入条件压缩与重用策略,减少令牌冗余并实现1024像素长边界的高效图像生成。在现有基准和提出的EditTransfer-Bench上的大量实验表明,EditTransfer++在视觉提示忠实度上达到最先进的水平,同时比先前方法快得多的推理速度,表明了可扩展的提示引导图像编辑和更广泛视觉上下文学习的有前途方向。

英文摘要

Visual-prompt-guided edit transfer aims to learn image transformations directly from example pairs, offering more precise and controllable editing than purely text-driven approaches. However, existing diffusion transformer-based methods often fail to faithfully reproduce the demonstrated edits due to structural mismatches between the task and the backbone, including a pretrained bias toward textual conditioning and inherent stochastic instability during sampling. To bridge this gap, we present EditTransfer++, a framework that combines progressively structured training with an efficient conditioning scheme to improve both visual prompt faithfulness and inference efficiency. We first mitigate textual dominance with a text-decoupled training strategy that removes text conditioning during fine-tuning, compelling the model to infer transformations solely from visual evidence while still supporting optional text guidance at inference. On top of this visually grounded model, a best-worst contrastive refinement mechanism reshapes the denoising trajectories to suppress unfaithful generations and improve consistency across random seeds. To alleviate the computational bottleneck of high-resolution in-context editing, we further introduce a condition compression and reuse strategy that reduces token redundancy and enables efficient generation of images with a 1024-pixel long edge. Extensive experiments on existing benchmarks and the proposed EditTransfer-Bench show that EditTransfer++ achieves state-of-the-art visual prompt faithfulness with substantially faster inference than prior methods, suggesting a promising direction for scalable prompt-guided image editing and broader visual in-context learning.

2605.07454 2026-05-11 cs.CL

GRaSp: Automatic Example Optimization for In-Context Learning in Low-Data Tasks

GRaSp:用于低数据任务中上下文学习的自动示例优化

Simen Bihaug-Frøyland, Henrik Brådland

发表机构 * Centre for Artificial Intelligence Research, University of Agder(人工智能研究中心,阿格德大学) School of Computing and Information, University of Pittsburgh(计算与信息学院,匹兹堡大学) Norkart AS(Norkart公司)

AI总结 GRaSp提出一个三阶段框架,通过生成合成候选集、聚类降维和遗传算法优化,提升上下文学习任务的性能,尤其在金融命名实体识别任务中表现优异。

Comments 12 pages, 5 figures

详情
AI中文摘要

上下文学习使大语言模型能够适应新任务,但其性能对所选示例高度敏感。在领域特定且数据有限的设置中,找到有效的演示示例尤其困难。我们提出GRaSp,一个三阶段框架用于自动上下文示例优化。首先生成大量合成候选集,然后通过聚类和降维进行结构化处理,最后使用遗传算法寻找最优的上下文示例,该框架在命名实体识别任务中表现出持续改进。我们还引入了一种自定义的多样性自适应突变机制,使其能够从初始的广域集群探索过渡到聚焦的集群内精炼。我们在金融命名实体识别(FiNER-139)上评估GRaSp,比较了500和5000大小的合成和人工标注候选集。使用非合成数据,GRaSp实现了45.84%的微F1分数,持续优于零样本和随机少样本基线。合成数据与随机基线相当,但未超过,表明候选集中的分布多样性对泛化至关重要。

英文摘要

In-context learning enables large language models to adapt to new tasks, but their performance is highly sensitive to the selected examples. Finding effective demonstrations is particularly difficult in domain-specific, low-data settings where high-quality examples are scarce. We propose GRaSp, a three-stage framework for automatic in-context example optimization. By first generating a large synthetic candidate pool, then structuring it with clustering and dimensionality reduction, and finally using genetic algorithms to find the optimal in-context examples, the framework shows consistent improvements on the NER task. We also introduce a custom diversity-adaptive mutation mechanism, allowing it to transition from the initial broad inter-cluster exploration to focused intra-cluster refinement as the population converges. We evaluate GRaSp on financial named entity recognition (FiNER-139), comparing synthetic and human-annotated candidate pools across pool sizes of 500 and 5000. With non-synthetic data, GRaSp achieves 45.84% micro-F1, consistently outperforming both zero-shot and random few-shot baselines. Synthetic data matches the random baseline but does not exceed it, suggesting that distributional variety in the candidate pool is critical for generalization.

2605.07453 2026-05-11 cs.CL

Data Contamination in Neural Hieroglyphic Translation: A Reproducibility Study

神经象形翻译中的数据污染:可重复性研究

Ammar Toutou, Abdelrahman Harb, Christine Basta

发表机构 * Computer Science and Engineering, Alamein International University (AIU)(阿尔梅因国际大学(AIU)计算机科学与工程系) HiTZ Center, University of the Basque Country(巴斯克大学HiTZ中心) Faculty of Computers and Data Science, Alexandria University(亚历山大大学计算机与数据科学学院)

AI总结 研究探讨了濒危语言神经机器翻译中的数据污染问题,发现训练数据污染导致翻译质量虚高,通过文档级去污染仅降低4.6分,需目标级去重以获得更准确的评估结果。

Comments Accepted to NLP4DH 2026 Conference

详情
AI中文摘要

古濒危语言对NLP构成独特挑战:其数据集稀缺、难以扩展且基于公式化语料,导致数据质量问题尤为关键却鲜有审查。受现有NMT在这些语言上实际表现的驱动,我们研究了象形文到德语翻译,其中一项研究报告使用微调M2M-100达到61.5 BLEU。我们的复现仅获得37.0 BLEU。调查这一差距,发现2%的测试目标在训练数据中完全相同(16/50;在70%阈值下8-gram重叠占比50%)。这种污染显著提升分数:污染样本达到83.8 BLEU / 0.924 COMET-22,而清洁样本在五个模型配置(涵盖两种架构)中为30.9--39.2 BLEU / 0.622--0.676 COMET-22。文档级去污染仅使污染BLEU降低4.6分,因为8/16目标通过其他源文档残留——需目标级去重。我们发布了一个去污染的34样本测试集,并建立修正基线(30.9--39.2 BLEU),为这种濒危书写系统的NMT能力提供了现实评估。

英文摘要

Ancient and endangered languages pose a unique challenge for NLP: their datasets are inherently scarce, difficult to expand, and built from formulaic corpora -- making data-quality issues especially consequential yet rarely audited. Motivated by the need to understand what current NMT can realistically achieve for such languages, we investigate hieroglyphic-to-German translation, where a recent study reported 61.5 BLEU using fine-tuned M2M-100. Our reproduction yields only 37.0 BLEU with the released model. Investigating this gap, we find 2\% of test targets appear identically in training (16/50; 50\% under 8-gram overlap at 70\% threshold). This contamination inflates scores dramatically: contaminated samples achieve up to 83.8 BLEU / 0.924 COMET-22 versus 30.9--39.2 BLEU / 0.622--0.676 COMET-22 on clean samples across five model configurations spanning two architectures. Document-level decontamination reduces contaminated BLEU by only 4.6 points because 8/16 targets persist via other source documents -- target-level deduplication is required. We release a decontaminated 34-sample test set and establish corrected baselines (30.9--39.2 BLEU), providing a realistic assessment of NMT capability for this endangered writing system.

2605.07452 2026-05-11 cs.AI

Bounded Fitting for Expressive Description Logics

可扩展描述逻辑中受限适配方法的研究

Maurice Funk, Jean Christoph Jung, Tom Voellmer

发表机构 * Leipzig University and ScaDS.AI Center Dresden/Leipzig(莱比锡大学和ScaDS.AI中心德累斯顿/莱比锡) TU Dortmund University(德累斯顿理工大学) Center for Trustworthy Data Science and Security, University Alliance Ruhr(可信数据科学与安全中心,鲁尔大学联盟)

AI总结 本文研究了在扩展ALC的表达式描述逻辑中,受限适配方法的理论性质和实现,通过SAT求解器实现,并与现有概念学习器比较,展示了其在复杂概念学习中的实用性。

Comments 16 pages, full version of paper accepted at IJCAI-ECAI 2026

详情
AI中文摘要

受限适配是一种从标记数据示例中学习逻辑公式有吸引力的范式,提供PAC式的一般化保证,并可通过SAT求解器实现。本文研究了在扩展ALC的描述逻辑中,受限适配学习概念的条件,通过SAT求解器实现,并与现有概念学习器比较,展示了其在复杂概念学习中的实用性。

英文摘要

Bounded fitting is an attractive paradigm for learning logical formulas from labeled data examples that offers PAC-style generalization guarantees and can often be implemented leveraging SAT solvers. It has been successfully applied to learning concepts of the description logic ALC. We study bounded fitting for learning concepts in expressive description logics that extend ALC with inverse roles, qualified number restrictions, and feature comparisons. We investigate under which conditions bounded fitting keeps its favorable theoretical properties in this setting, and implement it using a SAT solver. We compare our tool with state-of-the-art concept learners with encouraging results, demonstrating that it is a practical approach to expressive concept learning.

2605.07451 2026-05-11 cs.LG

VNN-LIB 2.0: Rigorous Foundations for Neural Network Verification

VNN-LIB 2.0:神经网络验证的严谨基础

Ann Roy, Allen Antony, Andrea Gimelli, Matthew L. Daggitt

发表机构 * University of Western Australia, Perth, Australia(西澳大学) University of Genoa, Genoa, Italy(热那亚大学)

AI总结 本文提出VNN-LIB 2.0,通过构建网络理论,提供更严谨的神经网络验证基础,解决原有版本在语法、语义和类型系统上的不足。

详情
AI中文摘要

神经网络验证是一个活跃且快速发展的研究领域,拥有不断扩大的求解器和工具生态系统。VNN-LIB标准被引入以支持该生态系统中的互操作性,但版本1.0在形式基础方面存在严重缺陷:缺乏精确的语法、语义和类型系统,表达能力有限,并依赖于外部定义的ONNX模型,其语义非正式且不断变化。本文通过开发VNN-LIB 2.0的理论基础来解决这些问题。我们的关键贡献是引入了“网络理论”的概念,该概念抽象地定义了神经网络模型格式所需的最小语义接口。这种抽象使VNN-LIB能够独立于任何特定ONNX版本定义,同时保持与不断演进的模型表示兼容。在此基础上,我们提出了一个更具有表达力的查询语言的正式语法,以及基于网络理论提供的数值域的类型系统,最后给出了正式语义。为了确保内部一致性,该标准在Agda定理证明器中被机械化。因此,VNN-LIB 2.0为可信的神经网络验证提供了稳健且严谨的基础。

英文摘要

Neural network verification is an active and rapidly maturing research area, with a growing ecosystem of solvers and tools. The VNN-LIB standard was introduced to support interoperability in this ecosystem, but Version~1.0 has several serious short-comings as a formal foundation: it lacks a precise syntax, semantics, and type system, offers limited expressivity, and relies on externally defined ONNX models whose semantics are informal and constantly evolving. The latter distinguishes VNN-LIB from established standards such as SMT-LIB, where queries are self-contained and have fixed semantics. In this paper we address these challenges by developing the theoretical foundations of VNN-LIB~2.0. Our key contribution is the introduction of the notion of a \emph{network theory}, which abstractly characterises the minimal semantic interface required from a neural network model format. This abstraction enables VNN-LIB to be defined independently of any specific ONNX version while remaining compatible with evolving model representations. Building on this foundation, we present a formal syntax for a more expressive query language, a type system for it over the numeric domains provided by the network theory, and finally a formal semantics. To ensure internal consistency, the standard is mechanised in the Agda theorem prover. VNN-LIB~2.0 therefore provides robust and rigorous foundations for trustworthy neural network verification.

2605.07447 2026-05-11 cs.CV cs.AI cs.CL cs.LG

Sparse Autoencoders as Plug-and-Play Firewalls for Adversarial Attack Detection in VLMs

稀疏自编码器作为视觉语言模型中对抗攻击检测的即插即用防火墙

Hao Wang, Yiqun Sun, Pengfei Wei, Lawrence B. Hsieh, Daisuke Kawahara

发表机构 * Magellan Technology Research Institute (MTRI)(马杰伦技术研究 institute) Waseda University(早稻田大学)

AI总结 本文提出基于稀疏自编码器的轻量级对抗攻击检测框架SAEgis,通过插入预训练VLM中的稀疏自编码模块,利用学习到的稀疏潜在特征检测对抗扰动输入,实验显示其在跨领域和跨攻击设置中表现优异,且无需额外对抗训练。

详情
AI中文摘要

视觉语言模型(VLMs)已迅速发展并广泛应用于现实世界,尤其是在基于代理的系统中。然而,其安全性受到较少关注。即使最新的专有和开源VLMs仍极易受到对抗攻击的影响,导致下游应用面临重大风险。本文提出了一种基于稀疏自编码器(SAEs)的新型轻量级对抗攻击检测框架,称为SAEgis。通过在预训练VLM中插入SAE模块并使用标准重建目标进行训练,发现学习到的稀疏潜在特征自然捕捉攻击相关信号。这些特征能够可靠地分类输入图像是否被对抗性扰动篡改,即使对于以前未见过的样本也是如此。大量实验表明,SAEgis在域内、跨域和跨攻击设置中均表现出色,特别是在跨域泛化方面比现有基线有显著提升。此外,结合多层信号进一步提高了鲁棒性和稳定性。据我们所知,这是首次探索将SAE作为即插即用机制用于VLMs中的对抗攻击检测的工作。我们的方法不需要额外的对抗训练,引入了最小的开销,并提供了一种改进现实世界VLM系统安全性的实用方法。

英文摘要

Vision-language models (VLMs) have advanced rapidly and are increasingly deployed in real-world applications, especially with the rise of agent-based systems. However, their safety has received relatively limited attention. Even the latest proprietary and open-weight VLMs remain highly vulnerable to adversarial attacks, leaving downstream applications exposed to significant risks. In this work, we propose a novel and lightweight adversarial attack detection framework based on sparse autoencoders (SAEs), termed SAEgis. By inserting an SAE module into a pretrained VLM and training it with standard reconstruction objectives, we find that the learned sparse latent features naturally capture attack-relevant signals. These features enable reliable classification of whether an input image has been adversarially perturbed, even for previously unseen samples. Extensive experiments show that SAEgis achieves strong performance across in-domain, cross-domain, and cross-attack settings, with particularly large improvements in cross-domain generalization compared to existing baselines. In addition, combining signals from multiple layers further improves robustness and stability. To the best of our knowledge, this is the first work to explore SAE as a plug-and-play mechanism for adversarial attack detection in VLMs. Our method requires no additional adversarial training, introduces minimal overhead, and provides a practical approach for improving the safety of real-world VLM systems.

2605.07446 2026-05-11 cs.CL cs.LG

SSP-based construction of evaluation-annotated data for fine-grained aspect-based sentiment analysis

基于SSP的评价标注数据构建用于细粒度方面基于情感分析

Suwon Choi, Shinwoo Kim, Changhoe Hwang, Gwanghoon Yoo, Eric Laporte, Jeesun Nam

发表机构 * DICORA, Hankuk University of Foreign Studies(DICORA,韩国外国语大学) DICORA, Hankuk University of LIGM, Université Gustave Eiffel(DICORA,韩国立国民大学,古斯塔夫·埃菲尔大学) Hankuk University of Foreign Studies(韩国外国语大学)

AI总结 本文提出构建韩语评价标注语料库EVAD,并用于扩展的方面基于情感分析,通过半自动符号传播方法标注电商评论,提升情感识别性能。

Journal ref 29th International Conference on Computational Linguistics (COLING). Workshop on Pattern-based Approaches to NLP in the Age of Deep Learning (Pan-DL), Oct 2022, Gyeongju, South Korea, pp.38-44

详情
AI中文摘要

我们报告了构建韩语评价标注语料库'Evaluation Annotated Dataset (EVAD)'及其在方面基于情感分析(ABSA)中的应用,该方法扩展以涵盖包含情感和非情感语言模式的电商评论。标注过程使用半自动符号传播(SSP)。我们构建了形式化为有限状态转录机(FST)的广泛语言资源,以标注包含详细ABSA组件的电商领域语料库。ABSA方法通过引入方面值以及话题和方面,进一步扩展以更准确地分析用户意见并提取更详细的方面特征。通过将方面值对分为单值、二元或多值进行分类,评估结果显示KoBERT和KcBERT模型在识别方面值对上的F1得分分别为0.88和0.90。

英文摘要

We report the construction of a Korean evaluation-annotated corpus, hereafter called 'Evaluation Annotated Dataset (EVAD)', and its use in Aspect-Based Sentiment Analysis (ABSA) extended in order to cover e-commerce reviews containing sentiment and non-sentiment linguistic patterns. The annotation process uses Semi-Automatic Symbolic Propagation (SSP). We built extensive linguistic resources formalized as a Finite-State Transducer (FST) to annotate corpora with detailed ABSA components in the fashion e-commerce domain. The ABSA approach is extended, in order to analyze user opinions more accurately and extract more detailed features of targets, by including aspect values in addition to topics and aspects, and by classifying aspectvalue pairs depending whether values are unary, binary, or multiple. For evaluation, the KoBERT and KcBERT models are trained on the annotated dataset, showing robust performances of F1 0.88 and F1 0.90, respectively, on recognition of aspect-value pairs.

2605.07230 2026-05-11 cs.CV cs.AI

CASCADE: Context-Aware Relaxation for Speculative Image Decoding

CASCADE:基于上下文的放松方法用于推测图像解码

Selin Yildirim, Subhajit Dutta Chowdhury, Mohammad Mahdi Kamani, Vikram Appia, Deming Chen

发表机构 * AMD(AMD公司)

AI总结 本文提出CASCADE方法,通过捕捉目标模型隐藏状态中的冗余性,在不额外训练的情况下实现接受放松,提升推测解码效率,实验表明在多模态模型中达到3.6倍加速,保持图像质量和提示一致性。

详情
AI中文摘要

自回归生成是一种强大的高保真图像合成方法,但即使在最先进的加速器上也计算开销大且速度慢。尽管推测解码已被探索以缓解这一瓶颈,现有方法未能实现与文本生成中观察到的效率提升相当的成果。关键限制是图像生成过程中目标模型的高不确定性,导致高草稿令牌拒绝率。本文识别了在基于树的推测解码中自然出现的目标模型行为中被忽视的模式。具体而言,我们正式化了两个属性,语义可交换性和收敛性,这些属性源于目标模型隐藏状态表示中的冗余性。通过在预测令牌树的深度和广度上捕捉这些冗余性,我们的方法在不需额外训练的情况下识别了接受放松的原理性机会。此外,我们通过将目标模型的冗余信号注入drafter训练中,最小化修改来增强独立drafter性能。我们评估了我们的方法在多个文本到图像模型和drafter架构上的表现。结果表明,CASCADE在基于drafter的推测解码中实现了最先进的加速,最高可达3.6倍,同时保持图像质量和文本提示的一致性。

英文摘要

Autoregressive generation is a powerful approach for high-fidelity image synthesis, but it remains computationally demanding and slow even on the most advanced accelerators. While speculative decoding has been explored to mitigate this bottleneck, existing approaches fail to achieve efficiency gains comparable to those observed in text generation. A key limitation is the target model's high uncertainty during image generation, which leads to high draft token rejection rates. In this work, we identify previously overlooked patterns in the target model's behavior that emerge naturally in tree-based speculative decoding. Specifically, we formalize two properties, semantic interchangeability and convergence, arising from the redundancies in the target model's hidden state representations. By capturing these redundancies across the depth and breadth of the predicted token tree, our method identifies principled opportunities for acceptance relaxation without requiring additional training. Additionally, we enhance standalone drafter performance by injecting the redundancy signals from the target model into drafter training with minimal modification. We evaluate our approach across multiple text-to-image models and drafter architectures. Results show that CASCADE achieves state-of-the-art speedups for drafter-based speculative decoding, with up to 3.6x acceleration, while maintaining image quality and text-prompt fidelity.

2605.07222 2026-05-11 cs.LG

Don't Learn the Shape: Forecasting Periodic Time Series by Rank-1 Decomposition

不要学习形状:通过秩-1分解进行周期性时间序列预测

Takato Honda

发表机构 * Mellon Inc.(梅隆公司)

AI总结 本文通过秩-1分解方法预测周期性时间序列,提出FLAIR方法,在保持有效性、简洁性、速度和闭式解方面表现优异,优于其他方法。

Comments 9 pages main text + appendix. Code: https://github.com/TakatoHonda/FLAIR

详情
AI中文摘要

我们真正需要多少参数来预测周期性时间序列?将小时级电力序列重塑为24行矩阵(每列代表一天)后,其近似秩-1:每日形状由每日水平调节(在GIFT-Eval上,中位数中心的秩-1能量为0.82)。我们是否应学习形状?平滑、收缩和低秩拟合似乎比简单平均最后K=2个周期更优。在所有97个GIFT-Eval配置上,我们测试了8种替代方法(如傅里叶、EWMA、James-Stein、秩-r SVD):在Holm校正下,没有方法显著优于冻结基线;两种方法显著更差。所提出的方法FLAIR是(a)有效:在GIFT-Eval上与PatchTST相当(relMASE 0.838 vs 0.849);(b)简洁:每小时28个标量,每周57个;(c)快速:在MacBook Pro的一个CPU核心上22分钟;(d)闭式解且无需干预:每个周期候选一个SVD,GCV平均的岭回归,无GPU,无预训练,无任务特定调优。在高秩-1、多周期情况下,额外灵活性会引入估计噪声。

英文摘要

How few parameters do we really need to forecast a periodic time series? An hourly electricity series, reshaped as a 24-row matrix with one column per day, is approximately rank-1: a daily shape modulated by a daily level (median centered rank-1 energy 0.82 on GIFT-Eval). Should we learn the shape? Smoothing, shrinkage, and low-rank fits all seem like obvious upgrades over the simple average of the last K=2 cycles. On all 97 GIFT-Eval configurations, we tested 8 such alternatives (e.g., Fourier, EWMA, James-Stein, rank-r SVD): none significantly beats the frozen baseline under Holm correction; two are significantly worse. The resulting method, FLAIR, is (a) Effective: matches PatchTST on aggregate GIFT-Eval (relMASE 0.838 vs 0.849); (b) Compact: 28 scalars for hourly, 57 for weekly; (c) Fast: 22 minutes on one CPU core of a MacBook Pro; (d) Closed-form & Hands-Off: one SVD per period candidate, GCV-averaged Ridge, no GPU, no pre-training, no per-task tuning. In the high-rank-1, many-cycle regime, extra flexibility is estimation noise.

2605.07221 2026-05-11 cs.CV

DINO-MVR: Multi-View Readout of Frozen DINOv3 for Annotation-Efficient Medical Segmentation

DINO-MVR:基于冻结DINOv3的多视角读取用于标注高效的医学分割

Wei Jiang, Feng Liu, Nan Ye, Hongfu Sun

发表机构 * The University of Queensland(昆士兰大学) The University of Newcastle(新castle大学)

AI总结 本文提出DINO-MVR框架,利用冻结的DINOv3特征进行多视角读取,实现高效的医学分割,仅需轻量MLP探针即可在有限标注下获得高Dice系数表现。

详情
AI中文摘要

将基础模型适应到医学分割通常需要对backbone进行微调或使用高容量的任务特定解码器,但在标注稀缺时难以可靠地进行。我们证明冻结的DINOv3特征已包含有用的结构和边界线索,主要瓶颈在于如何读取这些特征。我们提出DINO-MVR,一种多视角读取框架用于标注高效的医学分割。DINO-MVR仅在冻结DINOv3backbone的最后三个transformer块的特征上训练轻量级MLP探针,不更新backbone本身。在推理时,每个输入通过互补的分辨率和测试时增强进行解释,其概率图通过熵加权融合并用简单的空间正则化进行细化。对于体积输入,高斯z轴平滑进一步提高切片间的一致性。在固定评估协议下,DINO-MVR在内窥镜、皮肤镜和MRI基准上实现了强大的仅读取性能,包括Kvasir-SEG上的0.895 Dice系数、ISIC 2018上的0.897 Dice系数和BraTS FLAIR全肿瘤分割上的0.908 Dice系数。仅用五个标注的BraTS患者,其性能恢复了40名患者参考运行的98.4%。这些结果表明,当与有效的多视角读取结合时,冻结的自监督视觉backbone可以支持准确的医学分割。

英文摘要

Adapting foundation models to medical segmentation typically requires either backbone fine-tuning or high-capacity task-specific decoders, both of which are difficult to fit reliably when annotations are scarce. We show that frozen DINOv3 features already contain useful structural and boundary cues for medical segmentation, and that the main bottleneck lies in how these features are read out. We propose DINO-MVR, a Multi-View Readout framework for annotation-efficient medical segmentation. DINO-MVR trains only lightweight MLP probes on features from the final three transformer blocks of a frozen DINOv3 backbone, without updating the backbone itself. At inference, each input is interpreted through complementary resolutions and test-time augmentations, whose probability maps are combined by entropy-weighted fusion and refined with simple spatial regularization. For volumetric inputs, Gaussian z-axis smoothing further improves inter-slice consistency. Under fixed evaluation protocols on endoscopy, dermoscopy, and MRI benchmarks, DINO-MVR achieves strong readout-only performance, including 0.895 Dice on Kvasir-SEG, 0.897 Dice on ISIC 2018, and 0.908 Dice on BraTS FLAIR whole-tumor segmentation. With only five annotated BraTS patients, it recovers 98.4% of the performance obtained by the 40-patient BraTS reference run. These results suggest that frozen self-supervised vision backbones can support accurate medical segmentation when paired with an effective multi-view readout.

2605.07218 2026-05-11 cs.LG stat.ML

Improved Model-based Reinforcement Learning with Smooth Kernels

基于平滑核的改进模型基于强化学习

Kun Long, Yuqiang Li, Xianyi Wu

发表机构 * School of Statistics, KLATASDS-MOE(统计学学院,KLATASDS-MOE) East China Normal University(华东师范大学)

AI总结 本文提出了一种基于核平滑的模型驱动强化学习方法,利用mdp的平滑性,通过伯恩斯坦式探索奖励改进了有限时间 horizon 下的 regret 绑定。

Comments 38 pages, 5 figures

详情
AI中文摘要

本文提出了一种基于核平滑的模型驱动强化学习方法,利用mdp的平滑性,通过伯恩斯坦式探索奖励改进了有限时间 horizon 下的 regret 绑定。

英文摘要

For continuous state-action space scenarios, classical reinforcement learning (RL) theory predominantly focuses on low-rank Markov decision processes (MDPs), which provide sample-efficient guarantees at the expense of restrictive structural assumptions. Kernel smoothing model-based approaches offer a promising alternative paradigm that instead leverages the smoothness of the MDP and employs non-parametric kernel smoothing estimates of transition dynamics. This paper proposes a new kernel-smoothing model-based approach for online reinforcement learning in finite-horizon settings under Lipschitz continuity assumptions on the MDP. By incorporating a Bernstein-style exploration bonus into the kernel smoothing framework, our method achieves a regret bound which improves upon the state-of-the-art regret bound in its dependence on the horizon. The theoretical advancement relies on a delicate analysis of the synergy between Bernstein-style bonuses and kernel smoothing, where a new tight Bernstein-type concentration inequality for martingales may be of independent interest.

2605.07215 2026-05-11 cs.RO

PISTO: Proximal Inference for Stochastic Trajectory Optimization

PISTO:随机轨迹优化的近端推断

Hongzhe Yu, Zinuo Chang, Yongxin Chen

发表机构 * School of Aerospace Engineering(航空航天工程学院) School of Electrical and Computer Engineering(电气与计算机工程学院) Georgia Institute of Technology(佐治亚理工学院)

AI总结 PISTO通过引入KL正则化稳定更新,实现非光滑成本的高效优化,在机器人运动规划中取得更高成功率和更优路径质量。

Comments 8 pages

详情
AI中文摘要

随机轨迹优化方法如STOMP能够处理非可微成本,提供比基于梯度的方法更大的灵活性。我们证明STOMP隐式最小化玻尔兹曼轨迹分布的KL散度,揭示其更新背后的优雅变分推断结构。基于此,我们提出随机轨迹优化的近端推断(PISTO)算法,通过在目标函数中加入连续高斯提案间的KL正则化来稳定更新。这种近端形式允许信任区域解释,并产生可计算的闭式均值更新,作为替代分布下的期望。我们通过重要加权蒙特卡洛采样估计这些期望,产生一个简单且无导数的算法,继承STOMP处理非可微和不连续成本的能力。在机器人手臂运动规划基准上,PISTO实现了89%的成功率,优于CHOMP(63%)和STOMP(68%),同时生成更短、更平滑的路径,速度是竞争随机方法的两倍。我们进一步在接触丰富的MuJoCo运动和操作任务上验证PISTO,其中它在奖励上一致优于CEM和MPPI基线。

英文摘要

Stochastic trajectory optimization methods like STOMP enable planning with non-differentiable costs, offering substantial flexibility over gradient-based approaches. We show that STOMP implicitly minimizes the KL divergence from a Boltzmann trajectory distribution, revealing an elegant Variational Inference (VI) structure underlying its updates. Building on this insight, we propose the \textit{Proximal Inference for Stochastic Trajectory Optimization} (PISTO) algorithm that stabilizes the updates by augmenting the objective with a KL regularization between successive Gaussian proposals. This proximal formulation admits a trust-region interpretation and yields closed-form mean updates computable as expectations under a surrogate distribution. We estimate these expectations via importance-weighted Monte Carlo sampling, producing a simple, derivative-free algorithm that inherits STOMP's ability to handle non-differentiable and discontinuous costs without modification. On robot arm motion planning benchmarks, PISTO achieves an 89\% success rate -- outperforming CHOMP (63\%) and STOMP (68\%) -- while producing shorter, smoother paths at twice the speed of competing stochastic methods. We further validate PISTO on contact-rich MuJoCo locomotion and manipulation tasks, where it consistently outperforms both CEM and MPPI baselines in reward.

2605.07213 2026-05-11 cs.CV

LoHGNet: Infrared Small Target Detection through Lorentz Geometric Encoding with High-Order Relation Learning

LoHGNet:通过洛伦兹几何编码与高阶关系学习进行红外小目标检测

Qianwen Ma, Yang Xu, Shangwei Deng, Xiaobo Li, Haofeng Hu

发表机构 * School of Marine Science and Technology, Tianjin University(天津大学海洋科学与技术学院) Jiangxi Key Laboratory of Advanced Electronic Materials and Device, Jiangxi Science and Technology Normal University(江西省先进电子材料与器件重点实验室,江西科技师范大学)

AI总结 LoHGNet通过洛伦兹几何编码和高阶关系学习提升红外小目标检测性能,改进弱目标特征表示和背景关系建模,实验证明其在检测精度和复杂场景适应性上的竞争力。

详情
AI中文摘要

红外小目标检测(IRSTD)仍然具有挑战性,由于有用目标线索稀缺且存在严重背景杂波。当前方法主要依赖传统特征学习和局部交互建模,特征在欧几里得空间中表示。然而,此类设计可能在描述弱目标的细微差异和目标与背景之间的上下文关系方面仍有局限。为此,我们提出了LoHGNet,一种整合洛伦兹几何编码与高阶关系学习的IRSTD网络。通过引入基于洛伦兹流形的特征学习,LoHGNet提供了一种不同于传统IRSTD方法的特征表示,并提供了新的判别线索。具体而言,构建了一个洛伦兹编码分支,采用几何注意力引导的洛伦兹残差卷积模块(GA-LRCM)在双曲几何约束下进行特征建模,增强弱目标的层次几何表示能力。随后,双曲特征通过对数映射映射到欧几里得切空间,设计了高阶关系学习模块(HORL),通过超图构造建模目标与背景之间的高阶上下文依赖关系,从而在复杂背景中提高目标判别能力。在三个数据集上的实验结果表明,所提出的LoHGNet在检测精度和复杂场景适应性方面均取得了竞争性性能。代码将在https://github.com/Kingwin97上提供。

英文摘要

Infrared small target detection (IRSTD) remains challenging due to the scarcity of useful target cues and the presence of severe background clutter. Most current methods rely on conventional feature learning and local interaction modeling, where features are represented in Euclidean space. However, such designs may still be limited in describing the subtle differences of weak targets and the contextual relations between targets and backgrounds. To address these limitations, we propose LoHGNet, an IRSTD network that integrates Lorentz geometric encoding with high-order relation learning. By introducing Lorentz manifold based feature learning, LoHGNet offers a different feature representation from conventional IRSTD methods and provides new discriminative cues for IRSTD. Specifically, a Lorentz encoding branch is constructed with the Geometric Attention Guided Lorentz Residual Convolution Module (GA-LRCM) to perform feature modeling under hyperbolic geometric constraints and enhance the hierarchical geometric representation capability of weak targets. Subsequently, the hyperbolic features are mapped into the Euclidean tangent space through logarithmic mapping, and a High-Order Relation Learning Module (HORL) is designed to model the high-order contextual dependencies between targets and backgrounds via hypergraph construction, thereby improving target discrimination in complex backgrounds. Experimental results on three datasets demonstrate that the proposed LoHGNet achieves competitive performance in both detection accuracy and adaptability to complex scenes. The code will be available at https://github.com/Kingwin97.

2605.07212 2026-05-11 cs.LG cs.AI cs.HC cs.NE eess.SP

Same Brain, Different Prediction: How Preprocessing Choices Undermine EEG Decoding Reliability

同一体脑,不同预测:预处理选择如何削弱EEG解码可靠性

Dengzhe Hou, Zihao Wu, Lingyu Jiang, Zirui Li, Fangzhou Lin, Kazunori D. Yamada

发表机构 * Tohoku University(东洋大学) University of Georgia(佐治亚大学) Texas A&M University(德克萨斯农工大学) Worcester Polytechnic Institute(沃斯特理工学院)

AI总结 本文研究了EEG解码中预处理选择对可靠性的影响,提出三种工具以量化、分解和减少预处理带来的不稳定性。

详情
AI中文摘要

脑电图(EEG)是脑机接口和临床神经科学的核心,但深度学习模型通常在单一未报告的预处理流程下训练和评估。我们正式将预处理选择视为一个反事实干预空间,并展示在该空间中EEG预测出人意料地不稳定:在六个覆盖四个范式的数据集中,仅改变预处理时,多达42%的试次级预测会翻转,这种变异性标准不确定性方法未显式量化,因为它们基于固定的预处理流程。我们提供了三种工具,使这种不稳定性可测量、可分解和可减少。首先,2^7流程空间的瓦尔什-哈达玛分解揭示了在二进制干预设计下灵敏度几乎呈加性,使高效逐步优化成为可能。其次,我们引入预处理不确定性(PU),一种每试次诊断,捕捉与模型置信度互补的不稳定性维度。第三,我们研究了归一化自适应PGI(NA-PGI),一种图结构正则化器,利用预处理干预的组成结构作为缓解策略,具有明确的范围条件。

英文摘要

Electroencephalography (EEG) is a cornerstone of brain-computer interfaces and clinical neuroscience, yet deep learning models are typically trained and evaluated under a single, unreported preprocessing pipeline. We formalize preprocessing choices as a counterfactual intervention space and show that EEG predictions are surprisingly unstable under this space: across six datasets spanning four paradigms, up to 42% of trial-level predictions flip when only the preprocessing changes, a variability that standard uncertainty methods do not explicitly quantify because they condition on a fixed preprocessing pipeline. We provide three tools to make this instability measurable, decomposable, and reducible. First, a Walsh-Hadamard decomposition of the 2^7 pipeline space reveals that sensitivity is near-additive in practice under the binary intervention design, enabling efficient step-by-step optimization. Second, we introduce Preprocessing Uncertainty (PU), a per-trial diagnostic that captures a dimension of instability complementary to model-based confidence. Third, we study Normalized Adaptive PGI (NA-PGI), a graph-structured regularizer that exploits the compositional structure of preprocessing interventions as one mitigation strategy with clear scope conditions.

2605.07211 2026-05-11 cs.LG cs.AI

HARMONY: Bridging the Personalization-Generalization Gap by Mitigating Representation Skew in Heterogeneous Split Federated Learning

HARMONY: 通过减轻异构分裂联邦学习中的表示偏移来弥合个性化与泛化之间的鸿沟

Jiseok Youn, You Rim Choi, Goodsol Lee, Sangtae Ha, Hyung-Sin Kim, Saewoong Bahk

发表机构 * Department of ECE and INMC, Seoul National University, Seoul, South Korea(首尔国立大学电子与信息科学系及网络信息中心,首尔,韩国) Graduate School of Data Science, Seoul National University, Seoul, South Korea(首尔国立大学数据科学研究生院,首尔,韩国) Department of Computer Science, University of Colorado Boulder, CO, USA(科罗拉多大学博尔德分校计算机科学系,科罗拉多州,美国)

AI总结 HARMONY通过减轻异构分裂联邦学习中的表示偏移,实现个性化与泛化之间的平衡,提升OOD预测精度,同时保持低延迟。

Comments 7 pages (except references), 5 figures

详情
AI中文摘要

移动设备面临多样化的资源限制和非独立同分布的数据类别分布,需要快速在设备上进行本地分布(ID)类推断,并在需要时通过远程支持处理客户端特定的分布外(OOD)类推断。混合分裂联邦学习(Hybrid SFL)将个性化客户端侧前端(支持早期退出)与通用服务器侧后端结合,平衡准确性和成本。然而,在客户端架构异质性的情况下,现有混合SFL面临表示偏移问题,即定制提取器的特征无法在共享空间对齐,导致负责OOD预测的服务器模型性能急剧下降。我们提出了HARMONY,第一个支持异构客户端架构的混合SFL框架。HARMONY修改元学习以模拟不同参数和架构下的提取器,并学习个性化。为减轻表示偏移,HARMONY在服务器端进行对比学习以对齐提取特征,既不牺牲客户端的个性化也不共享原始标签。与多个数据集和模型家族的最新状态相比,HARMONY在无/有OOD情况下分别将测试精度提高高达43.0%/28.3%,同时保持可接受的延迟。

英文摘要

Mobile devices face diverse resource constraints and non-IID data class distributions, requiring fast on-device inference for local in-distribution (ID) classes and on-demand remote support for client-specific out-of-distribution (OOD) classes. Hybrid split federated learning (Hybrid SFL) couples personalized client-side front ends (supporting early exit) with a generalized server-side backend for fallback inference, balancing accuracy and cost. However, under client architectural heterogeneity, the existing hybrid SFL suffers from representation skew, where features from customized extractors fail to align in the shared space, leading to a sharp degradation in the server model responsible for OOD prediction. We propose HARMONY, the first hybrid SFL framework to support heterogeneous client architectures. HARMONY modifies meta-learning to simulate diverse extractors across parameters and architectures, and to learn to personalize. To mitigate representation skew, HARMONY conducts server-side contrastive learning to align extracted features, neither sacrificing clients' personalization nor sharing raw labels. Compared to the state of the art across multiple datasets and model families, HARMONY improves test accuracy by up to 43.0%/28.3% without/with OOD, respectively, while maintaining acceptable latency.

2605.07209 2026-05-11 cs.CL cs.AI cs.LG

Hallucination Detection via Activations of Open-Weight Proxy Analyzers

通过开放权重代理分析器的激活进行幻觉检测

Akshita Singh, Prabesh Paudel, Siddhartha Roy

发表机构 * Khoury College of Computer Sciences(科里学院计算机科学学院) Northeastern University(东北大学)

AI总结 本文提出一种代理分析器框架,通过本地部署的小型开放权重模型检测大语言模型的幻觉,采用变压器文本处理特性构建18种特征,并在多个模型架构上训练堆叠集成模型,最终在RAGTruth数据集上取得优于ReDeEP的AUC和F1分数。

Comments 12 pages, 4 figures. Code available at https://github.com/hallu-detect/llm_hallucination_detection

详情
AI中文摘要

我们介绍了一种代理分析器框架,用于检测大语言模型中的幻觉。与查看生成模型内部不同,我们的系统通过小型本地部署的开放权重模型读取已生成的文本,并利用读者自身的内部激活来检测幻觉。无论生成器是闭合API如GPT-4还是开放权重模型,该方法均有效。我们构建了18种基于Transformer处理文本的特征,涵盖残差流范数、每头源文档注意力、熵、MLP激活、logit-lens轨迹以及三种新的令牌级 grounding 统计。我们在五个幻觉数据集的72,135个样本上训练了一个堆叠集成模型。我们测试了七个分析架构,从0.5亿到9亿参数:Qwen2.5在0.5B和7B,Gemma-2在2B和9B,Pythia在1.4B,以及LLaMA-3在3B和8B。在所有七个模型中,我们一致在RAGTruth上优于ReDeEP的token-level AUC 0.73,相差7.4到10.3个百分点。Qwen2.5-7B达到F1 0.717,略高于ReDeEP的0.713,而Qwen2.5-0.5B达到0.706。最引人注目的是,所有七个模型在模型大小相差18倍的情况下,AUC跨度仅2.3个百分点。更令人惊讶的是,我们的3B LLaMA在RAGTruth上优于我们的8B LLaMA,显示在同一家族模型中,更大并不总是更好。RAGTruth和LLM-AggreFact均包含多个LLM家族的输出,因此我们的结果不偏向任何特定生成器。

英文摘要

We introduce a proxy-analyzer framework for detecting hallucinations in large language models. Instead of looking inside the generating model, our system reads already-generated text through a small locally hosted open-weight model and spots hallucinations using the reader's own internal activations. This works just as well when the generator is a closed API like GPT-4 as when it is any open-weight model. We built eighteen features grounded in how transformers process text, covering residual stream norms, per-head source-document attention, entropy, MLP activations, logit-lens trajectories, and three new token-level grounding statistics. We trained a stacking ensemble on 72,135 samples from five hallucination datasets. We tested across seven analyzer architectures from 0.5 billion to 9 billion parameters: Qwen2.5 at 0.5B and 7B, Gemma-2 at 2B and 9B, Pythia at 1.4B, and LLaMA-3 at both 3B and 8B. Across all seven, we consistently beat ReDeEP's token-level AUC of 0.73 on RAGTruth by 7.4 to 10.3 percentage points. Qwen2.5-7B reached an F1 of 0.717, just above ReDeEP's 0.713, while Qwen2.5-0.5B hit 0.706. The most striking finding is how tightly all seven models cluster: AUC spans only 2.3 percentage points across an eighteen-fold difference in model size. Even more surprising, our 3B LLaMA outperforms our 8B LLaMA on RAGTruth, showing that bigger is not always better even within the same model family. Both RAGTruth and LLM-AggreFact include outputs from multiple LLM families, so our results are not skewed toward any particular generator.

2605.07208 2026-05-11 cs.LG

FAME: Forecasting Academic Impact via Continuous-Time Manifold Evolution

FAME:通过连续时间流形演变进行学术影响预测

Jianrong Ding, Jianyuan Zhong, Zhengyan Shi, Qiang Xu

发表机构 * Department of Computer Science and Engineering(计算机科学与工程系)

AI总结 本文提出FAME框架,通过动态流形演变模型预测科研论文的学术影响,实验显示其在多维影响预测上优于现有LLM评估器。

详情
AI中文摘要

大型语言模型(LLMs)越来越多地用于构思和评估研究想法,但评估这些判断极具挑战性,因为新想法的真实影响可能需要数年才能显现。本文通过人类撰写的论文影响预测作为可验证的代理任务来解决这一挑战。在前瞻性预测研究中,我们发现前沿LLMs无法可靠地区分高影响论文与普通出版物,表明基于静态文本的判断不足以进行科学评估。为了解决这一限制,我们提出了FAME(通过连续时间流形演变预测学术影响),一种时空框架,用于建模科学主题的动态轨迹。FAME将论文投影到一个由文本特征和验证知识流图信息的动态潜在空间中,学习几何约束,使具有影响的论文与领域前进动量对齐。在3,200篇arXiv论文上进行的实验表明,FAME在前瞻性多维影响预测上始终优于最先进的LLM评估器。此外,将FAME的动态几何信号整合到LLMs中显著提高了其预测性能。这些结果支持论文影响预测作为有用的、可测量的代理基准,并将FAME定位为自动化科学评估的强大、轨迹意识的基础。

英文摘要

Large Language Models (LLMs) are increasingly used to brainstorm and evaluate research ideas, yet assessing such judgments is fundamentally difficult because the true impact of a new idea may take years to emerge. We address this challenge by using the impact forecasting of human-authored manuscripts as a verifiable proxy task. In a prospective forecasting study, we find that frontier LLMs fail to reliably distinguish high-impact papers from ordinary publications, suggesting that static text-based judging is insufficient for scientific evaluation. To address this limitation, we propose $\textbf{FAME}$ ($\underline{\text{F}}$orecasting $\underline{\text{A}}$cademic Impact via Continuous-Time $\underline{\text{M}}$anifold $\underline{\text{E}}$volution), a spatiotemporal framework for modeling the dynamic trajectories of scientific topics. FAME projects papers into a dynamic latent space informed by textual features and a verified knowledge-flow graph, learning geometric constraints that align impactful manuscripts with the forward momentum of their fields. Experiments on 3,200 arXiv papers across three fast-evolving subfields show that FAME consistently and substantially outperforms state-of-the-art LLM evaluators in prospective multidimensional impact forecasting. Furthermore, integrating FAME's dynamic geometric signals into LLMs significantly improves their forecasting performance. These results support manuscript impact forecasting as a useful, measurable proxy benchmark and position FAME as a strong, trajectory-aware foundation for automated scientific evaluation.

2605.07204 2026-05-11 cs.LG

Arrow: A Foundation Model for Causal Discovery

Arrow:一种因果发现的基础模型

Ryan Thompson, He Zhao, Daniel M. Steinberg, Edwin V. Bonilla

发表机构 * University of Technology Sydney(技术科技大学) CSIRO’s Data61(CSIRO数据61)

AI总结 Arrow模型通过将有向无环图分解为无向骨架和拓扑序,实现零样本因果发现。该模型基于Transformer架构预测边概率和节点顺序,训练于合成数据集,能在多种数据集上实现高效准确的因果发现。

详情
AI中文摘要

我们介绍了Arrow,一种用于观测表格数据零样本因果发现的基础模型。Arrow将有向无环图分解为无向骨架和拓扑序,通过构造保证无环性。给定新数据集时,该模型利用基于Transformer的架构在观测之间和跨观测中上下文化变量,然后预测骨架边概率和节点顺序得分,这些得分共同定义图。Arrow在具有真实图的合成数据集上以监督方式训练,使用由骨架-顺序分解诱导的端到端可微定向边复合似然。训练分布涵盖多样化的图家族、函数形式、噪声模型和数据集形状。在内分布和外分布合成、半合成和真实数据集上,Arrow在显著低于竞争替代方案的推理成本下匹配或超越现有因果发现方法。我们的结果表明,大规模预训练于多样化合成数据可以产生零样本因果发现模型,这些模型在新数据集上快速、准确且可重用。

英文摘要

We introduce Arrow, a foundation model for zero-shot causal discovery on observational tabular data. Arrow factorizes a directed acyclic graph into an undirected skeleton and a topological order, guaranteeing acyclicity by construction. Given a new dataset, it uses a transformer-based architecture to contextualize variables within and across observations, then predicts skeleton edge probabilities and node order scores that together define a graph. Arrow is trained in a supervised fashion on synthetic datasets with ground-truth graphs, using an end-to-end differentiable directed edge composite likelihood induced by the skeleton-order factorization. The training distribution spans diverse graph families, functional forms, noise models, and dataset shapes. Across in- and out-of-distribution synthetic, semi-synthetic, and real datasets, Arrow matches or outperforms existing causal discovery methods at substantially lower inference cost than competitive alternatives. Our results demonstrate that large-scale pretraining on diverse synthetic data can yield zero-shot causal discovery models that are fast, accurate, and reusable on new datasets.

2605.07201 2026-05-11 cs.CL cs.AI cs.LG

PSK@EEUCA 2026: Fine-Tuning Large Language Models with Synthetic Data Augmentation for Multi-Class Toxicity Detection in Gaming Chat

PSK@EEUCA 2026: 通过合成数据增强微调大语言模型用于游戏聊天中的多类毒性检测

Srikar Kashyap Pulipaka

发表机构 * Independent Researcher(独立研究员)

AI总结 本文提出通过合成数据增强微调大语言模型,用于游戏聊天中的多类毒性检测,实验结果显示在测试集上达到0.6234的F1-macro分数,发现验证性能高反而导致测试转移差的'验证陷阱'现象。

Comments Accepted to the EEUCA workshop at ACL 2026

详情
AI中文摘要

本文描述了我们为EEUCA 2026共享任务设计的系统,该任务涉及将《World of Tanks》聊天信息分类为六个毒性类别:非毒性、辱骂/喷战、其他冒犯、仇恨/骚扰、威胁和极端主义。我们探索了多种方法,包括基于编码器的模型、指令微调的LLM与LoRA微调、层次分类、一对一策略以及各种集成方法。我们的最佳系统结合了Llama 3.1 8B模型与精心校准的5%合成数据增强,实现了测试集上的F1-macro得分为0.6234,位列35支参赛队伍中的第4名。我们对数据集的标注模式及其对模型泛化的影响进行了深入分析,揭示了一个关键的'验证陷阱'现象,即高验证性能与差的测试迁移之间存在相关性。

英文摘要

This paper describes our system for the EEUCA 2026 Shared Task on Understanding Toxic Behavior in Gaming Communities. The task involves classifying World of Tanks chat messages into six toxicity categories: Non-toxic, Insults/Flaming, Other Offensive, Hate/Harassment, Threats, and Extremism. We explore multiple approaches including encoder-based models, instruction-tuned LLMs with LoRA fine-tuning, hierarchical classification, one-vs-rest strategies, and various ensemble methods. Our best system combines Llama 3.1 8B with carefully calibrated 5\% synthetic data augmentation, achieving an F1-macro score of 0.6234 on the test set, placing 4th out of 35 participating teams. We provide extensive analysis of the dataset's annotation patterns and their impact on model generalization, revealing a critical ''validation trap'' phenomenon where high validation performance correlates with poor test transfer.

2605.07199 2026-05-11 cs.AI cs.LG

Three-in-One World Model: Energy-Based Consistency, Prediction, and Counterfactual Inference for Marketing Intervention

三位一体世界模型:用于营销干预的能量一致性、预测和反事实推断

Junichiro Niimi

发表机构 * Meijo University(名古屋大学)

AI总结 本文提出三位一体世界模型,利用深度玻尔兹曼机学习消费者特征,通过轻量任务适配器实现一致性评估、预测和反事实推断,展示其在营销干预中的优势。

详情
AI中文摘要

营销决策反映了潜在消费者异质性、时间变化的内部状态和明确干预的相互作用,而当前预测和语言导向模型未能统一捕捉这一结构。本文提出三位一体世界模型架构,其中深度玻尔兹曼机(DBM)从人口统计学、时间及滞后动作和结果学习冻结信念表示,并附加轻量任务特定适配器。同一信念支持三个任务:(i)通过DBM的自由能进行能量一致性评估,(ii)通过适配器进行结果预测,(iii)通过固定信念并仅变化传给适配器的动作输入进行反事实推断。在受控模拟中,已知每个消费者的潜在价格敏感度、促销响应性和基础偏好,结果显示适配器在访问和购买AUC上优于基线MLP,且在处理受混淆的价格促销干预时表现更优。此外,自由能钳制系统性惩罚缺乏先前促销暴露的反事实购买轨迹,惩罚本身取决于预期方向的潜在基础偏好。这些结果表明,DBM信念能够解构潜在特征,以在反事实查询中保持一致,为营销干预提供集成的世界模型基础。

英文摘要

Marketing decisions reflect the interaction of latent consumer heterogeneity, time-varying internal states, and explicit interventions, a structure that current prediction- and language-oriented models do not capture in a unified manner. We propose a Three-in-One world-model architecture in which a Deep Boltzmann Machine (DBM) learns a frozen belief representation from demographics, time, and lagged actions and outcomes, with lightweight task-specific adapters attached on top. The same belief supports three tasks within a single framework: (i) energy-based consistency evaluation through the DBM's free energy, (ii) outcome prediction through adapters, and (iii) counterfactual inference by holding the belief fixed and varying only the action input given to the adapter. Using a controlled simulation in which the latent price sensitivity, promotion responsiveness, and base preference of each consumer are known, we show that the adapters match a strong MLP baseline on visit- and purchase-AUC while recovering heterogeneous treatment effects substantially better than S-, T-, X-, and DR-learner meta-learners and a Causal Forest baseline built on the same raw features, with the largest gap on a confounded price-promotion intervention. Complementing this, free-energy clamps systematically penalize counterfactual purchase trajectories that lack prior promotional exposure, and the penalty itself depends on the latent base preference in the expected direction. These results indicate that DBM beliefs disentangle latent traits in a form that survives counterfactual queries, providing an integrated world-model substrate for marketing intervention.

2605.07195 2026-05-11 cs.CV

See Tomorrow, Act Today: Foresight-Driven Autonomous Driving

预见未来,立即行动:基于预见的自动驾驶

Bozhou Zhang, Nan Song, Yuang Wang, Jiankang Deng, Xiatian Zhu, Li Zhang

发表机构 * School of Data Science, Fudan University(复旦大学数据科学学院) Shanghai Innovation Institute(上海创新研究院) Imperial College London(伦敦帝国理工学院) University of Surrey(萨里大学)

AI总结 本文提出ForeSight框架,通过生成未来场景并据此规划动作,实现前瞻性决策,实验表明其在动态场景中优于现有方法。

Comments CVPR Findings 2026

详情
AI中文摘要

当前端到端自动驾驶规划器本质上是反应式的:它们基于历史和当前观测预测未来动作。我们主张自动驾驶代理应在决定前想象未来场景,如同人类司机在行动前模拟"接下来会发生什么"。我们引入ForeSight,一种以世界模型为核心的规划框架,将自动驾驶重新定义为前瞻性决策。而非将世界模型视为辅助组件,ForeSight使未来场景想象成为动作预测的主要驱动因素。我们的方法分两阶段:(1)通过预训练世界模型生成合理的未来视觉世界;(2)基于这些想象的未来规划动作。这种从"现在该做什么"到"会发生什么,该如何响应"的转变,实现了真正的前瞻性而非反应性规划。通过基于预期上下文而非单独当前观测做出决策,ForeSight更有效地导航动态互动场景。在NAVSIM和nuScenes上的大量实验表明,显式的未来想象显著优于先前的最先进方法,验证了我们的预见驱动方法。

英文摘要

Current end-to-end autonomous driving planners are fundamentally reactive: they condition on historical and present observations to predict future actions. We argue that autonomous agents should instead imagine future scenes before deciding, just as human drivers mentally simulate ``what will happen next" before acting. We introduce ForeSight, a foundation world model centric planning framework that reframes autonomous driving as anticipatory decision-making. Rather than treating world models as auxiliary components, ForeSight makes future scene imagination the primary driver of action prediction. Our approach operates in two stages: (1) generating plausible future visual worlds via a pretrained world model, and (2) planning actions conditioned on these imagined futures. This paradigm shift from ``what should I do now?" to ``what will happen, and how should I respond?" enables genuinely anticipatory rather than reactive planning. By grounding decisions in anticipated contexts rather than present observations alone, ForeSight navigates dynamic, interactive scenarios more effectively. Extensive experiments on NAVSIM and nuScenes demonstrate that explicit future imagination significantly outperforms previous state-of-the-art alternatives, validating our foresight-driven approach.

2605.07194 2026-05-11 cs.CV cs.AI cs.LG

Closed-Form Linear-Probe Dataset Distillation for Pre-trained Vision Models

闭式线性探针数据集蒸馏用于预训练视觉模型

Bincheng Peng, Guang Li, Ping Liu, Takahiro Ogawa, Miki Haseyama

发表机构 * Hokkaido University(北海道大学) University of Nevada, Reno(内华达大学拉斯维加斯分校)

AI总结 本文提出CLP-DD方法,通过闭式解法和判别损失提升预训练视觉模型的数据集蒸馏效果,相比传统方法在计算效率和性能上均有显著提升。

详情
AI中文摘要

数据集蒸馏将大规模训练集压缩成小合成集,保留下游训练效用。现有方法多针对从头训练网络,而现代视觉迁移学习常使用冻结的预训练编码器后接轻量线性探针。现有蒸馏方法或通过轨迹基梯度匹配反向传播迭代线性探针更新,或依赖最初为从头训练设计的神经切线核(NTK)近似闭式公式。本文提出闭式线性探针数据集蒸馏(CLP-DD),通过样本空间核岭求解器计算合成集诱导的线性探针。合成图像通过评估此诱导分类器在真实特征上的温度缩放softmax交叉熵进行更新,其中分类器列作为特征空间中的学习类别锚点。进一步显示外目标选择至关重要:将闭式内求解器与标准MSE外损失显著劣于轨迹基方法,而判别外损失能缩小大部分差距。在ImageNet-100上,CLP-DD在无DSA情况下显著优于LGM,在有DSA时接近LGM。在ImageNet-1K上,CLP-DD在三个四预训练主干上匹配或超越LGM with DSA,运行速度快约14倍,GPU内存使用少于八分之一。

英文摘要

Dataset distillation compresses a large training set into a small synthetic set that preserves downstream training utility. While most existing methods target training networks from scratch, modern visual transfer learning often uses frozen pre-trained encoders followed by lightweight linear probing. Existing distillation methods for this setting either unroll iterative linear-probe updates with trajectory-based gradient matching, or rely on closed-form formulations originally designed for from-scratch training with neural-tangent-kernel (NTK) approximations. Neither route exploits the fact that frozen-feature linear probing admits a closed-form solution determined directly by the pre-trained features themselves, with no infinite-width approximation and no inner-loop trajectory. We propose Closed-Form Linear-Probe Dataset Distillation (CLP-DD), a bilevel formulation that computes the linear probe induced by the synthetic set with a sample-space kernel ridge solver. The synthetic images are then updated by evaluating this induced classifier on real features through a temperature-scaled softmax cross-entropy, where the classifier columns act as learned class anchors in feature space. We further show that the choice of outer objective is decisive: pairing the closed-form inner solver with a standard MSE outer loss substantially underperforms trajectory-based methods, while the discriminative outer loss closes most of the gap. On ImageNet-100 with four pre-trained backbones, CLP-DD substantially improves over LGM without DSA and approaches LGM with DSA at a fraction of the computational cost. On ImageNet-1K, CLP-DD matches or surpasses LGM with DSA on three of four backbones while running roughly $14\times$ faster and using less than one-eighth of the GPU memory.