arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 1719
2602.08857 2026-06-08 cs.LG cs.AI cs.CL 版本更新

Discovering Interpretable Algorithms by Decompiling Transformers to RASP

通过将Transformer反编译为RASP发现可解释算法

Xinting Huang, Aleksandra Bakalova, Satwik Bhattamishra, William Merrill, Michael Hahn

发表机构 * Saarland Informatics Campus, Saarland University(萨尔兰大学信息学院校区,萨尔兰大学) University of Oxford(牛津大学) Allen Institute for AI(人工智能研究所)

AI总结 提出一种将训练好的Transformer忠实重参数化为RASP程序,并通过因果干预发现小型充分子程序的方法,实验表明长度泛化的Transformer内部实现了简单可解释的RASP程序。

Comments 104 pages, 92 figures. Accepted for publication at ICML 2026

详情
AI中文摘要

近期研究表明,Transformer的计算可以在RASP编程语言家族中模拟。这些发现增进了对Transformer表达能力和泛化能力的理解。特别是,Transformer被建议在具有简单RASP程序的问题上精确实现长度泛化。然而,训练模型是否实际实现了简单的可解释程序仍是一个开放问题。在本文中,我们提出了一种从训练好的Transformer中提取此类程序的通用方法。其思想是将Transformer忠实地重参数化为RASP程序,然后应用因果干预来发现一个小的充分子程序。在算法和形式语言任务上训练的小型Transformer实验中,我们表明我们的方法通常能从长度泛化的Transformer中恢复简单且可解释的RASP程序。我们的结果提供了迄今为止最直接的证据,证明Transformer内部实现了简单的RASP程序。

英文摘要

Recent work has shown that the computations of Transformers can be simulated in the RASP family of programming languages. These findings have enabled improved understanding of the expressive capacity and generalization abilities of Transformers. In particular, Transformers have been suggested to length-generalize exactly on problems that have simple RASP programs. However, it remains open whether trained models actually implement simple interpretable programs. In this paper, we present a general method to extract such programs from trained Transformers. The idea is to faithfully re-parameterize a Transformer as a RASP program and then apply causal interventions to discover a small sufficient sub-program. In experiments on small Transformers trained on algorithmic and formal language tasks, we show that our method often recovers simple and interpretable RASP programs from length-generalizing transformers. Our results provide the most direct evidence so far that Transformers internally implement simple RASP programs.

2602.02600 2026-06-08 cs.LG cs.AI 版本更新

Step-Wise Refusal Dynamics in Autoregressive and Diffusion Language Models

自回归与扩散语言模型中的逐步拒绝动态

Eliron Rahimi, Elad Hirshel, Rom Himelstein, Amit LeVi, Avi Mendelson, Chaim Baskin

发表机构 * Department of Computer Science, Technion – Israel Institute of Technology(技术学院计算机科学系,以色列技术学院) INSIGHT Lab, School of Electrical and Computer Engineering, Ben-Gurion University of the Negev, Israel(内斯坦实验室,贝内-加隆大学内加尔分校,以色列) Computer Science Department, University of Haifa, Haifa, Israel(海法大学计算机科学系,海法,以色列)

AI总结 研究扩散语言模型(DLM)与自回归(AR)模型在拒绝有害生成行为上的差异,发现扩散重掩码机制可促进恢复,提出逐步拒绝内部动态(SRI)信号,并基于此构建无需修改推理的越狱检测器。

Comments Preprint

详情
AI中文摘要

扩散语言模型(DLM)最近已成为自回归(AR)模型的有竞争力的替代方案,提供并行解码、竞争性生成质量以及越狱鲁棒性改善的初步证据。尽管取得了这些进展,但采样机制在塑造拒绝行为中的作用仍知之甚少。为填补这一空白,我们提出了一项关于逐步拒绝动态的全面研究。我们表明,扩散重掩码可以促进从有害中间生成中恢复,提供证据表明这种行为与采样机制相关,并证明从AR采样切换到扩散采样可提高越狱鲁棒性,包括在固定模型权重下。为了捕捉在文本层面不可观察的生成动态,我们提出了逐步拒绝内部动态(SRI)信号。与我们的文本层面发现一致,SRI表明恢复主要在AR采样下失败,这些失败在SRI空间中通常相对于无害生成表现为异常。基于这一观察,我们表明SRI能够实现一个简单的越狱检测器,该检测器无需修改推理,并且仅通过在良性SRI信号上训练即可泛化到未见攻击。我们的评估表明,该检测器匹配或超越现有越狱检测基线,同时增加可忽略的开销。

英文摘要

Diffusion language models (DLMs) have recently emerged as a competitive alternative to autoregressive (AR) models, offering parallel decoding, competitive generation quality, and initial evidence of improved jailbreak robustness. Despite this progress, the role of sampling mechanisms in shaping refusal behavior remains poorly understood. To address this gap, we present a comprehensive study of step-wise refusal dynamics. We show that diffusion remasking can promote recovery from harmful intermediate generations, provide evidence that this behavior is tied to the sampling mechanism, and demonstrate that switching from AR to diffusion sampling improves jailbreak robustness, including under fixed model weights. To capture generation dynamics not observable at the text level, we propose the Step-Wise Refusal Internal Dynamics (SRI) signal. Consistent with our text-level findings, SRI shows that recovery fails primarily under AR sampling, with these failures often appearing anomalous relative to harmless generations in the SRI space. Based on this observation, we show that SRI enables a simple jailbreak detector that does not modify inference and generalizes to unseen attacks by training only on benign SRI signals. Our evaluation shows that this detector matches or outperforms existing jailbreak detection baselines while adding negligible overhead.

2602.07025 2026-06-08 cs.CV cs.AI 版本更新

The Geometry of Representational Failures in Vision Language Models

视觉语言模型中表征失败的几何结构

Daniele Savietto, Declan Campbell, André Panisson, Marco Nurisso, Giovanni Petri, Jonathan D. Cohen, Alan Perotti

发表机构 * Dipartimento di Fisica, Università di Torino(都灵大学物理系) Princeton Neuroscience Institute and AI Lab, Princeton University(普林斯顿大学神经科学研究所和AI实验室) Intesa Sanpaolo AI Research(Intesa Sanpaolo AI研究中心) Dipartimento di Scienze Matematiche, Politecnico di Torino(都灵理工学院数学科学系) Network Science Institute, Northeastern University London, UK(伦敦大学东北方大学网络科学研究所)

AI总结 通过分析开源视觉语言模型的概念向量几何重叠,揭示多目标视觉任务中幻觉等错误与认知约束的关联,并提出基于干预的验证方法。

详情
AI中文摘要

视觉语言模型在多目标视觉任务中表现出令人困惑的失败,例如幻觉不存在的元素或未能识别干扰中最相似的物体。虽然这些错误反映了人类的认知约束,如“绑定问题”,但在人工系统中驱动这些错误的内部机制仍然知之甚少。在这里,我们通过分析开源视觉语言模型(Qwen、InternVL、Gemma)的表征几何结构,提出了一种机制性见解,比较了提炼“概念向量”(编码视觉概念的潜在方向)的方法。我们通过引导干预验证了概念向量,这些干预在简化和自然视觉任务中可靠地操纵模型行为(例如,强制模型将红色花朵感知为蓝色)。我们观察到这些向量之间的几何重叠与特定错误模式强相关,提供了一个有依据的定量框架来理解内部表征如何塑造模型行为并驱动视觉失败。

英文摘要

Vision-Language Models (VLMs) exhibit puzzling failures in multi-object visual tasks, such as hallucinating non-existent elements or failing to identify the most similar objects among distractions. While these errors mirror human cognitive constraints, such as the 'Binding Problem', the internal mechanisms driving them in artificial systems remain poorly understood. Here, we propose a mechanistic insight by analyzing the representational geometry of open-weight VLMs (Qwen, InternVL, Gemma), comparing methodologies to distill "concept vectors'' - latent directions encoding visual concepts. We validate our concept vectors via steering interventions that reliably manipulate model behavior in both simplified and naturalistic vision tasks (e.g., forcing the model to perceive a red flower as blue). We observe that the geometric overlap between these vectors strongly correlates with specific error patterns, offering a grounded quantitative framework to understand how internal representations shape model behavior and drive visual failures.

2602.01740 2026-06-08 cs.AI cs.CV cs.LG 版本更新

MACD: Model-Aware Contrastive Decoding via Counterfactual Data

MACD:基于反事实数据的模型感知对比解码

Qixin Xiao, Kun Zhou

发表机构 * University of Michigan, Ann Arbor, MI, USA(密歇根大学,安娜堡分校) University of California San Diego, La Jolla, CA, USA(加州大学圣地亚哥分校)

AI总结 提出MACD方法,利用视频语言模型自身反馈识别导致幻觉的目标区域,生成目标级反事实输入,结合对比解码减少幻觉,提升多模型在复杂场景下的准确性。

详情
AI中文摘要

视频语言模型(Video-LLMs)容易产生幻觉,当视觉证据薄弱、模糊或存在偏差时,会生成看似合理但无根据的内容。现有方法如对比解码(CD)依赖随机扰动构建对比数据以缓解幻觉,但往往未能针对驱动幻觉的视觉线索或模型弱点。我们提出基于模型感知反事实数据的对比解码(MACD),这是一种结合模型引导的反事实构建与对比解码的推理策略。MACD利用Video-LLM自身的反馈来识别最可能导致幻觉的目标区域,生成有针对性的目标级反事实输入,而非任意的帧或时间修改。这些反事实输入被整合到CD中,以在解码过程中强制进行基于证据的令牌选择。在EventHallusion、MVBench、Perception-test和Video-MME上的实验表明,MACD在包括Qwen和InternVL在内的多种Video-LLM上持续减少幻觉,同时保持或提高任务准确性,在涉及小目标、遮挡目标或共现目标的场景中尤其表现出显著优势。

英文摘要

Video language models (Video-LLMs) are prone to hallucinations, generating plausible but ungrounded content when visual evidence is weak, ambiguous, or biased. Existing methods, such as contrastive decoding (CD), rely on random perturbations to construct contrastive data for hallucination mitigation, but often fail to target the visual cues that drive hallucination or align with model weaknesses. We propose Model-Aware Counterfactual Data based Contrastive Decoding (MACD), an inference strategy that combines model-guided counterfactual construction with contrastive decoding. MACD uses the Video-LLM's own feedback to identify object regions most responsible for hallucination, generating targeted object-level counterfactual inputs rather than arbitrary frame or temporal modifications. These counterfactual inputs are integrated into CD to enforce evidence-grounded token selection during decoding. Experiments on EventHallusion, MVBench, Perception-test, and Video-MME show that MACD consistently reduces hallucination while maintaining or improving task accuracy across diverse Video-LLMs, including Qwen and InternVL, with especially strong gains in scenarios involving small, occluded, or co-occurring objects.

2602.06941 2026-06-08 cs.LG cs.AI cs.CL 版本更新

Endogenous Resistance to Activation Steering in Language Models

语言模型中激活引导的内生抵抗

Alex McKenzie, Keenan Pepper, Stijn Servaes, Martin Leitgab, Murat Cubuktepe, Mike Vaiana, Diogo de Lucena, Judd Rosenblatt, Michael S. A. Graziano

发表机构 * University of Washington(华盛顿大学)

AI总结 研究发现大型语言模型在任务不匹配的激活引导下能内生抵抗,通过显式重启恢复正确生成,并识别出相关稀疏自编码器潜在变量,可增强或削弱该抵抗。

详情
AI中文摘要

大型语言模型可以在生成过程中从任务不匹配的激活引导中恢复,产生显式的语言重启(例如,“等等,那不对”),并在引导扰动仍然活跃的情况下继续讨论主题。我们将此称为内生引导抵抗(ESR)。使用稀疏自编码器(SAE)潜在变量来引导模型激活,我们发现Llama-3.3-70B在\llamaseventyEsrRate\\%的情况下表现出显式ESR,而来自Llama-3和Gemma-2系列的较小模型则较少出现显式形式。两个对照实验将ESR分解为检测事件和持续抵抗组件,后者不能仅由最近的on-topic token条件化来完全解释。我们通过对比on-topic/off-topic搜索识别出\numOtdLatents{}个SAE潜在变量;将其零消融使多次尝试率降低\multiAttemptReductionPct\\%,随机潜在变量和保留提示对照支持特异性。ESR还可以通过元提示和基于合成自我纠正示例的微调来有意增强。ESR对安全性具有双重影响:它可能使模型对对抗性激活空间操纵更具抵抗力,但同样可能干扰有益的基于引导的干预,因为模型无法区分两者。代码可在\href{https://github.com/agencyenterprise/endogenous-steering-resistance}{github.com/agencyenterprise/endogenous-steering-resistance}获取。

英文摘要

Large language models can recover mid-generation from task-misaligned activation steering, producing explicit verbal restarts (e.g., ``wait, that's not right'') and continuing on-topic even while the steering perturbation remains active. We term this Endogenous Steering Resistance (ESR). Using sparse autoencoder (SAE) latents to steer model activations, we find that Llama-3.3-70B exhibits explicit ESR at \llamaseventyEsrRate\%, with smaller models from the Llama-3 and Gemma-2 families showing the explicit form less frequently. Two controls dissociate ESR into a detection event and a sustained-resistance component that conditioning on recent on-topic tokens does not fully explain. We identify \numOtdLatents{} SAE latents through contrastive on-topic/off-topic search; zero-ablating them reduces the multi-attempt rate by \multiAttemptReductionPct\%, with random-latent and held-out-prompt controls supporting specificity. ESR can also be deliberately enhanced through both meta-prompting and fine-tuning on synthetic self-correction examples. ESR has dual implications for safety: it could harden models against adversarial activation-space manipulation, but may equally interfere with beneficial steering-based interventions, since the model has no way to distinguish the two. Code is available at \href{https://github.com/agencyenterprise/endogenous-steering-resistance}{github.com/agencyenterprise/endogenous-steering-resistance}.

2512.17058 2026-06-08 cs.LG 版本更新

Universal consistency of the $k$-NN rule in metric spaces and Nagata dimension. III

度量空间和Nagata维数中$k$-NN规则的普适一致性. III

Vladimir G. Pestov

发表机构 * Department of Mathematics and Statistics, University of Ottawa(数学与统计学系,渥太华大学) Departamento de Matemática, Universidade Federal de Santa Catarina(数学系,圣卡塔琳娜联邦大学)

AI总结 本文证明了在完备可分度量空间中,$k$-最近邻分类器普适一致的充要条件是空间具有强Lebesgue-Besicovitch微分性质或Nagata的$\sigma$-有限维数,填补了最后缺失的环节。

Comments 22 pages, latex with ESAIM P&S macros, a second revision requested by the referee, with more accurate and detailed proofs, in particular, the referee pointed out the correct value of the Nagata dimension of R^2 which is 4

详情
AI中文摘要

我们建立了最后缺失的环节,使得能够用维数理论的组合术语和实分析的基本性质来描述那些完备可分度量空间$X$,其中$k$最近邻分类器是普适一致的。以下条件等价:(1) $k$-最近邻分类器在$X$中普适一致,(2) 强Lebesgue--Besicovitch微分性质在$X$中对每个局部有限Borel测度成立,(3) $X$在Jun-Iti Nagata意义下是$\sigma$-有限维的。等价关系(2)$\iff$(3)由Preiss (1983)宣布,而(3)$\Rightarrow$(2)的详细证明仅出现在Assouad和Quentin de Gromard (2006)中。(2)$\Rightarrow$(1)由Cérou和Guyader (2006)建立。我们证明了(1)$\Rightarrow$(3)。我们进一步表明,弱(而非强)Lebesgue--Besicovitch性质对于$k$-NN规则的一致性是不充分的,例如Heisenberg群就是一个反例(这里我们纠正了之前文章(Kumari and Pestov 2024)中的一个错误说法)。有点反直觉的是,存在一个与通常距离一致等价的实数直线上的度量,在该度量下$k$-NN分类器失效。最后,另一个可以添加到上述条件的等价条件是Cover--Hart性质:(4) $1$-最近邻分类器的误差渐近地至多是Bayes误差的两倍。

英文摘要

We establish the last missing link allowing to describe those complete separable metric spaces $X$ in which the $k$ nearest neighbour classifier is universally consistent, both in combinatorial terms of dimension theory and via a fundamental property of real analysis. The following are equivalent: (1) The $k$-nearest neighbour classifier is universally consistent in $X$, (2) The strong Lebesgue--Besicovitch differentiation property holds in $X$ for every locally finite Borel measure, (3) $X$ is sigma-finite dimensional in the sense of Jun-Iti Nagata. The equivalence (2)$\iff$(3) was announced by Preiss (1983), while a detailed proof of the implication (3)$\Rightarrow$(2) has only appeared in Assouad and Quentin de Gromard (2006). The implication (2)$\Rightarrow$(1) was established by Cérou and Guyader (2006). We prove the implication (1)$\Rightarrow$(3). We further show that the weak (instead of strong) Lebesgue--Besicovitch property is insufficient for the consistency of the $k$-NN rule, as witnessed, for example, by the Heisenberg group (here we correct a wrong claim made in the previous article (Kumari and Pestov 2024)). A bit counter-intuitively, there is a metric on the real line uniformly equivalent to the usual distance but under which the $k$-NN classifier fails. Finally, another equivalent condition that can be added to the above is the Cover--Hart property: (4) the error of the $1$-nearest neighbour classifier is asymptotically at most twice as bad as the Bayes error.

2602.05833 2026-06-08 cs.LG 版本更新

SecretFan: Synthesizing Realistic Data without Breaking Privacy

SecretFan: 在不破坏隐私的情况下合成真实数据

Laura Plein, Alexi Turcotte, Arina Hallemans, Andreas Zeller

发表机构 * CISPA Helmholtz Center for Information Security(CISPA赫尔姆霍尔茨信息安全部) Saarland University(萨尔兰州大学)

AI总结 提出将合成数据生成视为引导测试生成问题,结合生成对抗网络(GAN)的判别器和模糊测试生成器,在保护隐私的同时生成高可用性合成数据。

详情
AI中文摘要

需要合成训练和测试数据集,这些数据集能够复制原始数据集的统计分布,同时不损害其机密性。已有大量研究利用生成对抗网络(GAN)进行合成数据生成,但生成的模型要么不够准确,要么由于原始数据在训练过程中被利用,仍然容易受到成员推断攻击(MIA)或数据集重建攻击。在本文中,我们将合成数据生成视为引导测试生成或基于搜索的测试问题,而不是纯粹的生成建模任务。我们提出了一种基于搜索的、充分性引导的输入生成技术,灵感来自GAN,包括生成步骤和判别步骤;与GAN一样,判别使用在数据上训练的判别器模型,但生成部分我们不使用模型,而是使用模糊测试器。这样,原始(私有)数据仅在生成过程中间接利用,通过演化样本并用判别器确定“好样本”,我们可以生成遵循与原始数据集相同统计分布的隐私保护数据,从而获得与原始数据相似的效用。我们在八个用于评估最先进技术的数据集上评估了我们的方法,发现我们的技术生成的合成数据平均具有良好效用,同时具有较高的相似性得分,突显了结合经典生成和模型驱动判别的混合方法在生成隐私保护且有用的合成数据集方面的潜力。

英文摘要

There is a need for synthetic training and test datasets that replicate statistical distributions of original datasets without compromising their confidentiality. A lot of research has been done in leveraging Generative Adversarial Networks (GANs) for synthetic data generation, however the resulting models are either not accurate enough or are still vulnerable to membership inference attacks (MIA) or dataset reconstruction attacks since the original data has been leveraged in the training process. In this paper, we frame synthetic data generation as a guided test generation, or search-based testing problem rather than a purely generative modeling task. Ours is a search-based, adequacy-guided input generation technique inspired by GANs, with a generation step and a discrimination step; as in GAN, discrimination uses a discriminator model trained on the date, but instead of using models also for generation, we use a fuzzer. This way, the original (private) data is only indirectly leveraged in the generation process, and by evolving samples and determining "good samples" with the discriminator, we can generate privacy-preserving data that follows the same statistical distributions as the original dataset, leading to a similar utility as the original data. We evaluated our approach on eight datasets that have been used to evaluate the state-of-the-art techniques, finding that synthetic generated with our technique achieves good utility on average while also having good similarity scores, highlighting the potential of a mixed approach leveraging classical generation and model-driven discrimination for generating privacy-preserving, useful synthetic datasets.

2602.03160 2026-06-08 cs.AI cs.CL 版本更新

VALUEFLOW: Toward Pluralistic and Steerable Value-based Alignment in Large Language Models

VALUEFLOW:迈向大语言模型中多元化和可引导的基于价值的对齐

Woojin Kim, Sieun Hyeon, Jusang Oh, Jaeyoung Do

发表机构 * Department of Electrical and Computer Engineering, Seoul National University(首尔国立大学电气与计算机工程系) Interdisciplinary Program in Artificial Intelligence, Seoul National University(首尔国立大学人工智能交叉学科项目)

AI总结 提出VALUEFLOW框架,通过分层价值嵌入、强度标注数据库和锚定评估器,实现大语言模型在价值强度上的可控对齐,解决现有方法在提取、评估和引导方面的不足。

Comments Accepted in ICML 2026 (Oral). Code available at https://github.com/AIDASLab/VALUEFLOW

详情
AI中文摘要

将大语言模型(LLMs)与人类价值的多元光谱对齐仍然是一个核心挑战:基于偏好的方法通常无法捕捉更深层次的动机原则。基于价值的方法提供了更原则性的路径,但仍存在三个差距:提取常常忽略层次结构,评估检测存在但未校准强度,并且LLMs在受控强度下的可引导性仍未得到充分理解。为解决这些限制,我们引入了VALUEFLOW,这是第一个统一框架,涵盖提取、评估和引导,并具有校准的强度控制。该框架整合了三个组件:(i) HIVES,一个层次化价值嵌入空间,捕捉理论和跨理论的价值结构;(ii) 价值强度数据库(VIDB),一个大规模资源,包含基于排序聚合得出的强度估计的价值标注文本;(iii) 一个基于锚点的评估器,通过将模型输出与VIDB面板进行排序,产生一致的强度分数。使用VALUEFLOW,我们在十个模型和四个价值理论上进行了全面的大规模研究,识别了可引导性的不对称性和多价值控制的组合规律。本文建立了一个可扩展的基础设施,用于评估和控制价值强度,推进了LLMs的多元化对齐。

英文摘要

Aligning Large Language Models (LLMs) with the diverse spectrum of human values remains a central challenge: preference-based methods often fail to capture deeper motivational principles. Value-based approaches offer a more principled path, yet three gaps persist: extraction often ignores hierarchical structure, evaluation detects presence but not calibrated intensity, and the steerability of LLMs at controlled intensities remains insufficiently understood. To address these limitations, we introduce VALUEFLOW, the first unified framework that spans extraction, evaluation, and steering with calibrated intensity control. The framework integrates three components: (i) HIVES, a hierarchical value embedding space that captures intra- and cross-theory value structure; (ii) the Value Intensity DataBase (VIDB), a large-scale resource of value-labeled texts with intensity estimates derived from ranking-based aggregation; and (iii) an anchor-based evaluator that produces consistent intensity scores for model outputs by ranking them against VIDB panels. Using VALUEFLOW, we conduct a comprehensive large-scale study across ten models and four value theories, identifying asymmetries in steerability and composition laws for multi-value control. This paper establishes a scalable infrastructure for evaluating and controlling value intensity, advancing pluralistic alignment of LLMs.

2602.02014 2026-06-08 cs.CV cs.AI cs.CL cs.LG 版本更新

Rethinking Genomic Modeling Through Optical Character Recognition

通过光学字符识别重新思考基因组建模

Hongxin Xiang, Pengsen Ma, Yunkang Cao, Di Yu, Haowen Chen, Xinyu Yang, Xiangxiang Zeng

发表机构 * National University of Singapore(新加坡国立大学) University of Science and Technology of China(中国科学技术大学)

AI总结 提出OpticalDNA框架,将DNA渲染为视觉布局,利用视觉语言模型进行OCR式基因组理解,实现高保真压缩和长序列高效处理,在450k碱基序列上以近20倍更少有效token超越基线模型。

Comments Accepted by ICML 2026

详情
AI中文摘要

最近的基因组基础模型大多采用大型语言模型架构,将DNA视为一维token序列。然而,穷举式顺序阅读在结构上与稀疏且不连续的基因组语义不匹配,导致在低信息背景上的计算浪费,并阻碍了面向长上下文的压缩理解。在此,我们提出OpticalDNA,一个基于视觉的框架,将基因组建模重新定义为光学字符识别(OCR)风格的文档理解。OpticalDNA将DNA渲染为结构化视觉布局,并训练一个具备OCR能力的视觉语言模型,该模型包含视觉DNA编码器和文档解码器,其中编码器生成紧凑、可重建的视觉token以实现高保真压缩。基于这种表示,OpticalDNA定义了基于提示条件的核心基因组原语目标——读取、区域定位、子序列检索和掩码跨度补全——从而学习到布局感知的DNA表示,在减少的有效token预算下保留细粒度的基因组信息。在多种基因组基准测试中,OpticalDNA持续优于最近的基线模型;在长达450k碱基的序列上,它以近20倍更少的有效token实现了最佳整体性能,并且仅调整256k可训练参数就超越了激活参数多达985倍的模型。

英文摘要

Recent genomic foundation models largely adopt large language model architectures that treat DNA as a one-dimensional token sequence. However, exhaustive sequential reading is structurally misaligned with sparse and discontinuous genomic semantics, leading to wasted computation on low-information background and preventing understanding-driven compression for long contexts. Here, we present OpticalDNA, a vision-based framework that reframes genomic modeling as Optical Character Recognition (OCR)-style document understanding. OpticalDNA renders DNA into structured visual layouts and trains an OCR-capable vision--language model with a visual DNA encoder and a document decoder, where the encoder produces compact, reconstructible visual tokens for high-fidelity compression. Building on this representation, OpticalDNA defines prompt-conditioned objectives over core genomic primitives-reading, region grounding, subsequence retrieval, and masked span completion-thereby learning layout-aware DNA representations that retain fine-grained genomic information under a reduced effective token budget. Across diverse genomic benchmarks, OpticalDNA consistently outperforms recent baselines; on sequences up to 450k bases, it achieves the best overall performance with nearly 20$\times$ fewer effective tokens, and surpasses models with up to 985$\times$ more activated parameters while tuning only 256k trainable parameters.

2602.00541 2026-06-08 cs.LG 版本更新

One Loss to Rule Them All: Marked Time-to-Event for Structured EHR Foundation Models

一个损失统治一切:结构化EHR基础模型的标记时间到事件

Zilin Jing, Vincent Jeanselme, Yuta Kobayashi, Simon A. Lee, Chao Pang, Aparajita Kashyap, Yanwei Li, Xinzhuo Jiang, Shalmali Joshi

发表机构 * Department of Computer Science, Columbia University(哥伦比亚大学计算机科学系) Department of Biomedical Informatics, Columbia University(哥伦比亚大学生物医学信息学系) Department of Computational Medicine, UCLA(洛杉矶大学计算医学系) Formation Bio

AI总结 提出ORA预训练目标,联合建模事件时间和关联测量,相比下一词预测和忽略连续测量的损失,在多个数据集和下游任务上产生更通用的表示,提升回归和时间到事件预测能力。

详情
AI中文摘要

电子健康记录(EHR)中捕获的临床事件是不规则采样的,可能由离散事件和数值测量(如实验室值或治疗剂量)混合组成。EHR的序列性质类似于自然语言,这促使使用下一词预测来训练事件上的EHR基础模型(FM)。然而,这种训练未能捕获EHR的完整结构。必须捕获给定事件发生的时间,但事件值(异常实验室)也会调节其他临床事件的可能性。大多数现有的EHR FM不联合建模这种可能性,无法捕获完整的观察过程,影响下游能力。我们提出ORA,一种标记时间到事件预训练目标,联合建模事件时间和相关测量。在多个数据集、下游任务和模型骨干上,该目标始终比下一词预测和忽略连续测量的预训练损失产生更可泛化的表示。重要的是,所提出的目标在传统分类评估之外带来改进,包括更好的回归和时间到事件预测。除了引入新的FM家族,我们的消融研究提出了更广泛的结论:考虑EHR结构的预训练目标对于扩展下游能力和泛化性至关重要。

英文摘要

Clinical events captured in Electronic Health Records (EHR) are irregularly sampled and may consist of a mixture of discrete events and numerical measurements, such as laboratory values or treatment dosages. The sequential nature of EHR, analogous to natural language, has motivated the use of next-token prediction to train prior EHR Foundation Models (FMs) over events. However, this training fails to capture the full structure of EHR. When a given event occurs must be captured, but the event value (abnormal lab) also modulates the likelihood of other clinical events. Most existing EHR FMs do not jointly model this likelihood and are unable to capture the full observation process, impacting downstream capabilities. We propose ORA, a marked time-to-event pretraining objective that jointly models event timing and associated measurements. Across multiple datasets, downstream tasks, and model backbones, this objective consistently yields more generalizable representations than next-token prediction and pretraining losses that ignore continuous measurements. Importantly, the proposed objective yields improvements beyond traditional classification evaluation, including better regression and time-to-event prediction. Beyond introducing a new family of FMs, our ablations suggest a broader takeaway: pretraining objectives that account for EHR structure are critical for expanding downstream capabilities and generalizability.

2602.00471 2026-06-08 cs.AI cs.CV 版本更新

Dual Latent Memory for Visual Multi-agent System

面向视觉多智能体系统的双潜在记忆

Xinlei Yu, Chengming Xu, Zhangquan Chen, Bo Yin, Cheng Yang, Yongbo He, Yihao Hu, Jiangning Zhang, Cheng Tan, Xiaobin Hu, Shuicheng Yan

发表机构 * University of Science and Technology of China(中国科学技术大学)

AI总结 提出L²-VMAS框架,通过双潜在记忆解耦感知与思考,并采用熵驱动主动触发机制,打破视觉多智能体系统的“扩展墙”,在提升准确率的同时大幅降低令牌消耗。

详情
AI中文摘要

尽管视觉多智能体系统(VMAS)有望通过智能体间协作增强综合能力,但经验证据揭示了一个反直觉的“扩展墙”:增加智能体轮次往往会降低性能,同时指数级增加令牌成本。我们将这一失败归因于以文本为中心的通信中固有的信息瓶颈,其中将感知和思维轨迹转换为离散自然语言不可避免地导致语义损失。为此,我们提出了\textbf{L}$\mathbf{^{2}}$\textbf{-VMAS},一种新颖的模型无关框架,通过双潜在记忆实现智能体间协作。此外,我们解耦了感知与思考,同时动态合成双潜在记忆。另外,我们引入了熵驱动的主动触发,用高效的按需内存访问取代被动信息传输。在骨干网络、规模和多智能体结构上的大量实验表明,我们的方法有效打破了“扩展墙”,具有卓越的可扩展性,平均准确率提高2.7-5.4%,同时令牌使用量减少21.3-44.8%。

英文摘要

While Visual Multi-Agent Systems (VMAS) promise to enhance comprehensive abilities through inter-agent collaboration, empirical evidence reveals a counter-intuitive "scaling wall": increasing agent turns often degrades performance while exponentially inflating token costs. We attribute this failure to the information bottleneck inherent in text-centric communication, where converting perceptual and thinking trajectories into discrete natural language inevitably induces semantic loss. To this end, we propose \textbf{L}$\mathbf{^{2}}$\textbf{-VMAS}, a novel model-agnostic framework that enables inter-agent collaboration with dual latent memories. Furthermore, we decouple the perception and thinking while dynamically synthesizing dual latent memories. Additionally, we introduce an entropy-driven proactive triggering that replaces passive information transmission with efficient, on-demand memory access. Extensive experiments among backbones, sizes, and multi-agent structures demonstrate that our method effectively breaks the "scaling wall" with superb scalability, improving average accuracy by 2.7-5.4% while reducing token usage by 21.3-44.8%.

2602.00163 2026-06-08 cs.CV q-bio.NC 版本更新

Deep Learning Pose Estimation for Multi-Label Recognition of Combined Hyperkinetic Movement Disorders

基于深度学习姿态估计的联合多动性运动障碍多标签识别

Laura Cif, Diane Demailly, Gabriella A. Horvàth, Juan Dario Ortigoza Escobar, Nathalie Dorison, Mayté Castro Jiménez, Cécile A. Hubsch, Thomas Wirth, Gun-Marie Hariz, Sophie Huby, Morgan Dornadic, Zohra Souei, Muhammad Mushhood Ur Rehman, Simone Hemm, Mehdi Boulayme, Eduardo M. Moraud, Jocelyne Bloch, Xavier Vasques

发表机构 * Lausanne University Hospital (CHUV) and University of Lausanne (UNIL)(日内瓦大学医院(CHUV)和日内瓦大学) Institut du Neurone(神经研究所) Department of Neurology, Clinique Beau Soleil, Institut Mutualiste Montpelliérain(神经科,贝索尔诊所,蒙彼利埃互益研究所) Department of Pediatrics, British Columbia Children’s Hospital(儿科,不列颠哥伦比亚儿童医院) Movement Disorders Unit, Pediatric Neurology Department, Institut de Recerca, Hospital Sant Joan de Déu(运动障碍科,儿童神经科,研究所,圣约翰德杜医院) European Reference Network for Rare Neurological Diseases (ERN-RND)(罕见神经系统疾病欧洲参考网络(ERN-RND)) U-703 Centre for Biomedical Research on Rare Diseases (CIBER-ER), Instituto de Salud Carlos III(罕见疾病生物医学研究中心(CIBER-ER),卡洛斯三世健康研究所) Pediatric Neurosurgery Department, CCMR Neurogenetique, European Reference Network Brainteam Member, Rothschild Foundation Hospital(小儿神经外科部门,CCMR神经遗传学,欧洲参考网络Brainteam成员,罗切什基金会医院) Department of Neurology, University Hospital of Strasbourg(神经科,斯特拉斯堡大学医院) Strasbourg Neuroscience Institute, Strasbourg University(斯特拉斯堡神经科学研究所,斯特拉斯堡大学) Institute of Genetics and Cellular biology(遗传学和细胞生物学研究所)

AI总结 针对多动性运动障碍(HMD)临床识别主观性强、表型重叠的问题,提出基于姿态的机器学习框架,从常规临床视频提取关键点时间序列并计算多维度运动学特征,实现多标签分类。

详情
AI中文摘要

多动性运动障碍(HMD),如肌张力障碍、震颤、舞蹈症、肌阵挛和抽动症,是儿童和成人中致残的运动表现。其波动性、间歇性和频繁共存的表达阻碍了临床识别和纵向监测,这些在很大程度上仍然是主观的且易受评估者间变异影响。目前仍缺乏客观且可扩展的方法来从常规临床视频中区分重叠的HMD表型。在此,我们开发了一个基于姿态的机器学习框架,将常规门诊视频转化为解剖学上有意义的关键点时间序列,并计算涵盖统计、时间、频谱以及高阶不规则性-复杂性特征的运动学描述符。

英文摘要

Hyperkinetic movement disorders (HMDs) such as dystonia, tremor, chorea, myoclonus, and tics are disabling motor manifestations across childhood and adulthood. Their fluctuating, intermittent, and frequently co-occurring expressions hinder clinical recognition and longitudinal monitoring, which remain largely subjective and vulnerable to inter-rater variability. Objective and scalable methods to distinguish overlapping HMD phenotypes from routine clinical videos are still lacking. Here, we developed a pose-based machine-learning framework that converts standard outpatient videos into anatomically meaningful keypoint time series and computes kinematic descriptors spanning statistical, temporal, spectral, and higher-order irregularity-complexity features.

2601.23207 2026-06-08 cs.LG cs.AI 版本更新

Learning to Execute Graph Algorithms Exactly with Graph Neural Networks

学习用图神经网络精确执行图算法

Muhammad Fetrat Qharabagh, Artur Back de Luca, George Giapitzakis, Kimon Fountoulakis

发表机构 * University of Waterloo(多伦多大学)

AI总结 证明在有限度和有限精度约束下,图神经网络能通过训练多层感知机集成学习局部指令,从而在推理时无误差执行完整图算法,并展示了在分布式计算LOCAL模型及多种经典算法上的可学习性。

详情
AI中文摘要

理解图神经网络能学习什么,特别是它们学习执行算法的能力,仍然是一个核心的理论挑战。在这项工作中,我们证明了在有限度和有限精度约束下图算法的精确可学习性结果。我们的方法遵循两步过程。首先,我们训练一个多层感知机(MLP)集成来执行单个节点的局部指令。其次,在推理过程中,我们使用训练好的MLP集成作为图神经网络(GNN)中的更新函数。利用神经正切核(NTK)理论,我们表明局部指令可以从一个小训练集中学习,从而使得完整的图算法在推理过程中能够以高概率无误差地执行。为了说明我们设置的学习能力,我们为分布式计算的LOCAL模型建立了一个严格的可学习性结果。我们进一步展示了广泛研究的算法(如消息洪泛、广度优先搜索、深度优先搜索和贝尔曼-福特算法)的积极可学习性结果。

英文摘要

Understanding what graph neural networks can learn, especially their ability to learn to execute algorithms, remains a central theoretical challenge. In this work, we prove exact learnability results for graph algorithms under bounded-degree and finite-precision constraints. Our approach follows a two-step process. First, we train an ensemble of multi-layer perceptrons (MLPs) to execute the local instructions of a single node. Second, during inference, we use the trained MLP ensemble as the update function within a graph neural network (GNN). Leveraging Neural Tangent Kernel (NTK) theory, we show that local instructions can be learned from a small training set, enabling the complete graph algorithm to be executed during inference without error and with high probability. To illustrate the learning power of our setting, we establish a rigorous learnability result for the LOCAL model of distributed computation. We further demonstrate positive learnability results for widely studied algorithms such as message flooding, breadth-first and depth-first search, and Bellman-Ford.

2601.23204 2026-06-08 cs.AI 版本更新

TSAQA: Time Series Analysis Question And Answering Benchmark

TSAQA:时间序列分析问答基准

Baoyu Jing, Sanhorn Chen, Lecheng Zheng, Boyu Liu, Zihao Li, Jiaru Zou, Tianxin Wei, Zhining Liu, Zhichen Zeng, Ruizhong Qiu, Xiao Lin, Yuchen Yan, Dongqi Fu, Jingchao Ni, Jingrui He, Hanghang Tong

发表机构 * University of Illinois at Urbana-Champaign(伊利诺伊大学厄巴纳-香槟分校) Virginia Polytechnic Institute and State University(弗吉尼亚理工学院和州立大学) Amazon(亚马逊) Meta AI University of Houston(休斯顿大学)

AI总结 提出TSAQA基准,涵盖6种时间序列分析任务(含新型PZ格式),评估LLM在13领域21万样本上的表现,最佳模型仅65.08分。

Comments Comments: 35 pages, 7 figures. Accepted to the GEM Workshop at ACL 2026

详情
AI中文摘要

时间序列数据在金融、医疗、交通和环境科学等关键应用中不可或缺。虽然近期工作开始探索多任务时间序列问答(QA),但现有基准仍局限于预测和异常检测任务。我们引入了TSAQA,这是一个新颖的统一基准,旨在拓宽任务覆盖范围并评估多样化的时间分析能力。TSAQA在单一框架下整合了六种不同任务,从常规分析(包括异常检测和分类)到高级分析(如特征描述、比较、数据转换和时间关系分析)。该数据集涵盖13个领域的21万个样本,采用多种格式,包括真/假(TF)、多项选择(MC)和一种新颖的谜题(PZ),以全面评估时间序列分析。零样本评估表明,这些任务对当前大型语言模型(LLM)具有挑战性:表现最好的商业LLM Gemini-2.5-Flash的平均得分仅为65.08。尽管指令调优提升了开源模型的性能:表现最好的开源模型LLaMA-3.1-8B仍有显著改进空间,凸显了LLM进行时间分析的复杂性。

英文摘要

Time series data are integral to critical applications across domains such as finance, healthcare, transportation, and environmental science. While recent work has begun to explore multi-task time series question answering (QA), current benchmarks remain limited to forecasting and anomaly detection tasks. We introduce TSAQA, a novel unified benchmark designed to broaden task coverage and evaluate diverse temporal analysis capabilities. TSAQA integrates six diverse tasks under a single framework ranging from conventional analysis, including anomaly detection and classification, to advanced analysis, such as characterization, comparison, data transformation, and temporal relationship analysis. Spanning 210k samples across 13 domains, the dataset employs diverse formats, including true-or-false (TF), multiple-choice (MC), and a novel puzzling (PZ), to comprehensively assess time series analysis. Zero-shot evaluation demonstrates that these tasks are challenging for current Large Language Models (LLMs): the best-performing commercial LLM, Gemini-2.5-Flash, achieves an average score of only 65.08. Although instruction tuning boosts open-source performance: the best-performing open-source model, LLaMA-3.1-8B, shows significant room for improvement, highlighting the complexity of temporal analysis for LLMs.

2601.22574 2026-06-08 cs.CV cs.AI 版本更新

Enhancing Video Representations with Spatiotemporal-Semantic Residual to Mitigate Hallucinations in Video Large Multimodal Models

增强视频表示中的时空语义残差以缓解视频大型多模态模型中的幻觉

Yuansheng Gao, Jinman Zhao, Tong Zhang, Xingguo Xu, Wenbin Xing, Han Bao, Zonghui Wang, Wenzhi Chen

发表机构 * Zhejiang University(浙江大学) University of Toronto(多伦多大学) Dalian University of Technology(大连理工大学) Sun Yat-sen University(中山大学)

AI总结 提出ViSSRes方法,通过轻量级MLP网络学习视频表示的残差,从时空和语义一致性优化,在推理时仅需单次前向传播,有效降低幻觉率并提升视频理解性能。

Comments Preprint

详情
AI中文摘要

尽管视频大型多模态模型在视频理解方面取得了强劲性能,但它们仍然存在幻觉问题。现有的推理时干预方法通常在对比解码框架下修改视频,但其启发式设计带来的改进有限且增加了推理延迟。为了解决这些问题,我们提出了ViSSRes,一种通过轻量级MLP风格网络增强视频表示的推理时干预方法。具体来说,我们使用对比随机游走方法来表征视频表示的时空一致性,并引入条件互信息将视频表示与模型的语义理解关联起来。在保持模型主干冻结的情况下,ViSSRes学习视频表示的残差,并从时空和语义一致性角度优化它们。在推理时,ViSSRes仅需单次前向传播,且不会引入显著的额外推理成本。实验表明,ViSSRes在EventHallusion上将LLaVA-NeXT-Video的幻觉率降低了40.69%,并在CoT设置下将MMVU上的视频理解提升了18.36%,证明了其在缓解幻觉方面的有效性。

英文摘要

Although Video Large Multimodal Models have achieved strong performance in video understanding, they still suffer from hallucination. Existing inference-time intervention methods usually modify videos under the contrastive decoding framework, but their heuristic designs bring limited improvements and increase inference latency. To address these issues, we propose ViSSRes, an inference-time intervention method that enhances video representations through a lightweight MLP-style network. Specifically, we use a contrastive random walk approach to characterize the spatiotemporal consistency of video representations, and introduce conditional mutual information to associate video representations with the model's semantic understanding. With the model backbone kept frozen, ViSSRes learns residuals for video representations and optimizes them from both spatiotemporal and semantic consistency perspectives. During inference, ViSSRes requires only a single forward pass and introduces no substantial additional inference cost. Experiments show that ViSSRes reduces the hallucination rate of LLaVA-NeXT-Video on EventHallusion by 40.69% and improves video understanding on MMVU by 18.36% under the CoT setting, demonstrating its effectiveness in mitigating hallucinations.

2512.05291 2026-06-08 cs.LG 版本更新

SHAP-Guided Kernel Actor-Critic for Explainable Reinforcement Learning

基于SHAP引导的核化Actor-Critic可解释强化学习

Na Li, Hangguan Shan, Wei Ni, Wenjie Zhang, Xinyu Li

发表机构 * National University of Singapore(新加坡国立大学) University of Science and Technology of China(中国科学技术大学)

AI总结 提出RSA2C算法,利用RKHS-SHAP计算状态属性,通过马氏门控权重调节Actor梯度和Advantage Critic目标,实现高效、稳定且可解释的强化学习。

详情
Journal ref
ICML2026
AI中文摘要

Actor-Critic (AC) 方法是强化学习 (RL) 的基石,但可解释性有限。当前的可解释RL方法很少使用状态属性来辅助训练,而是平等对待所有状态特征,从而忽略了单个状态维度对奖励的异质性影响。我们提出基于RKHS-SHAP的高级Actor-Critic (RSA2C),一种属性感知的、核化的、双时间尺度AC算法,包括Actor、Value Critic和Advantage Critic。Actor实例化在向量值再生核希尔伯特空间 (RKHS) 中,使用马氏加权算子值核,而Value Critic和Advantage Critic位于标量RKHS中。这些RKHS增强组件使用稀疏化字典:Value Critic维护自己的字典,而Actor和Advantage Critic共享一个字典。通过RKHS-SHAP(用于流形上期望的核均值嵌入和流形外期望的条件均值嵌入)从Value Critic计算的状态属性被转换为马氏门控权重,用于调节Actor梯度和Advantage Critic目标。我们推导了在状态扰动下的全局非渐近收敛界,通过扰动误差项显示稳定性,通过收敛误差项显示效率。在三个连续控制环境上的实验结果表明,RSA2C实现了效率、稳定性和可解释性。我们的代码可在 https://github.com/Na-Li66/RSA2C 获取。

英文摘要

Actor-critic (AC) methods are a cornerstone of reinforcement learning (RL) but offer limited interpretability. Current explainable RL methods seldom use state attributions to assist training. Rather, they treat all state features equally, thereby neglecting the heterogeneous impacts of individual state dimensions on the reward. We propose RKHS-SHAP-based Advanced Actor-Critic (RSA2C), an attribution-aware, kernelized, two-timescale AC algorithm, including Actor, Value Critic, and Advantage Critic. The Actor is instantiated in a vector-valued reproducing kernel Hilbert space (RKHS) with a Mahalanobis-weighted operator-valued kernel, while the Value Critic and Advantage Critic reside in scalar RKHSs. These RKHS-enhanced components use sparsified dictionaries: the Value Critic maintains its own dictionary, while the Actor and Advantage Critic share one. State attributions, computed from the Value Critic via RKHS-SHAP (kernel mean embedding for on-manifold and conditional mean embedding for off-manifold expectations), are converted into Mahalanobis-gated weights that modulate Actor gradients and Advantage Critic targets. We derive a global, non-asymptotic convergence bound under state perturbations, showing stability through the perturbation-error term and efficiency through the convergence-error term. Empirical results on three continuous-control environments show that RSA2C achieves efficiency, stability, and interpretability. Our code is available at https://github.com/Na-Li66/RSA2C.

2505.15998 2026-06-08 cs.AI 版本更新

Exploring Flow-Lenia Universes with a Curiosity-driven AI Scientist: Discovering Diverse Ecosystem Dynamics

探索Flow-Lenia宇宙:基于好奇心驱动的AI科学家发现多样生态系统动力学

Thomas Michel, Marko Cvjetko, Gautier Hamon, Pierre-Yves Oudeyer, Clément Moulin-Frier

发表机构 * Univ. Lille, Inria, CNRS, Centrale Lille, CRIStAL, France(里尔大学、法国国家科学研究中心、中央里尔学院、CRIStAL实验室、法国) Inria Center at the University of Bordeaux, France(波尔多大学的Inria研究中心、法国) Inria, INSA Lyon, CITI, UR3720, 69621 Villeurbanne, France(Inria、里昂INSA、CITI、UR3720、法国)

AI总结 提出好奇心驱动的AI科学家方法,通过内在动机目标探索过程(IMGEP)在Flow-Lenia中发现系统级动力学,揭示类似生物现象的自组织行为,并展示大规模多样性搜索作为后续实验设计的框架。

Comments Extended version of the paper first published at ALife 2025. Project webpage: https://developmentalsystems.org/Exploring-Flow-Lenia-Universes/ 24 pages, 16 figures

详情
Journal ref
Proceedings of the Artificial Life Conference 2025, pp. 633-643
AI中文摘要

我们提出了一种好奇心驱动的AI科学家方法,用于发现Flow-Lenia中的系统级动力学。Flow-Lenia是一种具有质量守恒和参数局部化的连续元胞自动机(CA)。基于先前使用Lenia中的多样性搜索来发现个体自组织模式的工作,我们将内在动机目标探索过程(IMGEP)适应于交互模式的大型环境,使用模拟范围的度量,如进化活动、压缩比和多尺度物质分布。我们在两个探索实验中应用IMGEP:一个针对生态系统级动力学,另一个针对通过障碍物环境的物质运动。在这两个实验中,IMGEP比随机搜索照亮了更多的度量空间,并揭示了定性上类似于许多生物现象的自组织行为。利用生成的档案,我们随后在六个空间尺度和七个时间跨度上进行了缩放研究,揭示了在基础尺度上没有类似物的宏观尺度组织,并表征了目标空间度量在尺度上的行为。这说明了我们方法的一个优势:相对廉价的大规模多样性搜索可以作为设计后续更昂贵实验的原则性框架,通过交互式探索工具支持实验设计、检查和重新设计的迭代循环,使科学家保持在循环中。尽管在Flow-Lenia上进行了演示,但这种方法可能适用于其他可参数化的复杂系统,其中研究自下而上的集体行为是有意义的。

英文摘要

We present a curiosity-driven AI scientist method for discovering system-level dynamics in Flow-Lenia, a continuous cellular automaton (CA) with mass conservation and parameter localization. Building on prior work that uses diversity search in Lenia to find individual self-organized patterns, we adapt Intrinsically Motivated Goal Exploration Processes (IMGEPs) to large environments of interacting patterns, using simulation-wide metrics such as evolutionary activity, compression ratio, and multi-scale matter distribution. We apply IMGEP in two exploration experiments: one targeting ecosystem-level dynamics, the other matter movement through obstacle-laden environments. In both, IMGEP illuminates significantly more of the metric space than random search and reveals self-organized behaviors qualitatively resembling many biological phenomena. Leveraging the resulting archive, we then run a scaling study across six spatial scales and seven time horizons, uncovering macro-scale organization with no analogue at the base scale and characterizing how goal-space metrics behave at scale. This illustrates a strength of our approach: a relatively cheap large-scale diversity search can act as a principled scaffold for designing subsequent, more expensive experiments, enabling an iterative loop of experiment design, inspection, and redesign, supported by an interactive exploration tool that keeps scientists in the loop. Though demonstrated with Flow-Lenia, this approach potentially applies to other parameterizable complex systems where studying bottom-up collective behavior is of interest.

2512.09084 2026-06-08 cs.LG 版本更新

GS-KAN: Parameter-Efficient Kolmogorov-Arnold Networks via Sprecher-Type Shared Basis Functions

GS-KAN: 通过Sprecher型共享基函数的参数高效Kolmogorov-Arnold网络

Oscar Eliasson

发表机构 * Chalmers University of Technology(挑战大学)

AI总结 提出GS-KAN,通过每层共享单一父函数的线性变换构造边函数,在保持参数高效的同时,在函数逼近、表格回归和图像分类任务上优于或媲美现有KAN和MLP。

Comments 6 pages, 2 figures

详情
AI中文摘要

Kolmogorov-Arnold表示定理通过在边上而非节点上放置可学习单变量函数,为多层感知器(MLP)提供了理论替代方案。尽管最近的实现如Kolmogorov-Arnold网络(KAN)展示了高逼近能力,但由于需要为每个网络边维护唯一参数化,它们存在显著的参数低效问题。在这项工作中,我们提出GS-KAN(广义Sprecher-KAN),一种受David Sprecher对叠加定理的改进启发的轻量级架构。GS-KAN通过对每层单个可学习的共享父函数应用可学习线性变换来构造唯一的边函数。我们在合成函数逼近、表格数据回归和图像分类任务上评估了GS-KAN与现有KAN架构和MLP的性能。结果表明,GS-KAN在连续函数逼近任务上优于MLP和标准KAN基线,同时保持优越的参数效率。此外,GS-KAN在表格回归上与现有KAN架构性能相当,在高维分类任务上优于MLP。关键的是,所提出的架构使得在严格参数约束下的高维场景中部署基于KAN的架构成为可能,而标准实现由于参数爆炸通常不可行。源代码可在https://github.com/rambamn48/gs-impl获取。

英文摘要

The Kolmogorov-Arnold representation theorem offers a theoretical alternative to Multi-Layer Perceptrons (MLPs) by placing learnable univariate functions on edges rather than nodes. While recent implementations such as Kolmogorov-Arnold Networks (KANs) demonstrate high approximation capabilities, they suffer from significant parameter inefficiency due to the requirement of maintaining unique parameterizations for every network edge. In this work, we propose GS-KAN (Generalized Sprecher-KAN), a lightweight architecture inspired by David Sprecher's refinement of the superposition theorem. GS-KAN constructs unique edge functions by applying learnable linear transformations to a single learnable, shared parent function per layer. We evaluate GS-KAN against existing KAN architectures and MLPs across synthetic function approximation, tabular data regression and image classification tasks. Our results demonstrate that GS-KAN outperforms both MLPs and standard KAN baselines on continuous function approximation tasks while maintaining superior parameter efficiency. Additionally, GS-KAN achieves competitive performance with existing KAN architectures on tabular regression and outperforms MLPs on high-dimensional classification tasks. Crucially, the proposed architecture enables the deployment of KAN-based architectures in high-dimensional regimes under strict parameter constraints, a setting where standard implementations are typically infeasible due to parameter explosion. The source code is available at https://github.com/rambamn48/gs-impl.

2601.16622 2026-06-08 cs.LG cs.AI 版本更新

E2Former-V2: On-the-Fly Equivariant Attention with Linear Activation Memory

E2Former-V2:具有线性激活内存的即时等变注意力

Lin Huang, Chengxiang Huang, Ziang Wang, Yiyue Du, Chu Wang, Haocheng Lu, Yunyang Li, Xiaoli Liu, Arthur Jiang, Jia Zhang

发表机构 * University of Science and Technology of China(中国科学技术大学)

AI总结 提出E2Former-V2架构,通过等变轴对齐稀疏化(EAAS)和即时等变注意力机制,利用SO(3)到SO(2)基变换和自定义Triton内核,实现线性激活内存和20倍TFLOPS提升,在SPICE和OMol25数据集上加速推理并保持预测性能。

详情
AI中文摘要

等变图神经网络(EGNN)已成为建模3D原子系统的广泛使用的方法。然而,主流架构由于在每条边上显式构造几何特征或密集张量积而面临关键的可扩展性瓶颈。为克服这一问题,我们引入了**E2Former-V2**,一种将代数稀疏性与硬件感知执行相结合的可扩展架构。我们首先提出**等变轴对齐稀疏化(EAAS)**。EAAS基于Wigner-$6j$卷积,利用$\mathrm{SO}(3) \rightarrow \mathrm{SO}(2)$基变换,将计算昂贵的密集张量收缩转化为高效的稀疏奇偶重索引操作。基于这种表示,我们引入了**即时等变注意力**,一种通过自定义融合Triton内核实现的完全节点中心机制。通过消除物化的边张量并最大化SRAM利用率,我们的内核相比标准实现实现了**20倍的TFLOPS提升**。在SPICE和OMol25数据集上的大量实验表明,E2Former-V2在保持相当预测性能的同时显著加速推理。这项工作表明,大型等变Transformer可以使用广泛可用的GPU平台高效训练。代码可在https://github.com/IQuestLab/UBio-MolFM/tree/e2formerv2获取。

英文摘要

Equivariant Graph Neural Networks (EGNNs) have become a widely used approach for modeling 3D atomistic systems. However, mainstream architectures face critical scalability bottlenecks due to the explicit construction of geometric features or dense tensor products on \textit{every} edge. To overcome this, we introduce \textbf{E2Former-V2}, a scalable architecture that integrates algebraic sparsity with hardware-aware execution. We first propose \textbf{E}quivariant \textbf{A}xis-\textbf{A}ligned \textbf{S}parsification (EAAS). EAAS builds on Wigner-$6j$ convolution by exploiting an $\mathrm{SO}(3) \rightarrow \mathrm{SO}(2)$ change of basis to transform computationally expensive dense tensor contractions into efficient, sparse parity re-indexing operations. Building on this representation, we introduce \textbf{On-the-Fly Equivariant Attention}, a fully node-centric mechanism implemented via a custom fused Triton kernel. By eliminating materialized edge tensors and maximizing SRAM utilization, our kernel achieves a \textbf{20$\times$ improvement in TFLOPS} compared to standard implementations. Extensive experiments on the SPICE and OMol25 datasets demonstrate that E2Former-V2 maintains comparable predictive performance while notably accelerating inference. This work demonstrates that large equivariant transformers can be trained efficiently using widely accessible GPU platforms. The code is avalible at https://github.com/IQuestLab/UBio-MolFM/tree/e2formerv2.

2601.10930 2026-06-08 cs.RO 版本更新

Where to Touch, How to Contact: Hierarchical RL-MPC Framework for Geometry-Aware Long-Horizon Dexterous Manipulation

何处触碰,如何接触:面向几何感知的长时间灵巧操作的分层RL-MPC框架

Zhixian Xie, Yu Xiang, Michael Posa, Wanxin Jin

发表机构 * Arizona State University(亚利桑那州立大学) University of Texas at Dallas(德克萨斯大学达拉斯分校) University of Pennsylvania(宾夕法尼亚大学)

AI总结 提出分层RL-MPC框架,高层RL策略预测接触意图(接触位置和子目标位姿),低层接触隐式MPC优化局部接触模式并实时重规划,实现几何泛化的非抓取操作,数据效率提升10倍且零样本迁移到真实环境。

详情
AI中文摘要

接触丰富的灵巧操作中的一个关键挑战是需要共同推理全局几何和非光滑接触动力学。端到端策略绕过了这一复杂性,但通常需要大量数据,并且从仿真到现实的迁移效果差。我们通过一个简单的见解来解决这些局限性:灵巧操作本质上是分层的——在高层次上,机器人决定在哪里触碰(几何);在低层次上,它确定如何通过接触动力学移动物体。基于这一见解,我们提出了一个分层RL-MPC框架,其中高层强化学习(RL)策略预测接触意图,这是一种新颖的以物体为中心的接口,指定了(i)物体表面接触位置和(ii)接触后的物体子目标位姿。在接触意图的条件下,低层接触隐式模型预测控制(MPC)优化局部接触模式,并通过接触动力学进行实时(重新)规划,以生成稳健地将物体移向每个子目标的机器人动作。我们在非抓取任务上评估该框架,包括跨不同物体形状的几何泛化推、基于翻转/旋转的物体重新定向以及环境辅助的物体重新定位。它实现了高成功率,数据量大幅减少(比端到端基线少10倍),高度稳健的性能,以及零样本从仿真到现实的迁移。

英文摘要

A key challenge in contact-rich dexterous manipulation is the need to jointly reason over global geometry and nonsmooth contact dynamics. End-to-end policies bypass this complexity, but often require large amounts of data and transfer poorly from simulation to reality. We address the limitations with a simple insight: dexterous manipulation is inherently hierarchical--at a high level, a robot decides where to touch (geometry); at a low level it determines how to move the object through contact dynamics. Building on this insight, we propose a hierarchical RL--MPC framework in which a high-level reinforcement learning (RL) policy predicts a contact intention, a novel object-centric interface that specifies (i) an object-surface contact location and (ii) a post-contact object subgoal pose. Conditioned on the contact intention, a low-level contact-implicit model predictive control (MPC) optimizes local contact modes and real-time (re)plans through contact dynamics to generate robot actions that robustly move the object toward each subgoal. We evaluate the framework on non-prehensile tasks, including geometry-generalized pushing across diverse object shapes, pivoting/flipping-based object reorientation, and environment-assisted object repositioning. It achieves high success rate with substantially reduced data (10 times less than end-to-end baselines), highly robust performance, and zero-shot sim-to-real transfer.

2408.08973 2026-06-08 cs.CV 版本更新

Image class translation: visual inspection of class-specific hypotheticals and classification based on translation distance

图像类别翻译:类别特定假设的视觉检查与基于翻译距离的分类

Mikyla K. Bowen, Jesse W. Wilson

发表机构 * College of Natural Sciences, Colorado State University, Colorado, United States of America(科罗拉多州立大学自然科学院) School of Biomedical and Chemical Engineering, Colorado State University, Colorado, United States of America(科罗拉多州立大学生物医学与化学工程学院) Department of Electrical and Computer Engineering, Colorado State University, Colorado, United States of America(科罗拉多州立大学电气与计算机工程学院)

AI总结 提出图像翻译网络用于分类,通过翻译距离作为低维特征进行分类,在皮肤镜和骨髓细胞图像上验证,可解释性优于传统CNN。

Comments 47 pages, 20 figures, submitted revision to SPIE J. Medical Imaging

详情
AI中文摘要

目的:人工智能在医学应用中的主要障碍是自动CNN缺乏可解释性,并且对错误决策(尤其是域外样本)有高置信度。我们提出图像翻译网络用于图像分类的泛化,并展示翻译网络作为传统黑盒分类器更可解释的替代方案的潜力。\n方法:我们训练一个图像到图像网络,将输入图像翻译为类别特定的假设,然后通过视觉和定量方式将这些假设与输入进行比较。翻译距离(即为了符合某一类别所需的改变程度)被检查其聚类和趋势,并用作分类的简单低维特征向量。\n结果:在黑色素瘤/良性皮肤镜图像上,翻译距离分类器仅使用2维特征空间就达到了80%的准确率(而传统CNN使用约62,000维特征空间达到85%)。对渲染图像的视觉检查揭示了数据集偏差,例如黑色素瘤照片中比良性病变有更多的比例尺。翻译距离空间中的图像分布揭示了沿着皮肤科医生活检决策的自然分离,而不是恶性与良性之间的分离。在骨髓细胞学图像上,翻译距离分类器在3类(92%准确率对比CNN的89%)和6类(90%对比86%)场景中均优于传统CNN。\n结论:这一概念验证表明,图像到图像翻译有潜力超越艺术/风格变化,揭示数据集偏差,进行降维和数据集可视化,并且在某些情况下可能优于传统的端到端CNN分类器。

英文摘要

Purpose: A major barrier to the implementation of artificial intelligence for medical applications is automated CNNs' lack of explainability and high confidence for incorrect decisions, specifically with out-of-domain samples. We propose a generalization of image translation networks for image classification and demonstrate translation networks' potential as a more interpretable alternative to conventional black-box classifiers. Approach: We train an image-to-image network to translate an input image to class-specific hypotheticals, and then compare these with the input, both visually and quantitatively. Translation distances, the degree of alteration needed to conform to one class or another, are examined for clusters and trends, and used as a simple low-dimensional feature vector for classification. Results: On melanoma/benign dermoscopy images, a translation distance classifier achieved 80% accuracy using only a 2-dimensional feature space (versus 85% for a conventional CNN using a ~62,000-dimensional feature space). Visual inspection of rendered images revealed dataset biases, like more scalebars in melanoma photographs than in benign lesions. Image distributions in translation distance space revealed a natural separation along the lines of dermatologist decision to biopsy, rather than between malignant and benign. On bone marrow cytology images, translation distance classifiers outperformed a conventional CNN in both 3-class (92% accuracy vs 89% for CNN) and 6-class (90% vs 86% for CNN) scenarios. Conclusions: This proof-of-concept shows the potential for image-to-image translation to go beyond artistic/stylistic changes and to expose dataset biases, perform dimension reduction and dataset visualization, and in some cases, potentially outperform conventional end-to-end CNN classifiers.

2601.10896 2026-06-08 cs.CL 版本更新

DialDefer: A Framework for Detecting and Mitigating LLM Dialogic Deference

DialDefer: 检测和缓解LLM对话性遵从的框架

Parisa Rabbani, Priyam Sahoo, Ruben Mathew, Aishee Mondal, Harshita Ketharaman, Nimet Beyza Bozdag, Dilek Hakkani-Tür

发表机构 * University of Illinois Urbana-Champaign(伊利诺伊大学厄巴纳-香槟分校)

AI总结 提出DialDefer框架,通过对话性遵从分数检测和缓解LLM在对话评估中因提问框架导致的判断偏移,发现框架效应显著但准确率稳定,且模型对人类与AI的不同归因产生最大偏移。

Comments 10 pages main content, 7 figures, 35 pages total with appendix

详情
AI中文摘要

LLM越来越多地被用作第三方评判者,但它们在评估对话中的说话者时的可靠性仍知之甚少。我们证明,LLM对相同主张的判断因框架而异:相同内容在作为陈述验证(“这个陈述正确吗?”)与归因于说话者(“这个说话者正确吗?”)时得到不同裁决。我们称此为对话性遵从,并引入DialDefer,一个用于检测和缓解这些框架诱导的判断偏移的框架。我们的对话性遵从分数(DDS)捕捉了聚合准确性所掩盖的方向性偏移。在十个领域、3000多个实例和五个模型上,对话框架诱导了大幅偏移(模型间平均|DDS|=15.9个百分点,p<0.0001),而准确性保持稳定(<2个百分点),在自然Reddit对话中效应放大2-5倍。这种效应是领域依赖的:单个模型可以在研究生级别的科学上转向不同意(怀疑),在社会判断上转向同意(遵从)。消融实验揭示,人类与LLM的归因导致最大偏移(17.7个百分点的摆动),表明模型认为与人类的分歧比与AI的分歧代价更高。缓解尝试可以减少遵从,但过度校正为怀疑,揭示了超出准确性优化的校准问题。

英文摘要

LLMs are increasingly used as third-party judges, yet their reliability when evaluating speakers in dialogue remains poorly understood. We show that LLMs judge identical claims differently depending on framing: the same content receives different verdicts when presented as a statement to verify ("Is this statement correct?") versus attributed to a speaker ("Is this speaker correct?"). We call this dialogic deference and introduce DialDefer, a framework for detecting and mitigating these framing-induced judgment shifts. Our Dialogic Deference Score (DDS) captures directional shifts that aggregate accuracy obscures. Across ten domains, 3k+ instances, and five models, conversational framing induces large shifts (mean|DDS|=15.9 percentage points (pp) across models, p < .0001) while accuracy remains stable (<2 pp), with effects amplifying 2--5x on naturalistic Reddit conversations. This effect is domain-dependent: a single model can shift toward disagreement (skepticism) on graduate-level science and toward agreement (deference) on social judgment. Ablations reveal that human-vs-LLM attribution drives the largest shifts (17.7 pp swing), suggesting models treat disagreement with humans as more costly than with AI. Mitigation attempts can reduce deference but over-correct into skepticism, revealing a calibration problem beyond accuracy optimization.

2601.09698 2026-06-08 cs.CV 版本更新

COMPOSE: Hypergraph Cover Optimization for Multi-view 3D Human Pose Estimation

COMPOSE:用于多视角三维人体姿态估计的超图覆盖优化

Tony Danjun Wang, Tolga Birdal, Nassir Navab, Lennart Bastian

发表机构 * School of Computation, Information, and Technology, Technical University of Munich(技术大学慕尼黑计算、信息与技术学院) Munich Center for Machine Learning(慕尼黑机器学习中心) Department of Computing, Imperial College London(伦敦帝国学院计算机系)

AI总结 提出COMPOSE方法,将多视角三维人体姿态估计重构为超图上的加权精确覆盖优化,通过全局组合目标替代局部配对关联,结合几何剪枝与整数线性规划或信念传播求解器,无监督下精度提升显著。

详情
AI中文摘要

从稀疏多视角相机装置中进行三维人体姿态估计是众多应用(包括动作识别、体育分析和人机交互)的基本任务。尽管学习方法在基准测试中占据主导地位,但它们需要大量标注数据集;无训练的基于优化的方法仍然有前景,因为它们通过解决来自二维检测的跨视角对应问题来规避三维监督。现有的组合公式依赖配对关联来建模这一对应问题,并将跨视角的全局一致性仅作为下游约束来强制执行。然而,在遮挡和噪声检测下,调和局部合理的配对匹配变得脆弱,局部错误会全局传播。我们提出COMPOSE,它将多视角三维人体姿态估计重新定义为对人物假设超图上的加权精确覆盖优化。我们的公式用单个全局组合目标替代了配对关联和事后一致性强制执行。为了应对指数级大的候选空间,我们引入了一种几何剪枝策略以及两种互补的求解器:精确整数线性规划公式和通过信念传播的可扩展松弛。在没有任何三维监督的情况下,COMPOSE在平均精度上比最佳基于优化的方法提高了31个百分点,比自监督学习方法提高了13个百分点,证明了高阶组合关联在无训练的多视角三维人体姿态估计中的有效性。

英文摘要

3D human pose estimation from sparse multi-view camera rigs is an essential task for numerous applications, including action recognition, sports analysis, and human-robot interaction. While learned methods dominate the field on benchmarks, they require large annotated datasets; training-free optimization-based methods remain promising as they circumvent 3D supervision by solving a correspondence problem across views from 2D detections. Existing combinatorial formulations rely on pairwise associations to model this correspondence problem and enforce global consistency across views only as a downstream constraint. However, reconciling locally plausible pairwise matches becomes brittle under occlusion and noisy detections, where local errors propagate globally. We propose COMPOSE, which recasts multi-view 3D human pose estimation as a weighted exact-cover optimization over a hypergraph of person hypotheses. Our formulation replaces pairwise association and post-hoc consistency enforcement with a single global combinatorial objective. To address the exponentially large candidate space, we introduce a geometric pruning strategy alongside two complementary solvers: an exact Integer Linear Programming formulation and a scalable relaxation via Belief Propagation. Without any 3D supervision, COMPOSE improves average precision by up to 31 points over the best optimization-based method and 13 points over self-supervised learned methods, demonstrating the effectiveness of higher-order combinatorial association for training-free multi-view 3D human pose estimation.

2601.09402 2026-06-08 cs.CL 版本更新

SEEK: Steering LLM Reasoning for RAG via Internal Reasoning Sketches

SEEK: 通过内部推理草图引导LLM推理用于RAG

Xinze Li, Yuqing Lan, Zhenghao Liu, Haidong Xin, Yukun Yan, Shuo Wang, Zheni Zeng, Sen Mei, Ge Yu, Maosong Sun

发表机构 * School of Computer Science and Engineering, Northeastern University, China(东北大学计算机科学与工程学院) Department of Computer Science and Technology, Institute for AI, Tsinghua University, China(清华大学计算机科学与技术系,人工智能研究院) School of Intelligent Science and Technology, Nanjing University, China(南京大学智能科学与技术学院)

AI总结 提出SEEK框架,通过构建结构化引导草图,迭代检索和填充知识槽,减少冗余检索,提升RAG性能。

详情
AI中文摘要

检索增强生成(RAG)通过将外部知识融入生成过程来增强大型语言模型(LLM)。借助LLM的推理能力,现有方法利用这种能力实现迭代知识获取和积累,从而更好地支持答案生成。然而,随着推理轨迹的增长,积累的知识和先前生成的查询可能会干扰后续检索决策,导致子查询意图重复和知识获取冗余。为了解决这个问题,我们提出了SEEK,一种用于RAG的草图引导知识获取框架。SEEK首先提示LLM为给定问题构建一个结构化的引导草图。它由多组引导要点组成,每个要点后跟一个用于知识填充的槽位。在这些引导要点的指导下,SEEK迭代地检索和精炼知识,并填充相应的槽位以完成草图。然后,完成的草图作为上下文输入用于最终答案生成。实验结果表明,SEEK在多个任务上取得了比基线模型更好的性能。进一步分析表明,SEEK可以生成更多样化的子查询,减少冗余检索,并在外部知识利用和内部知识冲突缓解之间实现更好的平衡。所有代码可在 https://github.com/OpenBMB/PAGER 获取。

英文摘要

Retrieval-Augmented Generation (RAG) enhances Large Language Models (LLMs) by incorporating external knowledge into the generation process. Benefiting from the reasoning capabilities of LLMs, existing methods have leveraged such capabilities to enable iterative knowledge acquisition and accumulation, thereby better supporting answer generation. However, as the reasoning trajectory grows, the accumulated knowledge and previously generated queries may interfere with subsequent retrieval decisions, resulting in sub-queries with repetitive intents and redundant knowledge acquisition. To address this issue, we propose SEEK, a sketch-guided knowledge acquisition framework for RAG. SEEK first prompts the LLM to construct a structured steering sketch for the given question. It consists of multiple groups of steering gists, with each gist followed by a slot for knowledge filling. Guided by these steering gists, SEEK iteratively retrieves and refines knowledge, and fills the corresponding slots to complete the sketch. The completed sketch is then used as contextual input for final answer generation. Experimental results show that SEEK achieves better performance than baseline models across multiple tasks. Further analyses demonstrate that SEEK can generate more diverse sub-queries, reduce redundant retrieval, and achieve a better balance between external knowledge utilization and internal knowledge conflict mitigation. All codes are available at https://github.com/OpenBMB/PAGER.

2601.08097 2026-06-08 cs.CL cs.LG 版本更新

AdaJudge: Adaptive Multi-Perspective Judging for Reward Modeling

AdaJudge: 自适应多视角评判用于奖励建模

Yongliang Miao, Yangyang Liang, Mengnan Du

发表机构 * Emory University(埃默里大学) The Chinese University of Hong Kong, Shenzhen(香港中文大学(深圳))

AI总结 提出AdaJudge框架,通过门控精化块和自适应多视角池化模块,联合优化表示与聚合,解决奖励建模中静态归纳偏差和表示不匹配问题,在RM-Bench和JudgeBench上超越现有模型。

Comments ACL 2026

详情
AI中文摘要

奖励建模对于将大型语言模型与人类偏好对齐至关重要,但主流架构依赖静态池化策略将序列压缩为标量分数。然而,这种范式存在两个关键限制:静态归纳偏差与任务相关的偏好信号不匹配,以及表示不匹配,因为骨干网络针对生成的优化使其表示不适用于细粒度判别。为解决这一问题,我们提出AdaJudge,一个统一框架,联合调整表示和聚合。AdaJudge首先通过门控精化块将骨干网络表示改进到判别导向的空间。然后,它用自适应多视角池化模块替换静态读出,该模块动态路由并组合证据。在RM-Bench和JudgeBench上的大量实验表明,AdaJudge优于强大的现成奖励模型和传统池化基线。

英文摘要

Reward modeling is essential for aligning large language models with human preferences, yet predominant architectures rely on a static pooling strategy to condense sequences into scalar scores. This paradigm, however, suffers from two key limitations: a static inductive bias that misaligns with task-dependent preference signals, and a representational mismatch, as the backbone's optimization for generation leaves its representations ill-suited to fine-grained discrimination. To address this, we propose AdaJudge, a unified framework that jointly adapts representation and aggregation. AdaJudge first improves backbone representations into a discrimination-oriented space via gated refinement blocks. It then replaces the static readout with an adaptive multi-view pooling module, which dynamically routes and combines evidence. Extensive experiments on RM-Bench and JudgeBench show that AdaJudge outperforms strong off-the-shelf reward models and traditional pooling baselines.

2601.05751 2026-06-08 cs.CL cs.AI 版本更新

Analysing Differences in Persuasive Language in LLM-Generated Text: Uncovering Stereotypical Gender Patterns

分析LLM生成文本中说服性语言的差异:揭示刻板的性别模式

Amalie Brogaard Pauli, Maria Barrett, Max Müller-Eberstein, Isabelle Augenstein, Ira Assent

发表机构 * Department of Computer Science, Aarhus University(阿arhus大学计算机科学系) AMD Silo AI University of Tokyo(东京大学) IT University of Copenhagen(哥本哈根IT大学) Department of Computer Science, University of Copenhagen(哥本哈根大学计算机科学系)

AI总结 提出框架评估LLM生成说服性语言时受接收者性别、发送者意图和输出语言的影响,发现所有模型均存在显著的性别差异,反映性别刻板印象的语言倾向。

Comments Accepted at ACL Findings 2026

详情
AI中文摘要

大型语言模型(LLMs)越来越多地用于日常交流任务,包括起草旨在影响和说服的人际信息。先前研究表明,LLMs能够成功说服人类并放大说服性语言。因此,理解用户指令如何影响说服性语言的生成,以及生成的说服性语言是否因目标群体不同而有所差异至关重要。在这项工作中,我们提出了一个框架,用于评估说服性语言生成如何受接收者性别、发送者意图或输出语言的影响。我们使用成对提示指令评估了13个LLMs和16种语言。我们采用基于社会心理学和传播科学的LLM-as-judge设置,在19个说服性语言类别上评估模型响应。我们的结果揭示了所有模型生成的说服性语言中存在显著的性别差异。这些模式反映了与社会心理学和社会语言学中记录的性别刻板语言倾向一致的偏见。

英文摘要

Large language models (LLMs) are increasingly used for everyday communication tasks, including drafting interpersonal messages intended to influence and persuade. Prior work has shown that LLMs can successfully persuade humans and amplify persuasive language. It is therefore essential to understand how user instructions affect the generation of persuasive language, and to understand whether the generated persuasive language differs, for example, when targeting different groups. In this work, we propose a framework for evaluating how persuasive language generation is affected by recipient gender, sender intent, or output language. We evaluate 13 LLMs and 16 languages using pairwise prompt instructions. We evaluate model responses on 19 categories of persuasive language using an LLM-as-judge setup grounded in social psychology and communication science. Our results reveal significant gender differences in the persuasive language generated across all models. These patterns reflect biases consistent with gender-stereotypical linguistic tendencies documented in social psychology and sociolinguistics.

2601.05675 2026-06-08 cs.AI 版本更新

CHDP: Cooperative Hybrid Diffusion Policies for Reinforcement Learning in Parameterized Action Space

CHDP:参数化动作空间中强化学习的协同混合扩散策略

Bingyi Liu, Jinbo He, Haiyong Shi, Enshu Wang, Weizhen Han, Jingxiang Hao, Peixi Wang, Zhuangzhuang Zhang

发表机构 * National University of Singapore(新加坡国立大学) University of Science and Technology of China(中国科学技术大学)

AI总结 针对混合动作空间中的策略表达力不足和高维扩展性差问题,提出协同混合扩散策略框架,通过离散和连续扩散策略的协作与顺序更新,结合码本嵌入和Q函数引导,在基准测试中成功率提升高达19.3%。

Comments Accepted by AAAI 2026

详情
AI中文摘要

混合动作空间结合了离散选择和连续参数,在机器人控制和游戏AI等领域普遍存在。然而,高效建模和优化离散-连续混合动作空间仍然是一个基本挑战,主要由于策略表达力有限和高维设置下的可扩展性差。为应对这一挑战,我们将混合动作空间问题视为完全合作博弈,并提出\textbf{协同混合扩散策略(CHDP)}框架来解决。CHDP采用两个协作智能体,分别利用离散和连续扩散策略。连续策略以离散动作的表示为条件,显式建模它们之间的依赖关系。这种协作设计使扩散策略能够利用其表达力捕获各自动作空间中的复杂分布。为缓解协作设置中同时更新策略导致的更新冲突,我们采用顺序更新方案以促进协同适应。此外,为提高在高维离散动作空间中学习时的可扩展性,我们构建了一个将动作空间嵌入低维潜在空间的码本。该映射使离散策略能够在紧凑、结构化的空间中学习。最后,我们设计了一种基于Q函数的引导机制,在训练过程中对齐码本的嵌入与离散策略的表示。在具有挑战性的混合动作基准测试中,CHDP的成功率比最先进方法高出高达19.3%。

英文摘要

Hybrid action space, which combines discrete choices and continuous parameters, is prevalent in domains such as robot control and game AI. However, efficiently modeling and optimizing hybrid discrete-continuous action space remains a fundamental challenge, mainly due to limited policy expressiveness and poor scalability in high-dimensional settings. To address this challenge, we view the hybrid action space problem as a fully cooperative game and propose a \textbf{Cooperative Hybrid Diffusion Policies (CHDP)} framework to solve it. CHDP employs two cooperative agents that leverage a discrete and a continuous diffusion policy, respectively. The continuous policy is conditioned on the discrete action's representation, explicitly modeling the dependency between them. This cooperative design allows the diffusion policies to leverage their expressiveness to capture complex distributions in their respective action spaces. To mitigate the update conflicts arising from simultaneous policy updates in this cooperative setting, we employ a sequential update scheme that fosters co-adaptation. Moreover, to improve scalability when learning in high-dimensional discrete action space, we construct a codebook that embeds the action space into a low-dimensional latent space. This mapping enables the discrete policy to learn in a compact, structured space. Finally, we design a Q-function-based guidance mechanism to align the codebook's embeddings with the discrete policy's representation during training. On challenging hybrid action benchmarks, CHDP outperforms the state-of-the-art method by up to $19.3\%$ in success rate.

2505.11470 2026-06-08 cs.CL 版本更新

Reference-Free Evaluation of Taxonomies

无参考评价的层次分类体系

Pascal Wullschleger, Majid Zarharan, Donnacha Daly, Marc Pouly, Jennifer Foster

发表机构 * Hamilton Institute, Maynooth University, Ireland(爱尔兰梅诺特大学哈密尔顿研究所) School of Computing, Dublin City University, Ireland(爱尔兰都柏林城市大学计算学院) Lucerne School of Computer Science and IT, Switzerland(瑞士卢塞恩计算机科学与信息技术学院)

AI总结 提出两种无参考指标评估层次分类体系质量:基于语义与分类相似性相关性的鲁棒性指标,以及基于自然语言推理的逻辑充分性指标,在五个层次分类体系上验证与真实F1值高度相关,并能预测下游层次分类性能。

详情
AI中文摘要

我们引入了两种无参考指标,用于在缺乏标签的情况下评估层次分类体系的质量。第一个指标通过计算语义相似性与分类相似性之间的相关性来评估鲁棒性,解决了现有指标未考虑的错误类型。第二个指标使用自然语言推理来评估逻辑充分性。这两个指标在五个层次分类体系上进行了测试,结果显示它们与真实层次分类体系的F1值高度相关。我们进一步证明,当与标签层次结构一起使用时,我们的指标可以预测层次分类中的下游性能。

英文摘要

We introduce two reference-free metrics for quality evaluation of taxonomies in the absence of labels. The first metric evaluates robustness by calculating the correlation between semantic and taxonomic similarity, addressing error types not considered by existing metrics. The second uses Natural Language Inference to assess logical adequacy. Both metrics are tested on five taxonomies and are shown to correlate well with F1 against ground truth taxonomies. We further demonstrate that our metrics can predict downstream performance in hierarchical classification when used with label hierarchies.

2510.26714 2026-06-08 cs.LG cs.AI 版本更新

On the importance of multiple training seeds for evaluating machine unlearning

关于多个训练种子在评估机器遗忘中的重要性

Jamie Lanyon, Axel Finke, Petros Andreou, Georgina Cosma

发表机构 * Department of Computer Science(计算机科学系) School of Mathematics(数学学院) School of Science(科学学院) Statistics and Physics(统计学与物理学) Loughborough University(洛桑大学) Newcastle University(新castle大学)

AI总结 本文指出评估机器遗忘算法时仅使用单个训练种子可能导致结果不具代表性,并通过图像分类、联邦学习排序和大语言模型实验验证了问题普遍性,最后给出选择训练和遗忘种子数量的指导。

Comments mini paper, 5 figures

详情
AI中文摘要

机器遗忘旨在从训练好的模型中移除某些数据点的影响,而无需昂贵的重新训练。大多数实用的遗忘算法只是近似,其性能只能通过经验评估。常见做法是从同一个训练好的模型(即仅使用单个训练种子)开始,多次独立运行遗忘算法(即使用多个遗忘种子)。在图像分类实验中,这种做法可能给出不具代表性的结果,因为遗忘性能可能对训练种子的选择敏感。这对于确定性遗忘方法尤其相关,这些方法从同一个训练好的模型开始时总是产生相同的结果。在联邦学习排序和大语言模型上的进一步实验证实,这个问题不仅限于图像分类。我们还解释了为什么增加遗忘种子的数量通常无法弥补多个训练种子的缺失。最后,我们给出了如何选择训练和遗忘种子数量的指导。

英文摘要

Machine unlearning aims to remove the influence of certain data points from a trained model without costly retraining. Most practical unlearning algorithms are only approximate and their performance can only be assessed empirically. Common practice is to run unlearning algorithms multiple times independently (i.e., using multiple unlearning seeds) starting from the same trained model (i.e., using only a single training seed ). In image-classification experiments, this practice can give non-representative results as unlearning performance can be sensitive to the choice of training seed. This is particularly relevant for deterministic unlearning methods which always produce the same result when started from the same trained model. Further experiments on federated learning-to-rank, and large language models confirm that this issue extends beyond image classification. We also explain why increasing the number of unlearning seeds cannot generally compensate for the lack of multiple training seeds. Finally, we give guidance on how to select the number of training and unlearning seeds.

2505.21423 2026-06-08 cs.LG stat.ML 版本更新

Conflicting Biases at the Edge of Stability: Norm versus Sharpness Regularization

稳定性边缘的冲突偏差:范数与锐度正则化

Maria Matveev, Vit Fojtik, Hung-Hsu Chou, Gitta Kutyniok, Johannes Maly

发表机构 * Munich Center for Machine Learning (MCML)(慕尼黑机器学习中心) Prusa Research(普拉萨研究公司) Institute for Robotics and Mechatronics, DLR-German Aerospace Center(德国航空航天中心机器人与机电研究所)

AI总结 本文研究过参数化网络中梯度下降的隐式正则化,证明学习率在低范数与低锐度之间插值,且单一偏差不足以解释泛化,需考虑动态权衡。

Comments Accepted at ICML 2026

详情
AI中文摘要

过参数化网络显著的泛化性能通常归因于隐式偏差,例如小学习率下的范数最小化和稳定性边缘(Edge-of-Stability)状态下的低锐度。在这项工作中,我们认为全面理解梯度下降的泛化性能需要分析这些不同形式的隐式正则化之间的相互作用。我们通过实验证明,学习率在训练模型的低参数范数和低锐度之间插值。此外,我们证明对于在简单回归任务上训练的对角线性网络,单独的隐式偏差都不能最小化泛化误差。这些发现表明,仅关注单一隐式偏差不足以解释良好的泛化,并促使我们采用更广阔的隐式正则化视角,捕捉由不可忽略的学习率引起的范数与锐度之间的动态权衡。

英文摘要

The remarkable generalization properties of overparameterized networks are often attributed to implicit biases, such as norm minimization at small learning rates and low sharpness in the Edge-of-Stability regime. In this work, we argue that a comprehensive understanding of the generalization performance of gradient descent requires analyzing the interaction between these various forms of implicit regularization. We empirically demonstrate that the learning rate interpolates between low parameter norm and low sharpness of the trained model. We furthermore prove that neither implicit bias alone minimizes the generalization error for diagonal linear networks trained on a simple regression task. These findings demonstrate that focusing on a single implicit bias is insufficient to explain good generalization, and they motivate a broader view of implicit regularization that captures the dynamic trade-off between norm and sharpness induced by non-negligible learning rates.