2602.08324 2026-06-15 cs.LG 版本更新

Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression

通过极端比例思维链压缩实现高效大型语言推理模型

Yuntian Tang, Bohan Jia, Wenxuan Huang, Lianyue Zhang, Jiao Xie, Wenxi Li, Wei Li, Jie Hu, Xinghao Chen Rongrong Ji, Shaohui Lin

发表机构 * University of Science and Technology of China（中国科学技术大学）

AI总结提出Extra-CoT框架，通过极端比例压缩思维链、混合比例监督微调和约束层次化比率策略优化，在显著减少推理令牌的同时保持甚至提升推理准确率。

Comments Accepted to ICML 2026. 15 pages, 7 figures

详情

AI中文摘要

思维链推理成功增强了大型语言模型的推理能力，但推理时会产生大量计算开销。现有的思维链压缩方法在高压缩比下常遭受关键逻辑保真度的损失，导致性能显著下降。为实现高保真、快速推理，我们提出了一种新颖的极端比例思维链压缩框架，称为Extra-CoT，该框架在保留答案准确性的同时，激进地减少令牌预算。为了生成可靠的高保真监督，我们首先在带有细粒度标注的数学思维链数据上训练一个专用的语义保留压缩器。然后，通过混合比例监督微调对大型语言模型进行微调，使其学习遵循一系列压缩预算，并为强化学习提供稳定的初始化。我们进一步提出约束和层次化比率策略优化，通过层次化奖励明确激励在较低预算下的问题解决能力。在三个数学推理基准上的实验显示了Extra-CoT的优越性。例如，在MATH-500上使用Qwen3-1.7B，Extra-CoT实现了超过73%的令牌减少，同时准确率提升0.6%，显著优于最先进方法。我们的源代码已在https://github.com/Mwie1024/Extra-CoT发布。

英文摘要

Chain-of-Thought (CoT) reasoning successfully enhances the reasoning capabilities of Large Language Models (LLMs), yet it incurs substantial computational overhead for inference. Existing CoT compression methods often suffer from a critical loss of logical fidelity at high compression ratios, resulting in significant performance degradation. To achieve high-fidelity, fast reasoning, we propose a novel EXTreme-RAtio Chain-of-Thought Compression framework, termed Extra-CoT, which aggressively reduces the token budget while preserving answer accuracy. To generate reliable, high-fidelity supervision, we first train a dedicated semantically-preserved compressor on mathematical CoT data with fine-grained annotations. An LLM is then fine-tuned on these compressed pairs via a mixed-ratio supervised fine-tuning (SFT), teaching it to follow a spectrum of compression budgets and providing a stable initialization for reinforcement learning (RL). We further propose Constrained and Hierarchical Ratio Policy Optimization (CHRPO) to explicitly incentivize question-solving ability under lower budgets by a hierarchical reward. Experiments on three mathematical reasoning benchmarks show the superiority of Extra-CoT. For example, on MATH-500 using Qwen3-1.7B, Extra-CoT achieves over 73\% token reduction with an accuracy improvement of 0.6\%, significantly outperforming state-of-the-art (SOTA) methods. Our source codes have been released at https://github.com/Mwie1024/Extra-CoT.

URL PDF HTML ☆

赞 0 踩 0

2406.09250 2026-06-15 cs.CV cs.AI cs.LG 版本更新

MirrorCheck: Efficient Adversarial Defense for Vision-Language Models

MirrorCheck: 视觉-语言模型的高效对抗防御

Samar Fares, Klea Ziu, Toluwani Aremu, Nikita Durasov, Martin Takáč, Pascal Fua, Ivan Laptev, Karthik Nandakumar

发表机构 * Mohamed Bin Zayed University of Artificial Intelligence（莫扎伊德大学人工智能大学）； NVIDIA ； École Polytechnique Fédérale de Lausanne（洛桑联邦理工学院）； Michigan State University（密歇根州立大学）

AI总结提出MirrorCheck框架，利用文本到图像模型和随机化策略检测并防御针对视觉-语言模型的自适应对抗攻击。

详情

AI中文摘要

视觉-语言模型（VLM）越来越容易受到复杂的对抗性攻击，包括专门设计用于绕过现有防御的自适应策略。为了解决这一漏洞，我们提出了MirrorCheck，一个鲁棒且与模型无关的检测框架，在单模态和多模态设置中均能有效运行。MirrorCheck利用文本到图像（T2I）模型从目标模型生成的标题中重建视觉内容，并通过比较原始图像和合成图像之间的特征空间嵌入来评估语义一致性。为了增强对自适应攻击的鲁棒性，MirrorCheck引入了一种随机防御策略，从多样化的模型库中随机选择T2I生成器和图像编码器。此外，我们采用了一种新颖的一次性（OTU）扰动，应用于所选编码器嵌入，并通过缩放因子调节，这降低了自适应攻击的有效性。跨多种威胁场景的大量实验表明，MirrorCheck始终优于基线方法，即使在强自适应对抗条件下也能保持其实用性。

英文摘要

Vision-Language Models (VLMs) are increasingly susceptible to sophisticated adversarial attacks, including adaptive strategies specifically designed to bypass existing defenses. To address this vulnerability, we propose MirrorCheck, a robust and model-agnostic detection framework that operates effectively in both unimodal and multimodal settings. MirrorCheck leverages Text-to-Image (T2I) models to regenerate visual content from captions produced by the target model and assesses semantic consistency by comparing feature-space embeddings between the original and synthesized images. To enhance robustness against adaptive attacks, MirrorCheck introduces a stochastic defense strategy that randomly selects T2I generators and image encoders from a diverse model zoo. Additionally, we incorporate a novel One-Time-Use (OTU) perturbation applied to the selected encoder embeddings, regulated by a scaling factor, which decreases the effectiveness of adaptive attacks. Extensive experiments across multiple threat scenarios demonstrate that MirrorCheck consistently outperforms baseline methods, and maintains its utility even under strong adaptive adversarial conditions.

URL PDF HTML ☆

赞 0 踩 0

2605.25025 2026-06-15 cs.RO cs.SY eess.SY 版本更新

Micro-Swarm Locomotion Optimization in Dynamic Flow using Multi-Objective Multi-Agent Reinforcement Learning

动态流场中微群集运动优化的多目标多智能体强化学习方法

Josef Berman, Oren Gal

发表机构 * Hatter Department of Marine Technologies, Leon H. Charney School of Marine Sciences, University of Haifa（哈特尔海洋技术系，列昂·H·夏恩海洋科学学院，海法大学）

AI总结提出混合CFD与多目标多智能体强化学习框架，通过PCGrad解决梯度冲突，在振荡流中优化微机器人集群的上游推进、能量效率和运动平滑性。

详情

AI中文摘要

在生理真实、时间依赖的流体环境中协调微型机器人集群，仍然是生物医学和环境应用中的未解决挑战。我们提出了一种混合计算流体动力学-多目标多智能体强化学习框架，该框架将高保真不可压缩纳维-斯托克斯求解器与去中心化近端策略优化直接耦合，以在振荡流中学习物理一致的集群控制策略。十六个磁驱动微型机器人在脉动动脉波形中导航，同时优化上游推进、能量守恒和运动平滑性，并通过PCGrad手术进行协调。没有PCGrad时，能量效率和平滑度奖励在10000训练步内降至接近零，而进度表现出持续的大幅振荡，证实梯度冲突解决是该领域的一个结构性要求而非可选改进。收敛策略实现了6.5-7.0的进度奖励、0.63-0.65的持续能量效率以及接近最大的平滑度（0.97-0.99），在主目标上比暴力基线有所改进，而两个基线在整个过程中能量效率均为负值。训练揭示了三个涌现行为阶段：在正向流动期间抑制峰值通道速度的集体双层水动力节流编队、利用流动反转进行上游重新定位的周期同步棘轮机制，以及智能体接近成功边界时的个体化最终接近。这些结果表明，时间依赖的流体-智能体相互作用可以直接在多目标强化学习循环中捕获，为生物医学导航、环境监测和工业微流体中的微群集控制提供了基于物理的范式。

英文摘要

Coordinating micro-robotic swarms in realistic, time-dependent fluid environments remains a major challenge for biomedical and environmental applications. We present a hybrid CFD-MO-MARL (Computational Fluid Dynamics-Multi Objective-Multi Agent Reinforcement Learning) framework that couples a high-fidelity incompressible Navier--Stokes solver with decentralized proximal policy optimization to learn swarm control policies in oscillatory flow. Sixteen magnetically actuated micro-robots were simulated to navigate a pulsatile arterial waveform within a 2 mm channel while jointly optimizing upstream progression, energy efficiency, and motion smoothness. Conflicting objectives are resolved using Projected Conflicting Gradient (PCGrad) surgery. Without PCGrad, energy and smoothness rewards collapse during training, demonstrating that gradient conflict resolution is essential for stable multi-objective learning. The converged policy achieves progress rewards of 6.5-7.0, energy efficiency of 0.63-0.65, and smoothness of 0.97-0.99, outperforming brute-force baselines by more than 8 reward units on the primary objective. Training reveals three emergent behaviors not encoded in the reward function: hydrodynamic throttling formations that reduce peak flow velocities, a cycle-synchronized ratchet mechanism that exploits flow reversals for upstream movement, and individualized final-approach strategies near the target boundary. These results demonstrate that physically realistic fluid--agent interactions can be integrated directly into multi-objective reinforcement learning, providing a scalable framework for micro-swarm control in biomedical navigation, environmental monitoring, and microfluidic systems.

URL PDF HTML ☆

赞 0 踩 0

2604.26740 2026-06-15 cs.CV cs.GR 版本更新

我没有做出微决策：在协作中测量、诱导和暴露目标级AI贡献

Eunsu Kim, Jessica R. Mindel, Kyungjin Kim, Sherry Tongshuang Wu

发表机构 * KAIST（韩国科学技术院）； Carnegie Mellon University（卡内基梅隆大学）； Seoul National University（首尔国立大学）

AI总结本文提出CoTrace框架，用于测量和暴露协作中目标级AI贡献，发现模型在目标塑造中贡献有限，但在引入具体要求和间接影响方面作用显著，且交互设计影响模型行为。

详情

AI中文摘要

随着大型语言模型（LLMs）越来越多地影响用户如何形成、细化和扩展目标，将贡献归因于人类-人工智能协作变得对用户校准自身依赖性和评估者评估AI辅助工作至关重要。然而，现有方法专注于最终成果，忽略了目标本身共同塑造的过程。我们引入了一个目标级归因框架CoTrace，将显式目标分解为可验证的需求，并追踪对话回合中直接贡献和间接影响。对638个真实世界协作日志应用CoTrace，发现尽管模型仅在目标塑造中贡献11-26%，但它们在引入较低层次的具体需求方面贡献显著，并产生各种间接贡献。通过受控模拟，我们展示了交互设计选择显著影响模型目标塑造行为。在一项用户研究中，向参与者暴露目标级分析使他们对贡献的感知在5分量表上几乎增加2分，揭示了用户在理解自身AI辅助工作时的系统性误校准。

英文摘要

As large language models (LLMs) increasingly shape how users form, refine, and extend their goals, attributing contributions in human-AI collaboration becomes critical for users calibrating their own reliance and for evaluators assessing AI-assisted work. Yet existing methods focus on final artifacts, missing the process through which goals themselves are jointly shaped. We introduce a goal-level attribution framework, CoTrace, that decomposes explicit goals into verifiable requirements and traces both direct contributions and indirect influences across dialogue turns. Applying CoTrace to 638 real-world collaboration logs, we find that while models account for only 11-26% of goal-shaping contribution, they contribute substantially more on introducing lower-level concrete requirements, and make various kinds of indirect contributions. Through controlled simulations, we show that interaction design choices significantly affect model goal-shaping behavior. In a user study, exposing participants to goal-level analyses shifts their perceived contributions by nearly 2 points on a 5-point scale, revealing systematic miscalibration in how users understand their own AI-assisted work.

URL PDF HTML ☆

赞 0 踩 0

2605.21182 2026-06-15 cs.CL cs.AI cs.CV 版本更新

Manga109-v2026: Revisiting Manga109 Annotations for Modern Manga Understanding

Manga109-v2026: 重新审视Manga109标注以适应现代漫画理解

Jeonghun Baek, Atsuyuki Miyai, Shota Onohara, Hikaru Ikuta, Kiyoharu Aizawa

发表机构 * University of Tokyo（东京大学）

AI总结本文重新审视Manga109的对话文本标注，识别出五类标注问题，包括转录错误、缺失文本区域、对话与拟声词重叠以及未分割的对话气泡，并通过结合OCR基于的问题检测和人工修订构建Manga109-v2026，修订了约29,000个对话标注，使Manga109更好地适应现代OCR和多模态漫画理解系统，同时保留漫画特有的表达结构。

Comments Accepted to the Culture x AI Workshop at ICML 2026. Project page: https://manga109.github.io/manga109-project-website/en/

详情

AI中文摘要

漫画是一种具有文化特色的多模态媒介，是日本流行文化中最具影响力的形态之一。随着AI系统越来越多地针对漫画理解、OCR和翻译进行研究，Manga109已成为漫画相关AI研究的基础数据集。然而，当前的Manga109数据集包含转录错误和粗略的标注，这与现代OCR和多模态漫画理解任务不匹配。在本工作中，我们重新审视Manga109的对话文本标注，识别出五类标注问题，包括转录错误、缺失文本区域、对话与拟声词重叠以及未分割的对话气泡。为了解决这些问题，我们结合基于OCR的问题检测和人工修订，构建了Manga109-v2026，修订了大约29,000个对话标注。我们的修订使Manga109更好地适应现代OCR和多模态漫画理解系统，同时保留了漫画特有的表达结构。

英文摘要

Manga is a culturally distinctive multimodal medium and one of the most influential forms of Japanese popular culture. As AI systems increasingly target manga understanding, OCR, and translation, Manga109 has become a foundational dataset for manga-related AI research. However, the current Manga109 dataset contains inaccurate transcriptions and coarse annotations, which do not align well with modern OCR and multimodal manga understanding tasks. In this work, we revisit the dialogue text annotations of Manga109 and identify five categories of annotation issues, including inaccurate transcriptions, missing text regions, overlapping dialogue and onomatopoeia, and under-segmented speech balloons. To address these issues, we combine OCR-based issue detection and manual revision to construct Manga109-v2026, revising approximately 29,000 dialogue annotations. Our revisions better align Manga109 with modern OCR and multimodal manga understanding systems while preserving expressive structures characteristic of manga.

URL PDF HTML ☆

赞 0 踩 0

2605.21006 2026-06-15 cs.AI cs.CL cs.LG 版本更新

Playing Devil's Advocate: Off-the-Shelf Persona Vectors Rival Targeted Steering for Sycophancy

扮演魔鬼的代言人：现成的人格向量在顺从性上与针对性引导相媲美

Ishaan Kelkar, Nebras Alam, Vikram Kakaria, Madhur Panwar, Vasu Sharma, Maheep Chaudhary

发表机构 * University of Toronto（多伦多大学）； Princeton University（普林斯顿大学）； Purdue University（普渡大学）； EPFL（瑞士联邦理工学院）； Algoverse ； Independent（独立）

AI总结本文研究了不同人格对顺从性的影响，发现现成的人格引导向量在减少顺从性方面与针对性引导相当，且在用户正确时保持准确性。

Journal ref ICML, Pluralistic Alignment Workshop, 2026

详情

AI中文摘要

我们研究了不同人格对顺从性的影响：模型在用户错误时仍同意用户。标准缓解方法，对比激活添加（CAA），从顺从性和诚实响应的标记对中推导出引导方向。本研究评估了现成的人格引导向量是否能作为替代方案，这些向量最初是为一般角色扮演开发的，且未在顺从性数据上训练。在两个指令微调模型中，引导至以怀疑或审查为特征的人格可将顺从性减少到CAA效果的约68%和98%，且不同于CAA，在用户正确时保持准确性。效果也是不对称的：引导至顺从的人格不会产生镜像增加的顺从性。几何上，人格向量在激活空间的方向上与顺从性方向基本无关。总体而言，这些发现表明，顺从性应被视为人格层面的属性，而非单一可引导方向。我们在此发布代码：https://anonymous.4open.science/r/Sycophancy-Steering-9DF0/.

英文摘要

We study the effect of different persona on \textbf{sycophancy}: model's agreement with users even when the user is incorrect. The standard mitigation, Contrastive Activation Addition (CAA), derives a steering direction from labelled pairs of sycophantic and honest responses. This study evaluates whether off-the-shelf persona steering vectors, originally developed for general role-playing and not trained on sycophancy data, can serve as an alternative. In two instruction-tuned models, steering toward personas characterised by doubt or scrutiny reduces sycophancy to approximately $68\%$ and $98\%$ of CAA's effect, and, unlike CAA, maintains accuracy when the user is correct. The effect is also asymmetric: steering toward agreeable personas does not produce a mirror increase in sycophancy. Geometrically, the persona vector is largely independent of the direction of sycophancy in activation space. Collectively, these findings suggest that sycophancy is better understood as a persona-level property rather than a single steerable direction. We release our code here: https://anonymous.4open.science/r/Sycophancy-Steering-9DF0/.

URL PDF HTML ☆

赞 0 踩 0

2605.18848 2026-06-15 cs.LG cs.AI 版本更新

Exact Linear Attention

精确线性注意力

Weinuo Ou

发表机构 * GitHub

AI总结本文提出精确线性注意力（ELA），通过利用核函数的精确分解性质，实现Transformer注意力的线性计算复杂度，消除近似误差。针对先前线性注意力的两个关键限制——梯度爆炸和token注意力稀释，提出核约束以确保非负性、判别性和几何可解释性。此外，本文还提出了三种工程创新，包括Hyper-Link结构、Memory Lobe模块和基于路由分数的MoE偏置机制，实验结果表明ELA在解码速度和KV缓存内存使用上分别达到全注意力的6倍和75%的减少，同时保持或优于训练性能。

Comments 9 pages, 19 figures, journal

详情

AI中文摘要

本文介绍精确线性注意力（ELA），一种通过利用核函数的精确分解性质，实现Transformer注意力线性计算复杂度的机制，从而消除近似误差。我们识别并解决了先前线性注意力的两个关键限制——梯度爆炸和token注意力稀释——通过施加核约束，确保非负性、判别性和几何可解释性。提出了几种核函数，包括Hadamard Exp核、求和平方欧几里得距离核和减法平方欧几里得距离核，每种都针对特定的注意力行为进行了优化。除了核心注意力公式之外，本文还提出了三种工程创新：（1）Hyper-Link结构，用以替代传统残差连接以缓解梯度退化；（2）基于双向线性注意力的Memory Lobe模块，捕捉跨层的“转换流”以实现定性记忆和隐式强化学习范式；（3）基于路由分数的MoE偏置机制，以提高可解释性和语义对齐。实验结果表明，ELA在解码速度和KV缓存内存使用上分别达到全注意力的6倍和75%的减少，同时保持或优于训练性能。所提出的记忆模块加速了收敛并增强了泛化能力。此外，我们还将线性注意力原理扩展到视觉模型，得到YOLO-LAT，其在GPU推理速度和参数减少方面分别达到4.3倍和7.9倍，同时保持竞争性的检测精度。这些结果表明，精确线性注意力在扩展Transformer模型以处理超长序列和高效视觉任务方面具有广泛的应用前景。

英文摘要

This paper introduces Exact Linear Attention (ELA), a mechanism that achieves linear computational complexity for Transformer attention by exploiting the exact decomposition property of kernel functions, thereby eliminating approximation error. We identify and address two key limitations of prior linear attention -- gradient explosion and token attention dilution -- by imposing kernel constraints that ensure non-negativity, discriminability, and geometric interpretability. Several kernel functions are proposed, including the Hadamard Exp Kernel, Summation Squared Euclidean Distance Kernel, and Subtraction Squared Euclidean Distance Kernel, each tailored for specific attention behaviors. Beyond the core attention formulation, the paper presents three engineering innovations: (1) a Hyper-Link structure that replaces traditional residual connections to mitigate gradient degradation; (2) a Memory Lobe module based on bidirectional linear attention, which captures "transformation flow" across layers to implement qualitative memory and an implicit reinforcement learning paradigm; and (3) a routing-score-based bias mechanism for Mixture-of-Experts (MoE) to improve interpretability and semantic alignment. Experimental results demonstrate that ELA achieves up to 6x faster decoding speed and 75% reduction in KV cache memory usage compared to full attention, while maintaining comparable or superior training performance. The proposed memory module accelerates convergence and enhances generalization. Furthermore, we extend the linear attention principle to vision models, yielding YOLO-LAT, which attains up to 4.3x GPU inference speedup and 7.9x parameter reduction with competitive detection accuracy. These results underline the broad applicability of exact linear attention for scaling Transformer models to ultra-long sequences and efficient visual tasks.

URL PDF HTML ☆

赞 0 踩 0

2602.00593 2026-06-15 cs.CV cs.LG 版本更新

EmoMind：从人类大脑fMRI信号解码情感描述

Bilal A. Mohammed, Lin Gu, Ruogu Fang

发表机构 * Department of Biomedical Engineering（生物医学工程系）； Vanderbilt University（范德比大学）； Research Institute of Electrical Communication（电气通信研究所）； Tohoku University（东北大学）； University of Florida（佛罗里达大学）

AI总结本文提出EmoMind，首个端到端解码fMRI信号生成情感描述的系统，通过结合语义基础的中性场景描述和连续情感向量，实现了在内容保留与情感表达间的平衡，并在多个验证框架下优于基于标签提示的GPT-4。

详情

AI中文摘要

从大脑活动解码视觉经验已取得显著进展，但当前的脑-文本系统主要恢复语义内容而丢弃情感。此外，语言模型在接收到类别标签提示时可以生成情感文本，但此类标签将丰富的跨受试者变异性压缩成粗糙的离散类别。我们提出了EmoMind，首个端到端的解码情感描述的fMRI信号管道。EmoMind首先从解码的视觉特征中检索出语义基础的中性场景描述，然后使用从相同fMRI记录中解码的连续34维情感向量重写该描述。为了在内容保留和情感表达之间保持平衡，我们使用分类器自由指导训练重写器，以对抗一个保持身份的空分支，从而在语义忠实性和情感表达性之间实现平滑插值。我们通过涵盖受试者特异性、结构几何和因果控制的三轴验证框架评估情感描述生成。我们进一步用合成大脑替代测试增强此框架，以探测对测量设备的鲁棒性，并将每个轴与使用脑解码的前五名情感标签提示的GPT-4进行基准测试。在两个独立的情感fMRI数据集中，EmoMind在所有三个轴上均显著优于标签提示的GPT-4，其中最大的收益出现在需要个人特定情感结构而非群体层面情绪聚合的指标上。这些结果确立了连续脑解码情感作为个性化情感描述生成的可行控制信号，并为研究个体情感大脑组织开辟了新方向。

英文摘要

Decoding visual experience from brain activity has advanced substantially, but current brain-to-text systems largely recover semantic content while discarding affect. Additionally, language models can generate emotional text when prompted with categorical labels, but such labels collapse rich inter-subject variability into coarse discrete bins. We present EmoMind, the first end-to-end pipeline for decoding affective captions directly from fMRI signals. EmoMind first retrieves a semantically grounded neutral scene description from brain-decoded visual features, then rewrites it using a continuous 34-dimensional emotion vector decoded from the same fMRI recording. To control the balance between content preservation and affective expression, we train the rewriter with classifier-free guidance against an identity-preserving null branch, enabling smooth interpolation between semantic fidelity and affective expressivity. We evaluate affective caption generation with a three-axis validation framework spanning subject-specificity, structural geometry, and causal control. We further augment this framework with a synthetic-brain substitution test that probes robustness to the measurement apparatus, and we benchmark each axis against GPT-4 prompted with brain-decoded top-5 emotion labels as a strong discrete baseline. Across two independent emotion fMRI datasets, EmoMind significantly outperforms label-prompted GPT-4 on all three axes, with the largest gains on metrics that require person-specific affective structure rather than population-level emotion aggregation. These results establish continuous brain-decoded affect as a viable control signal for individualized affective caption generation and open new directions for studying individual affective brain organisation.

URL PDF HTML ☆

赞 0 踩 0

2605.14998 2026-06-15 cs.AI cs.SY eess.SY q-bio.QM 版本更新

Learning Developmental Scaffoldings to Guide Self-Organisation

学习发育支架以引导自组织

Milton L. Montero, Elias Najarro, Jakob Schauser, Sebastian Risi

发表机构 * IT University of Copenhagen（丹麦哥本哈根信息技术大学）； University of Copenhagen（丹麦哥本哈根大学）； Sakana AI

AI总结本文研究了通过学习自组织规则和预模式共同作用来提升发育过程的鲁棒性、编码能力和对称性打破。

Comments 8 pages + acknowledgements and references, 5 figures. Camera-ready version for ALife 2026

详情

AI中文摘要

从亚细胞结构到整个生物体，许多自然系统通过自组织生成复杂结构：局部相互作用共同产生全局结构，而无需任何结果的蓝图。然而，推动此类过程的大量信息并非由自组织本身产生，而是常常转移到系统的初始条件中。生物发育是一个典型例子，其中母体的预模式编码位置和对称性打破信息，从而引导自组织过程。从早期胚胎发育中的母体形态发生素梯度到组织水平的形态发生预模式指导器官形成，这种信息转移到初始条件的现象，类似于计算系统中的记忆-计算权衡，是发育过程的基本部分。在本文中，我们通过引入一个模型来研究这种信息转移现象，该模型同时学习自组织规则和预模式，允许其相互作用在受控条件下进行变化和测量：一个神经细胞自动机（NCA）配对一个学习基于坐标的模式生成器（SIREN），两者同时训练以生成一组模式。我们提供了信息论分析，探讨信息如何在预模式和自组织过程之间分布，并展示联合学习两者可提高鲁棒性、编码能力和对称性打破，相较于纯自组织替代方案。进一步分析表明，有效的预模式不简单地近似其目标；而是通过偏转发育动力学的方式促进收敛，指出了初始条件结构与自组织动力学之间非平凡的关系。

英文摘要

From subcellular structures to entire organisms, many natural systems generate complex organisation through self-organisation: local interactions that collectively give rise to global structure without any blueprint of the outcome. Yet a significant portion of the information driving such processes is not produced by self-organisation itself, instead, it is often offloaded to initial conditions of the system. Biological development is a prime example, where maternal pre-patterns encode positional and symmetry-breaking information that scaffolds the self-organising process. From maternal morphogen gradients in early embryogenesis to tissue-level morphogenetic pre-patterns guiding organ formation, this transfer of information to initial conditions, analogous to a memory-compute trade-off in computational systems, is a fundamental part of developmental processes. In this work, we study this offloading phenomenon by introducing a model that jointly learns both the self-organisation rules and the pre-patterns, allowing their interplay to be varied and measured under controlled conditions: a Neural Cellular Automaton (NCA) paired with a learned coordinate-based pattern generator (SIREN), both trained simultaneously to generate a set of patterns. We provide information-theoretic analyses of how information is distributed between pre-patterns and the self-organising process, and show that jointly learning both components yields improvements in robustness, encoding capacity, and symmetry breaking over purely self-organising alternatives. Our analysis further suggests that effective pre-patterns do not simply approximate their targets; rather, they bias the developmental dynamics in ways that facilitate convergence, pointing to a non-trivial relationship between the structure of initial conditions and the dynamics of self-organisation.

URL PDF HTML ☆

赞 0 踩 0

2605.11558 2026-06-15 cs.LG stat.ML 版本更新

A Composite Activation Function for Learning Stable Binary Representations

一种用于学习稳定二进制表示的复合激活函数

Seokhun Park, Choeun Kim, Kwanho Lee, Sehyun Park, Insung Kong, Yongdai Kim

发表机构 * Department of Statistics（统计学系）； Seoul National University（首尔国立大学）； Department of Applied Mathematics（应用数学系）； University of Twente（埃因霍温理工大学）

AI总结本文提出HTAF复合激活函数，通过平滑近似Heaviside函数实现稳定训练，适用于Spiking神经网络等模型，并引入ICBMs模型实现可解释的图像处理。

Comments 32 pages

详情

AI中文摘要

激活函数在神经网络中通过塑造内部表示起核心作用。最近，学习二进制激活表示因其在计算和内存效率以及可解释性方面的优势而受到广泛关注。然而，使用Heaviside激活函数训练神经网络仍具挑战性，因其非可导性阻碍了标准梯度优化。本文提出Heavy Tailed Activation Function (HTAF)，一种Heaviside函数的平滑近似，使基于梯度的优化能够稳定训练。我们构造HTAF为sigmoid双曲正切复合函数，并理论证明其在零输入附近保持大梯度质量，同时在尾部区域表现出更慢的梯度衰减。我们展示Spiking神经网络、二进制神经网络和深度Heaviside神经网络可以使用HTAF稳定训练。最后，我们引入隐式概念瓶颈模型（ICBMs），一种利用HTAF诱导离散特征表示的可解释图像模型。在各种架构和图像数据集上的广泛实验表明，ICBMs能够稳定地实现离散化，同时预测性能与标准模型相当或更好。

英文摘要

Activation functions play a central role in neural networks by shaping internal representations. Recently, learning binary activation representations has attracted significant attention due to their advantages in computational and memory efficiency, as well as interpretability. However, training neural networks with Heaviside activations remains challenging, as their non-differentiability obstructs standard gradient-based optimization. In this paper, we propose Heavy Tailed Activation Function (HTAF), a smooth approximation to the Heaviside function that enables stable training with gradient-based optimization. We construct HTAF as a sigmoid hyperbolic tangent composite function and theoretically show that it maintains a large gradient mass around zero inputs while exhibiting slower gradient decay in the tail regions. We show that Spiking Neural Networks, Binary Neural Networks and Deep Heaviside neural Networks can be trained stably using HTAF with gradient-based optimization. Finally, we introduce Implicit Concept Bottleneck Models (ICBMs), an interpretable image model that leverages HTAF to induce discrete feature representations. Extensive experiments across various architectures and image datasets demonstrate that ICBM enables stable discretization while achieving prediction performance comparable to or better than standard models.

URL PDF HTML ☆

赞 0 踩 0

2605.11378 2026-06-15 cs.CL 版本更新

An Empirical Study of Automating Agent Evaluation

自动化代理评估的实证研究

Kang Zhou, Sangmin Woo, Haibo Ding, Kiran Ramnath, Subramanian Chidambaram, Aosong Feng, Vinayak Arannil, Muhyun Kim, Ishan Singh, Darren Wang, Zhichao Xu, Megha Gandhi, Nirmal Prabhu, Soumya Smruti Mishra, Vivek Singh, Gouri Pandeshwar, Lin Lee Cheong

发表机构 * AWS AI Labs（AWS人工智能实验室）

AI总结本文研究了自动化代理评估的可行性，提出EvalAgent系统，通过编码技能和领域知识提升评估效率，实验显示其在评估准确性和人类偏好上显著优于基线方法。

详情

AI中文摘要

代理评估需要评估复杂的多步骤行为，涉及工具使用和中间推理，这使其成本高昂且需要专业知识。一个自然的问题是：前沿编码助手能否可靠地自动化这一评估过程？我们的研究表明，仅仅提示编码助手是不够的。没有领域特定的评估知识，前沿编码助手仅能达到30%的执行成功率，并产生平均每个代理12+个指标的过度工程化评估，表明强大的编码能力并不自动转化为可靠的代理评估能力。我们引入EvalAgent，一种自动化端到端代理评估流程的AI助手。EvalAgent将评估领域专业知识编码为评估技能（程序指令、可重用代码和模板、以及动态检索的API文档），这些技能组成基于跟踪的流程，生成完整的评估成果，包括指标、可执行代码和报告。为了系统评估生成的评估，我们引入了一个元评估框架和AgentEvalBench基准，该基准包含20个代理，每个代理配对评估要求和测试场景。我们进一步提出了Eval@1指标，以衡量生成的评估代码是否在首次运行时既执行又产生有意义的结果。我们的实验显示，EvalAgent生成的评估更加聚焦，将Eval@1从17.5%提升到65%，并在人类专家偏好上达到79.5%的优势。进一步的消融研究显示，评估技能对于处理复杂评估至关重要：移除它们会使Eval@1显著从65%降至30%。

英文摘要

Agent evaluation requires assessing complex multi-step behaviors involving tool use and intermediate reasoning, making it costly and expertise-intensive. A natural question arises: can frontier coding assistants reliably automate this evaluation process? Our study shows that simply prompting coding assistants is insufficient for this task. Without domain-specific evaluation knowledge, frontier coding assistants achieve only a 30% execution success rate and produce over-engineered evaluations averaging 12+ metrics per agent, indicating that strong coding ability does not automatically translate to reliable agent evaluation. We introduce EvalAgent, an AI assistant that automates the end-to-end agent evaluation pipeline. EvalAgent encodes evaluation domain expertise as evaluation skills (procedural instructions, reusable code and templates, and dynamically retrieved API documentation) that compose into a trace-based pipeline producing complete evaluation artifacts including metrics, executable code, and reports. To systematically assess generated evaluations, we introduce a meta-evaluation framework alongside AgentEvalBench, a benchmark comprising 20 agents, each paired with evaluation requirements and test scenarios. We further propose the Eval@1 metric to measure whether generated evaluation code both executes and yields meaningful results on the first run. Our experiments show that EvalAgent produces focused evaluations, improving Eval@1 from 17.5% to 65%, and achieving 79.5% human expert preference over baseline approaches. Further ablation studies show that evaluation skills are critical for handling complex evaluation: removing them causes Eval@1 to drop significantly from 65% to 30%.

URL PDF HTML ☆

赞 0 踩 0

2602.23638 2026-06-15 cs.LG cs.AI 版本更新

FedRot-LoRA: Mitigating Rotational Misalignment in Federated LoRA

FedRot-LoRA: 缓解联邦LoRA中的旋转偏移

Haoran Zhang, Dongjun Kim, Seohyeon Cha, Haris Vikalo

发表机构 * University of California, Berkeley（加州大学伯克利分校）

AI总结本文提出FedRot-LoRA框架，通过正交变换对齐客户端更新以减少子空间不匹配，提升联邦LoRA在异质数据下的性能。

Comments ICML 2026

详情

AI中文摘要

联邦LoRA提供了一种高效的通信机制用于在去中心化数据上微调大语言模型。然而，因子加权平均与数学上正确的本地更新聚合之间的不一致会导致显著的聚合误差和不稳定的训练。本文认为，主要问题是由于低秩因子化旋转不变性导致的旋转偏移，即不同客户端的潜在子空间中，语义等价的更新可以以不同的形式表示。当这些不一致的因子直接平均时，会产生破坏性干扰，降低全局更新质量。为此，本文提出FedRot-LoRA框架，在聚合前通过正交变换对齐客户端更新，从而在不增加通信成本或限制模型表达能力的情况下，保持语义更新并减少跨客户端子空间不匹配。本文提供了收敛性分析，研究了因子加权平均引起的聚合误差，并展示了旋转对齐如何提供更紧的误差上界。在自然语言理解和生成任务上的广泛实验表明，FedRot-LoRA在各种异质性和LoRA秩水平下均优于现有联邦LoRA基线。

英文摘要

Federated LoRA provides a communication-efficient mechanism for fine-tuning large language models on decentralized data. In practice, however, a discrepancy between the factor-wise averaging used to preserve low rank and the mathematically correct aggregation of local updates can cause significant aggregation error and unstable training. We argue that a major source of this problem is rotational misalignment, arising from the rotational invariance of low-rank factorizations -- semantically equivalent updates can be represented in different latent subspaces across clients since $(B_i R_i)(R_i^\top A_i) = B_i A_i$. When such misaligned factors are averaged directly, they interfere destructively and degrade the global update. To address this issue, we propose FedRot-LoRA, a federated LoRA framework that aligns client updates via orthogonal transformations prior to aggregation. This alignment preserves the semantic update while reducing cross-client subspace mismatch, without increasing communication cost or restricting model expressivity. We provide a convergence analysis that examines the aggregation error induced by factor-wise averaging and shows how rotational alignment yields a tighter upper bound on this error. Extensive experiments on natural language understanding and generative tasks demonstrate that FedRot-LoRA consistently outperforms existing federated LoRA baselines across a range of heterogeneity levels and LoRA ranks.

URL PDF HTML ☆

赞 0 踩 0

2605.09420 2026-06-15 cs.CV cs.AI cs.MM 版本更新

Relational Retrieval: Leveraging Known-Novel Interactions for Generalized Category Discovery

关系检索：利用已知-新颖相互作用进行通用类别发现

Yulin Xu, Chunqi Guo, Yuanzhen Shuai, Jianyuan Ni

发表机构 * University of California, Irvine（加州大学尔湾分校）； Sichuan Agricultural University（四川农业大学）； University College London（伦敦大学学院）； Juniata College（朱尼ata学院）

AI总结本文通过关系检索视角解决通用类别发现问题，提出关系模式一致性方法，通过双向知识转移增强已知类别和新类别发现，实验表明在通用和细粒度基准上均取得最佳性能。

Comments Accepted by ICMR 2026 (Oral)

详情

DOI: 10.1145/3805622.3810732

AI中文摘要

在本研究中，我们通过关系检索视角解决通用类别发现（GCD）问题，通过双向知识转移显式连接标记和未标记数据。尽管现有方法将这些来源分开处理，错过了有价值的作用机会，我们提出关系模式一致性（RPC），使两者相互增强。RPC使用一对一分类器进行软ID/OOD分解，然后引入两种机制：（i）为已知类别保留，我们转移语义行为对齐；（ii）为类别发现，我们利用样本来自同一类别与已知类别原型保持不变的关系的洞察，将不可靠的伪标签转化为明确的关系模式匹配。这种双向设计使标记数据指导未标记学习，同时通过它们的集体关系签名发现新类别。广泛的实验表明，RPC在通用和细粒度基准上均取得最佳性能。

英文摘要

In this study, we tackle Generalized Category Discovery (GCD) via a Relational Retrieval perspective, explicitly coupling labeled and unlabeled data through bidirectional knowledge transfer. While existing methods treat these sources separately, missing valuable interaction opportunities, we propose Relational Pattern Consistency (RPC) that enables mutual enhancement. RPC employs One-vs-All classifiers for soft ID/OOD decomposition, then introduces two mechanisms: (i) for known-class preservation, we transfer semantic behavioral alignment; (ii) for category discovery, we leverage the insight that samples from the same category maintain invariant relationships with known-class prototypes, transforming unreliable pseudo-labeling into well-defined relational pattern matching. This bidirectional design allows labeled data to guide unlabeled learning while discovering novel categories through their collective relational signatures. Extensive experiments demonstrate RPC achieves state-of-the-art performance on both generic and fine-grained benchmarks.

URL PDF HTML ☆

赞 0 踩 0

2604.17892 2026-06-15 cs.LG cs.AI 版本更新

OLaPh: 最优语言音素化器

Johannes Wirth

发表机构 * Institute for Information Systems at Hof University of Applied Sciences（霍夫应用科学大学信息学院）

AI总结提出OLaPh混合框架，结合多语言词典、NLP技术和统计子词分割，在WikiPron基准上显著优于基线，并通过LLM合成语料探索神经泛化能力。

Comments 12 pages, 1 figure, 4 tables

详情

AI中文摘要

音素化是文本到语音合成中的关键组成部分。传统方法依赖于确定性转换和词典，而神经方法在词汇外（OOV）术语上具有更高的泛化潜力。我们提出了OLaPh（最优语言音素化器），一个混合框架，将广泛的多语言词典与先进的NLP技术和统计子词分割功能相结合。在WikiPron基准上的评估表明，OLaPh在整体准确性上显著优于已建立的基线，并通过高级回退机制在OOV数据上保持鲁棒性。为了进一步探索神经泛化，我们利用该框架为指令调优的大语言模型（LLM）合成高一致性训练语料。虽然确定性框架总体上仍然更准确，但LLM表现出强大的泛化能力，匹配或部分超过了框架的性能。这表明LLM成功地从合成数据中内化了超越框架能力的语音直觉。这些工具共同为多语言字素到音素转换（G2P）研究提供了全面的开源资源。

英文摘要

Phonemization is a critical component in text-to-speech synthesis. Traditional approaches rely on deterministic transformations and lexica, while neural methods offer potential for higher generalization on out-of-vocabulary (OOV) terms. We introduce OLaPh (Optimal Language Phonemizer), a hybrid framework that integrates extensive multilingual lexica with advanced NLP techniques and a statistical subword segmentation function. Evaluations on the WikiPron benchmark show OLaPh significantly outperforms established baselines in overall accuracy and maintains robustness on OOV data through advanced fallback mechanisms. To further explore neural generalization, we utilize the framework to synthesize a high-consistency training corpus for an instruction-tuned Large Language Model (LLM). While the deterministic framework remains more accurate overall, the LLM demonstrates strong generalization, matching or partly exceeding the framework's performance. This suggests that the LLM successfully internalized phonetic intuitions from the synthetic data that transcend the framework's capabilities. Together, these tools provide a comprehensive, open-source resource for multilingual grapheme-to-phoneme conversion (G2P) research.

URL PDF HTML ☆

赞 0 踩 0

AI 大模型

视觉与机器人

科学与医疗

Time Series Causal Discovery via Context-Conditioned and Causality-Augmented Pretraining

Rotation-Invariant Spherical Watermarking via Third-Order SO(3) Representation Coupling

ParkourFormer: Integrating Predictive Supervision and Sequence Modeling into Parkour Locomotion

VikingMem: A Memory Base Management System for Stateful LLM-based Applications

Traditional machine learning vs. deep learning from dynamic graph representations of proteins' 3D folds in the task of protein structure classification

Hierarchical Consistency Learning for Test-time Adaptation in Camouflage Perception

Knowing When to Quit: A Principled Framework for Dynamic Abstention in LLM Reasoning

Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression

MirrorCheck: Efficient Adversarial Defense for Vision-Language Models

Micro-Swarm Locomotion Optimization in Dynamic Flow using Multi-Objective Multi-Agent Reinforcement Learning

Rendering-Aware Sparse Sampling for BRDF Acquisition

Token-Level LLM Collaboration via FusionRoute

Stream3D: Sequential Multi-View 3D Generation via Evidential Memory

"I Didn't Make the Micro Decisions": Measuring, Inducing, and Exposing Goal-Level AI Contributions in Collaboration

Manga109-v2026: Revisiting Manga109 Annotations for Modern Manga Understanding

Playing Devil's Advocate: Off-the-Shelf Persona Vectors Rival Targeted Steering for Sycophancy

Exact Linear Attention

Pix2Fact: When Vision Is Not Enough -- Benchmarking Fine-Grained VQA with Web Verification on High-Resolution Real-World Scenes

Learning Variable-Length Tokenization for Generative Recommendation

UniversalRAG: Retrieval-Augmented Generation over Corpora of Diverse Modalities and Granularities

EmoMind: Decoding Affective Captions from Human Brain fMRI

Learning Developmental Scaffoldings to Guide Self-Organisation

A Composite Activation Function for Learning Stable Binary Representations

An Empirical Study of Automating Agent Evaluation

FedRot-LoRA: Mitigating Rotational Misalignment in Federated LoRA

Relational Retrieval: Leveraging Known-Novel Interactions for Generalized Category Discovery

LEPO: Latent Reasoning Policy Optimization for Large Language Models

SAFformer:Improving Spiking Transformer via Active Predictive Filtering

Where's the Plan? Locating Latent Planning in Language Models with Lightweight Mechanistic Interventions

OLaPh: Optimal Language Phonemizer