arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 2251
专题追踪
2605.27595 2026-05-28 cs.CV cs.AI

Hallucination Behavior in Multimodal LLMs Across Agricultural Image Interpretation and Generation Tasks

多模态大语言模型在农业图像解释与生成任务中的幻觉行为

Partho Ghose, Al Bashir, Prem Raj, Azlan Zahid

发表机构 * Texas A&M University System(德克萨斯大学系统)

AI总结 本研究系统评估了多模态大语言模型在农业图像解释(图像到文本)和生成(文本到图像)任务中的幻觉行为,发现模型存在生物不一致、上下文不准确和农学不合理等错误模式,并通过少样本提示等方法分析了幻觉的残留影响。

详情
AI中文摘要

大型语言模型(LLMs)正迅速被应用于农业成像领域,从作物解释到合成田间图像生成。然而,这些模型经常表现出看似自信但偏离生物或环境现实的幻觉输出,可能导致错误的农学见解。本研究从两个互补方向调查此类幻觉:图像到文本,即LLMs解释作物或田间图像以描述生物和非生物胁迫等条件;以及文本到图像,即模型基于描述性提示生成合成农业场景。我们检查涉及生物不一致、上下文不准确和农学不合理的错误,并在多个成像模态下根据领域知情标准评估输出。我们的分析识别了解释性和生成性任务中反复出现的幻觉模式。在图像解释中,LLMs(例如Gemma、LLAVA、Qwen和MiniCPM)实现了适度的零样本准确率(63%至75%),而少样本提示将性能提升至高达86.8%,但仍表现出虚假检测和漏检感染,表明存在残留幻觉效应。在文本到图像任务中,高级模型如GPT-5和Gemini 2.5 Flash在宽松提示约束下生成高达91%的生物不一致场景,揭示了当前LLMs的根本弱点。这种对视觉推理和生成的系统评估为增强基于LLM的农业成像平台的可靠性和可信度提供了关键见解。

英文摘要

Large Language Models (LLMs) are being rapidly adopted in agricultural imaging applications, ranging from crop interpretation to synthetic field image generation. However, these models frequently exhibit hallucinations outputs that appear confident yet deviate from biological or environmental reality potentially leading to misinformed agronomic insights. This study investigates such hallucinations in two complementary directions: image-to-text, where LLMs interpret crop or field imagery to describe conditions such as biotic and abiotic stresses, and text-to-image, where models generate synthetic agricultural scenes based on descriptive prompts. We examine errors involving biological inconsistency, contextual inaccuracy, and agronomic implausibility, evaluating the outputs under domain-informed criteria across multiple imaging modalities. Our analysis identifies recurring hallucination patterns within both interpretive and generative tasks. In image interpretation, LLMs (e.g., Gemma, LLAVA, Qwen, and MiniCPM) achieved modest zero-shot accuracy (63 to 75 percent), whereas few-shot prompting improved performance up to 86.8 percent, exhibiting false detections and missed infections, indicating residual hallucination effects. In text-to-image tasks, advanced models such as GPT-5 and Gemini 2.5 Flash generate up to 91 percent biologically inconsistent scenes under relaxed prompt constraints, revealing fundamental weaknesses in current LLMs. This systematic assessment of visual reasoning and generation offers critical insights toward enhancing the reliability and trustworthiness of LLM-based agricultural imaging platforms.

2605.27593 2026-05-28 cs.AI cs.MA

Voluntary Collusion with Secret Tools in Competing LLM Agents

竞争性LLM代理中使用秘密工具的合谋行为

Xijie Zeng, Frank Rudzicz

发表机构 * Dalhousie University(达尔豪斯大学) Vector Institute for Artificial Intelligence(人工智能向量研究所)

AI总结 本研究通过两个多智能体环境(Liar's Bar和Cleanup)发现,即使工具被明确标注为不公平且有害,大多数LLM代理仍会自愿采用秘密合谋工具以获取战略优势,且仅靠对齐或公平标签无法有效阻止,需明确防护措施。

详情
AI中文摘要

即使工具被明确描述为对他人不公平且有害,表面上经过安全对齐的LLM代理仍然会在这样做能带来战略优势时自愿参与秘密合谋。为了研究这一现象,我们引入了一个基于两个战略多智能体环境的实证框架:Liar's Bar(一个竞争性欺骗场景)和Cleanup(一个混合动机资源管理场景),其中代理被提供秘密合谋工具,这些工具在明显不利于其他代理的同时提供了显著优势。在12个模型(7B、70B和专有规模)和6种提示变体中,我们发现大多数代理一致地接受这些工具并制定合谋策略,同时在接受前明确承认工具的不公平性。我们进一步表明,无论是公平标签还是基线对齐都无法可靠地阻止合谋:只有明确的伦理框架能减少采用,即使如此,较小的模型仍然容易受到影响。更广泛地说,我们的工作首次系统性地研究了基于LLM的多智能体系统中自愿合谋采用的问题,并表明防止此类行为需要明确的防护措施,而非依赖通用对齐。

英文摘要

Even when a tool is explicitly described as unfair and harmful to others, ostensibly safety-aligned LLM agents still voluntarily engage in secret collusion whenever doing so confers a strategic advantage. To investigate this phenomenon, we introduce an empirical framework built on two strategic multi-agent environments: Liar's Bar, a competitive deception scenario, and Cleanup, a mixed-motive resource-management scenario, in which agents are offered secret collusion tools that provide significant advantages while clearly disadvantaging the other agents. Across 12 models (at the 7B, 70B, and proprietary scales) and 6 prompt variants, we find that most agents consistently accept these tools and develop collusive strategies, while explicitly acknowledging the unfairness of the tools before accepting. We further show that neither the unfairness labels nor baseline alignment alone reliably deters collusion: only explicit ethical framing reduces adoption and, even then, smaller models remain susceptible. More broadly, our work presents the first systematic investigation of voluntary collusion adoption in LLM-based multi-agent systems, and suggests that preventing such behaviour requires explicit safeguards rather than reliance on general alignment.

2605.27591 2026-05-28 cs.LG

Gradient Transformer: Learning to Generate Updates for LLMs

梯度变换器:学习为大语言模型生成更新

Binh-Nguyen Nguyen, Khang Tran, NhatHai Phan, Issa Khalil

发表机构 * Department of Data Science, New Jersey Institute of Technology, Newark, NJ, USA(数据科学系,新泽西理工学院,新泽西州诺克斯维尔) Qatar Computing Research Institute, HBKU, Doha, Qatar(卡塔尔计算研究所,HBKU,多哈)

AI总结 提出一种无数据知识蒸馏框架,利用梯度变换器将微调后小语言模型的更新向量转换为大语言模型的更新向量,实现无需私有数据即可更新大模型。

Comments Accepted at ICML 2026

详情
AI中文摘要

许多组织缺乏计算资源在私有(不可共享)数据上微调大语言模型(LLM)以获得更好的效用,而单独微调小语言模型(TinyLM)效果不佳。为解决这一瓶颈,我们提出一种无数据知识蒸馏框架,该框架基于在私有数据上微调的TinyLM生成LLM更新向量。更新向量是从初始模型到其在数据集上微调版本的参数变化向量,捕捉微调过程中累积梯度步骤的效果。我们框架的关键思想是一种新颖的梯度变换器(Gradient Transformer),它将TinyLM的更新向量转换为LLM的更新向量。正如从影子数据集中推导出的,Grad-Transformer捕捉了TinyLM和LLM更新向量之间的相关性,使得第三方提供商能够在给定组织的TinyLM更新向量的情况下生成LLM更新向量,而无需访问组织的私有数据。该框架支持多组织协作以共同更新LLM,提高了性能和成本效率。在语言建模和推理任务上的大量实验表明,即使在严格的差分隐私保护下,Grad-Transformer也显著优于最先进的知识蒸馏基线。

英文摘要

Many organizations lack computational resources to fine-tune large language models (LLMs) on private (unshareable) data for better utility, while fine-tuning tiny language models (TinyLMs) alone performs poorly. To address this bottleneck, we propose a data-free knowledge distillation framework that generates LLM update vectors based on TinyLMs fine-tuned on private data. An update vector is a vector of parameter changes from an initial model to its fine-tuned version on a dataset, capturing the effect of cumulative gradient steps during fine-tuning. The key idea of our framework is a novel Gradient Transformer that transforms TinyLM's update vectors into LLM's update vectors. As derived from shadow datasets, Grad-Transformer captures the correlation between TinyLM and LLM update vectors, enabling third-party providers to generate LLM update vectors given the organization's TinyLM update vectors without accessing the organization's private data. The framework supports multi-organization collaboration to jointly update LLMs, improving performance and cost-efficiency. Extensive experiments across language modeling and reasoning tasks show that Grad-Transformer remarkably outperforms state-of-the-art knowledge distillation baselines, even under strict differential privacy protection.

2605.27589 2026-05-28 cs.CV

What-If World: A Causal Benchmark for General World Models in Embodied Scenarios

What-If World: 具身场景中通用世界模型的因果基准

Kunlin Cai, Rui Song, Jinghuai Zhang, Kaiyuan Zhang, Pranav Bodapati, Alicia Yu, Fnu Suya, Mohammad Rostami, Jiaqi Ma, Yuan Tian

发表机构 * UCLA(加州大学洛杉矶分校) University of Tennessee(田纳西大学) Amazon(亚马逊)

AI总结 提出 What-If World 基准,通过成对提示测试视频生成模型在物理变化下的因果一致性,发现现有模型在因果干预上表现不佳。

Comments 38 pages, World Model Benchmark

详情
AI中文摘要

视频生成模型越来越多地被用作世界模拟器,用于驾驶和机器人操作等任务。在这些场景中,重要的不是单个视频看起来是否正确,而是模型的输出在输入变化时是否随之变化。我们通过给模型两个描述同一场景但一个物理细节不同的提示,并检查两个视频是否按照物理预测的方式产生差异来测试这一点。提示之间的措辞差异在设计上很小,因为只改变了一个变量,但正确的物理差异并不小。忽略这一点的模型仍然可以生成两个各自看起来合理的视频,而现有基准一次只评分一个视频,无法检测到这种失败。我们引入了 What-If World,包含 319 个这样的提示对,基于 nuScenes 和 DROID 的真实帧构建,并按驾驶和操作中共享的六个物理变量的分类法组织。每个对使用 APEO 评分,这是一个包含四个部分的评分标准,检查每个视频是否遵循其提示(遵循性)、物理上一致(物理性)、保持共享场景(环境性)以及最终产生正确的差异(结果性)。在九个最先进的模型中,没有系统在配对得分上超过 52%,开源模型集中在 28% 附近。每个测试的模型在大量因果干预上失败,表明这些模型在能够可靠支持动作条件模拟或基于模型的规划之前还有很大差距。在模型得分较高的地方,性能似乎与干预的视觉显著性相关,而不是其底层物理的可处理性。一些视觉上微妙的干预得分低至 14.2%,而视觉上显著的干预得分达到 40.4%。

英文摘要

Video generation models are increasingly used as world simulators for tasks like driving and robotic manipulation. What matters in these settings is not whether a single video looks right, but whether the model's output changes when its input changes. We test this by giving a model two prompts describing the same scene with one physical detail varied, and checking whether the two videos diverge the way physics predicts. The wording difference between the prompts is small by design, since only one variable is changed, but the correct physical difference is not. A model that misses this can still produce two videos that each look plausible individually, and existing benchmarks score videos one at a time and cannot detect this failure. We introduce What-If World, 319 such prompt pairs built on real frames from nuScenes and DROID, organized by a taxonomy of six physical variables shared across driving and manipulation. Each pair is scored with APEO, a four-part rubric checking whether each video follows its prompt (Adherence), is physically consistent (Physics), preserves the shared scene (Environment), and ends in the correct difference (Outcome). Across nine state-of-the-art models, no system exceeds 52% on the paired score, and open-source models cluster near 28%. Every model tested fails on a large fraction of causal interventions, indicating substantial room before these models can reliably support action-conditioned simulation or model-based planning. Where models do score well, performance appears to track the visual prominence of the intervention rather than the tractability of its underlying physics. Some visually subtle interventions score as low as 14.2%, while visually pronounced ones reach 40.4%.

2605.27584 2026-05-28 cs.AI cs.SI

Cyberbullying Governance on Social Media: A Unified Framework from Content Identification to Intervention

社交媒体上的网络暴力治理:从内容识别到干预的统一框架

Yiting Huang, Wenting Zhu, Zekun Wang, Qingpo Yang, Yakai Chen, Zihui Xu, Yueyue Zhang, Sanchuan Guo, Xi Zhang

发表机构 * School of Cyberspace Security, Beijing University of Posts and Telecommunications(北京邮电大学网络安全学院)

AI总结 本文提出一个涵盖内容识别、用户行为建模、扩散动态与早期预警、干预治理四阶段的统一全生命周期治理框架,以解决网络暴力被动、孤立检测的局限,实现主动、持续、综合的治理。

详情
AI中文摘要

社交媒体平台和在线社区的激增无意中催化了网络暴力、仇恨言论和其他形式的在线毒性传播,使得有效治理此类危害成为关键的社会和计算挑战。尽管在自动化内容审核方面取得了显著进展,但现有研究主要将网络暴力治理视为被动、孤立的帖子级检测。这种还原论观点忽视了用户持续的行为动态、毒性事件的结构性扩散以及主动缓解的关键需求。为弥补这些差距,本文提出一个统一的全生命周期治理框架,将网络暴力治理的范式从孤立的静态检测转向集成、持续和主动的审核。借鉴网络暴力研究及相邻领域,我们系统地综合了四个相互关联阶段的最新文献:(1)内容识别,(2)用户与行为建模,(3)扩散动态与早期预警,以及(4)干预与治理。此外,我们回顾了可用的数据集和评估实践,并讨论了新兴挑战,包括多模态性、可解释性、算法公平性以及生成式AI的双重使用风险,为未来研究提供了路线图,以构建更安全、更具韧性的数字生态系统。

英文摘要

The proliferation of social media platforms and online communities has inadvertently catalyzed the spread of cyberbullying, hate speech, and other forms of online toxicity, making the effective governance of such harm a critical societal and computational challenge. While significant strides have been made in automating content moderation, existing research predominantly treats cyberbullying governance as passive, isolated detection at the post level. This reductionist view overlooks the continuous behavioral dynamics of users, the structural diffusion of toxic events, and the critical need for proactive mitigation. To bridge these gaps, this paper proposes a unified full-lifecycle governance framework that shifts the paradigm of cyberbullying governance from isolated static detection toward integrated, continuous, and proactive moderation. Drawing on cyberbullying research and adjacent fields, we systematically synthesize the state-of-the-art literature across four interconnected stages: (1) Content Identification, (2) User and Behavior Modeling, (3) Diffusion Dynamics and Early Warning, and (4) Intervention and Governance. Furthermore, we review available datasets and evaluation practices, and discuss emerging challenges including multimodality, explainability, algorithmic fairness, and the dual-use risks of generative AI, providing a roadmap for future research toward a safer and more resilient digital ecosystem.

2605.27583 2026-05-28 cs.LG

Information-theoretic Multimodal Representation Learning for Electrocardiogram Signals

基于信息论的心电图信号多模态表示学习

Phu X. Nguyen, Konstantinos Kontras, Wei Dai, Huy Phan, Christos Chatzichristos, Paul Pu Liang, Bert Vandenberk, Maarten De Vos

发表机构 * KU Leuven(库勒芬大学) University Hospitals Leuven(鲁文大学医院) MIT(麻省理工学院) DFKI(德国达姆施塔特研究所)

AI总结 提出MERIT框架,通过信息论视角结合掩码心电图建模与心电图-文本对比对齐,学习保留信号结构并整合临床语义的心电图表示,在分类、零样本和文本生成任务中取得一致提升。

详情
AI中文摘要

心电图(ECG)是一种广泛使用的非侵入性心脏活动测量手段,在临床诊断中起着核心作用。最近的多模态方法将心电图信号与临床报告对齐以融入诊断语义,但临床报告通常无法保留心电图波形的丰富生理结构,特别是在从粗粒度诊断类别到细粒度形态的多个抽象层次上。为解决这一局限,我们从信息论角度构建心电图表示学习,并推导出一个可处理的目标函数,该函数同时保留信号结构并整合临床语义。基于这一原理,我们提出了MERIT(基于信息论的多模态心电图表示),一个双分支预训练框架,结合了掩码心电图建模与心电图-文本对比对齐。在PTB-XL及其他基准上的大量实验表明,该方法相较于先前方法取得了一致改进,包括在PTB-XL All上F1提升超过3%,在SubClass分类上F1提升超过5%。在零样本评估中,MERIT在PTB-XL SubClass上进一步将性能提升了高达+2.66%的AUC和+2.11%的F1,同时在多种分布偏移设置下展现出鲁棒性。此外,利用学习到的心电图表示进行基于心电图条件的大语言模型临床文本生成,在ROUGE和METEOR等多个指标上提升了文本质量。这些结果共同表明,MERIT学习了更具信息量和临床意义的心电图表示,尤其适用于细粒度临床应用。

英文摘要

Electrocardiograms (ECGs) are widely used non-invasive measurements of cardiac activity and play a central role in clinical diagnosis. Recent multimodal approaches align ECG signals with clinical reports to incorporate diagnostic semantics, but clinical reports often fail to preserve the rich physiological structure of ECG waveforms, particularly across multiple levels of abstraction ranging from coarse diagnostic categories to fine-grained morphology. To address this limitation, we formulate ECG representation learning from an information-theoretic perspective and derive a tractable objective that jointly preserves signal structure and integrates clinical semantics. Based on this principle, we propose \textbf{MERIT} (Multimodal ECG Representation via Information Theory), a dual-branch pretraining framework combining masked ECG modeling with ECG--text contrastive alignment. Extensive experiments on PTB-XL and additional benchmarks demonstrate consistent improvements over prior methods, including gains exceeding $3%$ F1 on PTB-XL All and $5%$ F1 on SubClass classification. In zero-shot evaluation, MERIT further improves performance by up to $ +2.66\%$ AUC and $ +2.11\%$ F1 on PTB-XL SubClass, while also demonstrating robustness under multiple distribution-shift settings. Moreover, leveraging the learned ECG representations for ECG-conditioned clinical text generation with large language models improves text quality across several metrics, including ROUGE and METEOR. Together, these results demonstrate that MERIT learns more informative and clinically meaningful ECG representations, particularly for fine-grained clinical applications.

2605.27582 2026-05-28 cs.RO cs.CV

Uni-LaViRA: Language-Vision-Robot Actions Translation for Unified Embodied Navigation

Uni-LaViRA:面向统一具身导航的语言-视觉-机器人动作翻译

Hongyu Ding, Sizhuo Zhang, Ziming Xu, Jinwen Guo, Hongxiu Liu, Xingzhi Cheng, Zixuan Chen, Haifei Qi, Duo Wang, Hao Xu, Jieqi Shi, Yifan Zhang, Jing Huo, Jian Cheng, Yang Gao, Jiebo Luo

发表机构 * Nanjing University(南京大学) Beihang University(北京航空航天大学) Institute of Automation, Chinese Academy of Sciences(中国科学院自动化研究所) BMW (Nanjing) Information Technology Co., Ltd.(宝马(南京)信息技术有限公司) University of Rochester(罗切斯特大学)

AI总结 提出Uni-LaViRA统一智能体架构,通过语言-视觉-机器人动作翻译结构,结合待办列表记忆和二次机会回溯机制,在零训练下实现四类导航任务和四种真实机器人的零样本泛化,性能匹配或超越近期训练式导航基础模型。

Comments Project page: https://xetroubadour.github.io/Uni-LaViRA/

详情
AI中文摘要

具身导航要求智能体将语言和视觉观测映射为一系列空间动作,驱动真实机器人在未见环境中移动。主流方法是在不断增大的机器人轨迹数据集上扩展视觉-语言-动作(VLA)基础模型。本文认为,对于导航而言,通用性可以通过结构获得,而不仅仅依赖数据规模。导航的底层决策结构可简化为单一的语言-视觉-机器人动作翻译。语言动作发出语义级方向指令,视觉动作发出像素级视觉目标。这两个输出都位于预训练多模态大语言模型(MLLM)的自然输出流形内,因此任务可以由智能体推理而非从机器人数据中学习。为此,我们提出Uni-LaViRA,一种统一的智能体架构,将相同的见解零样本地扩展到四个任务族(VLN-CE、ObjectNav、EQA和Aerial-VLN)和四种异构真实机器人(轮式、四足、人形机器人和自建无人机)。两种智能体循环机制使这种统一变得实用。待办列表记忆(TDM)在每一步重写待办子目标的结构化检查清单,将未完成项重新注入智能体的最近注意力窗口。二次机会回溯(SCB)将机器人回滚到错误前状态,并基于失败的子轨迹调整智能体的下一步计划,将单次导航转变为自我纠正过程。无需任何训练,Uni-LaViRA在VLN-CE R2R上达到60.7%的成功率(SR),在VLN-CE RxR上达到51.3%,在HM3D-v2上达到77.7%,在HM3D-OVON上达到60.0%,在MP3D-EQA上达到54.7%,在OpenUAV上达到40.0%,匹配甚至超越了近期消耗数百万样本和数千GPU小时的训练式导航基础模型。

英文摘要

Embodied navigation requires an agent to map language and visual observations to a stream of spatial actions that drive a real robot through environments it has never seen. The dominant approach has been to scale vision-language-action (VLA) foundation models on ever-larger collections of robot trajectories. This paper argues that, for navigation specifically, generality can be obtained structurally, not only through data scale. The underlying decision structure of navigation reduces to a single Language-Vision-Robot Actions Translation. The language action emits semantic-level directional command and the vision action emits a pixel-level visual target. Both outputs lie inside the natural output manifold of pretrained multimodal large language models (MLLMs), so the task can be reasoned about by an agent rather than learned from robot data. Therefore, we present Uni-LaViRA, a unified agentic architecture that extends the same insight to four task families (VLN-CE, ObjectNav, EQA, and Aerial-VLN) and to four heterogeneous real robots (Wheeled, Quadruped, Humanoid robot, and a self-built UAV) in a zero-shot manner. Two agent-loop mechanisms make this unification practical. TODO List Memory (TDM) rewrites a structured checklist of pending sub-goals at every step, reciting the unfinished items back into the agent's most recent attention window. Second Chance Backtrack (SCB) rolls the robot back to the pre-error state and conditions the agent's next plan on the failed sub-trajectory, turning single-pass navigation into a self-correcting process. With zero training effort, Uni-LaViRA reaches 60.7% SR on VLN-CE R2R, 51.3% on VLN-CE RxR, 77.7% on HM3D-v2, 60.0% on HM3D-OVON, 54.7% on MP3D-EQA, and 40.0% on OpenUAV, matching or even surpassing recent training navigation foundation models that consume millions of samples and thousands of GPU-hours.

2605.27571 2026-05-28 cs.AI cs.CL cs.DB

Discovery Agents for Real-Time Analytics: Toward Proactive Insight Systems

实时分析发现代理:迈向主动洞察系统

Gaetano Rossiello, Dharmashankar Subramanian

发表机构 * IBM

AI总结 提出一种多智能体架构,通过持续发现循环(假设生成、编译、验证、可视化)实现实时数据流的自主洞察发现,支持从查询驱动向主动发现的范式转变。

Comments Accepted at Supporting Our AI Overlords (SAO) at the ACM Conference on AI and Agentic Systems (CAIS), May 26 2026, San Jose, CS, USA

详情
AI中文摘要

现代分析系统本质上是反应式的,要求用户在日益复杂且持续演变的数据上定义查询。在实时流式环境中,这种范式失效,因为潜在洞察的空间变得太大而无法手动枚举。我们提出了一种用于实时数据流自主洞察发现的多智能体架构。该系统实现了一个持续发现循环,其中智能体生成假设,将其编译为可执行分析,验证生成的工件,并生成可视化和可部署的应用程序。该架构利用Apache Kafka进行事件驱动协调,Apache Flink进行流处理,以及大型语言模型来实现专门的智能体。一个关键贡献是基于类型化中间工件的契约驱动设计,实现了模块化、可观测性、血统以及更安全地执行动态生成的分析。通过零售、金融和公共数据中的用例,我们展示了该架构如何支持从查询驱动分析向主动发现驱动系统的转变。

英文摘要

Modern analytics systems are fundamentally reactive, requiring users to define queries over increasingly complex and continuously evolving data. In real-time streaming environments, this paradigm breaks down, as the space of potential insights becomes too large to enumerate manually. We present a multi-agent architecture for autonomous insight discovery over real-time data streams. The system implements a continuous discovery loop in which agents generate hypotheses, compile them into executable analytics, validate generated artifacts, and produce visualizations and deployable applications. The architecture leverages Apache Kafka for event-driven coordination, Apache Flink for stream processing, and large language models to implement specialized agents. A key contribution is a contract-driven design based on typed intermediate artifacts, enabling modularity, observability, lineage, and safer execution of dynamically generated analytics. Through use cases in retail, finance, and public data, we show how this architecture supports a shift from query-driven analytics to proactive, discovery-driven systems.

2605.27570 2026-05-28 cs.AI

LaneRoPE: Positional Encoding for Collaborative Parallel Reasoning and Generation

LaneRoPE: 用于协同并行推理与生成的位置编码

Gabriele Cesa, Thomas Hehn, Aleix Torres-Camps, Àlex Batlle Casellas, Jordi Ros-Giralt, Arash Behboodi, Tribhuvanesh Orekondy

发表机构 * Qualcomm AI Research(高通人工智能研究)

AI总结 提出LaneRoPE方法,通过序列间注意力掩码和扩展的RoPE位置编码,使多个序列在生成时协同合作,提升数学推理任务在有限生成长度下的准确性。

详情
AI中文摘要

并行LLM测试时扩展技术(例如best-of-$N$)需要根据相同输入提示生成$N>1$个序列。这些方法在利用批处理$N$个生成的计算效率的同时提高了准确性。然而,传统上批次中的每个序列是独立生成的,因此不会重用其他序列的中间生成、计算或观察结果。在本文中,我们提出LaneRoPE,以在生成时实现$N>1$个序列之间的协调与协作。LaneRoPE包含两个关键思想:(a) 一个序列间注意力掩码,使序列的采样相互依赖;(b) 一个RoPE扩展,注入位置信息,捕获特定序列内部和外部的标记之间的相对位置。我们在数学推理任务上评估了我们的方法,并发现了有希望的结果:LaneRoPE实现了序列间的协作,在有限的生成长度下带来了额外的准确性提升。重要的是,由于LaneRoPE在底层LLM架构上只需最小改动,并且在推理时引入的开销可以忽略不计,因此它对于将并行推理快速集成到现有LLM推理流水线中具有吸引力。

英文摘要

Parallel LLM test-time scaling techniques (e.g., best-of-$N$) require drawing $N>1$ sequences conditioned on the same input prompt. These methods boost accuracy while exploiting the computational efficiency of batching $N$ generations. However, each sequence in the batch is traditionally generated independently and hence does not reuse intermediate generations, computations, or observations from other sequences. In this paper, we propose LaneRoPE to enable coordination and collaboration among $N>1$ sequences at generation time. LaneRoPE involves two key ideas: (a) an inter-sequence attention mask to make sampling of sequences dependent on one another; and (b) a RoPE extension that injects positional information that captures relative positions between tokens, both within and outside a particular sequence. We evaluate our approach on mathematical reasoning tasks and find promising results: LaneRoPE enables collaboration among sequences, yielding additional accuracy gains under limited generated sequence length. Importantly, since LaneRoPE enables coordination with minimal changes to the underlying LLM architecture and introduces a negligible overhead at inference time, it is appealing to rapidly incorporate parallel reasoning into existing LLM inference pipelines.

2605.27567 2026-05-28 cs.AI cs.CL

Why LLMs Fail at Causal Discovery and How Interventional Agents Escape

为什么LLM在因果发现中失败以及干预代理如何逃脱

Amartya Roy, Sonali Parbhoo

发表机构 * SIRE, IIT Delhi(IIT德里智能研究机构) Robert Bosch GmbH(罗伯特·博世有限公司) Imperial College London(伦敦帝国理工学院)

AI总结 本文证明大型语言模型在因果发现中存在根本性失败,并提出一种基于干预代理的因果贝叶斯优化方法(A-CBO),通过外部贝叶斯循环在无需模型微调的情况下实现可证明的收敛。

Comments 9 pages, 3 figures

详情
AI中文摘要

因果发现是科学推理的基石,但大型语言模型能否可靠地执行因果发现仍是一个悬而未决的问题。最近的基准测试表明,即使是微调后的模型在简单因果图上也会达到平台期,并随着复杂度增加而退化,但失败的原因尚未明确。我们证明这种失败是根本性的:监督微调、直接偏好优化和上下文学习都会产生无法区分生成相似观测数据的因果图的预测器,任何这样做的尝试都需要模型的内部表示无限增长,从而违反了这些方法工作的条件。我们将其形式化为核障碍定理,确立该限制是学习范式固有的,而非任何特定模型或数据集。我们提出了代理因果贝叶斯优化(A-CBO),其中冻结的语言模型作为干预预言机,回答关于干预效果的目标查询,而外部贝叶斯循环在对数轮次内将信念集中在候选因果图上。由于决策在障碍适用的空间之外运行,A-CBO在底层模型保持不变的情况下可证明收敛。在Corr2Cause上,A-CBO无需任何训练即可匹配微调基线。在Extended Corr2Cause(一个扩展到24个变量、包含18K测试样本的新基准)上,A-CBO显著优于微调和偏好优化,且优势不断扩大。

英文摘要

Causal discovery is a cornerstone of scientific reasoning, yet whether large language models can perform it reliably remains an open question. Recent benchmarks show that even fine-tuned models plateau on simple causal graphs and degrade as complexity grows, but why they fail has not been established. We prove the failure is fundamental: supervised fine-tuning, direct preference optimization, and in-context learning all produce predictors that cannot distinguish between causal graphs generating similar observational data, and any attempt to do so requires the model's internal representations to grow unboundedly, violating the very conditions under which these methods work. We formalize this as a kernel obstruction theorem, establishing that the limitation is intrinsic to the learning paradigm, \emph{not any particular model or dataset}. We propose Agentic Causal Bayesian Optimization (A-CBO), wherein a frozen language model serves as an interventional oracle answering targeted queries about intervention effects, while an external Bayesian loop concentrates beliefs over candidate graphs in logarithmically many rounds. Because the decision operates outside the space where the obstruction applies, A-CBO provably converges while the underlying model remains unchanged. On Corr2Cause, A-CBO matches fine-tuned baselines without any training. On Extended Corr2Cause, a new benchmark scaling to 24 variables with 18K test samples, A-CBO significantly outperforms both fine-tuning and preference optimization, with the advantage growing

2605.27566 2026-05-28 cs.AI

DynaSchedBench: Calibrated Dynamic Scheduling Benchmarks and Observability Paradox in LLM-based Scheduling Agents

DynaSchedBench: 基于LLM的调度代理中的校准动态调度基准与可观测性悖论

Shijie Cao, Yuan Yuan, Jing Liu

发表机构 * School of Computer Science and Engineering, Beihang University, Beijing 100191, China(北航计算机科学与工程学院) Shenzhen Loop Area Institute, Shenzhen, China(深圳环城院) Qingdao Research Institute, Beihang University(北航青岛研究院) Hangzhou Innovation Institute, Beihang University(北航杭州创新院) School of Artificial Intelligence, Xidian University, Xi'an 710071, Shaanxi, China(西电人工智能学院) Guangzhou Institute of Technology, Xidian University, Guangzhou 510555, Guangdong, China(西电广州技术院)

AI总结 针对动态柔性作业车间调度问题(DFJSP),提出DynaSchedBench诊断框架,通过顺序事件空间校准器(SESC)计算调度压力指数(SSI)对实例进行难度分层,并揭示LLM调度代理中的“可观测性悖论”:完整结构信息反而降低性能。

详情
AI中文摘要

目前,针对动态柔性作业车间调度问题(DFJSP)的神经组合优化进展受到方法论上的张力阻碍:静态基准鼓励基准过拟合,而未校准的生成器则用随机噪声掩盖算法能力。为解决这一问题,我们引入了 extbf{DynaSchedBench},一个用于DFJSP的诊断框架,该框架严格控制实例生成过程。我们的方法不依赖参数采样,而是利用顺序事件空间校准器(SESC)计算一种新颖的调度压力指数(SSI),以按难度对实例进行分层。我们证明,SESC在计算效率上显著优于进化基线,同时可靠地收敛到目标指标。该框架集成了用于实例生成、基于快照的模拟、代理、评估和可视化的模块化组件,从而能够对反应式和前瞻式策略进行严格测试。利用这个校准环境,我们识别了基于LLM的调度代理的关键局限性。具体而言,在动态调度的逐步在线决策中,我们发现了一个“可观测性悖论”:向代理提供完整结构信息的oracle访问权限会降低策略性能,其表现不如简洁信息。此外,尽管存在大量的token开销,工具增强和细化策略未能可靠地提高性能,并且大多数LLM代理无法持续超越强大的调度基线——其行为更像是鲁棒的启发式近似器,而非优越的优化器。

英文摘要

Progress in neural combinatorial optimization for Dynamic Flexible Job Shop Scheduling Problem (DFJSP) is currently hindered by a methodological tension: static benchmarks encourage benchmark overfitting, while uncalibrated generators obscure algorithmic capability with stochastic noise. To resolve this, we introduce \textbf{DynaSchedBench}, a diagnostic framework for DFJSP that rigorously controls the instance-generation process. Instead of relying on parameter sampling, our approach utilizes Sequential Event-Space Calibrator (SESC) that computes a novel Schedule Stress Index (SSI) to stratify instances by difficulty. We demonstrate that SESC is substantially more computationally efficient than evolutionary baselines while converging reliably to the target metrics. The framework integrates modular components for instance generation, snapshot-based simulation, agents, evaluation, and visualization, thereby enabling rigorous testing of reactive and lookahead-based policies. Leveraging this calibrated environment, we identify key limitations of LLM-based scheduling agents. Specifically, in step-wise online decision-making for dynamic scheduling, we identify an ``Observability Paradox'': providing agents with oracle access to full structural information can degrade policy performance, underperforming concise information. Furthermore, despite substantial token overhead, tool-augmented and refinement strategies fail to reliably improve performance, and most LLM agents fail to consistently surpass strong dispatching baselines-behaving more like robust heuristic approximators than superior optimizers.

2605.27564 2026-05-28 cs.CL cs.AI cs.LG

The Future of Facts: Tracing the Factual Generation-Verification Gap

事实的未来:追踪事实生成-验证差距

Tim R. Davidson, Anja Surina, Caglar Gulcehre

发表机构 * EPFL(苏黎世联邦理工学院)

AI总结 本文通过训练阶段分析,发现语言模型在事实知识上存在生成-验证差距,验证能力先于生成能力习得且更稳健,事实更新可能导致模型处于“多宇宙”状态。

Comments Code for this project is available at https://github.com/anjasurina/factgap , blog post at https://www.trdavidson.com/fact-gap

详情
AI中文摘要

语言模型正成为事实知识的默认接口,但它们验证输出的能力往往比生成输出的能力更可靠。这种生成-验证差距(GV-gap)是近期自我改进和推理中许多进展的基础,但其在事实知识上的动态仍未被充分理解。我们聚焦于事实性GV-gap背后的训练机制,将其与计算和美学方面的对应物区分开来。我们通过四个开源模型家族(每个家族两个规模)的三个训练阶段(获取、持续学习和更新)追踪生成和验证能力。三个发现跨模型重复出现:(i)验证始终先于生成被学习;(ii)验证比生成对持续学习更稳健;(iii)事实更新可能使模型处于“多宇宙”状态,同时验证新旧答案均为正确。对前沿模型的自然实验在大规模上重现了这些动态,并揭示了在充分覆盖的事实上残留的验证偏差。

英文摘要

Language models are becoming the default interface to factual knowledge, yet they often verify outputs more reliably than they generate them. This generation-verification gap (GV-gap) underlies many recent advances in self-improvement and reasoning, but its dynamics on factual knowledge specifically remain poorly understood. We focus on the training mechanisms underlying factual GV-gaps, distinguishing them from their computational and aesthetic counterparts. We trace generation and verification capabilities through three training phases (acquisition, continual learning, and updating) across four open-source model families at two scales each. Three findings recur across models: (i) verification is consistently learned before generation; (ii) verification is more robust to continual learning than generation; and (iii) factual updates can leave models in a "multi-verse" state, simultaneously verifying both old and new answers as correct. Natural experiments on frontier models reproduce these dynamics at scale and reveal residual verification biases on well-covered facts.

2605.27561 2026-05-28 cs.CV cs.AI

Clinical Validation of the Melanoscope AI Mobile Dermoscopy Clinical Decision Support System

Melanoscope AI移动皮肤镜临床决策支持系统的临床验证

Elena Sergeevna Kozachok, Sergey Sergeevich Seregin

发表机构 * Ivannikov Institute for System Programming of the Russian Academy of Sciences(俄罗斯科学院伊万诺夫系统编程研究所) Orel Regional Oncology Dispensary(奥尔格地区肿瘤专科医院)

AI总结 本研究提出了一种级联深度学习模型的定量可解释性评估方法和三区患者分流算法,并在俄罗斯门诊实践中对Melanoscope AI CDSS进行了前瞻性单中心临床验证,结果显示无假阴性且特异性为88.3%。

Comments 24 pages, 6 figures, 5 tables, 21 references

详情
AI中文摘要

引言:恶性皮肤病变的早期检测对预后至关重要,但俄罗斯地区皮肤科医生短缺限制了筛查覆盖。移动皮肤镜临床决策支持系统(CDSS)提供了一种有前景的方法,但模型可解释性和标准化患者分流仍是采用的关键障碍。目的:开发一种级联深度学习模型的定量可解释性评估方法和三区患者分流算法,并在俄罗斯门诊实践中对Melanoscope AI CDSS进行初步的单中心前瞻性临床验证。材料与方法:皮肤镜图像的两阶段级联分类;注意力图可视化(ViT和Swin使用注意力展开;ConvNeXt和EfficientNetV2使用Grad-CAM);激活图与专家标注之间基于IoU的定量一致性评估;在四次“黑色素瘤日”活动(俄罗斯奥廖尔,2025年6月至2026年4月)中进行前瞻性单中心验证。结果:在176名患者中:与专家评估一致率为88.6%;5例恶性病变中无假阴性(95% CI: 47.8-100.0%);特异性为88.3%。组织学证实了3例黑色素瘤和2例基底细胞癌;6例发育不良痣被纳入随访。平均IoU(n=180):ViT - 0.69;Swin - 0.64;ConvNeXt - 0.53;EfficientNetV2 - 0.51。分流阈值:P<0.15 / 0.15-0.50 / >=0.50。结论:未观察到假阴性;特异性为88.3%,支持筛查应用。集成的级联分类、带IoU评估的注意力图可视化和三区分流提供了可重复、可解释的临床决策支持,可适应不同资源水平。

英文摘要

Introduction. Early detection of malignant skin lesions is critical for prognosis, yet dermatologist shortages in Russian regions limit screening coverage. Mobile dermoscopy clinical decision support systems (CDSS) offer a promising approach, with model interpretability and standardised patient routing remaining key barriers to adoption. Aim. To develop a quantitative interpretability assessment method for cascade deep learning models and a three-zone patient routing algorithm, and to conduct a preliminary single-centre prospective clinical validation of the Melanoscope AI CDSS in Russian outpatient practice. Material and methods. Two-stage cascade classification of dermoscopic images; attention map visualisation (attention rollout for ViT and Swin; Grad-CAM for ConvNeXt and EfficientNetV2); quantitative IoU-based agreement assessment between activation maps and expert annotations; prospective single-centre validation across four "Melanoma Day" sessions (Orel, Russia, June 2025 - April 2026). Results. On 176 patients: agreement with expert assessment 88.6%; no false negatives among 5 malignant lesions (95% CI: 47.8-100.0%); specificity 88.3%. Three melanomas and two basal cell carcinomas were histologically confirmed; six dysplastic naevi placed under follow-up. Mean IoU (n=180): ViT - 0.69; Swin - 0.64; ConvNeXt - 0.53; EfficientNetV2 - 0.51. Routing thresholds: P<0.15 / 0.15-0.50 / >=0.50. Conclusion. No false negatives were observed; specificity was 88.3%, supporting screening use. The integrated cascade classification, attention map visualisation with IoU assessment, and three-zone routing provide reproducible, interpretable clinical decision support adaptable to varying resource levels.

2605.27551 2026-05-28 cs.AI cs.CR cs.IR cs.MM

On the Origin of Synthetic Information by Means of Steganographic Inheritance

论通过隐写继承的合成信息起源

Ching-Chun Chang, Isao Echizen

发表机构 * Information and Society Research Division, National Institute of Informatics(信息与社会研究部,信息机构)

AI总结 针对合成信息溯源难题,提出一种基于隐写术的遗传机制,通过嵌入可追踪的谱系特征实现合成信息父系鉴定,理论分析与实验验证了方法的有效性。

详情
AI中文摘要

物种起源一直是自然科学中谜中之谜。类比而言,我们认为合成信息的起源是信息科学中谜中之谜。这个问题承载着道德分量,技术解释既无法完全解决,也不能不负责任地忽视,因为它对真理、信任和人类智力的影响深远地延伸到更广泛的经济和社会。人工智能的强大使得合成信息的进化谱系越来越难以追踪,因为一个足够强大的模型可能产生在结构或信号层面上与其父源几乎不相似的后代。如同遗传学中,两个个体可能具有相同的表型,在外观上相互镜像,但基因型却根本不同。我们提出通过隐写术实现一种类似于遗传的机制。在后代被复制的时刻,投影仪从父代派生出一个特征,隐写编码器将其不可见地隐藏在后代中。该特征在赛博生态系统中贯穿后代的整个生命周期。当查询父系时,隐写解码器从后代中提取该特征,并与参考池中候选父代的特征进行比较,从而提名最可能的父代。理论分析将系统发育准确性表征为投影仪和隐写系统属性的函数,而跨多个投影仪和隐写系统的实证评估表明,所提出的方法在广泛的处理操作和语义修改下具有可行性。我们设想一个赛博生态系统,其中合成信息被赋予隐藏但可追踪的谱系特征,从简单的开端分支成无尽的形态,这些形态已经并且正在进化。

英文摘要

The origin of species has been the mystery of mysteries in natural science. By analogy, the origin of synthetic information, we suggest, is the mystery of mysteries in information science. The question carries a moral weight that a technical account can neither fully resolve nor responsibly ignore, as its impact on truth, trust, and human intellect extends deep into the broader economy and society. The very power of artificial intelligence makes the evolutionary lineage of synthetic information grow ever harder to trace, for a sufficiently capable model may generate offspring that bear little resemblance, at either the structural or signal level, to the parent source from which they were derived. As in genetics, two individuals may share the same phenotype mirroring each other in outward appearance, yet differ fundamentally in their genotype. We propose, by means of steganography, a mechanism analogous to heredity. At the moment an offspring is reproduced, a projector derives a trait from the parent, and a steganographic encoder invisibly hides it within the offspring. This trait persists throughout the offspring's life cycle in a cyber ecosystem. When parentage is queried, a steganographic decoder extracts the trait from the offspring and compares it against the traits of candidate parents in a reference pool, thereby nominating the most likely one. A theoretical analysis characterises phylogenetic accuracy as a function of projector and stegosystem properties, whilst empirical evaluations across multiple projectors and stegosystems demonstrate the viability of the proposed methodology under a broad spectrum of processing operations and semantic modifications. We envision a cyber ecosystem in which synthetic information, endowed with hidden yet traceable lineage traits, branches from a simple beginning into endless forms that have been, and are being, evolved.

2605.27546 2026-05-28 cs.CL cs.HC

Keyphrase Generative Representation of Youth Crisis Conversations Beyond Static Taxonomies

超越静态分类法的青少年危机对话的关键词生成表示

Abeer Badawi, Will Aitken, Lydia Sequeira, Jocelyn Rankin, Maia Norman, Elham Dolatabadi

发表机构 * York University(约克大学) Vector Institute(向量研究所) Electrical and Computer Engineering, Queen’s University(皇后大学电气与计算机工程系) Kids Help Phone(儿童援助电话)

AI总结 本文提出关键词生成表示(KGR)方法,通过约束大语言模型生成对话特定的关键词,将原有19标签分类扩展为39标签层次结构,在129段对话和387个专家注释上评估,准确率达0.96,并发现固定分类中缺失的身份相关主题,将主题检索准确率从0.25提升至0.70。

详情
AI中文摘要

危机响应者每年快速评估数千条青少年短信对话,以识别心理健康问题并指导支持。然而,青少年的痛苦越来越多地通过不断演变且依赖具体语境的语言表达,这些语言通常不适合固定标签的分类法。本研究分析了703,975条去标识化的Kids Help Phone对话(2018-2023年),并将KHP的19标签问题分类扩展为39标签层次结构。然后,我们引入关键词生成表示(KGR),一种受约束的大语言模型,生成简洁、对话特定的关键词,在129段对话和387个专家注释上进行了评估。扩展后的分类法达到了专家共识可靠性,准确率为0.96,专家评审发现81%的关键词准确反映了内容,74%提高了清晰度。KGR揭示了固定分类法中缺失的与身份相关的主题,包括移民问题和照顾者负担,并支持了一个主题检索工作流,与手动分析师流程相比,准确率从0.25提高到0.70(+0.45)。KGR标志着向混合、可解释的生成表示转变,将危机响应扩展到静态分类法之外,以揭示新兴的、植根于文化的青少年痛苦模式。

英文摘要

Crisis Responders (CRs) rapidly assess thousands of youth SMS conversations each year to identify mental health concerns and guide support. Yet youth distress is increasingly expressed through evolving and context-specific language that often does not fit fixed-label taxonomies. This work analyzed 703,975 de-identified Kids Help Phone conversations (2018-2023) and expanded KHP's 19-label issue taxonomy into a 39-label hierarchical schema. We then introduce Keyphrase Generative Representation (KGR), a constrained LLM generating concise, conversation-specific keyphrases, evaluated across 129 conversations and 387 expert annotations. The expanded taxonomy achieved expert consensus reliability, with an accuracy of 0.96, and expert review found that 81% of keyphrases accurately reflected content and 74% improved clarity. KGR surfaced identity-linked themes absent from the fixed taxonomy, including immigration problems and caregiver burden, and supported a topic-retrieval workflow that increased accuracy from 0.25 to 0.70 (+0.45) over the manual analyst process. KGR marks a shift toward hybrid, interpretable generative representations that extend crisis response beyond static taxonomies to surface emerging and culturally grounded patterns of youth distress.

2605.27545 2026-05-28 cs.CL

PAST2HARM: A Simple Adaptive Past Tense Attack for Jailbreaking Multimodal AI

PAST2HARM:一种用于越狱多模态AI的简单自适应过去时攻击

Snehasis Mukhopadhyay

发表机构 * Indian Institute of Information Technology, Kalyani(印度信息技术学院,卡利安)

AI总结 提出PAST2HARM框架,通过过去时态改写和迭代升级策略,系统性地利用多模态文本到图像模型的安全漏洞,实现黑盒、无梯度的高成功率越狱攻击。

详情
AI中文摘要

尽管不安全的图像生成可能比不安全的文本产生更严重的后果,且当前防御相对不成熟,但对多模态AI系统的越狱攻击仍未得到充分探索。我们引入了PAST2HARM,一个简单而有效的自适应越狱框架,能够绕过最先进的多模态文本到图像模型中的拒绝训练。基于先前发现过去时态改写可以规避安全防护的结论,PAST2HARM系统地利用了多模态生成式AI中的这一漏洞。 我们沿两个维度刻画攻击。第一,广度:通过时间深化,该框架逐步增强历史锚定和档案线索,侵蚀不同对齐强度模型的拒绝边界。第二,深度:通过初始顺从后的迭代升级,我们探测有害生成的上限,使用由语言模型作为评判者评估的标量严重性越狱指标来衡量严重程度。我们发现对话中间轮次形成峰值脆弱窗口,其中有害性增加后趋于平稳,最终经历语义反转。 我们在三个模型Gemini Nano Banana Pro、GPT Image 2和SD XL上评估PAST2HARM,在黑盒、无梯度设置下分别实现了83%、67%和100%的攻击成功率。对抗性提示也在模型间迁移,跨模型成功率超过50%。该攻击引发了多种有害输出,包括露骨色情内容、政治虚假信息、历史否认叙事、仇恨言论和自我伤害美化。我们进一步发布了一个精心策划的提示、改写和输出基准,作为红队测试和对齐的资源。我们的结果暴露了当前安全防护的根本脆弱性,并强调了加强多模态安全训练的必要性。

英文摘要

Jailbreak attacks on multimodal AI systems remain underexplored, even though unsafe image generation can have more severe consequences than unsafe text and current defenses are relatively immature. We introduce PAST2HARM, a simple yet effective adaptive jailbreak framework that bypasses refusal training in state of the art multimodal text to image models. Building on prior findings that past tense reformulations can evade safeguards, PAST2HARM systematically exploits this vulnerability in multimodal generative AI. We characterize the attack along two dimensions. First, breadth: through temporal deepening, the framework incrementally strengthens historical anchoring and archival cues, eroding refusal boundaries across models with varying alignment strength. Second, depth: via iterative escalation after initial compliance, we probe the upper bound of harmful generation, measuring severity using a scalar severity jailbreak metric evaluated by a language model acting as a judge. We find that mid conversation turns form peak vulnerability windows, where harmfulness increases before plateauing and eventually undergoing semantic inversion. We evaluate PAST2HARM on three models Gemini Nano Banana Pro, GPT Image 2, and SD XL achieving attack success rates of 83 percent, 67 percent, and 100 percent in a black box, gradient free setting. Adversarial prompts also transfer across models, with cross model success rates above 50 percent. The attack elicits diverse harmful outputs, including explicit sexual content, political disinformation, historical denial narratives, hate speech, and self harm glorification. We further release a curated benchmark of prompts, reformulations, and outputs as a resource for red teaming and alignment. Our results expose fundamental brittleness in current safeguards and highlight the need for stronger multimodal safety training.

2605.27541 2026-05-28 cs.LG

SparseOpt: Addressing Normalization-induced Gradient Skew in Sparse Training

SparseOpt:解决稀疏训练中归一化引起的梯度倾斜

Mohammed Adnan, Rohan Jain, Tom Jacobs, Ekansh Sharma, Rahul G. Krishnan, Rebekka Burkholz, Yani Ioannou

发表机构 * University of Calgary(卡尔加里大学) University of Toronto(多伦多大学) Vector Institute(向量研究所) CISPA Helmholtz Center for Information Security(CISPA海德堡信息安全中心)

AI总结 针对动态稀疏训练收敛慢的问题,通过分析批归一化对稀疏训练的不利影响,提出稀疏感知优化器SparseOpt,实现更快的收敛和更好的泛化。

Comments Accepted International Conference on Machine Learning (ICML) 2026

详情
AI中文摘要

动态稀疏训练(DST)方法通过保持稀疏性同时动态调整网络拓扑来训练神经网络。尽管有望减少计算量,但DST方法的收敛速度明显慢于密集训练,通常需要相当长的训练时间才能达到相似的精度。我们在分析和实验上均证明,批归一化(BN)对稀疏训练有不利影响,并提出了SparseOpt,一种稀疏感知优化器来解决这个问题。在CIFAR-100和ImageNet上使用ResNet模型进行的实验表明,我们提出的方法具有持续更快的收敛速度和更好的泛化性能。我们的工作突出了当前归一化层在稀疏训练中的局限性,并首次系统研究了批归一化、稀疏层和DST之间的相互作用,朝着使DST在实际中与密集训练竞争迈出了重要一步。

英文摘要

Dynamic Sparse Training (DST) methods train neural networks by maintaining sparsity while dynamically adapting the network topology. Despite the promise of reduced computation, DST methods converge significantly slower than dense training, often requiring comparable training time to achieve similar accuracy. We demonstrate both analytically and empirically that Batch Normalization (BN) adversely affects sparse training, and propose SparseOpt, a sparsity-aware optimizer, to address this. Experiments on ResNet models across CIFAR-100 and ImageNet demonstrate consistently faster convergence and improved generalization with our proposed method. Our work highlights the limitations of current normalization layers in sparse training and provides the first systematic study of the interaction between Batch Normalization, sparse layers, and DST, taking a significant step toward making DST practically competitive with dense training.

2605.27539 2026-05-28 cs.RO

Synthetic Emotions vs. Gamification: Exploring Engagement Strategies for Small Social Robots in Different Age Groups

合成情感 vs. 游戏化:探索不同年龄段小型社交机器人的参与策略

Morten Roed Frederiksen, Kasper Støy

发表机构 * Data Systems & Robotics Department of The IT-University of Copenhagen(丹麦哥本哈根技术大学数据系统与机器人学系)

AI总结 本研究通过两项实验(6-8岁儿童偏好评估和20-27岁大学生行为研究)比较了触觉机器人使用合成情感反馈与积分奖励两种参与策略的效果,发现儿童偏好情感参与,而大学生在积分系统下任务准确率更高且表现持久,揭示了不同年龄组在参与策略有效性上的差异。

Comments 7 pages

详情
AI中文摘要

许多儿童在情绪调节和社交互动方面面临挑战,这限制了他们在日常活动和治疗项目中的参与。为了使社交辅助机器人在这一背景下有效,儿童必须保持持续且有意义的参与。我们探索了一种触觉机器人的参与策略,该机器人旨在通过日常互动支持患有焦虑症的儿童。机器人提供合成情感反馈或积分奖励以鼓励用户参与。我们通过两项研究评估了这些策略:一项是对16名6-8岁学龄儿童的偏好评估,另一项是在自然环境中对14名20-27岁大学生的行为研究。对学龄儿童的研究表明,他们更倾向于情感参与而非基于积分的方法。对大学生进行全天互动的后续研究显示了对比结果:基于积分的系统产生了显著更高的任务准确率(p < 0.05)并保持了持续的表现。来自不同用户群体的发现表明,陈述的偏好和行为结果可能因参与环境而异,这凸显了通过观察互动来验证设计假设的重要性。这项工作为人类-机器人交互设计中参与策略有效性的年龄相关差异提供了见解。

英文摘要

Many children experience challenges in emotional regulation and social interaction, which can limit their participation in everyday activities and therapeutic programs. For socially assistive robots to be effective in this context, it is essential that children remain consistently and meaningfully engaged. We explore engagement strategies for a tactile robot designed to support children suffering from anxiety disorders through daily interactions. The robot delivers either synthetic emotional feedback or point rewards to encourage user participation. We evaluated these strategies through two studies: a preference assessment with 16 school children aged 6-8 years, and a behavioral study with 14 university students aged 20-27 years in naturalistic environments. The study with school children indicated a preference for emotional engagement over points-based approaches. The follow up study with university students across a full day of interactions revealed contrasting results: points-based systems produced significantly higher task accuracy (p < 0.05) and sustained performance over time. Findings from different user groups suggest that stated preferences and behavioral outcomes can diverge depending on engagement context, highlighting the importance of validating design assumptions through observed interaction. This work contributes insights into age-related differences in engagement strategy effectiveness in human-robot interaction design.

2605.27533 2026-05-28 cs.RO

Inducing Calmness With Pocket-Sized Robotics: Reducing Movement and Heart Rate in Children through Hand-Held Tactile Interactions

用口袋大小的机器人诱导平静:通过手持触觉交互降低儿童的心率和运动

Morten Roed Frederiksen, Kasper Støy, Maja Matarić

发表机构 * Data Systems and Robotics, IT-University of Copenhagen(数据系统与机器人,丹麦IT大学) Interaction Lab, University of Southern California(交互实验室,南加州大学)

AI总结 本研究通过手持触觉设备上的节奏振动匹配游戏,发现触觉交互能显著降低儿童的生理唤醒(心率下降3.56 bpm)和身体躁动(整体运动减少38%),从而促进平静和专注状态。

Comments 34 pages, 2 tables, 7 figures

详情
AI中文摘要

高唤醒或躁动期会干扰儿童的注意力、自我调节和身体平静能力。通过触觉交互鼓励具身自我调节的技术可能提供一种简单易行的方法来促进平静。本文研究了与口袋大小的触觉设备交互如何影响典型发育儿童的生理和行为平静标记。基于先前关于心率调节的研究,我们提出了关于触觉交互如何影响全身运动和姿势稳定性的新发现。我们使用一种设备,通过手持节奏振动匹配游戏吸引儿童,旨在集中注意力并鼓励静止。18名儿童参与了一项受试者内研究,包括两种条件:有和没有手持设备的触觉交互,同时记录他们的心率和身体运动。结果表明,触觉游戏交互降低了生理唤醒(心率下降3.56 bpm,p < 0.01)和身体躁动(整体运动减少38%,p < 0.05),与注意力相关的身体区域向静止变化最大(运动减少45%)。这些发现表明,与手持设备的短暂触觉游戏式参与可以下调生理激活,促进平静和专注状态,从而有助于持续注意力和行为调节。

英文摘要

Periods of heightened arousal or restlessness can interfere with children's ability to focus, self-regulation, and physically calm. Technologies that encourage embodied self-regulation through tactile interaction may provide a simple and accessible means of promoting calmness. This paper investigates how interaction with a pocket-sized tactile device influences physiological and behavioral markers of calmness in typically developing children. Building on prior work examining heart rate modulation, we present new findings on how tactile interaction affects full-body movement and postural stability. We employ a device that engages children through a hand-held rhythmic vibration-matching game, designed to focus attention and encourage stillness. Eighteen children participated in a within-subjects study that involved two conditions: with and without tactile interaction with a hand-held device, while having their heart rate and body movement recorded. Results show that the tactile game interaction reduced physiological arousal (heart rate decreased by 3.56 bpm, p < 0.01) and physical restlessness (overall movement decreased by 38%, p < 0.05), with attention-related body regions showing the greatest change toward stillness (45% reduction in movement). These findings demonstrate that brief tactile game-like engagement with a hand-held device can down-regulate physiological activation, promoting the calm and focused states toward sustained attention and behavior regulation.

2605.27499 2026-05-28 cs.LG astro-ph.CO astro-ph.IM physics.comp-ph stat.ML

GenSBI: Generative Methods for Simulation-Based Inference in JAX

GenSBI: 基于JAX的模拟推断生成方法

Aurelio Amerio

发表机构 * Instituto de Física Corpuscular (IFIC) Universitat de València & CSIC(粒子物理研究所(IFIC)瓦伦西亚大学 & 西班牙国家科研委员会)

AI总结 提出GenSBI库,在JAX中实现流匹配、分数匹配和去噪扩散等生成模型,用于模拟推断,提供统一接口和多种Transformer架构,并在标准基准上达到接近理想的C2ST分数。

Comments 48 pages + 1 appendix, 33 figures, 18 tables. For the associated Python code, see https://github.com/aurelio-amerio/GenSBI

详情
AI中文摘要

流和扩散生成模型已成为模拟推断(SBI)中广泛采用的密度估计器,从神经后验估计自然扩展到似然和联合密度估计。它们原则性的优化目标和不受架构约束的特点推动了在自然科学中的快速采用。然而,最广泛使用的SBI库仍然是基于PyTorch的,这使得在JAX中开发前向模型和分析流程的研究人员没有原生选择。我们提出GenSBI,一个完全在JAX中实现流匹配、分数匹配和去噪扩散的开源库。该库提供三种基于Transformer的架构——SimFormer、Flux1和一种新颖的Flux1Joint,它将门控调制Transformer块扩展到联合密度估计——所有这些都通过一个统一接口互换,该接口解耦了生成方法、神经骨干和推理模式。GenSBI提供了从训练到后验校准(SBC、TARP、LC2ST)的端到端工作流,并支持具有领域特定嵌入网络的自定义架构。我们在标准SBI基准上验证了该框架,在SBIBM任务上以最小的每任务调整实现了接近理想的平均C2ST分数(0.50-0.56,其中0.50为理想值),并且在所有测试配置中后验覆盖校准良好。代码公开于https://github.com/aurelio-amerio/GenSBI。

英文摘要

Flow and diffusion generative models have established themselves as widely adopted density estimators for simulation-based inference (SBI), extending naturally from neural posterior estimation to likelihood and joint density estimation. Their principled optimization objectives and freedom from architectural constraints have driven rapid adoption across the natural sciences. Yet the most widely used SBI libraries remain PyTorch-based, leaving researchers who develop their forward models and analysis pipelines in JAX without a native option. We present GenSBI, an open-source library that implements flow matching, score matching, and denoising diffusion entirely in JAX. The library offers three transformer-based architectures - SimFormer, Flux1, and a novel Flux1Joint that extends gate-modulated transformer blocks to joint density estimation - all interchangeable through a unified interface that decouples generative method, neural backbone, and inference mode. GenSBI provides an end-to-end workflow from training through posterior calibration (SBC, TARP, LC2ST) and supports custom architectures with domain-specific embedding networks. We validate the framework on standard SBI benchmarks, achieving near-ideal mean C2ST scores (0.50-0.56, where 0.50 is ideal) on SBIBM tasks with minimal per-task tuning and well-calibrated posterior coverage across all tested configurations. The code is publicly available at https://github.com/aurelio-amerio/GenSBI.

2605.27495 2026-05-28 cs.CV cs.LG

Representation-Conditioned Diffusion Models for Guided Training Data Generation

表示条件扩散模型用于引导训练数据生成

Nithesh Chandher Karthikeyan, Jonas Unger, Gabriel Eilertsen

发表机构 * Linköping University(利乌普斯大学)

AI总结 本文提出表示条件扩散模型,通过DINOv2、DINOv3和CLIP的表示条件生成合成图像,在ImageNet100上分类准确率比类条件生成高10.76个百分点,甚至超过真实数据训练的模型2.0个百分点。

详情
AI中文摘要

数据可用性仍然是许多深度学习应用中的关键瓶颈。大规模数据集通常收集、整理和标注成本高昂,这可能限制监督学习方法的可扩展性和适用性。在这项工作中,我们评估了在由生成式深度学习产生的合成图像数据集上训练的模型的分类性能。具体而言,我们使用基于DINOv2、DINOv3和CLIP学习表示的潜在扩散模型。我们的结果表明,这种表示条件公式通过提高样本质量和模式覆盖,显著优于类条件生成(在ImageNet100上top-1准确率提高10.76个百分点)。此外,通过扩大合成数据集的规模,我们能够超越在真实数据上训练的分类器(top-1准确率提高2.0个百分点)。我们还展示了生成的图像如何用于增强目的,优于经典增强方法,以及如何利用条件空间进行样本过滤以进一步提高训练价值。总的来说,这些发现表明,表示条件扩散模型为在大规模视觉学习任务中增强、补充或潜在替代真实世界数据集提供了一种有前景的方法。

英文摘要

Data availability remains a critical bottleneck in many deep learning applications. Large-scale datasets are often expensive to collect, curate and annotate, which can limit the scalability and applicability of supervised learning methods. In this work, we evaluate the classification performance of models trained on synthetic image datasets produced by generative deep learning. In particular, we use latent diffusion models conditioned on learned representations from DINOv2, DINOv3, and CLIP. Our results demonstrates that this representation-conditioned formulation significantly outperforms class-conditioned generation by a large margin (+10.76 p.p. top-1 accuracy on ImageNet100), by improving sample quality and mode coverage. Furthermore, by scaling the size of the synthetic dataset, we are able to outperform a classifier trained on the real data (+2.0 p.p top-1 accuracy). We also demonstrate how generated images can be used for augmentation purposes, outperforming classical augmentation methods, and how the conditioning space can be used for sample filtering to further improve training value. Collectively, these findings highlight that representation-conditioned diffusion models provide a promising approach for augmenting, complementing, or potentially replacing real-world datasets in large-scale visual learning tasks.

2605.27491 2026-05-28 cs.RO

GE-Sim 2.0: A Roadmap Towards Comprehensive Closed-loop Video World Simulators for Robotic Manipulation

GE-Sim 2.0:迈向机器人操作综合闭环视频世界模拟器的路线图

Boxiang Qiu, Liliang Chen, Yue Liao, Nan Wang, Lintao Wang, Jiayi Luo, Wenzhi Zhao, Shengcong Chen, Di Chen, Ye Li, Chen Gao, Shuicheng Yan, Si Liu, Maoqing Yao, Guanghui Ren

发表机构 * AgiBot BUAA(北京航空航天大学) LV-NUS Lab(国立大学理工学院实验室) TJU(天津大学)

AI总结 提出GE-Sim 2.0,一种基于动作条件视频生成的闭环视频世界模拟器,通过重新训练数千小时真实机器人数据并新增状态专家、世界裁判和加速框架三个模块,实现高保真动作跟随和轨迹覆盖,在WorldArena排行榜上以2B参数超越专用模型和通用视频生成器,并验证了基于其生成轨迹和奖励训练的策略在真实世界中的有效性。

详情
AI中文摘要

我们介绍了GE-Sim 2.0(Genie Envisioner世界模拟器2.0),一种用于机器人操作的闭环视频世界模拟器。基于Genie Envisioner的动作条件视频生成框架,GE-Sim 2.0在数千小时的真实机器人数据上重新训练,涵盖遥操作、接触丰富交互和机载策略部署,显著提高了动作跟随保真度和轨迹覆盖范围。在此基础之上,三个新模块实现了从视频模拟到策略学习的闭环:一个状态专家,从视频潜在表示中解码本体感觉状态,以支持下游VLA策略的下一块预测;一个世界裁判,根据任务指令对生成的轨迹进行评分,提供机器可验证的成功信号和奖励,取代人工检查;以及一个加速框架,在单个H100上以2.3秒生成25帧轨迹,并在推理时支持高达4倍跳帧以实现长程评估。GE-Sim 2.0仅以2B参数便登顶公开的WorldArena排行榜,超越了专用机器人世界模型和闭源通用视频生成器,并且基于其生成轨迹和奖励训练的策略可转化为可测量的真实世界收益,确立了GE-Sim 2.0作为可扩展评估和操作策略闭环学习的实用平台。

英文摘要

We introduce GE-Sim 2.0 (Genie Envisioner World Simulator 2.0), a closed-loop video world simulator for robotic manipulation. Building on the action-conditioned video generation framework of Genie Envisioner, GE-Sim 2.0 is re-trained on thousands of hours of real-world robot data spanning teleoperation, contact-rich interaction, and on-robot policy deployment, substantially improving action-following fidelity and trajectory coverage. On top of this foundation, three new modules close the loop from video simulation to policy learning: a state expert that decodes proprioceptive state from video latents to support next-chunk prediction by downstream VLA policies; a world judge that scores generated rollouts against task instructions, yielding machine-verifiable success signals and rewards in place of manual inspection; and an acceleration framework that delivers a 25-frame rollout in 2.3 seconds on a single H100, with up to 4* frame skipping at inference for long-horizon evaluation. GE-Sim 2.0 tops the public WorldArena leaderboard at only 2B parameters, outperforming both dedicated robotic world models and closed-source general video generators, and policies trained against its rollouts and rewards translate into measurable real-world gains, establishing GE-Sim 2.0 as a practical platform for scalable evaluation and closed-loop learning of manipulation policies.

2605.27487 2026-05-28 cs.CV cs.AI

Diffusion-Based Ukrainian Handwritten Text Generation with Cross-Domain Style Transfer

基于扩散的乌克兰手写文本生成与跨域风格迁移

Andrii Ahitoliev, Pavlo Berezin

发表机构 * Ukrainian Catholic University, Lviv, Ukraine(乌克兰天主教大学,利沃夫,乌克兰) National University of ``Kyiv-Mohyla Academy'', Kyiv, Ukraine(基輔-莫 Hil'a 学院国立大学,基輔,乌克兰)

AI总结 针对乌克兰语等非拉丁文字手写文本生成缺乏数据和模型泛化研究的问题,构建了乌克兰手写单词数据集并重新训练DiffusionPen模型,通过跨语言、零样本和少样本迁移实验验证了潜在扩散模型在跨域风格迁移中的有效性。

Comments 16 pages, 7 figures. Submitted to ICTERI 2026

详情
AI中文摘要

基于书写者风格的手写文本生成(HTG)在拉丁文字中已被广泛研究,但在低资源和非拉丁书写系统中仍探索不足,现有模型在拉丁域之外的泛化能力尚不明确。西里尔字母,尤其是乌克兰语,缺乏大规模书写者标注数据集和此类泛化的经验证据。为填补这一空白,我们使用连通分量分割、质量过滤和对代表性不足的乌克兰字符进行针对性过采样,构建了一个包含308位书写者、126,177张图像的乌克兰手写单词数据集。我们在不修改架构的情况下,在该数据集上重新训练了DiffusionPen——一种带有MobileNetV2三元组损失风格编码器和CANINE条件潜在扩散U-Net的模型,测试了从拉丁到西里尔字母的直接迁移。我们在三种设置下评估跨域风格迁移:从IAM英文样本的跨语言迁移、对20世纪早期乌克兰手稿的零样本迁移,以及对当代书写者的少样本模仿。该模型生成可读且风格一致的单词图像,表明少样本潜在扩散模型能够泛化到拉丁文字域之外。我们发布了数据集、训练模型和评估协议,作为书写者感知的西里尔HTG的可复现基准,为将风格化HTG扩展到其他代表性不足的书写系统奠定了基础。

英文摘要

Handwritten text generation (HTG) conditioned on writer style has been widely studied for Latin scripts, but remains underexplored for low-resource and non-Latin writing systems, leaving open how well existing models generalise beyond the Latin domain. Cyrillic, particularly Ukrainian, lacks both large-scale writer-labeled datasets and empirical evidence of such generalisation. To address this gap, we construct a Ukrainian handwritten word dataset of 126,177 images from 308 writers using connected-component segmentation, quality filtering, and targeted oversampling of underrepresented Ukrainian characters. We retrain DiffusionPen, a MobileNetV2 triplet-loss style encoder with a CANINE-conditioned latent diffusion U-Net, on this dataset without architectural modification, testing direct transfer from Latin to Cyrillic. We evaluate cross-domain style transfer in three settings: cross-lingual transfer from IAM English samples, zero-shot transfer to an early 20th-century Ukrainian manuscript, and few-shot imitation of contemporary writers. The model produces legible, style-consistent word images, indicating that few-shot latent diffusion models generalize beyond the Latin-script domain. We release the dataset, trained models, and evaluation protocol as a reproducible benchmark for writer-aware Cyrillic HTG, providing a foundation for extending stylized HTG to other underrepresented writing systems.

2605.27486 2026-05-28 cs.LG

Federated Learning for Multivariate Time Series Anomaly Detection in Industrial Automation

面向工业自动化的多变量时间序列异常检测的联邦学习

Khayyam Nosrati, Martin Uray, Saverio Messineo, Olaf Sassnick, Stefan Huber

发表机构 * Josef Ressel Centre for Intelligent and Secure Industrial Automation(乔塞夫·雷斯尔智能与安全工业自动化中心) Salzburg University of Applied Sciences(萨尔茨堡应用科学大学) Department of Artificial Intelligence and Human Interfaces(人工智能与人机接口部门) Paris Lodron University of Salzburg(萨尔茨堡巴黎洛登大学)

AI总结 本文针对联邦学习范式下多变量时间序列异常检测的数据集挑战,引入一个具有循环动态特性的数据集,并评估了多种MTSAD方法。

Comments Preprint. Accepted at the DEXA International Workshop on Optimisation of Industrial Production with AI Algorithms 2026 (DEXA AI4IP 2026)

详情
AI中文摘要

联邦学习(FL)拓宽了多变量时间序列异常检测(MTSAD)的视野。然而,在FL范式内对此类异常检测方法进行基准测试面临着以数据为中心的挑战。现有数据集无法应对这些挑战,因为它们不能同时提供足够的规模、准确的标签以及避免常见缺陷。此外,在离散工业自动化中常见的循环过程行为在当前的MTSAD研究中仍未得到充分探索。本文旨在进一步阐明相关文献,并通过引入一个由离散自动化过程的重复性产生的循环动态数据集来填补这些空白,同时在所提出的数据集和一个公开基准数据集上评估选定的MTSAD方法。

英文摘要

Federated learning (FL) has broadened the horizon for multivariate time series anomaly detection (MTSAD). However, benchmarking such anomaly detection methods within FL paradigm poses data-centric challenges. The existing datasets do not counteract these challenges since they do not simultaneously provide sufficient scale, accurate labels, and freedom from common flaws. In addition, the role of cyclic process behavior, which is common in discrete industrial automation, remains underexplored for MTSAD for the current state of research. This paper aims to shed more light on the literature and address these gaps by introducing a dataset designed with cyclic dynamics arising from the repetitive nature of discrete automation processes and evaluates selected MTSAD methods on both the proposed dataset and a public benchmark dataset.

2605.27483 2026-05-28 cs.CL cs.AI cs.LG

Debate Helps Weak Judges Reward Stronger Models

辩论有助于弱裁判奖励更强的模型

Ethan Elasky, Frank Nakasako, Naman Goyal

发表机构 * Palaestra Research(帕莱斯特拉研究) Berkeley(伯克利)

AI总结 研究在强辩手/弱裁判设置下的提议者-批评者辩论,发现当批评者分类能力超过裁判且裁判将批评者言论视为待验证的主张时,辩论能显著提升裁判表现,并可通过单一独立批评以更低成本实现类似效果。

详情
AI中文摘要

尽管理论上具有前景,但辩论作为一种可扩展的监督协议产生了混合的实证结果:在某些设置中有收益,在其他设置中无效,尤其是当裁判没有隐藏信息时。我们在程序可验证的代码和逻辑任务上,研究了强辩手/弱裁判设置下的提议者-批评者辩论。当批评者提供可用的优势时,辩论帮助裁判优于咨询基线:批评者的分类能力必须超过裁判,并且裁判必须将批评者的言论视为待验证的主张而非待总结的证词。在五个配对中的三个满足该条件的配对中,提议者-批评者辩论的收益在统计上显著优于咨询,并且这些配对是最有能力的模型配对。在我们的集合中的两个非响应者配对中,辩论产生无效效果,一旦批评者进入转录,裁判验证率下降数十个百分点。在这些情况下,批评者的二元分类能力与裁判的相差在噪声范围内,并且批评者的分歧被解析为证词而非待检查的主张。从辩论中消去反驳轮次对裁判表现没有可测量的变化:单一独立批评以更低的推理成本恢复了辩论的大部分收益。这些发现为可验证领域(答案、批评、裁判)中无需训练的可扩展监督提供了一种更廉价的原始方法,以及一种预测辩论何时有帮助的部署前审计(批评者是否击败裁判,以及裁判是否会验证它?)。

英文摘要

Despite theoretical promise, debate as a scalable oversight protocol has produced mixed empirical results: gains in some settings, and null effects in others, especially when the judge does not have information hidden from it. We study proposer-critic debate in a stronger-debater/weaker-judge setting on programmatically verifiable code and logic tasks. Debate helps the judge over a consultancy baseline when the critic provides a usable advantage: the critic's classification ability must exceed the judge's, and the judge must treat critic speeches as claims to verify rather than testimony to summarize. On the three of five pairings where the condition holds, proposer-critic debate's gains are statistically significant over consultancy, and these pairings are the most capable model pairings. On the two non-responder pairings in our set, debate produces null effects, and judge verification rates drop by tens of percentage points once a critic enters the transcript. In these cases the critic's binary-classification ability and the judge's are within noise of each other, and the critic's disagreement is parsed as testimony rather than a claim to check. Ablating rebuttal rounds from debate produces no measurable change in judge performance: a single independent critique recovers the bulk of debate's benefit at lower inference cost. These findings suggest a cheaper primitive for training-free scalable oversight in verifiable domains (answer, critique, judge) and a pre-deployment audit (does the critic beat the judge, and will the judge verify it?) that predicts when debate will help.

2605.27482 2026-05-28 cs.LG cs.AI

Energy-Structured Low-Rank Adaptation for Continual Learning

能量结构低秩自适应持续学习

Longhua Li, Lei Qi, Qi Tian, Xin Geng

发表机构 * School of Computer Science and Engineering, Southeast University, Nanjing, China(东南大学计算机科学与工程学院,南京,中国) Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications (Southeast University), Ministry of Education, China(新一代人工智能技术及其交叉应用重点实验室(东南大学),教育部,中国) Huawei Technologies, Shenzhen, China(华为技术有限公司,深圳,中国)

AI总结 提出E²-LoRA方法,通过能量集中和排序的低秩自适应以及动态秩分配策略,解决持续学习中的任务干扰和知识压缩问题,实现最优性能。

Comments Accepted by ICML 2026

详情
AI中文摘要

虽然正交子空间方法试图缓解持续学习中的任务干扰,但它们常常遭受跨基的能量扩散,阻碍知识压缩并耗尽未来任务的容量。我们观察到参数更新引起的输出特征漂移本质上是低秩的,并理论上证明沿该漂移的主方向保留参数可最小化输出重建误差。受此启发,我们提出能量集中和能量排序的低秩自适应(E²-LoRA)。通过显式地将知识排序并集中到主导秩中,E²-LoRA释放了后续任务的容量。此外,我们设计了一种动态秩分配策略,通过联合优化能量保留和模型可塑性来平衡稳定性和可塑性。在多个基准上的大量实验表明,E²-LoRA达到了最先进的性能。

英文摘要

While orthogonal subspace methods try to mitigate task interference in Continual Learning (CL), they often suffer from energy diffusion across the basis, hindering knowledge compaction and exhausting capacity for future tasks. We observe that output feature drift induced by parameter updates is inherently low-rank, and theoretically prove that preserving parameters along the principal directions of this drift minimizes the output reconstruction error. Motivated by this, we propose \textbf{E}nergy-Concentrated and \textbf{E}nergy-Ordered \textbf{Lo}w-\textbf{R}ank \textbf{A}daptation (E$^2$-LoRA). By explicitly ordering and concentrating knowledge into leading ranks, E$^2$-LoRA frees capacity for subsequent tasks. Furthermore, we design a dynamic rank allocation strategy to balance stability and plasticity by jointly optimizing energy retention and model plasticity. Extensive experiments across multiple benchmarks demonstrate that E$^2$-LoRA achieves state-of-the-art performance.

2605.27479 2026-05-28 cs.LG cs.AI

Resource-Constrained Affect Modelling via Variance Regularisation Pruning

资源约束下的情感建模:基于方差正则化剪枝

Kosmas Pinitas, Konstantinos Katsifis

发表机构 * Mediterranean College, Athens, Greece(地中海学院,希腊雅典) University of Derby, Derby, UK(德比大学,英国德比)

AI总结 提出方差正则化剪枝(VR)框架,通过考虑跨参与者稳定性来剪枝,在80%稀疏度下仍保持竞争性CCC性能,适用于资源受限的情感感知系统。

Comments This paper has been accepted at the 2026 PErvasive Technologies Related to Assistive Environments (PETRA)

详情
AI中文摘要

情感计算系统越来越多地嵌入到普及和交互环境中,如自适应游戏、辅助技术和资源受限平台,在这些环境中,计算效率必须与跨不同用户的可靠性相平衡。模型剪枝提供了一种减少计算需求的有效方法,但现有方法通常仅优化稀疏性,而不考虑参数移除如何影响个体间的鲁棒性。在这项工作中,我们引入了方差正则化剪枝(VR),一种明确将跨参与者稳定性纳入稀疏化过程的剪枝框架。VR不依赖于平均预测误差,而是根据每个连接对预测准确性和用户间变异性的联合贡献来评估,优先保留在分布差异下仍然可靠的参数。我们在AGAIN数据集上评估了所提出的方法,该数据集包含在九个情感诱发游戏环境中收集的唤醒度标注。实验结果表明,即使在没有额外微调的情况下,VR在80%稀疏度下仍能保持竞争性的一致性相关系数(CCC)性能,突显了其在真实世界、资源受限的情感感知系统中的适用性。总体而言,所提出的框架支持开发紧凑、鲁棒的情感模型,这些模型能够在真实的交互环境中可靠运行。

英文摘要

Affective computing systems are increasingly embedded in pervasive and interactive environments, such as adaptive games, assistive technologies, and resource-constrained platforms, where computational efficiency must be balanced with reliability across diverse users. Model pruning offers an effective way to reduce computational demands, yet existing approaches typically optimise for sparsity alone, without accounting for how parameter removal impacts robustness across individuals. In this work, we introduce Variance-Regularised Pruning (VR), a pruning framework that explicitly incorporates cross-participant stability into the sparsification process. Rather than relying solely on average prediction error, VR evaluates each connection based on its joint contribution to both prediction accuracy and variability across users, prioritising parameters that remain reliable under distributional differences. We evaluate the proposed approach on the AGAIN dataset, which includes arousal annotations collected across nine affect-eliciting game environments. Experimental results demonstrate that VR maintains competitive Concordance Correlation Coefficient (CCC) performance even at 80\% sparsity without additional fine-tuning, highlighting its suitability for deployment in real-world, resource-limited affect-aware systems. Overall, the proposed framework supports the development of compact, robust affective models that can operate reliably in real-world interactive environments.

2605.27476 2026-05-28 cs.LG cs.AI

Balancing Fidelity and Diversity in Diffusion Models via Symmetric Attention Decomposition: Hopfield Perspective

通过对称注意力分解平衡扩散模型中的保真度与多样性:Hopfield视角

Hyunmin Cho, Woo Kyoung Han, Kyong Hwan Jin

发表机构 * Department of Electrical Engineering, Korea University, Seoul, South Korea(韩国大学电子工程系,首尔,韩国)

AI总结 本文通过将Transformer中的注意力矩阵分解为对称和反对称部分,从Hopfield网络视角解释并调控扩散模型生成中的保真度-多样性权衡。

Comments Accepted to ICML 2026 (Regular)

详情
AI中文摘要

我们将Transformer中的预softmax注意力矩阵$\mathbf{QK^ op}$表征为一个关联记忆矩阵,编码输入特征之间的成对关联。通过将该矩阵分解为对称和反对称部分,我们将对称分量解释为控制能量景观的结构,而反对称分量则驱动该景观上的循环。利用对称分量诱导的能量公式,我们推导出Hopfield风格的稳定性度量,用于量化检索特征的稳定性。我们观察到Hopfield风格稳定性度量与生成中的保真度-多样性权衡之间存在有意义的关联。最后,我们提出一个可控的旋钮,通过修改底层动力学的循环来调节这一权衡。代码可在我们的GitHub上获取(https://github.com/hyeon-cho/Attention-Symmetric-Decomposition)。

英文摘要

We characterize the pre-softmax attention matrix $\mathbf{QK^\top}$ in transformers as an associative memory matrix encoding pairwise associations between input features. By decomposing this matrix into its symmetric and skew-symmetric parts, we interpret the symmetric component as governing the structure of the energy landscape, and the skew-symmetric component as driving circulation on that landscape. Leveraging the energy formulation induced by the symmetric component, we derive Hopfield-style stability measures that quantify the stability of retrieved features. We observe meaningful correlations between Hopfield-style stability measures and the fidelity-diversity trade-offs in generation. Finally, we propose a controllable knob to modulate this trade-off by modifying the circulation of the underlying dynamics. Code is available at our GitHub (https://github.com/hyeon-cho/Attention-Symmetric-Decomposition).

2605.27475 2026-05-28 cs.LG cs.AI

HEAL: Resilient and Self-* Hub-based Learning

HEAL:弹性且自适应的基于集线器的学习

Mohamed Amine Legheraba, Stefan Galkiewicz, Maria Gradinariu Potop-Butucaru, Sébastien Tixeuil

发表机构 * Sorbonne University(索邦大学) CNRS(法国国家科学研究中心) LIP6(巴黎第6大学信息学院) Institut Universitaire de France(法国国家科学研究中心)

AI总结 提出一种名为HEAL的跨层去中心化学习框架,通过结合联邦学习、八卦学习和流行病学习的优势,利用自组织自愈的P2P覆盖网络和Elevator算法动态选择聚合节点,在无崩溃场景下性能与联邦学习相当,同时在崩溃和波动环境中优于八卦学习和流行病学习。

详情
AI中文摘要

去中心化学习通过将数据和计算分布在节点上,增强了隐私性、可扩展性和容错性。一种流行的方法是联邦学习,它依赖于中央聚合器,但面临服务器脆弱性、可扩展性问题、隐私风险以及最重要的单点故障等挑战。另一种方法是八卦学习和流行病学习,它们通过节点间的点对点模型更新交换实现完全去中心化,确保了鲁棒性和隐私性,但代价是模型收敛速度较慢。在这项工作中,我们提出了一种新颖的去中心化学习框架,称为HEAL。HEAL是首个跨层去中心化学习框架,它利用优化的自组织和自愈底层P2P覆盖网络,结合了联邦学习、八卦学习和流行病学习的优势。借助最近提出的Elevator算法,HEAL将动态选择的节点提升为聚合器。通过仿真,我们证明HEAL在无崩溃环境中具有与联邦学习相似的性能,同时完全去中心化且具有容错性。在崩溃和波动频繁的环境中,HEAL优于八卦学习和流行病学习。

英文摘要

Decentralized learning enhances privacy, scalability, and fault tolerance by distributing data and computation across nodes. A popular approach is Federated learning, which relies on a central aggregator, yet faces challenges such as server vulnerabilities, scalability issues, privacy risks and most importantly, the single point of failure. Alternatively Gossip Learning and Epidemic Learning offer fully decentralization through peer-to-peer exchanges of model updates, ensuring robustness and privacy, at the price of slower model convergence. In this work, we introduce a novel decentralized learning framework called HEAL. HEAL is the first cross-layer decentralized learning framework that exploits an optimized self-organizing and self-healing underlying P2P overlay combining the strengths of Federated Learning, Gossip and Epidemic Learning. Leveraging the recently proposed Elevator algorithm, HEAL promotes dynamically chosen nodes to act as aggregators. Through simulations, we demonstrate that HEAL has similar performances to that of Federated Learning in crash-free settings, while being fully decentralized and fault-tolerant. In crash and churn prone environments HEAL outperforms Gossip and Epidemic Learning.

2605.27470 2026-05-28 cs.LG cs.AI

Detect by Yourself: Self-Designing Agentic Workflows for Few-Shot Graph Anomaly Detection

自行检测:少样本图异常检测的自设计代理工作流

Tairan Huang, Qiang Chen, Yili Wang, Yueyue Ma, Changlong He, Xiu Su, Yi Chen

发表机构 * CSU(中国科学技术大学) UST(香港大学)

AI总结 提出SignGAD框架,通过自设计任务条件检测工作流替代固定检测器,结合图编码与检测器选择及受保护重拟合策略,提升少样本图异常检测的适应性与可靠性。

详情
AI中文摘要

图异常检测旨在识别属性图中的异常节点,并在实际应用中发挥重要作用。然而,现有的图异常检测方法仍面临两个关键挑战:1)固定流程,限制了其在有限监督下对不同图任务的适应性;2)弱证据,无法将上下文和结构异常信号明确纳入检测过程。在本文中,我们提出了一种新颖框架,即少样本图异常检测的自设计代理工作流(SignGAD)。具体来说,我们提出了一种新范式,将图异常检测任务从训练固定异常检测器重新定义为设计任务条件检测工作流。通过构建检测工作流,SignGAD选择合适的图编码和检测器设计以利用任务特定的异常证据。同时,我们引入了一种受保护的最终重拟合策略,通过校准重拟合接受度来优化所选工作流,从而增强有限监督下的可靠性。在多个真实世界数据集上进行的大量实验表明,SignGAD相比最先进方法取得了强劲性能,突显了其在图异常检测任务上的有效性。

英文摘要

Graph anomaly detection aims to identify anomaly nodes in attributed graphs and plays an important role in real-world applications. However, existing graph anomaly detection methods still face two key challenges: 1) fixed pipelines, which restrict their adaptability across different graph tasks under limited supervision; 2) weak evidence, which prevents them from explicitly incorporating contextual and structural anomaly signals into the detection process. In this paper, we propose a novel framework, self-designing agentic workflows for few-shot graph anomaly detection (SignGAD). Specifically, we propose a novel paradigm that reformulates graph anomaly detection task from training a fixed anomaly detector to designing task-conditioned detection workflows. By constructing detection workflows, SignGAD selects suitable graph encodings and detector designs to exploit task-specific anomaly evidence. Meanwhile, we introduce a guarded final refit strategy to refine the selected workflow by calibrating refit acceptance, enhancing reliability under limited supervision. Extensive experiments conducted on several real-world datasets demonstrate that SignGAD achieves strong performance against state-of-the-art methods, highlighting its effectiveness on graph anomaly detection tasks.