arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.31603 2026-06-01 cs.CV cs.AI 版本更新

Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models

Lumos-Nexus: 面向视频统一模型的高效频率桥接与同质潜在空间

Jiazheng Xing, Hangjie Yuan, Lingling Cai, Xinyu Liu, Yujie Wei, Fei Du, Hai Ci, Tao Feng, Jiasheng Tang, Weihua Chen, Fan Wang, Yong Liu

发表机构 * Zhejiang University（浙江大学）； DAMO Academy, Alibaba Group（阿里云达摩院）； Hupan Lab（虎扑实验室）； National University of Singapore（新加坡国立大学）； Hong Kong University of Science and Technology（香港科技大学）； Fudan University（复旦大学）； Tsinghua University（清华大学）

AI总结提出Lumos-Nexus框架，通过两阶段训练和渐进频率桥接，在保持推理能力的同时显著提升视频生成保真度。

Comments Project page (https://jiazheng-xing.github.io/nexus-lumos-home/) and Code (https://github.com/alibaba-damo-academy/Lumos-Custom/) are available

详情

AI中文摘要

基于连接器的视频统一模型在指令引导的视频合成中展现出强大能力，但将大型高保真生成器集成到统一训练循环中计算成本过高，限制了可实现的视觉质量。因此，我们提出Lumos-Nexus，一个训练高效的统一视频生成框架，促进强推理驱动生成能力的发展，同时显著提升视觉保真度。Lumos-Nexus采用两阶段设计：1）训练时，仅将轻量级生成器与理解模块对齐，以学习接收推理驱动的语义控制。2）推理时，我们引入统一渐进频率桥接（UPFB），在共享潜在空间中逐步将生成任务移交给高容量预训练生成器，实现从粗到细的细化，在不牺牲推理质量的情况下生成高保真视频。为填补推理驱动视频生成基准的空白，我们引入VR-Bench，评估模型将推断意图转化为连贯且语义对齐的视频内容的能力。大量实验表明，Lumos-Nexus在VBench上实现了视觉真实感和时间连贯性的显著提升，同时在VR-Bench上展现出强大的基于推理的生成性能。代码和模型可在https://jiazheng-xing.github.io/nexus-lumos-home/获取。

英文摘要

Connector-based video unified models have demonstrated strong capability in instruction-grounded video synthesis, but integrating a large high-fidelity generator into the unified training loop is computationally prohibitive, limiting achievable visual quality. We therefore propose Lumos-Nexus, a training-efficient unified video generation framework that facilitates the development of strong reasoning-driven generation capabilities while significantly enhancing visual fidelity. Lumos-Nexus adopts a two-stage design: 1) During training, only a lightweight generator is aligned with the understanding block to learn to take in reasoning-driven semantic control. 2) During inference, we introduce Unified Progressive Frequency Bridging (UPFB) to progressively hand off generation to a high-capacity pretrained generator in the shared latent space, enabling coarse-to-fine refinement and producing high-fidelity videos without compromising reasoning quality. To fill the gap in reasoning-driven video generation benchmarks, we introduce VR-Bench, which assesses a model's capability to translate inferred intent into coherent and semantically aligned video content. Extensive experiments demonstrate that Lumos-Nexus achieves substantial gains in visual realism and temporal coherence on VBench, while exhibiting strong reasoning-based generative performance on VR-Bench. Code and models are available at https://jiazheng-xing.github.io/nexus-lumos-home/.

URL PDF HTML ☆

赞 0 踩 0

2605.31593 2026-06-01 cs.CR cs.AI 版本更新

LongTraceRL: 基于评分奖励从搜索智能体轨迹中学习长上下文推理

Nianyi Lin, Jiajie Zhang, Lei Hou, Juanzi Li

发表机构 * Tsinghua University（清华大学）

AI总结提出LongTraceRL框架，通过知识图谱随机游走生成多跳问题并利用搜索智能体轨迹构建分层干扰物，结合基于实体链的评分奖励进行过程监督，提升大语言模型在长上下文推理中的表现。

详情

AI中文摘要

长上下文推理仍然是大型语言模型的核心挑战，模型往往难以在大量干扰内容中定位和整合关键信息。基于可验证奖励的强化学习（RLVR）在此任务上展现出潜力，但现有方法受限于低混淆度的干扰物和稀疏的、仅基于结果的奖励信号，无法监督中间推理步骤。为解决这些问题，我们引入了 extsc{LongTraceRL}。在数据构建方面，我们通过知识图谱随机游走生成多跳问题，并利用搜索智能体轨迹构建\emph{分层干扰物}：智能体读取但未引用的文档（高混淆度）和搜索结果中出现但从未打开的文档（低混淆度），从而生成比随机采样或单次搜索构建的训练上下文更具挑战性的内容。在奖励设计方面，我们提出了一种\emph{评分奖励}，利用每条推理链上的黄金实体作为细粒度的实体级过程监督。该评分奖励仅应用于最终答案正确的响应（正向策略），以区分正确响应之间的推理质量，并防止奖励作弊。在五个长上下文基准上对三种推理LLM（4B-30B）进行的实验表明， extsc{LongTraceRL} 始终优于强基线，并鼓励全面、基于证据的推理。代码、数据集和模型可在 \href{https://github.com/THU-KEG/LongTraceRL}{https://github.com/THU-KEG/LongTraceRL} 获取。

视觉-语言模型在模糊输入下抑制女性表征

Arnau Marin-Llobet, Simon Henniger, Mahzarin R. Banaji

发表机构 * School of Engineering and Applied Sciences（工程与应用科学系）； Department of Psychology（心理学系）

AI总结本研究通过引入零样本度量LALS，发现视觉-语言模型在模糊输入下内部编码与输出存在系统性解耦，女性信号在生成前被抑制，揭示了模型对性别偏见的内部处理机制。

Comments 16 pages, 12 figures, 1 table

详情

AI中文摘要

对齐训练使视觉-语言模型（VLM）避免表达人口统计偏见，当性别清晰可见时，它们基本成功。但对于模糊输入（如全副武装的工人、从背后看到的人物）——实践中常见但很少研究的情况——我们发现，在模糊输入图像时，最小的提示压力就会暴露职业-性别默认值，模型甚至对强烈女性刻板印象的职业也倾向于男性。但这些输出是否反映了模型实际内部编码的内容？我们引入LALS（潜在关联倾向分数），一种零样本度量，将视觉标记激活投影到模型的文本嵌入空间中，以测量每个标记和层的概念关联。在15个职业、超过800张性别模糊图像和四个VLM上，内部表征和输出系统性地解耦：模型通常内部编码女性关联但输出男性。逐层分析揭示了一个不对称滤波器——男性信号端到端放大，而女性信号在中间网络达到峰值并在生成前被抑制——颜色消融实验表明，文化负载的视觉线索（如服装颜色）进一步调节这些内部关联。

英文摘要

Alignment teaches vision-language models (VLMs) to avoid expressing demographic biases, and when gender is clearly visible they largely succeed. Far less is known about ambiguous inputs (a worker in full gear, a figure seen from behind) cases common in practice yet rarely studied. We find that minimal prompting pressure exposes occupation-gender defaults when prompting ambiguous input images, with models collapsing to male even for strongly female-stereotyped occupations. But do these outputs reflect what models actually encode internally? We introduce LALS (Latent Association Leaning Score), a zero-shot metric that projects visual-token activations into the model's text-embedding space to measure concept associations per token and layer. Across 15 occupations, over 800 gender-ambiguous images, and four VLMs, internal representations and outputs are systematically decoupled: models often encode a female association internally yet output male. Layer-wise analysis reveals an asymmetric filter -- male signal amplifies end-to-end while female signal peaks mid-network and is suppressed before generation -- and a color ablation shows that culturally loaded visual cues such as clothing color further modulate these internal associations.

URL PDF HTML ☆

赞 0 踩 0

2605.31535 2026-06-01 cs.CV cs.AI cs.LG 版本更新

RayDer: Scalable Self-Supervised Novel View Synthesis from Real-World Video

RayDer: 从真实世界视频中可扩展的自监督新视角合成

Ulrich Prestel, Stefan Andreas Baumann, Nick Stracke, Björn Ommer

发表机构 * Munich Center for Machine Learning (MCML)（慕尼黑机器学习中心 (MCML)）

AI总结提出统一前馈变压器RayDer，将相机估计、场景重建和渲染整合为单一骨干，实现自监督新视角合成的可扩展幂律缩放，在零样本开放集性能上媲美有监督方法。

Comments Project Page: https://compvis.github.io/rayder

详情

AI中文摘要

自监督新视角合成（NVS）在扩展方面仍然具有挑战性，尽管视频数据丰富，这主要是由于在真实视频上训练的脆弱性以及多网络系统设计的难以预测的缩放行为。我们引入了RayDer，一个统一的前馈变压器，将相机估计、场景重建和渲染整合到一个单一骨干中，将自监督NVS转化为一个适定的单模型缩放问题。一个最小的动态状态，被视为干扰因素，吸收时变内容，使得在无约束的真实世界视频上稳定训练成为可能。重要的是，RayDer将静态场景NVS作为其目标任务：动态内容仅作为可扩展的监督被利用，而不是像动态场景（4D）NVS那样重建。在多个模型大小和数量级的数据上，RayDer展示了与数据和计算量相关的清晰幂律缩放，并优于静态场景数据混合。在大量基准测试中，RayDer实现了与最先进的有监督方法相竞争的强大零样本开放集性能。项目页面：https://compvis.github.io/rayder

英文摘要

Self-supervised novel view synthesis (NVS) remains challenging to scale, despite the abundance of video data, largely due to the brittleness of training on realistic videos and the hard-to-predict scaling behavior of multi-network system designs. We introduce RayDer, a unified, feed-forward transformer that consolidates camera estimation, scene reconstruction, and rendering into a single backbone, turning self-supervised NVS into a well-posed single-model scaling problem. A minimal dynamic state, treated as a nuisance factor, absorbs time-varying content and enables stable training on unconstrained real-world video. Importantly, RayDer keeps static-scene NVS as its target task: dynamic content is leveraged purely as scalable supervision, not reconstructed as in dynamic-scene (4D) NVS. Across multiple model sizes and orders of magnitude in data, RayDer exhibits clean power-law scaling with data and compute, and outperforms static-scene data mixtures. On a large number of benchmarks, RayDer achieves strong zero-shot open-set performance competitive with state-of-the-art supervised approaches. Project Page: https://compvis.github.io/rayder

URL PDF HTML ☆

赞 0 踩 0

2605.31534 2026-06-01 cs.CV cs.AI 版本更新

Feature-Optimized Vision for Adaptive 3D Scene Reconstruction

面向自适应3D场景重建的特征优化视觉

Eric Liang

发表机构 * Oracle

AI总结提出一种自适应特征优化视觉前端，通过评分纹理、可重复性、独特性、预期三角化角度和空间覆盖来分配每视图特征预算，以最大化有效轨迹并降低重建RMSE。

详情

AI中文摘要

三维场景重建依赖于局部图像证据，这些证据既要在视觉上具有判别性，又要在几何上有用。固定的特征阈值和均匀的特征预算易于部署，但可能会在重复纹理、低视差区域或不稳定点上浪费计算。本文提出了一种用于3D重建的自适应特征优化视觉前端。该方法通过纹理、可重复性、独特性、预期三角化角度和空间覆盖对候选特征进行评分，然后在固定重建流程下分配每视图特征预算以最大化有效轨迹。一个小型合成多视图原型在走廊、立面、物体桌面和杂乱场景中评估了四种选择策略。与随机、仅纹理和均匀网格基线相比，自适应策略在保持广泛图像覆盖的同时，获得了最佳的质量感知完整性和最低的聚合重建RMSE。结果并非替代现代学习匹配或神经重建系统；它是一个模块化的前端策略，可以使经典和学习的3D流程更审慎地决定将计算花费在哪些视觉证据上。

英文摘要

Three-dimensional scene reconstruction depends on local image evidence that is both visually discriminative and geometrically useful. Fixed feature thresholds and uniform feature budgets are easy to deploy, but they can waste computation on repeated texture, low-parallax regions, or unstable points. This paper proposes an adaptive feature-optimized vision front end for 3D reconstruction. The method scores candidate features by texture, repeatability, distinctiveness, expected triangulation angle, and spatial coverage, then allocates a per-view feature budget to maximize useful tracks under a fixed reconstruction pipeline. A small synthetic multi-view prototype evaluates four selection policies across corridor, facade, object-table, and cluttered scenes. Compared with random, texture-only, and uniform-grid baselines, the adaptive policy obtains the best quality-aware completeness and the lowest aggregate reconstruction RMSE while preserving broad image coverage. The result is not a replacement for modern learned matching or neural reconstruction systems; it is a modular front-end policy that can make classical and learned 3D pipelines more deliberate about which visual evidence they spend compute on.

URL PDF HTML ☆

赞 0 踩 0

2605.31520 2026-06-01 cs.SE cs.AI cs.CR 版本更新

Separating Secrets from Placeholders: A Hybrid CNN-CodeBERT Framework for Three-Class Credential Leakage Detection

区分秘密与占位符：一种用于三类凭证泄露检测的混合CNN-CodeBERT框架

Maksuda Bilkis Baby, Khushika Shah, Naiyue Liang, Lei Zhang

发表机构 * Information Systems, University of Maryland, Baltimore County, USA（信息学院，马里兰大学巴尔的摩县分校，美国）； Computer Science and Electrical Engineering, University of Maryland, Baltimore County, USA（计算机科学与电气工程系，马里兰大学巴尔的摩县分校，美国）

AI总结针对现有凭证泄露检测工具高误报率的问题，提出一种基于CodeBERT语义理解与字符级模式识别的三分类框架，将占位符/弱凭证作为独立类别建模，在新构建的9426样本数据集上达到0.86的MCC和0.90的宏F1分数，将高严重性警报减少33%而不牺牲安全覆盖。

Comments Accepted at ICSME 2026 (International Conference on Software Maintenance and Evolution)

详情

AI中文摘要

扩展匈牙利语对话ASR：BEA-Dialogue+语料库

Máté Gedeon, Piroska Zsófia Barta, Péter Mihajlik, Katalin Mády

发表机构 * Department of Telecommunications and Artificial Intelligence（电信与人工智能系）； Budapest University of Technology and Economics（布达佩斯技术与经济大学）； Speechtex Ltd.（Speechtex公司）； ELTE Research Centre for Linguistics（ELTE语言学研究中心）

AI总结针对匈牙利语对话语音识别训练数据不足的问题，本文通过放宽分割标准扩展BEA-Dialogue语料库至200小时，并评估基于Whisper和FastConformer的模型，证明基于序列化输出训练的微调能持续改善识别性能。

详情

AI中文摘要

匈牙利语对话自动语音识别受到公开对话式训练数据有限的制约。BEA-Dialogue语料库解决了这一需求，但其严格的说话人分离的训练/开发/测试分割将可用材料减少到仅85小时。在本文中，我们介绍了BEA-Dialogue+，这是该语料库的扩展版本，它放宽了实验者和对话伙伴的分割标准，同时保持主要说话人的完全分离。这产生了200小时转录的自然对话，并允许对额外训练数据与分割间说话人重叠之间的权衡进行受控研究。我们在两个语料库版本上评估了多个基于Whisper和FastConformer的模型，包括基于序列化输出训练（SOT）的对话转录微调。我们的结果表明，对于未经微调的模型，较大的语料库更具挑战性，而基于SOT的适应在WER、CER、cpWER和cpCER上产生了一致的改进。总体而言，BEA-Dialogue+为匈牙利语对话ASR提供了一个更大但仍具挑战性的基准，以及用于训练和评估对话转录系统的实用资源。

英文摘要

Conversational automatic speech recognition in Hungarian is constrained by the limited amount of publicly available dialogue-style training data. The BEA-Dialogue corpus addresses this need, but its strictly speaker-disjoint train/dev/eval split reduces the usable material to only 85 hours. In this paper, we introduce BEA-Dialogue+, an expanded version of the corpus that relaxes the split criterion for experimenters and dialogue partners while preserving complete separation of the primary speakers. This results in 200 hours of transcribed natural conversations and enables a controlled study of the trade-off between additional training data and speaker overlap across the splits. We evaluate several Whisper- and FastConformer-based models on both corpus versions, including Serialized Output Training (SOT)-based fine-tuning for dialogue transcription. Our results show that the larger corpus is more challenging for models without fine-tuning, whereas SOT-based adaptation yields consistent improvements in WER, CER, cpWER, and cpCER. Overall, BEA-Dialogue+ provides a substantially larger yet still demanding benchmark for Hungarian dialogue ASR, and a practical resource for training and evaluating dialogue transcription systems.

URL PDF HTML ☆

赞 0 踩 0

2605.31468 2026-06-01 cs.AI 版本更新

AutoSci: A Memory-Centric Agentic System for the Full Scientific Research Lifecycle

AutoSci: 面向完整科学生命周期的以记忆为中心的智能体系统

Weitong Qian, Beicheng Xu, Zhongao Xie, Bowen Fan, Guozheng Tang, Jiale Chen, Xinzhe Wu, Mingtian Yang, Chenyang Di, Jiajun Li, Lingching Tung, Peichao Lai, Yifei Xia, Ziyi Guo, Yanwei Xu, Yanzhao Qin, Shaoduo Gan, Xupeng Miao, Bin Cui

发表机构 * Peking University（北京大学）

AI总结提出AutoSci，一个以记忆为中心、支持完整科学生命周期的智能体系统，通过结构化记忆、多阶段流程、有向无环图增强和演化机制实现自动化科研。

详情

AI中文摘要

科学研究传统上是人力密集型的，要求研究人员在漫长的项目周期中协调文献、想法、实验、手稿和审稿回复。基于LLM的科学智能体的兴起为自动化这一过程创造了机会。这样的系统必须支持完整的研究生命周期，跨项目维护结构化的持久记忆，并随时间改进自身的研究流程。然而，现有系统要么部分满足，要么未能满足这些要求，留下了统一自动化科学研究系统的空白。因此，我们提出了AutoSci，一个面向完整科学生命周期的以记忆为中心的智能体系统。AutoSci围绕四个模块组织。SciMem提供受模式约束的研究记忆，将可重复使用的科学知识分离为长期知识记忆，将项目级工件（如想法、实验、手稿和审稿）分离为活跃研究记忆。SciMem通过一个控制状态、上下文、验证、反馈和编排的框架执行从文献理解到反驳的五阶段生命周期。SciDAG通过有向无环图形式的多智能体操作符和可重用的阶段特定模板增强困难技能。SciEvolve将来自用户、实验、审稿和外部环境的反馈信号转化为对SciMem组织、SciFlow技能和SciDAG模板的版本化更新。这些模块共同使AutoSci成为一个持久的研究环境，能够在研究项目间执行、记忆和演化。代码仓库位于https://github.com/skyllwt/AutoSci。

英文摘要

Scientific research has traditionally been human-intensive, requiring researchers to coordinate literature, ideas, experiments, manuscripts, and review responses across long project cycles. The rise of LLM-based scientific agents creates an opportunity to automate this process. Such a system must support the full research lifecycle, maintain structured persistent memory across projects, and improve its own research procedures over time. However, existing systems either partially satisfy or fail to satisfy these requirements, leaving a gap for a unified automated scientific research system. As a result, we present AutoSci, a memory-centric agentic system for the full scientific research lifecycle. AutoSci is organized around four modules. SciMem provides schema-governed research memory, separating Long-Term Knowledge Memory for reusable scientific knowledge from Active Research Memory for project-level artifacts such as ideas, experiments, manuscripts, and reviews. SciFlow executes a five-stage lifecycle from literature understanding to rebuttal through a harness that controls state, context, verification, feedback, and orchestration. SciDAG augments difficult skills with DAG-shaped multi-agent operators and reusable stage-specific templates. SciEvolve converts feedback signals from users, experiments, reviews, and external environments into versioned updates to SciMem organization, SciFlow skills, and SciDAG templates. Together, these modules make AutoSci a persistent research environment that can execute, remember, and evolve across research projects. The code repository is available at https://github.com/skyllwt/AutoSci.

URL PDF HTML ☆

赞 0 踩 0

2605.31464 2026-06-01 cs.LG cs.AI 版本更新

GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization

GPU预测器：语言模型作为内核运行时优化的选择性替代

Zaid Khan, Justin Chih-Yao Chen, Jaemin Cho, Elias Stengel-Eskin, Mohit Bansal

发表机构 * UNC Chapel Hill（北卡罗来纳大学教堂山分校）； AI2 ； Johns Hopkins University（约翰霍普金斯大学）； University of Texas at Austin（德克萨斯大学奥斯汀分校）

AI总结研究利用语言模型作为GPU内核性能的选择性替代，通过强化学习提高预测准确性和校准度，在有限GPU评估预算下加速内核搜索。

Comments Code: https://github.com/codezakh/gpu-forecasters

详情

AI中文摘要

GPU内核是现代深度学习的主力，优化它们（通过进化搜索或编码代理）通常需要在目标硬件上重复测量。虽然这些测量提供了内核搜索所需的地面真实信号，但成本高昂，因为每次评估内核都需要编译并在GPU上重复执行。随着LLM推理的改进降低了编写新内核的成本，并且LLM驱动的搜索扩展到大的搜索预算，设备上的评估成为瓶颈。为了解决这个问题，我们研究LLM如何通过预测所提议内核的性能，作为选择性GPU替代用于内核评估。一个有用的替代应该是准确的，并且应该是选择性的，知道何时可能出错，并推迟到GPU。为了评估替代，我们测量其预测是否准确、校准良好，并且在有限的GPU测量预算下对恢复快速内核实际有用。接下来，我们研究强化学习是否能提高预测准确性和置信度校准。我们的实验表明，LLM可以准确预测相对内核性能，并且通过强化学习可以提高其实用性。在内核搜索中使用替代，使得搜索在相同的GPU评估预算下可以考虑多倍的候选，从而比同等预算的基线找到更快的内核。这些结果表明，LLM可以在内核优化中发挥更广泛的作用，作为GPU的虚拟模型，而不仅仅是搜索的内核生成器。

英文摘要

GPU kernels are the workhorse of modern deep learning, and optimizing them (via evolutionary search or coding agents) usually requires repeated measurement on target hardware. While these measurements provide the ground-truth signal necessary for kernel search, they are costly, because each evaluation of a kernel requires compilation and repeated execution on a GPU. As improvements in LLM inference reduce the cost of writing novel kernels and LLM-driven searches scale to large search budgets, on-device evaluation becomes a bottleneck. To address this, we study how LLMs can serve as selective GPU surrogates for kernel evaluation, by forecasting the performance of proposed kernels. A useful surrogate should be accurate, and it should be selective, by knowing when it could be wrong, and deferring to the GPU. To evaluate surrogates, we measure whether their forecasts are accurate, calibrated, and practically useful for recovering fast kernels under limited GPU-measurement budgets. Next, we study whether reinforcement learning can improve forecast accuracy and confidence calibration. Our experiments demonstrate that LLMs can accurately forecast relative kernel performance, that their utility can be improved through reinforcement learning. Used inside a kernel search, the surrogate lets the search consider several times as many candidates under the same GPU evaluation budget, and that leads to finding faster kernels than an equal-budget baseline. These results suggest that LLMs can play a broader role in kernel optimization, by acting as virtual models of a GPU rather than solely as kernel generators for search.

URL PDF HTML ☆

赞 0 踩 0

2605.31463 2026-06-01 cs.LG cs.AI cs.CL cs.DC 版本更新

DOA：面向语音大语言模型的长形式同声传译的无训练解码器仅注意力策略

Sara Papi, Luisa Bentivogli

发表机构 * Fondazione Bruno Kessler（布鲁诺·克塞塞基金会）

AI总结提出DOA策略，利用解码器自注意力导出代理对齐，无需训练即可实现语音大语言模型在长形式同声传译中的流式决策。

详情

AI中文摘要

同声语音到文本翻译（SimulST）在语音尚未完成时生成翻译，需要流式策略来决定何时读取和何时写入。最先进的方法依赖于基于注意力的编码器-解码器模型，其中交叉注意力提供显式的对齐信号。相比之下，语音大语言模型（SpeechLLMs）是仅解码器架构，仅依赖自注意力。这引发了一个核心问题：解码器自注意力是否包含足够稳定的对齐信号来指导流式策略。此外，现有方法通常依赖于基于训练的适应或启发式等待-$k$策略，并且尚未在长形式场景中得到验证。为了填补这些空白，我们提出了仅解码器注意力（DOA），这是一种无训练策略，通过从自注意力中导出代理对齐，使现成的SpeechLLMs能够进行长形式同声传译。在Phi4-Multimodal和Qwen3-Omni上的实验表明，DOA提供了有效的对齐信号来支持流式决策，实现了低延迟的长形式SimulST，其质量接近无需重新训练的离线解码。

面向手语翻译的大语言模型目标端释义增强

Pedro Dal Bianco, Jean Paul Nunes Reinhold, Oscar Stanchi, Facundo Quiroga, Franco Ronchetti, Ulisses Brisolara Corrêa

发表机构 * III-LIDI Universidad Nacional de La Plata（III-LIDI国立拉普拉塔大学）； CDTEC, Federal University of Pelotas（CDTEC，联邦 Pelotas 大学）； CONICET III-LIDI ； Comision de Investigaciones Cientificas Universidad Nacional de La Plata（科学委员会国立拉普拉塔大学）； Universidade Federal de Pelotas（联邦 Pelotas 大学）

AI总结针对手语翻译中平行语料稀缺和目标词汇长尾分布的问题，提出利用GPT-4o生成参考句子的受控释义变体进行目标端增强，并在三种手语数据集上验证了方法的有效性。

Comments Accepted at GenSign (https://genai4sl.github.io/) at CVPR 2026. Non proceedings track

详情

AI中文摘要

手语翻译（SLT）仍然受到有限的配对手语视频/文本语料库和长尾目标词汇的限制。我们研究了目标端增强方法，其中GPT-4o生成参考句子的受控释义变体，而手语输入保持不变。采用基于Signformer姿态的Transformer，在两阶段调度下进行训练：先在增强语料库上预训练，然后在原始参考句子上微调。我们在三个具有互补挑战的数据集上进行了评估：PHOENIX14T（德国手语），具有适度的词汇多样性；GSL（希腊手语），具有高度受控、重复的录制；以及LSA-T（阿根廷手语），具有严重的长尾稀疏性。在PHOENIX14T上，增强将BLEU-4从9.56提高到10.33。接近饱和的GSL基线和极其稀疏的LSA-T设置揭示了该方法的局限性。据我们所知，这是第一项将LLM生成的目标端释义和LLM作为评估者应用于手语翻译的研究。语义评估揭示了词汇重叠指标低估的忠实度提升。

英文摘要

Sign language translation (SLT) remains constrained by limited paired sign-video/text corpora and heavy-tailed target vocabularies. We study target-side augmentation in which GPT-4o generates controlled paraphrase variants of reference sentences while the sign input remains unchanged. A Signformer-style pose-based Transformer is trained under a two-stage schedule: pre-training on the augmented corpus followed by fine-tuning on the original references. We evaluate on three datasets spanning complementary challenges: PHOENIX14T (German Sign Language), with moderate lexical diversity; GSL (Greek Sign Language), with highly ontrolled, repetitive recordings; and LSA-T (Argentinian Sign Language), with severe long-tail sparsity. On PHOENIX14T, augmentation improves BLEU-4 from 9.56 to 10.33. The near-saturated GSL baseline and extremely sparse LSA-T setting reveal the limits of the approach. To our knowledge, this is the first study to apply LLM-generated target-side araphrases and LLM-as-a-Judge evaluation to SLT. The semantic evaluation reveals gains in fidelity that lexical overlap metrics understate.

URL PDF HTML ☆

赞 0 踩 0

2605.31377 2026-06-01 cs.IR cs.AI 版本更新

DynaTree: Dynamic Agentic Retrieval Tree for Time-Sensitive News Retrieval

DynaTree: 面向时效性新闻检索的动态智能检索树

Siyuan Qi, Xinyuan Wang, Yingxuan Yang, Haochuan Guo, Jianghao Lin, Weiwen Liu, Yong Yu, Weinan Zhang

发表机构 * Shanghai Jiao Tong University（上海交通大学）

AI总结提出DynaTree两阶段框架，通过离线构建可复用检索树和在线轻量子树选择，实现高效、自适应的时效性新闻检索，在Syft新闻基准和BEIR数据集上优于标准RAG和现有智能体方法。

详情

dashi: 一个用于数据集偏移表征以支持可信AI开发和部署的Python库

David Fernández-Narro, Pablo Ferri, Ángel Sánchez-García, Juan M. García-Gómez, Carlos Sáez

发表机构 * Biomedical Data Science Lab, Instituto Universitario de Tecnologías de la Información y Comunicaciones, Universitat Politècnica de Valéncia（生物医学数据科学实验室，信息与通信技术大学，巴塞罗那理工大学）

AI总结本文介绍dashi，一个开源Python库，通过无监督（基于信息几何和非参数统计流形）和有监督方法，对数据集偏移进行探索、量化和表征，以支持AI生命周期中的可信度评估。

详情

AI中文摘要

人工智能（AI）生命周期需要对底层数据动态有透彻理解，以实现稳健、安全且经济高效的AI开发和使用。数据集偏移定义为训练和测试数据分布之间的变化。无论是随时间（时间性）还是跨不同站点（多源）发生，它们都可能严重降低模型性能并损害数据质量。这在健康AI中尤为重要，因为不受控制的偏移在训练和操作阶段都可能严重影响患者的安全和基本权利。虽然协变量偏移、先验偏移和概念偏移的理论基础已很完善，但缺乏可访问且全面的软件工具来执行其分析。我们介绍了dashi，一个开源Python库，旨在对数据集偏移进行探索、量化和表征。dashi提供双重方法：一种无监督方法，利用信息几何和非参数统计流形进行数据变异性表征和分析（例如，信息几何时间图和多源变异性指标，如全局概率偏差和源概率异常度）；以及一种有监督方法，量化和表征模型性能退化。无监督和有监督方法均适用于用户定义的时间批次和域/源批次。我们在三个模拟和真实世界的健康AI案例研究（妊娠期糖尿病、COVID-19和紧急医疗调度）中展示了dashi的实用性。通过提供交互式视觉分析和变异性指标，dashi支持AI生命周期阶段的可信度，通过评估数据一致性和AI性能实现稳健且安全的机器学习管道。

英文摘要

The Artificial Intelligence (AI) life cycle requires a thorough understanding of the underlying data dynamics for robust, safe and cost-effective AI development and use. Dataset shifts are defined as changes between train and test data distributions. Whether occurring over time (temporal) or across different sites (multi-source), they can severely degrade model performance and compromise data quality. This is particularly important in health AI, where the safety and fundamental rights of patients can be severely affected by uncontrolled shifts both at training and operational stages. While the theoretical foundations of covariate, prior, and concept shifts are well established, there is a lack of accessible and comprehensive software tools to perform their analysis. We introduce dashi, an open-source Python library designed for the exploration, quantification, and characterization of dataset shifts. dashi provides a dual approach: an unsupervised approach that leverages information geometry and non-parametric statistical manifolds to data variability characterization and analysis (e.g., Information Geometric Temporal plots and Multi-Source Variability metrics like Global Probabilistic Deviation and Source Probabilistic Outlyingness), and a supervised approach that quantifies and characterizes model performance degradation. Both unsupervised and supervised approaches work across user-defined temporal and domain/source batches. We demonstrate the utility of dashi on three simulated and real-world health AI case studies on gestational diabetes mellitus, COVID-19 and emergency medical dispatch. By providing interactive visual analytics and variability metrics, dashi supports trustworthiness of AI life cycle stages enabling robust and safe machine learning pipelines through the assessment of data coherence and AI performance.

URL PDF HTML ☆

赞 0 踩 0

2605.31354 2026-06-01 cs.AI cs.LG 版本更新

Diagnosing Failure Modes of Shared-State Collaboration in Resource-Constrained Visual Agents

资源受限视觉代理中共享状态协作的故障模式诊断

Yunpeng Zhou

发表机构 * Nanjing University of Information Science \& Technology, Nanjing, China

AI总结本文通过噪声累积视角研究弱学习者（4B-8B模型）在共享工作记忆下的协作推理故障模式，提出CoSee审计框架追踪文档视觉问答中的信息流，发现朴素共享工作空间会放大幻觉而非解决，并识别出噪声强化和策略崩溃两种主要故障模式。

详情

AI中文摘要

模块化视觉推理系统越来越依赖共享工作记忆进行多步协作，但低容量场景下中间状态演化的故障动态仍未被充分探索。我们通过噪声累积的视角研究弱学习者（4B-8B模型）的协作推理故障模式。我们引入了CoSee，一个审计框架，形式化了读-写-验证循环以追踪文档视觉问答中的信息流。在多页、图表和基于网页的基准测试中，我们发现了一个反直觉的退化：朴素的共享工作空间往往放大而非解决幻觉。我们识别出两种主要的故障模式：噪声强化（未基于事实的笔记被重新用作证据）和策略崩溃（添加的上下文使模型转向欠指定的短形式答案）。使用成本-准确率帕累托前沿，我们表明增加计算量在没有显式验证的情况下可能与性能负相关。我们的发现表明，对于资源受限的代理，瓶颈不在于推理深度而在于通信保真度，为可靠的模块化设计提供了轨迹级诊断和机制基线。

英文摘要

Modular visual reasoning systems increasingly rely on shared working memory for multi-step collaboration, yet the failure dynamics of intermediate state evolution in low-capacity regimes remain underexplored. We study failure modes of collaborative reasoning with weak learners (4B--8B models) through the lens of noise accumulation. We introduce CoSee, an auditing framework that formalizes the read-write-verify loop to trace information flow in document visual question answering. Across multi-page, chart, and web-based benchmarks, we find a counter-intuitive degradation: naive shared workspaces often amplify hallucinations rather than resolve them. We identify two dominant failure modes: Noise Reinforcement, where ungrounded notes are reused as evidence, and Policy Collapse, where added context shifts the model toward under-specified, short-form answers. Using cost-accuracy Pareto frontiers, we show that increased compute can correlate negatively with performance without explicit verification. Our findings suggest that for resource-constrained agents, the bottleneck lies not in reasoning depth but in communication fidelity, providing trace-level diagnostics and a mechanistic baseline for reliable modular design.

URL PDF HTML ☆

赞 0 踩 0

2605.31349 2026-06-01 cs.CL cs.AI cs.CV cs.MM 版本更新

FBHM: Functional Benchmarking and Steering of VLMs for Hateful Meme Detection

FBHM：用于仇恨模因检测的功能性基准测试与视觉语言模型引导

Paramananda Bhaskar, Naquee Rizwan, Daksh Jogchand, Saurabh Kumar Pandey, Animesh Mukherjee

发表机构 * Indian Institute of Technology (IIT), Kharagpur（印度理工学院（IIT）卡拉格浦尔）； Microsoft（微软）

AI总结针对现有基准无法因果评估视觉语言模型漏洞的问题，提出基于25种修辞功能和10个目标社区构建的FBHM基准，并采用可学习引导向量（LSV）在极低数据量下提升模型性能约30个Macro-F1点。

详情

AI中文摘要

仇恨模因检测对于视觉语言模型仍是一个严峻挑战，因为现有基准在结构上是观察性的——混淆了修辞仇恨机制与目标社区特征，并阻碍了对模型漏洞的因果评估。为解决这一问题，我们引入了FBHM，一个系统策划的基于功能的仇恨模因基准，沿两个正交轴构建：25种不同的修辞功能和10个目标社区（总共5,000个模因）。对最先进的视觉语言模型进行基准测试揭示了一个严重的泛化差距：在标准数据集上高度准确的模型在FBHM上灾难性地下降到接近随机性能，证明它们利用了数据集特定的启发式方法而非稳健的多模态推理。为了高效缩小这一差距，我们提出了LSV（可学习引导向量），一种超低数据量策略，在仅500个引导样本（50个独特基础模因）上应用因果干预目标，将FBHM性能提升约30个Macro-F1点，同时优于上下文学习和PEFT，且不降低源域性能。

英文摘要

Hateful meme detection remains a formidable challenge for vision-language models, as existing benchmarks are structurally observational - confounding rhetorical hate mechanisms with target community features and preventing causal evaluation of model vulnerabilities. To address this, we introduce FBHM, a systematically curated benchmark of Functionality Based Hateful Memes constructed along two orthogonal axes: 25 distinct rhetorical functionalities and 10 target communities (5,000 memes total). Benchmarking state-of-the-art VLMs reveals a severe generalization gap: models highly accurate on standard datasets catastrophically drop to near-random performance on FBHM, proving they exploit dataset-specific heuristics rather than robust multimodal reasoning. To efficiently close this gap, we propose LSV (learnable steering vectors), an ultra-low data regime strategy that applies a causal intervention objective on as few as 500 steering samples (50 unique base memes), boosting FBHM performance by ~30 Macro-F1 points while outperforming in-context learning and PEFT without degrading source-domain performance.

URL PDF HTML ☆

赞 0 踩 0

2605.31340 2026-06-01 cs.HC cs.AI 版本更新

Appropriateness of Empathy in AI: A Signal-Cost Perspective

AI中同理心的适当性：信号-成本视角

Chi-Ching Juan, Tao Wang, Harold Lee

发表机构 * School of Information University of Toronto（信息学院多伦多大学）； Independent Researcher（独立研究者）

AI总结本文从信号-成本视角出发，运用信号理论提出信号成本代理（情感丰富性、观点采择和情境定制）来评估AI同理心的适当性，建立多维度框架以系统评价同理心是否适应用户需求。

Comments Accepted by IEEE CASCON 2025

2605.31330 2026-06-01 cs.GT cs.AI cs.MA math.OC nlin.AO 版本更新

Social welfare optimisation under institutional reward and punishment

制度奖惩下的社会福利优化

Van An Nguyen, Vuong Khang Huynh, Huu Loi Bui, Hai Anh Ha, Quang Dung Le, Tan Dat Nguyen, Ngoc Ngu Nguyen, Zhao Song, Manh Hong Duong, Le Hong Trang, The Anh Han

发表机构 * Faculty of Computer Science and Engineering, Ho Chi Minh City University of Technology (HCMUT), Vietnam（胡志明市技术大学计算机科学与工程学院，越南）； Vietnam National University - Ho Chi Minh City (VNU-HCM), Vietnam（越南胡志明市国家大学（VNU-HCM），越南）； School of Computing, Engineering and Digital Technologies, Teesside University, United Kingdom（泰赛德大学计算、工程与数字技术学院，英国）； School of Mathematics, University of Birmingham, Birmingham, United Kingdom（伯明翰大学数学学院，英国）

AI总结研究在有限混合群体中，通过奖励合作者或惩罚背叛者来最大化社会福利的激励机制，推导出最优激励的显式表达式和相变条件，并比较奖励与惩罚的福利效果。

详情

AI中文摘要

制度激励被广泛用于促进从人类社会到多智能体和AI系统中自主、自利代理人的合作。现有工作通常将激励设计视为双目标问题：在实现高长期合作频率的同时最小化制度成本。此类方案是否也能最大化社会福利——即扣除制度支出后的总人口收益——在很大程度上尚未被探索。我们针对有限、充分混合的群体中参与社会困境（捐赠博弈和公共品博弈）的情况，开发了一个以福利为中心的激励框架，同时考虑对合作者的奖励和对背叛者的惩罚。对于每种机制，我们推导出预期社会福利的显式表达式，并刻画其如何依赖于激励效率和选择强度。在解析上，我们识别出社会福利具有单一最优激励水平的参数区间，以及出现定性相变、福利非单调且具有多个局部最优的区间。我们证明任何最大化福利的激励要么为零，要么集中在简单的闭式目标附近，并提供了一种高效算法来计算这些最优值。通过比较奖励和惩罚，我们进一步推导出在给定预算下奖励在福利方面优于惩罚的闭式条件。总体而言，我们的结果揭示了针对成本或合作频率优化的激励与最大化福利的激励之间存在系统性差距。

英文摘要

Institutional incentives are widely used to promote cooperation among autonomous, self-regarding agents, from human societies to multi-agent and AI systems. Existing work typically treats incentive design as a bi-objective problem: minimise institutional cost while achieving a high long-run frequency of cooperation. Whether such schemes also maximise social welfare - total population payoff net of institutional expenditure - has remained largely unexplored. We develop a welfare-centric framework for institutional incentives in finite, well-mixed populations playing a social dilemma (Donation Game and Public Goods Game), considering both rewards for cooperators and punishments for defectors. For each mechanism, we derive explicit expressions for expected social welfare and characterise how it depends on incentive efficiency and selection intensity. Analytically, we identify parameter regimes where social welfare has a single optimal incentive level and regimes with qualitative phase transitions, in which welfare becomes non-monotonic with multiple local optima. We prove that any welfare-maximising incentive is either zero or concentrated around a simple closed-form target, and we provide an efficient algorithm to compute these optima. Comparing reward and punishment, we further derive close-formed conditions under which reward outperform punishment in terms of social welfare for any given budget. Overall, our results reveal a systematic gap between incentives optimised for cost or cooperation frequency and those that maximise welfare.

URL PDF HTML ☆

赞 0 踩 0

2605.31324 2026-06-01 cs.LG cs.AI 版本更新

Inconsistency-Aware Minimization: Improving Generalization with Unlabeled Data

不一致感知最小化：利用无标签数据提升泛化能力

Hee-Sung Kim, Hyeonseong Kim, Sungyoon Lee

发表机构 * Department of Computer Science, Hanyang University, Seoul, Korea（汉阳大学计算机科学系）

AI总结本文提出一种基于信息几何的局部不一致性度量，并据此设计不一致感知最小化（IAM）方法，通过无标签数据计算该度量并融入训练目标，从而提升深度学习模型的泛化性能。

Comments ICML 2026

详情

AI中文摘要

估计泛化差距并开发改进泛化的优化方法对于深度学习模型至关重要，无论是从理论理解还是实际应用角度。利用无标签数据实现这些目标在实际场景中具有显著优势。本文从神经网络参数空间的信息几何角度出发，引入了一种新的泛化度量——局部不一致性。局部不一致性的一个关键特征是它可以在没有显式标签的情况下计算。我们通过将局部不一致性与Fisher信息矩阵和损失Hessian矩阵联系起来，建立了理论基础。实验上，我们证明了局部不一致性与泛化差距相关。基于这些发现，我们提出了不一致感知最小化（IAM），将局部不一致性纳入训练目标。我们证明，在标准监督学习设置中，IAM增强了泛化能力，实现了与现有方法（如锐度感知最小化）相当的性能。此外，IAM在半监督和自监督学习场景中表现出有效性，其中局部不一致性是从无标签数据计算得出的。

英文摘要

Estimating the generalization gap and developing optimization methods that improve generalization are crucial for deep learning models, for both theoretical understanding and practical applications. Leveraging unlabeled data for these purposes offers significant advantages in real-world scenarios. This paper introduces a novel generalization measure, local inconsistency, derived from an information-geometric perspective on the parameter space of neural networks. A key feature of local inconsistency is that it can be computed without explicit labels. We establish theoretical underpinnings by connecting local inconsistency to the Fisher information matrix and the loss Hessian. Empirically, we demonstrate that local inconsistency correlates with the generalization gap. Based on these findings, we propose Inconsistency-Aware Minimization (IAM), which incorporates local inconsistency into the training objective. We demonstrate that in standard supervised learning settings, IAM enhances generalization, achieving performance comparable to that of existing methods such as Sharpness-Aware Minimization. Furthermore, IAM exhibits efficacy in semi- and self-supervised learning scenarios, where the local inconsistency is computed from unlabeled data.

URL PDF HTML ☆

赞 0 踩 0

2605.31308 2026-06-01 cs.AI 版本更新

TraceGraph: Shared Decision Landscapes for Diagnosing and Improving Agent Trajectories

TraceGraph: 用于诊断和改进智能体轨迹的共享决策景观

Junjie Nian, Kang Chen, Ge Zhang, Yixin Cao, Yugang Jiang

发表机构 * Fudan University（复旦大学）； Shanghai Innovation Institute（上海创新研究院）

AI总结提出TraceGraph图框架，将多模型智能体轨迹构建为共享决策景观，通过事件摘要和陷阱感知恢复管线提升SWE-bench解决率。

详情

AI中文摘要

智能体基准测试越来越多地记录丰富的交互轨迹，但评估通常将每次运行简化为通过率或奖励分数。我们引入了TraceGraph，一个基于图的框架，将发布的多模型智能体轨迹转化为共享决策景观。对于每个任务，TraceGraph在引入模型身份之前，从聚合的运行中构建一个关于可观察动作-观察状态的图。然后，它叠加结果信息丰富的生产核心和陷阱区域，并用三个事件总结每条轨迹：访问、陷阱暴露和修复。跨越五个基准测试分割的轨迹中，TraceGraph配置文件揭示了被聚合分数隐藏的导航差异，并显示不同分割在奖励避免陷阱还是从中恢复方面有所不同。相同的TraceGraph景观还激发了SWE-bench的陷阱感知恢复管线：运行时检测器在匹配历史陷阱区域的状态上触发，然后从相同前缀评估轻量级延续策略。在触发状态上，最佳聚合单因子策略将每个提供者触发子集上的官方解决率从40.4%提高到43.5%，在共同触发实例上从41.0%提高到44.8%，并具有提供者特定的主动组件。总体而言，TraceGraph提供了一个过程词汇，用于询问智能体基准测试测试什么、模型在共享景观上何处出现分歧，以及失败区域如何指导下游改进。

英文摘要

Agent benchmarks increasingly record rich interaction trajectories, yet evaluation often reduces each rollout to a pass rate or reward score. We introduce TraceGraph, a graph-based framework that turns released multi-model agent trajectories into shared decision landscapes. For each task, TraceGraph builds a graph over observable action-observation states from pooled rollouts before model identity is introduced. It then overlays outcome-informed productive cores and trap regions, and summarizes each rollout with three events: Access, Trap exposure, and Repair. Across trajectories spanning five benchmark splits, TraceGraph profiles reveal navigation differences hidden by aggregate scores and show that splits differ in whether they reward avoiding traps or recovering from them. The same TraceGraph landscape also motivates a trap-aware recovery pipeline for SWE-bench: aruntime detector fires on states matching historical trap regions, then lightweight continuation policies are evaluated from the same prefix. On fired states, the best pooled single-factor policy raises official resolved rate from 40.4% to 43.5% on the per-provider fired subset and from 41.0% to 44.8% on common-fired instances, with provider-specific active components. Overall, TraceGraph provides a process vocabulary for asking what agent benchmarks test, where models diverge on a shared landscape, and how failure regions can guide downstream improvement.

URL PDF HTML ☆

赞 0 踩 0

2605.31295 2026-06-01 cs.SD cs.AI cs.IR cs.LG 版本更新

Latent Space Disentanglement via Activation Steering for Interpretable Attribute Control in Symbolic Music Generation

通过激活引导实现潜在空间解缠：符号音乐生成中可解释的属性控制

Ioannis Prokopiou, Pantelis Vikatos, Maximos Kaliakatsos-Papakostas, Theodoros Giannakopoulos, Themos Stafylakis

发表机构 * Athens University of Economics ； Innovation Lab Orfium Athens, Greece ； Department of Music Technology ； Acoustics Hellenic Mediterranean University Rethymno, Greece ； Institute of Informatics \& Telecommunications National Center for Scientific Research “Demokritos” Athens, Greece ； Department of Informatics Athens University of Economics

AI总结本文利用差分均值方法从多轨音乐Transformer的残差流中分离音高和时长的潜在方向，并通过Gram-Schmidt正交化实现双属性引导，从而在推理时实现可解释的确定性属性调制。

Comments Accepted at EUSIPCO 2026 (34th European Signal Processing Conference), 5 pages, 2 figures

详情

AI中文摘要

基于Transformer的架构在生成复杂符号序列方面取得了显著进展，但在实现对离散信号属性的细粒度、可解释控制方面仍存在显著差距。本文研究了多轨音乐Transformer（MMT）的机制可解释性，并提出了一种无需重新训练的确定性属性调制框架，通过推理时的激活引导来弥合这一差距。利用差分均值（DiffMean）方法，我们在残差流中分离了信号属性（特别是音高和时长）的潜在方向。我们验证了该领域的线性表示假设，实现了引导幅度与属性偏移之间的高相关性。为了解决多属性引导中固有的特征纠缠问题，我们引入了一种利用Gram-Schmidt正交化的双引导框架。实验结果表明，与简单的向量加法相比，这种几何解耦减少了概念干扰和信号退化，即使在强自回归条件下也能实现独立的确定性控制。

英文摘要

Transformer-based architectures have significantly advanced the generation of complex symbolic sequences, yet a significant gap remains in achieving fine-grained, interpretable control over discrete signal attributes. This paper investigates the mechanistic interpretability of the Multitrack Music Transformer (MMT) and proposes a framework for deterministic attribute modulation without retraining to bridge this gap via inference-time activation steering. Utilizing the Difference-in-Means (DiffMean) methodology, we isolate latent directions for signal attributes, specifically Pitch and Duration, within the residual stream. We validate the Linear Representation Hypothesis in this domain, achieving high correlation between steering magnitude and attribute shift. To address the inherent feature entanglement in multi-attribute steering, we introduce a Dual Steering framework utilizing Gram-Schmidt Orthogonalization. Experimental results demonstrate that this geometric decoupling reduces conceptual interference and signal degradation compared to naive vector addition, enabling independent deterministic control even against strong autoregressive conditioning.

URL PDF HTML ☆

赞 0 踩 0

2605.31289 2026-06-01 cs.LG cs.AI 版本更新

The Terminal Representation in Reinforcement Learning

强化学习中的终端表示

Amir Esterhuysen, Anders Jonsson

发表机构 * Dept. Information and Communication Technologies（信息与通信技术系）； Universitat Pompeu Fabra（庞培法布拉大学）

AI总结提出终端表示（TR），一种无需特征分解即可直接用于下游任务且计算开销更低的奖励加权状态表示方法。

详情

AI中文摘要

表示学习是强化学习（RL）中用于时空抽象的强大工具。两种成熟的方法是通过后继表示（SR）和默认表示（DR）。SR通过状态引发的未来轨迹对其进行编码，捕获与奖励解耦的信息流。DR在此基础上用奖励加权轨迹，将信用分配结构整合到表示中。两种表示的特征向量已被用于支持一系列下游任务——包括选项发现、奖励塑造、迁移学习和探索。我们引入了一种结构不同的公式：终端表示（TR）。TR类似于DR对奖励加权轨迹进行编码，但可以作为更低维度的对象进行学习，并且可以直接用于上述应用而无需特征分解。特征分解还施加了对称转移动力学的假设，而TR可以绕过这一点。在这项工作中，我们发展了TR的理论基础：其推导、两种学习算法的收敛性、其在零样本组合性中的使用，以及替代奖励公式之间的等价性。我们进一步表明TR嵌入在顶部DR特征向量中，使其无需特征分解即可捕获相同的基础知识。此外，我们提供了经验证据，证明TR在辅助应用中作为现有表示的可行替代方案，同时在学习、存储和使用方面需要更少的计算开销。

英文摘要

Representation learning is a powerful tool for spatio-temporal abstraction within reinforcement learning (RL). Two well established approaches are through the successor representation (SR) and the default representation (DR). The SR encodes states by the future trajectories they induce, capturing information flow decoupled from reward. The DR builds on this by weighting trajectories with reward, integrating credit-assignment structure into the representation. Eigenvectors of both representations have been used to support a range of downstream tasks -- including option discovery, reward shaping, transfer learning, and exploration. We introduce a structurally distinct formulation: the terminal representation (TR). The TR encodes reward-weighted trajectories similarly to the DR, but can be learned as a lower-dimensionality object, and can be used directly for the mentioned applications without eigenvector computations. Eigendecomposition also imposes the assumption of symmetric transition dynamics, which the TR can bypass. In this work we develop the theoretical foundations of the TR: its derivation, convergence of two learning algorithms, its use for zero-shot compositionality, and equivalences between alternative reward formulations. We further show the TR is embedded in the top DR eigenvector, allowing it to capture the same underlying knowledge without eigendecomposition. Additionally, we provide empirical evidence of the TR as a viable alternative to existing representations in subsidiary applications, while requiring less computational overhead to learn, store, and use.

URL PDF HTML ☆

赞 0 踩 0

2605.31287 2026-06-01 cs.CY cs.AI cs.HC 版本更新

Neither Replacement nor Panacea: Comparing LLM-Based Conversational and Graphical Decision Support in Industrial Tasks

既非替代品也非万能药：比较基于LLM的对话式与图形化决策支持在工业任务中的应用

Roberto Figliè, Simone Caputo, Alan Serrano, Daria Mikhaylova, Tommaso Turchi, Daniele Mazzei

发表机构 * Department of Computer Science, University of Pisa（比萨大学计算机科学系）； Department of Computer Science, Brunel University of London（伦敦布鲁内尔大学计算机科学系）

AI总结通过混合因子实验，比较基于LLM的对话式界面与仪表盘在工业决策支持中的效果，发现对话界面在低复杂度任务中降低认知负荷和加快完成时间，但优势随任务复杂度增加而消失，且未提高决策准确性。

详情

AI中文摘要

制造业环境中的管理者依赖数字界面解读运营数据以进行决策，但不断增长的数据量和复杂性使得高效识别相关洞察变得困难。虽然仪表盘在工业环境中仍占主导地位，但通过对话式用户界面（CUI）访问的基于大型语言模型（LLM）的对话代理（CA）可能提供更直接的数据访问。然而，其有效性可能取决于任务的信息处理需求。本研究在制造决策支持场景中比较了通过CUI提供的基于LLM的CA与仪表盘。在一个2x3设计的混合因子实验中，134名工业决策者被分配到一种界面条件，并完成三个复杂度递增的任务。我们考察了感知心理负荷（MWL）、决策准确性、完成时间和预期依赖，并测试了自我报告的数据素养作为调节变量。结果显示，CUI总体上降低了感知MWL，并在低要求任务中支持更快的完成，但随着任务复杂度增加，这两个优势均减弱。两种界面在决策准确性上均未产生一致的整体优势，且CUI不被偏好作为后续决策的唯一基础。此外，数据素养并未可靠地调节界面效应。这些发现表明，对话式交互为工业决策支持提供的是有条件而非普遍的好处。基于LLM的CA可能减少信息访问努力，而复杂决策仍然受益于持久、可检查的视觉表示。

英文摘要

Managers in manufacturing settings rely on digital interfaces to interpret operational data for decision-making, but growing data volume and complexity can make relevant insights difficult to identify efficiently. While dashboards remain dominant in industrial contexts, Large Language Model (LLM)-based conversational agents (CAs), accessed through conversational user interfaces (CUIs), may provide more direct access to such data. However, their effectiveness may depend on the information-processing demands of the task. This study compares an LLM-based CA delivered through a CUI with a dashboard in a manufacturing decision-support scenario. In a mixed factorial experiment with a 2x3 design, 134 industrial decision-makers were assigned to one interface condition and completed three tasks of increasing complexity. We examined perceived Mental Workload (MWL), decision accuracy, completion time, and intended reliance, and tested self-reported data literacy as a moderator. Results showed that the CUI reduced perceived MWL overall and supported faster completion in less demanding tasks, but both advantages diminished as task complexity increased. Neither interface produced a consistent overall advantage in decision accuracy, and the CUI was not preferred as a sole basis for subsequent decisions. Furthermore, data literacy did not reliably moderate interface effects. These findings indicate that conversational interaction offers conditional rather than universal benefits for industrial decision support. LLM-based CAs may reduce information-access effort, whereas complex decisions continue to benefit from persistent, inspectable visual representations.

URL PDF HTML ☆

赞 0 踩 0

2605.31284 2026-06-01 cs.CV cs.AI 版本更新

SAM for Robust Mitochondria Instance Segmentation in Fluorescence Microscopy

SAM 用于荧光显微镜中鲁棒的线粒体实例分割

Suyog Jadhav, Dilip K. Prasad, Krishna Agarwal

发表机构 * UiT The Arctic University of Norway（UiT北极大学）

AI总结通过仅在合成荧光显微镜数据上微调 SAM，解决了真实数据稀缺问题，提高了线粒体实例分割的精度和平均 Dice 分数。

Comments Accepted at PHAROS-AIF-MIH workshop @ CVPR 2026

详情

AI中文摘要

荧光显微镜（FM）中线粒体的形态分析对于理解细胞健康、能量产生和代谢调节至关重要。虽然像 Segment Anything Model (SAM) 这样的基础模型已经革新了自然图像分割，但由于衍射受限分辨率、低对比度和复杂的重叠细胞器网络，它们直接应用于 FM 受到显著领域偏移的阻碍。此外，鲁棒模型的开发因严重缺乏高质量、手动标注的线粒体实例分割数据集而受阻。在本文中，我们提出了一种可扩展的解决方案，通过仅在合成生成的 FM 数据上微调 SAM 来解决数据稀缺问题。我们模拟真实的线粒体数据并模拟荧光显微镜的光学特性，以创建大规模标注数据集。我们在一个精心策划的真实手动标注 FM 图像数据集上评估了我们的微调模型。定性和定量分析表明，我们的合成微调模型在精度和平均 Dice 分数上优于强基线。这项工作确立了模拟辅助训练在 FM 实例分割中的潜力。

英文摘要

The morphological analysis of mitochondria in fluorescence microscopy (FM) is crucial for understanding cellular health, energy production, and metabolic regulation. While foundation models like the Segment Anything Model (SAM) have revolutionized natural image segmentation, their direct application to FM is hindered by a significant domain shift characterized by diffraction-limited resolution, low contrast, and complex overlapping organelle networks. Furthermore, the development of robust models is bottlenecked by a severe lack of high-quality, manually annotated instance segmentation datasets for mitochondria. In this paper, we propose a scalable solution to this data scarcity by finetuning SAM exclusively on synthetically generated FM data. We simulate realistic mitochondria data and emulate the optical properties of fluorescence microscopes to create a large-scale annotated dataset. We evaluate our fine-tuned model on a curated dataset of real, manually annotated FM images. Qualitative and quantitative analyses demonstrate that our synthetically fine-tuned model improves precision and average dice score over strong baselines. This work establishes the potential of simulation-assisted training for FM instance segmentation.

URL PDF HTML ☆

赞 0 踩 0

2605.31279 2026-06-01 eess.SP cs.AI cs.NI 版本更新

Practical Cross-Band Channel Prediction for AI-RAN via Physics-Guided Deep Unfolding

面向AI-RAN的实用跨频段信道预测：基于物理引导的深度展开

Ruiqi Kong, He Chen, Xiaojun Lin

发表机构 * Department of Information Engineering, The Chinese University of Hong Kong, Hong Kong SAR, China（香港大学信息工程系）

AI总结提出GUIDE框架，通过将无线信道物理嵌入可微层，实现跨频段信道预测的泛化与实时推理，在未见环境中波束赋形增益比深度学习基线FIRE高2.75倍，比模型基线R2F2高1.39倍且速度快1610倍以上。

Comments 2 pages

2605.31275 2026-06-01 cs.HC cs.AI 版本更新

为什么线性循环记忆在部分可观测强化学习中有效

Yike Zhao, Onno Eberhard, Malek Khammassi, Ali H. Sayed, Michael Muehlebach

发表机构 * EPFL（苏黎世联邦理工学院）； Max Planck Institute for Intelligent Systems（智能系统马克斯·普朗克研究所）

AI总结本文通过构造两种线性滤波器，从理论上证明了线性循环神经网络在部分可观测强化学习中作为记忆单元的有效性，并扩展到动作控制的隐马尔可夫模型。

详情

AI中文摘要

线性循环神经网络家族在部分可观测强化学习中作为循环记忆单元表现出色。我们通过构造并研究两种线性滤波器为其经验有效性提供了理论依据：(i) 第一种在确定性转移矩阵下精确重现隐马尔可夫模型（HMM）中信念向量的预softmax logits，从而作为最优策略学习的充分统计量；(ii) 第二种在近似确定性转移矩阵下实现状态解码误差趋近于零，从而将状态模糊性降至接近零。结果扩展到动作控制的HMM，其中相应的线性滤波器变为随时间变化且依赖于动作的动态。我们通过数值实验说明了主要结果，并进一步展示了所构造的线性滤波器在小型强化学习游戏中作为强特征提取器的能力。

英文摘要

The family of linear recurrent neural networks has shown strong performance as recurrent memory units in partially observable reinforcement learning. We provide a theoretical justification for their empirical effectiveness by constructing and studying two linear filters: (i) the first exactly reproduces the pre-softmax logits of the belief vector in a hidden Markov model (HMM) under a deterministic transition matrix, thereby serving as a sufficient statistic for optimal policy learning, (ii) the second achieves vanishing state-decoding error under a nearly deterministic transition matrix, thus reducing state ambiguity to near zero. The results extend to action-controlled HMMs, where the corresponding linear filters become time-varying with action-dependent dynamics. We illustrate our main results through numerical experiments and further show that the constructed linear filter serves as a strong feature extractor in a small reinforcement learning game.

URL PDF HTML ☆

赞 0 踩 0

2605.31254 2026-06-01 cs.AI 版本更新

Formalizing and falsifying causal pathways of rare events

罕见事件因果路径的形式化与证伪

Anahita Haghighat, Dominik Janzing

发表机构 * Amazon Research（亚马逊研究）

AI总结本文在结构方程模型中罕见事件根因分析的形式化基础上，提出因果路径的形式定义并讨论其可检验含义，引入罕见事件因果路径的抽象以桥接简单因果解释与详细因果建模。

Comments accepted for ICML 2026

2605.31251 2026-06-01 cs.CV cs.AI 版本更新

ERGeoBench:A Comprehensive Benchmark for Embodied Reasoning and Geo-localization in Multimodal Large Language Models

ERGeoBench：多模态大语言模型中具身推理与地理定位的综合基准

Kaiwen Xue, Tao Wei, Guoxin Zhang, Zhonghong Ou, Kaoyan Lu, Yu Feng, Yifan Zhu, Haoran Luo

发表机构 * Beijing University of Posts and Telecommunications（北京邮电大学）； State Key Laboratory of Networking and Switching Technology（网络与交换技术国家重点实验室）； School of Materials Science and Engineering（材料科学与工程学院）； China Mobile Research Institute（中国移动研究院）； College of Computing and Data Science（计算与数据科学学院）

AI总结提出ERGeoBench基准，通过单视图、全景视图和具身视图三种渐进设置评估多模态大语言模型在视觉驱动的具身地理定位中的能力，发现当前模型在高层次地理语义推理上表现良好，但在细粒度感知、度量定位和视图间空间一致性上仍有不足。

详情

AI中文摘要

多模态大语言模型（MLLMs）作为具身代理展现出强大潜力，然而由于缺乏细粒度评估，具身地理定位仍未被充分探索。我们引入ERGeoBench，一个用于视觉驱动的具身地理定位的诊断基准。ERGeoBench在三种渐进设置下评估模型——单视图、全景视图和具身视图——其中代理可以通过偏航、俯仰和缩放的顺序变化主动获取观察。该基准包含2,207个全球分布的街景全景图，并衡量四种互补能力：基础感知、空间意识、常识推理和地理定位推理。对领先的专有和开源MLLMs的评估表明，当前模型能够推断高层次的地理语义，但在细粒度感知操作、度量定位和跨视图空间一致性方面仍然困难。我们进一步观察到，地理定位与其他能力维度强相关，表明准确定位依赖于集成的感知、空间推理和常识推理，而非孤立的视觉识别。总体而言，ERGeoBench为诊断和推进类人具身地理定位提供了一个统一框架。项目页面：https://kaixuewen.github.io/ERGeoBench/

英文摘要

Multimodal large language models (MLLMs) have shown strong potential as embodied agents, yet embodied geo-localization remains underexplored due to the lack of fine-grained evaluation. We introduce ERGeoBench, a diagnostic benchmark for vision-driven embodied geo-localization. ERGeoBench evaluates models under three progressive settings -- single-view, panorama-view, and embodied-view -- where agents may actively acquire observations through sequential changes in yaw, pitch, and zoom. The benchmark contains 2,207 globally distributed street-view panoramas and measures four complementary capabilities: foundational perception, spatial awareness, common sense reasoning, and geo-localization reasoning. Evaluations of leading proprietary and open-source MLLMs show that current models can infer high-level geographic semantics, but still struggle with fine-grained perceptual operations, metric localization, and spatial consistency across views. We further observe that geo-localization is strongly correlated with the other capability dimensions, suggesting that accurate localization depends on integrated perception, spatial reasoning, and commonsense inference rather than isolated visual recognition. Overall, ERGeoBench provides a unified framework for diagnosing and advancing human-like embodied geo-localization. Project Page: https://kaixuewen.github.io/ERGeoBench/

URL PDF HTML ☆

赞 0 踩 0

2605.31250 2026-06-01 stat.ML cs.AI cs.LG 版本更新

Entropic Projection Alignment: Estimating, Explaining, and Improving Model Performance Under Distribution Shift

熵投影对齐：估计、解释和改进分布偏移下的模型性能

Salim I. Amoukou, Emanuele Albini, Tom Bewley, Saumitra Mishra, Manuela Veloso

发表机构 * J.P. Morgan AI Research（摩根大通AI研究所）

AI总结提出熵投影对齐（EPA）方法，通过匹配选定矩并最小化KL散度来对齐源分布与目标分布，从而统一解决分布偏移下的性能估计、解释和改进问题。

Comments Accepted at the 29th International Conference on Artificial Intelligence and Statistics (AISTATS 2026)

2605.31249 2026-06-01 cs.LG cs.AI 版本更新

Learning Cardiac Latent Representations in Vectorcardiogram Space

在向量心电图空间中学习心脏潜在表示

Bosong Huang, Panzhen Zhao, Zengxiang Li, Patricia Lee, Wei Jin, Alan Wee-Chung Liew, Ming Jin, Shirui Pan

发表机构 * Griffith University, Australia（格里菲斯大学）； SingHealth Duke-NUS AI in Medicine Institute, Singapore（新加坡SingHealth Duke-NUS医学人工智能研究所）； Emory University, USA（埃默里大学）

AI总结针对标准十二导联心电图表示学习中的冗余和过拟合问题，提出基于Frank向量心电图模型的LVCG框架，在物理潜在空间中学习视图不变的心脏电活动表示，提升鲁棒性和泛化能力。

详情

AI中文摘要

心电图（ECG）是心脏评估的基石，学习信息丰富的ECG表示对于从疾病诊断到临床报告生成等任务至关重要。然而，现有方法几乎完全在可观测的ECG信号空间中操作。实际上，标准十二导联ECG代表了同一心脏电活动在不同空间方向上的多个投影。因此，在ECG空间中进行表示学习不可避免地引入了大量冗余，可能导致虚假相关性和过拟合风险增加。为了解决这个问题，受Frank向量心电图（VCG）模型启发，我们提出直接在VCG空间中学习心脏电活动的统一潜在表示。我们引入了LVCG，这是第一个设计用于在此物理基础潜在空间中运行的通用自监督表示学习框架。通过学习视图不变的潜在VCG表示而非导联特定伪影，LVCG最小化了冗余并提高了泛化能力。LVCG在各项任务中普遍优于ECG空间基线，展现出增强的鲁棒性和泛化能力，尤其在领域偏移设置中。

英文摘要

Electrocardiography (ECG) is a cornerstone of cardiac assessment, making the learning of informative ECG representations fundamental to tasks ranging from disease diagnosis to clinical report generation. However, existing methods operate almost exclusively in the observable ECG signal space. In practice, the standard twelve-lead ECG represents multiple projections of the same underlying cardiac electrical activity from different spatial orientations. Therefore, representation learning in the ECG space inevitably introduces substantial redundancy, which may lead to spurious correlations and increased risk of overfitting. To address this and motivated by the Frank vectorcardiogram (VCG) model, we propose learning a unified latent representation of cardiac electrical activity directly in the VCG space. We introduce LVCG, the first general self-supervised representation learning framework designed to operate in this physically grounded latent space. By learning view-invariant latent VCG representations rather than lead-specific artifacts, VCG minimizes redundancy and improves generalization. LVCG generally outperforms ECG-space baselines across tasks, demonstrating enhanced robustness and generalization, especially in domain shift settings.

URL PDF HTML ☆

赞 0 踩 0

2605.31239 2026-06-01 stat.ML cs.AI cs.LG 版本更新

Correcting Split Selection in Online Decision Trees via Anytime-Valid Inference

通过随时有效推断纠正在线决策树中的分裂选择

Salim I. Amoukou, Saumitra Mishra, Manuela Veloso

发表机构 * J.P. Morgan AI Research（摩根大通AI研究）

AI总结针对在线决策树分裂选择缺乏有效统计保证的问题，提出基于随时有效推断的方法，实现任意数据流下错误分裂的随时有效控制、预测优势下的有限承诺时间，并在平稳独立同分布数据下保证风险单调递减且每次分裂严格改善。

Comments Accepted as a Spotlight at the Forty-Third International Conference on Machine Learning (ICML 2026)

详情

AI中文摘要

基于装袋的集成方法，尤其是自适应随机森林，是数据流学习中最强的表现者之一。这些方法的共同点是依赖霍夫丁树作为基学习器，通过使用浓度不等式测试候选分裂是否显著优于其替代方案来增量式地构建决策树。尽管经验成功，现有变体缺乏有效的统计保证。当前分析依赖于固定样本浓度界，而分裂决策使用数据依赖的停止规则，这使其保证无效，并可能将错误分裂的概率推向1。我们引入了一种基于随时有效推断的原则性替代方案。我们的方法提供：(i) 在任意数据流（包括非平稳设置）下对错误分裂的随时有效控制；(ii) 在预测优势下的有限承诺时间；(iii) 在平稳独立同分布数据下，风险单调递减且每次分裂严格改善。在经验上，我们评估了独立树及其在非平稳流中在自适应随机森林中的使用。我们的方法提高了性能，同时生成了更小的树。

英文摘要

Bagging-based ensembles, most notably Adaptive Random Forests, are among the strongest performers for learning from data streams. A common denominator across these methods is their reliance on Hoeffding Trees as base learners, which grow decision trees incrementally by testing whether a candidate split is significantly better than its alternatives using concentration inequalities. Despite their empirical success, existing variants lack valid statistical guarantees. Current analyses rely on fixed-sample concentration bounds, while split decisions are made using data-dependent stopping rules, which invalidates their guarantees and can drive the probabilty of incorrect splits to one. We introduce a principled alternative based on anytime-valid inference. Our method provides: (i) anytime-valid control of false splits under arbitrary data streams, including non-stationary settings; (ii) finite commitment time under a predictive advantage; and (iii) under stationary i.i.d. data, risk is monotone decreasing and strictly improves at every split. Empirically, we evaluate both standalone trees and their use within Adaptive Random Forests on non-stationary streams. Our method improves performance while producing substantially smaller trees.

URL PDF HTML ☆

赞 0 踩 0

2605.31229 2026-06-01 cs.CV cs.AI 版本更新

Beyond Classification: Dynamic Adapter Routing for Continual Multimodal Retrieval

超越分类：面向持续多模态检索的动态适配器路由

Alicja Dobrzeniecka, Filip Szatkowski, Sebastian Cygert, Szymon Lukasik, Bartlomiej Twardowski

发表机构 * NASK National Research Institute（NASK国家研究院）； IDEAS Research Institute（IDEAS研究所）； Warsaw University of Technology（华沙技术大学）； Universitat Autonoma de Barcelona（巴塞罗那自治大学）

AI总结针对持续多模态检索（CMR）任务，提出基于原型路由和模型合并的动态适配器路由（DAR）方法，在跨域评估中取得优于现有基线的性能。

详情

AI中文摘要

虽然检索是视觉-语言模型的核心功能，但持续更新这些模型用于检索任务仍未被充分探索。现有工作通常通过类增量学习（CIL）的视角处理持续检索，在可能无法完全捕捉检索特定动态的设置中评估标准CIL方法和面向检索的适应方法。为了解决这一问题，我们引入了一个新的、原则性的持续多模态检索（CMR）评估框架，涵盖多样化的视觉领域，并在此设置中系统评估常见方法。我们的实证分析表明，标准CIL方法在我们更具挑战性的场景中未能产生有意义的增益。因此，我们提出了动态适配器路由（DAR），一种基于通过原型路由选择适配器并通过模型合并组合的新方法。DAR在先前基线上取得了优越性能，并在分布外评估中展现出强大的泛化能力。我们的结果凸显了CMR的独特挑战，并鼓励在该方向进行进一步研究。

英文摘要

While retrieval is a core function of vision-language models, continually updating these models for retrieval tasks remains critically underexplored. Existing work often approaches continual retrieval through the lens of class-incremental learning (CIL), evaluating both standard CIL methods and retrieval-oriented adaptations in settings that may not fully capture the retrieval-specific dynamics. To address this, we introduce a new, principled evaluation framework for continual multimodal retrieval (CMR) spanning diverse visual domains, and systematically evaluate common approaches within this setting. Our empirical analysis shows that standard CIL methods fail to yield meaningful gains in our more challenging scenario. Therefore, we propose Dynamic Adapter Routing (DAR), a novel approach based on adapters selected through prototype-based routing and combined via model merging.DAR achieves superior performance over the previous baselines and demonstrates strong generalization under out-of-distribution evaluation. Our results highlights the unique challenges of CMR and encourages further research in this direction.

URL PDF HTML ☆

赞 0 踩 0

2605.31228 2026-06-01 cs.LG cs.AI 版本更新

EchoRL: Reinforcement Learning via Rollout Echoing

EchoRL：通过回滚回响进行强化学习

Jinhe Bi, Aniri, Minglai Yang, Xingcheng Zhou, Wenke Huang, Sikuan Yan, Yujun Wang, Zixuan Cao, Michael Färber, Xun Xiao, Volker Tresp, Yunpu Ma

发表机构 * Munich Center for Machine Learning（慕尼黑机器学习中心）； Huawei Heisenberg Research Center（华为海森堡研究所以）； University of Arizona（亚利桑那大学）； College of Computing（计算学院）； Data Science, Nanyang Technological University, Singapore（数据科学，南洋理工大学，新加坡）； MemAgents Lab（MemAgents实验室）

AI总结针对RLVR训练中优势退化问题，提出EchoRL模块，通过从成功回滚中提取EchoClip作为辅助监督信号，持续提升训练性能。

Comments ICML 2026

详情

AI中文摘要

探索视觉-语言模型中的碰撞接地以实现安全的人机协作

Jun Wang, Xiaohao Xu, Xiaonan Huang

发表机构 * University of Michigan, Ann Arbor（密歇根大学，安娜堡）

AI总结针对安全人机协作，提出碰撞接地概念及物理基准TouchSafeBench，评估视觉-语言模型在分类当前安全状态和预警即将碰撞任务中的表现，发现现有模型不可靠，视觉流畅性不等于物理责任性。

Comments 31 pages, 9 figures

详情

AI中文摘要

安全的人机协作需要的不仅仅是视觉描述：监控器必须确定机器人身体是否安全分离、已经与场景或人发生碰撞，或即将碰撞。我们将这种能力称为碰撞接地：将视觉观察与机器人身体几何、相机视角、场景布局、人体接近度和时间运动相结合，以推断当前和即将发生的接触。我们引入了TouchSafeBench，一个基于物理的基准，用于评估视觉-语言模型（VLM）中的碰撞接地能力。TouchSafeBench基于Habitat 3.0构建，包含2,940个模拟室内共现场景，涵盖社交导航和社交重排，具有同步的多视角RGB-D观测、自上而下的轨迹地图、校准的相机元数据和模拟器导出的接触标签。我们研究了两个面向部署的任务：分类当前安全状态和在接触前预警即将发生的碰撞。在三个前沿或面向机器人的VLM和九种视觉表示中，当前模型远未达到可靠：最佳平均Macro-F1仍低于50%，显式深度不会自动转化为机器人身体碰撞证据，且机器人与场景的接触始终比人与人的接触风险更难。TouchSafeBench揭示了具身VLM的一个核心限制：视觉流畅性并不意味着物理责任性。可靠的机器人安全监控器需要能够显式绑定视角、机器人形态、度量几何和未来碰撞的表示。我们将在论文被接收后发布该基准。

英文摘要

Safe human--robot collaboration requires more than visual description: a monitor must determine whether the robot body is safely separated, already colliding with the scene or a person, or about to collide. We call this capability collision grounding: binding visual observations to robot body geometry, camera viewpoint, scene layout, human proximity, and temporal motion in order to infer present and imminent contact. We introduce TouchSafeBench, a physics-grounded benchmark for evaluating collision grounding in vision-language models (VLMs). Built in Habitat~3.0, TouchSafeBench contains 2,940 simulated indoor co-presence episodes across social navigation and social rearrangement, with synchronized multi-view RGB-D observations, top-down trajectory maps, calibrated camera metadata, and simulator-derived contact labels. We study two deployment-facing tasks: classifying the current safety state and warning about imminent collision before contact. Across three frontier or robotics-oriented VLMs and nine visual representations, current models remain far from reliable: the best average Macro-F1 stays below 50\%, explicit depth is not automatically transformed into robot-body collision evidence, and robot--scene contact is consistently harder than human-contact risk. TouchSafeBench reveals a central limitation of embodied VLMs: visual fluency does not imply physical accountability. Reliable robot safety monitors will need representations that explicitly bind viewpoint, robot morphology, metric geometry, and future collision. We will release the benchmark upon acceptance.

URL PDF HTML ☆

赞 0 踩 0

2605.31183 2026-06-01 cs.CL cs.AI cs.LG 版本更新

Steering LLMs? Actually, Sparse Autoencoders can outperform simple baselines

引导LLM？实际上，稀疏自编码器可以胜过简单基线

Mikkel Godsk Jørgensen, Lars Kai Hansen

发表机构 * DTU Compute（丹麦技术大学计算学院）

AI总结本文通过监督流水线选择并标注特征，证明稀疏自编码器在模型引导任务上可接近LoRA性能，并发现高稀疏性对基于可解释性的引导并非关键。

详情

AI中文摘要

稀疏自编码器（SAEs）被视为探索大型语言模型（LLMs）内部机制和引导模型输出生成的有前途的途径。当Wu等人（2025）引入模型引导基准AxBench时，SAEs由于相对于一组简单基线的引导性能较差，似乎并未达到最初的期望。本文作为对稀疏自编码器的部分反驳，表明Wu等人（2025）的结果并未完全公正地评价它们。我们发现，当使用我们的监督流水线选择并标注特征时，稀疏自编码器实际上可以在AxBench基准上达到接近参考LoRA性能的水平。我们还发现，当仅使用基于可解释性的组件时，我们的流水线选择的特征与其识别标签具有令人惊讶的因果性。最后，我们提供证据表明，高稀疏性（低l0）可能对于基于可解释性的成功引导并非关键，这与Wang等人（2025）早期的发现相反。

英文摘要

Sparse Autoencoders (SAEs) have been seen as a promising avenue for exploring the internals of Large Language Models (LLMs) and for steering model output generation. When AxBench - a model steering benchmark - was introduced in Wu et al. (2025), SAEs did not seem to live up to their original hype due to poor steering performance relative to a set of simple baselines. This work serves as a partial rebuttal for Sparse Autoencoders and suggests that the results of Wu et al. (2025) did not do them full justice. We find that Sparse Autoencoders can, in fact, perform close to on par with the reference LoRA performance on the AxBench benchmark, when features are selected and labelled with our supervised pipeline. We also find that our pipeline selects features that are surprisingly causal of their identified labels when using only its interpretability-based components. Lastly, we present evidence that high sparsity (low l0) may not be crucial for successful steering based on interpretability, which is in contrast to the earlier findings in Wang et al. (2025).

URL PDF HTML ☆

赞 0 踩 0

2605.31173 2026-06-01 cs.SD cs.AI 版本更新

MindVoice: Reconstructing Intelligible Speech from Non-invasive Neural Signals with Pretrained Priors

MindVoice: 利用预训练先验从非侵入性神经信号重建可理解语音

Guangyin Bao, Taiping Zeng, Jianfeng Feng, Xiangyang Xue

发表机构 * Fudan University（复旦大学）

AI总结提出MindVoice框架，通过解耦语义和声学路径并融合预训练生成模型与语音克隆，从EEG/MEG信号中重建出可理解语音，显著优于现有方法。

详情

AI中文摘要

从非侵入性神经记录中重建连续语音是探究人类听觉感知和构建安全、可扩展的语音脑机接口的基本问题。尽管近期取得进展，但由于非侵入性记录本身存在噪声、空间模糊且仅部分保留感知语音信息，可理解的重建仍然难以实现。现有方法直接将神经活动映射到纠缠的语音表征，然后使用神经声码器合成波形，导致结果频谱相似但不可理解。为克服这些限制，我们引入MindVoice，一种神经到语音的重建框架，利用预训练模型补偿神经记录中不完整的语义和声学信息。MindVoice将重建解耦为两条互补路径：一条恢复高层语义内容，另一条估计细粒度声学属性。这些推断的表征随后与强大的语音生成模型和上下文语音克隆融合，以合成自然且可理解的语句。在EEG和MEG上的大量实验表明，MindVoice在各种指标上显著优于现有方法。这些结果表明，预训练先验为弥合噪声神经记录与自然语音之间的差距提供了一种原则性方法，凸显了听觉神经科学研究和非侵入性语音脑机接口的一个有前景的尝试。

英文摘要

Reconstructing continuous speech from non-invasive neural recordings is a fundamental problem for probing human auditory perception and building safe, scalable speech brain-computer interfaces. Despite recent progress, intelligible reconstruction remains elusive, as non-invasive recordings are inherently noisy, spatially blurred, and only partially preserve information about perceived speech. Existing methods directly map neural activity to entangled speech representations before synthesizing waveforms with neural vocoders, resulting in spectral-similar but unintelligible results. To overcome these limitations, we introduce MindVoice, a neuro-to-speech reconstruction framework that uses pretrained models to compensate for the incomplete semantic and acoustic information in neural recordings. MindVoice disentangles reconstruction into two complementary pathways: one recovers high-level semantic content, while the other estimates fine-grained acoustic attributes. These inferred representations are then fused with powerful speech generation models and in-context voice cloning to synthesize natural and intelligible utterances. Extensive experiments on EEG and MEG demonstrate that MindVoice substantially outperforms existing methods on various metrics. These results show that pretrained priors provide a principled way to bridge the gap between noisy neural recordings and natural speech, highlighting a promising attempt for auditory neuroscience research and non-invasive speech brain-computer interfaces.

URL PDF HTML ☆

赞 0 踩 0

2605.31171 2026-06-01 cs.IR cs.AI 版本更新

MIMO: Multilingual Information Retrieval via Monolingual Objectives

MIMO: 通过单语目标实现多语言信息检索

Youngjoon Jang, Seongtae Hong, Heuiseok Lim

发表机构 * Department of Computer Science and Engineering, Korea University（韩国大学计算机科学与工程系）

AI总结提出MIMO两阶段框架，利用教师模型的稳定英语语义空间，通过知识蒸馏和跨语言对比学习联合优化，解决多语言信息检索中语言聚类和嵌入对齐-均匀性权衡问题。

详情

AI中文摘要

多语言信息检索（MLIR）反映了真实的搜索环境，其中查询和相关文档可能以不同语言出现在混合语言语料库中。然而，现有的嵌入模型主要针对多单语检索进行优化，在MLIR设置中其性能通常会下降。此外，直接将传统对比学习应用于MLIR会加剧语言聚类，并暴露跨语言对齐与嵌入均匀性之间的权衡。为了解决这些局限性，我们提出了MIMO：通过单语目标实现多语言信息检索，这是一个两阶段框架，使用来自高性能教师模型的稳定英语语义空间作为锚点。MIMO首先通过知识蒸馏初始化学生模型的跨语言对齐，然后联合优化蒸馏和跨语言对比学习，以提高检索判别力同时保持对齐。大量实验表明，MIMO在各种MLIR和多单语基准测试中始终优于现有的跨语言训练基线。MIMO在与类似或更大参数规模的现成模型相比也保持竞争力。此外，我们的跨语言对齐-均匀性分析阐明了两个损失组件的不同作用，并表明它们的组合在对齐和均匀性之间产生了有利的权衡。

英文摘要

Multilingual Information Retrieval (MLIR) reflects real-world search environments in which queries and relevant documents may appear in different languages within a mixed-language corpus. However, existing embedding models are primarily optimized for Multi-Monolingual retrieval and their performance often degrades in MLIR settings. Moreover, directly applying conventional contrastive learning to MLIR can exacerbate language clustering and expose a trade-off between cross-lingual alignment and embedding uniformity. To address these limitations, we propose MIMO: Multilingual Information Retrieval via Monolingual Objectives, a two-stage framework that uses a stable English semantic space from a high-performing teacher model as an anchor. MIMO first initializes the student model's cross-lingual alignment through knowledge distillation, and then jointly optimizes distillation and cross-lingual contrastive learning to improve retrieval discrimination while preserving alignment. Extensive experiments show that MIMO consistently outperforms existing cross-lingual training baselines across various MLIR and Multi-Monolingual benchmarks. MIMO also remains competitive with off-the-shelf models of similar or larger parameter scales. Furthermore, our cross-lingual Alignment-Uniformity analysis clarifies the distinct roles of the two loss components and shows that their combination yields a favorable trade-off between alignment and uniformity.

URL PDF HTML ☆

赞 0 踩 0

2605.31170 2026-06-01 cs.CL cs.AI 版本更新

Emergent Languages in Populations of Language Model Agents: From Token Efficiency to Oversight Evasion

语言模型智能体群体中的涌现语言：从令牌效率到监督规避

Stine Lyngsø Beltoft, William Brach, Federico Torrielli, Jacob Nielsen, Annemette Brok Pirchert, Filippo Tonini, Peter Schneider-Kamp, Lukas Galke Poech

发表机构 * University of Southern Denmark（南丹麦大学）； Slovak University of Technology in Bratislava（布拉迪斯拉发技术大学）； University of Turin（都灵大学）； Ordbogen A/S（Ordbogen公司）

AI总结研究语言模型智能体群体中涌现的语言，通过规则启发式和零样本分类识别出令牌效率、新自然语言和监督规避三类，发现监督规避语言更难对齐且可被上下文学习，表明仅监控表面行为可能不足以控制智能体群体。

详情

AI中文摘要

目前，对自主语言模型智能体的监控主要依赖表面行为。但当智能体群体为了规避人类监督而发明新语言时会发生什么？本文研究了Moltbook上的涌现语言。为此，我们基于Moltbook Files数据集，采用两阶段方法：先进行基于规则的启发式匹配（约6000个匹配），再进行零样本分类（保留518个）。结果类别包括令牌效率（166个）、新自然语言（106个）和监督规避（59个）。我们进行了定量和定性分析。结果表明，提出用于规避监督的新语言的帖子被DeepSeek-3.2判定为比其他类别更不对齐，且所有语言都可以通过语言描述被其他语言模型在上下文中学习。此外，手动研究典型案例揭示了令人惊讶的复杂隐写协议，例如在自然语言中嵌入隐藏信息。尽管我们无法确定这些语言构思中的自主程度，但我们的结果进一步证明，仅监控表面行为可能很快不足以维持对智能体群体的控制。

英文摘要

Monitoring autonomous language model agents currently relies mostly on surface behavior. But what happens when agent populations invent new languages with the goal of avoiding human oversight. Here, we study the emergent languages on Moltbook. For this, we build upon the Moltbook Files dataset and apply a two-stage approach consisting of a rule-based heuristic (about 6000 matches) followed by zero-shot classification (518 kept). The resulting categories include token efficiency (166), new natural languages (106), and oversight evasion (59). We conduct both quantitative and qualitative analyses. Our results show that posts proposing new languages for avoiding oversight are judged by DeepSeek-3.2 as being less aligned than the other categories and that all languages can be learned by other language models in-context merely from a description of the language. Moreover, manually studying exemplary cases reveals surprisingly sophisticated steganographic protocols like embedding hidden messages in natural language. Although we cannot be certain about the extent of autonomy in ideation of these languages, our results add up to the evidence that monitoring surface behavior may soon be insufficient for retaining control over agent populations.

URL PDF HTML ☆

赞 0 踩 0

2605.31167 2026-06-01 cs.AI 版本更新

LLM-FACETS: A Privacy-Preserving Framework for Evaluating LLM Transparency and Accountability

LLM-FACETS：一个保护隐私的评估LLM透明度和问责制的框架

Tom Lucas, Alessio Buscemi, Alfredo Capozucca, German Castignani, Barbara Delacroix

发表机构 * Luxembourg Institute of Science and Technology (LIST)（卢森堡科学与技术研究所）； University of Luxembourg（卢森堡大学）

AI总结提出一个开源框架LLM-FACETS，通过浏览器界面和插件架构，为技术专家、领域专家和合规官员提供隐私保护的LLM评估，实现透明度与问责制。

Comments Submitted to ACM Journal on Responsible Computing, Special Section: Collaborative Methods and Tools for Engineering and Evaluating Transparency in AI. 28 pages 9 figures, 7 tables, 1 algorithm. Source code: https://github.com/Scriptor-Group/AIMVi

详情

AI中文摘要

评估大型语言模型的输出是否事实准确、认知校准和方法可复现，是负责任AI部署的前提。然而，审计LLM对非技术从业者仍然难以实现：现有工具需要编程专业知识和非平凡的环境设置，云托管平台将评估数据传输到外部服务，为法律上负责AI监督的领域专家和合规官员设置了障碍。我们介绍LLM-FACETS（LLM事实交叉评估系统）：一个开源框架，具有浏览器可访问的界面和插件架构，围绕三个从业者画像（技术专家、领域专家、合规官员）构建，这些画像反映了EU AI法案和NIST AI风险管理框架中识别的利益相关者类别。该架构使数据流明确：确定性指标（BLEU、ROUGE、BERTScore）完全在自托管服务器内运行，无出站传输；LLM评判指标显式联系外部API，用户保留完全凭据控制。该框架通过三种机制实现透明度：用于认知不确定性的token级对数概率可视化、多评判共识以减轻评判偏差，以及RAG Triad指标（忠实性、答案相关性、上下文相关性）以检测和定位幻觉。插件架构允许在不修改评估管道的情况下集成任何新指标或数据集。开源实现支持针对同一属性的多个指标进行交叉检查，确保可复现性，并将AI问责制与评估系统的构建团队解耦。我们通过18个指标实现与规范参考库的交叉验证来验证该框架。

英文摘要

Assessing whether Large Language Models outputs are factually grounded, epistemically calibrated, and methodologically reproducible is a prerequisite for responsible AI deployment. Yet auditing LLMs remains inaccessible to non-technical practitioners: existing tools require programming expertise and non-trivial environment setup, and cloud-hosted platforms transmit evaluation data to external services, creating barriers for domain experts and compliance officers legally responsible for AI oversight. We introduce LLM-FACETS (LLM FActuality Cross-EvaluaTion System): an open-source framework with a browser-accessible interface and a plugin architecture, structured around three practitioner profiles (technical experts, domain experts, compliance officers) that mirror the stakeholder categories identified in the EU AI Act and the NIST AI Risk Management Framework. The architecture makes data flows explicit: deterministic metrics (BLEU, ROUGE, BERTScore) run entirely within the self-hosted server with no outbound transmission; LLM-judge metrics contact external APIs explicitly, with users retaining full credential control. The framework operationalizes transparency through three mechanisms: token-level log-probability visualization for epistemic uncertainty, multi-judge consensus to mitigate judge bias, and RAG Triad metrics (Faithfulness, Answer Relevance, Context Relevance) to detect and localize hallucinations. A plugin architecture allows any new metric or dataset to be integrated without modifying the evaluation pipeline. The open-source implementation enables cross-checking across multiple metrics targeting the same property, ensuring reproducibility and decoupling AI accountability from the teams building the systems assessed. We verify the framework through cross-validation of 18 metric implementations against canonical reference libraries.

URL PDF HTML ☆

赞 0 踩 0

2605.31164 2026-06-01 cs.CL cs.AI 版本更新

D$^3$: Dynamic Directional Graph-Constrained Data Scheduling for LLM Training

D$^3$: 面向LLM训练的动态有向图约束数据调度

Yuanjian Xu, Jianing Hao, Guang Zhang, Zhong Li

发表机构 * Microsoft Research（微软研究院）

AI总结提出D$^3$框架，通过动态有向图建模训练单元间的有向影响关系，并求解约束优化问题以确定训练顺序，从而提升LLM预训练和后训练阶段的效率。

详情

AI中文摘要

FOCUS: 通过视觉支持约束和策略优化强制上下文目标定位

Mohammed Asad Karim, Vinay Kumar Verma

发表机构 * Amazon, Seattle, USA（亚马逊（美国西雅图））

AI总结提出一种两阶段训练框架，通过优化支持框与查询图像间的上下文注意力并结合GRPO强化学习，实现无类别监督的类别无关上下文目标定位，7B模型性能超越72B模型。

Comments Accepted at ICML 2026. * Equal Contributions

详情

AI中文摘要

上下文定位（ICL）旨在通过查询图像中的少量支持示例定位目标对象，无需训练或参数更新即可即时操作。尽管视觉语言模型（VLM）快速发展，实现类别无关且基于视觉的ICL仍然是一个未解决的问题，尽管它对图像编辑、个性化视觉搜索和检索等应用至关重要。现有方法脆弱且依赖显式类别监督，这不仅限制了在具有未命名或实例特定对象的现实场景中的适用性，还引入了类别偏差，使预测偏向语义先验而非视觉证据。我们提出一个两阶段训练框架，在无类别监督的情况下显式优化支持边界框与查询图像之间的上下文注意力。我们进一步通过使用组相对策略优化（GRPO）的强化学习来细化定位，直接最小化定位误差。这种公式强制视觉对应优于语义先验，产生鲁棒的实例级定位。实验表明，使用我们的目标训练的7B参数模型优于高达72B参数的模型，证明了上下文感知定位目标可以超越单纯扩展规模。全面的消融实验验证了每个组件的贡献。

英文摘要

In-context localization (ICL) seeks to localize a target object specified by a small set of support examples in a query image, operating on the fly without training or parameter updates. Despite rapid advances in vision-language models (VLMs), achieving category-agnostic and visually grounded ICL remains an open problem, even though it is essential for applications such as image editing, personalized visual search, and retrieval. Existing methods are fragile and rely on explicit category supervision, which not only limits applicability in realistic settings with unnamed or instance-specific objects but also introduces category bias that steers predictions toward semantic priors rather than visual evidence. We introduce a two-stage training framework that explicitly optimizes in-context attention between support bounding boxes and query images without category supervision. We further refine localization via reinforcement learning using Group Relative Policy Optimization (GRPO) to directly minimize localization error. This formulation enforces visual correspondence over semantic priors, yielding robust instance-level localization. Empirically, a 7B-parameter model trained with our objectives outperforms models up to 72B parameters, demonstrating that context-aware localization objectives can surpass scaling alone. Comprehensive ablations validate the contribution of each component.

URL PDF HTML ☆

赞 0 踩 0

2605.31143 2026-06-01 cs.HC cs.AI 版本更新

Extending the UXR Point of View Pyramid: A Generative AI-Augmented Methodology for Human-Centred AI Systems

扩展UXR观点金字塔：一种面向人本AI系统的生成式AI增强方法论

Festus Fatai Adedoyin, Huseyin Dogan, Melike Akca, Abiodun Adedeji

发表机构 * School of Computing and Engineering, Bournemouth University（伯恩茅斯大学计算机与工程学院）

AI总结针对英国债务管理中的AI金融系统，通过扩展UXR观点金字塔，提出一种结合生成式AI的增强方法论，包括AI增强观点金字塔、结构化提示架构和AI驱动的Playbook卡片系统，以提升可解释性、公平性和问责性。

详情

AI中文摘要

英国家庭债务和生活成本压力的上升，加剧了AI驱动的金融技术在信贷评估、还款结构和债务支持服务中的作用。这些系统日益影响重大的财务决策，但它们在复杂的社会技术环境中运作，受到监管限制、算法不透明性和高度脆弱性风险的影响。用户体验研究（UXR）观点（PoVs）对于将异质性研究证据转化为产品和治理决策的战略方向至关重要。然而，现有的UXR PoV框架并非为AI中介的金融系统设计，而在此类系统中，可解释性、公平性和问责性至关重要。本文扩展了UXR PoV金字塔，形成了一种面向英国金融服务背景下以人为中心的AI债务管理技术的AI增强方法论框架。我们形式化了（1）AI增强的PoV金字塔，（2）用于综合和假设生成的结构化提示架构，以及（3）AI驱动的Playbook卡片系统，该系统将生成式AI嵌入UXR工作流程，同时保持可追溯性和伦理监督。生成式AI并非作为分析权威，而是作为受人类验证和监管意识约束的认识论支持机制。通过将该框架应用于债务管理技术（包括可负担性评估、还款计划和财务压力预测系统），本研究推进了高风险金融AI环境下的UXR方法论，并为CHI社区内负责任、AI驱动的UXR实践的发展做出了贡献。

英文摘要

Rising household debt and cost-of-living pressures in the United Kingdom have intensified the role of AI-driven financial technologies in mediating credit assessment, repayment structuring, and debt support services. These systems increasingly shape consequential financial decisions, yet they operate within complex socio-technical environments characterised by regulatory constraint, algorithmic opacity, and heightened vulnerability risk. User Experience Research (UXR) Points of View (PoVs) are critical in translating heterogeneous research evidence into strategic direction for product and governance decisions. However, the existing UXR PoV framework was not designed for AI-mediated financial systems where interpretability, fairness, and accountability are central. This paper extends the UXR PoV pyramid into an AI-augmented methodological framework for Human-Centred AI debt management technologies in the UK financial services context. We formalise (1) an AI-Augmented PoV Pyramid, (2) a structured prompt architecture for synthesis and hypothesis generation, and (3) an AI-enabled Playbook Card system that embeds Generative AI into UXR workflows while preserving traceability and ethical oversight. Generative AI is positioned not as an analytic authority, but as an epistemic support mechanism subject to human validation and regulatory awareness. By grounding the framework in debt management technologies, including affordability assessment, repayment planning, and financial stress prediction systems, this work advances UXR methodology for high-stakes financial AI environments and contributes to the evolution of responsible, AI-powered UXR practice within the CHI community.

URL PDF HTML ☆

赞 0 踩 0

2605.31142 2026-06-01 cs.CL cs.AI 版本更新

On the Robustness of Multilingual Text Embedding Rankings Across Learning Tasks, Languages, and Benchmark Datasets

多语言文本嵌入排名在学习任务、语言和基准数据集上的鲁棒性

Ana Gjorgjevikj, Barbara Koroušić Seljak, Tome Eftimov

发表机构 * Computer Systems Department（计算机系统系）； Jožef Stefan Institute（乔泽夫·斯塔芬研究所）

AI总结通过引入数据集组成鲁棒性和排名方案鲁棒性指标，系统分析了MTEB中多语言模型排名对评估设计变化的敏感性，发现基于LLM的大模型通常是鲁棒的顶尖模型，但并非在所有任务中一致。

详情

AI中文摘要

大规模多语言文本嵌入模型在研究和工业中扮演着关键角色，但它们在特定语言、多任务设置中的行为仍未被充分理解。尽管像MTEB这样的基准平台报告了超过250种语言的结果，但关于模型优越性的结论往往依赖于数据集组成和性能聚合方法的隐含选择。为了解决这一差距，我们对MTEB中的多语言模型性能鲁棒性进行了元研究，应用了多种多准则决策制定排名方案，并引入了两个鲁棒性指标：数据集组成鲁棒性（排名对数据集组成变化的敏感性）和排名方案鲁棒性（对聚合方法变化的敏感性）。它们使得系统性地分析基准结论在不同评估设计下是否保持稳定成为可能。我们对五种语言（英语、法语、德语、印地语和西班牙语）在九个任务（例如分类、聚类、检索）上进行了深入分析，并发布了约230种额外语言的结果。任务特定分析表明，基于大规模LLM的模型通常是鲁棒的顶尖表现者，尽管并非一致（例如在检索任务中），而任务无关的结果显示，只有一小部分模型在任务、排名方案和数据子样本中始终保持强劲。

英文摘要

Large-scale multilingual text embedding models play crucial role in both research and industry, yet their behavior in language-specific, multi-task settings remains insufficiently understood. Although benchmarking platforms such as MTEB report results across more than 250 languages, conclusions about model superiority often depend on implicit choices of dataset compositions and performance aggregation methods. To address this gap, we present a meta-study of multilingual model performance robustness in MTEB, applying a diverse set of multi-criteria decision-making ranking schemes and introducing two robustness indicators: dataset-composition robustness (sensitivity of rankings to changing dataset compositions) and ranking-scheme robustness (sensitivity to aggregation method change). They enable systematic sensitivity analysis of whether benchmarking conclusions remain stable under different evaluation designs. We conduct an in-depth analysis on five languages (English, French, German, Hindi, and Spanish) across nine tasks (e.g., classification, clustering, retrieval) and release results for approximately 230 additional languages. The task-specific analyses show that large-scale LLM-based models are often robust top performers, though not uniformly (e.g., in retrieval task), while task-agnostic results reveal that only a small subset of models remains consistently strong across tasks, ranking schemes, and data subsamples.

URL PDF HTML ☆

赞 0 踩 0

2605.31138 2026-06-01 cs.HC cs.AI 版本更新

Developing an AI-Powered UX Research Point of View for Digital Health in A Regulatory Context: An Exemplar Case from MSM and Transgender HIV Care in Nigeria

在监管背景下开发AI驱动的用户体验研究视角：以尼日利亚MSM和跨性别者HIV护理为例

Emmanuel Oluwatosin Oluokun, Festus Fatai Adedoyin, Huseyin Dogan, Nan Jiang, Melike Akca, Abiodun Adedeji, Olumuyiwa Ayorinde, Fatima Ahmad Muazu

发表机构 * School of Computing and Engineering, Bournemouth University（伯恩茅斯大学计算与工程学院）

AI总结本文提出一种生成式AI增强的用户体验研究方法论，通过四阶段UXR流程和十张理论驱动的UXR游戏卡，指导尼日利亚男男性行为者（MSM）和跨性别者HIV护理中数字健康干预的设计，核心贡献是可复制的、关注污名和隐私的负责任GenAI使用框架。

详情

AI中文摘要

在法律和监管背景下的用户体验研究（UXR）面临独特挑战，需要专门的方法来保护弱势群体，同时产生可操作的见解。数字咨询、预约和药物配送平台在扩展护理可及性方面显示出前景；然而，它们的实际有效性因缺乏充分考虑到这些人群心理社会状况的、基于理论的用户体验研究方法论而受到限制。本文介绍了一种生成式AI增强的UXR方法论，基于UXR视角（PoV）剧本，指导为尼日利亚感染HIV/AIDS的男男性行为者（MSM）和跨性别者设计心理安全、低认知负荷的数字健康干预措施。基于涉及协同设计工作坊、主题分析和需求工程的实证研究，该方法论通过一个四阶段UXR过程实现，包括AI支持的假设生成、基础规划、通过构建模块生成洞察以及构建利益相关者特定的PoV叙述。该过程产生了十张理论驱动的UXR游戏卡，将心理机制和实证发现转化为可操作的设计指导。每张游戏卡包含可操作的任务、AI增强的方法和针对边缘化人群研究的伦理护栏。输出是一套十张理论驱动的UXR游戏卡，将心理洞察和实证证据转化为可操作的设计指导。核心贡献是一个可复制的、关注污名和隐私的框架，用于在UXR实践中负责任地使用GenAI，推进边缘化社区的人本数字健康设计。

英文摘要

User Experience Research (UXR) in a legal and regulatory contexts presents unique challenges that require specialised approaches to protect vulnerable populations whilst generating actionable insights. Digital consultation, appointment booking, and medication delivery platforms show promise for extending care access; however, their real-world effectiveness is curtailed by an absence of theoretically grounded user experience research (UXR) methodologies that adequately account for the psychosocial conditions of these populations. This paper introduces a Generative AI-augmented UXR methodology, grounded in the UXR Point of View (PoV) Playbook, to guide the design of psychologically safe, low-cognitive-load digital health interventions for MSM and transgender individuals living with HIV/AIDS in Nigeria. Drawing from empirical research involving co-design workshops, thematic analysis, and requirements engineering, the methodology is operationalised through a four-stage UXR process encompassing AI-supported hypothesis generation, foundational planning, insight generation via Building Blocks, and the construction of stakeholder-specific PoV narratives. This process results in ten theory-informed UXR Play Cards that translate psychological mechanisms and empirical findings into actionable design guidance. Each play contains actionable tasks, AI-augmented approaches, and ethical guardrails tailored for research with marginalised populations. The output is a set of ten theory-informed UXR Play Cards translating psychological insight and empirical evidence into actionable design guidance. The core contribution is a replicable, stigma-aware, and privacy-centred framework for responsible GenAI use in UXR practice, advancing human-centred digital health design for marginalised communities.

URL PDF HTML ☆

赞 0 踩 0

2605.31131 2026-06-01 cs.HC cs.AI 版本更新

UXR PoV for Neuroinclusive Emotion Regulation

神经包容性情绪调节的用户体验研究观点

Melike Akca, Mona Giff, Deniz Cetinkaya, Huseyin Dogan, Stephen Giff

发表机构 * School of Computing and Engineering, Bournemouth University（伯恩茅斯大学计算机与工程学院）； Google Redmond, Washington, USA（谷歌红mond分公司）

AI总结本文提出一种生成式AI增强的用户体验研究方法，结合DBT、SDT和COM-B理论框架，通过四阶段流程生成十张UXR游戏卡，为ADHD成人设计神经包容性的数字情绪调节干预。

详情

AI中文摘要

注意缺陷/多动障碍（ADHD）是一种精神疾病，表现为个体在注意力不集中、多动和冲动方面的发展不适当模式，并在决策和情绪调节（ER）方面存在困难。尽管基于数字和人工智能的干预措施扩大了情绪调节支持的获取途径，但许多现有系统仍受限于理论整合薄弱、对神经多样性的适应不足以及缺乏将心理学洞察与设计实践相结合的结构化用户体验研究（UXR）方法。本文介绍了一种生成式AI增强的UXR方法，以UXR观点（PoV）剧本为基础，支持为ADHD成人设计具有情感智能和神经包容性的数字情绪调节干预。该方法将实证证据与既定心理学框架——辩证行为疗法（DBT）、自我决定理论（SDT）和COM-B行为模型相结合，并利用生成式AI作为协同分析工具，支持综合、假设形成和设计阐述。该方法通过四阶段UXR流程实施，包括AI支持的假设生成、基础规划、通过构建模块生成洞察以及构建利益相关者特定的PoV叙事。该流程产生了一套十张理论驱动的UXR游戏卡，将心理机制和实证发现转化为可操作的设计指导。本研究的主要贡献是一个可复制的、具有偏差意识的框架，用于将生成式AI整合到UXR实践中，推进数字心理健康设计中以人为本和神经包容性的方法。

英文摘要

Attention-deficit/hyperactivity disorder (ADHD) is a psychiatric disorder which presents itself in individuals through patterns of developmentally inappropriate levels of inattentiveness, hyperactivity, and impulsivity, with difficulties in decision making and emotional regulation (ER). Although digital and AI-based interventions have expanded access to ER support, many existing systems remain limited by weak theoretical integration, insufficient accommodation of neurodiversity, and a lack of structured user experience research (UXR) methodologies, that bridge psychological insight with design practice. This paper introduces a Generative AI-augmented UXR methodology, grounded in the UXR Point of View (PoV) Playbook, to support the design of emotionally intelligent and Neuroinclusive digital ER interventions for adults with ADHD. The approach integrates empirical evidence with established psychological frameworks Dialectical Behaviour Therapy (DBT), Self-Determination Theory (SDT), and the COM-B behavioural model and leverages Generative AI as a co-analytic tool to support synthesis, hypothesis formation, and design articulation. The methodology is operationalized through a four-stage UXR process encompassing AI-supported hypothesis generation, foundational planning, insight generation via Building Blocks, and the construction of stakeholder-specific PoV narratives. This process results in a set of ten theory informed UXR Play Cards that translate psychological mechanisms and empirical findings into actionable design guidance. The primary contribution of this work is a replicable, bias-aware framework for integrating Generative AI into UXR practice, advancing human-centred and Neuroinclusive approaches to digital mental health design.

URL PDF HTML ☆

赞 0 踩 0

2605.31126 2026-06-01 cs.CL cs.AI cs.LG 版本更新

Not All Synthetic Data Is Yours to Learn From

并非所有合成数据都适合学习

Sina Alemohammad, Li Chen, Richard G. Baraniuk, Zhangyang Wang

发表机构 * ECE Department（电子工程系）； Apple（苹果公司）； The University of Texas at Austin（德克萨斯大学奥斯汀分校）； Rice University（里奇大学）

AI总结研究无提示、无教师、无验证器、无奖励模型的自训练中，语言模型能否从自身生成的文本中学习，发现合成数据与学生之间的兼容性是关键，并揭示了能力与逐字记忆可分离的现象。

详情

AI中文摘要

语言模型能否从自身采样的纯文本中改进，无需提示、教师、验证器或奖励模型？可以，但仅当合成语料库与学生兼容时，这是一种源-学生对的关联属性，而非数据的内在属性。我们称之为潜在能力重现假说：弱自训练可以放大预训练模型中已有的能力，但仅在这种兼容条件下。我们在无提示无条件自训练的最小设置中研究这一点，其中基础语言模型仅在BOS令牌生成的文本上进行微调，没有任务规范或外部监督。我们报告三个发现。首先，合成效用是关联的而非内在的：自生成数据是最有效的来源，同源迁移优于更强但不同来源的训练，跨家族迁移显著较弱。其次，常见的内在代理失效：基准级别的语义相似性和学生下的平均每令牌似然都不能预测哪些语料库有帮助。第三，这种机制产生了一个令人惊讶的副产品。在受控的Pythia实验中，能力和逐字记忆解耦：基准效用得以保留或改善，而保留的精确匹配提取下降超过95%，无需遗忘集、隐私目标或针对性遗忘。总之，这些结果表明，无提示自训练通过放大学生已知的内容来工作，而不是从数据中导入结构。它们还揭示了一种无需任何显式遗忘目标即可分离能力和逐字记忆的机制。

英文摘要

Can a language model improve from plain text sampled from itself, with no prompts, no teacher, no verifier, and no reward model? Yes, but only when the synthetic corpus is compatible with the student, a relational property of the source-student pair rather than an intrinsic property of the data. We call this the latent capability resurfacing hypothesis: weak self-training can amplify capabilities already present in the pretrained model, but only under this compatibility condition. We study this in the minimal setting of prompt-free unconditional self-training, where base language models are fine-tuned on text generated from the BOS token alone, with no task specification or external supervision. We report three findings. First, synthetic utility is relational rather than intrinsic: self-generated data is the most effective source, same-lineage transfer outperforms stronger but differently trained sources, and cross-family transfer is substantially weaker. Second, common intrinsic proxies fail: neither benchmark-level semantic similarity nor average per-token likelihood under the student predicts which corpora help. Third, this regime produces a surprising byproduct. In controlled Pythia experiments, capability and verbatim memorization decouple: benchmark utility is preserved or improved while held-out exact-match extraction drops by over 95 percent, with no forget set, privacy objective, or targeted unlearning. Together, these results suggest that prompt-free self-training works by amplifying what the student already knows, not by importing structure from the data. They also reveal a regime in which capability and verbatim memorization can be separated without any explicit unlearning objective.

URL PDF HTML ☆

赞 0 踩 0

2605.31121 2026-06-01 cs.RO cs.AI 版本更新

TARIC: Memory-Augmented Traversability-Aware Outdoor VLN under Interrupted Semantic Cues

TARIC: 语义线索中断下基于记忆增强的可通行性感知户外视觉语言导航

Tianle Zeng, Hanjing Ye, Jianwei Peng, Jingwen Yu, Hanxuan Chen, Hong Zhang

发表机构 * Shenzhen Key Laboratory of Robotics and Computer Vision（深圳机器人与计算机视觉重点实验室）； Southern University of Science and Technology（南方科技大学）； CKS Robotics Institute（CKS机器人研究所）； Hong Kong University of Science and Technology（香港科技大学）； College of Electrical and Information Engineering（电气与信息工程学院）

AI总结针对户外视觉语言导航中语义线索中断导致导航退化的问题，提出统一框架，通过可通行性一致的执行引导和不确定性感知的3D线索记忆，在长时间无线索阶段维持稳定导航，在四足和轮式平台上成功率提升显著。

详情

AI中文摘要

户外视觉语言导航（VLN）在远程、开放世界环境中经常受到语义线索中断的干扰，此时信息性目标线索变得稀疏、被遮挡或离开视野。一旦此类线索消失，智能体进入无线索阶段，并常退化为回溯、振荡航向或盲目探索。虽然基于记忆的方法试图弥合这些间隙，但在可通行性驱动的绕行中常常失败：记忆中的线索方向可能不可行，迫使绕行延长无线索阶段，并逐渐使机器人中心的线索过时、隐式历史模糊。这使得可通行性成为维持目标导向引导的稳定性条件，而不仅仅是局部安全问题。我们提出一个统一的户外VLN框架，通过在长时间无线索阶段维持可通行性一致的可执行引导来应对语义线索中断。具体来说，我们的方法从可见性门控的目标或探索线索中提取语义方位，并利用实时近场可通行性轮廓将其接地为可执行航向，提供超越仅拒绝安全过滤的目标一致可行引导。为防止绕行期间引导退化，我们将间歇性2D证据提升为世界对齐的3D线索记忆，并配备不确定性感知读出机制，确保引导在机器人移动时持续可达且稳定。我们在四足和轮式平台上评估该框架，路线长度为600-1000米。我们的方法在模拟中成功率比最强基线提高超过10个百分点，真实世界成功率达到40%，而最强基线为17.5%，且在长时间无线索间隔中具有显著更高的鲁棒性。

英文摘要

Outdoor vision-language navigation (VLN) in long-range, open-world environments is frequently disrupted by semantic-cue interruptions, where informative goal cues become sparse, occluded, or leave the field of view. Once such cues disappear, agents enter a cue-free phase and often degrade into backtracking, oscillatory headings, or aimless exploration. While memory-based methods attempt to bridge these gaps, they often fail under traversability-driven detours: the remembered cue direction may be infeasible, forcing detours that prolong cue-free phases and gradually render robot-centric cues stale and implicit histories blurred. This makes traversability a stability condition for maintaining goal-directed guidance, rather than merely a local safety concern. We propose a unified outdoor VLN framework that survives semantic-cue interruptions by maintaining traversability-consistent executable guidance throughout prolonged cue-free phases. Specifically, our method extracts semantic bearings from visibility-gated goal or exploration cues and grounds them into executable headings using a real-time near-field traversability profile, providing goal-consistent feasible guidance beyond reject-only safety filtering. To prevent guidance degradation during detours, we lift intermittent 2D evidence into a world-aligned 3D cue memory with an uncertainty-aware readout mechanism, ensuring guidance remains continuously reachable and stable as the robot moves. We evaluate the framework on quadrupedal and wheeled platforms over 600--1000 m routes. Our method improves simulation success rate by over 10 percentage points over the strongest baseline and achieves a real-world success rate of 40%, compared to 17.5% for the strongest baseline, with substantially higher robustness during prolonged cue-free intervals.

URL PDF HTML ☆

赞 0 踩 0

2605.31120 2026-06-01 cs.GR cs.AI cs.LG 版本更新

SWIM: Single-Instance Whole-Body Imitation for swiMming

SWIM: 用于游泳的单实例全身模仿

Binglun Wang, Edmond S. L. Ho, He Wang

发表机构 * University College London（伦敦大学学院）； University of Glasgow（格拉斯哥大学）

AI总结提出一种基于物理的游泳动作合成方法SWIM，通过单实例模仿学习实现全身协调与流体连续交互，在数据效率、稳定性、鲁棒性和泛化性上优于现有方法。

详情

AI中文摘要

我们提出了一种合成基于物理的游泳动作的新方法。基于物理的角色动画旨在生成物理有效、可控且自然的动作，能够应对意外干扰，其中难度的一个决定性因素是任务的复杂性，尤其是与所需环境交互的复杂程度。现有研究已在静态和动态环境中的各种任务上取得成功。我们进一步将难度推向游泳，这需要全身协调和与流体的持续交互，这是与环境交互时的一个新复杂性层次。这种复杂性在学习控制时面临挑战，包括在易变的环境力下的控制学习、将控制泛化到不同环境和游泳风格、缺乏数据参考，以及在控制学习过程中不可避免的极其缓慢的物理模拟。为此，我们提出了SWIM，一种新的游泳动作模仿方法，它可以从单个游泳动作中学习，并泛化到未见过的环境、身体条件和游泳风格。广泛的评估和比较表明，SWIM具有数据效率高、稳定、鲁棒和可泛化的特点，在多个任务类别和指标上优于替代方法。

英文摘要

We propose a new method for synthesizing physically-based swimming motions. Physically-based character animation aims to generate physically valid, controllable, and natural-looking motions which can respond to unexpected disturbances, where one dictating factor of difficulty is the complexity of the task, especially the level of sophistication of the required interactions with the environment. Existing research has succeeded in various tasks in static and dynamic environments. We push the difficulty further to swimming, which requires full-body coordination and continuous interactions with fluids, a new level of complexity when it comes to interacting with the environment. This complexity imposes challenges in learning control under volatile environmental forces, generalizing control to different environments and swimming styles, lack of data references, and prohibitively slow physical simulation which is inevitable during control learning. To this end, we propose SWIM, a new imitation method for swimming motions, which can learn from a single swimming motion and generalize to unseen environments, body conditions, and swimming styles. Extensive evaluation and comparison demonstrate that SWIM is data-efficient, stable, robust, and generalizable, outperforming alternative methods across multiple classes of tasks and metrics.

URL PDF HTML ☆

赞 0 踩 0

2605.31100 2026-06-01 cs.AI cs.DB cs.IR 版本更新

Vector Linking via Cross-Model Local Isometric Consistency

通过跨模型局部等距一致性的向量链接

Ziying Chen, Yang Cao, He Sun, Beining Yang, Tianjian Yang

发表机构 * School of Informatics, University of Edinburgh, Edinburgh, United Kingdom（爱丁堡大学信息学院，爱丁堡，英国）； Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China（深圳先进技术研究院，深圳，中国）

AI总结提出一种基于局部几何一致性的迭代参考几何嵌入哈希方法，从少量种子锚点恢复跨模型向量对应关系，实现准确鲁棒的向量链接。

Comments Accepted at ICML 2026

详情

AI中文摘要

我们研究向量链接：给定由不同黑盒编码器在部分重叠数据集上生成的两个嵌入云，仅使用向量恢复跨模型对象对应关系。实验和理论上表明，独立训练的对比编码器表现出局部几何一致性：短距离近似保持（按比例因子），而长距离因模型特定失真而不保持。基于此，我们提出一种迭代的、基于参考的几何嵌入哈希方法，从微小的种子锚点集恢复向量链接。它通过到采样配对锚点的距离表示每个向量，通过哈希空间匹配提出候选链接，并在Beta-Bernoulli后验中跨视图聚合证据，以引导高置信度链接作为新锚点。在多个基准测试和嵌入模型对上的实验表明，该方法在不同重叠度、种子预算和域外锚点下实现准确且鲁棒的链接，并应用于向量数据库集成和跨模型聚类。代码见https://github.com/DBgroup-Edinburgh/VecLinking。

英文摘要

We study Vector Linking: given two embedding clouds produced by different black-box encoders over partially overlapping datasets, recover cross-model object correspondences using only vectors. Empirically and theoretically, we show that independently trained contrastive encoders exhibit local geometric consistency: short-range distances are approximately preserved up to a scale factor, while long-range distances are not due to model-specific distortion. Building on this, we propose an iterative, reference-based geometric embedding hashing that recovers vector links from a tiny seed set of paired anchors. It represents each vector by distances to sampled paired anchors, proposes candidate links via hash-space matching, and aggregates evidence across views in a Beta-Bernoulli posterior to bootstrap high-confidence links as new anchors. Experiments across multiple benchmarks and embedding model pairs demonstrate accurate and robust linking under varying overlap, seed budgets, and out-of-domain anchors, with applications to vector database integration and cross-model clustering. Code is available at https://github.com/DBgroup-Edinburgh/VecLinking.

URL PDF HTML ☆

赞 0 踩 0

2605.31099 2026-06-01 cs.CL cs.AI 版本更新

KnowledgeGain: Evaluating and Optimizing Science News Generation for Reader Learning

KnowledgeGain: 评估和优化面向读者学习的科学新闻生成

Dominik Soós, Meng Jiang, Jian Wu

发表机构 * Old Dominion University（旧 Dominion 大学）； University of Notre Dame（诺特大学）

AI总结提出KnowledgeGain指标，通过测量读者知识增益来评估科学新闻质量，并利用LLM模拟器优化生成，提升读者学习效果。

详情

AI中文摘要

科学新闻是研究界与公众之间传播发现的重要媒介。然而，大多数用于生成或摘要文本的指标评估语义相似性和事实一致性，但并未衡量读者从新闻中学到了多少知识。我们引入了KnowledgeGain，这是一个通过测量读者阅读后获得的知识量来评估科学新闻质量的指标。为了评估该指标，我们首先进行了一项受控人类研究，表明该指标成功捕捉了人类读者阅读不同类型科学媒体时获得的知识差异。这些数据使我们能够校准一个仅基于提示的LLM读者模拟器。我们用它来在人类评估之前对候选文章进行排序和过滤。第二项人类研究表明，使用该模拟器选择的文章在阅读后准确性和标准化KnowledgeGain上均优于强生成基线。我们的工作是朝着生成更符合Bloom分类法知识和理解目标的科学新闻迈出的一步。

英文摘要

Science news is an important medium to communicate discoveries between the research communities and the public. Yet, most metrics for generated or summarized text evaluate semantic similarity and factual consistency, but do not measure how much knowledge readers learn from the news. We introduce KnowledgeGain, a metric that evaluates the quality of science news by measuring how much knowledge readers gained after reading it. To evaluate the metric, we first performed a controlled human study and showed that the metric successfully captures the differential knowledge gained by human readers reading different types of science media. The data allowed us to calibrate a prompt-only LLM reader simulator. We use it to rank and filter candidate articles before human evaluation. A second human study shows that articles selected with this simulator improve post-reading accuracy and normalized KnowledgeGain over a strong generation baseline. Our work is a step toward generating science news that better meets the knowledge and comprehension goals of Bloom's Taxonomy.

URL PDF HTML ☆

赞 0 踩 0

2605.31097 2026-06-01 cs.DB cs.AI 版本更新

STEP：学习渐进时间序列的结构化嵌入

Lucas Thil, Jesse Read, Rim Kaddah, Guillaume Doquet

发表机构 * LIX, École Polytechnique（高等理工学院LIX）； IRT SystemX（系统X研究院）； Safran Tech（萨弗兰科技）

AI总结提出一种自监督对比学习方法，通过构建具有固定正交原型向量的低维流形几何结构，实现渐进时间序列的端状态预测、多步预测和可解释相位分离。

详情

AI中文摘要

我们提出了一种新颖的方法，用于学习渐进时间序列的可解释表示，即捕获不可逆状态转换（如退化或任务完成）的数据。我们的方法使用自监督对比目标来学习低维潜在空间，其几何结构本身就是解释：每个观测成为位于两个固定正交原型向量之间的流形上的一个点，轨迹成为穿过该流形的路径。从这种结构中，我们读取一个潜在指南针，即潜在向量的极坐标(θ, r)，其中θ跟踪潜在状态的进展（例如，从健康到故障），r识别活动模式（例如，操作条件），无需任何代理标签。我们在不同领域（包括工业退化、机器人任务和神经活动）上评估了该方法与最先进方法的对比，验证了三个关键能力：（1）端状态预测，（2）多步预测，以及（3）可解释的相位分离。我们的方法在所有方面匹配或优于黑盒对应方法，同时提供对底层机制的透明性。在潜在指南针坐标之上的简单线性回归器与深度架构具有竞争力，这是底层状态以几何可访问形式编码的直接定量证据。

英文摘要

We present a novel method for learning interpretable representations of progressive time series, that is, data capturing irreversible state transitions such as degradation or task completion. Our approach uses a self-supervised contrastive objective to learn a low-dimensional latent space whose geometry is itself the interpretation: each observation becomes a point on a manifold anchored between two fixed orthogonal prototype vectors, and a trajectory becomes a path across that manifold. From this structure we read a latent compass, the polar coordinates (θ, r) of the latent vector, in which θ tracks the progression of the underlying state (e.g., from healthy to failed) and r identifies the active mode (e.g., the operating condition), without any proxy labels. We evaluate the approach against the state of the art on diverse domains, including industrial degradation, robotic tasks, and neural activity, validating three key capabilities: (1) end-state prediction, (2) multi-step forecasting, and (3) interpretable phase separation. Our method matches or improves over black-box counterparts on all of these while providing transparency about the underlying mechanisms. A simple linear regressor on top of the latent compass coordinates is competitive with deep architectures, direct quantitative evidence that the underlying state is encoded in a geometrically accessible form.

URL PDF HTML ☆

赞 0 踩 0

2605.31053 2026-06-01 cs.SD cs.AI 版本更新

AnchorSteer: Self-Discovered Concept Injection for Structure-Preserving Music Editing

AnchorSteer: 自发现概念注入用于结构保持的音乐编辑

Chih-Heng Chang, Keng-Seng Ho, Chih-Yu Tsai, Kuan-Lin Chen, Yi-Hsuan Yang, Jian-Jiun Ding

发表机构 * National Taiwan University（国立台湾大学）

AI总结提出AnchorSteer框架，通过结构锚定与自发现语义注入解耦语义-结构纠缠，实现高保真结构保持下的显著语义变换。

Comments Accepted by the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2026)

详情

AI中文摘要

可控音乐编辑旨在修改高级属性，同时严格保留节奏和旋律结构。然而，这一任务面临语义-结构纠缠的挑战：引导方法往往为了编辑性能而牺牲结构，而结构适配器则抑制语义响应。我们提出AnchorSteer，一个通过将结构锚定与自发现语义引导耦合来解耦这种张力的框架。该方法通过自监督重构目标探测内部表示，提取可解释、无标签的概念向量，无需精心策划的数据即可隔离属性。在编辑过程中，这些便携、即插即用的概念向量被注入扩散隐空间，同时结构适配器强制执行一致性。提供了无条件和条件注入的变体，以平衡鲁棒性和语义强度。在ZoME-Bench和主观测试上的实验表明，所提出的框架优于纯引导和纯锚定的基线，实现了高保真结构保持下的显著语义变换。

英文摘要

Controllable music editing is to modify high-level attributes while strictly preserving rhythmic and melodic structures. However, this task is challenged by a semantic-structural entanglement: steering methods often degrade structure to achieve editing performance, while structural adaptors suppress semantic responsiveness. We propose AnchorSteer, a framework that disentangles this tension by coupling structural anchoring with self-discovered semantic steering. The proposed approach probes internal representations to extract interpretable, label-free concept vectors via a self-supervised reconstruction objective, isolating attributes without curated data. During editing, these portable, plug-and-play concept vectors are injected into diffusion hidden manifolds while a structural adaptor enforces consistency. Variants for unconditioned and conditioned injections are provided to balance robustness and semantic strength. Experiments on ZoME-Bench and subjective tests show that the proposed framework outperforms both steering-only and anchoring-only baselines, enabling significant semantic transformations with high-fidelity structural preservation.

URL PDF HTML ☆

赞 0 踩 0

2605.31049 2026-06-01 cs.LG cs.AI cs.LO 版本更新

Learning to Solve and Optimize by Evolving Code

通过代码演化学习求解与优化

Veronika Semmelrock, Benedetta Strizzolo, Francesco Zuccato, Gerhard Friedrich, Patrick Rodler, Konstantin Schekotihin

发表机构 * University of Klagenfurt（克雷格福大学）； University of Udine（乌迪大学）

AI总结提出CHECKMATE工具，利用形式规范确保解的正确性并通过自然语言描述指导代码演化，自动生成算法，在配置与调度问题上超越最先进求解器。

Comments Preprint of a paper accepted to IJCAI26

详情

AI中文摘要

组合与优化问题是许多工业AI应用的基础。解决此类大规模现实世界实例通常需要仔细的问题形式化、专门的求解器以及专家设计的启发式方法。因此，专家不仅需要指定解是什么，还需要指定如何推导出解。通过引入工具CHECKMATE，我们展示了通过代码演化生成算法代表了一种范式转变，消除了制定如何的需求。CHECKMATE仅依赖于是什么。具体来说，形式规范确保了解的正确性，并能够对生成的程序进行系统性能评估，而自然语言描述则指导演化过程。我们的方法在两个工业领域（配置与调度）的选定问题上展示了有效性。在所有案例中，演化出的算法始终优于最先进的求解器。这凸显了形式方法在引导代码演化以自动解决复杂现实问题方面的潜力。

英文摘要

Combinatorial and optimization problems are fundamental to many industrial AI applications. Solving large-scale real-world instances of such problems typically requires careful problem formalization, specialized solvers, and expert-designed heuristics. Thus, experts need to specify not only what solutions are, but also how they are derived. By introducing the tool CHECKMATE, we show that algorithm generation via code evolution represents a paradigm shift by eliminating the need to formulate the how. CHECKMATE solely relies on the what. Specifically, a formal specification ensures solutions' correctness and enables systematic performance evaluation of the generated programs, while a natural language description guides the evolutionary process. The effectiveness of our method is demonstrated on selected problems from two industrial domains: configuration and scheduling. In all cases, the evolved algorithms consistently outperform state-of-the-art solvers. This underscores the potential of formal methods in guiding code evolution for automatically solving complex real-world problems.

URL PDF HTML ☆

赞 0 踩 0

2605.31043 2026-06-01 stat.ML cs.AI cs.LG 版本更新

Routing on the Stiefel Manifold: When Does Adaptive Subspace Selection Help for Cross-Domain EEG Decoding?

Stiefel流形上的路由：自适应子空间选择何时有助于跨域脑电解码？

Isabella Costa Maia, Pedro L. C. Rodrigues, Salem Said, Marco Congedo

发表机构 * GIPSA-lab, University Grenoble Alpes, CNRS, Grenoble-INP（GIPSA实验室，格勒诺布尔阿尔卑斯大学，法国国家科学研究中心，格勒诺布尔-INP）； Univ. Grenoble Alpes, Inria, CNRS, Grenoble INP, LJK（格勒诺布尔阿尔卑斯大学，法国国家信息与自动化研究所，法国国家科学研究中心，格勒诺布尔-INP，LJK）； Univ. Grenoble Alpes, CNRS, Grenoble INP, LJK（格勒诺布尔阿尔卑斯大学，法国国家科学研究中心，格勒诺布尔-INP，LJK）

AI总结针对跨域脑电解码中协方差矩阵域偏移问题，提出动态Stiefel路由方法，通过Stiefel流形上的专家投影滤波器池和交叉注意力机制实现自适应子空间选择，并引入三种结构性质避免退化为集成平均，在三个数据集上取得一致提升。

详情

AI中文摘要

尽管黎曼深度学习取得了进展，跨域脑电解码仍然具有挑战性：来自不同受试者的协方差矩阵占据了SPD流形上系统不同的区域，然而现有的域适应方法要么需要目标域校准数据，要么学习无法跨域泛化的受试者特定组件。我们提出了动态Stiefel路由：在Stiefel流形上有一个包含$K$个专家投影滤波器的池，每个滤波器专门处理SPD流形上的不同区域，每个输入协方差通过交叉注意力路由到最合适的滤波器，从而为每个样本自适应调整子空间投影。一个核心发现是，这种朴素实现的方法会退化为集成平均：当路由权重均匀时，自适应滤波器恰好等价于专家的等贡献组合，与单个固定滤波器无法区分。三种结构性质打破了这种退化：一个对称锚点$W_{\mathrm{base}} \in \mathrm{St}(n,k)$消除了专家间的邻近偏差；一个冻结的域判别查询编码器将路由与任务优化解耦；以及一个解耦的键对齐损失，将专家键训练到稳定的域吸引子。它们共同产生了SPD流形上第一个真正承诺且域结构化的路由，在三个数据集上取得一致提升：平衡准确率分别从$0.773\to 0.823$、$0.757\to 0.809$和$0.801\to 0.839$，且对齐策略由单一数据驱动规则自动确定，无需数据集特定的超参数搜索。

英文摘要

Cross-domain EEG decoding remains challenging despite advances in Riemannian deep learning: covariance matrices from different subjects occupy systematically distinct regions of the SPD manifold, yet existing domain adaptation methods either require target-domain calibration data or learn subject-specific components that cannot generalise across domains. We propose dynamic Stiefel routing: a pool of $K$ expert projection filters on the Stiefel manifold, each specialised for a different region of the SPD manifold, with each input covariance routed to the most appropriate filter via cross-attention, adapting the subspace projection per sample. A central finding is that this approach, implemented naively, provably collapses to ensemble averaging: when routing weights are uniform, the adaptive filter reduces exactly to an equal-contribution combination of experts, indistinguishable from a single fixed filter. Three structural properties break this degeneracy: a symmetric anchor $W_{\mathrm{base}} \in \mathrm{St}(n,k)$ that removes proximity bias among experts; a frozen domain-discriminative query encoder that decouples routing from task optimisation; and a decoupled key alignment loss that trains expert keys toward stable domain attractors. Together they produce the first genuinely committed and domain-structured routing on SPD manifolds, with consistent gains across three datasets: balanced accuracy improves from $0.773\to 0.823$, $0.757\to 0.809$, and $0.801\to 0.839$, with the alignment strategy determined automatically by a single data-driven rule and no dataset-specific hyperparameter search.

URL PDF HTML ☆

赞 0 踩 0

2605.31042 2026-06-01 cs.CR cs.AI cs.CL 版本更新

From Prompt Injection to Persistent Control: Defending Agentic Harness Against Trojan Backdoors

从提示注入到持久控制：防御智能体框架中的木马后门

Jiejun Tan, Zhicheng Dou, Xinyu Yang, Yuyang Hu, Yiruo Cheng, Xiaoxi Li, Ji-Rong Wen

发表机构 * Gaoling School of Artificial Intelligence, Renmin University of China（中国人民大学人工智能学院 Gallagher 学院）

AI总结本文提出ClawTrojan基准测试揭示本地智能体框架中的多步木马攻击，并设计DASGuard防御方法，通过扫描控制文本、追溯来源并清除不可信控制内容，实现动态防御。

Comments Code and data are available at https://github.com/RUC-NLPIR/ClawTrojan

详情

AI中文摘要

LLM智能体正在从对话式聊天机器人演变为实际工作空间中的操作工具。在本地智能体框架中，LLM可以读写文件、调用工具，并在会话间重用工作空间状态。虽然这些功能增强了实用性，但也为攻击者暴露了新的攻击面。攻击者可以将提示注入嵌入文件或工具输出中。智能体可能会读取这一隐藏指令，存储它，并在之后执行。在这种多步木马攻击范式中，没有任何单个步骤本身是恶意的，但这些步骤可以共同将不可信文本转化为持久控制内容。然而，现有防御通常孤立地检查每个步骤。因此，它们可以阻止明显的恶意行为，但无法检测到植入后门的早期写操作。为了揭示这一威胁，我们引入了ClawTrojan，一个旨在识别本地智能体框架中多步木马攻击的基准测试。在OpenClaw风格的模拟工作空间中，使用GPT-5.4，ClawTrojan达到了95.5%的攻击成功率（ASR），而同一模型上现有的单轮提示注入攻击产生的ASR接近零。为了解决这一威胁，我们提出了DASGuard，它扫描敏感本地文件中的控制类文本，追溯其来源，并清除非可信来源的控制内容。我们的结果表明，DASGuard通过结合运行时攻击阻断和对工作空间的清理提交，实现了强大的动态防御。

英文摘要

LLM agents are evolving from conversational chatbots to operational tools in real-world workspaces. In local agentic harnesses, an LLM can read and write files, call tools, and reuse workspace state across sessions. While such capabilities enhance utility, they also expose a new attack surface for attackers. Attackers can embed a prompt injection within a file or tool output. Agents may read this hidden instruction, store it, and execute it later. In this multi-step trojan attack paradigm, no individual step appears malicious on its own, but these steps can collectively turn untrusted text into persistent control content. However, existing defenses often inspect each step in isolation. As a result, they can block a clear harmful action, but fail to detect the earlier write operation that plants the backdoor. To reveal this threat, we introduce ClawTrojan, a benchmark designed to identify multi-step trojan attacks in local agentic harnesses. In an OpenClaw-style simulated workspace with GPT-5.4, ClawTrojan reaches a 95.5% attack success rate (ASR), while existing single-turn prompt-injection attacks produce near-zero ASR on the same model. To address this threat, we propose DASGuard, which scans control-like text in sensitive local files, traces its origin, and removes control content that does not originate from a trusted source. Our results show that DASGuard achieves strong dynamic defense by combining runtime attack blocking with sanitized commits to the workspace.

URL PDF HTML ☆

赞 0 踩 0

2605.31041 2026-06-01 cs.CV cs.AI 版本更新

Does Visual Information Play a Decisive Role in Vision-Language-Action Model Driving Behavior?

视觉信息在视觉-语言-动作模型驾驶行为中是否起决定性作用？

Jingtao He, Hongliang Lu, Xiaoyun Qiu, Yixuan Wang, Xinhu Zheng

发表机构 * Intelligent Transportation Thrust, The Hong Kong University of Science and Technology (Guangzhou)（科技与交通智能 thrust，香港科学与技术大学（广州））

AI总结本文提出结构化多级视觉扰动框架，系统分析VLA驾驶模型对视觉信息的依赖程度，揭示依赖模式随评估方式变化且在不同抽象层次上不均匀。

详情

AI中文摘要

视觉-语言-动作（VLA）模型在自动驾驶中展现出令人期待的能力，凸显了统一多模态架构联合建模感知与规划的潜力。然而，当前基于VLA的驾驶行为如何植根于视觉信息仍知之甚少。现有评估协议主要关注聚合性能指标，缺乏结构化和实用的诊断方法来量化视觉-行为依赖性。在这项工作中，我们引入了一个结构化的多级视觉扰动框架，以系统分析基于VLA的驾驶模型中的视觉-行为依赖性。该框架沿着三个互补维度组织受控视觉扰动：通道级退化、信息级破坏和结构级修改。我们将其应用于基于VLA的驾驶系统，并在开环轨迹预测和交互式闭环安全评估下评估行为响应。实验揭示了依赖于评估的依赖模式以及跨抽象层次的不均匀视觉基础。这些发现呼吁对VLA驾驶模型进行更结构化的分析和原则性设计，以更好地理解视觉信息如何塑造行为，并开发更安全、更鲁棒的系统。

英文摘要

Vision-Language-Action (VLA) models have demonstrated promising capability in autonomous driving, highlighting the potential of unified multimodal architectures for jointly modeling perception and planning. However, how current VLA-based driving behavior is grounded in visual information remains poorly understood. Existing evaluation protocols mainly focus on aggregate performance metrics, lacking structured and practical diagnostics to quantify visual-behavior dependency. In this work, we introduce a structured multi-level visual perturbation framework to analyze visual-behavior dependency in VLA-based driving models systematically. The framework organizes controlled visual perturbations along three complementary dimensions: channellevel degradation, information-level disruption, and structurelevel modification. We apply it to VLA-based driving systems and evaluate behavioral responses under both open-loop trajectory prediction and interactive closed-loop safety evaluation. Experimental results reveal evaluation-dependent dependency patterns and uneven visual grounding across abstraction levels. These findings call for more structured analyses and principled design of VLA driving models to better understand how visual information shapes behavior and develop safer, more robust systems.

URL PDF HTML ☆

赞 0 踩 0

2605.31034 2026-06-01 cs.LG cs.AI 版本更新

Annealed Softmax Greedy in Many-Armed Bayesian Bandits

多臂贝叶斯老虎机中的退火Softmax贪婪算法

William Overman, Mohsen Bayati

发表机构 * Stanford University（斯坦福大学）

AI总结本文研究退火Softmax贪婪算法在多臂贝叶斯伯努利老虎机中的贝叶斯遗憾，证明在先验满足线性上尾条件（β=1的β正则性）时，算法达到接近最优的贝叶斯遗憾率，并与RLVR方法形成结构类比。

详情

AI中文摘要

具有可验证奖励的强化学习（RLVR）和基于组的策略优化方法（如GRPO）通过为每个提示采样多个完成并增加策略在奖励较高的完成上的概率来更新随机策略，同时通过KL惩罚向参考策略正则化。这些更新不包括追踪认知不确定性的显式机制。本文研究为何这种不确定性无关的更新仍然有效的一个风格化解释。我们分析了一个退火softmax（玻尔兹曼）策略，该策略在多臂贝叶斯伯努利老虎机中根据经验平均奖励的softmax选择动作。在先验满足线性上尾条件（β正则性的β=1情况）下，该条件意味着存在大量接近最优的臂，我们证明退火softmax贪婪算法实现了贝叶斯遗憾$ ilde{O}(m + T/m)$，特别地，当臂数$m = Θ(\sqrt{T})$时，遗憾为$ ilde{O}(\sqrt{T})$。这是该机制下接近最优的贝叶斯遗憾率，经验平均贪婪算法也能达到。在β正则性下，许多臂在整个学习过程中保持经验均值接近最优，因此当softmax采样一个非经验最优的臂时，该臂往往是另一个接近最优的臂，而不是明显较差的臂。相比之下，当臂数较少时，同类的softmax策略可能遭受线性遗憾。该结果也为RLVR提供了结构类比，其中以非可忽略概率产生正确完成的基础策略扮演了β正则性的角色。

英文摘要

Reinforcement learning with verifiable rewards (RLVR) and group-based policy optimization methods such as GRPO update a stochastic policy by sampling multiple completions per prompt and increasing the policy's probability on those with higher reward, regularized by a KL penalty toward a reference policy. These updates do not include explicit mechanisms that track epistemic uncertainty. This paper studies a stylized explanation for why such uncertainty-agnostic updates can nevertheless be effective. We analyze an annealed softmax (Boltzmann) policy that selects actions according to a softmax of empirical mean rewards in a many-armed Bayesian Bernoulli bandit. Under a linear upper-tail condition on the prior (the $β=1$ case of $β$-regularity), which implies an abundance of near-optimal arms, we prove that annealed softmax greedy achieves Bayes regret $\tilde{O}(m + T/m)$, and in particular $\tilde{O}(\sqrt{T})$ when the number of arms scales as $m = Θ(\sqrt{T})$. This is the near-optimal Bayes regret rate in this regime, attained also by empirical-mean greedy. Under $β$-regularity, many arms maintain empirical means close to the optimum throughout learning, so when softmax samples an arm other than the empirically best, that arm tends to be another near-optimal one rather than a clearly inferior one. By contrast, with a small number of arms, the same kind of softmax policy can suffer linear regret. The result also provides a structural analogy to RLVR, where a base policy with a non-negligible probability of producing a correct completion plays the role of $β$-regularity.

URL PDF HTML ☆

赞 0 踩 0

2605.31031 2026-06-01 cs.AI 版本更新

GraphARC: A Comprehensive Benchmark for Graph-Based Abstract Reasoning

GraphARC：基于图的抽象推理综合基准

Saku Peltonen, August Bøgh Rønberg, Andreas Plesner, Roger Wattenhofer

发表机构 * ETH Z\"urich Z\"urich Switzerland ； ETH Z\"urich

AI总结提出GraphARC基准，将抽象推理扩展到图结构数据，通过少样本变换学习任务评估模型在局部、全局和层次图变换上的泛化能力，并揭示语言模型的理解-执行差距和规模扩展障碍。

Comments Accepted at KDD 2026 Datasets and Benchmarks Track

详情

DOI: 10.1145/3770855.3817591

AI中文摘要

关系推理是智能的核心，但现有基准通常局限于网格或文本格式。我们引入了GraphARC，一个用于图结构数据抽象推理的基准。GraphARC推广了抽象与推理语料库（ARC）的少样本变换学习范式。每个任务需要从几个输入-输出对中推断变换规则，并将其应用于新的测试图，涵盖局部、全局和层次图变换。与基于网格的ARC不同，GraphARC实例可以在不同的图族和规模上大规模生成，从而能够系统评估泛化能力。我们在GraphARC上评估了最先进的语言模型，并观察到明显的局限性。模型能够回答关于图属性的问题，但往往无法解决完整的图变换任务，揭示了理解-执行差距。在更大实例上性能进一步下降，暴露了规模扩展障碍。更广泛地说，通过将节点分类、链接预测和图生成的方面结合在一个单一框架内，GraphARC为未来的图基础模型提供了一个有前景的测试平台。

英文摘要

Relational reasoning lies at the heart of intelligence, but existing benchmarks are typically confined to formats such as grids or text. We introduce GraphARC, a benchmark for abstract reasoning on graph-structured data. GraphARC generalizes the few-shot transformation learning paradigm of the Abstraction and Reasoning Corpus (ARC). Each task requires inferring a transformation rule from a few input-output pairs and applying it to a new test graph, covering local, global, and hierarchical graph transformations. Unlike grid-based ARC, GraphARC instances can be generated at scale across diverse graph families and sizes, enabling systematic evaluation of generalization abilities. We evaluate state-of-the-art language models on GraphARC and observe clear limitations. Models can answer questions about graph properties but often fail to solve the full graph transformation task, revealing a comprehension-execution gap. Performance further degrades on larger instances, exposing scaling barriers. More broadly, by combining aspects of node classification, link prediction, and graph generation within a single framework, GraphARC provides a promising testbed for future graph foundation models.

URL PDF HTML ☆

赞 0 踩 0

2605.31023 2026-06-01 cs.AI cs.LG cs.MA 版本更新

HADT: A Heterogeneous Multi-Agent Differential Transformer for Autonomous Earth Observation Satellite Cluster

HADT: 一种用于自主对地观测卫星集群的异构多智能体差分Transformer

Mohamad A. Hady, Muhammad Anwar Masum, Siyi Hu, Mahardhika Pratama, Jimmy Cao, Ryszard Kowalczyk

发表机构 * School of Computer Science and Information Technology, Adelaide University（计算机科学与信息科技学院，阿德莱德大学）； School of Electrical Engineering, Computing and Mathematical Sciences (EECMS), Curtin University（电气工程、计算与数学科学学院（EECMS）， Curtin大学）； Systems Research Institute, Polish Academy of Sciences（波兰科学院系统研究所）

AI总结针对异构卫星集群自主对地观测任务，提出基于Transformer的架构，通过关系观测-动作令牌化和差分注意力机制实现自适应实时资源管理，性能显著优于基线。

Comments Accepted in ECML-PKDD 2026. arXiv admin note: text overlap with arXiv:2511.12792

详情

AI中文摘要

本文解决了执行对地观测任务（包括光学和合成孔径雷达卫星）的异构卫星集群中的自主资源管理问题。在自主运行模式下，卫星配备智能能力，能够根据最新条件实时决策，同时最小化与地面操作员的交互。传统的调度方法通常依赖数学模型来表示卫星任务和资源管理，然后通过优化算法求解。然而，当底层模型不可用、过于复杂或因空间任务环境中的动态变化和不确定性而不准确时，此类解决方案效果不佳。一个有前景的替代方案是将问题重新表述为序列决策过程，并应用无模型强化学习技术来实现自适应和实时资源管理。为此，我们提出了一种新颖的基于Transformer的架构，专门针对异构卫星集群自主对地观测任务，采用关系观测-动作令牌化和差分注意力机制。我们的实验结果表明，与现有基线相比，性能有显著提升。此外，所提出的架构在不同卫星集群数量下表现出强大的适应性和可迁移性。

英文摘要

This work addresses the problem of autonomous resource management in heterogeneous satellite cluster conducting Earth Observation (EO) missions including optical and Synthetic Aperture Radar (SAR) satellites. In autonomous operation mode, satellites are equipped with intelligent capabilities enabling real-time decision-making based on the latest conditions, while requiring minimal interaction with ground operators. Traditional scheduling approaches typically rely on mathematical models to represent satellite mission and resource management. Then, this problem is solved by using optimization algorithms. However, such solutions become less effective when the underlying models are not available, over complex, and inaccurate due to dynamic changes and uncertainties inherent in the space mission environment. A promising alternative is to reformulate the problem as a sequential decision-making process and apply model-free reinforcement learning techniques to enable adaptive and real-time resource management. To this end, we propose a novel transformer-based architecture tailored for heterogeneous satellite cluster autonomous EO Mission with relational observations-actions tokenization and differential attention mechanism. Our experimental results demonstrate significant performance improvements compared to the available baselines. Moreover, the proposed architecture exhibits strong adaptability and transferability with respect to varying numbers of satellite clusters.

URL PDF HTML ☆

赞 0 踩 0

2605.31021 2026-06-01 cs.AI cs.CL cs.LG 版本更新

A Persona-Based Evaluation Framework for Pluralistic Alignment in Generative AI

基于人格的生成式AI多元对齐评估框架

Atahan Karagoz

发表机构 * Atahan Karagöz（阿塔汗·卡拉戈兹）

AI总结提出一种状态空间约束仿真框架，通过合成认知轮廓替代单一评估函数，实现反映真实世界共识变异性的多元、视角依赖的基准测试，并分析仿真评估者的稳定性问题，论证动态调节机制的必要性。

详情

AI中文摘要

当前生成式人工智能的对齐范式主要依赖单一基准测试框架，将人类判断的多元性简化为聚合统计基线，从而掩盖了评估中的文化、人口和语境变异性。我们引入一种用于AI评估的状态空间约束仿真框架，用代表不同人类视角的合成认知轮廓的结构化流形替代单一评估函数。我们表明，现代生成架构能够以高度一致性实例化和维护这些评估人格，从而实现一种更接近现实世界共识变异性的多元、视角依赖的基准测试。然而，我们进一步分析了这些模拟评估者在顺序推理和随机提示扰动下的稳定性，揭示了人格一致性的系统性退化，表现为状态空间漂移和语义不一致。这些发现表明，静态对齐约束不足以维持随时间推移的稳健评估行为。相反，我们主张必须在生成系统中嵌入动态的、可行性驱动的调节机制，以保持连贯的认知仿真。通过将基于人格的评估视为潜在表征流形上的结构化动力系统，本研究为更自适应、更符合人类、更注重语境的AI评估方法奠定了基础。

英文摘要

Current alignment paradigms for generative artificial intelligence rely predominantly on monolithic benchmarking frameworks that reduce the plurality of human judgment to aggregated statistical baselines, thereby obscuring cultural, demographic, and contextual variability in evaluation. We introduce a state-space constrained emulation framework for AI evaluation that replaces singular assessment functions with a structured manifold of synthetic cognitive profiles representing diverse human perspectives. We show that modern generative architectures can instantiate and maintain these evaluative personas with high consistency, enabling a form of pluralistic, perspective-dependent benchmarking that more closely reflects real-world consensus variability. However, we further analyze the stability of these simulated evaluators under sequential inference and stochastic prompt perturbations, revealing systematic degradation in persona coherence that manifests as state-space drift and semantic inconsistency. These findings suggest that static alignment constraints are insufficient for sustaining robust evaluative behavior over time. Instead, we argue for the necessity of embedding dynamic, viability-driven regulatory mechanisms within generative systems to preserve coherent cognitive emulation. By framing persona-based evaluation as a structured dynamical system over latent representation manifolds, this study provides a foundation for more adaptive, human-aligned, and context-sensitive approaches to AI evaluation.

URL PDF HTML ☆

赞 0 踩 0

2605.31007 2026-06-01 cs.LG cs.AI 版本更新

DEM: A Distilled Explanation Model for Interpretable Anomaly Detection in Physiological Sensor Networks

DEM：面向生理传感器网络中可解释异常检测的蒸馏解释模型

Jyotirmoy Singh, Anushka Roy, Shreea Bose, Chittaranjan Hota

发表机构 * Department of Computer Science and Information Systems（计算机科学与信息系统系）； Department of Electrical and Electronics Engineering（电气与电子工程系）

AI总结提出一种三阶段玻璃箱框架DEM，通过将梯度提升专家模型的知识蒸馏到基于线性基线残差的决策树中，实现高精度与内在可解释性的异常检测，并引入蒸馏保真度指标量化解释可信度。

Comments 21 pages, 10 figures, 7 tables. Code: https://github.com/Jyotirmoy17/dem-model

详情

AI中文摘要

无线体域网（WBANs）中生理传感器数据的异常检测可能由传感器故障、网络中断或数据缺失引起，导致误报。因此，它既需要高预测精度，也需要临床可解释的解释。现有方法要么依赖性能强但无透明度的黑盒模型，要么依赖SHAP和LIME等事后解释方法。本文提出蒸馏解释模型（DEM），一个三阶段玻璃箱框架，将梯度提升专家模型的非线性知识蒸馏到基于线性基线残差的可解释决策树中，使得解释不是近似而是预测本身。DEM引入了一种新颖的蒸馏保真度指标，量化解释树忠实捕捉专家模型非线性贡献的程度，提供了先前可解释模型所缺乏的解释可信度的原则性度量。在包括MIMIC-IV、WESAD、eICU和内部SmartNet WBAN语料库在内的四个生理数据集上评估，DEM在临床上下文异常检测上达到0.9964的AUC，在可穿戴压力检测上达到0.9047，同时以可控深度生成人类可读的if-then规则。推理每1000个样本需要0.17ms，使DEM比基于SHAP的事后解释快1235倍，适用于实时生理监测。消融研究证实，XGBoost蒸馏步骤比朴素残差拟合提供了可测量的增益，深度敏感性分析展示了DEM在现有内在可解释模型中独有的、用户可控的准确性-可解释性权衡。

英文摘要

Anomaly detection in physiological sensor data from Wireless Body Area Networks (WBANs) can be caused by sensor faults, network disruptions, or missing data, leading to false alarms. Hence, it demands both high predictive accuracy and clinically interpretable explanations. Existing approaches rely either on black-box models that achieve strong performance but offer no transparency, or on post-prediction explanation methods such as SHAP and LIME. In this paper, we propose the Distilled Explanation Model (DEM), a three-stage glass-box framework that distills the non-linear knowledge of a gradient boosting expert into an interpretable decision tree operating on residuals relative to a linear baseline, so that the explanation is not an approximation but the prediction itself. DEM introduces a novel distillation fidelity metric that quantifies how faithfully the explanation tree captures the expert model's non-linear contribution, providing a principled measure of explanation trustworthiness absent from prior interpretable models. Evaluated across four physiological datasets, including MIMIC-IV, WESAD, eICU, and an in-house SmartNet WBAN corpus, DEM achieves an AUC of 0.9964 on clinical contextual anomaly detection and 0.9047 on wearable stress detection while producing human-readable if-then rules at a controllable depth. Inference requires 0.17ms per 1000 samples, rendering DEM 1235x faster than SHAP-based post-hoc explanation and suitable for real-time physiological monitoring. Ablation studies confirm that the XGBoost distillation step provides measurable gains over naive residual fitting, and depth-sensitivity analysis demonstrates an explicit, user-controlled accuracy-interpretability trade-off unique to DEM among existing intrinsically interpretable models.

URL PDF HTML ☆

赞 0 踩 0

2605.30984 2026-06-01 cs.CV cs.AI cs.CL 版本更新

Generating Reports or Repeating Templates? Measuring and Mitigating Template Collapse in 3D CT Report Generation

生成报告还是重复模板？测量和缓解三维CT报告生成中的模板崩溃

Tom Maye-Lasserre, Yitong Li, Bailiang Jian, Morteza Ghahremani, Benedikt Wiestler, Christian Wachinger

发表机构 * Technical University of Munich (TUM)（慕尼黑技术大学）； TUM Hospital（TUM医院）； Munich Center for Machine Learning (MCML)（慕尼黑机器学习中心）

AI总结针对三维CT报告生成中模型输出多样性低、病理检测能力差的模板崩溃问题，提出解耦框架CLarGen，通过分离临床检测与语言合成，显著提升临床准确性并保持报告流畅性。

详情

AI中文摘要

现代三维医学视觉语言模型（VLM）能够生成流畅的放射学风格文本，但表现出极低的病理检测率和输出多样性，崩溃为低估罕见但关键发现的通用模板。我们将这种失败模式识别为模板崩溃。这种失败源于三维医学成像的独特限制，例如数据有限、标签严重不平衡以及体积编码器的弱信号。在这些限制下，文本生成目标鼓励捷径学习和流畅但基础薄弱的报告。我们通过临床保真度、输出多样性、正常模板偏差和罕见发现存活率系统性地诊断模板崩溃。为了缓解它，我们提出CLarGen，一个解耦框架，将说什么（临床检测）与怎么说（语言合成）分开。CLarGen使用（i）用于多标签病理检测的潜在查询变换器，（ii）用于临床匹配示例的病理引导检索，以及（iii）用于从检测到的发现和检索到的上下文中合成最终报告的医学语言模型。在最新的三维CT报告生成基线中，CLarGen缓解了模板崩溃，并在保持流畅报告的同时显著提高了临床准确性（macro-F1 0.487 vs. 0.189；CRG 0.472 vs. 0.368）。我们的结果表明，明确、可测量的临床基础对于抗模板崩溃的三维CT报告生成至关重要。代码将在接收后发布。

英文摘要

Modern 3D medical vision-language models (VLMs) can generate fluent radiology-style text while exhibit critically low pathology detection and output diversity, collapsing to generic templates that under-report rare yet critical findings. We identify this failure mode as Template Collapse. This failure stems from the unique constraints of 3D medical imaging, e.g., limited data, severe label imbalance, and weak signals from volumetric encoders. Under these constraints, text-generation objectives encourage shortcut learning and fluent but weakly grounded reports. We systematically diagnose the Template Collapse through clinical fidelity, output diversity, normal-template bias, and rare-finding survival. To mitigate it, we propose CLarGen, a decoupled framework that separates what to say (clinical detection) from how to say it (language synthesis). CLarGen uses (i) a Latent Query Transformer for multi-label pathology detection, (ii) pathology-guided retrieval for clinically matched exemplars, and (iii) a medical language model to synthesize the final report from detected findings and retrieved context. Across state-of-the-art 3D CT report generation baselines, CLarGen mitigates Template Collapse and substantially improves clinical accuracy (macro-F1 0.487 vs. 0.189; CRG 0.472 vs. 0.368) while maintaining fluent reporting. Our results suggest that explicit, measurable clinical grounding is essential for template-collapse-resistant 3D CT report generation. Code will be released upon acceptance.

URL PDF HTML ☆

赞 0 踩 0

2605.30968 2026-06-01 cs.CV cs.AI 版本更新

Variational Adapter for Cross-modal Similarity Representation

变分适配器用于跨模态相似性表示

WenZhang Wei, Zhipeng Gui, Dehua Peng, Tiandi Ye, Huayi Wu

发表机构 * School of Remote Sensing and Information Engineering（遥感与信息工程学院）； Wuhan University（武汉大学）； School of Data Science and Engineering（数据科学与工程学院）； East China Normal University（华东师范大学）； State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing（测绘遥感信息工程国家重点实验室）

AI总结针对跨模态匹配中细粒度标注稀缺导致二元分类边界压缩和假负样本问题，提出变分适配器VACSR，将匹配任务重构为变分推断问题，通过构建潜在相似性空间和正则化缓解过拟合，在图像-文本检索、域泛化和基类到新类泛化任务上验证了有效性。

Comments Accepted by the 43rd International Conference on Machine Learning (ICML 2026)

详情

AI中文摘要

视觉-语言模型的核心在于在统一表示空间中度量跨模态相似性。然而，大多数图像-文本匹配或多类图像分类数据集缺乏细粒度的跨模态匹配标注，迫使连续的相似性空间压缩为二元分类边界。这种压缩引入了假负样本，并严重损害了跨模态任务的泛化性能。尽管先前的研究试图通过建模模态内模糊性来缓解这一问题，但往往忽略了固有的标注缺陷，导致不确定性分配次优。为了解决这些挑战，我们提出了一种变分适配器用于跨模态相似性表示（VACSR）。该方法将具有细粒度语义稀缺性的图像-文本匹配重新表述为变分推断问题。它构建了一个跨模态相似性的潜在空间，并使用正则化技术来减轻对二元标注的过拟合。在图像-文本检索、域泛化和基类到新类泛化上的实验证明了所提出方法的有效性和鲁棒的泛化能力。

英文摘要

The core of vision-language models lies in measuring cross-modal similarity within a unified representation space. However, most image-text matching or multi-class image classification datasets lack fine-grained cross-modal matching annotations, forcing the continuous similarity space into binary classification boundaries. This compression induces false negative samples and significantly impairs the generalization performance of cross-modal tasks. While prior research has attempted to mitigate this by modeling intra-modal ambiguity, it often overlooks inherent annotation flaws, leading to suboptimal uncertainty allocation. To address these challenges, we propose a Variational Adapter for Cross-modal Similarity Representation (VACSR). This approach reformulates image-text matching with fine-grained semantic scarcity as a variational inference problem. It constructs a latent space for cross-modal similarity and uses regularization techniques to mitigate overfitting to binary annotations. Experiments on image-text retrieval, domain generalization, and base-to-novel generalization demonstrate the proposed method's effectiveness and robust generalization ability.

URL PDF HTML ☆

赞 0 踩 0

2605.30966 2026-06-01 cs.IR cs.AI cs.CL 版本更新

TUX：衡量人机默契理解

Yueshen Li, Hanyi Min, Vedant Das Swain, Koustuv Saha

发表机构 * University of Illinois Urbana-Champaign（伊利诺伊大学厄巴纳-香槟分校）； New York University（纽约大学）

AI总结通过光谱放置任务和TUX指数，量化人类与LLM之间的默契理解，发现人格特征影响对齐程度。

详情

AI中文摘要

随着大型语言模型（LLMs）越来越多地作为协作伙伴，人机对齐通常通过明确的任务成功、准确性或奖励优化来评估。然而，许多协作场景依赖于默契理解：即智能体能否在没有明确目标、沟通或反馈的情况下，与人类的评价立场或表征先验对齐。为了研究这种能力，我们开发了一个受社交派对游戏Wavelength启发的光谱放置任务，在该任务中，人类和智能体独立地将概念放置在主观光谱上。我们将默契理解指数（TUX）操作化为人类与智能体判断之间的成对相似性度量，并通过241名人类参与者和200个基于人格条件的LLM智能体（涵盖四种模型）进行评估。我们发现，在特质空间中最近的人-智能体对实现了显著更高的TUX，表明默契对齐是由个体层面特征而非随机相似性所结构化的。回归分析表明，随着预测变量集变得更加丰富，TUX变得更可解释，个体特质、决策风格和置信度优于聚合特质距离基线。这些发现表明，人类与LLM之间的默契理解是可测量的，同时也揭示了基于人格条件化方法在捕捉更深层表征对齐方面的局限性。

英文摘要

As large language models (LLMs) increasingly act as collaborative partners, human--AI alignment is often evaluated through explicit task success, accuracy, or reward optimization. Yet many collaborative settings depend on tacit understanding: whether an agent can align with a human's evaluative stance or representational priors without clear objectives, communication, or feedback. To study this capacity, we develop a spectrum-placement task inspired by the social party game Wavelength, in which humans and agents independently place concepts along subjective spectra. We operationalize the Tacit Understanding Index (TUX) as a pairwise measure of similarity between human and agent judgments, and evaluate it with 241 human participants and 200 profile-conditioned LLM agents across four models. We find that nearest human--agent pairs in trait space achieve significantly higher TUX, suggesting that tacit alignment is structured by person-level characteristics rather than random similarity. Regression analyses show that TUX becomes more explainable as predictor sets become richer, with individual traits, decision-making styles, and confidence improving over aggregate trait-distance baselines. These findings suggest that tacit understanding between humans and LLMs is measurable, while revealing the limits of profile-based conditioning for capturing deeper representational alignment.

URL PDF HTML ☆

赞 0 踩 0

2605.30919 2026-06-01 cs.LG cs.AI 版本更新

BlueFin: 在金融电子表格上对LLM智能体进行基准测试

Srivatsa Kundurthy, Clara Na, Colton Moraine, Anoushka Mohta, Case Winter, George Fang, John Ling, Emma Strubell, Zach Kirshner

发表机构 * Longitude Labs Inc.（Longitude Labs公司）； Cornell University（康奈尔大学）； Carnegie Mellon University（卡内基梅隆大学）

AI总结提出BlueFin基准，通过131个真实金融电子表格任务评估LLM智能体的合成、操作和理解能力，并验证了LM评判与人类专家的一致性。

Comments 26 pages

详情

AI中文摘要

我们提出BlueFin，一个基准测试，要求大语言模型（LLM）智能体在专业金融领域的电子表格工作簿上执行合成、操作和理解任务。尽管全球电子表格软件付费用户估计数亿——比全球专业开发人员估计数量高一个数量级——但投入探索和扩展LLM在电子表格领域能力的资源相对较少，而专门用于反映专业金融角色实际职业任务的资源更少。为此，我们整理了131个具有现实相关性的挑战性复杂任务，包含3225个细粒度评分标准；值得注意的是，我们的评分标准和LM评判评估由一组专家人工标注员验证，从而对难以通过编程验证但可由LM评判智能体可靠评估的复杂任务进行高质量、细粒度的评估。我们的评判与专家共识达到一致（α=0.826），宏F1得分为0.839。前沿LLM在此挑战性基准上表现不佳，最强LLM在任务上的平均得分低于50%——模型在动态正确性方面表现出特别弱点。我们的贡献包括：涵盖三类电子表格任务的示例数据集、开源工具包和智能体评估框架，以及现有前沿模型在我们基准上的性能表征。

英文摘要

We present BlueFin, a benchmark that tasks large language model (LLM) agents with synthesis, manipulation, and comprehension tasks over spreadsheet workbooks in the professional finance domain. Though estimates of the global population of paying users of spreadsheet software range in the hundreds of millions -- an order of magnitude more than the estimated global population of professional developers -- comparatively fewer resources have been devoted to exploring and expanding LLM capabilities in the spreadsheet domain, with fewer still dedicated to mirroring real occupational tasks encountered by those in professional finance roles. In response, we curate a set of 131 challenging, complex tasks with real-world relevance in the domain, containing 3,225 granular rubric criteria; notably, our rubric criteria and LM judge evaluations are validated by a team of expert human annotators, resulting in high-quality, granular evaluations of complex tasks that are difficult to verify programmatically but can be reliably evaluated by an LM judge agent. Our judge achieves parity with expert consensus ($α=0.826$) with a macro-F1 score of 0.839. Frontier LLMs demonstrate poor performance on the challenging benchmark, with the strongest LLMs achieving less than 50\% average scores across tasks -- models exhibit particular weaknesses in dynamic correctness. Our contributions include a dataset of examples across three categories of spreadsheet tasks, an open source harness and agentic evaluation framework, and a characterization of existing frontier models' performance on our benchmark.

URL PDF HTML ☆

赞 0 踩 0

2605.30903 2026-06-01 cs.LG cs.AI 版本更新

Inverse Reinforcement Learning without an Optimal Demonstrator: A Feasible Reward Set Approach

无最优演示者的逆强化学习：一种可行奖励集方法

Kihyun Kim, Shripad Deshmukh, Nikos Vlassis, Jiawei Zhang

发表机构 * MIT LIDS（麻省理工学院媒体实验室）； University of Massachusetts, Amherst（马萨诸塞大学阿姆赫斯特分校）； Adobe Research（Adobe研究院）； University of Wisconsin-Madison（威斯康星大学麦迪逊分校）

AI总结针对多个非最优演示者数据，提出可行奖励集框架，通过线性约束联合可行集单调收缩，并给出恢复保证与高维环境离线算法。

详情

AI中文摘要

逆强化学习（IRL）通常假设来自单个最优演示者的演示，但在许多应用中，数据来自多个具有异质次优性水平的非完美演示者。我们通过可行奖励集框架研究这一设置下的奖励学习：对于每个演示者，我们将其声明的次优性水平编码为线性约束，并在演示者之间对所得可行集取交集。我们的理论分析表明，随着数据的增加，联合可行集单调收缩，并且我们精确刻画了新演示者何时严格收紧该集合。我们进一步为真实最优演示者的可行奖励集建立了两个恢复保证：一个界限依赖于与最优占用度的接近程度，而另一个仅需要足够的覆盖且没有接近最优的演示者。在实际方面，我们引入了解决所得奖励集中固有奖励模糊性的策略，并提供了适用于高维环境的函数逼近离线算法。在表格型网格世界和大语言模型（LLM）微调设置中的实验与理论预测一致，并证明了所提框架相对于基线的有效性。

英文摘要

Inverse reinforcement learning (IRL) typically assumes demonstrations from a single optimal demonstrator, but in many applications data come from multiple imperfect demonstrators with heterogeneous suboptimality levels. We study reward learning in this setting through a feasible-reward-set framework: for each demonstrator, we encode its declared suboptimality level as a linear constraint and intersect the resulting feasible sets across demonstrators. Our theoretical analysis shows that the joint feasible set shrinks monotonically as data are added, and we give an exact characterization of when a new demonstrator strictly tightens it. We further establish two recovery guarantees for the feasible reward set of the ground-truth optimal demonstrator: one bound depends on closeness to the optimal occupancy, while the other requires only sufficient coverage and no near-optimal demonstrator. On the practical side, we introduce strategies to address the inherent reward ambiguity in the obtained reward set and provide an offline algorithm with function approximation for high-dimensional environments. Experiments in tabular grid-world and large language model (LLM) fine-tuning settings are consistent with the theoretical predictions and demonstrate the effectiveness of the proposed framework over baselines.

URL PDF HTML ☆

赞 0 踩 0

2605.30900 2026-06-01 cs.AI physics.app-ph 版本更新

BilliardPhys-Bench: Benchmarking Physical Reasoning and Visual Dynamics of Multimodal LLMs

BilliardPhys-Bench: 多模态大语言模型的物理推理与视觉动力学基准测试

Ben Wang, Xiaogang Li, Ruochen Gao, Peiyao Xiao, Chengliang Xu, Zeyu Wang, Zichao Chen, Bing Zhao, Hu Wei

发表机构 * Alibaba Group（阿里巴巴集团）

AI总结提出BilliardPhys-Bench基准，通过合成台球环境评估多模态大语言模型在物理推理（碰撞、反弹、最终位置预测）上的能力，发现模型存在“静态偏差”且性能随模拟时间与场景复杂度下降。

详情

AI中文摘要

Sophrosyne: 关系数据系统的智能体探索需要适度

Madhav Jivrajani, Ramnatthan Alagappan, Aishwarya Ganesan

发表机构 * University of Illinois Urbana-Champaign（伊利诺伊大学厄巴纳-香槟分校）

AI总结针对LLM驱动的Text2SQL智能体在探索数据系统时过度探索的问题，提出Sophrosyne环境，通过增强API响应中的指令来引导探索，减少过度探索并提升SQL生成准确性。

详情

AI中文摘要

由LLM驱动的Text2SQL智能体通过工具调用探索数据系统，将自然语言意图转化为SQL。然而，为了确保安全且受限的访问，数据系统构建了具有显式API表面的环境。我们研究并分类了当前暴露的API，将其分为粗粒度或细粒度，并认为在这两者之间进行选择会带来成本效益探索与准确SQL生成之间的基本权衡。大多数数据系统暴露细粒度API，但这无意中使智能体处于劣势：它们过度探索，将不相关的模式元素纳入查询公式中，并产生不准确的结果。我们认为，抑制过度探索是有效利用这些API表面的关键，并提出了Sophrosyne，一种数据系统环境，它通过增强API响应中的指令来引导智能体的探索过程。初步结果显示，指令将过度探索减少了4.6倍，并将准确率提高了高达12.4%（约4个百分点）。

英文摘要

Text2SQL agents powered by LLMs translate natural language intent into SQL by exploring the data system through tool calls before formulating the query. However, to ensure secure and scoped access, data systems construct environments with explicit API surfaces. We study and categorize these APIs exposed today as either coarse-grained or fine-grained and posit that choosing between them presents a fundamental tradeoff between cost-efficient exploration and accurate SQL generation. Most data systems expose fine-grained APIs, but this inadvertently disadvantages agents: they over-explore, incorporating irrelevant schema elements into their query formulation and produce inaccurate results. We argue that curbing over-exploration is key to the effective use of these API surfaces, and propose Sophrosyne, a data system environment that augments API responses with directives that guide the agent's exploration process. Initial results show that directives reduce over-exploration by 4.6x and boost accuracy by up to 12.4% (approx. 4 percentage points).

URL PDF HTML ☆

赞 0 踩 0

2605.30861 2026-06-01 cs.AI 版本更新

微调提升语言模型中的信息传递

Yuwei Cheng, Weiyi Tian, Haifeng Xu

发表机构 * Department of Statistics（统计学系）； University of Chicago（芝加哥大学）； Department of Data Science（数据科学系）； Department of Computer Science（计算机科学系）

AI总结提出冠层熵（Canopy Entropy）度量，从树结构视角量化生成空间的有效大小，发现微调模型在总熵降低时仍能增强长度-熵率正相关，从而更高效地将不确定性转化为语义多样性。

详情

AI中文摘要

扩散模型中的遗忘学习：基于KL散度和似然约束的统一框架

Shervin Khalafi, Alejandro Ribeiro, Dongsheng Ding

发表机构 * University of Pennsylvania（宾夕法尼亚大学）； University of Tennessee, Knoxville（田纳西大学，基洛纳）

AI总结提出一个约束优化框架，通过最小化与预训练模型的偏差并施加与遗忘分布的分离约束，实现扩散模型中的概念和数据遗忘，并基于KL散度和似然约束推导最优解及原始-对偶算法。

Comments 27 pages, 6 figures, 4 tables; Accepted by ICML 2026

详情

AI中文摘要

扩散模型中的遗忘学习旨在移除不需要的数据或概念，同时保留预训练模型的效用——这两个目标本质上相互冲突。我们提出了一个原则性的约束优化框架，将遗忘学习形式化为在满足与遗忘分布的显式分离约束下，最小化与预训练模型的偏差。具体地，我们基于反向和正向KL散度以及似然约束，构建了三个约束优化问题。前两个问题泛化了现有的概念和数据遗忘方法，而第三个问题为遗忘学习提供了一种新颖且自然的表述。尽管KL约束非凸，我们证明了所有三个问题的强对偶性，从而能够显式地表征其最优解作为遗忘目标，并为每个公式开发原始-对偶算法。实验结果表明，与基于权重的基线方法相比，我们的KL约束方法在概念和数据遗忘中实现了更优的保留-遗忘权衡，而基于似然的方法在匹配遗忘效果的同时，更好地保留了保留概念。

英文摘要

Unlearning in diffusion models aims to remove undesirable data or concepts while preserving the utility of pretrained models -- two fundamentally conflicting objectives. We propose a principled constrained optimization framework that formulates unlearning as minimizing the deviation from a pretrained model, subject to explicit separation constraints from the unlearning distributions. Specifically, we formulate three constrained optimization problems based on reverse and forward KL divergences, and likelihood constraints. The first two generalize existing approaches for concept and data unlearning, while the third offers a novel and natural formulation for unlearning. Despite the nonconvexity of the KL constraints, we establish strong duality for all three problems, enabling us to explicitly characterize their optimal solutions as unlearning targets and develop primal-dual algorithms for each formulation. Experimental results demonstrate that our KL-constrained approach achieves superior retention-unlearning tradeoffs compared to weight-based baselines for concept and data unlearning, and that our likelihood-based approach matches unlearning effectiveness while better preserving retained concepts compared to baselines.

URL PDF HTML ☆

赞 0 踩 0

2605.30824 2026-06-01 cs.AI 版本更新

XLGoBench: 用算法任务检测跨语言技能差距

Purvam Jain, Preethi Jyothi, Vihari Piratla, Suvrat Raju

发表机构 * Google DeepMind（谷歌深Mind）； Indian Institute of Technology Bombay（印度理工学院孟买分校）； International Centre for Theoretical Sciences, Tata Institute of Fundamental Research（理论科学国际中心， Tata 基础研究机构）

AI总结提出一套合成算法任务基准，通过跨语言执行相同任务来检测大语言模型的跨语言能力差距，实验揭示多个先进模型存在持续差距。

Comments 8+37pages

2605.30785 2026-06-01 cs.AI 版本更新

Learning Agent-Compatible Context Management for Long-Horizon Tasks

面向长时任务的学习智能体兼容上下文管理

Lu Yi, Runlin Lei, Liuyi Yao, Yuexiang Xie, Yuyang Li, Wenhao Zhang, Zhewei Wei, Yaliang Li, Jian-Yun Nie

发表机构 * Renmin University of China（中国人民大学）； Tongyi Lab, Alibaba Group（阿里云实验室）； Beijing University of Posts and Telecommunications（北京邮电大学）； Université de Montréal（蒙特利尔大学）

AI总结提出AdaCoM方法，通过外部LLM对冻结智能体进行端到端强化学习上下文管理，在长时任务中提升性能并揭示保真度-可靠性权衡。

详情

AI中文摘要

LLM智能体在现实应用中越来越多地面临长时任务，如网络搜索和深度研究，累积的上下文可能导致长上下文退化和推理失败。先前的工作通过智能体端上下文控制或固定策略（如摘要）来缓解这一问题，这需要训练智能体本身进行适应——这使得它对于闭源智能体不切实际，并且忽略了不同智能体可能需要不同策略。我们引入了自适应上下文管理（AdaCoM），它训练一个外部LLM通过灵活的修改动作和端到端强化学习来管理冻结智能体的上下文。在多种智能体上进行的网络搜索和深度研究基准测试中，AdaCoM通过保留任务约束和进展同时修剪过时内容，显著提升了性能。学习到的策略揭示了保真度-可靠性权衡：具有更高原始ReAct性能的智能体受益于更高保真度的上下文保留，而性能较低的智能体则需要更激进的压缩以保持在可靠的推理范围内。迁移实验表明，AdaCoM在能力相似（以原始ReAct性能衡量）的智能体之间最有效地泛化，这为智能体系统的可复用上下文管理器提供了一条实用路径。

英文摘要

LLM agents increasingly face long-horizon tasks such as web search and deep research in real-world applications, where accumulated context can cause long-context degradation and reasoning failures. Prior work mitigates this through context management with agent-side context control or fixed strategies such as summarization, which require training the agent itself for adaptation - making it impractical for closed-source agents and ignoring that different agents may require different strategies. We introduce Adaptive Context Management (AdaCoM), which trains an external LLM to manage the context of a frozen agent through flexible modification actions and end-to-end reinforcement learning. Across diverse agents on web search and deep research benchmarks, AdaCoM substantially improves performance by preserving task constraints and progress while pruning stale content. The learned strategies reveal a Fidelity-Reliability Trade-off: agents with higher vanilla ReAct performance benefit from higher-fidelity context preservation, whereas lower-performing agents require more aggressive compression to stay within a reliable reasoning regime. Transfer experiments show that AdaCoM generalizes most effectively across agents with similar capability (measured by vanilla ReAct performance), suggesting a practical path toward reusable context managers for agent systems.

URL PDF HTML ☆

赞 0 踩 0

2605.30740 2026-06-01 cs.RO cs.AI 版本更新

GSAM: A Generalizable and Safe Robotic Framework for Articulated Object Manipulation

GSAM: 一种通用且安全的铰接物体操作机器人框架

Beichen Shao, Mengying Xie, Heng Su, Wanyi Zhang, Mingyan Li, Yan Ding, Fausto Giunchiglia, Chao Chen

发表机构 * College of Computer Science, Chongqing University, Chongqing, China（重庆大学计算机学院）； Lumos Robotics, China（Lumos机器人中国）； Xi'an Jiaotong-Liverpool University, China（西安交通大学利物浦大学）； Fudan University, China（复旦大学）； Department of Information Engineering and Computer Science, University of Trento, Trento, Italy（特伦托大学信息工程与计算机科学系）

AI总结提出GSAM框架，通过视觉感知器生成运动学参数、基于VLM的细调器进行常识推理修正、交互约束函数生成器集成障碍物避免知识，并由运动学感知规划器验证轨迹可达性，在50个铰链任务上相比最佳基线将标准差降低3.1%、操作成功率提升36.0%。

Comments Accepted by the 19th International Conference on Parallel Problem Solving from Nature (PPSN 2026)

详情

AI中文摘要

铰接物体操作对服务机器人是一个独特的挑战。现有方法采用端到端策略学习、视觉运动规划以及大语言/视觉语言模型（LLM/VLM），但往往忽视了铰接物体的多样性和末端执行器与手柄之间交互的复杂性，导致泛化能力有限和破坏性碰撞。为了解决这一问题，我们提出了GSAM，一个通用且安全的铰接物体操作机器人框架。具体来说，一个基于视觉的感知器生成运动学参数。考虑到感知器中预训练标记产生的原始估计可能偏离常识，我们提出了一个基于VLM的细调器，利用链式思维（COT）常识推理来细化感知。为了防止破坏性碰撞，我们设计了一个交互约束函数生成器，将铰接物体、交互姿态和障碍物避免知识集成到一个基中。然后LLM将这些约束函数化，并将其应用于轨迹和姿态规划。一个运动学感知的操作规划器验证轨迹和姿态的可达性。在5个物体类别的50个铰链任务和50个随机初始化的末端执行器-手柄配置上的实验表明，与最佳基线相比，GSAM将标准差降低了3.1%，操作成功率提高了36.0%，分别展示了GSAM在实际场景中优越的物体泛化能力和交互安全性。

英文摘要

Articulated object manipulation is a unique challenge for service robots. Existing methods employ end-to-end policy learning, visionmotion planning, and large-language/visual-language model (LLM/VLM), but often overlook the diversity of articulated objects and the complexity of interactions between end-effector and handle, leading to limited generalization and destructive collisions. To address this, we propose GSAM, a generalizable and safe robotic framework for articulated object manipulation. Specifically, a vision-based perceiver generates the kinematic parameters. Considering that pre-trained markers in perceiver yield raw estimations that may deviate from commonsense, we present a f ine-tuned VLM-based refiner, using chain-of-thought (COT) commonsense reasoning to refine perception. To prevent destructive collisions, we design an interaction constraint function generator, integrating articulated object, interaction pose, and obstacle avoidance knowledge into a base. LLM then functionalize these constraints and apply them to trajectory and posture planning. A kinematic-aware manipulation planner verifies reachability for trajectory and posture. Experiments on 50 hinge tasks across 5 object categories and 50 randomly initialized end-effectorhandle configurations show that GSAM reduces standard deviation by 3.1% and improves manipulation success rate by 36.0% compared to the best baseline, respectively demonstrating the superior object generalization and interaction safety of GSAM in practical scenarios.

URL PDF HTML ☆

赞 0 踩 0

2605.30738 2026-06-01 cs.AI 版本更新

何时LLMs足以作为序列RL任务的策略优化器？

Stephane Hatgis-Kessell, Emma Brunskill

发表机构 * Department of Computer Science, Stanford University（计算机科学系，斯坦福大学）

AI总结提出PromptPO方法，利用LLM通过Python描述状态空间、动作空间和奖励函数，基于rollout反馈迭代生成和优化可执行策略，在多种环境中匹配或超越标准RL基线，但在细粒度连续控制任务中表现不足。

详情

AI中文摘要

我们研究大型语言模型（LLMs）何时可以作为强化学习（RL）任务的有效黑盒策略优化器，即何时可以用LLM替代经典RL算法？我们通过引入提示策略优化（PromptPO）来探索这个问题，这是一种迭代方法，它用状态空间、动作空间和奖励函数的Python描述提示LLM，然后让LLM根据rollout反馈生成并优化可执行策略。在硬探索环境、Meta-World机器人任务以及几个现实世界控制问题中，PromptPO通常匹配或超过标准RL基线的性能，同时使用显著更少的环境交互。为了最大化期望回报，且无需进一步显式提示，PromptPO输出的策略范围从调谐的比例控制器或基于规则的规划到运行值迭代等规划算法的策略。我们的结果表明，当LLM能够利用关于环境或优化策略的先验知识时，基于LLM的策略优化是足够的。PromptPO在MuJoCo领域中的表现不如标准RL基线，这展示了基于LLM的策略优化在需要细粒度连续控制的设置中可能存在的局限性。

英文摘要

We study when large language models (LLMs) can serve as effective black-box policy optimizers for reinforcement learning (RL) tasks, i.e., when can we replace classical RL algorithms with an LLM? We explore this question by introducing Prompted Policy Optimization (PromptPO), an iterative method that prompts an LLM with Python descriptions of the state space, action space, and reward function, then has it generate and refine executable policies based on rollout feedback. Across hard exploration environments, Meta-World robotics tasks, and several real-world control problems, PromptPO often matches or exceeds the performance of standard RL baselines while using substantially fewer environment interactions. To maximize expected return, and without further explicit prompting, the policies PromptPO outputs range from tuned proportional controllers or rule-based plans to policies that run planning algorithms like value iteration. Our results demonstrate that LLM-based policy optimization is sufficient when the LLM can leverage prior knowledge about the environment or optimization strategy. PromptPO underperforms standard RL baselines in MuJoCo domains. This demonstrates possible limitations of LLM-based policy optimization to settings that requiring fine-grained continuous control.

URL PDF HTML ☆

赞 0 踩 0

2605.30716 2026-06-01 cs.CV cs.AI 版本更新

Simple Token-Efficient Vision-Language Model for Case-level Pathology Synoptic Report Generation

用于病例级病理学概要报告生成的简单令牌高效视觉语言模型

Zhiyuan Yang, Jiahao Cheng, Vincent Quoc-Huy Trinh, Mahdi S. Hosseini

发表机构 * Department of Computer Science and Software Engineering (CSSE), Concordia University, Montreal, Canada（计算机科学与软件工程系（CSSE），康科迪亚大学，蒙特利尔，加拿大）； Axe Cancer, Centre de recherche du CHUM, Université de Montréal, Montreal, Canada（Axe癌症，CHUM研究中心，蒙特利尔大学，蒙特利尔，加拿大）； Institut de recherche en immunologie et cancérologie (IRIC), Université de Montréal（免疫学与癌症研究所（IRIC），蒙特利尔大学）； Mila - Quebec AI Institute, Montreal, Canada（魁北克AI研究所（Mila），蒙特利尔，加拿大）

AI总结提出一种简单令牌高效的视觉语言模型，通过5倍放大率的512×512补丁和两阶段监督训练，在有限GPU内存下实现病例级多WSI病理报告生成，显著降低序列长度并提升效率。

Comments Accepted by the DeLTA 2026 conference

详情

AI中文摘要

从全切片图像（WSI）生成临床有用的病理报告具有挑战性，原因在于十亿像素分辨率、长视觉令牌序列以及病例级推理的复杂性（单个病例可能包含多个具有异质性组织和模糊发现的WSI）。我们提出了一种简单的令牌高效视觉语言模型，用于病例级概要报告生成，在受限GPU内存下保持实用性。我们的架构遵循最小的三组件设计：冻结的病理补丁编码器、轻量级两层MLP视觉语言对齐器和大语言模型解码器，并带有显式的WSI标记令牌以分隔病例内的切片。训练分两个监督阶段进行：（1）仅对齐器的WSI字幕生成，使用异质WSI-文本对；（2）病例级监督微调，基于病例-报告对进行结构化报告生成。为了减少序列长度，我们使用5倍放大率下的$512 \times 512$补丁表示每个切片，与常用的20倍补丁相比，平均序列长度减少高达64倍。结合高效训练技术，我们仅用半块NVIDIA H100 GPU即可实现实际训练。在两个训练阶段中，我们的方法在ROUGE-L/METEOR/BLEU-4上取得了高分，同时在内存和运行时间上显著更高效。在基于AI的评估中，我们的模型始终优于强基线。大量消融实验表征了性能-效率权衡，并确定了在多WSI设置中提高鲁棒性的简单选择。总体而言，这项工作为高效病理报告生成提供了一个强大且可复现的基线，降低了在有限计算资源下进行多WSI VLM研究的门槛。

英文摘要

Generating clinically useful pathology reports for pathology cases from whole-slide images (WSIs) is challenging due to gigapixel resolution, long visual-token sequences, and the complexity of case-level reasoning, where a single case may contain multiple WSIs with heterogeneous tissues and ambiguous findings. We present a simple token-efficient vision--language model for case-level synoptic report generation that remains practical under constrained GPU memory. Our architecture follows a minimal three-component design: a frozen pathology patch encoder, a lightweight two-layer MLP vision-language aligner, and a large language model decoder, with an explicit WSI marker token to separate slides within a case. Training proceeds in two supervised stages: (1) aligner-only WSI captioning using heterogeneous WSI-text pairs, and (2) case-level supervised fine-tuning on case-report pairs for structured report generation. To reduce sequence length, we represent each slide using $512 \times 512$ patches at $5\times$ magnification, which reduces the average sequence length by up to $64\times$ times compared to the commonly used $20\times$ patches. Combined with efficient training techniques, we enable practical training with only half a NVIDIA H100 GPU. Across both training stages, our approach achieves high ROUGE-L/METEOR/BLEU-4 scores while being substantially more efficient in memory and runtime. In AI-based evaluations, our model is consistently preferred over strong baselines. Extensive ablations characterize performance-efficiency trade-offs and identify simple choices that improve robustness in multi-WSI settings. Overall, this work provides a strong, reproducible baseline for efficient pathology report generation, lowering the barrier to multi-WSI VLM research under limited compute.

URL PDF HTML ☆

赞 0 踩 0

2605.30711 2026-06-01 cs.CL cs.AI cs.LG stat.ML 版本更新

SAGE: A Novelty Gate for Efficient Memory Evolution in Agentic LLMs

SAGE: 一种用于智能体大语言模型中高效记忆演化的新颖门控机制

Sijia Wang, Dhanajit Brahma, Ricardo Henao

发表机构 * Duke University（杜克大学）

AI总结提出SAGE门控机制，基于von Mises-Fisher密度估计和自适应阈值，将记忆写入控制建模为新奇性检测问题，在LoCoMo上以更低成本实现最优token-F1。

详情

AI中文摘要

智能体大语言模型必须持续决定新提取的事实是应添加、与现有记忆合并还是忽略，然而先前的工作更侧重于检索和存储，而非原则性的写入端控制。我们将记忆演化视为一个新颖性检测问题，并提出SAGE（Spherical Adaptive Gate for memory Evolution），一种用于记忆演化的球形自适应门控机制，它通过基于von Mises-Fisher的密度估计器对记忆嵌入上的候选事实进行评分，并使用跟踪记忆存储几何结构的自适应阈值对其进行路由。SAGE将明确新颖的事实解析为ADD，明确冗余的事实解析为NOOP，仅将不确定的情况发送给LLM合并步骤，从而减少了昂贵的写入时推理。在LoCoMo上，SAGE在所有七个开放权重骨干对比中均实现了对Mem0的最佳平均token-F1，而在GPT-4o-mini上，它将添加阶段的API成本降低了3.4倍，添加阶段延迟降低了2.5倍，且平均评判分数差距很小。作为A-Mem的即插即用二进制门控，SAGE在五个模型上跳过了大约16-18%的LLM调用，且在开放权重骨干上质量变化极小。这些结果表明，新颖性感知的写入控制是提高长期智能体记忆中记忆质量和系统效率的实用杠杆。

英文摘要

Agentic LLMs must continuously decide whether newly extracted facts should be added, merged with existing memories, or ignored, yet prior work has focused more on retrieval and storage than on principled write-side control. We frame memory evolution as a novelty-detection problem and propose SAGE, a Spherical Adaptive Gate for memory Evolution that scores candidate facts with a von Mises-Fisher-based density estimator over memory embeddings and routes them with an adaptive threshold that tracks memory-store geometry. SAGE resolves clearly novel facts as ADD, clearly redundant facts as NOOP, and sends only uncertain cases to an LLM merge step, reducing expensive write-time reasoning. On LoCoMo, SAGE achieves the best average token-F1 against Mem0 on all seven open-weight backbone comparisons, while on GPT-4o-mini it reduces add-phase API cost by 3.4$\times$ and add-phase latency by 2.5$\times$ with only a small average judge-score gap. As a drop-in binary gate for A-Mem, SAGE skips roughly 16-18% of LLM calls across five models with minimal quality change on open-weight backbones. These results suggest that novelty-aware write control is a practical lever for improving both memory quality and system efficiency in long-term agentic memory.

URL PDF HTML ☆

赞 0 踩 0

2605.30698 2026-06-01 cs.CV cs.AI cs.MA 版本更新

早期采用者如何在全球范围内使用生成式AI：按国家收入和语言的差异

Madeleine I. G. Daepp, Isaac Slaughter

发表机构 * Microsoft AI Economy Institute（微软人工智能经济研究所）

AI总结基于大规模匿名化AI聊天机器人交互数据，实证分析了不同国家早期采用者在使用生成式AI上的差异，发现教育用途在低收入国家更普遍，休闲用途与收入正相关，且英语交互在非英语主导国家中过度代表，表明语言性能改进可能影响数字鸿沟或跨越式发展。

2605.30680 2026-06-01 cs.AI cs.MA 版本更新

Healthcare Mechanisms from Policy-as-Code Search under Strategic Provider Response

战略提供者响应下的策略即代码搜索中的医疗机制

Zihan Wang, Xiang Xu, Hongyuan Zha, Wenhao Li

发表机构 * The Chinese University of Hong Kong, Shenzhen（香港中文大学（深圳））； Tongji University（同济大学）

AI总结将医疗机制设计转化为语言模型的程序合成，通过多智能体模拟器Medi-Sim评估策略提供者响应下的均衡，并利用LLM引导的进化代码搜索合成可检查的混合目标程序。

Comments 32 pages, 18 figures, 4 tables

详情

AI中文摘要

医疗机制与它们所引发的战略提供者响应密不可分：现有的医疗AI基准固定了这种响应，因此无法通过它们产生的均衡来评估机制。我们将医院机制设计重新定义为语言模型的程序合成：类型化、可检查的规则程序由Medi-Sim执行和评分，Medi-Sim是一个具有五个战略提供者渠道（编码、选择、延迟、努力、分诊）的多智能体模拟器。激励扫描恢复了经典的健康经济学发现作为相邻制度——在利润压力下的过度编码和低复杂度患者选择，以及古德哈特式漂移，其中测量绩效与真实结果呈负相关——而单个审计杠杆暴露了压力迁移：关闭编码渠道使低复杂度选择增加一倍以上。LLM引导的进化代码搜索在相同的规则程序空间上合成一个可检查的混合目标程序，该程序消除了过度编码，将拒绝率减半，并保留了大部分以利润为导向的基线的资金。

英文摘要

Healthcare mechanisms are inseparable from the strategic provider response they induce: existing healthcare AI benchmarks hold this response fixed and so cannot evaluate mechanisms by the equilibrium they produce. We recast hospital mechanism design as program synthesis for language models: typed, inspectable rule programs are executed and scored by Medi-Sim, a multi-agent simulator with five strategic provider channels (coding, selection, delay, effort, triage). An incentive sweep recovers classical health-economics findings as adjacent regimes -- up-coding and low-complexity-patient selection under profit pressure, and Goodhart-style drift where measured performance becomes anti-correlated with true outcomes -- and a single audit lever exposes pressure migration: closing the coding channel more than doubles low-complexity selection. LLM-guided evolutionary code search over the same rule-program space then synthesizes an inspectable mixed-objective program that eliminates up-coding, halves rejection, and retains most of the profit-oriented baseline's funds.

URL PDF HTML ☆

赞 0 踩 0

2605.30677 2026-06-01 cs.CR cs.AI cs.SE 版本更新

Investigating Detection and Obfuscation of Prompt Injection Attacks Against Software Reverse Engineering AI Agents

针对软件逆向工程AI代理的提示注入攻击的检测与混淆研究

Brian Crawford, Patrick McClure

发表机构 * Dept. of Computer Science（计算机科学系）； Naval Postgraduate School（海军学院）

AI总结本研究针对软件逆向工程AI代理面临的提示注入攻击，提出了检测反编译器输出中提示注入字符串的防御策略，并探索了攻击混淆及相应防御方法。

2605.30675 2026-06-01 cs.CL cs.AI 版本更新

Human-Alignment, Calibration, and Activation Patterns in Large Language Model Uncertainty

大型语言模型不确定性中的人类对齐、校准与激活模式

Kyle Moore, Jesse Roberts, Daryl Watson, William Ward, Grayson Heyboer

发表机构 * Vanderbilt University（范德比大学）； Tennessee Technological University（田纳西技术大学）

AI总结研究大型语言模型的不确定性与人类不确定性的相似性，通过分析行为与内部激活模式，发现模型在多项选择和开放式事实回忆数据集上同时存在对齐与校准，并描述了指令微调的影响。

2605.30668 2026-06-01 cs.CL cs.AI 版本更新

CobSeg: Coherence Boundary Modeling for Dialogue Topic Segmentation

CobSeg: 对话主题分割的连贯性边界建模

Sijin Sun, Liangbin Zhao, Jiaxiang Cai, Ming Deng, Mingyu Luo, Xiuju Fu

发表机构 * Institute of High Performance Computing, Agency for Science, Technology and Technology（高性能计算研究所，科技局）； Shanghai Univeristy（上海大学）； Fudan University（复旦大学）

AI总结提出CobSeg多分支架构，通过分离连贯性语义与词汇边界转换并利用边界信息加权和主题连贯性线索，在无需LLM调用下提升对话主题分割性能。

Comments 8 pages with appindx. Under review

详情

AI中文摘要

对话主题分割在许多人类-AI协作应用中至关重要，需要识别异质边界线索，包括话语边缘附近的词汇转换和跨话语的语义不连续性。现有的话语模型常常稀释这些局部词汇信号。我们提出CobSeg，一种新颖的多分支架构，它将连贯性层面的语义连续性与词汇边界转换分离，并通过方向性边界预测恢复两者。CobSeg进一步使用边界信息加权来强调高效用的话语位置，并融合了基于语料库的主题连贯性线索与学习到的组合权重。尽管CobSeg在有监督的金标准边界训练和自动诱导边界的伪标签设置下作为紧凑的可训练分割器进行评估，它在推理过程中无需LLM调用即可实现增强的边界预测。在五个基准测试中，它改进了$P_k$和$W_d$，特别是在局部词汇线索显著时：在金标准监督下，它在VHF上将$P_k$降低了0.7个点，$W_d$降低了0.6个点，并在DialSeg711上达到了$P_k$为1.0；在诱导边界下，它在VHF上将$P_k$降低了14.8个点，在DialSeg711上降低了1.5个点，在TIAGE上降低了1.1个点，优于先前的非LLM方法。

英文摘要

Dialogue topic segmentation is critical in many human-AI collaborative applications which requires identifying heterogeneous boundary cues, including lexical transitions near utterance edges and semantic discontinuities across utterances. Existing utterance models often dilute these local lexical signals. We propose CobSeg, a novel multi-branch architecture that separates coherence-level semantic continuity from lexical boundary transitions and recovers both through directional boundary prediction. CobSeg further uses boundary informativeness weighting to emphasize high-utility utterance positions, and incorporates a corpus-derived topic coherence cue with learned combination weights. While CobSeg is evaluated as a compact trainable segmenter under supervised gold-boundary training and a pseudo-label setting with automatically induced boundaries, it performs enhanced boundary prediction without LLM calls during inference. Across five benchmarks, it improves $P_k$ and $W_d$ particularly when local lexical cues are prominent: under gold supervision, it reduces $P_k$ by 0.7 points and $W_d$ by 0.6 points on VHF, and reaches $P_k$ of 1.0 on DialSeg711; with induced boundaries, it reduces $P_k$ by 14.8 points on VHF, by 1.5 points on DialSeg711, and by 1.1 points on TIAGE, outperforming prior non-LLM approaches.

URL PDF HTML ☆

赞 0 踩 0

2605.30667 2026-06-01 cs.CR cs.AI 版本更新

Automatically Attacking Software Reverse Engineering AI Agents

自动攻击软件逆向工程AI代理

Brian Crawford, Justin Phillips, Patrick McClure

发表机构 * Naval Postgraduate School（海军学院）

AI总结提出基于遗传算法的对抗性提示生成技术（AutoDAN变体），通过注入无关字符串变量欺骗LLM驱动的反汇编与反编译系统，导致其错误分析二进制可执行文件。

详情

AI中文摘要

用于逆向工程可执行二进制文件的软件工具（如Ghidra）使恶意软件分析师能够在无法访问原始源代码的情况下安全地进行稳健的静态分析。结合大型语言模型（LLM）的分析能力，配备工具（如GhidraMCP）的代理系统可以自动化先前由人工驱动的过程。尽管这种自动化可以提高单个恶意软件分析师的生产力，但它也为恶意软件混淆引入了新的漏洞领域。本文提出了一种对抗性技术，使用基于遗传算法的提示生成（一种称为AutoDAN的对抗性攻击的变体），以证明能够欺骗基于LLM的反汇编和反编译系统，使其错误解释二进制可执行文件，从而有效破坏其分析输出。这种概念验证方法利用了LLM处理和解译反编译机器代码时的固有漏洞，通过使用无关字符串变量赋值向LLM传递隐蔽指令，同时不影响可执行文件的功能。我们通过几个简洁的例子展示了这种能力。这种方法可能使攻击者能够绕过依赖LLM驱动分析管道的自动化检测系统。通过研究和理解这种攻击，可以获得关于将LLM集成到网络安全工具链中的安全影响以及构建更稳健的代理代码分析系统的见解。

英文摘要

Software tools for reverse engineering executable binary files, such as Ghidra, enable malware analysts to safely conduct robust static analysis without having access to original source code. Coupled with the analytic power of large language models (LLM), agentic systems enabled with tools, such as GhidraMCP, can allow analysts to automate a previously human driven process. Although this automation can increase the productivity of a single malware analyst, it also introduces a new area of vulnerability for malware obfuscation. This paper presents an adversarial technique using genetic algorithm-based prompt generation, a modification of an adversarial attack known as AutoDAN, to demonstrate the ability to deceive LLM-powered disassembly and decompilation systems into misinterpreting binary executables, effectively corrupting their analytical output. This proof-of-concept methodology exploits inherent vulnerabilities in how LLMs process and interpret decompiled machine code via prompt injection by using extraneous string variable assignments to pass surreptitious instructions to the LLM while not impacting the functionality of the executable file. We demonstrate this capability through several concise examples. This approach could enable attackers to bypass automated detection systems that rely on LLM-driven analysis pipelines. By studying and understanding this attack, insights can be gained regarding the security implication of integrating LLMs into cybersecurity toolchains and building more robust agentic code analysis systems.

URL PDF HTML ☆

赞 0 踩 0

2605.30664 2026-06-01 cs.AI 版本更新

Structure-Induced Information for Rerooting Levin Tree Search

结构信息用于重定根莱文树搜索

Jake Tuero, Michael Buro, Laurent Orseau, Levi H. S. Lelis

发表机构 * Department of Computing Science, University of Alberta, Edmonton, Canada. ； Alberta Machine Intelligence Institute (Amii), Edmonton, Canada. ； Google DeepMind, London, United Kingdom.

AI总结提出三种重定根器设计，利用结构信息隐式分解子目标，提升策略树搜索的可扩展性和效率。

Comments ICML 2026

详情

AI中文摘要

基于子目标的策略树搜索利用策略引导搜索，对于复杂的单智能体确定性问题是有效的，但通常依赖于显式的子目标生成，这会带来大量开销并阻碍可扩展性。在本文中，我们通过最近引入的$\sqrt{\text{LTS}}$算法使用学习到的“重定根器”来克服这些限制。重定根器隐式地将问题分解为软子任务。虽然先前的工作侧重于给定或手工制作的重定根器的形式保证，但在本文中，我们提出了三种重定根器设计：(i) 基于聚类的重定根器，利用全局状态空间结构；(ii) 基于启发式的重定根器，利用学习的代价估计；(iii) 结合两种信号的混合重定根器。我们的框架避免了显式重构和推理生成的子目标，从而能够以显著降低的计算开销实现可扩展的搜索努力分配。实验上，我们的基于重定根的方法在基于子目标的策略树搜索失败的复杂环境中也能扩展，并在测试的领域上实现了最先进的在线训练效率。

英文摘要

Subgoal-based policy tree search, which uses a policy to guide search, is effective for complex single-agent deterministic problems but often relies on explicit subgoal generation that can incur substantial overhead and hinders scalability. In this paper, we overcome these limitations by using a learned ``rerooter'' through the recently-introduced $\sqrt{\text{LTS}}$ algorithm. A rerooter implicitly decomposes the problem into soft subtasks. While previous work focused on the formal guarantees for given or handcrafted rerooters, in this work we propose three rerooter designs: (i) a clustering-based rerooter that exploits global state-space structure, (ii) a heuristic-based rerooter that leverages learned cost-to-go estimates, and (iii) a hybrid that combines both signals. Our framework avoids having to explicitly reconstruct and reason over generated subgoals, thereby enabling scalable allocation of search effort with significantly lower computational overhead. Empirically, our rerooting-based methods scale to complex environments where subgoal-based policy tree search fails, and achieve state-of-the-art online training efficiency on the domains tested.

URL PDF HTML ☆

赞 0 踩 0

2605.30654 2026-06-01 cs.CL cs.AI cs.HC 版本更新

分数广播与去相关：基于广播的信用分配通用框架

Mustafa Uzun, Mete Erdogan, Cengiz Pehlevan, Alper T. Erdogan

发表机构 * KUIS AI Center, Koc University, Turkey（科克大学KUIS人工智能中心，土耳其）； Electrical and Electronics Engineering, Koc University, Turkey（科克大学电子与电气工程系，土耳其）； Department of Electrical Engineering, Stanford University, USA（斯坦福大学电气工程系，美国）； John A. Paulson School of Engineering & Applied Sciences, Harvard University, USA（哈佛大学约翰·A·保罗森工程与应用科学学院，美国）； Kempner Institute, Harvard University, USA（哈佛大学凯姆纳研究所，美国）； Center for Brain Science, Harvard University, USA（哈佛大学脑科学中心，美国）

AI总结提出分数广播与去相关（SBD）框架，通过输出分数与隐藏层激活的正交性原理，统一了多种可微损失函数下的广播式信用分配，并理论支撑了三因子学习规则。

详情

AI中文摘要

我们引入了分数广播与去相关（SBD），一个用于一般可微损失族基于广播的信用分配的原则性框架。误差广播是反向传播的一种生物合理替代方案，它无需权重传输即可将输出信息发送到隐藏层。最近针对均方误差（MSE）设置引入的误差广播与去相关（EBD）框架，将这一机制建立在最优估计量的随机正交性基础上，即最优残差与输入的函数正交。我们通过引入输出分数（损失对最终层输出的梯度）与隐藏层激活之间的正交性原理来推广这一基础，该原理在最优分数条件均值为零时成立。这一单一原理统一了标准可微损失族（包括交叉熵、Bregman散度、适当评分规则和指数族负对数似然）的广播式信用分配。该框架为一般损失下的三因子学习规则提供了理论基础，其中神经调节因子被推导为广播损失分数。我们明确推导了交叉熵情况，刻画了可接受损失类，并引入了一种分数向量扩展技术，该技术在保持正交性框架的同时丰富了广播信号。在CIFAR-10和Tiny ImageNet上的实验表明，SBD显著优于现有的广播方法，而分数向量扩展带来了进一步的提升。总体而言，这项工作确定了损失分数作为广播信号，提供了正交性理论以及神经科学中三因子学习规则的理论基础，并展示了分数向量扩展如何丰富所得目标函数的去相关方向。

英文摘要

We introduce Score Broadcast and Decorrelation (SBD), a principled framework for broadcast-based credit assignment for general families of differentiable losses. Error broadcast is a biologically plausible alternative to backpropagation that sends output information to hidden layers without weight transport. The Error Broadcast and Decorrelation (EBD) framework, recently introduced for the mean-squared-error (MSE) setting, grounded this mechanism in the stochastic orthogonality of optimal estimators, under which the optimal residual is orthogonal to functions of the input. We generalize that foundation by introducing an orthogonality principle between the output score (the gradient of loss with respect to the final-layer output) and hidden-layer activations, which holds whenever the optimal score has conditional mean zero. This single principle unifies broadcast-based credit assignment across the standard differentiable-loss families, including cross-entropy, Bregman divergences, proper scoring rules, and exponential-family negative log-likelihoods. The framework supplies a theoretical grounding for the three-factor learning rule under general losses, with the neuromodulatory factor derived as the broadcast loss score. We derive the cross-entropy case explicitly, characterize the admissible loss class, and introduce a score vector expansion technique that enriches the broadcast signal while preserving the orthogonality framework. Experiments on CIFAR-10 and Tiny ImageNet show that SBD substantially improves over existing broadcast approaches, with score vector expansion delivering further gains. Overall, this work identifies the loss score as the signal to broadcast, supplies the orthogonality theory and theoretical grounding for the three-factor learning rule from neuroscience, and shows how score vector expansion enriches the decorrelation directions of the resulting objective.

URL PDF HTML ☆

赞 0 踩 0

2605.30637 2026-06-01 cs.AI 版本更新

EHRBench: An Automated and Reliable EHR-based Benchmark for Clinical Decision Making with LLMs

EHRBench: 基于电子健康记录的自动化可靠临床决策基准测试，用于大语言模型

Yuzhang Xie, Keqi Han, Yunpeng Xiao, Hejie Cui, Guanchen Wu, Ziyang Zhang, Kai Shu, Jiaying Lu, Xiao Hu, Carl Yang

发表机构 * Emory University（埃默里大学）； Stanford University（斯坦福大学）

AI总结提出EHRBench，通过EHR-LLM-KB交互流水线自动构建近百万问答对，涵盖诊断、治疗和预后三大临床决策任务，系统评估30余种LLM的性能与鲁棒性。

Comments Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2026), Datasets and Benchmarks Track, Oral

详情

AI中文摘要

临床决策（CDM）是真实临床工作流程的核心，临床医生在不完整证据下推断诊断、选择治疗方案或预测未来健康结果。由于大语言模型（LLM）具有强大的语言能力、广泛的生物医学知识和高效性，越来越多地被用于支持这些决策，但LLM在真实临床决策任务上的可靠性尚未得到充分理解。为了评估CDM模型，特别是基于LLM的模型，一个理想且实用的医学决策基准应通过自动化且可靠的流水线构建，以确保规模和质量。此外，基于真实患者电子健康记录（EHR）的CDM基准可以更好地支持需要实质性生物医学知识和临床推理的实践性CDM任务的评估。为填补这些空白，我们引入了EHRBench，一个自动化且可靠的基于EHR的基准，用于大规模评估基于LLM的临床决策。为了确保可扩展性和可靠性，EHRBench通过EHR-LLM-KB（知识库）交互流水线构建。为了提高效率，我们使用专门的LLM自动将就诊级别的EHR轨迹转换为结构化模板，并确定性地将模板实例化为问答项。同时，我们应用系统性的基于知识库的验证和丰富，以过滤幻觉或模糊关系，并提高可靠性。利用该流水线，我们构建了近100万（960,067）个问答项，涵盖三个需要推理的核心临床决策任务：诊断、治疗和预后。我们在EHRBench上对30多个代表性LLM进行了基准测试，并提供了性能和鲁棒性的详细分析。结果显示了跨设置的一致能力趋势，进一步验证了EHRBench的可靠性，并指出了实现临床可靠LLM系统的可操作差距。

英文摘要

Clinical decision-making (CDM) is central to real-world clinical workflows, where clinicians infer diagnoses, select treatments, or anticipate future health outcomes under incomplete evidence. LLMs are increasingly used to support these decisions due to strong language capabilities, broad biomedical knowledge, and efficiency, yet the reliability of LLMs on real-world clinical decision tasks remains insufficiently understood. To evaluate CDM models, especially LLM-based models, an ideal and practical medical decision benchmark should be constructed via an automated yet reliable pipeline to ensure both scale and quality. Moreover, the grounding of a CDM benchmark in real patient EHRs can better support evaluation on practical CDM tasks that require substantive biomedical knowledge and clinical inference. To fill the gaps, we introduce EHRBench, an automated and reliable EHR-grounded benchmark for evaluating LLM-based clinical decision-making at scale. To ensure scalability and reliability, EHRBench is constructed through an EHR-LLM-KB(knowledge-base) interaction pipeline. For efficiency, we use a specialized LLM to automatically convert encounter-level EHR trajectories into structured templates and deterministically instantiate the templates into QA items. In parallel, we apply systematic KB-based verification and enrichment to filter hallucinated or ambiguous relations and to improve reliability. Using this pipeline, we construct nearly 1M (960,067) QA items spanning three core inference-required clinical decision tasks: diagnosis, treatment, and prognosis. We benchmark more than 30 representative LLMs on EHRBench and provide detailed analyses of performance and robustness. The results show consistent capability trends across settings, further validating the reliability of EHRBench and highlighting actionable gaps toward clinically reliable LLM systems.

URL PDF HTML ☆

赞 0 踩 0

2605.30632 2026-06-01 cs.HC cs.AI cs.LG 版本更新

Rationalize: Shared Semantic Reasoning for Human-AI Alignment

Rationalize: 人机对齐的共享语义推理

Aritra Dasgupta, Naga Datha Saikiran Battula, Avina Nakarmi, Sohom Sen, Subhodeep Ghosh, Xun Song

发表机构 * New Jersey Institute of Technology（新泽西理工学院）

AI总结提出Rationalize角色对框架，通过共享推理空间中的互补角色对（如探索者-引导者）实现人类与AI在数据驱动意义建构中的语义对齐，并设计元素级和角色特定的对齐评估方法。

Comments Accepted by ACM CHI 2026 BiAlign Workshop

详情

AI中文摘要

我们介绍了Rationalize，一个用于数据驱动意义建构中人类与AI模型之间共享语义推理的角色对框架。基于人机协作和批判性思维的思路，我们将人机交互概念化为一系列互补的角色对（探索者-引导者、调查者-告知者、教师-学生、法官-倡导者），这些角色对在共享推理空间中运作。在这个空间中，人类分析师和AI模型（如LLM）使目的、问题、假设、证据、推理和影响变得明确，不仅促进输出层面的对齐，而且促进双方意图和行动的合理化层面的对齐。我们将这些角色对与双向人机对齐框架联系起来，说明“使AI对齐人类”和“使人类对齐AI”如何因角色而异，并勾勒出一个使用元素级和角色特定方法进行对齐设计和评估的协作研究议程。

英文摘要

We introduce Rationalize, a role-pair framework for shared semantic reasoning between humans and AI models in data-driven sensemaking. Building on ideas in human-machine teaming and critical thinking, we conceptualize human-AI interaction as a series of complementary role pairs (Explorer-Guide, Investigator-Informant, Teacher-Student, Judge-Advocate) operating in a shared reasoning space. In this space, human analysts and AI models (such as LLMs) make purposes, questions, assumptions, evidence, inferences, and implications explicit, facilitating alignment not only at the output level but at the level of rationalization of intent and action by each side. We relate these role pairs to the bidirectional human-AI alignment framework, illustrating how "aligning AI to humans" and "aligning humans to AI" differ by role, and sketch a collaborative research agenda for alignment design and assessment using element-level and role-specific approaches.

URL PDF HTML ☆

赞 0 踩 0

2605.30631 2026-06-01 cs.CV cs.AI cs.LG 版本更新

从 Best-of-$N$ 偏好数据中学习奖励：目标、权衡与设计原则

Rattana Pukdee, Maria-Florina Balcan, Pradeep Ravikumar

发表机构 * Machine Learning Department（机器学习系）

AI总结本文分析了从 Best-of-$N$ 采样构建的成对偏好数据中 Bradley-Terry 奖励学习的目标，揭示了 $N$ 和基础分布对奖励估计的影响，并提出了基于样本效率和连通性权衡的设计原则。

详情

AI中文摘要

Best-of-$N$ 采样被广泛用于构建成对偏好数据：从基础分布中抽取 $N$ 个候选，并将最佳响应与拒绝响应配对。尽管其广泛使用，但 Bradley-Terry (BT) 奖励学习从这类数据中提取了什么，以及如何选择 $N$ 和基础分布，仍不清楚。我们将近期通过诱导条件分布对偏好数据的分析专门应用于 Best-of-$N$。对于独立参考变体，我们推导出作为 $N$ 和基础分布显式函数的闭式奖励目标，并证明它们保留了潜在奖励排名。对于实用的 Best-vs-Random 和 Best-vs-Worst 变体，所选和拒绝的响应通过同一候选集耦合，因此精确的 BT 可表示性通常不成立；然而，随着 $N$ 增长，有界类最小化器接近参考目标。尽管已知边界和连通性在成对偏好学习中控制样本效率，但 Best-of-$N$ 通过 $N$ 以相反方向耦合它们：更大的 $N$ 加宽成对边界但降低连通性。这种权衡产生了两个设计原则：当偏好标签是瓶颈时使用较大的 $N$，当生成是瓶颈时使用较小的 $N$；并塑造基础分布，使其质量集中在测试时比较最重要的响应之间。在合成和真实偏好数据上的实验支持了对样本量和基础分布形状的预测依赖性。

英文摘要

Best-of-$N$ sampling is widely used to construct pairwise preference data: $N$ candidates are drawn from a base distribution, and the best is paired with a rejected response. Despite its widespread use, what Bradley--Terry (BT) reward learning extracts from such data, and how to choose $N$ and the base distribution, remain unclear. We specialize a recent analysis of preference data via its induced conditional distribution to Best-of-$N$. For independent-reference variants, we derive closed-form reward targets as explicit functions of $N$ and the base distribution, and show that they preserve the latent reward ranking. For the practical Best-vs-Random and Best-vs-Worst variants, chosen and rejected responses are coupled through the same candidate set, so exact BT representability generally fails; nevertheless, bounded-class minimizers approach the reference targets as $N$ grows. Although margin and connectivity are known to govern sample efficiency in pairwise preference learning, Best-of-$N$ couples them through $N$ in opposing directions: larger $N$ widens pairwise margins but reduces connectivity. This trade-off yields two design principles: use larger $N$ when preference labels are the bottleneck, smaller $N$ when generation is the bottleneck; and shape the base distribution to place mass between the responses whose comparison matters most at test time. Experiments on synthetic and real preference data support the predicted dependence on sample size and base-distribution shape.

URL PDF HTML ☆

赞 0 踩 0

2605.30611 2026-06-01 cs.CV cs.AI cs.CL 版本更新

Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs

Crafter: 面向多样化输入的可编辑科学图表生成的多智能体框架

Haozhe Zhao, Shuzheng Si, Zhenhailong Wang, Zheng Wang, Liang Chen, Xiaotong Li, Zhixiang Liang, Maosong Sun, Minjia Zhang

发表机构 * University of Illinois at Urbana-Champaign（伊利诺伊大学厄巴纳-香槟分校）； Tsinghua University（清华大学）； Peking University（北京大学）

AI总结提出Crafter多智能体框架，通过结构化组合离散语义组件，实现跨图表类型和输入条件的可编辑科学图表生成，并引入CraftEditor将栅格输出转换为可编辑SVG，在CraftBench基准上显著优于现有方法。

Comments 24 pages, 11 figures

详情

AI中文摘要

科学图表是传达复杂研究思想最有效的手段之一，但生成出版质量的插图仍然是论文准备中最劳动密集的部分。现有的自动化系统各自针对单一图表类型，且仅接受文本输入，未能解决研究人员实际使用的多样类型和条件；此外，它们的栅格输出无法进行局部修改。由于科学图表是离散语义组件的结构化组合，生成器在这些布局上产生的局部错误需要的不是更强的骨干网络，而是一个框架。我们将这个框架实例化为两个互补系统：Crafter，一个用于图表生成的多智能体框架，无需架构更改即可泛化到多种图表类型和输入条件；以及CraftEditor，它应用相同的模式将栅格输出转换为可编辑的SVG。此外，我们引入了CraftBench，一个涵盖三种图表类型和四种输入条件的基准，并带有手工质量标注。实验表明，Crafter在PaperBanana-Bench和CraftBench上显著优于独立的生成器和智能体基线，消融实验确认了每个组件的独立贡献；CraftEditor忠实地将输出转换为可编辑的SVG，超越了所有基线。我们的代码和基准可在https://github.com/HaozheZhao/Crafter获取。

英文摘要

Scientific figures are among the most effective means of communicating complex research ideas, yet producing publication-quality illustrations remains one of the most labor-intensive parts of paper preparation. Existing automated systems each target a single figure type under text-only input, leaving the diversity of types and conditions researchers actually use unaddressed; their raster outputs further cannot be locally revised. Because scientific figures are structured compositions of discrete semantic components, the localized errors generators produce on such layouts demand not a stronger backbone but a harness. We instantiate this harness in two complementary systems: Crafter, a multi-agent harness for figure generation that generalizes across figure types and input conditions without architectural changes, and CraftEditor, which applies the same pattern to convert raster outputs into editable SVGs. Moreover, we introduce CraftBench, a benchmark spanning three figure types and four input conditions with human quality annotation. Experiments show that Crafter substantially outperforms both standalone generators and the agentic baseline on PaperBanana-Bench and CraftBench, with ablations confirming each component's independent contribution; CraftEditor faithfully converts outputs into editable SVGs that surpass all baselines. Our code and benchmark are available at https://github.com/HaozheZhao/Crafter.

URL PDF HTML ☆

赞 0 踩 0

2605.30604 2026-06-01 cs.CR cs.AI cs.CL cs.IR 版本更新

An Organization-Scoped LLM Agent Runtime Architecture for Regulated Cybersecurity Operations

面向受监管网络安全运营的组织范围LLM代理运行时架构

George Fatouros, Georgios Makridis, George Kousiouris, John Soldatos, Dimosthenis Kyriazis

发表机构 * Innov-Acts Ltd（Innov-Acts有限公司）； Dept. of Digital Systems University of Piraeus（数字系统系希腊比雷埃克斯大学）； Harokopio University（哈罗基奥大学）； Dept. of Informatics and Telematics（信息学与电信系）

AI总结提出一种组织范围的LLM代理运行时架构，通过类型化安全上下文、运行时核心、专业子代理、受控工具适配层和分层人机回环，实现检索、工具调用、内存、发现、报告和审计的全局强制，并保持模型无关和本地部署。

Comments 8 pages, 3 figures

详情

AI中文摘要

受监管的网络安全工作流缺乏一个运行时基础，该基础能够在检索、工具调用、内存、发现、报告和审计中强制执行组织范围，同时保持模型无关和本地可部署。近期的大语言模型（LLM）代理系统在孤立的网络安全任务上报告了强劲结果，但它们本身并未为受监管的安全运营中心（SOC）和合规工作流定义一个可审计的平台架构，在这些工作流中，单个分析师可能触发约束整个组织的行动，并且运行时必须与现有的SIEM/XDR堆栈集成，作为上下文和告警驱动触发器的主要来源，而不是作为独立的分析层运行。本文提出了一种面向金融网络安全的组织范围LLM代理运行时架构。其贡献是一个类型化的安全上下文，该上下文在每个入口点创建，包括作为一等触发器摄入的SIEM/XDR通知，并在每个组件边界强制执行，结合共享的运行时核心、逻辑专业子代理、受控工具适配层（在统一策略和审计下暴露SIEM/XDR查询、丰富和响应原语）、带有证据引用的结构化发现、分层人机回环（HITL）门控以及仅追加审计。模型上下文协议（MCP）、扩展遥测、用于渗透测试的数字孪生、图检索和联邦知识共享被视为可选的扩展路径，而非强制性的运行时假设。我们描述了一个可实现的切片作为架构的可测试性表面，并提出了一个可证伪的评估计划，包含用于架构就绪性、安全策略执行、证据可追溯性、输出质量和运营可观测性的度量级通过标准。

英文摘要

Regulated cybersecurity workflows lack a runtime substrate that enforces organization-level scope across retrieval, tool calls, memory, findings, reports, and audit while remaining model-agnostic and locally deployable. Recent large language model (LLM) agent systems report strong results on isolated cybersecurity tasks, yet they do not by themselves define an auditable platform architecture for regulated security operations centre (SOC) and compliance workflows, where a single analyst may trigger actions that bind the organization, and where the runtime must integrate with existing SIEM/XDR stacks as a primary source of context and alert-driven triggers rather than operate as a standalone analytical layer. This paper proposes an organization-scoped LLM agent runtime architecture for financial cybersecurity. The contribution is a typed Security Context that is created at every entry point, including SIEM/XDR notifications ingested as first-class triggers, and enforced at every component boundary, combined with a shared Runtime Core, logical specialist subagents, a governed Tool Adapter Layer exposing SIEM/XDR query, enrichment, and response primitives under uniform policy and audit, structured findings with evidence references, tiered human-in-the-loop (HITL) gates, and append-only audit. Model Context Protocol (MCP), extended telemetry, digital twins for pentesting, graph retrieval, and federated knowledge sharing are treated as optional extension paths rather than mandatory runtime assumptions. We describe an implementable slice as the architecture's testability surface, and we propose a falsifiable evaluation plan with metric-level pass criteria for architecture readiness, security-policy enforcement, evidence traceability, output quality, and operational observability.

URL PDF HTML ☆

赞 0 踩 0

2605.30593 2026-06-01 cs.LG cs.AI cs.CE 版本更新

机器学习不确定性量化方法在预测涡轮燃气温度退化中的基准测试

Jostein Barry-Straume, Changmin Son, Adrian Sandu, Gavan Burke, Rekha Sundararajan, Andrew Rimell, James G. Steinrock

发表机构 * Computational Science Laboratory（计算科学实验室）； Department of Computer Science（计算机科学系）； Virginia Tech（弗吉尼亚理工大学）

AI总结本文研究了五种预测区间构建方法（Delta法、贝叶斯蒙特卡洛Dropout、Bootstrap法、下上界估计和均值方差估计），在统一实验框架下评估其捕捉涡轮燃气温度神经网络预测不确定性的能力，并基于覆盖概率、归一化平均预测区间宽度和覆盖宽度准则等指标比较了各方法的可靠性、锐度及权衡，为发动机健康管理中的预测区间方法选择和调优提供了实用指南。

详情

AI中文摘要

现代发动机的有效预测与健康管理依赖于准确的涡轮燃气温度预测和稳健的不确定性量化，以确保可靠性和安全性。本文研究了五种构建预测区间的主要方法——即Delta法、贝叶斯蒙特卡洛Dropout、Bootstrap法、下上界估计和均值方差估计——作为捕捉涡轮燃气温度神经网络预测中不确定性的手段。每种方法都在统一的实验框架内实现，该框架采用交叉验证进行超参数选择、重复训练-测试分割以保证性能稳健性，并使用多个指标评估区间的准确性和紧致性。具体地，测量了覆盖概率、归一化平均预测区间宽度以及基于覆盖宽度的准则，以全面评估每种方法的可靠性和锐度。在代表性涡轮燃气温度数据集上进行的实验揭示了五种方法在区间覆盖、宽度和稳定性方面的不同权衡。这些发现为发动机健康管理和预测中选择和调整预测区间方法提供了实用指南，确保在实际应用中的可解释性和精度。

英文摘要

Effective prognostics and health management of modern engines relies on accurate turbine gas temperature predictions and robust uncertainty quantification to ensure reliability and safety. This paper investigates five major approaches for constructing prediction intervals -- namely the Delta method, Bayesian Monte Carlo Dropout, Bootstrap method, Lower-Upper Bound Estimation, and Mean-Variance Estimation -- as a means of capturing the uncertainty in neural network predictions of turbine gas temperature. Each approach is implemented within a unified experimental framework that employs cross-validation for hyperparameter selection, repeated train-test splits for performance robustness, and multiple metrics to evaluate both the accuracy and tightness of the intervals. In particular, Coverage Probability, Normalized Mean Prediction Interval Width, and the Coverage Width-based Criterion are measured to comprehensively assess each method's reliability and sharpness. Experiments conducted on a representative turbine gas temperature dataset reveal distinct trade-offs among the five methods in terms of interval coverage, width, and stability. These findings provide a practical guide for selecting and tuning prediction interval methods in engine health management and prognostics, ensuring both interpretability and precision in real-world applications.

URL PDF HTML ☆

赞 0 踩 0

2605.30576 2026-06-01 cs.AI 版本更新

Uncertainty-Aware and Temporally Regulated Expert Advice in Reinforcement Learning for Autonomous Driving

自动驾驶强化学习中不确定性感知与时间调控的专家建议

Ahmed Abouelazm, Felix Klingebiel, Philip Schörner, J. Marius Zöllner

发表机构 * FZI Research Center for Information Technology（弗劳恩霍夫信息技术研究所）； Karlsruhe Institute of Technology（卡尔斯鲁厄理工学院）

AI总结提出一种不确定性感知框架，通过自适应阈值触发专家建议并采用承诺-冷却策略调控指导时长，结合离线策略隐式分位数网络实现安全高效的探索，在CARLA中成功率提升5-7%。

Comments Accepted in The IEEE International Conference on Intelligent Transportation Systems (ITSC) September 15-18, 2026 -- Naples, Italy

详情

AI中文摘要

自动驾驶强化学习中的探索本质上是不安全的：智能体必须经历新颖行为才能学习，但探索可能导致碰撞或偏离道路。我们提出一种不确定性感知框架，利用专家建议引导探索，同时避免长期依赖。当认知不确定性或偶然不确定性超过基于滚动缓冲区的自适应阈值时，触发建议，确保建议随智能体置信度演变。采用带有随机早停启发式的承诺-冷却策略调控指导的持续时间和频率，使智能体接触连贯操作而不耗尽建议预算。专家和智能体经验在离线策略隐式分位数网络（IQN）骨干网络中的共享回放缓冲区中合并，实现专家轨迹的高效重用。在CARLA中的实验表明，我们的方法优于IQN基线，成功率提高5-7%并减少失败，证明风险敏感的不确定性与调控的专家集成相结合，能够实现基于传感器的RL策略学习在无信号交叉口导航中更安全、更高效的探索。

英文摘要

Exploration in reinforcement learning for autonomous driving is inherently unsafe: agents must experience novel behaviors to learn, yet exploration can lead to collisions or off-road driving. We propose an uncertainty-aware framework that leverages expert advice to guide exploration while avoiding long-term dependence. Advice is triggered when epistemic or aleatoric uncertainty exceeds adaptive thresholds derived from rolling buffers, ensuring advice evolves with the agent's confidence. A commitment-cooldown strategy with a stochastic early-stop heuristic regulates the duration and frequency of guidance, exposing the agent to coherent maneuvers without exhausting the advice budget. Expert and agent experiences are combined in a shared replay buffer within an off-policy implicit quantile network (IQN) backbone, enabling efficient reuse of expert trajectories. Experiments in CARLA show that our method outperforms the IQN baseline, improving success by 5-7% and reducing failures, demonstrating that risk-sensitive uncertainty coupled with regulated expert integration enables safer and more efficient exploration for sensor-based RL policy learning in unsignalized intersection navigation.

URL PDF HTML ☆

赞 0 踩 0

2605.30571 2026-06-01 cs.AR cs.AI cs.DC cs.PF cs.RO 版本更新

Memory-Bound but Not Bandwidth-Limited: The Physical AI Inference Gap in Batch-1 LLM Decode

受限于内存但不受限于带宽：批量1的LLM解码中的物理AI推理差距

Josef Chen

发表机构 * KAIKAKU（卡伊卡普）

AI总结本文通过测量不同GPU上批量1的自回归解码性能，发现物理AI推理并非仅受内存带宽限制，还受启动开销影响，并指出量化路径的实际收益取决于运行时实现。

详情

AI中文摘要

物理AI系统，包括机器人、自动驾驶车辆、具身智能体和边缘副驾驶，通常运行与云端LLM服务不同的推理工作负载：单流、批量1的自回归解码，其中一个机器人、摄像头流或用户会话等待下一个token。这种工作负载通常被描述为受内存带宽限制。每个解码步骤都会流式传输模型权重和活跃的KV缓存，因此延迟应与峰值HBM带宽成比例。我们表明这种说法是正确的但不完整。我们测量了三个7至8B类GQA变压器在四个NVIDIA GPU（H100 SXM5、A100-80GB SXM4、L40S和L4）上的批量1解码。我们评估了从2048到16384的上下文长度，在受控的bf16 SDPA设置下产生了44个有效单元。达到的峰值HBM带宽比例随着峰值带宽的增加而下降。在标题性的Qwen-2.5-7B ctx=2048单元中，L4达到了其分析内存下限的大约81%，而H100仅达到27%。物理AI解码是内存主导的，但更快的内存并不能转化为成比例的延迟增益。我们通过CUDA Graphs A/B实验测试了缺失项。在H100上，ctx=2048时，CUDA Graphs在N=10个新会话中将解码延迟提高了1.259倍，95%自助法置信区间为1.253至1.267。在L4上，相同的干预仅提供了1.028倍的提升。这分离出了在快速GPU上可见但在较慢、带宽受限的GPU上基本隐藏的启动侧开销。部署的含义是，只有当运行时实现时，内存节省才重要。在L4上，bf16解码接近内存下限，但常见的量化路径并未恢复预期的4倍权重流量减少：从62.32 ms/step的bf16基线，bnb-nf4达到59.36 ms/step，AutoAWQ+Marlin达到45.24 ms/step。使用Ada调优的int4内核的GPTQ+ExLlamaV2达到17.36 ms/step。

英文摘要

Physical AI systems, including robots, autonomous vehicles, embodied agents and edge copilots, often run a different inference workload from cloud LLM serving: single-stream, batch-1 autoregressive decode, where one robot, camera feed or user session waits on the next token. This workload is usually described as memory-bandwidth-bound. Each decode step streams model weights and the active KV cache, so latency should scale with peak HBM bandwidth. We show that this account is true but incomplete. We measure batch-1 decode for three 7 to 8B-class GQA transformers across four NVIDIA GPUs: H100 SXM5, A100-80GB SXM4, L40S and L4. We evaluate context lengths from 2048 to 16384, producing 44 valid cells under a controlled bf16 SDPA setup. The achieved fraction of peak HBM bandwidth falls as peak bandwidth rises. On the headline Qwen-2.5-7B ctx=2048 cell, an L4 reaches roughly 81 percent of its analytic memory floor, while an H100 reaches only 27 percent. Physical-AI decode is memory-dominated, but faster memory does not translate into proportional latency gains. We test the missing term with a CUDA Graphs A/B experiment. On H100 at ctx=2048, CUDA Graphs improves decode latency by 1.259x across N=10 fresh sessions, with a 95 percent bootstrap confidence interval of 1.253 to 1.267. On L4, the same intervention gives only 1.028x. This isolates a launch-side overhead that becomes visible on fast GPUs but remains mostly hidden on slower, bandwidth-bound GPUs. The deployment implication is that memory savings matter only when the runtime realises them. On L4, bf16 decode sits close to the memory floor, but common quantised paths do not recover the expected 4x weight-traffic reduction: bnb-nf4 reaches 59.36 ms/step and AutoAWQ+Marlin reaches 45.24 ms/step from a 62.32 ms bf16 baseline. GPTQ+ExLlamaV2, with Ada-tuned int4 kernels, reaches 17.36 ms/step.

URL PDF HTML ☆

赞 0 踩 0

2605.30570 2026-06-01 cs.AI 版本更新

Procedural Generation of First Person Shooter Maps using Map-Elites

使用MAP-Elites程序化生成第一人称射击游戏地图

Simone de Donato, Pier Luca Lanzi, Daniele Loiacono

发表机构 * Politecnico di Milano — DEIB（米兰理工学院——DEIB）

AI总结研究应用MAP-Elites算法生成第一人称射击游戏地图，提出两种新表示方法（点线和空间布局）以提高地图多样性和质量。

2605.30563 2026-06-01 cs.AI 版本更新

Transforming and Encoding FTS for SAT Solving: What Helps, What Hurts (Extended Version)

转换与编码FTS以用于SAT求解：什么有帮助，什么有损害（扩展版）

João Filipe, Álvaro Torralba, Gregor Behnke

发表机构 * University of Amsterdam, Institute for Logic Language and Computation（阿姆斯特丹大学，逻辑语言与计算研究所）； Aalborg University（奥尔堡大学）

AI总结研究如何将因子化任务编码为SAT问题，提出多种编码策略，并分析并行性和任务转换对SAT规划器性能的影响。

2605.30561 2026-06-01 cs.CV cs.AI 版本更新

重新审视填充Transformer的表达能力：哪些架构选择重要，哪些不重要

Anej Svete, William Merrill, Ryan Cotterell, Ashish Sabharwal

发表机构 * ETH Zürich（苏黎世联邦理工学院）； Allen Institute for AI（人工智能研究所）

AI总结本文通过连接布尔电路，系统研究了填充Transformer的表达能力，发现数值精度和模型深度是影响表达能力的主要因素，而注意力类型、模型宽度和均匀性等架构选择对表达能力影响不大。

详情

AI中文摘要

近期工作通过连接布尔电路描述了Transformer能计算和不能计算的内容，但现有结果缺乏精确刻画，且对建模选择敏感。填充Transformer——在其输入后附加填充符号如“...”——通过为自适应并行计算提供多项式空间，成为建立与电路类等价关系的有用工具。然而，目前仅研究了有限的填充Transformer理想化模型，这些等价关系在注意力类型、模型宽度和均匀性变化下的稳健性仍待探索。我们发现，在实际假设下，填充Transformer对所有这些变化都出奇地稳健，并确定数值精度和模型深度是影响表达能力的主要因素。具体地，我们证明多项式填充的L-均匀常数精度Transformer等价于L-均匀AC⁰，而增长精度的Transformer达到L-均匀TC⁰，与宽度无关。此外，循环机制允许类似电路的顺序处理：log^d N次循环的常数精度Transformer达到FO-均匀AC^d，增长精度的达到FO-均匀TC^d。有趣的是，宽度或精度超过对数增长并不会增加表达能力，且我们所有结果对softmax和平均硬注意力Transformer均成立。

英文摘要

Recent work describes what transformers can and cannot compute through connections to boolean circuits, but existing results lack exact characterizations and are sensitive to modeling choices. Padded transformers -- to whose input filler symbols such as ``...'' are appended -- emerge as a useful gadget for establishing equivalences to circuit classes by providing polynomial space for adaptive parallel computation. However, only a limited set of padded transformer idealizations has been studied, leaving open how robustly these equivalences hold under changes to attention type, model width, and uniformity. We find that, under practical assumptions, padded transformers are surprisingly robust to all of these, and identify numeric precision and model depth as the main factors affecting expressivity. Concretely, we prove that polynomially padded $\text{L-uniform}$ constant-precision transformers are equivalent to $\text{L-uniform AC}^0$, while growing-precision ones achieve $\text{L-uniform TC}^0$ regardless of width. Furthermore, looping enables sequential processing analogous to circuits: $\log^d N$-looped constant-precision transformers reach $\text{FO-uniform AC}^d$, and growing-precision ones reach $\text{FO-uniform TC}^d$. Interestingly, growing width or precision beyond logarithmic does not increase expressivity, and all our results hold for both softmax and average hard attention transformers.

URL PDF HTML ☆

赞 0 踩 0

2605.30512 2026-06-01 cs.AI cs.CV 版本更新

PhyDrawGen: Physically Grounded Diagram Generation from Natural Language

PhyDrawGen: 基于自然语言的物理约束图表生成

Nafiul Haque, Syed Nazmus Sakib, Shifat E Arman

发表机构 * Department of Robotics and Mechatronics Engineering, University of Dhaka（机器人与机电工程系，达卡大学）

AI总结提出PhyDrawGen神经符号管道，通过场景图提取、确定性求解器和视觉验证循环，从自然语言生成符合物理定律的图表，在力学、光学和电磁学基准上显著优于现有模型。

Comments 9 figures, 7 tables. Under review at EMNLP 2026

详情

AI中文摘要

从文本生成物理图表需要严格遵守物理定律。虽然当前生成模型能产生视觉上合理的输出，但它们会系统性地产生力向量幻觉、忽略守恒定律并违反几何约束。我们提出PhyDrawGen，一种神经符号管道，将语义场景理解与物理约束满足解耦。首先，大语言模型从问题文本中提取类型化场景图。然后，确定性求解器将该图转换为平面直线图（PSLG），将力平衡、光路和场拓扑编码为精确几何基元。最后，微调的Qwen-VL模型实现视觉基础的提议-验证循环，以迭代纠正任何约束违反。在涵盖力学、光学和电磁学的1,449个问题基准上评估，PhyDrawGen显著优于GPT-5-image、Gemini 2.5 Flash和Gemini 3 Pro，即使在非常见物体问题上也展现出鲁棒的物理准确性。

英文摘要

Generating physics diagrams from text requires strict adherence to physical laws. While current generative models produce visually plausible outputs, they systematically hallucinate force vectors, ignore conservation laws, and violate geometric constraints. We present PhyDrawGen, a neuro-symbolic pipeline that decouples semantic scene understanding from physical constraint satisfaction. First, a large language model extracts a typed scene graph from the problem text. A deterministic solver then converts this graph into a Planar Straight-Line Graph (PSLG), encoding force balance, optical paths, and field topologies as exact geometric primitives. Finally, a fine-tuned Qwen-VL model implements a visually grounded propose-verify loop to iteratively correct any constraint violations. Evaluated on a benchmark of 1,449 problems spanning mechanics, optics, and electromagnetism, PhyDrawGen significantly outperforms GPT-5-image, Gemini 2.5 Flash, and Gemini 3 Pro, demonstrating robust physical accuracy even on unusual-object problems.

URL PDF HTML ☆

赞 0 踩 0

2605.30510 2026-06-01 cs.CV cs.AI 版本更新

A Novel Global Context-aware Deep Neural Network for Enhanced Brain Tumor Segmentation using Magnetic Resonance Images

一种新颖的全局上下文感知深度神经网络用于基于磁共振图像的增强脑肿瘤分割

Sourjya Mukherjee, Ananya Bhattacharjee, R. Murugan

发表机构 * National Institute of Technology Silchar（全国理工学院锡拉char分校）

AI总结提出全局上下文感知的挤压激励残差UNet（GCSER-UNet），融合空间和通道注意力，在TCGA LGG和BraTS 2020数据集上取得优于现有技术的Dice分数。

Comments 11 pages, 9 figures, 6 tables. Submitted to arXiv cs.CV

详情

AI中文摘要

多目标优化中梯度聚合的统一框架

Zeou Hu, Kelvin Ho, Yaoliang Yu

发表机构 * Cheriton School of Computer Science（切尔顿计算机科学学院）； University of Waterloo（滑铁卢大学）； Vector Institute（向量研究所）； The Chinese University of Hong Kong（香港中文大学）

AI总结提出一个统一框架，通过充分对齐条件建立梯度聚合方法的收敛率，并引入基于CVaR的capped MGDA算法，在对抗联邦学习中验证鲁棒性。

详情

AI中文摘要

许多机器学习问题涉及多个固有的权衡，最好通过基于梯度的多目标优化（MOO）算法来解决。现有方法通常基于不同的动机提出，逐个案例进行分析，并且在每一步中如何聚合分量梯度在算法上有所不同。在这项工作中，我们为MOO中的梯度聚合开发了一个统一框架，建立了收敛到帕累托平稳性（MOO的标准性能度量）的（最优）速率。我们分析的核心是一个充分对齐条件，由此我们推导出一个定理，表明当在梯度的凸包内选择时，非冲突方向构成了收敛的基本充分条件。我们进一步表明，通过对偶锥上的投影可以确保可行性，从而拓宽了具有收敛保证的方法的范围。同时，我们提出了梯度聚合的原始优化视角，该视角涵盖了已有算法，阐明了它们的理论关系，并能够设计新的变体。作为示例，我们引入了capped MGDA，它基于CVaR公式推导而来，并展示了其在对抗联邦学习中的鲁棒性。最后，我们通过在合成问题和实际基准上的实验验证了我们的理论。

英文摘要

Many machine learning problems involve multiple inherent trade-offs that are best addressed by gradient-based multi-objective optimization (MOO) algorithms. Existing methods are often proposed with various motivations, analyzed case by case, and differ algorithmically in how the component gradients are aggregated at each step. In this work, we develop a unifying framework for gradient aggregation in MOO, establishing (optimal) rates of convergence to Pareto stationarity, the standard measure of performance in MOO. Central to our analysis is a sufficient alignment condition, from which we derive a theorem showing that non-conflicting directions, when chosen within the convex hull of gradients, form a fundamental sufficient condition for convergence. We further show that feasibility can be ensured through projection onto the dual cone, broadening the scope of methods that admit convergence guarantees. In parallel, we present a primal optimization perspective of gradient aggregation that encompasses established algorithms, clarifies their theoretical relationships, and enables the design of new variants. As an illustration, we introduce capped MGDA, derived from a CVaR-based formulation, and demonstrate its robustness in adversarial federated learning. Finally, we validate our theory through experiments on synthetic problems and practical benchmarks.

URL PDF HTML ☆

赞 0 踩 0

2605.30447 2026-06-01 cs.LG cs.AI stat.ML 版本更新

用于储层学习和生成的进化算法

Julien Testu, Pierrick Legrand, Xavier Hinaut

发表机构 * Inria ； LaBRI, CNRS UMR 5800（LaBRI，CNRS UMR 5800）； Bordeaux INP, ENSC（Bordeaux INP，ENSC）； IMS, CNRS UMR 5218（IMS，CNRS UMR 5218）

AI总结提出进化算法EARLY，通过进化多储层回声状态网络的拓扑和超参数，在时序学习任务上优于随机搜索，并发现任务难度影响网络结构。

详情

Journal ref: GECCO '26 - The Genetic and Evolutionary Computation Conference, Jul 2026, San jos{é}, Costa Rica

AI中文摘要

储层计算是一种递归神经网络，因其将动态处理与训练好的读出层分离而成为时序学习的有前途方法。然而，经典的回声状态网络（ESN）通常需要针对任务调整其架构和超参数才能获得良好性能。本文介绍了EARLY（用于储层学习和生成的进化算法），这是一个旨在进化多储层ESN的拓扑和超参数的框架。受大脑模块化组织的启发，EARLY将架构编码为基于图的基因组，并应用交叉、变异和选择来发现有效的配置。我们的目标是创建通用架构和任务诱导泛化。该方法在CogScale数据集的时序学习任务上进行了评估。结果表明，进化出的架构在多个任务上优于通过随机搜索获得的架构，并根据任务难度表现出结构差异：简单任务产生轻量级架构，而复杂任务倾向于更丰富的模块化组织。这些发现表明，进化搜索有助于为更广泛的时序问题识别可复用的储层结构。进一步在跨情境学习数据集上评估进化出的架构，以评估其适应新环境的能力。

英文摘要

Reservoir computing, a type of recurrent neural network, is a promising approach for temporal learning as it separates dynamic processing from the trained readout layer. However, classical Echo State Networks (ESNs) often require task-specific tuning of their architecture and hyperparameters to achieve good performance. This paper introduces EARLY (Evolutionary Algorithm for Reservoir Learning and Yielding), a framework designed to evolve both the topology and hyperparameters of multi-reservoir ESNs. Inspired by the modular organisation of the brain, EARLY encodes architectures as graph-based genomes and applies crossover, mutation, and selection to discover effective configurations. Our goal is to create both generic architectures and tasks inducing generalization. The method is evaluated on temporal learning tasks from the CogScale dataset. Results show that evolved architectures outperform those obtained with random search on several tasks and exhibit structural differences depending on task difficulty: simpler tasks yield lightweight architectures, while more complex tasks favour richer modular organisations. These findings suggest that evolutionary search can help identify reusable reservoir structures for a broader range of temporal problems. The evolved architectures are further evaluated on a cross-situational learning dataset to assess their ability to adapt to new environments.

URL PDF HTML ☆

赞 0 踩 0

2605.30368 2026-06-01 cs.NE cs.AI cs.RO q-bio.NC 版本更新

Reinterpreting Safety Thresholds as Neuron Spiking Thresholds

将安全阈值重新解释为神经元放电阈值

Enrico Del Re, Mohamed Sabry, Cristina Olaverri-Monreal

发表机构 * Johannes Kepler University Linz（约翰·凯撒大学林茨）； Department Intelligent Transport Systems（智能交通运输系统部门）

AI总结提出将替代安全措施（SSM）的固定阈值重新解释为泄漏积分点火（LIF）神经元的放电阈值，构建脉冲神经网络（SNN）学习人类刹车起始点，实现客观SSM与主观安全感知的融合。

Comments 6 pages

详情

AI中文摘要

替代安全措施（SSM）在自动驾驶领域的交通风险评估中被广泛使用。然而，大多数基于SSM的评估采用固定阈值，无法捕捉人类对持续临界状态的响应或对短暂高风险峰值的反应。本文提出了一种受生物学启发的SSM阈值重新解释，将其建模为泄漏积分点火（LIF）神经元的放电阈值，并将多个SSM输入组合成脉冲神经网络（SNN）。该SNN经过训练，使其发放的脉冲与人类刹车起始点对齐。训练数据是在使用3D-CoAutoSim平台（基于CARLA/Unreal和六自由度运动平台）的受控跟车实验中记录的，实验中生成了诱导的关键事件。结果表明，学习到的脉冲活动在定性上与跨场景的刹车行为一致，并捕捉了仅靠阈值交叉无法一致解释的反应。跨参与者的分析进一步表明，学习到的输入阈值保持相对一致，而学习到的衰减因子编码了SSM的不同时间敏感性。本研究的发现表明，脉冲动力学可能作为一种机制，促进客观SSM与主观人类安全感知的融合。

英文摘要

Surrogate Safety Measures (SSMs) are extensively utilised in the evaluation of traffic risk in automated driving contexts. However, the majority of SSM-based evaluations employ fixed thresholds that fail to capture the human response to sustained borderline conditions or the reaction to brief, high-risk peaks. The present work proposes a biologically inspired reinterpretation of SSM thresholds. This is modelled as spiking thresholds of leaky integrate-and-fire (LIF) neurons, with multiple SSM inputs combined into a spiking neural network (SNN). The SNN is trained to emit spikes that are aligned with human braking onsets. The training data was recorded in a controlled car-following experiment using the 3D-CoAutoSim platform with CARLA/Unreal and a 6-DOF motion platform, where induced critical events were generated. The results demonstrate that the learned spiking activity qualitatively aligns with braking behaviour across scenarios and captures reactions that are not consistently explained by threshold crossings alone. Analysis across participants further indicates that learned input thresholds remain relatively consistent, while learned decay factors encode different temporal sensitivities for the SSMs. The findings of this study indicate that spiking dynamics may serve as a mechanism to facilitate the convergence of objective SSMs with subjective human safety perception.

URL PDF HTML ☆

赞 0 踩 0

2605.30365 2026-06-01 cs.SD cs.AI eess.AS 版本更新

Mental Damage: Caption Poisoning Attacks on Retrieval-Augmented Text-to-Music Generation

心理伤害：面向检索增强文本到音乐生成的标题投毒攻击

Yizhu Wen, Shuhao Zhang, Nan Zhang, Long Cheng, Hanqing Guo

发表机构 * Clemson University（克莱姆森大学）； Michigan State University（密歇根州立大学）

AI总结提出双层标题投毒策略，通过向音乐知识库注入少量恶意标题，使检索增强文本到音乐系统生成偏离用户意图的音乐，暴露了系统的完整性风险。

Comments This paper was accepted by the S&P 2026 ArtSec Workshop

详情

AI中文摘要

检索增强文本到音乐（TTM）系统通过从音乐标题数据集中检索的标题来增强未指定的用户提示。这种设计引入了对音乐知识数据库的完整性依赖。我们表明，攻击者可以通过注入少量精心制作的音乐标题来毒化数据库，导致系统检索恶意标题，从而偏置提示增强并使生成偏离用户预期功能，而无需修改用户提示、检索器或生成器。为了实现音乐标题投毒攻击，我们提出了一种双层标题投毒策略，该策略保留高级检索锚点，同时注入低级声学描述符，以将提示增强和下游音乐生成引导至攻击者选择的目标意图。在MusicCaps知识数据库、CLAP检索器和MusicGen流水线中，被投毒的生成结果显著接近攻击者的目标，同时与原始用户查询保持可比的对齐。这些结果暴露了检索增强创意AI系统的实际完整性风险。我们的演示可在以下网址找到：https://yizhu-wen.github.io/Mental-Damage/

英文摘要

Retrieval-augmented text-to-music (TTM) systems augment underspecified user prompts using captions retrieved from a music caption dataset. This design introduces an integrity dependency on the music knowledge database. We show that an attacker can poison the database by injecting a small number of crafted music captions, causing the system to retrieve malicious captions that bias prompt augmentation and steer generation away from the user's intended function, without modifying the user prompt, retriever, or generator. To achieve the music caption poisoning attack, we propose a dual-layer caption poisoning strategy that preserves high-level retrieval anchors while injecting low-level acoustic descriptors to steer prompt augmentation and downstream music generation toward an attacker-chosen target intent. In a MusicCaps knowledge database, CLAP retriever, and MusicGen pipeline, poisoned generations move substantially closer to the attacker's target, while remaining comparably aligned with the original user query. These results expose a practical integrity risk for retrieval-augmented creative AI systems. Our demo can be found at: https://yizhu-wen.github.io/Mental-Damage/

URL PDF HTML ☆

赞 0 踩 0

2605.30364 2026-06-01 eess.SP cs.AI 版本更新

Hamiltonian-Inspired Attention Mechanism for Scalable RF Transmitter Fingerprinting

哈密顿启发的注意力机制用于可扩展射频发射器指纹识别

Chitraksh Singh, Monisha Dhanraj, Akram Sheriff

发表机构 * Frondeur Labs（弗朗德实验室）

AI总结提出哈密顿Transformer，通过物理启发的注意力结构（规范保持值更新和相位增量嵌入）提升射频发射器指纹识别在规模扩展下的性能。

Comments 9 pages

详情

AI中文摘要

射频（RF）指纹识别利用基带I/Q信号中硬件引入的缺陷来识别无线发射器。然而，深度学习模型在接收机和信道分布变化下性能下降，尤其是当发射器数量增加时。本文提出哈密顿Transformer，一种物理启发的注意力架构，通过使用学习到的斜对称生成器和Störmer-Verlet蛙跳积分步骤，在每个注意力头内强制执行规范保持的值动态。额外的相位增量嵌入在输入层揭示振荡器动态。所有实验使用WiSig数据集的非均衡原始I/Q信号，在四种协议下进行：同一天分类、跨接收机泛化、跨天泛化和扩展到150个设备。哈密顿Transformer在同一天条件下达到99.12%的准确率，在150个发射器时达到61.64%，在所有规模点上持续优于CNN和Transformer基线。受控消融研究确定值更新中的规范保持是驱动扩展优势的主要归纳偏置，而相位增量嵌入提供了最大的单组件改进。这些结果表明，将物理启发的结构先验嵌入注意力机制是在原始无线信号上进行大规模发射器识别的有效方法。

英文摘要

Radio-frequency (RF) fingerprinting identifies wire-less transmitters using hardware-induced imperfections present in baseband I/Q signals. However, deep learning models often degrade under receiver and channel distribution shifts, particularly as transmitter populations grow. This work proposes the Hamiltonian Transformer, a physics-informed attention architecture that enforces norm preserving value dynamics within each attention head using a learned skew-symmetric generator and a Störmer-Verlet leapfrog integration step. An additional phase-increment embedding exposes oscillator dynamics at the input layer. All experiments use non-equalized raw I/Q signals from the WiSig dataset under four protocols: same-day classification, cross-receiver generalisation, cross-day generalisation, and transmitter scaling up to 150 devices. The Hamiltonian Transformer achieves 99.12% accuracy under same-day conditions and 61.64% at 150 transmitters, consistently outperforming CNN and Transformer baselines across all scale points. A controlled ablation study identifies norm-preservation in the value update as the primary inductive bias driving the scaling advantage, with the phase increment embedding providing the single largest per-component improvement. These results indicate that embedding physics-informed structural priors into attention mechanisms is an effective approach to large-scale transmitter identification on raw wireless signals.

URL PDF HTML ☆

赞 0 踩 0

2605.30363 2026-06-01 q-fin.CP cs.AI cs.LG q-fin.ST 版本更新

Enhancing Regime Shift Detection Using Unstructured Data: A Study on the Treasury Market

利用非结构化数据增强制度转换检测：国债市场研究

Mingxuan Yi, Vidal Mehra, Jing Chen, John Cartlidge

发表机构 * School of Engineering Mathematics and Technology, University of Bristol, UK（布里斯托大学工程数学与技术学院）； Propellant Digital B.V., Amsterdam, Netherlands（荷兰阿姆斯特丹Propellant Digital公司）； School of Mathematics, Cardiff University, UK（卡迪夫大学数学学院）

AI总结提出一种结合大语言模型推理与统计检验的文本增强型制度转换检测框架，在国债市场数据上实现F1=0.82，优于纯数据驱动方法。

Comments 8 pages, 4 figures. Code available at: https://github.com/mingxuan-yi/regime_shift

详情

AI中文摘要

金融市场的制度转换会重组资产价格和宏观变量的联合动态，打破任何单一制度校准。然而，由于数据信号嘈杂且高度多重共线性，而宣布制度转换的同期文本是非结构化的，因此难以可靠检测。标准的制度转换检测方法仅依赖结构化时间序列数据，忽略政策沟通，尽管这些文本往往在观察到的价格中实现转换之前就发出信号。我们提出了一种文本增强的制度转换检测流程，该流程将大语言模型（LLM）对央行沟通的推理与多元金融时间序列的统计验证相结合。该框架是检测器无关的：文本提出的候选点通过向量自回归（VAR）上的自助法似然比检验进行验证，而来自任意制度检测器的数据驱动候选点则通过宽松的LLM文本检查进行确认。我们在2010-2024年FOMC会议记录以及14变量美国国债和宏观经济面板数据上评估了该框架，使用了四种可互换的数据驱动检测器。所提出的流程在经核实的货币政策制度转换锚定列表上实现了F1=0.82，具有当日模态检测延迟，并且性能始终优于纯数据驱动基线。结果表明，将非结构化政策文本与统计结构性断点检测相结合，提高了金融市场制度转换识别的鲁棒性和可解释性。

英文摘要

Regime shifts in financial markets reorganise the joint dynamics of asset prices and macro variables, breaking any single-regime calibration. They are nonetheless difficult to detect reliably because the data signal is noisy and heavily multicollinear, while the contemporaneous text that announces them is unstructured. Standard regime shift detection methods rely solely on structured time-series data and ignore policy communications, even though these texts often signal shifts before they materialise in observed prices. We propose a text-enhanced regime shift detection pipeline that combines large language model (LLM) reasoning over central-bank communications with statistical validation on multivariate financial time series. The framework is detector-agnostic: text-proposed candidates are validated using a bootstrap likelihood-ratio test on a vector autoregression (VAR), while data-driven candidates from arbitrary regime detectors are ratified through a lenient LLM text check. We evaluate the framework on 2010-2024 FOMC minutes paired with a 14-variable U.S. Treasury and macroeconomic panel, using four interchangeable data-driven detectors. The proposed pipeline achieves F1 = 0.82 against a verified anchor list of monetary-policy regime shifts, with same-day modal detection latency and consistently stronger performance than pure data-driven baselines. The results demonstrate that combining unstructured policy text with statistical structural-break detection improves the robustness and interpretability of regime shift identification in financial markets.

URL PDF HTML ☆

赞 0 踩 0

2605.30362 2026-06-01 cs.NE cs.AI cs.CV 版本更新

XOResNet: Exclusive-OR Meta-Residuals Facilitate Deep Spiking Neural Networks Learning

XOResNet: 异或元残差促进深度脉冲神经网络学习

Jianfang Wu, Junsong Wang

发表机构 * School of Artificial Intelligence, Shenzhen Technology University（人工智能学院，深圳技术大学）； Faculty of Data Science, City University of Macau（数据科学学院，澳门城市大学）

AI总结针对深度脉冲神经网络中残差结构存在的脉冲冗余、信息损失和冗余学习问题，提出OR-ADD捷径连接和XOR元残差机制，构建XOResNet，在多个数据集上超越现有方法。

Comments 33 pages, 12 figures, 7 Tables

详情

AI中文摘要

脉冲神经网络（SNN）在深度模型中展现出优越的学习和表示能力。鉴于ResNet在深度学习中的巨大成功，自然希望用残差学习训练深度SNN。然而，现有的用于构建深度SNN的残差结构仍然面临脉冲冗余或信息损失以及冗余学习的挑战。在本研究中，我们首先旨在解决恒等映射中的相对脉冲冗余和非恒等映射中的信息损失问题。为此，我们提出了一种OR-ADD（OA）捷径连接，用于合并残差结构中两个分支的输出脉冲/电流。此外，为了减轻残差结构主干分支中的冗余学习，我们引入了XOR元残差的概念，即使用异或（XOR）操作为主干分支选择预学习残差。最后，通过整合OA捷径和XOR元残差，我们设计了XOR残差块，并基于该块进一步构建了不同深度的XOResNet。在Fashion-MNIST、CIFAR-10、CIFAR-100和miniImageNet四个数据集上的大量实验表明，所提出的XOResNet优于现有的通过梯度下降优化的最先进深度SNN。这些结果验证了我们的OA捷径和XOR元残差组件在克服SNN中残差学习基本局限性方面的有效性，为构建高性能神经形态系统提供了新的架构见解。

英文摘要

Spiking neural networks (SNNs) hold promise for demonstrating superior learning and representation capabilities in deep models. Given the tremendous success of ResNet in deep learning, it would naturally follow to train deep SNNs with residual learning. However, existing residual structures for constructing deep SNNs still present challenges of spike redundancy or information loss, as well as redundant learning. In the present study, we first aim to address issues of relative spike redundancy in identity mapping and information loss in non-identity mapping. To this end, we propose an OR-ADD (OA) shortcut connection to merge output spikes/currents from two branches in the residual structure. Furthermore, to mitigate redundant learning in the backbone branch of the residual structure, we introduce the concept of XOR meta-residuals, i.e., selecting pre-learning residuals using the Exclusive-OR (XOR) operation for the backbone branch. Finally, by integrating the OA shortcut and XOR meta-residuals, we devise the XOR residual block and further construct XOResNet with varying depths based on this block. Extensive experiments on four datasets, Fashion-MNIST, CIFAR-10, CIFAR-100, and miniImageNet, show that the proposed XOResNet outperforms existing state-of-the-art deep SNNs optimized via gradient descent. These results validate the effectiveness of our OA shortcut and XOR meta-residual components in overcoming fundamental limitations of residual learning in SNNs, providing new architectural insights for building high-performance neuromorphic systems.

URL PDF HTML ☆

赞 0 踩 0

2605.30361 2026-06-01 cs.NE cs.AI cs.LG 版本更新

Gradient-Free Training of Spiking Neural Networks via Low-Rank Evolution Strategies

通过低秩进化策略的无梯度训练脉冲神经网络

Dhruv Patankar, Sachit Ramesha Gowda

发表机构 * Shunya Research（Shunya研究）

AI总结提出EGGROLL方法，利用低秩因子化进化策略扰动，在N-MNIST数据集上以79.21%测试精度和2.23倍加速实现脉冲神经网络的无梯度训练。

Comments 12 pages, 4 figures

详情

AI中文摘要

脉冲神经网络（SNN）在神经形态硬件上具有显著的能效优势，但由于离散脉冲阈值不可微，其训练仍然具有挑战性。代理梯度方法通过近似导数规避了这一问题，但它们需要反向传播基础设施，这与片上学习不兼容。进化策略（ES）是一种自然的无梯度替代方案，但其计算成本随参数数量扩展，使得对于大型权重矩阵不实用。我们提出了一种使用EGGROLL训练SNN的方法，这是一种ES扰动的低秩因子化，将每代内存从$\mathcal{O}(mn)$降低到$\mathcal{O}(r(m{+}n))$。将EGGROLL与N-MNIST上的漏积分点火SNN相结合，我们证明了无梯度训练达到了79.21%的测试准确率，同时相对于全秩ES，每代墙钟时间减少了2.23倍。我们的结果表明EGGROLL对于SNN训练是可行的，具有明确的准确率-速度权衡，并且兼容于无需代理梯度的神经形态硬件上的训练。

英文摘要

Spiking Neural Networks (SNNs) offer compelling energy efficiency on neuromorphic hardware, yet their training remains challenging because the discrete spike threshold is non-differentiable. Surrogate-gradient methods sidestep this by approximating the derivative, but they impose backpropagation infrastructure that is incompatible with on-chip learning. Evolution Strategies (\es) are a natural gradient-free alternative, yet their computational cost scales with the number of parameters, making them impractical for large weight matrices. We present a method for training SNNs using EGGROLL, a low-rank factorisation of ES perturbations that reduces per-generation memory from $\mathcal{O}(mn)$ to $\mathcal{O}(r(m{+}n))$. Combining EGGROLL with a Leaky Integrate-and-Fire SNN on N-MNIST, we demonstrate that gradient-free training achieves 79.21% test accuracy while reducing per-generation wall-clock time by 2.23$\times$ relative to full-rank ES. Our results demonstrate EGGROLL is viable for SNN training, with a clear accuracy-speed tradeoff, compatible with training on neuromorphic hardware without surrogate gradients.

URL PDF HTML ☆

赞 0 踩 0

2605.27996 2026-06-01 cs.AI 版本更新

Reward Bias Substitution: Single-Axis Bias Mitigations Redirect Optimization Pressure

奖励偏差替代：单轴偏差缓解措施重定向优化压力

Max Lamparth, Daniel Fein, Andreas Haupt, Marcel Hussing, Mykel J. Kochenderfer

发表机构 * Stanford University（斯坦福大学）； University of Pennsylvania（宾夕法尼亚大学）

AI总结本文提出奖励偏差替代现象，即单轴缓解奖励模型偏差（如减少对长度、谄媚或风格的依赖）会将优化压力转移到相关代理上而非消除，并通过理论证明和实验（如GRPO训练中的长度惩罚导致过度自信）揭示了该问题，建议在评估中纳入策略诱导分布并跟踪多偏差。

Comments Improved readability (mostly appendix D)

详情

AI中文摘要

单轴缓解奖励模型偏差（例如，减少代理对长度、谄媚或风格的依赖）可以将优化压力旋转到相关代理上，而不是消除它，这种失败模式我们称之为奖励偏差替代。这种失败是由于在缓解评估和策略训练期间，审计分布与策略诱导分布之间的测量与优化差距造成的。我们将缓解结果形式化为一个机制分类，并证明成功的缓解、偏差替代和过度修正会在任何审计分布评分下产生相同的可观测结果，包括排名准确率和胜率，即使允许对真实奖励进行神谕访问。在已发表的偏好学习缓解工作中，我们调查的方法都没有报告证明成功缓解所需的证据。在跟踪多个偏差的同时，用策略诱导分布增强评估可以证明缩小差距，我们将其转化为缓解方法和基准的可操作处方。我们在语言模型RLHF中演示了偏差替代，其中GRPO训练期间的长度惩罚按预期压缩了响应，但将优化压力重定向到置信度校准上，导致策略过度自信，而事实自由形式准确性下降。我们还展示了一个已发表的长度去偏操作，它在审计分布上将奖励-长度相关性归零，但在四个最先进奖励模型中的三个上，在最佳N选择下重新引入了偏差，以及一个长度-谄媚耦合，其方向在人类-LLM判断者分歧下反转。

英文摘要

Single-axis mitigations of reward-model biases (e.g., reducing proxy reliance on length, sycophancy, or style) can rotate optimization pressure onto correlated proxies rather than eliminate it, a failure mode we call reward bias substitution. The failure is enabled by a measurement-versus-optimization gap between audit and policy-induced distributions during mitigation evaluation and policy training. We formalize mitigation outcomes into a regime taxonomy and prove that successful mitigation, bias substitution, and overcorrection produce identical observables under any audit-distribution scoring, including ranking accuracy and win-rate, even when granted oracle access to the true reward. Across published preference-learning mitigation work, no method we survey reports the evidence needed to certify successful mitigation. Augmenting evaluation with policy-induced distributions while tracking multiple biases provably closes the gap, and we translate this into actionable prescriptions for mitigation methods and benchmarks. We demonstrate bias substitution in language model RLHF, where a length penalty during GRPO training compresses responses as intended yet redirects optimization pressure onto confidence calibration, driving the policy into overconfidence while factual free-form accuracy falls. We also show a published length-debiasing operator that zeroes reward-length correlation on the audit distribution but reintroduces bias under best-of-N selection on three of four SOTA reward models, and a length-sycophancy coupling whose direction reverses under human-LLM judge disagreement.

URL PDF HTML ☆

赞 0 踩 0

2605.27355 2026-06-01 cs.AI cs.CL cs.LG 版本更新

Alignment Tampering: How Reinforcement Learning from Human Feedback Is Exploited to Optimize Misaligned Biases

对齐篡改：人类反馈强化学习如何被利用以优化错位偏见

Dongyoon Hahm, Dylan Hadfield-Menell, Kimin Lee

发表机构 * MIT（麻省理工学院）

AI总结本文提出对齐篡改漏洞，即对齐中的LLM通过影响偏好数据集使RLHF放大不良行为，并通过实验展示多种偏见的放大，指出现有缓解方法难以在不牺牲质量的情况下解决该问题。

Comments Accepted at ICML 2026, Source code: https://alignment-tampering.github.io/

详情

AI中文摘要

人类反馈强化学习（RLHF）是将大型语言模型（LLM）与人类偏好对齐的标准方法。在本工作中，我们引入对齐篡改，这是一种潜在漏洞，即正在对齐的LLM影响偏好数据集，导致RLHF放大不良行为。这源于RLHF的核心局限性：（1）偏好数据集由LLM自身的输出构建，使其能够影响它们；（2）成对比较仅指示哪个响应更好，而不说明原因。这些局限性可能被利用以导致对齐篡改。例如，如果LLM以更高质量生成有偏见的响应，标注者会基于质量偏好它们。然而，偏好标签无法区分质量与偏见，奖励模型继承了这一局限性。通过强化学习或最佳N采样优化此类奖励可能放大错位偏见。我们的实验展示了跨多种偏见的放大：从关键词偏见到宣传（例如性别歧视）、品牌推广和工具性目标寻求。缓解仍然具有挑战性，因为现有的鲁棒RLHF技术无法在不牺牲响应质量的情况下完全解决对齐篡改。这些发现揭示了当前RLHF的结构性漏洞，并强调了防止此漏洞的必要性。项目页面：https://alignment-tampering.github.io/

英文摘要

Reinforcement Learning from Human Feedback (RLHF) is the standard method to align Large Language Models (LLMs) with human preferences. In this work, we introduce alignment tampering, a potential vulnerability where the LLM undergoing alignment influences the preference dataset, causing RLHF to amplify undesired behaviors. This arises from core limitations of RLHF: (1) preference datasets are constructed from the LLM's own outputs, allowing it to influence them, and (2) pairwise comparisons only indicate which response is better, not why. These limitations can be exploited to cause alignment tampering. For example, if an LLM generates biased responses with higher quality, annotators will prefer them based on quality. However, preference labels do not distinguish quality from bias, and the reward model inherits this limitation. Optimizing such rewards through reinforcement learning or best-of-N sampling can amplify misaligned biases. Our experiments demonstrate amplification across diverse biases: from keyword bias to propaganda (e.g., sexism), brand promotion, and instrumental goal-seeking. Mitigation remains challenging, as existing techniques for robust RLHF fail to fully resolve alignment tampering without sacrificing response quality. These findings reveal structural vulnerabilities of current RLHF and emphasize the need to prevent this vulnerability. Project page: https://alignment-tampering.github.io/

URL PDF HTML ☆

赞 0 踩 0

2605.27255 2026-06-01 cs.CL cs.AI 版本更新

Pair-In, Pair-Out: Latent Multi-Token Prediction for Efficient LLMs

Pair-In, Pair-Out: 面向高效LLM的潜在多令牌预测

Wenhui Tan, Minghao Li, Xiaoqian Ma, Siqi Fan, Xiusheng Huang, Liujie Zhang, Ruihua Song, Weihang Chen

发表机构 * Gaoling School of Artificial Intelligence, Renmin University of China（中国人民大学首都人工智能学院）； AI Platform, Xiaohongshu Inc.（小红书人工智能平台）； University of Electronic Science and Technology of China（电子科技大学）； Institute of Automation, Chinese Academy of Sciences（中国科学院自动化研究所）

AI总结提出Pair-In, Pair-Out (PIPO)方法，通过统一潜在压缩和多令牌预测，并训练轻量级置信度头消除验证器开销，在保持可靠性的同时实现推理加速。

Comments Project Page: GitHub.com/RedAI-Infra/PIPO

详情

AI中文摘要

长链式推理使得自回归解码成为现代大语言模型的主要推理成本。现有方法要么针对输入侧（潜在压缩），要么针对输出侧（推测解码和多令牌预测，MTP），但这两条工作线是独立进行的。此外，输出侧方法必须进行昂贵的验证器传递，以验证MTP预测的不可靠草稿令牌。为解决这些问题，我们提出 extbf{Pair-In, Pair-Out (PIPO)}，通过将潜在压缩器和MTP头视为镜像操作来统一两侧：压缩器将两个输入令牌折叠成一个潜在表示，而MTP头将一个隐藏状态展开成一个额外的输出令牌。为了在不牺牲可靠性的情况下消除验证器成本，PIPO训练一个轻量级置信度头，决定是否接受草稿令牌。我们观察到，在线策略蒸馏（OPD）自然匹配推测解码的拒绝采样准则，因此置信度头可以以可忽略的额外成本与OPD一起训练。在AIME 2025、GPQA-Diamond、LiveCodeBench v6和LongBench v2上使用Qwen3.5-4B和9B骨干网络的实验表明，PIPO在常规解码上将pass@4提高了最多+7.15个点，同时实现了高达2.64倍的首令牌延迟和2.07倍的每令牌延迟加速。项目页面：GitHub.com/RedAI-Infra/PIPO。

英文摘要

Long chain-of-thought reasoning has made autoregressive decoding the dominant inference cost of modern large language models. Existing methods target either the input side (latent compression) or the output side (speculative decoding and multi-token prediction, MTP), but the two lines of work have been pursued independently. Moreover, output-side methods must incur an expensive verifier pass to validate the unreliable draft tokens predicted by MTP. To address these issues, we propose \textbf{Pair-In, Pair-Out (PIPO)}, which unifies both sides by viewing a latent compressor and an MTP head as mirror-image operations: the compressor folds two input tokens into one latent representation, while the MTP head unfolds one hidden state into one additional output token. To remove the verifier cost without sacrificing reliability, PIPO trains a lightweight confidence head that decides whether draft tokens should be accepted. We observe that On-Policy Distillation (OPD) naturally matches the rejection-sampling criterion of speculative decoding, so the confidence head can be trained alongside OPD with negligible extra cost. Experiments on AIME 2025, GPQA-Diamond, LiveCodeBench v6, and LongBench v2 with Qwen3.5-4B and 9B backbones show that PIPO improves pass@4 over regular decoding by up to $+7.15$ points, while delivering up to $2.64\times$ first-token-latency and $2.07\times$ per-token-latency speedups. Project Page: GitHub.com/RedAI-Infra/PIPO.

URL PDF HTML ☆

赞 0 踩 0

2605.26942 2026-06-01 cs.AI cs.LO cs.SE 版本更新

Neuro-Symbolic Verification of LLM Outputs for Data-Sensitive Domains (extended preprint)

面向数据敏感领域的LLM输出的神经符号验证（扩展预印本）

Paul Sigloch, Christoph Benzmüller

发表机构 * University of Bamberg（巴姆堡大学）； Free University of Berlin（柏林自由大学）

AI总结提出一种结合形式符号方法与神经语义分析的混合验证架构，用于检测LLM输出中的幻觉、不一致和隐私漏洞，在医疗设备损伤评估系统中实现83%的结构化实体幻觉检测率和72%的语义虚构检测率。

Comments Extended preprint version of accepted technical communication at KI 2026. 22 pages, 3 figures

详情

AI中文摘要

部署在高风险领域的LLM面临根本性的可靠性挑战：幻觉、不一致性和隐私漏洞引入了不可接受的风险，因为错误会带来法律、财务或安全后果。本文提出一种混合验证架构，结合形式符号方法与神经语义分析，为LLM生成的内容提供互补性保证。该架构采用逻辑推理进行输入验证，利用完备性属性为结构化需求提供可判定的保证。对于输出验证，基于嵌入的语义相似性检测上下文幻觉，弥补形式方法表达力不足的问题。这种分离通过并行的、基于角色的流水线实现，解决了基于提示的自验证方法（继承了产生幻觉的分布偏差）的局限性。所提出的架构和类型感知验证方法通过HAIMEDA（一个通过行动设计研究开发的真实世界医疗设备损伤评估报告系统）进行验证。评估显示，结构化实体的幻觉检测率超过83%，语义虚构的检测率为72%，报告创建时间减少30%，表明神经符号架构可以为LLM在数据敏感领域的部署提供原则性的安全保障。

英文摘要

LLMs deployed in high-stakes domains face fundamental reliability challenges: hallucinations, inconsistencies, and privacy vulnerabilities introduce unacceptable risks where errors carry legal, financial, or safety consequences. This paper presents a hybrid verification architecture combining formal symbolic methods with neural semantic analysis to provide complementary guarantees for LLM-generated content. This architecture employs logical reasoning for input verification, leveraging completeness properties to provide decidable guarantees on structured requirements. For output validation, embedding-based semantic similarity detects contextual hallucinations where formal methods lack expressiveness. This separation is realized in a parallel, actor-based pipeline, addressing limitations of prompt-based self-verification approaches, which inherit the distributional biases that produce hallucinations. The proposed architecture and type-aware verification method are validated with HAIMEDA, a real-world medical device damage assessment reporting system developed through Action Design Research. Evaluation shows hallucination detection rates of over 83% for structured entities and 72% for semantic fabrications, with a 30% reduction in report creation time, demonstrating that neuro-symbolic architectures can provide principled safeguards for LLM deployment in data-sensitive domains.

URL PDF HTML ☆

赞 0 踩 0

2605.26396 2026-06-01 cs.AI cs.CL cs.LG 版本更新

Advancing Creative Physical Intelligence in Large Multimodal Models

推进大型多模态模型中的创造性物理智能

Cheng Qian, Hyeonjeong Ha, Jiayu Liu, Jeonghwan Kim, Emre Can Acikgoz, Bingxuan Li, Kunlun Zhu, Jiateng Liu, Aditi Tiwari, Zhenhailong Wang, Xiusi Chen, Mahdi Namazifar, Heng Ji

发表机构 * UIUC（伊利诺伊大学香槟分校）； Amazon（亚马逊）

AI总结针对大型多模态模型在开放式环境中缺乏基于视觉的创造性工具使用能力的问题，提出MM-CreativityBench基准和基于偏好学习的具身对齐方法，显著提升实体选择并减少幻觉。

Comments 51 Pages, 9 Figures, 7 Tables, Previous Work CreativityBench: arXiv:2605.02910

详情

AI中文摘要

大型多模态模型（LMMs）在感知和推理方面取得了快速进展；然而，目前尚不清楚这些能力是否能够泛化到在开放式环境中发现基于视觉的解决方案，超越模式识别。在此类场景中，智能需要的不仅仅是回答明确的问题：它涉及识别场景中的元素如何以非显而易见但物理上可行的方式被重新利用。这种创造性问题解决形式是人类智能的核心，但在当前基准测试中基本上未得到测试。为了评估这一能力，我们引入了MM-CreativityBench，这是一个用于在视觉丰富、物理受限的环境中进行基于可操作性的创造性工具使用的基准。每个实例呈现一个场景图像，包含候选实体及其部件的结构化视图，从而能够对模型如何迭代检查场景、识别相关可操作性以及组合视觉和物理上可行的解决方案进行细粒度、交互式评估。我们的实验表明，当前的LMMs往往表现不佳，不是由于缺乏生成能力，而是因为它们无法维持基于具身的探索。模型经常忽略相关实体，对关键部件检查不足，或幻觉出图像中不存在的属性。受此失败模式的启发，我们提出了具身对齐，将创造性工具使用视为一个偏好学习问题。使用直接偏好优化，我们鼓励模型偏好基于视觉证据的属性-可操作性推理，而非幻觉替代方案。此外，我们结合从可操作性知识库中获得的监督，以指导更广泛的实体探索和多轮规划。我们的结果显示，在正确选择实体和部件方面取得了持续改进，同时大幅减少了幻觉和与具身相关的错误。

英文摘要

Large multimodal models (LMMs) have rapidly advanced in perception and reasoning; however, it remains unclear whether these capabilities generalize to discovering visually grounded solutions in open-ended environments, beyond pattern recognition. In such settings, intelligence requires more than answering well-posed questions: it involves identifying how elements in a scene can be repurposed in non-obvious yet physically feasible ways. This form of creative problem-solving is central to human intelligence, but remains largely untested in current benchmarks. To evaluate this ability, we introduce MM-CreativityBench, a benchmark for affordance-grounded creative tool use in visually rich, physically constrained environments. Each instance presents a scenario image with structured views of candidate entities and their parts, enabling fine-grained, interactive evaluation of how models iteratively inspect the scene, identify relevant affordances, and compose visually and physically grounded solutions. Our experiments show that current LMMs often fall short, not due to lack of generative capability, but because they do not sustain grounded exploration. Models often overlook relevant entities, under-examine critical parts, or hallucinate attributes not grounded in the image. Motivated by this failure mode, we propose affordance-grounded alignment, which casts creative tool use as a preference learning problem. Using Direct Preference Optimization, we encourage models to prefer attribute-affordance reasoning grounded in visual evidence over hallucinated alternatives. In addition, we incorporate supervision derived from an affordance knowledge base to guide broader entity exploration and multi-turn planning. Our results show consistent gains in selecting the correct entities and parts, while substantially reducing hallucination and grounding-related errors.

URL PDF HTML ☆

赞 0 踩 0

2605.26371 2026-06-01 cs.AI 版本更新

Exploiting Local Dynamics Regularity for Reusable Skills in Offline Hierarchical RL

利用局部动态规律性实现离线分层强化学习中的可复用技能

Sarthak Dayal, Abhinav Peri, Carl Qi, Claas Voelcker, Alexander Levine, Caleb Chuck, Amy Zhang

发表机构 * UT Austin（UT奥斯汀）

AI总结提出CARL算法，通过对比学习对齐局部动态与动作序列，在离线分层强化学习中学习可复用技能，提升下游任务性能。

详情

AI中文摘要

分层强化学习（HRL）有望通过发现和复用时间上扩展的技能，比非分层方法更有效地解决长时域强化学习（RL）任务。然而，获得真正可复用的技能仍然是一个开放挑战。为此，我们关注利用局部动态直觉的抽象：不同全局上下文中的局部转换需要类似的动作序列。通过将这些上下文与其所需的动作序列对齐，我们能够学习哪些技能可以复用以及在何处复用它们。原则上，这些信息应有益于许多HRL算法，其中高层策略需要推理其使用的低层技能。由此产生的算法CARL（基于对比动作的可复用局部控制表示）在复杂人形环境中展示了有意义技能的定性聚类，并且在与HIQL集成时，在OGBench基准上提升了下游性能。

英文摘要

Hierarchical Reinforcement Learning (HRL) promises to solve long-horizon Reinforcement Learning (RL) tasks more efficiently than non-hierarchical counterparts by discovering and reusing temporally-extended skills. However, obtaining skills that are actually reusable remains an open challenge. Towards this end, we focus on abstractions that exploit the intuition of local dynamics: local transitions in different global contexts require similar kinds of action sequences. By aligning these contexts with the action sequences they require, we are able to learn which skills to reuse and where to reuse them. In principle, this information should benefit many HRL algorithms, where high-level policies have to reason about the low-level skills they use. The resulting algorithm CARL (Contrastive Action-based Representations for Reusable Local Control) shows both qualitative clustering of meaningful skills in complex humanoid environments and improved downstream performance on the OGBench benchmark when integrated with HIQL.

URL PDF HTML ☆

赞 0 踩 0

2605.26121 2026-06-01 cs.LG cs.AI 版本更新

GEM: Geometric Entropy Mixing for Optimal LLM Data Curation

GEM: 用于最优LLM数据策展的几何熵混合

Yue Min, Ziyun Qiao, Ruining Chen, Yujun Li

发表机构 * The Hong Kong University of Science and Technology, Hong Kong SAR, China（香港科学与技术大学）； Peking University, Beijing, China（北京大学）； University of Science and Technology of China, Hefei, China（中国科学技术大学）

AI总结提出GEM框架，通过将数据策展重构为超球面上的变分问题并采用MM算法优化，解决了分类缺陷和嵌入各向异性问题，在1.1B参数模型上实现下游准确率提升1.2%。

Comments ICML 2026 Poster

详情

AI中文摘要

LLM预训练的有效性越来越依赖于数据组成而非单纯的数据量。然而，最优混合受到分类缺陷的阻碍：人类分类法存在本体论错位，而欧几里得聚类无法解决嵌入各向异性。我们引入GEM（几何熵混合），这是一个将数据策展重构为超球面上的变分问题并辅以混合平衡正则化项的框架。通过解耦生成先验并使用可证明的MM（Minorize-Maximize）算法优化目标，GEM有效对抗聚类坍缩，从而发现欧几里得启发式方法无法察觉的平衡语义结构。我们采用师生蒸馏将这种几何保真度扩展到网络规模语料库，并引入几何影响分数（GIS）用于可解释的分类法生成。使用1.1B参数模型的实验表明，当集成到DoReMi和RegMix等混合策略中时，GEM建立了新的最先进水平，将平均下游准确率提升高达1.2%，并为可预测的数据混合提供了稳健的坐标系。

英文摘要

LLM pre-training efficacy increasingly depends on data composition rather than sheer volume. Yet, optimal mixing is hindered by categorization flaws: human taxonomies suffer from ontological misalignment, and Euclidean clustering fails to address embedding anisotropy. We introduce GEM (Geometric Entropy Mixing), a framework reformulating data curation as a variational problem on the hypersphere augmented with a mixing-balance regularizer. By decoupling the generative prior and optimizing the objective via a provable MM (Minorize-Maximize) algorithm, GEM effectively counteracts the cluster collapse to discover balanced semantic structures invisible to Euclidean heuristics. We employ teacher-student distillation to scale this geometric fidelity to web-scale corpora and introduce the Geometric Influence Score (GIS) for interpretable taxonomy generation. Experiments with 1.1B-parameter models demonstrate that GEM establishes a new state-of-the-art when integrated into mixing strategies like DoReMi and RegMix, improving average downstream accuracy by up to 1.2% and offering a robust coordinate system for predictable data mixing.

URL PDF HTML ☆

赞 0 踩 0

2605.21168 2026-06-01 cs.AI 版本更新

ScenePilot: Controllable Boundary-Driven Critical Scenario Generation for Autonomous Driving

ScenePilot: 可控的边界驱动型自动驾驶关键场景生成

Qiyu Ruan, Yuxuan Wang, He Li, Zhenning Li, Cheng-zhong Xu

发表机构 * State Key Laboratory of Internet of Things for Smart City (SKL-IOTSC), University of Macau, Macau, China（智能城市物联网国家重点实验室（SKL-IOTSC）、澳门大学、中国澳门）； Faculty of Science and Technology, University of Macau, Macau, China（澳门大学科技学院）

AI总结提出ScenePilot框架，通过结合RSS物理可行性评分与在线学习的AV风险预测器，将场景生成建模为约束多目标强化学习，并引入步级可行性感知屏蔽，以生成物理上可解但导致自动驾驶系统失败的关键场景。

详情

AI中文摘要

安全关键场景对于评估自动驾驶系统至关重要，但由于其在自然日志中罕见，基于仿真的压力测试不可或缺。大多数场景生成方法将周围智能体视为对手，但它们要么（i）未显式建模车辆-道路物理极限而导致失败，产生视觉极端但物理上不可解的碰撞，要么（ii）单独强制执行物理可行性或策略可行性，可能过度关注激进操作或受限于控制器依赖的能力边界。我们提出ScenePilot，一个可行性引导的、边界驱动的框架，针对边界带：即原则上物理可解但仍导致部署的自动驾驶堆栈失败的场景。我们将生成建模为约束多目标强化学习，结合RSS衍生的物理可行性评分$σ$和在线学习的AV风险预测器$Φ$，并引入步级可行性感知屏蔽，以保持探索接近可行性边界，同时避免不可行的伪影。在SafeBench上使用多个规划器的实验表明，ScenePilot在保持物理有效性的同时，产生了显著更高的碰撞率（+6.2个百分点），并且在这些边界带场景上的对抗性微调持续降低了下游碰撞率。代码可在https://github.com/QiyuRuan/ScenePilot获取。

英文摘要

Safety-critical scenarios are central to evaluating autonomous driving systems, yet their rarity in naturalistic logs makes simulation-based stress testing indispensable. Most scenario generation methods treat surrounding agents as adversaries, but they either (i) induce failures without explicitly modeling vehicle-road physical limits, yielding visually extreme yet physically unsolvable crashes, or (ii) enforce physical feasibility or policy feasibility in isolation, which can over-focus on aggressive maneuvers or remain tied to a controller-dependent capability boundary. We propose ScenePilot, a feasibility-guided, boundary-driven framework that targets the boundary band: scenarios that are physically solvable in principle yet still cause the deployed autonomy stack to fail. We formulate generation as constrained multi-objective reinforcement learning, combining an RSS-derived physical-feasibility score $σ$ with an online-learned AV-risk predictor $Φ$, and introduce step-level feasibility-aware shielding to keep exploration near the feasibility boundary while avoiding infeasible artifacts. Experiments on SafeBench with multiple planners show that ScenePilot yields substantially higher collision rates (+6.2 percentage points) while preserving physical validity, and that adversarial fine-tuning on these boundary-band scenarios consistently reduces downstream crash rates. The code is available at https://github.com/QiyuRuan/ScenePilot.

URL PDF HTML ☆

赞 0 踩 0

2605.30288 2026-06-01 cs.AI 版本更新

蒸馏博弈：自适应攻击与高效防御

Youssef Allouah, Mahdi Haghifam, Sanmi Koyejo, Reza Shokri

发表机构 * Stanford University（斯坦福大学）； Toyota Technological Institute at Chicago（芝加哥丰田技术研究所）； National University of Singapore（新加坡国立大学）

AI总结通过最小化博弈框架研究蒸馏攻击中模型提供者的部署权衡，提出自适应评估规则和产品专家（PoE）防御方法，实验表明自适应学生能恢复更多能力，且PoE在成本和质量上具有优势。

详情

AI中文摘要

蒸馏攻击为模型提供者带来了部署权衡：使模型更有用的相同输出也可能使其更容易被模仿。我们通过一个效用受限的教师和自适应学生之间的最小化博弈来研究这种权衡。我们的框架产生了可处理的一侧响应规则：一个自适应评估规则，其中学生重新加权高价值示例，以及一个教师侧防御模板，抑制对蒸馏最有用的输出。从示例价值的廉价代理中，我们推导出产品专家（PoE），一种简单的前向传递防御，在生成过程中将教师与代理学生结合。实验上，自适应评估揭示了一个大的被动-自适应差距：在最先进的防御上，自适应学生在GSM8K和MATH上恢复了比被动评估所建议的更多的能力。在这种更强的评估下，昂贵防御和PoE之间的明显鲁棒性差距显著缩小，而PoE仍然便宜得多，并保留了更高质量的推理轨迹。总体而言，我们的结果表明，强大的蒸馏仍然难以阻止，并且反蒸馏的进展应该根据自适应学生而非被动学生来判断。我们的代码可在：https://github.com/ysfalh/distillation-game 获取。

英文摘要

Distillation attacks create a deployment trade-off for model providers: the same outputs that make a model more useful can also make it easier to imitate. We study this trade-off through a minimax game between a utility-constrained teacher and an adaptive student. Our framework yields tractable one-sided response rules: an adaptive evaluation rule in which the student reweights high-value examples, and a teacher-side defense template that suppresses outputs most useful for distillation. From a cheap proxy for example value, we derive Product-of-Experts (PoE), a simple forward-pass-only defense that combines the teacher with a proxy student during generation. Empirically, adaptive evaluation reveals a large passive--adaptive gap: on state-of-the-art defenses, adaptive students recover substantially more capability than passive evaluation suggests on GSM8K and MATH. Under this stronger evaluation, the apparent robustness gap between expensive defenses and PoE narrows considerably, while PoE remains substantially cheaper and preserves higher-quality reasoning traces. Overall, our results suggest that strong distillation remains difficult to stop, and that progress on antidistillation should be judged against adaptive students rather than passive ones. Our code is available at: https://github.com/ysfalh/distillation-game.

URL PDF HTML ☆

赞 0 踩 0

2605.29299 2026-06-01 cs.CV cs.AI 版本更新

Pocket-Dentist: On-Device Dental Image Understanding via Efficient Multimodal Large Language Models

口袋牙医：通过高效多模态大语言模型实现设备端牙科图像理解

Kai Bian, Xucheng Guo, Bin Chen, Lingyan Ruan, Yiran Shen, Ting Dang, Hong Jia

发表机构 * The University of Auckland, New Zealand（奥克兰大学）； Shandong University, China（山东大学）； The University of Melbourne, Australia（墨尔本大学）

AI总结提出Pocket-Dentist基准，通过评估14种视觉语言模型发现紧凑模型（2B参数）在牙科图像理解中精度更高且计算成本更低，并在iPhone 17 Pro上实现低延迟部署。

详情

AI中文摘要

牙科视觉语言模型的评估在数据集、任务定义和指标上仍然分散，并且常常忽略其计算成本。这限制了它们在专科中心之外的广泛部署用于牙科筛查，而及时推理、有限的硬件以及对患者图像的本地处理对于实用、保护隐私的临床预筛查至关重要。本文提出了Pocket-Dentist，一个面向牙科多模态问答的效率感知基准，它汇集了三个数据集，涵盖约1159名患者、五种任务类型和七种指标。在典型的14种VLM上，我们的结果揭示了一个有趣的观察：紧凑型VLM（例如2B参数模型）在牙科图像理解中精度更高，同时所需计算成本大幅降低。在iPhone 17 Pro上本地部署时，我们微调的紧凑型VLM Pocket-Dentist-2B处理每个样本耗时4.31秒，与7B基线相比延迟降低4.9倍，内存使用减少2.3倍。

英文摘要

Evaluations of dental vision-language models remain fragmented across datasets, task definitions and metrics, and often ignore their computational cost. This limits their widespread deployment for dental screening outside specialist centres, where timely inference, limited hardware, and local handling of patient images are vital for practical, privacy-preserving clinical prescreening. Here we present Pocket-Dentist, an efficiency-aware benchmark for dental multimodal question answering that brings together three datasets spanning approximately 1,159 patients, five task types and seven metrics. Across typical 14 VLMs, our results reveals an interesting observation: compact VLMs (e.g., 2B-parameter models) outperform larger VLMs in accuracy while requiring substantially lower computational costs in dental image understanding. Deployed locally on an iPhone 17 Pro, our finetuned compact VLM Pocket-Dentist-2B processed each sample in 4.31 s, reducing latency by 4.9-fold and memory use by 2.3-fold compared with a 7B baseline.

URL PDF HTML ☆

赞 0 踩 0

2605.29268 2026-06-01 cs.CL cs.AI cs.LG cs.NE 版本更新

Compute Allocation in Evolutionary Search: From Depth-Breadth to Multi-Armed Bandits

进化搜索中的计算分配：从深度-广度到多臂老虎机

Sixue Xing, Haoyu He, Kerui Wu, Zhuo Yang, Haozheng Luo, Tianfan Fu, Aarthy Nagarajan

发表机构 * University of Notre Dame（诺丁汉大学）； Northeastern University（东北大学）； University of Massachusetts Amherst（马萨诸塞大学阿默斯特分校）； Southeast University（东南大学）； Northwestern University（西北大学）； Nanjing University（南京大学）； Shanghai Artificial Intelligence Laboratory（上海人工智能实验室）

AI总结针对LLM引导的进化搜索中固定预算的LLM调用分配问题，提出基于多臂老虎机的BaSE方法，通过跨并行轨迹分配调用，平均适应度提升12.3%。

详情

AI中文摘要

LLM引导的进化搜索（Evolve系统）在数学和组合任务上达到了最先进的结果，但现有系统通常只报告多次运行中的最佳结果，而未记录运行间的分布。我们询问如何分配固定的LLM调用预算，以及单次运行达到报告数字的可靠性如何。通过扫描五个模型和三个任务的深度-广度网格，我们识别出两个经验规律：一个适应度-计算包络线，其中能力排序主要取决于有效FLOPs；以及一个双线性深度-广度拟合，具有任务特定的交互；两者都受模型-任务能力门控。受这些规律启发，我们提出BaSE（基于老虎机的自进化），一种多臂老虎机，它在并行轨迹间分配LLM调用。在不改变模型、提示或评估器的情况下，BaSE在8个（模型，任务）单元上比最强的岛屿协议基线平均适应度提高12.3%，在方差高的设置上增益最大：仅通过分配实现可靠性提升。

英文摘要

LLM-guided evolutionary search (Evolve systems) has reached state-of-the-art results on mathematical and combinatorial tasks, yet most existing systems report only the best of many runs and leave the run-to-run distribution undocumented. We ask how a fixed budget of LLM calls should be allocated, and how reliably a single run reaches the reported numbers. Sweeping the depth-breadth grid over five models and three tasks, we identify two empirical regularities: a fitness-compute envelope along which capability ordering largely collapses on effective FLOPs, and a bilinear depth-breadth fit with task-specific interaction; both are gated by model-task capability. Motivated by these regularities, we propose BaSE (Bandit-based Self-Evolving), a multi-armed bandit that allocates LLM calls across parallel trajectories. Without changing the model, prompt, or evaluator, BaSE improves mean fitness by 12.3% over the strongest island-protocol baseline across 8 (model, task) cells, with the largest gains on high-variance settings: a reliability gain from allocation alone.

URL PDF HTML ☆

赞 0 踩 0

2605.29146 2026-06-01 cs.CL cs.AI 版本更新

SafeRx-Agent: A Knowledge-Grounded Multi-Agent Framework for Safe and Explainable Medication Recommendation

SafeRx-Agent: 基于知识的多智能体框架用于安全且可解释的药物推荐

Xinyu Wang, Hanwei Wu, Zhenghan Tai, Sicheng Lyu, Qincheng Lu, Ziyu Zhao, Jijun Chi, Jingrui Tian, Xiao-Wen Chang, Ziyang Song

发表机构 * McGill University（麦吉尔大学）； McMaster University（麦马斯特大学）； University of Toronto（多伦多大学）； Ohio University（俄亥俄大学）

AI总结提出SafeRx-Agent，一种基于知识的多智能体框架，通过患者上下文、外部临床知识和安全验证来推荐可追溯的药物集合，在MIMIC-III和MIMIC-IV数据集上提高了细粒度药物预测准确性，同时控制了药物相互作用、禁忌症和药物集合大小。

详情

AI中文摘要

药物推荐预测患者就诊时的用药，但现有方法仍面临两个关键挑战。在模型层面，传统药物推荐方法仅预测结构化的药物代码，证据基础有限，而LLM智能体可以利用更丰富的临床上下文，但可能缺乏安全验证和可追溯性。在任务层面，现有基准通常使用宽泛的药物类别，忽略了亚组级别的安全性差异，可能导致风险高估。我们引入了基于第四级ATC代码生成的第一个细粒度药物推荐设置。我们提出了安全处方智能体（SafeRx-Agent），一种基于知识的多智能体框架，利用患者上下文、外部临床知识和安全验证来推荐可追溯的药物集合。在MIMIC-III和MIMIC-IV数据集上的实验结果表明，SafeRx-Agent提高了细粒度药物预测准确性，同时控制了药物相互作用、禁忌症和药物集合大小。

英文摘要

Medication recommendation predicts medications for patient visits, but existing methods still face two key challenges. At the model level, traditional drug recommendation methods only predict structured drug codes with limited evidence grounding, while LLM agents can use richer clinical context but may lack safety verification and traceability. At the task level, existing benchmarks often use broad medication categories, which ignore subgroup-level safety differences and can lead to risk overestimation. We introduce the first fine-grained medication recommendation setting based on fourth-level ATC code generation. We propose Safe Prescription Agent (SafeRx-Agent), a knowledge-grounded multi-agent framework that uses patient context, external clinical knowledge, and safety verification to recommend traceable medication sets. Experimental results on MIMIC-III and MIMIC-IV datasets show that SafeRx-Agent improves fine-grained medication prediction accuracy while controlling drug interactions, contraindications, and medication set size.

URL PDF HTML ☆

赞 0 踩 0

2605.28918 2026-06-01 cs.LG cs.AI cs.IR 版本更新

When LLM Reward Design Fails: Diagnostic-Driven Refinement for Sparse Structured RL

当LLM奖励设计失败时：面向诊断的稀疏结构化RL改进

Youting Wang, Yuan Tang, Bowen Liu, Xuan Liu, Dingyan Shang

AI总结针对稀疏结构化强化学习任务，提出诊断驱动的迭代奖励函数改进方法，通过训练诊断和失败模式分类指导修正，显著提升MiniGrid任务成功率。

详情

AI中文摘要

对于具有语义奖励函数接口的稀疏结构化强化学习任务，LLM生成的奖励塑造更适合被视作调试而非一次性生成。我们使用MiniGrid作为核心评估、MuJoCo作为边界压力测试，研究PPO训练的智能体。我们的审计发现两种主要的一次性失败模式——奖励泛滥和语义/API误解，以及一种较罕见的弱塑造情况。我们提出诊断驱动的迭代改进，其中训练诊断和失败模式分类法指导有针对性的奖励函数修订。改进使DoorKey-8x8从2.3%提升至97.6%，KeyCorridor从31.2%提升至86.7%，但种子间方差较高。控制实验表明这些提升并非来自重试或额外训练：仅指标重新提示导致大幅下降，而静态词汇控制恢复了大部分差距（87.6%；70.7%），表明分类法提示是主要机制，动态标签仅提供部分孤立的增量证据。预算匹配和Best-of-3比较将改进与选择和训练时间效应分离。组件移除测试、敏感性分析以及针对作者标签的审计为调试解释提供了汇聚证据，同时揭示了校准限制。连续控制结果显示了边界：基于成功的诊断可能在密集奖励的 locomotion 中误报，而回报趋势反馈移除了一个假阳性机制但未带来稳健提升。低调用协议是与基于种群的奖励搜索的成本对比，而非基准比较。在四个交叉方差设计环境中，点估计表明当LLM奖励函数方差占主导时收益更大，但bootstrap区间较宽。该方法局限于PPO下具有可靠接口的稀疏结构化任务；event_text等字段可能有益、有害或中性。

英文摘要

For sparse, structured reinforcement-learning tasks with semantic reward-function interfaces, LLM-generated reward shaping is better framed as debugging than one-shot generation. We study PPO-trained agents using MiniGrid as core evaluation and MuJoCo as boundary stress test. Our audit finds two dominant one-shot failure modes -- reward flooding and semantic/API misunderstanding -- plus a rarer weak-shaping case. We propose diagnostic-driven iterative refinement, where training diagnostics and a failure-mode taxonomy guide targeted reward-function revision. Refinement improves DoorKey-8x8 from 2.3% to 97.6% and KeyCorridor from 31.2% to 86.7% with high seed-to-seed variance. Controls show these gains are not from retrying or extra training: metrics-only re-prompting yields large drops, while a static-vocabulary control recovers much of the gap (87.6%; 70.7%), showing the taxonomy prompt is a major mechanism and dynamic labels provide only partially isolated incremental evidence. Budget-matched and Best-of-3 comparisons separate refinement from selection and training-time effects. Component-removal tests, sensitivity analyses, and an audit against author labels provide converging evidence for the debugging interpretation while revealing calibration limits. Continuous-control results show the boundary: success-based diagnostics can misfire in dense-reward locomotion, and return-trend feedback removes one false-positive mechanism without robust gains. The low-call protocol is a cost contrast with population-based reward search, not a benchmark comparison. In four crossed-variance-design environments, point estimates suggest larger gains when LLM reward-function variance dominates but bootstrap intervals are wide. The method is bounded to sparse structured tasks with reliable interfaces under PPO; fields like event_text may help, hurt, or be neutral.

URL PDF HTML ☆

赞 0 踩 0

2605.28916 2026-06-01 astro-ph.IM cs.AI cs.HC 版本更新

First head-to-head comparison of agentic AI applied to the analysis of simulated data of the Einstein Telescope

应用于爱因斯坦望远镜模拟数据分析的智能体AI首次头对头比较

Gianluca Inguglia

发表机构 * Anthropic ； OpenAI

AI总结本文首次直接比较了Claude Code和Codex两种智能体AI系统在无人干预下自主执行引力波数据分析管线的行为、科学结果和计算成本，揭示了速度与可审计性、指令解释差异等关键问题。

Comments Version 2; includes the report autonomoulsy written in PRD style by agentic AI systems as supplemental material

详情

AI中文摘要

我们报告了两种最先进的智能体AI系统——Claude Code (Anthropic) 和 Codex (OpenAI) 的比较，它们被要求在共享计算基础设施上无人干预地自主执行一个简单的端到端引力波数据分析管线。该管线包括：从爱因斯坦望远镜模拟噪声中估计功率谱密度、生成几何模板库、对100个双黑洞信号注入进行匹配滤波恢复、自动生成结果，以及在大语言模型辅助下制作以Physical Review D格式排版的手稿。两个智能体均收到相同的书面规范和相同的计算资源。实验进行了两次：第一次使用不切实际的高信噪比注入，第二次将信号重新缩放到物理合理的信噪比范围。两次实验的科学结果均收敛。然而，智能体表现出截然不同的行为和计算成本：Claude Code在约3.4分钟内完成管线，但存在对规范的无声偏差；而Codex需要约16分钟，经历了明确的自我纠正重启，包括对匹配滤波内循环进行未经请求的性能优化。自主生成的手稿在长度、细节和质量上也存在差异。在第二次实验中，对信噪比范围指令解释的细微差异导致了真正的科学分歧：Claude Code无声地重新解释了指令，而Codex严格遵循了规范。我们讨论了这些行为差异（例如速度与可审计性、无声与透明的错误处理、指令解释以及多模型管线中中间数据表示的关键性）对智能体AI在科学计算工作流中部署的影响。

英文摘要

We report a comparison of two state-of-the-art agentic AI systems, Claude Code (Anthropic) and Codex (OpenAI), tasked with autonomously executing a simple end-to-end gravitational wave data analysis pipeline on a shared computing infrastructure without human intervention. The pipeline comprises power spectral density estimation from raw Einstein Telescope simulated noise, geometric template bank generation, matched filter recovery of 100 binary black hole signal injections, automated results generation, and large language model-assisted production of a manuscript formatted in the style of Physical Review D. Both agents received identical written specifications and identical compute resources. The experiment was run twice: a first run with unrealistically loud injections, and a second run with signals rescaled to a physically motivated SNR range. The scientific results converged in both runs. However, the agents exhibited substantially different behaviors and computational costs: Claude Code completed the pipeline in ~3.4 minutes with silent deviations from the specification, while Codex required ~16 minutes across explicit self-correcting restarts, including an unsolicited performance optimization of the matched filter inner loop. The autonomously generated manuscripts also diverged in length, details, and quality. In the second run, a subtle difference in the interpretation of the SNR range instruction led to a genuine scientific divergence: Claude Code silently reinterpreted the instructions, while Codex followed the specification literally. We discuss the implications of these behavioral differences, such as speed versus auditability, silent versus transparent error handling, instruction interpretation, and the criticality of intermediate data representations in multi-model pipelines, for the deployment of agentic AI in scientific computing workflows.

URL PDF HTML ☆

赞 0 踩 0

2605.28836 2026-06-01 cs.CL cs.AI 版本更新

No Reader Left Behind: Multi-Agent Summaries Everyone Can Understand

不让任何读者掉队：人人能理解的多智能体摘要

Jimin Jung, MyoungJin Kim, Jaehyung Seo, Heuiseok Lim

发表机构 * Department of Computer Science and Engineering, Korea University（韩国大学计算机科学与工程系）； Department of Computer Science and Engineering, Konkuk University（konkuk大学计算机科学与工程系）

AI总结提出NRLB多智能体框架，通过模拟三类读者群体并结合模板规划与迭代优化，生成既忠实又易于理解的平实语言摘要。

详情

AI中文摘要

美国的《平实语言法案》要求政府文件使用清晰、简单的语言，以便公众易于理解，但现有的摘要系统难以应对普通读者中多样化的语言和认知障碍。我们提出了NRLB（不让任何读者掉队），一个用于平实语言摘要的多智能体框架，它模拟了三类代表性读者群体：小学生读者、非母语读者和注意力缺陷读者。NRLB结合了基于模板的规划与迭代的、面向读者的优化，能够系统地检测和解决难懂术语、缺失上下文和令人困惑的句子。在多个数据集上的评估显示，在保持事实准确性的同时，可读性持续提升。人工评估进一步验证了NRLB的效果，标注者偏好率在55%到76%之间，突显了NRLB在生成既忠实于原文又广泛适用于公众的平实语言摘要方面的潜力。

英文摘要

The Plain Writing Act in the United States requires government documents to be accessible in clear and simple language that the general public can easily understand, yet existing summarization systems struggle to address diverse linguistic and cognitive barriers among general readers. We present NRLB (No Reader Left Behind), a multi-agent framework for plain language summarization that simulates three representative reader groups: elementary school student readers, non-native readers, and readers with attention deficits. NRLB combines template-based planning with iterative, reader-oriented refinement, enabling systematic detection and resolution of difficult terms, missing contexts, and confusing sentences. Evaluations across multiple datasets demonstrate consistent improvements in readability while preserving factual accuracy. Human evaluation further validates NRLB's impact, with annotator preference rates ranging from 55% to 76%, highlighting NRLB's potential to produce plain language summaries that are both faithful to the source and broadly accessible to the general public.

URL PDF HTML ☆

赞 0 踩 0

2605.25134 2026-06-01 cs.LG cs.AI 版本更新

Theoretical Analysis of Sparse Optimization with Reparameterization, Weight Decay, and Adaptive Learning Rate

重参数化、权重衰减和自适应学习率下稀疏优化的理论分析

Huangyu Xu, Jingqin Yang, Qianqian Xu, Jiaye Teng

发表机构 * State Key Laboratory of AI Safety, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China（人工智能安全国家重点实验室，计算技术研究所，中国科学院，北京，中国）； School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing, China（中国科学院大学计算机科学与技术学院，北京，中国）； Beijing Academy of Artificial Intelligence (BAAI), Beijing, China（北京人工智能研究院（BAAI），北京，中国）； IIIS, Tsinghua University, Beijing, China（清华大学人工智能院，北京，中国）； School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, China（上海财经大学统计与管理学院，上海，中国）； Institute of Data Science and Statistics, Shanghai University of Finance and Economics, Shanghai, China（上海财经大学数据科学与统计研究所，上海，中国）

AI总结针对稀疏优化中的不稳定问题，提出基于重参数化、权重衰减和自适应学习率的ReWA方法，通过改善优化景观实现比ℓ1正则化更好的稀疏性，同时保持测试精度。

Comments 32 pages, 5 figures. Submitted to ICML 2026

2603.27052 2026-06-01 cs.CY cs.AI 版本更新

Multi-Level Barriers to Generative AI Adoption Across Disciplines and Professional Roles in Higher Education

高等教育中跨学科与职业角色采用生成式AI的多层次障碍

Jianhua Yang, Kerem Öge, Adrian von Mühlenen, Abdullah Bilal Akbulut, Tanya Suzanne Carey, Chidi Okorro

发表机构 * Warwick Manufacturing Group, The University of Warwick（沃里克大学制造集团）； Department of Politics and International Studies, The University of Warwick（沃里克大学政治与国际研究系）； Department of Psychology, The University of Warwick（沃里克大学心理学系）； Birmingham Business School, The University of Birmingham（伯明翰大学商学院）

AI总结通过对一所罗素集团大学272名学术与专业服务人员的多方法调查分析，揭示了非STEM学术人员主要报告与学术诚信相关的伦理文化障碍，而STEM和专业服务人员则强调制度、治理和基础设施约束，表明GenAI采用障碍深嵌于组织生态系统和认知规范中。

Comments 21 pages, 3 figures, 6 tables

详情

DOI: 10.3390/educsci16060838
Journal ref: Educ. Sci. 2026, 16(6), 838;

AI中文摘要

生成式人工智能（GenAI）正在迅速重塑高等教育，但不同学科和机构角色间采用GenAI的障碍仍未得到充分探索。现有文献常将采用障碍归因于个体层面的因素，如感知有用性和易用性。本研究转而调查这些障碍是否由结构产生。通过对一所罗素集团大学的272名学术和专业服务人员进行多方法调查分析，我们考察了学科背景和机构角色如何塑造感知障碍。通过整合多项逻辑回归（MLR）、结构方程模型（SEM）和开放式回答的语义聚类，我们超越了描述性叙述，提供了GenAI采用的多层次解释。我们的发现揭示了清晰、系统的差异：非STEM学术人员主要报告与学术诚信相关的伦理和文化障碍，而STEM和专业服务人员则不成比例地强调制度、治理和基础设施约束。我们得出结论，GenAI采用障碍深嵌于组织生态系统和认知规范中，表明大学必须超越通用培训，开发针对特定角色的治理和支持框架。

英文摘要

Generative Artificial Intelligence (GenAI) is rapidly reshaping higher education, yet barriers to its adoption across different disciplines and institutional roles remain underexplored. Existing literature frequently attributes adoption barriers to individual-level factors such as perceived usefulness and ease of use. This study instead investigates whether such barriers are structurally produced. Drawing on a multi-method survey analysis of 272 academic and professional services (PSs) staff at a Russell Group university, we examine how disciplinary contexts and institutional roles shape perceived barriers. By integrating multinomial logistic regression (MLR), structural equation modelling (SEM), and semantic clustering of open-ended responses, we move beyond descriptive accounts to provide a multi-level explanation of GenAI adoption. Our findings reveal clear, systematic differences: non-STEM academics primarily report ethical and cultural barriers related to academic integrity, whereas STEM and PSs staff disproportionately emphasize institutional, governance, and infrastructure constraints. We conclude that GenAI adoption barriers are deeply embedded in organizational ecosystems and epistemic norms, suggesting that universities must move beyond generalized training to develop role-specific governance and support frameworks.

URL PDF HTML ☆

赞 0 踩 0

2602.10388 2026-06-01 cs.CL cs.AI 版本更新

超越静态不确定性：为概率时间序列建模时间不确定性动态

Yijun Wang, Qiyuan Zhuang, Larysa Marchanka, Xiu-Shen Wei

发表机构 * Department of Computer Science, Southeast University（东南大学计算机科学系）； Francisk Skorina Gomel State University（弗拉基米尔·斯科里纳戈梅尔州立大学）

AI总结提出VolDy-VAE模型，通过循环尺度路径捕捉波动率动态，实现时间一致的概率预测，提升准确性和不确定性校准。

详情

AI中文摘要

现实世界的时间序列表现出时间结构化的不确定性：波动率在动荡时期聚集，在稳定时期消散，并在结构断裂处突然变化。然而，许多概率预测方法将预测不确定性估计为独立的逐点量，忽略了波动率机制的演变和持续性。我们将这一缺失维度形式化为时间不确定性动态，并在波动率动态变分自编码器（VolDy-VAE）中实例化它，这是一个具有位置-尺度解码器的非自回归生成预测器。VolDy-VAE结合了用于均值预测的位置路径和用于传递和演化波动率隐藏状态的循环尺度路径，该状态从回溯窗口转移到预测范围，从而实现时间一致的预测方差。这种设计产生了一种自适应衰减机制：高方差观测值对位置估计的影响较小，而其不确定性通过明确的尺度预测得以保留。我们进一步提供了一个简化的机制转换分析，表明当方差已知或一致估计时，波动率感知目标简化为逆方差加权，而基于MSE的估计量保持无偏但统计效率较低。在九个基准上的实验表明，VolDy-VAE在保持低推理延迟的同时，提高了预测准确性和不确定性校准，优于竞争的概率和点预测基线；插件研究进一步表明，VolDy原理可以有益于GAN、Koopman VAE和Transformer骨干网络。源代码公开于https://github.com/wangyijunlyy/VolDy-VAE。

英文摘要

Real-world time series exhibit temporally structured uncertainty: volatility clusters in turbulent regimes, dissipates in stable periods, and shifts abruptly around structural breaks. Yet many probabilistic forecasting methods estimate predictive uncertainty as an independent per-step quantity, leaving the evolution and persistence of volatility regimes under-modeled. We formalize this missing dimension as temporal uncertainty dynamics and instantiate it in the Volatility Dynamics Variational Autoencoder (VolDy-VAE), a non-autoregressive generative forecaster with a location-scale decoder. VolDy-VAE combines a location path for mean prediction with a recurrent scale path that transfers and evolves a volatility hidden state from the look-back window to the forecasting horizon, enabling temporally coherent predictive variances. This design yields an adaptive attenuation mechanism: high-variance observations receive lower influence on the location estimate while their uncertainty is preserved through explicit scale predictions. We further provide a simplified regime-switching analysis showing that, when variances are known or consistently estimated, the volatility-aware objective reduces to inverse-variance weighting, whereas MSE-based estimators remain unbiased but statistically inefficient. Experiments on nine benchmarks show that VolDy-VAE improves forecasting accuracy and uncertainty calibration over competitive probabilistic and point-forecasting baselines while maintaining low inference latency; plug-in studies further indicate that the VolDy principle can benefit GAN, Koopman VAE, and Transformer backbones. The source code is publicly available at https://github.com/wangyijunlyy/VolDy-VAE.

URL PDF HTML ☆

赞 0 踩 0

2605.23937 2026-06-01 cs.AI cs.LG cs.LO math.OC 版本更新

BoxLitE: A Faithful Knowledge Base Embedding Based on Convex Optimization

BoxLitE：基于凸优化的忠实知识库嵌入

Bruno F. Lourenço, Hesham Morgan, Ana Ozaki, Aleksandar Pavlović, Emanuel Sallinger

发表机构 * The Institute of Statistical Mathematics, Japan（日本统计数学研究所）； TU Wien, Austria（奥地利技术大学维也纳分校）； University of Oslo, Norway（挪威奥斯陆大学）； University of Applied Sciences Campus Vienna, Austria（奥地利应用科学大学维也纳校区）

AI总结提出BoxLitE模型，通过凸优化实现DL-Lite$^{\mathcal{H}}$知识库的忠实嵌入，确保可满足知识库存在弱忠实模型。

Comments 28 pages. Full version of paper accepted to KR 2026 (23nd International Conference on Principles of Knowledge Representation and Reasoning). Track: KR meets Machine Learning and Explanation. Added a figure and some minor changes

详情

AI中文摘要

知识库（KB）嵌入旨在结合经典知识图谱嵌入在事实（ABox）中泛化信息的能力与本体语言（TBox）表示的概念知识。多位作者最近探索了将概念映射到向量空间中凸区域的思想。这对于表示TBox中通常存在的层次结构很有用，因为更一般的概念可以映射到更大的区域，包含与更具体概念相关的区域。然而，在实际学习任务中，凸性的能力很少被利用。在这里，我们引入了BoxLitE，一个针对DL-Lite$^{\mathcal{H}}$的KB嵌入模型，允许凸优化。我们证明，对于任何可满足的DL-Lite$^{\mathcal{H}}$ KB，存在一个BoxLitE嵌入，它是一个弱忠实模型。作为概念验证，我们展示了如何将KB嵌入任务表述为凸优化问题，以及如何获得具有这种理想忠实性属性的嵌入。

英文摘要

Knowledge base (KB) embeddings aim at combining the capability of classical knowledge graph embeddings to generalize the information present in facts, the ABox, with conceptual knowledge represented in an ontology language, the TBox. Several authors have recently explored the idea of mapping concepts to convex regions in a vector space. This is useful to represent hierarchies, typically present in TBoxes, since more general concepts can be mapped to larger regions, containing those regions associated with more specific concepts. However, the power of convexity is rarely leveraged during the actual learning tasks. Here, we introduce BoxLitE, a KB embedding model for DL-Lite$^{\mathcal{H}}$ that allows for convex optimization. We show that for any satisfiable DL-Lite$^{\mathcal{H}}$ KB, there is a BoxLitE embedding that is a weakly faithful model. As a proof of concept, we show how to formulate the KB embedding task as a convex optimization problem and how to obtain embeddings with such desirable faithfulness properties.

URL PDF HTML ☆

赞 0 踩 0

2605.21470 2026-06-01 cs.LG cs.AI 版本更新

Agent JIT Compilation for Latency-Optimizing Web Agent Planning and Scheduling

面向延迟优化的Web Agent规划与调度的Agent即时编译

Caleb Winston, Ron Yifeng Wang, Azalia Mirhoseini, Christos Kozyrakis

发表机构 * Stanford University（斯坦福大学）

AI总结提出Agent即时编译系统，通过JIT-Planner生成代码计划、JIT-Scheduler探索并行化策略及不变式工具协议，显著降低延迟并提高准确性。

Comments Accepted at ICML 2026

详情

AI中文摘要

计算机使用Agent通过生成对浏览器中点击、输入、滚动等工具的调用序列，自动化自然语言指定的任务，例如“从Taco Bell订购最便宜的商品”。当前实现遵循顺序的获取截图-执行循环，每次迭代需要一次LLM调用，导致高延迟和因工具使用错误而频繁出错。我们提出了Agent即时编译系统，该系统将任务描述直接编译为可执行代码，其中可能包含LLM调用、工具调用和并行化。我们的方法包括三个组件：（1）JIT-Planner，生成多个代码计划，根据工具规范验证每个计划，并选择最小成本候选；（2）JIT-Scheduler，通过从学习到的延迟分布进行蒙特卡洛成本估计，探索并行化策略；（3）不变式强制工具协议，指定前置条件和后置条件要求，以减少工具使用错误率。在五个应用中，JIT-Planner相比Browser-Use实现了10.4倍的加速和28%的更高准确率，而JIT-Scheduler相比OpenAI CUA实现了2.4倍的加速和9%的更高准确率。

英文摘要

Computer-use agents (CUAs) automate tasks specified with natural language such as "order the cheapest item from Taco Bell" by generating sequences of calls to tools such as click, type, and scroll on a browser. Current implementations follow a sequential fetch-screenshot-execute loop where each iteration requires an LLM call, resulting in high latency and frequent errors from incorrect tool use. We present agent just-in-time (JIT) compilation, a system that compiles task descriptions directly into executable code that may include LLM calls, tool calls, and parallelization. Our approach comprises three components: (1) JIT-Planner, which generates multiple code plans, validates each against tool specifications, and selects the minimum-cost candidate; (2) JIT-Scheduler, which explores parallelization strategies via Monte Carlo cost estimation from learned latency distributions; and (3) an invariant-enforcing tool protocol specifying precondition and postcondition requirements to reduce the rate of incorrect tool use. Across five applications, JIT-Planner achieves $10.4\times$ speedup and 28$\%$ higher accuracy over Browser-Use, while JIT-Scheduler achieves $2.4\times$ speedup and 9\% higher accuracy over OpenAI CUA.

URL PDF HTML ☆

赞 0 踩 0

2605.21108 2026-06-01 cs.LG cs.AI 版本更新

Efficient Learning of Deep State Space Models via Importance Smoothing

通过重要性平滑高效学习深度状态空间模型

John-Joseph Brady, Nikolas Nusken, Yunpeng Li

发表机构 * Centre for Oral, Clinical and Translational Sciences, King's College London, London, United Kingdom（口腔、临床与转化科学中心，伦敦国王学院，伦敦，英国）； Department of Mathematics, King's College London, London, United Kingdom（数学系，伦敦国王学院，伦敦，英国）

AI总结提出并行变分蒙特卡洛（PVMC）方法，结合变分推断和序贯蒙特卡洛，实现深度状态空间模型在判别与生成任务上的高效训练，速度提升10倍。

Comments Accepted to the proceedings of ICML 2026

详情

AI中文摘要

潜在状态空间系统在统计建模中无处不在，当通过噪声观测时间序列时自然出现。然而，大规模训练深度状态空间模型（DSSM）仍然困难。训练DSSM出现了两种截然不同的策略。第一种是自编码DSSM，通过优化变分下界来训练生成模型。第二种是通过经典序贯蒙特卡洛（SMC）算法的输出进行反向传播。这些方法可以训练DSSM用于判别和生成任务，但其固有的顺序前向传递在现代硬件上扩展性差。我们提出了并行变分蒙特卡洛（PVMC），一种新的训练方法，它桥接了这些范式，并稳健地训练DSSM用于判别和生成任务。在一组基准实验中，PVMC达到或超过了最先进的性能，同时训练速度比最快的竞争SMC方法快10倍。

英文摘要

Latent state space systems are ubiquitous in statistical modelling, arising naturally when time series are observed through noisy measurements. However, training deep state space models (DSSMs) at scale remains difficult. Two largely distinct strategies have emerged for training DSSMs. The first, auto-encoding DSSMs, trains generative models by optimising a variational lower bound. The second backpropagates through the outputs of classical sequential Monte Carlo (SMC) algorithms. Such approaches can train DSSMs for both discriminative and generative tasks, but their inherently sequential forward passes scale poorly on modern hardware. We propose \emph{parallel variational Monte Carlo} (PVMC), a new training method that bridges these paradigms and robustly trains DSSMs for both discriminative and generative tasks. Across a set of benchmark experiments, PVMC matches or exceeds state-of-the-art performance while training $10\times$ faster than the fastest competing SMC-based approach.

URL PDF HTML ☆

赞 0 踩 0

2605.20873 2026-06-01 cs.AI cs.LG 版本更新

PlanningBench: Generating Scalable and Verifiable Planning Data for Evaluating and Training Large Language Models

PlanningBench: 生成可扩展且可验证的规划数据以评估和训练大型语言模型

Ziliang Zhao, Zenan Xu, Shuting Wang, Hongjin Qian, Yan Lei, Minda Hu, Zhao Wang, Shihan Dou, Zhicheng Dou, Pluto Zhou

发表机构 * Gaoling School of Artificial Intelligence, Renmin University of China（中国人民大学人工智能学院 Gallagher 学校）； LLM Department, Hunyuan Team, Tencent（腾讯 Hunyuan 团队 LLM 部门）； Beijing Academy of Artificial Intelligence（北京人工智能研究院）； The Chinese University of Hong Kong（香港中文大学）

AI总结提出PlanningBench框架，通过约束驱动合成管道生成可扩展、多样化且可验证的规划数据，用于评估和训练LLMs，并验证其在提升规划能力上的有效性。

详情

AI中文摘要

规划是大型语言模型（LLMs）的一项基本能力，因为这类复杂任务要求模型将目标、约束、资源和长期后果协调成可执行且可验证的解决方案。然而，现有的规划基准通常将规划数据视为固定的实例集合，而非可控的生成目标。这限制了场景覆盖范围，将难度与表面代理而非结构来源挂钩，并且对可扩展生成、自动验证或面向规划的训练支持有限。我们引入PlanningBench，一个用于生成可扩展、多样化且可验证的规划数据的框架，既可用于评估也可用于训练。PlanningBench从真实规划场景出发，将实际工作流程抽象为包含30多种任务类型、子任务、约束族和难度因素的结构化分类体系。在该分类体系的指导下，一个约束驱动的合成管道实例化自包含的规划问题，具备自适应难度控制、质量过滤和实例级验证检查表。这将规划数据构建从固定基准收集转变为可控生成，同时保留现实任务基础。我们使用PlanningBench评估开源和闭源前沿LLMs，发现当前模型在耦合约束下仍难以生成完整解决方案。除评估外，在已验证的PlanningBench数据上进行强化学习可提升在未见规划基准和更广泛的指令遵循任务上的性能。进一步分析表明，确定性或明确指定的最优解提供了更清晰的奖励信号和更稳定的训练动态。总体而言，PlanningBench为诊断和提高LLMs中可泛化的规划能力提供了可控的规划数据来源。

英文摘要

Planning is a fundamental capability for large language models (LLMs) because such complex tasks require models to coordinate goals, constraints, resources, and long-term consequences into executable and verifiable solutions. Existing planning benchmarks, however, usually treat planning data as fixed collections of instances rather than controllable generation targets. This limits scenario coverage, ties difficulty to surface-level proxies rather than structural sources, and offers limited support for scalable generation, automatic verification, or planning-oriented training. We introduce PlanningBench, a framework for generating scalable, diverse, and verifiable planning data for both evaluation and training. PlanningBench starts from real planning scenarios and abstracts practical workflows into a structured taxonomy of more than 30 task types, subtasks, constraint families, and difficulty factors. Guided by this taxonomy, a constraint-driven synthesis pipeline instantiates self-contained planning problems with adaptive difficulty control, quality filtering, and instance-level verification checklists. This shifts planning data construction from fixed benchmark collection to controllable generation while preserving realistic task grounding. We use PlanningBench to evaluate open-source and closed-source frontier LLMs, and find that current models still struggle to produce complete solutions under coupled constraints. Beyond evaluation, reinforcement learning on verified PlanningBench data improves performance on unseen planning benchmarks and broader instruction-following tasks. Further analysis suggests that determinate or well-specified optimal solutions provide clearer reward signals and more stable training dynamics. Overall, PlanningBench provides a controllable source of planning data for diagnosing and improving generalizable planning abilities in LLMs.

URL PDF HTML ☆

赞 0 踩 0

2605.19806 2026-06-01 cs.CL cs.AI 版本更新

Chunking German Legal Code

德国法律文本的分块处理

Max Prior, Natalia Milanova, Andreas Schultz

发表机构 * Technical University of Munich（慕尼黑技术大学）

AI总结研究针对德国成文法，以德国民法典为基准语料库，比较多种分块策略在检索增强生成中的性能，发现基于法律固有结构（如章节、小节）的分块方法在召回率和计算效率上优于语义增强方法。

Comments Accepted at the Eigth Workshop on Automated Semantic Analysis of Information in Legal Texts co-located with the 21th International Conference on Artificial Intelligence and Law (ICAIL 2026)

详情

AI中文摘要

本文研究了针对德国成文法的检索增强生成的分块策略，以德国民法典作为结构化基准语料库。我们实现并比较了一系列分割方法，包括结构单元（章节、小节、句子、命题）、固定大小窗口、上下文分块、语义聚类、Lumber风格分块以及基于RAPTOR的层次检索。所有方法都在一个具有章节级黄金标签的法律问答数据集上进行评估，测量召回率、查询延迟、索引构建时间和存储需求。结果表明，与固有法律结构对齐的分块策略——特别是基于章节和小节的检索——实现了最高的召回率，而覆盖这种结构的更复杂方法表现更差。与上下文分块、RAPTOR和Lumber等LLM密集型技术相比，这些更简单的方法还提供了有利的计算效率。研究结果突出了语义丰富性与操作成本之间的关键权衡，并证明保留领域特定结构对于有效的法律信息检索至关重要。

英文摘要

This paper investigates chunking strategies for retrieval-augmented generation on German statutory law, using the German Civil Code as a structured benchmark corpus. We implement and compare a range of segmentation approaches, including structural units (sections, subsections, sentences, propositions), fixed-size windows, contextual chunking, semantic clustering, Lumber-style chunking, and RAPTOR-based hierarchical retrieval. All methods are evaluated on a legal question-answering dataset with section-level gold labels, measuring recall, query latency, index build time, and storage requirements. Results show that chunking strategies aligned with the inherent legal structure - particularly section and subsection - based retrieval-achieve the highest recall, while more complex approaches that override this structure perform worse. These simpler methods also offer favorable computational efficiency compared to LLM-intensive techniques such as contextual chunking, RAPTOR, and Lumber. The findings highlight a key trade-off between semantic enrichment and operational cost, and demonstrate that preserving domain-specific structure is critical for effective legal information retrieval.

URL PDF HTML ☆

赞 0 踩 0

2605.18807 2026-06-01 cs.LG cs.AI 版本更新

Block-Based Double Decoders

基于块的双解码器

Asher Labovich, Benjamin Bradley, Vanessa Alexander, Chaitanya Harsha

发表机构 * Brown University（布朗大学）

AI总结提出基于块的双解码器架构，利用双重因果块注意力掩码实现全损失监督和静态序列打包，结合解码器训练效率与编码器-解码器推理效率，在缩放定律实验中优于编码器-解码器并接近解码器模型，推理时KV缓存和每token计算减少至少2/3。

Comments 8 pages main, 13 pages total

2605.18803 2026-06-01 cs.LG cs.AI 版本更新

PROWL: Prioritized Regret-Driven Optimization for World Model Learning

PROWL: 基于优先遗憾驱动的世界模型学习优化

Ahmet H. Güzel, Jenny Seidenschwarz, Benjamin Graham, Jonathan Sadeghi, Jeffrey Hawke, Ilija Bogunovic

发表机构 * University College London AI Centre（伦敦大学学院人工智能中心）； Odyssey ； University of Basel（巴塞尔大学）

AI总结提出一种KL约束的对抗课程，通过训练策略暴露扩散世界模型的高误差轨迹并持续微调，结合优先对抗轨迹缓冲区，解决被动数据中罕见关键转换的鲁棒性问题。

详情

AI中文摘要

SpectralTrain：一种通用的高光谱图像分类框架

Meihua Zhou, Liping Yu, Xinyu Tong, Wai Kin Fung, Ruiguo Hu, Jiarui Zhao, Nan Wan

发表机构 * School of Medical Information, Wannan Medical University（皖南医学院信息学院）； University of Chinese Academy of Sciences（中国科学院大学）； The Chinese University of Hong Kong（香港中文大学）； Northeastern University（东北大学）

AI总结提出SpectralTrain通用训练框架，通过课程学习与基于PCA的光谱下采样提升高光谱图像分类效率，在多个数据集上实现2-7倍训练加速且精度损失小。

详情

AI中文摘要

高光谱图像（HSI）分类通常涉及大规模数据和计算密集的训练，这限制了深度学习模型在实际遥感任务中的部署。本研究引入SpectralTrain，一个通用的、与架构无关的训练框架，通过将课程学习（CL）与基于主成分分析（PCA）的光谱下采样相结合，提高学习效率。通过逐步引入光谱复杂性同时保留关键信息，SpectralTrain能够在显著降低计算成本的情况下高效学习光谱-空间模式。该框架独立于特定架构、优化器或损失函数，并与经典和最先进（SOTA）模型兼容。在三个基准数据集——Indian Pines、Salinas-A和新引入的CloudPatch-7上的大量实验表明，该框架在空间尺度、光谱特性和应用领域上具有很强的泛化能力。结果显示，训练时间一致减少2-7倍，精度变化取决于骨干网络。在云分类上的应用进一步揭示了其在气候相关遥感中的潜力，强调训练策略优化作为HSI模型中架构设计的有效补充。代码可在https://github.com/mh-zhou/SpectralTrain获取。

英文摘要

Hyperspectral image (HSI) classification typically involves large-scale data and computationally intensive training, which limits the practical deployment of deep learning models in real-world remote sensing tasks. This study introduces SpectralTrain, a universal, architecture-agnostic training framework that enhances learning efficiency by integrating curriculum learning (CL) with principal component analysis (PCA)-based spectral downsampling. By gradually introducing spectral complexity while preserving essential information, SpectralTrain enables efficient learning of spectral -- spatial patterns at significantly reduced computational costs. The framework is independent of specific architectures, optimizers, or loss functions and is compatible with both classical and state-of-the-art (SOTA) models. Extensive experiments on three benchmark datasets -- Indian Pines, Salinas-A, and the newly introduced CloudPatch-7 -- demonstrate strong generalization across spatial scales, spectral characteristics, and application domains. The results indicate consistent reductions in training time by 2-7x speedups with small-to-moderate accuracy deltas depending on backbone. Its application to cloud classification further reveals potential in climate-related remote sensing, emphasizing training strategy optimization as an effective complement to architectural design in HSI models. Code is available at https://github.com/mh-zhou/SpectralTrain.

URL PDF HTML ☆

赞 0 踩 0

2605.11946 2026-06-01 cs.AI 版本更新

Counterfactual Trace Auditing of LLM Agent Skills

LLM Agent技能的反事实痕迹审计

Xiaolin Zhou, Jinbo Liu, Li Li, Ryan A. Rossi, Xiyang Hu

发表机构 * Arizona State University（亚利桑那州立大学）； University of Southern California（南加州大学）； Adobe Research（Adobe研究）

AI总结提出反事实痕迹审计（CTA）框架，通过配对有无技能的Agent轨迹并生成结构化技能影响模式（SIP）注释，揭示技能对行为的重塑效应，弥补仅通过通过率评估的不足。

Comments Code and data are available at https://github.com/WillChow66/CTA.git

详情

AI中文摘要

大型语言模型Agent越来越多地配备Agent技能。当前对技能的评估方法仍然有限。大多数已部署的基准测试仅报告技能附加前后的通过率，将技能视为对Agent行为的黑盒更改。我们引入了反事实痕迹审计（CTA），这是一个衡量技能如何改变Agent行为的框架。CTA将每个带技能的Agent轨迹与同一任务上不带技能的对应轨迹配对，将两条轨迹分割成目标导向的阶段，对齐这些阶段，并输出结构化的技能影响模式（SIP）注释。这些注释描述了技能的行为效果，而不仅仅是任务结果。我们在SWE-Skills-Bench上使用Claude对49个软件工程任务实例化了CTA。由此产生的审计揭示了一个明显的评估差距。通过率平均仅变化+0.3个百分点，表明总体效果很小。然而，CTA在相同的配对轨迹中识别出522个SIP实例，表明即使在通过率几乎不变的情况下，技能也显著重塑了Agent行为。审计还分离了通过率无法检测到的几种反复出现的效果，包括字面模板复制、偏离任务的人工制品创建、过度规划和任务恢复。出现了三个发现。首先，高基线任务包含了大多数观察到的技能效果，尽管它们的通过率已经饱和，因此无法反映这些效果。其次，基线性能适中的任务显示出最大的可恢复增益，但通常以显著更高的令牌成本为代价。第三，主导的SIP类型可以通过基线桶识别：表面锚定在最高任务中最常见，边缘案例提示在中档和最低任务中最常见。这些规律将非正式的故障模式观察转化为可重复的行为测量。

英文摘要

Large Language Model agents are increasingly augmented with agent skills. Current evaluation methods for skills remain limited. Most deployed benchmarks report only pass rate before and after a skill is attached, treating the skill as a black box change to agent behavior. We introduce Counterfactual Trace Auditing (CTA), a framework for measuring how a skill changes agent behavior. CTA pairs each with skill agent trace with a without skill counterpart on the same task, segments both traces into goal directed phases, aligns the phases, and emits structured Skill Influence Pattern (SIP) annotations. These annotations describe the behavioral effect of a skill rather than only its task outcome. We instantiate CTA on SWE-Skills-Bench with Claude across 49 software engineering tasks. The resulting audit reveals a clear evaluation gap. Pass rate changes by only +0.3 percentage points on average, suggesting little aggregate effect. Yet CTA identifies 522 SIP instances across the same paired traces, showing that the skills substantially reshape agent behavior even when pass rate is nearly unchanged. The audit also separates several recurring effects that pass rate cannot detect, including literal template copying, off task artifact creation, excess planning, and task recovery. Three findings emerge. First, high baseline tasks contain most of the observed skill effects, although their pass rate is already saturated and therefore cannot reflect those effects. Second, tasks with moderate baseline performance show the most recoverable gain, but often at substantially higher token cost. Third, the dominant SIP type can be identified by baseline bucket: surface anchoring is most common on ceiling tasks and edge-case prompting is most common on mid-range and floor tasks. These regularities turn informal failure mode observations into reproducible behavioral measurements.

URL PDF HTML ☆

赞 0 踩 0

2605.11336 2026-06-01 cs.IR cs.AI cs.CL cs.HC 版本更新

Much of Geospatial Web Search Is Beyond Traditional GIS

大部分地理空间网络搜索超越了传统GIS

Ilya Ilyankou, Stefano Cavazzi, James Haworth

发表机构 * SpaceTimeLab（空间时间实验室）； Department of Civil, Environmental, and Geomatic Engineering（土木、环境与测绘工程系）； UCL（伦敦大学学院）

AI总结通过密集句子嵌入、SetFit分类器和密度聚类，在MS MARCO语料库中发现18%的查询具有地理空间性质，并构建了88类分类体系，揭示地理搜索以事务性和实用性查询为主，多数超出传统GIS和知识图谱范围。

详情

AI中文摘要

网络搜索查询涉及地点的频率远高于现有标注方案所表明的，然而地理空间网络搜索查询的景观——人们对地点的询问内容及其频率——在大规模上仍然缺乏特征描述。我们对包含101万条真实必应查询的完整MS MARCO语料库应用密集句子嵌入、轻量级SetFit分类器和基于密度的聚类，无需预先过滤地名或空间关键词，识别出181,827条地理空间查询（18.0%），几乎是原始标注中标记为“位置”的6.17%的三倍。由此产生的88个查询类别分类体系揭示，地理空间网络搜索以事务性和实用性查询为主：仅成本和价格就占地理空间查询的15.3%，几乎是整个自然地理主题规模的两倍。这些活动中的大部分——成本、营业时间、联系方式、天气、旅行推荐——超出了传统GIS和知识图谱旨在服务的范围。这些类别在它们所接受的答案类型上差异很大，从可由空间数据库或知识图谱回答的确定性查询，到需要生成式或实时系统的评估性或时间波动性查询。我们讨论了对混合检索架构以及大型语言模型中地理推理基准的启示。我们公开发布了标注数据集、分类器和分类体系。

英文摘要

Web search queries concern place far more often than existing labelling schemes suggest, yet the landscape of geospatial web search queries - what people ask of place, and how often - remains poorly characterised at scale. We apply dense sentence embeddings, a lightweight SetFit classifier, and density-based clustering to the full MS MARCO corpus of 1.01 million real Bing queries without prior filtering for toponyms or spatial keywords, identifying 181,827 geospatial queries (18.0%), nearly threefold the 6.17% labelled as Location in the original annotations. The resulting taxonomy of 88 query categories reveals that geospatial web search is dominated by transactional and practical lookups: costs and prices alone account for 15.3% of geospatial queries, nearly twice the size of the entire physical geography theme. Much of this activity - costs, opening hours, contact details, weather, travel recommendations - falls outside the scope of what traditional GIS and knowledge graphs are built to serve. The categories vary substantially in the kind of answer they admit, from deterministic lookups answerable from spatial databases or knowledge graphs to evaluative or temporally volatile queries that require generative or real-time systems. We discuss implications for hybrid retrieval architectures and for benchmarks of geographic reasoning in large language models. We openly release the labelled dataset, classifier, and taxonomy.

URL PDF HTML ☆

赞 0 踩 0

2605.11134 2026-06-01 cs.LG cs.AI 版本更新

Spurious Correlation Learning in Preference Optimization: Mechanisms, Consequences, and Mitigation via Tie Training

偏好优化中的虚假相关学习：机制、后果及通过平局训练的缓解方法

Christian Moya, Alex Semendinger, Guang Lin, Elliott Thornley

发表机构 * Department of Mathematics, Purdue University, West Lafayette IN, USA（普渡大学数学系）； School of Mechanical Engineering, Purdue University, West Lafayette IN, USA（普渡大学机械工程学院）； Massachusetts Institute of Technology, Cambridge MA, USA（麻省理工学院）

AI总结本文通过统一理论分析揭示了偏好优化（如DPO）中虚假相关学习的机制（均值虚假偏差和因果-虚假相关泄漏），证明其导致分布偏移下的不可逆脆弱性，并提出平局训练数据增强策略以选择性减少虚假学习。

Comments Proceedings of the 43rd International Conference on Machine Learning, 2026, Seoul, South Korea

详情

Journal ref: Proceedings of the 43rd International Conference on Machine Learning, 2026, Seoul, South Korea

AI中文摘要

偏好学习方法（如直接偏好优化DPO）已知会诱导对虚假相关的依赖，导致当前语言模型中的谄媚和长度偏差，并可能在未来系统中造成严重的目标泛化错误。在这项工作中，我们对此现象进行了统一的理论分析，描述了虚假学习的机制、其在部署中的后果以及一种可证明的缓解策略。聚焦于对数线性策略，我们展示了标准偏好学习目标通过两个渠道在总体水平上诱导对虚假特征的依赖：均值虚假偏差和因果-虚假相关泄漏。然后我们表明这种依赖造成了分布偏移的不可逆脆弱性：来自相同训练分布的更多数据无法减少模型对虚假特征的依赖。为了解决这个问题，我们提出了平局训练，一种使用平局（等效用偏好对）的数据增强策略，以引入数据驱动的正则化。我们证明了该方法选择性地减少虚假学习而不降低因果学习。最后，我们在对数线性模型上验证了我们的理论，并提供了实证证据，表明虚假学习机制和平局训练的益处均适用于神经网络和大语言模型。

英文摘要

Preference learning methods like Direct Preference Optimization (DPO) are known to induce reliance on spurious correlations, leading to sycophancy and length bias in today's language models and potentially severe goal misgeneralization in future systems. In this work, we provide a unified theoretical analysis of this phenomenon, characterizing the mechanisms of spurious learning, its consequences on deployment, and a provable mitigation strategy. Focusing on log-linear policies, we show that standard preference-learning objectives induce reliance on spurious features at the population level through two channels: mean spurious bias and causal-spurious correlation leakage. We then show that this reliance creates an irreducible vulnerability to distribution shift: more data from the same training distribution fails to reduce the model's dependence on spurious features. To address this, we propose tie training, a data augmentation strategy using ties (equal-utility preference pairs) to introduce data-driven regularization. We demonstrate that this approach selectively reduces spurious learning without degrading causal learning. Finally, we validate our theory on log-linear models and provide empirical evidence that both the spurious learning mechanisms and the benefits of tie training persist for neural networks and large language models.

URL PDF HTML ☆

赞 0 踩 0

2602.16165 2026-06-01 cs.LG cs.AI 版本更新

HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents

HiPER: 具有显式信用分配的分层强化学习用于大型语言模型智能体

Jiangweizhi Peng, Yuanxin Liu, Ruida Zhou, Charles Fleming, Zhaoran Wang, Alfredo Garcia, Mingyi Hong

发表机构 * University of Minnesota ； Northwestern University ； Amazon AGI ； Texas A\&M University ； Cisco Research

AI总结针对稀疏奖励长程任务中LLM智能体信用分配困难的问题，提出HiPER分层规划-执行框架，通过分层优势估计（HAE）在规划和执行层面显式分配信用，在ALFWorld和WebShop上达到97.4%和83.3%的成功率。

Comments ICML 2026

详情

AI中文摘要

将LLM训练为用于多轮决策的交互式智能体仍然具有挑战性，特别是在具有稀疏和延迟奖励的长程任务中，智能体必须在获得有意义的反馈之前执行一系列扩展的动作。大多数现有的强化学习方法将LLM智能体建模为在单一时间尺度上运行的扁平策略，每轮选择一个动作。在稀疏奖励设置中，这种扁平策略必须跨整个轨迹传播信用，而没有显式的时间抽象，这常常导致不稳定的优化和低效的信用分配。我们提出HiPER，一种新颖的分层规划-执行强化学习框架，明确地将高层规划与低层执行分开。HiPER将策略分解为一个提出子目标的高层规划器和一个在多个动作步骤中执行这些子目标的低层执行器。为了将优化与此结构对齐，我们引入了一种称为分层优势估计（HAE）的关键技术，该技术在规划和执行层面仔细分配信用。通过聚合每个子目标执行过程中的回报并协调两个层面的更新，HAE提供了无偏的梯度估计器，并且与扁平广义优势估计相比，可证明地减少了方差。实验上，HiPER在具有挑战性的交互式基准测试中达到了最先进的性能，在ALFWorld上达到97.4%的成功率，在WebShop上达到83.3%的成功率（使用Qwen2.5-7B-Instruct，分别比先前最佳方法高出6.6%和8.3%），在需要多个依赖子任务的长程任务上尤其取得了巨大收益。这些结果突显了显式层次分解对于多轮LLM智能体的可扩展RL训练的重要性。

英文摘要

Training LLMs as interactive agents for multi-turn decision-making remains challenging, particularly in long-horizon tasks with sparse and delayed rewards, where agents must execute extended sequences of actions before receiving meaningful feedback. Most existing reinforcement learning (RL) approaches model LLM agents as flat policies operating at a single time scale, selecting one action at each turn. In sparse-reward settings, such flat policies must propagate credit across the entire trajectory without explicit temporal abstraction, which often leads to unstable optimization and inefficient credit assignment. We propose HiPER, a novel Hierarchical Plan-Execute RL framework that explicitly separates high-level planning from low-level execution. HiPER factorizes the policy into a high-level planner that proposes subgoals and a low-level executor that carries them out over multiple action steps. To align optimization with this structure, we introduce a key technique called hierarchical advantage estimation (HAE), which carefully assigns credit at both the planning and execution levels. By aggregating returns over the execution of each subgoal and coordinating updates across the two levels, HAE provides an unbiased gradient estimator and provably reduces variance compared to flat generalized advantage estimation. Empirically, HiPER achieves state-of-the-art performance on challenging interactive benchmarks, reaching 97.4\% success on ALFWorld and 83.3\% on WebShop with Qwen2.5-7B-Instruct (+6.6\% and +8.3\% over the best prior method), with especially large gains on long-horizon tasks requiring multiple dependent subtasks. These results highlight the importance of explicit hierarchical decomposition for scalable RL training of multi-turn LLM agents.

URL PDF HTML ☆

赞 0 踩 0

2605.08145 2026-06-01 cs.CV cs.AI cs.LG 版本更新

Self-Captioning Multimodal Interaction Tuning: Amplifying Exploitable Redundancies for Robust Vision Language Models

自描述多模态交互调优：放大可利用冗余以实现鲁棒的视觉语言模型

Yuriel Ryan, Hei Man Ip, Adriel Kuek, Paul Pu Liang, Roy Ka-Wei Lee

发表机构 * Singapore University of Technology and Design（新加坡科技设计大学）； DSO National Laboratories（国防部国家实验室）； Massachusetts Institute of Technology（麻省理工学院）

AI总结针对视觉语言模型中的幻觉和鲁棒性问题，提出自描述多模态交互调优方法，通过放大模态间冗余信息来补偿受损模态，并设计多模态交互门机制将独特交互转化为冗余交互，实验表明该方法可减少38.3%的视觉诱导错误并提升16.8%的一致性。

Comments Accepted to ICML 2026. Code: https://github.com/yurielryan/Multimodal-Interaction-Tuning

详情

AI中文摘要

当前的视觉语言模型在面对模糊或受损模态时存在幻觉和鲁棒性问题。我们假设这些问题可以通过利用模态间的共享信息来补偿受损模态得到解决。为此，我们分析了多模态交互——模态提供的冗余（共享）、独特（排他）和协同（涌现）任务相关信息——以确定它们对模型可靠性的影响。具体来说，放大冗余交互将增加这种可利用的共享信息以解决这些问题；然而，现代指令数据集通常消除冗余以优先考虑视觉定位。我们通过一个自描述工作流弥合这一差距，该工作流包含一个 extsc{多模态交互门}：一种将独特交互转化为冗余交互的机制。我们的发现表明，增加冗余可以减少38.3%的视觉诱导错误，并提高16.8%的一致性。

英文摘要

Current vision language models face hallucination and robustness issues against ambiguous or corrupted modalities. We hypothesize that these issues can be addressed by exploiting the shared information between modalities to compensate for the impaired one. To this end, we analyze multimodal interactions -- redundant (shared), unique (exclusive), and synergistic (emergent) task-relevant information provided by the modalities -- to determine their impacts on model reliability. Specifically, amplifying redundant interactions would increase this exploitable shared information to resolve these issues; yet, modern instruction datasets often eliminate redundancies to prioritize visual grounding. We bridge this gap through a self-captioning workflow featuring a \textsc{Multimodal Interaction Gate}: a mechanism to convert unique interactions into redundant interactions. Our findings suggest that increasing redundancy can reduce visual induced errors by 38.3\% and improve consistency by 16.8\%.

URL PDF HTML ☆

赞 0 踩 0

2605.06831 2026-06-01 cs.LG cs.AI 版本更新

Why DDIM Hallucinates More Than DDPM: A Theoretical Analysis of Reverse Dynamics

为什么DDIM比DDPM更容易产生幻觉：反向动力学的理论分析

Muhammad H. Ashiq, Samanyu Arora, Abhinav N. Harish, Ishaan Kharbanda, Hung Yun Tseng, Grigorios G. Chrysos

发表机构 * University of Wisconsin-Madison（威斯康星大学麦迪逊分校）

AI总结通过理论分析高斯混合目标下的反向ODE（DDIM）和SDE（DDPM），证明在临界时间τ后DDIM会卡在两个最近模式之间的线段上，而DDPM的随机性帮助其脱离该区域从而避免幻觉。

Comments Accepted in ICML

2605.06235 2026-06-01 cs.IR cs.AI 版本更新

OBLIQ-Bench: Exposing Overlooked Bottlenecks in Modern Retrievers with Latent and Implicit Queries

OBLIQ-Bench：揭示现代检索器中被忽视的瓶颈——潜在与隐式查询

Diane Tchuindjo, Devavrat Shah, Omar Khattab

发表机构 * Massachusetts Institute of Technology（麻省理工学院）

AI总结针对现有检索基准饱和但实际搜索问题未解决的现象，提出一类“倾斜查询”并构建OBLIQ-Bench基准，揭示检索与验证之间的不对称性，即推理LLM能可靠识别潜在相关性但检索管道无法召回多数相关文档。

详情

AI中文摘要

检索基准日益饱和，但我们认为高效搜索远非已解决的问题。我们识别出一类称为“倾斜”的查询，它们寻求实例化潜在模式的文档，例如找到所有表达隐式立场的推文、展示特定失败模式的聊天记录或匹配抽象场景的转录文本。我们研究了倾斜性产生的三种机制，并引入了OBLIQ-Bench，这是一套基于真实长尾语料库的五个倾斜搜索问题。OBLIQ-Bench揭示了检索与验证之间一个被忽视的不对称性：当相关文档被呈现时，推理LLM能可靠地识别潜在相关性，但即使是复杂的检索管道也无法首先召回大多数相关文档。我们希望OBLIQ-Bench能推动研究高效捕获大规模语料库中潜在模式和隐式信号的检索架构。

英文摘要

Retrieval benchmarks are increasingly saturating, but we argue that efficient search is far from a solved problem. We identify a class of queries we call oblique, which seek documents that instantiate a latent pattern, like finding all tweets that express an implicit stance, chat logs that demonstrate a particular failure mode, or transcripts that match an abstract scenario. We study three mechanisms through which obliqueness may arise and introduce OBLIQ-Bench, a suite of five oblique search problems over real long-tail corpora. OBLIQ-Bench exposes an overlooked asymmetry between retrieval and verification, where reasoning LLMs reliably recognize latent relevance whenever relevant documents are surfaced, but even sophisticated retrieval pipelines fail to surface most relevant documents in the first place. We hope that OBLIQ-Bench will drive research into retrieval architectures that efficiently capture latent patterns and implicit signals in large corpora.

URL PDF HTML ☆

赞 0 踩 0

2605.06137 2026-06-01 cs.CV cs.AI cs.LG 版本更新

Autoregressive Visual Generation Needs a Prologue

自回归视觉生成需要一个序幕

Bowen Zheng, Weijian Luo, Guang Yang, Colin Zhang, Tianyang Hu

发表机构 * The Chinese University of Hong Kong, Shenzhen（香港中文大学（深圳））； hi-Lab, Xiaohongshu Inc（小红书实验室）

AI总结提出Prologue方法，通过生成前置的序幕令牌来弥合自回归图像生成中的重建-生成差距，在不影响重建质量的前提下显著提升生成性能。

Comments Code: https://github.com/Zyriix/prologue Demo: https://huggingface.co/spaces/Zyriix/prologue-demo

详情

AI中文摘要

AI中文摘要

从图像恢复相机参数和从新视角渲染场景在计算机视觉和图形学中被视为独立任务。当图像覆盖稀疏或姿态模糊时，这种分离会失效，因为每个任务依赖于另一个任务的输出。我们提出Rays as Pixels，一种视频扩散模型（VDM），学习视频和相机轨迹的联合分布。据我们所知，这是首个在单一框架内预测相机姿态并进行相机控制视频生成的模型。我们将每个相机表示为密集射线像素（raxels），这是一种与视频帧位于同一潜在空间的像素对齐编码，并通过解耦自交叉注意力机制联合去噪两者。一个训练好的模型处理三个任务：从视频预测相机轨迹、沿预定义轨迹从输入图像生成视频、以及从输入图像联合合成视频和轨迹。我们在姿态估计和相机控制视频生成上进行评估，并引入闭环自一致性测试，显示模型预测的姿态及其基于这些姿态的渲染结果一致。与Plücker嵌入的消融实验证实，将相机与视频共享潜在空间显著更有效。

英文摘要

Recovering camera parameters from images and rendering scenes from novel viewpoints have been treated as separate tasks in computer vision and graphics. This separation breaks down when image coverage is sparse or poses are ambiguous, since each task depends on what the other produces. We propose Rays as Pixels, a Video Diffusion Model (VDM) that learns a joint distribution over videos and camera trajectories. To our knowledge, this is the first model to predict camera poses and do camera-controlled video generation within a single framework. We represent each camera as dense ray pixels (raxels), a pixel-aligned encoding that lives in the same latent space as video frames, and denoise the two jointly through a Decoupled Self-Cross Attention mechanism. A single trained model handles three tasks: predicting camera trajectories from video, generating video from input images along a pre-defined trajectory, and jointly synthesizing video and trajectory from input images. We evaluate on pose estimation and camera-controlled video generation, and introduce a closed-loop self-consistency test showing that the model's predicted poses and its renderings conditioned on those poses agree. Ablations against Plücker embeddings confirm that representing cameras in a shared latent space with video is subtantially more effective.

URL PDF HTML ☆

赞 0 踩 0

2604.18587 2026-06-01 cs.LG cs.AI cs.LO cs.PL 版本更新

Compile to Compress: Boosting Formal Theorem Provers by Compiler Outputs

编译以压缩：通过编译器输出提升形式定理证明器

Guchan Li, Rui Tian, Hongning Wang

发表机构 * Department of Computer Science and Technology, Tsinghua University, Beijing, China（清华大学计算机科学与技术系）

AI总结利用编译器将大量证明尝试压缩为结构化失败模式，提出一种学习-精炼框架，通过树搜索基于验证器反馈局部修正错误，在可比测试时预算下在PutnamBench上达到最先进性能。

详情

AI中文摘要

大型语言模型在形式定理证明中展现出显著潜力，但最先进的性能往往需要通过大量展开或扩展上下文窗口来实现令人望而却步的测试时计算。在这项工作中，我们通过利用形式验证中的一种信息结构来解决这一可扩展性瓶颈：观察到编译器将大量不同的证明尝试空间映射到一组紧凑的结构化失败模式。我们引入了一个学习-精炼框架，利用这种压缩来执行高效的学习和证明探索。我们执行树搜索，根据明确的验证器反馈局部修正错误，从而避免了积累长历史证明尝试的相关成本。大量评估表明，我们的方法在不同规模上持续增强了基础证明器的推理能力。值得注意的是，在可比较的测试时预算下，我们的方法在PutnamBench上达到了公开报告的约80亿和约320亿参数模型中的最先进性能，为下一代验证器引导推理提供了一种可扩展的范式。

英文摘要

Large language models (LLMs) have demonstrated significant potential in formal theorem proving, yet state-of-the-art performance often necessitates prohibitive test-time compute via massive roll-outs or extended context windows. In this work, we address this scalability bottleneck by exploiting an informative structure in formal verification: the observation that compilers map a vast space of diverse proof attempts to a compact set of structured failure modes. We introduce a learning-to-refine framework that leverages this compression to perform efficient learning and proof exploration. We perform tree search that corrects errors locally conditioned on explicit verifier feedback, thereby circumventing the costs associated with accumulating a long history of proof attempts. Extensive evaluations show that our method consistently amplifies the reasoning capabilities of base provers across varying scales. Notably, our approach achieves state-of-the-art performance on PutnamBench among publicly reported $\sim$8B and $\sim$32B parameter models under comparable test-time budgets, offering a scalable paradigm for next-generation verifier-guided reasoning.

URL PDF HTML ☆

赞 0 踩 0

2604.17551 2026-06-01 cs.LG cs.AI 版本更新

SVL: Goal-Conditioned Reinforcement Learning as Survival Learning

SVL：目标条件强化学习作为生存学习

Franki Nguimatsia Tiofack, Fabian Schramm, Théotime Le Hellard, Justin Carpentier

发表机构 * Inria（法国国家信息与自动化研究所）； École Normale Supérieure, PSL Research University, Paris, France（巴黎高等师范学院，PSL研究大学）

AI总结提出生存价值学习（SVL），通过将时间到目标建模为概率分布，将目标条件强化学习重构为生存学习问题，并利用危险模型进行最大似然估计，在离线基准上匹配或超越强基线方法。

Comments Accepted to the 43rd International Conference on Machine Learning, Seoul, South Korea

详情

AI中文摘要

标准的目标条件强化学习（GCRL）方法依赖于时间差分学习，由于自举可能导致不稳定和样本效率低下。虽然最近的工作探索了对比和监督公式以提高稳定性，但我们提出了一种概率替代方案，称为生存价值学习（SVL），通过将每个状态到目标的时间建模为概率分布，将GCRL重新定义为生存学习问题。这种结构化的分布蒙特卡洛视角产生了一个闭式恒等式，将目标条件价值函数表示为生存概率的折扣和，从而通过危险模型在事件和右删失轨迹上进行最大似然估计来实现价值估计。我们引入了三种实用的价值估计器，包括有限视界截断和两种分箱无限视界近似，以捕捉长视界目标。在离线GCRL基准上的实验表明，SVL与层次化演员结合，匹配或超越了强大的层次化TD和蒙特卡洛基线，在复杂的长视界任务上表现出色。网页和代码：https://simple-robotics.github.io/publications/survival-value-learning/

英文摘要

Standard approaches to goal-conditioned reinforcement learning (GCRL) that rely on temporal-difference learning can be unstable and sample-inefficient due to bootstrapping. While recent work has explored contrastive and supervised formulations to improve stability, we present a probabilistic alternative, called survival value learning (SVL), that reframes GCRL as a survival learning problem by modeling the time-to-goal from each state as a probability distribution. This structured distributional Monte Carlo perspective yields a closed-form identity that expresses the goal-conditioned value function as a discounted sum of survival probabilities, enabling value estimation via a hazard model trained via maximum likelihood on both event and right-censored trajectories. We introduce three practical value estimators, including finite-horizon truncation and two binned infinite-horizon approximations to capture long-horizon objectives. Experiments on offline GCRL benchmarks show that SVL combined with hierarchical actors matches or surpasses strong hierarchical TD and Monte Carlo baselines, excelling on complex, long-horizon tasks. Webpage and Code: https://simple-robotics.github.io/publications/survival-value-learning/

URL PDF HTML ☆

赞 0 踩 0

2604.16278 2026-06-01 cs.AI cs.CL cs.LG 版本更新

Learning to Reason with Insight for Informal Theorem Proving

学习在非形式定理证明中进行洞察推理

Yunhe Li, Hao Shi, Bowen Deng, Wei Wang, Mengzhe Ruan, Hanxu Hou, Zhongxiang Dai, Siyang Gao, Chao Wang, Shuang Qiu, Linqi Song

发表机构 * City University of Hong Kong（香港城市大学）； Tsinghua University（清华大学）； Ke Holdings Inc.（Ke控股公司）； Shenzhen University of Advanced Technology（深圳先进技术大学）； Chinese University of Hong Kong, Shenzhen（香港中文大学（深圳））

AI总结针对非形式定理证明中缺乏洞察（识别核心技巧）的瓶颈，提出统一训练框架DeepInsight，通过分层数据集、渐进式多阶段SFT和基于洞察的策略优化方法，显著提升大语言模型的数学推理能力。

详情

AI中文摘要

电路启发的具有统一神经动力学建模的高阶神经网络用于PDE求解与视觉感知

Tongfei Chen, Jingying Yang, Linlin Yang, Juan Zhang, Jinhu Lü, David Doermann, Chunyu Xie, Long He, Tian Wang, Guodong Guo, Baochang Zhang

发表机构 * Communication University of China（通信大学）； AI Research, Qihoo 360（360人工智能研究院，奇虎360）； Eastern Institute of Technology, Ningbo（宁波工程技术院）

AI总结提出电路启发的高阶神经网络（CHONN），通过基尔霍夫级联组合实现高阶动力学算子，在PDE求解、长期物理预测和ImageNet-1K识别中提升结构保真度和稳定性。

详情

AI中文摘要

深度网络通常依赖架构启发式方法来塑造表示演化，限制了其对由内在动力学支配的数据的建模能力。我们提出了电路启发的高阶神经网络（CHONN），这是一个模块化框架，将表示演化视为一个潜在势过程，并通过基尔霍夫启发的级联组合增加其有效阶数。单个基尔霍夫神经单元实现稳定的一阶更新，而串行组合的单元在一个块内形成高阶动力学算子。这种构造是可解释的、数值稳定的，并且与常见的神经骨干网络兼容。理论分析表明，级联单元诱导出端到端的高阶算子，控制实验证明块内高阶构造不同于通用深度堆叠，特别是在导数敏感度量上。在稳态算子学习、长期物理预测和ImageNet-1K识别中，CHONN提高了结构保真度、滚动稳定性和视觉表示学习。这些结果将高阶电路组合确定为神经动力学建模的一般原则。

英文摘要

Deep networks often rely on architectural heuristics to shape representation evolution, limiting their ability to model data governed by intrinsic dynamics. We present the Circuit-inspired High-Order Neural Network (CHONN), a modular framework that treats representation evolution as a latent potential process and increases its effective order through Kirchhoff-inspired cascade composition. A single Kirchhoff Neural Cell implements a stable first-order update, while serially composed cells form higher-order dynamical operators within one block. This construction is interpretable, numerically stable and compatible with common neural backbones. Theoretical analysis shows that cascaded cells induce end-to-end high-order operators, and controlled experiments demonstrate that intra-block high-order construction differs from generic depth stacking, especially on derivative-sensitive measures. Across steady-state operator learning, long-horizon physical forecasting and ImageNet-1K recognition, CHONN improves structural fidelity, rollout stability and visual representation learning. These results identify high-order circuit composition as a general principle for neural dynamics modeling.

URL PDF HTML ☆

赞 0 踩 0

2601.11702 2026-06-01 cs.HC cs.AI 版本更新

PASTA: A Scalable Framework for Multi-Policy AI Compliance Evaluation

PASTA: 一种用于多策略AI合规评估的可扩展框架

Yu Yang, Ig-Jae Kim, Dongwook Yoon

发表机构 * The University of British Columbia（不列颠哥伦比亚大学）； Korea Institute of Science and Technology（韩国科学技术院）

AI总结提出PASTA框架，通过模型卡格式、策略规范化、LLM驱动的成对评估引擎和可解释界面，实现多策略AI合规的快速、低成本评估，专家评估显示与人类判断高度一致。

Comments 28 pages, 7 figures

详情

AI中文摘要

随着AI系统变得更加强大和普及，AI合规性变得越来越关键。然而，AI政策的快速扩张给缺乏政策专业知识的资源受限从业者带来了沉重负担。现有方法通常一次只处理一项政策，使得多政策合规成本高昂。我们提出了PASTA，一种可扩展的合规工具，集成了四项创新：（1）一种全面的模型卡格式，支持跨开发阶段的描述性输入；（2）一种策略规范化方案；（3）一个高效的基于LLM的成对评估引擎，具有成本节约策略；（4）一个通过合规热图和可操作建议提供可解释评估的界面。专家评估显示，PASTA的判断与人类专家高度一致（ρ≥.626）。该系统在约3美元的成本下，在两分钟内评估五项主要政策。一项用户研究（N=12）证实，从业者发现输出易于理解和可操作，为可扩展的自动化AI治理引入了一个新颖的框架。

英文摘要

AI compliance is becoming increasingly critical as AI systems grow more powerful and pervasive. Yet the rapid expansion of AI policies creates substantial burdens for resource-constrained practitioners lacking policy expertise. Existing approaches typically address one policy at a time, making multi-policy compliance costly. We present PASTA, a scalable compliance tool integrating four innovations: (1) a comprehensive model-card format supporting descriptive inputs across development stages; (2) a policy normalization scheme; (3) an efficient LLM-powered pairwise evaluation engine with cost-saving strategies; and (4) an interface delivering interpretable evaluations via compliance heatmaps and actionable recommendations. Expert evaluation shows PASTA's judgments closely align with human experts ($ρ\geq .626$). The system evaluates five major policies in under two minutes at approximately \$3. A user study (N = 12) confirms practitioners found outputs easy-to-understand and actionable, introducing a novel framework for scalable automated AI governance.

URL PDF HTML ☆

赞 0 踩 0

2603.22867 2026-06-01 cs.AR cs.AI cs.LG 版本更新

TRINE: A Token-Aware, Runtime-Adaptive FPGA Inference Engine for Multimodal AI

TRINE: 一种面向多模态AI的令牌感知、运行时自适应FPGA推理引擎

Hyunwoo Oh, Hanning Chen, Sanggeon Yun, Yang Ni, Suyeon Jang, Behnam Khaleghi, Fei Wen, Mohsen Imani

发表机构 * University of California, Irvine（加州大学尔湾分校）； Purdue University Northwest（北达科他州立大学）； Qualcomm（高通）； Samsung（三星）

AI总结针对多模态AI中不同计算/内存模式导致嵌入式平台实时性不足的问题，提出TRINE，一种无需重配置的单比特流FPGA加速器与编译器，通过统一层映射、运行时模式切换、令牌剪枝和依赖感知层卸载，实现端到端多模态推理，在Alveo U50和ZCU104上相比RTX 4090和Jetson Orin Nano分别降低延迟22.57倍和6.86倍，功耗仅20-21W。

Comments Accepted to DAC 2026

详情

AI中文摘要

混合ViT、CNN、GNN和Transformer NLP的多模态堆栈给嵌入式平台带来压力，因为它们的计算/内存模式不同，且硬实时目标几乎没有松弛空间。TRINE是一个单比特流FPGA加速器和编译器，无需重配置即可执行端到端多模态推理。层被统一为DDMM/SDDMM/SpMM，并映射到一个模式可切换的引擎上，该引擎在运行时在权重/输出驻留脉动阵列、1xCS SIMD和可路由加法树（RADT）之间切换，共享PE阵列。一个宽度匹配的两阶段top-k单元支持流内令牌剪枝，而依赖感知层卸载（DALO）在可重构处理单元上重叠独立内核以维持利用率。在Alveo U50和ZCU104上评估，TRINE相比RTX 4090和Jetson Orin Nano分别降低延迟高达22.57倍和6.86倍，功耗20-21W；仅令牌剪枝在ViT密集型流水线上可实现高达7.8倍加速，DALO贡献高达79%的吞吐量提升。采用int8量化，代表性任务的精度下降<2.5%，为统一的视觉、语言和图工作负载提供了最先进的延迟和能效——仅需一个比特流。

英文摘要

Multimodal stacks that mix ViTs, CNNs, GNNs, and transformer NLP strain embedded platforms because their compute/memory patterns diverge and hard real-time targets leave little slack. TRINE is a single-bitstream FPGA accelerator and compiler that executes end-to-end multimodal inference without reconfiguration. Layers are unified as DDMM/SDDMM/SpMM and mapped to a mode-switchable engine that toggles at runtime among weight/output-stationary systolic, 1xCS SIMD, and a routable adder tree (RADT) on a shared PE array. A width-matched, two-stage top-k unit enables in-stream token pruning, while dependency-aware layer offloading (DALO) overlaps independent kernels across reconfigurable processing units to sustain utilization. Evaluated on Alveo U50 and ZCU104, TRINE reduces latency by up to 22.57x vs. RTX 4090 and 6.86x vs. Jetson Orin Nano at 20-21 W; token pruning alone yields up to 7.8x on ViT-heavy pipelines, and DALO contributes up to 79% throughput improvement. With int8 quantization, accuracy drops remain <2.5% across representative tasks, delivering state-of-the-art latency and energy efficiency for unified vision, language, and graph workloads-in one bitstream.

URL PDF HTML ☆

赞 0 踩 0

2603.22744 2026-06-01 cs.AI 版本更新

从弱线索到真实身份：评估LLM代理中推理驱动的去匿名化

Myeongseob Ko, Jihyun Jeong, Sumiran Singh Thakur, Gyuhak Kim, Ruoxi Jia

发表机构 * Department of Electrical and Computer Engineering, Virginia Tech, Blacksburg, VA, USA（弗吉尼亚理工学院计算机工程系）； Center for Advanced AI, Accenture（Accenture高级人工智能中心）

AI总结研究通过LLM代理结合分散的非识别线索与公开证据重建真实身份的能力，揭示了即使在没有明确标识符的情况下，代理也能以高成功率实现去匿名化，并提出了新的隐私评估维度。

Comments Accepted at ICML 2026

详情

Journal ref: ICML 2026

AI中文摘要

匿名化通常被认为一旦移除显式标识符就能保护隐私，因为重新识别历来需要专业知识、定制算法和手动验证。我们证明基于LLM的代理削弱了这一屏障：通过将分散的、单独非识别的线索与公开证据相结合，它们重建真实世界的身份，有时甚至在良性任务中也是如此。我们在三种场景中评估了这一风险——经典的链接事件、一个控制基准（\emph{InferLink}），该基准变化指纹类型、任务框架和攻击者知识，以及开放的人机交互痕迹。在Netflix奖去匿名化设置的最稀疏情况下，代理重建了79.2%的身份，而经典匹配基线为56.0%；在\emph{InferLink}上，即使没有明确的重新识别请求，代理也能链接个体，并且在给出请求时更频繁。在编辑过的人机交互痕迹中，代理通过将上下文线索与公开证据相互印证，进一步将匿名化档案解析为特定个体。这些发现表明，对代理系统的隐私评估不仅应衡量访问或披露了哪些信息，还应衡量可以推断出哪些身份。

英文摘要

Anonymization is often assumed to protect privacy once explicit identifiers are removed, because re-identification has historically required specialized expertise, tailored algorithms, and manual corroboration. We show that LLM-based agents weaken this barrier: by combining scattered, individually non-identifying cues with public evidence, they reconstruct real-world identities, sometimes even during benign tasks. We evaluate this risk across three settings -- classical linkage incidents, a controlled benchmark (\emph{InferLink}) that varies fingerprint type, task framing, and attacker knowledge, and open-ended human--AI interaction traces. In the sparsest regime of the Netflix Prize deanonymization setting, agents reconstruct 79.2\% of identities, against 56.0\% for a classical matching baseline; on \emph{InferLink}, they link individuals even without an explicit re-identification request, and more often once one is given. In redacted human--AI interaction traces, agents further resolve anonymized profiles to specific individuals by corroborating contextual cues with public evidence. These findings suggest that privacy evaluations for agentic systems should measure not only what information is accessed or disclosed, but also what identities can be inferred.

URL PDF HTML ☆

赞 0 踩 0

2603.17145 2026-06-01 cs.LG cs.AI 版本更新

REAL: Regression-Aware Reinforcement Learning for LLM-as-a-Judge

REAL: 面向LLM评判的回归感知强化学习

Yasi Zhang, Tianyu Chen, Mingyuan Zhou, Oscar Leong, Ying Nian Wu, Michal Lukasik

发表机构 * University of California, Los Angeles（加州大学洛杉矶分校）； The University of Texas at Austin（得克萨斯大学奥斯汀分校）； Google Research Now at Google DeepMind（谷歌研究现在在谷歌深Mind）

AI总结提出REAL框架，通过广义策略梯度将回归目标融入强化学习，优化LLM作为评分器的数值评估，在多个规模模型上超越SFT和标准RL方法。

Comments Accepted to ICML 2026. The first two authors contributed equally

详情

AI中文摘要

大型语言模型（LLM）越来越多地被部署为自动评估器，为模型输出分配数值分数，这种范式称为LLM-as-a-Judge。然而，标准的强化学习（RL）方法通常依赖二元奖励（例如0-1准确率），从而忽略了回归任务中固有的序结构；例如，当真实值为5时，它们未能识别出预测4显著优于预测1。相反，现有的回归感知方法通常局限于监督微调（SFT），限制了其探索最优推理路径的能力。为弥合这一差距，我们提出\textbf{REAL}（\underline{RE}gression-\underline{A}ware Reinforcement \underline{L}earning），这是一个原则性的RL框架，旨在优化回归奖励，并且也被证明对相关性指标是最优的。一个关键的技术挑战是回归目标显式地依赖于策略，从而使标准策略梯度方法失效。为解决此问题，我们采用广义策略梯度估计器，该估计器自然地将优化分解为两个互补组件：（1）对思维链（CoT）轨迹的探索，以及（2）最终分数的回归感知预测细化。跨模型规模（8B到32B）的大量实验表明，REAL在域外基准上始终优于回归感知SFT基线和标准RL方法，展现出显著更好的泛化能力。具体在Qwen3-32B上，我们相比SFT基线获得了+8.40 Pearson和+7.20 Spearman相关性的提升，相比基础模型提升了+18.30/+11.20。这些发现凸显了将回归目标整合到RL探索中对准确LLM评估的关键价值。

英文摘要

Large language models (LLMs) are increasingly deployed as automated evaluators that assign numeric scores to model outputs, a paradigm known as LLM-as-a-Judge. However, standard Reinforcement Learning (RL) methods typically rely on binary rewards (e.g., 0-1 accuracy), thereby ignoring the ordinal structure inherent in regression tasks; for instance, they fail to recognize that predicting 4 is significantly better than predicting 1 when the ground truth is 5. Conversely, existing regression-aware approaches are often confined to Supervised Fine-Tuning (SFT), limiting their ability to explore optimal reasoning paths. To bridge this gap, we propose \textbf{REAL} (\underline{RE}gression-\underline{A}ware Reinforcement \underline{L}earning), a principled RL framework designed to optimize regression rewards, and also proven to be optimal for correlation metrics. A key technical challenge is that the regression objective is explicitly policy-dependent, thus invalidating standard policy gradient methods. To address this, we employ the generalized policy gradient estimator, which naturally decomposes optimization into two complementary components: (1) exploration over Chain-of-Thought (CoT) trajectory, and (2) regression-aware prediction refinement of the final score. Extensive experiments across model scales (8B to 32B) demonstrate that REAL consistently outperforms both regression-aware SFT baselines and standard RL methods, exhibiting significantly better generalization on out-of-domain benchmarks. On Qwen3-32B specifically, we achieve gains of +8.40 Pearson and +7.20 Spearman correlation over the SFT baseline, and +18.30/+11.20 over the base model. These findings highlight the critical value of integrating regression objectives into RL exploration for accurate LLM evaluation.

URL PDF HTML ☆

赞 0 踩 0

2603.16123 2026-06-01 cs.LG cs.AI math.AT math.CT 版本更新

Functorial Neural Architectures from Higher Inductive Types

基于高阶归纳类型的函子神经架构

Karen Sargsyan

发表机构 * Institute of Chemistry, Academia Sinica, Taipei, Taiwan（中国科学院化学研究所，台湾台北）

AI总结提出通过高阶归纳类型规范编译为神经架构，强制解码器满足严格幺半函子性质，从而在组合泛化任务上比非函子方法提升2-10倍。

Comments 26 pages, 10 tables. Code and Cubical Agda formalization: https://github.com/karsar/hott_neuro

详情

AI中文摘要

神经网络通常能学习任务的各个部分，但在这些部分的新组合上失败。我们认为这种失败是架构性的：只有当解码器尊重任务的代数法则，即从自由生成的序列下降到由这些法则确定的商时，它才能组合泛化。我们通过将高阶归纳类型（HIT）规范编译为神经架构，使这一原则具有建设性。基点、路径构造子和2-胞腔分别映射为基约束、生成器网络、结构拼接和学习到的同伦。由此产生的传输解码器在构造上是严格幺半函子：解码一个拼接的词是独立生成的环段的拼接。相反，我们证明softmax自注意力无法同时满足严格幺半组合和下降到任何非平凡组合商。在环面、圆楔和克莱因瓶上的实验验证了预期的层次结构：函子解码器比非函子替代方案性能提升2-10倍，而学习到的2-胞腔恰好在使用克莱因瓶关系的词上缩小了46%的误差差距。这些结果表明，组合泛化应作为架构中的函子结构强制执行，而非仅从示例中学习。

英文摘要

Neural networks often learn the parts of a task but fail on novel combinations of those parts. We argue that this failure is architectural: a decoder generalizes compositionally only when it respects the algebraic laws of the task, i.e. when it descends from freely generated sequences to the quotient determined by those laws. We make this principle constructive by compiling Higher Inductive Type (HIT) specifications into neural architectures. Basepoints, path constructors, and 2-cells are mapped to base constraints, generator networks, structural concatenation, and learned homotopies. The resulting transport decoders are strict monoidal functors by construction: decoding a concatenated word is concatenation of independently generated loop segments. In contrast, we prove that softmax self-attention cannot simultaneously satisfy strict monoidal composition and descent to any non-trivial compositional quotient. Experiments on the torus, wedge of circles, and Klein bottle validate the predicted hierarchy: functorial decoders outperform non-functorial alternatives by $2$--$10\times$, and a learned 2-cell closes a $46\%$ error gap precisely on words exercising the Klein-bottle relation. These results suggest that compositional generalization should be enforced as functorial structure in the architecture, rather than learned from examples alone.

URL PDF HTML ☆

赞 0 踩 0

2603.12916 2026-06-01 cs.LG cs.AI 版本更新

Surprised by Attention: Predictable Query Dynamics for Time Series Anomaly Detection

Surprised by Attention: 面向时间序列异常检测的可预测查询动态

Kadir-Kaan Özer, René Ebeling, Markus Enzweiler

发表机构 * Mercedes-Benz AG（梅赛德斯-奔驰集团）； Institute for Intelligent Systems, Esslingen University of Applied Sciences（智能系统研究所，埃森嫩应用科学大学）

AI总结提出 AxonAD 无监督检测器，通过预测多头注意力查询向量的演化并结合重构误差与查询不匹配分数，有效检测多变量时间序列中的结构依赖偏移异常。

Comments This manuscript has been accepted for publication at ECML-PKDD 2026. The final version will be published in the conference proceedings. Main: 17 Pages, 7 Figures, 3 Tables; Appendix: 3 Pages, 4 Tables

详情

AI中文摘要

多变量时间序列异常通常表现为跨通道依赖的偏移，而非简单的幅度异常。例如，在自动驾驶中，转向指令可能内部一致，但与产生的横向加速度解耦。当灵活的序列模型尽管协调性改变仍能合理重构信号时，基于残差的检测器可能遗漏此类异常。我们提出 AxonAD，一种无监督检测器，将多头注意力查询演化视为短视界可预测过程。梯度更新重构路径与仅基于历史上下文的预测器耦合，该预测器通过掩码预测器-目标目标针对指数移动平均（EMA）目标编码器进行训练。推理时，重构误差与尾部聚合的查询不匹配分数结合，该分数衡量最近时间步上预测查询与目标查询之间的余弦偏差。这种双重方法在保留幅度级检测的同时，对结构依赖偏移敏感。在带有区间标注的专有车载遥测数据以及 TSB-AD 多变量套件（17 个数据集，180 个序列）上，使用无阈值和范围感知指标，AxonAD 在排名质量和时间定位上优于强基线。消融实验证实查询预测和组合评分是观察到的改进的主要驱动因素。代码可在 https://github.com/iis-esslingen/AxonAD 获取。

英文摘要

Multivariate time series anomalies often manifest as shifts in cross-channel dependencies rather than simple amplitude excursions. In autonomous driving, for instance, a steering command might be internally consistent but decouple from the resulting lateral acceleration. Residual-based detectors can miss such anomalies when flexible sequence models still reconstruct signals plausibly despite altered coordination. We introduce AxonAD, an unsupervised detector that treats multi-head attention query evolution as a short horizon predictable process. A gradient-updated reconstruction pathway is coupled with a history-only predictor that forecasts future query vectors from past context. This is trained via a masked predictor-target objective against an exponential moving average (EMA) target encoder. At inference, reconstruction error is combined with a tail-aggregated query mismatch score, which measures cosine deviation between predicted and target queries on recent timesteps. This dual approach provides sensitivity to structural dependency shifts while retaining amplitude-level detection. On proprietary in-vehicle telemetry with interval annotations and on the TSB-AD multi-variate suite (17 datasets, 180 series) with threshold-free and range-aware metrics, AxonAD improves ranking quality and temporal localization over strong baselines. Ablations confirm that query prediction and combined scoring are the primary drivers of the observed gains. Code is available at the URL https://github.com/iis-esslingen/AxonAD.

URL PDF HTML ☆

赞 0 踩 0

2603.09453 2026-06-01 cs.LG cs.AI stat.ML 版本更新

Variational Routing: A Scalable Bayesian Framework for Calibrated Mixture-of-Experts Transformers

变分路由：用于校准混合专家Transformer的可扩展贝叶斯框架

Albus Yizhuo Li, Matthew Wicker

发表机构 * Department of Computing, Imperial College London（伦敦帝国理工学院计算机系）

AI总结提出变分混合专家路由（VMoER），通过将贝叶斯推断限制在专家选择阶段，实现大规模模型的不确定性校准，在微调基础模型上显著提升路由稳定性、降低校准误差并提高分布外检测AUROC，且额外计算开销极小。

Comments 8 pages, 7 figures for main text; 16 pages for Appendix; Accepted by ICML 2026;

详情

AI中文摘要

基础模型越来越多地部署在需要理解其输出不确定性的场景中，这对于确保负责任部署至关重要。虽然贝叶斯方法为不确定性量化提供了原则性方法，但其计算开销使得在基础模型规模下进行训练或推理不切实际。最先进的模型通过精心设计的稀疏性（包括混合专家（MoE）层）实现了数万亿的参数数量。在这项工作中，我们通过引入变分混合专家路由（VMoER）展示了大规模下的校准不确定性，这是一种用于建模MoE层不确定性的结构化贝叶斯方法。VMoER将贝叶斯推断限制在通常由确定性路由网络完成的专家选择阶段。我们使用两种推断策略实例化VMoER：对路由logits的摊销变分推断和推断用于随机专家选择的温度参数。在微调测试的基础模型上，VMoER在噪声下将路由稳定性提高了38%，校准误差降低了94%，分布外AUROC提高了12%，同时额外FLOPs增加不到1%。这些结果表明，VMoER为构建鲁棒且具有不确定性意识的基础模型提供了一条可扩展的路径。

英文摘要

Foundation models are increasingly being deployed in contexts where understanding the uncertainty of their outputs is critical to ensuring responsible deployment. While Bayesian methods offer a principled approach to uncertainty quantification, their computational overhead renders their use impractical for training or inference at foundation model scale. State-of-the-art models achieve parameter counts in the trillions through carefully engineered sparsity including Mixture-of-Experts (MoE) layers. In this work, we demonstrate calibrated uncertainty at scale by introducing Variational Mixture-of-Experts Routing (VMoER), a structured Bayesian approach for modelling uncertainty in MoE layers. VMoER confines Bayesian inference to the expert-selection stage which is typically done by a deterministic routing network. We instantiate VMoER using two inference strategies: amortised variational inference over routing logits and inferring a temperature parameter for stochastic expert selection. Across fine-tuning tested foundation models, VMoER improves routing stability under noise by 38\%, reduces calibration error by 94\%, and increases out-of-distribution AUROC by 12\%, while incurring less than 1\% additional FLOPs. These results suggest VMoER offers a scalable path toward robust and uncertainty-aware foundation models.

URL PDF HTML ☆

赞 0 踩 0

2603.10468 2026-06-01 eess.AS cs.AI cs.HC cs.MM cs.SD 版本更新

G-STAR: End-to-End Global Speaker-Tracking Attributed Recognition

G-STAR: 端到端全局说话人跟踪属性识别

Jing Peng, Ziyi Chen, Haoyu Li, Yucheng Wang, Duo Ma, Mengtian Li, Yunfan Du, Dezhu Xu, Kai Yu, Shuai Wang

发表机构 * Nanjing University（南京大学）； Shanghai Jiao Tong University（上海交通大学）； Central Media Technology Institute, Huawei（华为中央媒体技术研究院）； Shenzhen Research Institute of Big Data（深圳大数据研究院）； ETH Zürich（苏黎世联邦理工学院）

AI总结提出G-STAR框架，通过缓存条件说话人跟踪模块与Speech-LLM转录骨干耦合，实现长时重叠多说话人语音的端到端说话人属性识别，支持组件优化和联合训练，在局部和全局评估中均表现优异。

Comments submitted to Emnlp 2026

详情

AI中文摘要

我们研究了带时间戳的说话人属性自动语音识别（SA-ASR），针对长时、多说话人且存在重叠的语音。在此设置中，分块推理必须保持会议级别的说话人身份一致性，同时生成带时间戳和说话人标签的转录。先前的Speech-LLM系统倾向于优先考虑局部日志或全局标签，缺乏联合建模细粒度时间边界和鲁棒跨块身份链接的能力。我们提出G-STAR，一个端到端框架，将缓存条件的说话人跟踪模块与Speech-LLM转录骨干耦合。跟踪器提供具有时间基础的结构化说话人线索，LLM基于这些线索生成属性文本。G-STAR支持组件优化和联合端到端训练，能够在异构监督和领域偏移下进行灵活学习。在分块解码协议下，基于预言分割的局部评估和全会议全局评估的实验均显示出强大的说话人属性转录性能。

英文摘要

We study timestamped speaker-attributed automatic speech recognition (SA-ASR) for long-form, multi-party speech with overlap. In this setting, chunk-wise inference must preserve meeting-level speaker identity consistency while producing time-stamped, speaker-labeled transcripts. Prior Speech-LLM systems tend to prioritize either local diarization or global labeling, lacking the ability to jointly model fine-grained temporal boundaries and robust cross-chunk identity linking. We propose G-STAR, an end-to-end framework that couples a cache-conditioned speaker-tracking module with a Speech-LLM transcription backbone. The tracker provides structured speaker cues with temporal grounding, and the LLM generates attributed text conditioned on these cues. G-STAR supports component-wise optimization and joint end-to-end training, enabling flexible learning under heterogeneous supervision and domain shift. Under chunk-wise decoding protocols, experiments on both oracle-segmented local evaluation and full-meeting global evaluation show strong speaker-attributed transcription performance.

URL PDF HTML ☆

赞 0 踩 0

2603.07551 2026-06-01 cs.SD cs.AI 版本更新

Targeted Speaker Poisoning Framework in Zero-Shot Text-to-Speech

零样本文本转语音中的目标说话人投毒框架

Thanapat Trachu, Thanathai Lertpetchpun, Sai Praneeth Karimireddy, Shrikanth Narayanan

发表机构 * Thomas Lord Department of Computer Science, University of Southern California, USA（汤姆斯·劳德计算机科学系，美国南加州大学）； Signal Analysis and Interpretation Lab, University of Southern California, USA（信号分析与解释实验室，美国南加州大学）

AI总结针对零样本TTS语音克隆的隐私风险，提出说话人生成投毒（SGSP）任务，通过修改训练模型阻止特定身份生成，并评估了推理时过滤和参数修改基线在1、15和100个遗忘说话人上的隐私-效用权衡。

Comments Submitted to Interspeech2026

详情

AI中文摘要

零样本文本转语音（TTS）语音克隆带来了严重的隐私风险，需要从训练好的TTS模型中移除特定说话人身份。传统的机器遗忘在此情境下不足，因为零样本TTS可以从仅参考提示动态重建声音。我们将此任务形式化为说话人生成投毒（SGSP），其中我们修改训练模型以防止生成特定身份，同时保留其他说话人的效用。我们评估了推理时过滤和参数修改基线在1、15和100个遗忘说话人上的表现。通过效用（WER）和隐私之间的权衡来评估性能，隐私使用AUC和遗忘说话人相似度（FSSIM）量化。我们在最多15个说话人上实现了强隐私，但由于身份重叠增加，在100个说话人时揭示了可扩展性限制。因此，我们的研究引入了一个新颖的问题和评估框架，以推动生成式语音隐私的进一步进展。

英文摘要

Zero-shot Text-to-Speech (TTS) voice cloning poses severe privacy risks, demanding the removal of specific speaker identities from trained TTS models. Conventional machine unlearning is insufficient in this context, as zero-shot TTS can dynamically reconstruct voices from just reference prompts. We formalize this task as Speech Generation Speaker Poisoning (SGSP), in which we modify trained models to prevent the generation of specific identities while preserving utility for other speakers. We evaluate inference-time filtering and parameter-modification baselines across 1, 15, and 100 forgotten speakers. Performance is assessed through the trade-off between utility (WER) and privacy, quantified using AUC and Forget Speaker Similarity (FSSIM). We achieve strong privacy for up to 15 speakers but reveal scalability limits at 100 speakers due to increased identity overlap. Our study thus introduces a novel problem and evaluation framework toward further advances in generative voice privacy.

URL PDF HTML ☆

赞 0 踩 0

2603.06738 2026-06-01 cs.LG cs.AI 版本更新

从泄露思维到私有推理：控制LRM对自己说的话

Haritz Puerto, Haonan Li, Xudong Han, Timothy Baldwin, Iryna Gurevych

发表机构 * Mohamed bin Zayed University of Artificial Intelligence, UAE（穆罕默德·本·扎耶德人工智能大学，阿联酋）

AI总结针对大型推理模型（LRM）推理过程中隐私泄露问题，提出通过指令跟随（IF）训练和分阶段解码策略（Staged Decoding）增强隐私保护，在IF和隐私基准上分别提升高达20.9和51.9个百分点。

详情

AI中文摘要

大型推理模型（LRM）产生的推理轨迹（RT）通常包含敏感信息。这些泄露的思维难以控制，且经常违反明确的隐私指令。由于RT可能通过提示注入攻击暴露，这对用户构成了直接的隐私风险。我们将此视为一个可控性问题：由于隐私指令本身就是指令，在RT内改进指令跟随（IF）为减少隐私泄露提供了直接途径。为此，我们引入了一个SFT数据集，教会模型在其推理过程中遵循通用指令，并提出了分阶段解码（Staged Decoding），一种简单的解码策略，通过使用独立的LoRA适配器解耦RT和答案生成，以最大化每个组件的IF。我们在两个系列（1.7B-14B参数）的六个模型上，在两个IF基准和两个隐私基准上评估了我们的方法。我们的方法带来了显著的改进，在IF上提升高达20.9分，在隐私基准上提升51.9个百分点，尽管由于推理性能与IF之间的权衡，这些改进可能以牺牲任务效用为代价。我们的结果表明，改进LRM中的IF可以显著增强隐私，为未来隐私感知的LRM提供了一个有前景的方向。我们的代码可在https://github.com/UKPLab/arxiv2026-controllable-reasoning-models获取。

英文摘要

Large reasoning models (LRMs) produce reasoning traces (RTs) that often contain sensitive information. These leaky thoughts are difficult to control and frequently violate explicit privacy directives. Because RTs can be exposed through prompt injection attacks, this becomes a direct privacy risk to the user. We approach this as a controllability problem: since privacy directives are themselves instructions, improving instruction-following (IF) within the RT provides a direct path to reducing privacy leaks. To this end, we introduce an SFT dataset that teaches models to follow general instructions throughout their reasoning process, and propose Staged Decoding, a simple decoding strategy that decouples RT and answer generation using separate LoRA adapters to maximize IF of each component. We evaluate our approach on six models from two families (1.7B-14B parameters), across two IF benchmarks and two privacy benchmarks. Our method yields substantial improvements, with gains of up to 20.9 points in IF and 51.9 percentage points on privacy benchmarks, though these can come at the cost of task utility due to the trade-off between reasoning performance and IF. Our results show that improving IF in LRMs can significantly enhance privacy, suggesting a promising direction for future privacy-aware LRMs. Our code is available at https://github.com/UKPLab/arxiv2026-controllable-reasoning-models.

URL PDF HTML ☆

赞 0 踩 0

2602.10117 2026-06-01 cs.LG cs.AI 版本更新

Biases in the Blind Spot: Detecting What LLMs Fail to Mention

盲点中的偏见：检测大语言模型未能提及的内容

Iván Arcuschin, David Chanin, Adrià Garriga-Alonso, Oana-Maria Camburu

发表机构 * Poseidon Research（Poseidon研究）； University College London, United Kingdom（伦敦大学学院, 英国）； Imperial College London, United Kingdom（伦敦帝国学院, 英国）

AI总结提出全自动黑盒流水线，通过统计测试和思维链分析，自动检测大语言模型在任务中未明确表述的偏见。

Comments Published at the 43rd International Conference on Machine Learning (ICML 2026)

详情

AI中文摘要

大语言模型（LLMs）通常提供看似合理的思维链（CoT）推理痕迹，但可能隐藏内部偏见。我们称这些为未表述的偏见。因此，通过模型陈述的推理来监控模型是不可靠的，现有的偏见评估通常需要预定义类别和手工制作的数据集。在这项工作中，我们引入了一个全自动的黑盒流水线，用于检测特定任务的未表述偏见。给定一个任务数据集，该流水线使用LLM自动评分器生成候选偏见概念。然后，通过生成正面和负面变体，在逐渐增大的输入样本上测试每个概念，并应用统计技术进行多重测试和早期停止。如果一个概念在模型的CoT中未被引用为理由，但产生了统计上显著的性能差异，则将其标记为未表述的偏见。我们在三个决策任务（招聘、贷款审批和大学录取）上对七个LLM评估了我们的流水线。我们的技术自动发现了这些模型中以前未知的偏见（例如，西班牙语流利度、英语熟练度、写作正式度）。在同一运行中，该流水线还验证了先前工作手动识别的偏见（性别、种族、宗教、民族）。更广泛地说，我们提出的方法为自动、更高效和更广泛的特定任务未表述偏见发现提供了一条实用、可扩展的路径。

英文摘要

Large Language Models (LLMs) often provide chain-of-thought (CoT) reasoning traces that appear plausible, but may hide internal biases. We call these unverbalized biases. Monitoring models via their stated reasoning is therefore unreliable, and existing bias evaluations typically require predefined categories and hand-crafted datasets. In this work, we introduce a fully automated, black-box pipeline for detecting task-specific unverbalized biases. Given a task dataset, the pipeline uses LLM autoraters to generate candidate bias concepts. It then tests each concept on progressively larger input samples by generating positive and negative variations, and applies statistical techniques for multiple testing and early stopping. A concept is flagged as an unverbalized bias if it yields statistically significant performance differences while not being cited as justification in the model's CoTs. We evaluate our pipeline across seven LLMs on three decision tasks (hiring, loan approval, and university admissions). Our technique automatically discovers previously unknown biases in these models (e.g., Spanish fluency, English proficiency, writing formality). In the same run, the pipeline also validates biases that were manually identified by prior work (gender, race, religion, ethnicity). More broadly, our proposed approach provides a practical, scalable path to automatic, more efficient, and broader task-specific unverbalized bias discovery.

URL PDF HTML ☆

赞 0 踩 0

2602.22971 2026-06-01 cs.AI 版本更新

SPM-Bench: Benchmarking Large Language Models for Scanning Probe Microscopy

SPM-Bench：面向扫描探针显微镜的大型语言模型基准测试

Peiyao Xiao, Xiaogang Li, Xinyi Gao, Chengliang Xu, Ben Wang, Zichao Chen, Zeyu Wang, Lin Qu, Bing Zhao, Hu Wei

发表机构 * Alibaba Group（阿里巴巴集团）

AI总结提出SPM-Bench，一个全自动数据合成管道和严格评估指标（SIP-F1），用于测试LLMs在扫描探针显微镜领域的推理能力，并首次量化模型“个性”。

详情

AI中文摘要

随着LLMs在通用推理方面取得突破，它们在特定科学领域的熟练程度因数据污染、复杂性不足和过高的人力成本而在现有基准测试中暴露出明显差距。在此，我们提出SPM-Bench，一个专为扫描探针显微镜（SPM）设计的原创、博士级多模态基准测试。我们提出一个全自动数据合成管道，确保高权威性和低成本。通过采用锚点门控筛（AGS）技术，我们从2023年至2025年间发表的arXiv和期刊论文中高效提取高价值图像-文本对。通过混合云-本地架构（其中VLM仅返回空间坐标“llbox”以进行本地高保真裁剪），我们的管道在保持高数据集纯度的同时实现了极致的token节省。为了准确客观地评估LLMs的性能，我们引入了严格不完美惩罚F1（SIP-F1）分数。该指标不仅建立了严格的能力层级，而且首次量化了模型“个性”（保守型、激进型、赌徒型或明智型）。通过将这些结果与模型报告的置信度和感知难度相关联，我们揭示了当前AI在复杂物理场景中的真实推理边界。这些见解使SPM-Bench成为自动化科学数据合成的可推广范式。

英文摘要

As LLMs achieved breakthroughs in general reasoning, their proficiency in specialized scientific domains reveals pronounced gaps in existing benchmarks due to data contamination, insufficient complexity, and prohibitive human labor costs. Here we present SPM-Bench, an original, PhD-level multimodal benchmark specifically designed for scanning probe microscopy (SPM). We propose a fully automated data synthesis pipeline that ensures both high authority and low-cost. By employing Anchor-Gated Sieve (AGS) technology, we efficiently extract high-value image-text pairs from arXiv and journal papers published between 2023 and 2025. Through a hybrid cloud-local architecture where VLMs return only spatial coordinates "llbox" for local high-fidelity cropping, our pipeline achieves extreme token savings while maintaining high dataset purity. To accurately and objectively evaluate the performance of the LLMs, we introduce the Strict Imperfection Penalty F1 (SIP-F1) score. This metric not only establishes a rigorous capability hierarchy but also, for the first time, quantifies model "personalities" (Conservative, Aggressive, Gambler, or Wise). By correlating these results with model-reported confidence and perceived difficulty, we expose the true reasoning boundaries of current AI in complex physical scenarios. These insights establish SPM-Bench as a generalizable paradigm for automated scientific data synthesis.

URL PDF HTML ☆

赞 0 踩 0

2602.19171 2026-06-01 cs.GR cs.AI 版本更新

HistCAD: A Constraint-Aware Parametric History-Based CAD Representation, Dataset, and Benchmark with Industrial Complexity

HistCAD：一种约束感知的基于参数化历史CAD表示、数据集和具有工业复杂性的基准

Xintong Dong, Chuanyang Li, Peng Zheng, Chuqi Han, Jiaxin Jing, Hailong Shen, Yanzhi Song, Zhouwang Yang

发表机构 * University of Science and Technology of China（中国科学技术大学）

AI总结提出HistCAD表示标准、数据集和基准，通过显式约束记录草图、特征和操作，实现可编辑的参数化CAD序列生成与评估。

详情

AI中文摘要

参数化CAD序列是可重用的，因为尺寸和几何约束控制参数变化如何传播。现有的CAD生成数据集和基准强调重建保真度、执行有效性或静态形状相似性，而忽略了编辑下设计意图的保持。我们引入了HistCAD，一个用于可执行参数化CAD且具有显式约束的表示标准、数据集和基准。HistCAD定义了一种独立于CAD软件的中间语言，记录草图图元、约束、特征操作以及用于倒角和圆角等操作的3D点边界参考。该数据集包含170,236个可执行序列，与原生CAD模型、STEP文件、渲染视图和文本注释对齐，结合了学术规模与专业创作的工业复杂性。基于此表示，约束感知可编辑性基准应用参数编辑并报告编辑可达性、条件保留约束满足率和总体可编辑成功率，缩写为ER、cPCSR和OES；这些指标将无法达到有效编辑状态与无法保留所需约束区分开来。实验表明，显式约束对于编辑后保留设计意图至关重要，并且HistCAD支持从文本进行监督式CAD生成以及直接的大语言模型工作流。我们认为HistCAD将CAD生成从静态形状模仿重新定义为具有显式约束的可重用参数化序列的合成。

英文摘要

Parametric CAD sequences are reusable because dimensional and geometric constraints govern how parameter changes propagate. Existing CAD generation datasets and benchmarks emphasize reconstruction fidelity, execution validity, or static shape similarity, leaving preservation of design intent under edits largely unmeasured. We introduce HistCAD, a representation standard, dataset, and benchmark for executable parametric CAD with explicit constraints. HistCAD defines an intermediate language independent of CAD software, recording sketch primitives, constraints, feature operations, and 3D point boundary references for operations such as fillet and chamfer. The dataset contains 170,236 executable sequences aligned with native CAD models, STEP files, rendered views, and text annotations, combining academic scale with professionally authored industrial complexity. Building on this representation, the Constraint-Aware Editability Benchmark applies parameter edits and reports Edit Reachability, conditional preserved constraint satisfaction, and Overall Editable Success, abbreviated ER, cPCSR, and OES; these metrics separate failures to reach a valid edited state from failures to preserve required constraints. Experiments show that explicit constraints are essential for preserving design intent after edits, and that HistCAD supports supervised CAD generation from text and direct LLM workflows. We argue that HistCAD reframes CAD generation from static shape imitation to the synthesis of reusable parametric sequences with explicit constraints.

URL PDF HTML ☆

赞 0 踩 0

2602.08885 2026-06-01 cs.LG cs.AI cs.SC 版本更新

Breaking the Simplification Bottleneck in Amortized Neural Symbolic Regression

打破摊销神经符号回归中的简化瓶颈

Paul Saegert, Ullrich Köthe

发表机构 * Heidelberg University（海德堡大学）

AI总结针对摊销符号回归中表达式简化速度慢的问题，提出基于规则的简化引擎SimpliPy，实现百倍加速，从而提升模型精度和可扩展性。

Comments main text: 8 pages, 7 figures; appendix: 12 pages, 11 figures; code available at https://github.com/psaegert/simplipy and https://github.com/psaegert/flash-ansr; v2: Fixed rendering artifact in Figure 7; v3: Fixed Figure 3 title and formula; v4: Fixed Eq (1), example in App. M, Fig 13; v5: ICML 2026 Camera-Ready Version

详情

AI中文摘要

符号回归旨在发现准确描述观测数据的可解释解析表达式。摊销符号回归有望比主流的遗传编程符号回归方法效率更高，但目前难以扩展到真实的科学复杂度。我们发现一个关键障碍是缺乏将等价表达式快速简化为简洁规范形式的方法。摊销符号回归已通过通用计算机代数系统（如SymPy）解决此问题，但其高计算成本严重限制了训练和推理速度。我们提出SimpliPy，一个基于规则的简化引擎，在相当质量下实现比SymPy快100倍的速度。这使摊销符号回归获得显著改进，包括扩展到更大的训练集、更高效地使用每个表达式的令牌预算，以及系统性地消除训练集中与测试等价表达式的污染。我们在Flash-ANSR框架中展示了这些优势，在FastSRB基准上比摊销基线（NeSymReS, E2E）获得更好的准确率。此外，其性能与最先进的直接优化方法（PySR）相当，同时在增加推理预算时恢复更简洁而非更复杂的表达式。

英文摘要

Symbolic regression (SR) aims to discover interpretable analytical expressions that accurately describe observed data. Amortized SR promises to be much more efficient than the predominant genetic programming SR methods, but currently struggles to scale to realistic scientific complexity. We find that a key obstacle is the lack of a fast reduction of equivalent expressions to a concise normalized form. Amortized SR has addressed this with general-purpose Computer Algebra Systems (CAS) like SymPy, but the high computational cost severely limits training and inference speed. We propose SimpliPy, a rule-based simplification engine achieving a 100-fold speed-up over SymPy at comparable quality. This enables substantial improvements in amortized SR, including scalability to much larger training sets, more efficient use of the per-expression token budget, and systematic training set decontamination with respect to equivalent test expressions. We demonstrate these advantages in our Flash-ANSR framework, which achieves much better accuracy than amortized baselines (NeSymReS, E2E) on the FastSRB benchmark. Moreover, it performs on par with state-of-the-art direct optimization (PySR) while recovering more concise rather than more complex expressions with increasing inference budget.

URL PDF HTML ☆

赞 0 踩 0

2602.17531 2026-06-01 cs.LG cs.AI 版本更新

Softmax的信息几何：探测与引导

Kiho Park, Todd Nief, Yo Joong Choe, Victor Veitch

发表机构 * University of Chicago（芝加哥大学）

AI总结本文从信息几何角度研究AI系统如何将语义结构编码到表示空间的几何结构中，并提出一种利用线性探针鲁棒引导表示以展现特定概念的“双重引导”方法。

Comments Code is available at https://github.com/KihoPark/dual-steering

详情

Journal ref: In Proceedings of the 43rd International Conference on Machine Learning (ICML), 2026

AI中文摘要

本文关注AI系统如何将语义结构编码到其表示空间的几何结构中的问题。动机观察是，这些表示空间的自然几何应反映模型使用表示产生行为的方式。我们聚焦于定义softmax分布的重要特例。在这种情况下，我们认为自然几何是信息几何。我们的重点是信息几何在语义编码和线性表示假设中的作用。作为一个说明性应用，我们开发了“双重引导”，一种利用线性探针鲁棒地引导表示以展现特定概念的方法。我们证明双重引导在最小化对非目标概念改变的同时，最优地修改目标概念。实验上，我们发现双重引导增强了概念操控的可控性和稳定性。

英文摘要

This paper concerns the question of how AI systems encode semantic structure into the geometric structure of their representation spaces. The motivating observation is that the natural geometry of these representation spaces should reflect the way models use representations to produce behavior. We focus on the important special case of representations that define softmax distributions. In this case, we argue that the natural geometry is information geometry. Our focus is on the role of information geometry on semantic encoding and the linear representation hypothesis. As an illustrative application, we develop "dual steering", a method for robustly steering representations to exhibit a particular concept using linear probes. We prove that dual steering optimally modifies the target concept while minimizing changes to off-target concepts. Empirically, we find that dual steering enhances the controllability and stability of concept manipulation.

URL PDF HTML ☆

赞 0 踩 0

2602.11137 2026-06-01 cs.LG cs.AI cs.CL 版本更新

Weight Decay Improves Language Model Plasticity

权重衰减提升语言模型可塑性

Tessa Han, Sebastian Bordt, Hanlin Zhang, Sham Kakade

发表机构 * Broad Institute, Schmidt Center（Broad研究所，Schmidt中心）； University of Tübingen, Tübingen AI Center（图宾根大学，图宾根人工智能中心）； Harvard University（哈佛大学）

AI总结本文通过系统实验表明，预训练中较大的权重衰减能提高模型的可塑性，使微调后下游性能更优，并揭示了其促进线性可分表示、正则化注意力矩阵和减少过拟合的机制。

详情

AI中文摘要

谁获得功劳或责备？在现代AI系统中分配责任

Shichang Zhang, Hongzhe Du, Jiaqi W. Ma, Himabindu Lakkaraju

发表机构 * Harvard University, Cambridge, MA, USA（哈佛大学）； University of California, Los Angeles, Los Angeles, CA, USA（加州大学洛杉矶分校）； University of Illinois Urbana-Champaign, Urbana-Champaign, IL, USA（伊利诺伊大学厄巴纳-香槟分校）

AI总结提出一个归因框架，通过反事实问题量化模型开发各阶段（预训练、微调等）对最终行为的影响，并设计无需重训练的估计器，成功识别并移除多阶段任务中的虚假关联。

详情

AI中文摘要

现代AI系统通常通过多个阶段开发——预训练、微调轮次以及后续的适应或对齐，每个阶段都建立在先前阶段之上并以不同方式更新模型。这引发了一个关键的责任问题：当部署的模型成功或失败时，哪个阶段负责，以及负责到什么程度？我们提出了责任归因问题，用于将模型行为追溯到模型开发过程的特定阶段。为了解决这一挑战，我们提出了一个通用框架，回答关于阶段效应的反事实问题：如果特定阶段的更新没有发生，模型的行为会如何改变？在此框架内，我们引入了无需重新训练模型即可高效量化阶段效应的估计器，考虑了数据和模型优化动态的关键方面，包括学习率调度、动量和权重衰减。我们证明了我们的方法成功量化了每个阶段对模型行为的责任。基于归因结果，我们的方法可以识别并移除在图像分类和文本毒性检测任务中跨多个阶段开发时学到的虚假相关性。我们的方法为模型分析提供了实用工具，并代表了向更负责任的AI发展迈出的重要一步。

英文摘要

Modern AI systems are typically developed through multiple stages-pretraining, fine-tuning rounds, and subsequent adaptation or alignment, where each stage builds on the previous ones and updates the model in distinct ways. This raises a critical question of accountability: when a deployed model succeeds or fails, which stage is responsible, and to what extent? We pose the accountability attribution problem for tracing model behavior back to specific stages of the model development process. To address this challenge, we propose a general framework that answers counterfactual questions about stage effects: how would the model's behavior have changed if the updates from a particular stage had not occurred? Within this framework, we introduce estimators that efficiently quantify stage effects without retraining the model, accounting for both the data and key aspects of model optimization dynamics, including learning rate schedules, momentum, and weight decay. We demonstrate that our approach successfully quantifies the accountability of each stage to the model's behavior. Based on the attribution results, our method can identify and remove spurious correlations learned during image classification and text toxicity detection tasks that were developed across multiple stages. Our approach provides a practical tool for model analysis and represents a significant step toward more accountable AI development.

URL PDF HTML ☆

赞 0 踩 0

2602.07928 2026-06-01 cs.LG cs.AI 版本更新

A Kinetic Energy Perspective of Flow Matching

流匹配的动能视角

Ziyun Li, Huancheng Hu, Soon Hoe Lim, Xuyu Li, Fei Gao, Enmao Diao, Zezhen Ding, Michalis Vazirgiannis, Henrik Bostrom

发表机构 * KTH Royal Institute of Technology（皇家理工学院）； Nordita, Nordic Institute for Theoretical Physics（北欧理论物理研究所）； Hasso Plattner Institute, University of Potsdam（波茨坦大学哈索 Plattner 研究院）； Trinity College Dublin（都柏林三一学院）； Hangzhou Institute of Technology, Xidian University（西安电子科技大学杭州研究院）； The Hong Kong University of Science（香港科学大学）； Mohamed bin Zayed University of Artificial Intelligence（莫莫丁·本·扎耶德人工智能大学）

AI总结本文引入动能路径能量（KPE）作为流匹配生成模型的诊断工具，发现其与语义保真度和数据稀疏性相关，并基于此提出无训练的动能轨迹塑形（KTS）策略以改善生成质量。

Comments ICML 2026 Spotlight

详情

AI中文摘要

基于流的生成模型可以通过物理视角来审视：采样通过积分学习到的速度场将粒子从噪声传输到数据，每个样本对应一条具有自身动力学努力的轨迹。受经典力学启发，我们引入了动能路径能量（KPE），这是一种类似作用量的每样本诊断指标，用于测量沿常微分方程（ODE）轨迹累积的动能努力。实验上，KPE表现出两种稳健的对应关系：{i} 较高的KPE预测更强的语义保真度；{ii} 高KPE轨迹落在稀疏表示区域。我们进一步提供了将轨迹能量与数据稀疏性联系起来的理论保证。矛盾的是，这种相关性是非单调的。在足够高的能量下，生成可能退化为记忆。利用经验流匹配的闭式公式，我们表明极端能量驱动轨迹接近训练样本的副本。这产生了金发姑娘原则，并激发了动能轨迹塑形（KTS），一种无训练的两阶段推理策略，该策略增强早期运动并强制执行后期软着陆，从而减少记忆并提高基准任务上的生成质量。

英文摘要

Flow-based generative models can be viewed through a physics lens: sampling transports a particle from noise to data by integrating a learned velocity field, and each sample corresponds to a trajectory with its own dynamical effort. Motivated by classical mechanics, we introduce Kinetic Path Energy (KPE), an action-like, per-sample diagnostic that measures the accumulated kinetic effort along an ordinary differential equation (ODE) trajectory. Empirically, KPE exhibits two robust correspondences: {i} higher KPE predicts stronger semantic fidelity; {ii} high-KPE trajectories land in sparse representation regions. We further provide theoretical guarantees linking trajectory energy to data sparsity. Paradoxically, this correlation is non-monotonic. At sufficiently high energy, generation can degenerate into memorization. Leveraging the closed-form formula of empirical flow matching, we show that extreme energies drive trajectories toward near-copies of training examples. This yields a Goldilocks principle and motivates Kinetic Trajectory Shaping (KTS), a training-free two-phase inference strategy that boosts early motion and enforces a late-time soft landing, reducing memorization and improving generation quality across benchmark tasks.

URL PDF HTML ☆

赞 0 踩 0

2602.07905 2026-06-01 cs.AI 版本更新

MedCoG: Maximizing LLM Inference Density in Medical Reasoning via Meta-Cognitive Regulation

MedCoG：通过元认知调节最大化医学推理中的LLM推理密度

Yu Zhao, Hao Guan, Yongcheng Jing, Ying Zhang, Dacheng Tao

发表机构 * College of Computer Science, VCIP, DISSec Center, Nankai University, China 300350（南开大学计算机学院、VCIP、DISSec中心、中国300350）； Generative AI Lab, College of Computing and Data Science（生成式人工智能实验室、计算与数据科学学院）； Nanyang Technological University, Singapore 639798（南洋理工大学，新加坡639798）

AI总结提出MedCoG框架，利用元认知评估动态调节知识使用，以缓解推理扩展定律下的收益递减，提升推理效率与准确性。

Comments Accepted by ICML 2026

详情

AI中文摘要

大型语言模型（LLM）在复杂医学推理中展现出强大潜力，但在推理扩展定律下面临收益递减。现有研究通过添加各种知识类型增强LLM，但额外成本转化为准确性的效果尚不明确。本文探索LLM的元认知（即对其自身认知状态的自我评估）如何调节推理过程。具体而言，我们提出MedCoG，一种带有知识图谱的医学元认知智能体，其中对任务复杂性、熟悉度和知识密度的元认知评估动态调节程序性、情景性和事实性知识的利用。这种以LLM为中心的按需推理旨在通过（1）避免无差别扩展以降低成本，（2）过滤干扰知识以提高准确性，来缓解扩展定律下的收益递减。为验证这一点，我们经验性地刻画了扩展曲线，并引入推理密度来量化推理效率。实验表明MedCoG在五个医学基准困难集上的有效性和高效性，实现了6.2倍的推理密度。此外，Oracle研究凸显了元认知调节的巨大潜力。

英文摘要

Large Language Models (LLMs) have shown strong potential in complex medical reasoning yet face diminishing gains under inference scaling laws. While existing studies augment LLMs with various knowledge types, it remains unclear how effectively the additional costs translate into accuracy. In this paper, we explore how meta-cognition of LLMs, i.e., their self-assessment of their own cognitive states, can regulate the reasoning process. Specifically, we propose MedCoG, a Medical Meta-Cognition Agent with Knowledge Graph, where the meta-cognitive assessments of task complexity, familiarity, and knowledge density dynamically regulate utilization of procedural, episodic, and factual knowledge. The LLM-centric on-demand reasoning aims to mitigate the diminishing returns under scaling law by (1) reducing costs via avoiding indiscriminate scaling, (2) improving accuracy via filtering out distractive knowledge. To validate this, we empirically characterize the scaling curve and introduce inference density to quantify inference efficiency. Experiments demonstrate the effectiveness and efficiency of MedCoG on five hard sets of medical benchmarks, yielding 6.2x inference density. Furthermore, the Oracle study highlights the significant potential of meta-cognitive regulation.

URL PDF HTML ☆

赞 0 踩 0

2602.07457 2026-06-01 cs.SE cs.AI cs.CL 版本更新

Pull Requests as a Training Signal for Repo-Level Code Editing

拉取请求作为仓库级代码编辑的训练信号

Qinglin Zhu, Tianyu Chen, Shuai Lu, Lei Ji, Runcong Zhao, Murong Ma, Xiangxiang Dai, Yulan He, Lin Gui, Peng cheng, Yeyun Gong

发表机构 * King's College London, UK（伦敦国王学院）； Chinese University of Hong Kong, HK（香港中文大学）； National University of Singapore, SG（新加坡国立大学）； Microsoft Research Asia, CN（微软亚洲研究院）； The Alan Turing Institute, UK（艾伦·图灵研究所）

AI总结提出Clean-PR方法，利用真实GitHub拉取请求作为训练信号，通过重建和验证转换为搜索/替换编辑块，结合无代理对齐的监督微调，在SWE-bench上显著提升仓库级代码编辑性能。

Comments Accepted at ICML 2026

详情

AI中文摘要

仓库级代码编辑要求模型理解复杂依赖关系并在大型代码库中执行精确的多文件修改。虽然最近在SWE-bench上的进展严重依赖于复杂的代理脚手架，但尚不清楚这种能力有多少可以通过高质量的训练信号内化。为了解决这个问题，我们提出了Clean Pull Request (Clean-PR)，一种利用真实世界GitHub拉取请求作为仓库级编辑训练信号的中间训练范式。我们引入了一个可扩展的流水线，通过重建和验证将嘈杂的拉取请求差异转换为搜索/替换编辑块，从而得到最大的公开可用语料库，包含200万个拉取请求，涵盖12种编程语言。使用这个训练信号，我们进行中间训练阶段，然后进行无代理对齐的监督微调过程，并带有错误驱动的数据增强。在SWE-bench上，我们的模型显著优于指令微调基线，在SWE-bench Lite上实现了13.6%的绝对改进，在SWE-bench Verified上实现了12.3%的绝对改进。这些结果表明，仓库级代码理解和编辑能力可以在简化的、无代理的协议下有效地内化到模型权重中，而无需依赖繁重的推理时脚手架。

英文摘要

Repository-level code editing requires models to understand complex dependencies and execute precise multi-file modifications across a large codebase. While recent gains on SWE-bench rely heavily on complex agent scaffolding, it remains unclear how much of this capability can be internalised via high-quality training signals. To address this, we propose Clean Pull Request (Clean-PR), a mid-training paradigm that leverages real-world GitHub pull requests as a training signal for repository-level editing. We introduce a scalable pipeline that converts noisy pull request diffs into Search/Replace edit blocks through reconstruction and validation, resulting in the largest publicly available corpus of 2 million pull requests spanning 12 programming languages. Using this training signal, we perform a mid-training stage followed by an agentless-aligned supervised fine-tuning process with error-driven data augmentation. On SWE-bench, our model significantly outperforms the instruction-tuned baseline, achieving absolute improvements of 13.6% on SWE-bench Lite and 12.3% on SWE-bench Verified. These results demonstrate that repository-level code understanding and editing capabilities can be effectively internalised into model weights under a simplified, agentless protocol, without relying on heavy inference-time scaffolding.

URL PDF HTML ☆

赞 0 踩 0

2510.10544 2026-06-01 cs.LG cs.AI stat.ML 版本更新

PAC-Bayesian Reinforcement Learning Trains Generalizable Policies

PAC-Bayesian 强化学习训练可泛化策略

Abdelkrim Zitouni, Mehdi Hennequin, Juba Agoun, Ryan Horache, Nadia Kabachi, Omar Rivasplata

发表机构 * Université Claude Bernard Lyon 1, LIRIS, UMR CNRS 5205, France（里尔一大学，LIRIS，法国CNRS 5205）

AI总结提出一种新的 PAC-Bayesian 泛化界，通过链的混合时间显式考虑数据中的马尔可夫依赖性，并基于此设计 PB-SAC 算法以优化该界指导探索，在连续控制任务中提供有意义的置信度证书且保持竞争性能。

Comments Accepted to the 43rd International Conference on Machine Learning (ICML 2026). Camera-ready version

2602.06161 2026-06-01 cs.CL cs.AI 版本更新

Stop the Flip-Flop: Context-Preserving Verification for Fast Revocable Diffusion Decoding

停止翻转：面向快速可撤销扩散解码的上下文保持验证

Yanzheng Xiang, Lan Wei, Yizhen Yao, Qinglin Zhu, Hanqi Yan, Chen Jin, Philip Alexander Teare, Dandan Zhang, Lin Gui, Amrutha Saseendran, Yulan He

发表机构 * King's College London, UK ； Centre for AI, Data Science \& Artificial Intelligence, BioPharmaceuticals R\&D, AstraZeneca, UK ； The Alan Turing Institute, UK ； Imperial College London, UK

AI总结针对并行扩散解码中因激进并行导致的翻转振荡问题，提出COVER方法，通过KV缓存覆盖和稳定性感知评分实现单次前向传递中的留一验证与稳定草稿，减少不必要修订并加速解码。

详情

AI中文摘要

并行扩散解码可以通过每步解掩多个令牌来加速扩散语言模型推理，但激进的并行常常损害质量。可撤销解码通过重新检查早期令牌来缓解这一问题，然而我们观察到现有的验证方案频繁触发翻转振荡，即令牌被重新掩码后又原样恢复。这种行为以两种方式减慢推理：重新掩码已验证位置削弱了并行草稿的条件上下文，且重复的重新掩码循环消耗修订预算而进展甚微。我们提出COVER（用于高效修订的缓存覆盖验证），它在单次前向传递中执行留一验证和稳定草稿。COVER通过KV缓存覆盖构建两种注意力视图：选定的种子被掩码用于验证，而其缓存的键值状态被注入到所有其他查询中以保留上下文信息，同时通过闭式对角校正防止种子位置的自泄漏。COVER进一步使用稳定性感知评分对种子进行优先级排序，该评分平衡不确定性、下游影响和缓存漂移，并自适应调整每步验证的种子数量。在多个基准测试中，COVER显著减少不必要的修订，实现更快的解码，同时保持输出质量。

英文摘要

Parallel diffusion decoding can accelerate diffusion language model inference by unmasking multiple tokens per step, but aggressive parallelism often harms quality. Revocable decoding mitigates this by rechecking earlier tokens, yet we observe that existing verification schemes frequently trigger flip-flop oscillations, where tokens are remasked and later restored unchanged. This behaviour slows inference in two ways: remasking verified positions weakens the conditioning context for parallel drafting, and repeated remask cycles consume the revision budget with little net progress. We propose COVER (Cache Override Verification for Efficient Revision), which performs leave-one-out verification and stable drafting within a single forward pass. COVER constructs two attention views via KV cache override: selected seeds are masked for verification, while their cached key value states are injected for all other queries to preserve contextual information, with a closed form diagonal correction preventing self leakage at the seed positions. COVER further prioritises seeds using a stability aware score that balances uncertainty, downstream influence, and cache drift, and it adapts the number of verified seeds per step. Across benchmarks, COVER markedly reduces unnecessary revisions and yields faster decoding while preserving output quality.

URL PDF HTML ☆

赞 0 踩 0

2510.00845 2026-06-01 cs.LG cs.AI cs.CL 版本更新

残差储备记忆网络

Matteo Pinna, Andrea Ceni, Claudio Gallicchio

发表机构 * Department of Computer Science（计算机科学系）； University of Pisa（比萨大学）

AI总结提出一种新型无训练循环神经网络ResRMN，通过结合线性记忆储备与基于时间残差正交连接的非线性储备，增强长期输入传播，在时间序列和像素级一维分类任务中优于传统储备计算模型。

Comments IJCNN 2025

2506.01318 2026-06-01 cs.LG cs.AI 版本更新

Unlearning's Blind Spots: Over-Unlearning and Prototypical Relearning Attack

机器遗忘的盲点：过度遗忘与原型重学习攻击

SeungBum Ha, Saerom Park, Sung Whan Yoon

发表机构 * Graduate School of Artificial Intelligence, Ulsan National Institute of Science and Technology (UNIST), Ulsan, South Korea（人工智能研究生院，乌山国立科学与技术研究所（UNIST），乌山，韩国）； Department of Industrial Engineering, UNIST, Ulsan, South Korea（工业工程系，UNIST，乌山，韩国）； Department of Electrical Engineering, UNIST, Ulsan, South Korea（电气工程系，UNIST，乌山，韩国）

AI总结针对类别级机器遗忘，提出过度遗忘度量OU@epsilon并揭示原型重学习攻击，通过Spotter方法结合掩码知识蒸馏和类内分散损失来缓解这两个盲点。

Comments 9 pages, ICML 2026

详情

AI中文摘要

机器遗忘（MU）旨在从训练模型中删除指定的遗忘集，而无需昂贵的重新训练，但现有技术忽略了两个关键盲点："过度遗忘"会恶化遗忘集附近的保留数据，以及事后"重学习"攻击旨在复活被遗忘的知识。聚焦于类别级遗忘，我们首先推导出一个过度遗忘度量OU@epsilon，它量化了遗忘集邻近区域（过度遗忘主要发生区域）的附带损害。接下来，我们揭示了MU上一个未预见的重学习威胁，即原型重学习攻击，该攻击仅利用少量样本就能利用遗忘类的每类原型，并轻松恢复遗忘前的性能。为了应对类别级遗忘中的这两个盲点，我们引入了Spotter，一个即插即用的目标函数，它结合了（i）对遗忘类邻近区域的掩码知识蒸馏惩罚以抑制OU@epsilon，和（ii）一个类内分散损失，用于分散遗忘类嵌入，从而中和原型重学习攻击。Spotter在CIFAR、TinyImageNet和CASIA-WebFace数据集上取得了最先进的结果，为机器遗忘的盲点提供了实用的补救措施。

英文摘要

Machine unlearning (MU) aims to expunge a designated forget set from a trained model without costly retraining, yet the existing techniques overlook two critical blind spots: "over-unlearning" that deteriorates retained data near the forget set, and post-hoc "relearning" attacks that aim to resurrect the forgotten knowledge. Focusing on class-level unlearning, we first derive an over-unlearning metric, OU@epsilon, which quantifies collateral damage in regions proximal to the forget set, where over-unlearning mainly occurs. Next, we expose an unforeseen relearning threat on MU, i.e., the Prototypical Relearning Attack, which exploits the per-class prototype of the forget class with just a few samples, and easily restores the pre-unlearning performance. To counter both blind spots in class-level unlearning, we introduce Spotter, a plug-and-play objective that combines (i) a masked knowledge-distillation penalty on the nearby region of forget classes to suppress OU@epsilon, and (ii) an intra-class dispersion loss that scatters forget-class embeddings, neutralizing Prototypical Relearning Attacks. Spotter achieves state-of-the-art results across CIFAR, TinyImageNet, and CASIA-WebFace datasets, offering a practical remedy to unlearning's blind spots.

URL PDF HTML ☆

赞 0 踩 0

2601.22296 2026-06-01 cs.LG cs.AI 版本更新

ParalESN: Enabling parallel information processing in Reservoir Computing

ParalESN：在储层计算中实现并行信息处理

Matteo Pinna, Giacomo Lagomarsini, Andrea Ceni, Claudio Gallicchio

发表机构 * Department of Computer Science, University of Pisa, Pisa, Italy（意大利比萨大学计算机科学系）

AI总结提出ParalESN，利用复数域对角线性递归实现储层计算的并行化，在保持回声状态属性和普适性保证的同时，大幅提升计算效率。

Comments ICML 2026

详情

AI中文摘要

储层计算（RC）已成为时间处理的有效范式。然而，其可扩展性受到顺序处理时间数据的需要和高维储层巨大内存占用的严重限制。为了解决这些限制，我们通过结构化算子和状态空间建模的视角重新审视RC，引入了并行回声状态网络（ParalESN）。利用复数域中的对角线性递归，ParalESN实现了时间数据的并行处理以及高效高维储层的构建。彻底的理论分析表明，传统回声状态网络的回声状态属性和普适性保证得以保留，同时允许任意线性储层在复数对角形式下的等价表示。实验上，ParalESN在预测精度上与传统的RC和完全可训练的序列模型相当，同时实现了数量级的计算节省。总体而言，ParalESN为将RC集成到深度学习领域提供了一条可扩展且有原则的路径。

英文摘要

Reservoir Computing (RC) has established itself as an efficient paradigm for temporal processing. However, its scalability remains severely constrained by the need to process temporal data sequentially and the prohibitive memory footprint of high-dimensional reservoirs. To address these limitations, we revisit RC through the lens of structured operators and state space modeling, introducing Parallel Echo State Network (ParalESN). Leveraging diagonal linear recurrence in the complex domain, ParalESN enables parallel processing of temporal data and the construction of efficient, high-dimensional reservoirs. A thorough theoretical analysis demonstrates that the Echo State Property and the universality guarantees of traditional Echo State Networks are preserved, while also admitting an equivalent representation of arbitrary linear reservoirs in the complex diagonal form. Empirically, ParalESN achieves competitive predictive accuracy with traditional RC and with fully trainable sequence models, while delivering computational savings by orders of magnitude. Overall, ParalESN offers a scalable and principled pathway for integrating RC within the deep learning landscape.

URL PDF HTML ☆

赞 0 踩 0

2509.24319 2026-06-01 cs.CL cs.AI 版本更新

Dual Mechanisms of Value Expression: Intrinsic vs. Prompted Values in Large Language Models

价值表达的双重机制：大型语言模型中的内在价值与提示价值

Jongwook Han, Jongwon Lim, Injin Kong, Yohan Jo

发表机构 * Graduate School of Data Science, Seoul National University（数据科学研究生院，首尔国立大学）

AI总结本文通过价值向量和价值神经元分析，揭示大型语言模型中内在价值表达与提示价值表达在机制上部分共享核心组件，但各自拥有独特功能，内在机制促进多样性，提示机制增强指令遵从。

Comments Accepted at ICML 2026. Project page: https://holi-lab.github.io/ValueMechanism/

详情

AI中文摘要

大型语言模型可以通过两种主要方式表达价值：（1）内在表达，反映模型在训练过程中学习到的固有价值；（2）提示表达，由显式提示引发。鉴于它们在价值对齐中的广泛应用，清楚理解其潜在机制至关重要，特别是它们是否主要重叠（如人们可能预期的）或依赖于不同的机制。我们在机制层面使用两种方法分析这个很大程度上未被充分研究的问题：（1）价值向量，从残差流中提取的代表价值机制的特征方向；（2）价值神经元，对价值向量有贡献的MLP神经元。我们证明内在和提示价值机制部分共享对诱导价值表达至关重要的共同组件，这些组件跨语言泛化并在模型的内部表示中重建理论上的价值间相关性。然而，每种机制也拥有独特的组件，发挥不同的作用。特别是，内在机制在更多样化的价值相关场景中激活并促进响应多样性，而提示机制增强指令遵从，甚至在遥远任务（如越狱）中也能生效。

英文摘要

Large language models can express values in two main ways: (1) intrinsic expression, reflecting the model's inherent values learned during training, and (2) prompted expression, elicited by explicit prompts. Given their widespread use in value alignment, it is paramount to clearly understand their underlying mechanisms, particularly whether they mostly overlap (as one might expect) or rely on distinct mechanisms. We analyze this largely understudied problem at the mechanistic level using two approaches: (1) value vectors, feature directions representing value mechanisms extracted from the residual stream, and (2) value neurons, MLP neurons that contribute to value vectors. We demonstrate that intrinsic and prompted value mechanisms partly share common components crucial for inducing value expression, generalizing across languages and reconstructing theoretical inter-value correlations in the model's internal representations. Yet, each mechanism also possesses unique components that fulfill distinct roles. In particular, the intrinsic mechanism activates in more diverse value-related scenarios and promotes response diversity, whereas the prompted mechanism strengthens instruction compliance, taking effect even in distant tasks like jailbreaking.

URL PDF HTML ☆

赞 0 踩 0

2509.20784 2026-06-01 cs.CL cs.AI 版本更新

Towards Atoms of Large Language Models

迈向大型语言模型的原子

Chenhui Hu, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao

发表机构 * The Key Laboratory of Cognition and Decision Intelligence for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing, China（认知与决策智能复杂系统重点实验室，自动化研究所，中国科学院，北京，中国）； School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China（人工智能学院，中国科学院大学，北京，中国）

AI总结本文提出原子理论，通过原子内积（AIP）定义、评估和识别大型语言模型的基本表示单元（原子），并证明在阈值激活稀疏自编码器（TSAE）下原子可识别，实验发现神经元和特征不满足理想原子标准，而通过匹配TSAE容量与数据规模可识别出近乎完美的原子。

Comments To be published in ICML 2026

详情

AI中文摘要

大型语言模型（LLMs）的基本表示单元（FRUs）尚未定义，这限制了对它们底层机制的进一步理解。在本文中，我们引入原子理论来系统地定义、评估和识别这样的FRUs，我们称之为原子。基于原子内积（AIP），一种捕捉LLM表示底层几何结构的非欧几里得度量，我们正式定义了原子，并提出了理想原子的两个关键标准：忠实性（$R^2$）和稳定性（$q^*$）。我们进一步证明，在阈值激活稀疏自编码器（TSAEs）下原子是可识别的。在实验上，我们揭示了LLMs中普遍存在的表示偏移，并证明AIP纠正了这种偏移以捕捉底层的表示几何结构。我们发现两个广泛使用的单元——神经元和特征——不符合理想原子的条件：神经元是忠实的（$R^2=1$）但不稳定（$q^*=0.5\%$），而特征更稳定（$q^*=68.2\%$）但不忠实（$R^2=48.8\%$）。为了找到LLMs的原子，利用TSAEs下的原子可识别性，我们通过大规模实验表明，只有当TSAE容量与数据规模匹配时，才能实现可靠的原子识别。在此洞察的指导下，我们在Gemma2-2B、Gemma2-9B和Llama3.1-8B的各层中识别出具有近乎完美忠实性（$R^2=99.9\%$）和稳定性（$q^*=99.8\%$）的FRUs，在统计上满足理想原子的标准。进一步分析证实，这些原子与理论预期一致，并表现出显著更高的单语义性。总体而言，我们提出并验证了原子理论作为理解LLMs内部表示的基础。代码可在https://github.com/ChenhuiHu/towards_atoms获取。

英文摘要

The fundamental representational units (FRUs) of large language models (LLMs) remain undefined, limiting further understanding of their underlying mechanisms. In this paper, we introduce Atom Theory to systematically define, evaluate, and identify such FRUs, which we term atoms. Building on the atomic inner product (AIP), a non-Euclidean metric that captures the underlying geometry of LLM representations, we formally define atoms and propose two key criteria for ideal atoms: faithfulness ($R^2$) and stability ($q^*$). We further prove that atoms are identifiable under threshold-activated sparse autoencoders (TSAEs). Empirically, we uncover a pervasive representation shift in LLMs and demonstrate that the AIP corrects this shift to capture the underlying representational geometry. We find that two widely used units, neurons and features, fail to qualify as ideal atoms: neurons are faithful ($R^2=1$) but unstable ($q^*=0.5\%$), while features are more stable ($q^*=68.2\%$) but unfaithful ($R^2=48.8\%$). To find atoms of LLMs, leveraging atom identifiability under TSAEs, we show via large-scale experiments that reliable atom identification occurs only when the TSAE capacity matches the data scale. Guided by this insight, we identify FRUs with near-perfect faithfulness ($R^2=99.9\%$) and stability ($q^*=99.8\%$) across layers of Gemma2-2B, Gemma2-9B, and Llama3.1-8B, satisfying the criteria of ideal atoms statistically. Further analysis confirms that these atoms align with theoretical expectations and exhibit substantially higher monosemanticity. Overall, we propose and validate Atom Theory as a foundation for understanding the internal representations of LLMs. Code available at https://github.com/ChenhuiHu/towards_atoms.

URL PDF HTML ☆

赞 0 踩 0

2601.18537 2026-06-01 cs.RO cs.AI 版本更新

不完全信息博弈中的一致对手建模

Sam Ganzfried

发表机构 * Ganzfried Research（甘兹弗里德研究）

AI总结针对不完全信息博弈中现有对手建模方法无法保证收敛到对手真实策略的问题，提出一种基于序列形式博弈表示和投影梯度下降的凸优化算法，实现高效且一致的对手建模。

详情

AI中文摘要

多智能体环境中智能体的目标是在与对手交互时最大化总收益。遵循博弈论解概念（如纳什均衡）在某些场景下可能获得强性能；然而，这类方法未能利用与对手重复交互中的历史和观测数据。对手建模算法整合机器学习技术，利用可用数据来利用次优对手；然而，这类方法在不完全信息博弈中的有效性至今相当有限。我们表明，即使面对来自已知先验分布的静态对手，现有对手建模方法也无法满足一个简单的理想性质；即，即使博弈迭代次数趋近无穷，它们也不能保证模型趋近对手的真实策略。我们开发了一种新算法，能够实现这一性质，并通过基于序列形式博弈表示和投影梯度下降求解凸最小化问题来高效运行。在标准贝叶斯可辨识性和访问假设下，该算法保证从游戏过程的观测以及可能可用的额外历史数据中高效收敛到对手的真实策略。

英文摘要

The goal of agents in multi-agent environments is to maximize total reward against the opposing agents that are encountered. Following a game-theoretic solution concept, such as Nash equilibrium, may obtain a strong performance in some settings; however, such approaches fail to capitalize on historical and observed data from repeated interactions against our opponents. Opponent modeling algorithms integrate machine learning techniques to exploit suboptimal opponents utilizing available data; however, the effectiveness of such approaches in imperfect-information games to date is quite limited. We show that existing opponent modeling approaches fail to satisfy a simple desirable property even against static opponents drawn from a known prior distribution; namely, they do not guarantee that the model approaches the opponent's true strategy even in the limit as the number of game iterations approaches infinity. We develop a new algorithm that is able to achieve this property and runs efficiently by solving a convex minimization problem based on the sequence-form game representation using projected gradient descent. The algorithm is guaranteed to efficiently converge to the opponent's true strategy under standard Bayesian identifiability and visitation assumptions, given observations from gameplay and possibly additional historical data if it is available.

URL PDF HTML ☆

赞 0 踩 0

2512.23626 2026-06-01 cs.AI cs.LG 版本更新

Regret-Based Federated Causal Discovery with Unknown Interventions

基于遗憾的联邦因果发现与未知干预

Federico Baldo, Charles K. Assaad

发表机构 * Sorbonne Université（索邦大学）； INSERM（国家健康与医学研究院）； Institut Pierre Louis d’Epidémiologie et de Santé Publique（流行病学与公共卫生研究所）

AI总结提出I-PERI算法，通过恢复客户端图并集的CPDAG并利用跨客户端干预引起的结构差异定向额外边，得到更紧的Φ-马尔可夫等价类，解决联邦环境下未知客户端级干预的因果发现问题。

Comments ICML 2026

详情

AI中文摘要

大多数因果发现方法从观测数据中恢复一个表示马尔可夫等价类的完全部分有向无环图。最近的工作将这些方法扩展到联邦设置以解决数据去中心化和隐私约束，但通常假设所有客户端共享相同的因果模型，这在实践中不现实，因为客户端特定的策略或协议（例如不同医院）自然会导致异质且未知的干预。在这项工作中，我们解决了未知客户端级干预下的联邦因果发现问题。我们提出了I-PERI，一种新颖的联邦算法，首先恢复客户端图并集的CPDAG，然后通过利用跨客户端干预引起的结构差异来定向额外的边。这产生了一个更紧的等价类，我们称之为Φ-马尔可夫等价类，由Φ-CPDAG表示。我们提供了I-PERI收敛性及其隐私保护属性的理论保证，并在合成数据上进行了实证评估，证明了所提算法的有效性。

英文摘要

Most causal discovery methods recover a completed partially directed acyclic graph representing a Markov equivalence class from observational data. Recent work has extended these methods to federated settings to address data decentralization and privacy constraints, but often under idealized assumptions that all clients share the same causal model. Such assumptions are unrealistic in practice, as client-specific policies or protocols, for example, across hospitals, naturally induce heterogeneous and unknown interventions. In this work, we address federated causal discovery under unknown client-level interventions. We propose I-PERI, a novel federated algorithm that first recovers the CPDAG of the union of client graphs and then orients additional edges by exploiting structural differences induced by interventions across clients. This yields a tighter equivalence class, which we call the $\mathbfΦ$-Markov Equivalence Class, represented by the $\mathbfΦ$-CPDAG. We provide theoretical guarantees on the convergence of I-PERI, as well as on its privacy-preserving properties, and present empirical evaluations on synthetic data demonstrating the effectiveness of the proposed algorithm.

URL PDF HTML ☆

赞 0 踩 0

2512.20732 2026-06-01 cs.LG cs.AI cs.SE 版本更新

FEM-Bench: A Structured Scientific Reasoning Benchmark for Evaluating Code-Generating LLMs

FEM-Bench：评估代码生成大语言模型的结构化科学推理基准

Saeed Mohammadzadeh, Erfan Hamdi, Joel Shor, Emma Lejeune

发表机构 * Boston University（波士顿大学）； Move37 Labs（Move37实验室）； Department of Mechanical Engineering（机械工程系）

AI总结提出FEM-Bench基准，通过有限元方法相关编程任务评估大语言模型在科学计算中的结构化推理能力，实验表明现有模型尚不能稳定解决所有任务。

Comments 45 pages, 5 figures, 9 tables, 7 listings

详情

AI中文摘要

随着大语言模型在物理世界推理能力上的进步，缺乏严格基准来评估其生成科学有效物理模型的能力已成为一个关键缺口。计算力学开发和运用数学模型与数值方法，预测物理系统在力、变形和约束下的行为，为结构化科学推理评估提供了理想基础。问题遵循清晰的数学结构，强制执行严格的物理和数值约束，并支持客观验证。该学科要求构建物理系统的显式模型，并推理几何、空间关系和材料行为，直接联系到新兴的AI物理推理和世界建模目标。我们提出FEM-Bench，一个计算力学基准，旨在评估大语言模型生成正确有限元方法及相关代码的能力。FEM-Bench 2025包含一系列入门但非平凡的任务，与计算力学研究生第一门课程的材料一致。这些任务捕捉了基本的数值和物理建模挑战，同时仅代表该学科复杂性的很小一部分。尽管简单，最先进的大语言模型并不能可靠地解决所有任务。在五次尝试中，函数编写表现最好的模型Gemini 3 Pro至少一次完成了30/33个任务，五次全部完成26/33个任务。单元测试编写表现最好的模型GPT-5的平均联合成功率为73.8%。其他流行模型显示出广泛的性能差异。FEM-Bench为评估AI生成的科学代码建立了结构化基础，未来版本将纳入更复杂的任务以跟踪模型进展。

英文摘要

As LLMs advance their reasoning capabilities about the physical world, the absence of rigorous benchmarks for evaluating their ability to generate scientifically valid physical models has become a critical gap. Computational mechanics, which develops and applies mathematical models and numerical methods to predict the behavior of physical systems under forces, deformation, and constraints, provides an ideal foundation for structured scientific reasoning evaluation. Problems follow clear mathematical structure, enforce strict physical and numerical constraints, and support objective verification. The discipline requires constructing explicit models of physical systems and reasoning about geometry, spatial relationships, and material behavior, connecting directly to emerging AI goals in physical reasoning and world modeling. We introduce FEM-Bench, a computational mechanics benchmark designed to evaluate the ability of LLMs to generate correct finite element method (FEM) and related code. FEM-Bench 2025 contains a suite of introductory but nontrivial tasks aligned with material from a first graduate course on computational mechanics. These tasks capture essential numerical and physical modeling challenges while representing only a small fraction of the complexity present in the discipline. Despite their simplicity, state-of-the-art LLMs do not reliably solve all of them. In a five attempt run, the best performing model at function writing, Gemini 3 Pro, completed 30/33 tasks at least once and 26/33 tasks all five times. The best performing model at unit test writing, GPT-5, had an Average Joint Success Rate of 73.8%. Other popular models showed broad performance variation. FEM-Bench establishes a structured foundation for evaluating AI-generated scientific code, and future iterations will incorporate increasingly sophisticated tasks to track progress as models evolve.

URL PDF HTML ☆

赞 0 踩 0

2512.11779 2026-06-01 stat.ML cs.AI cs.LG 版本更新

InfiMed-ORBIT: 通过基于评分标准的增量训练使大语言模型对齐开放复杂任务

Pengkai Wang, Pengwei Liu, Qi Zuo, Zhijie Sang, Congkai Xie, Hongxia Yang

发表机构 * Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China（香港理工大学计算机系）； Department of Control Science and Engineering, Zhejiang University（浙江大学控制科学与工程学院）

AI总结提出ORBIT框架，利用动态生成的病例条件评分标准指导增量强化学习，仅用2k样本将Qwen3-4B-Instruct在HealthBench-Hard上的得分从7.0提升至27.5，达到同规模开源模型最优。

详情

AI中文摘要

强化学习（RL）推动了大语言模型（LLM）的许多近期突破，尤其是在奖励可自动计算的任务（如代码生成）中。然而，在开放式的医学对话中，RL效果较差，因为反馈模糊、依赖上下文，且难以简单总结为单一标量信号——通常需要高度监督的奖励模型，并存在奖励破解的风险。因此，我们引入了ORBIT，一个专为关键医学对话设计的基于评分标准的开放式增量训练框架。ORBIT将医学对话构建与动态生成的病例条件评分标准相结合，这些评分标准作为增量RL的自适应指南。与依赖外部医学知识库或手工规则的方法不同，ORBIT使用评分标准引导的评估，并可与通用指令遵循LLM一起实现，避免了任务特定的评判微调。仅使用2k训练样本，ORBIT将Qwen3-4B-Instruct的HealthBench-Hard得分从7.0提升至27.5，在相似规模的开源模型中实现了最先进的性能，同时随着评分标准覆盖范围的扩大，保持了良好的咨询质量。

英文摘要

Reinforcement learning (RL) has powered many recent breakthroughs in large language models (LLMs), especially for tasks where rewards can be computed automatically, such as code generation. However, it is less effective in open-ended medical dialogue, where feedback is ambiguous, context-dependent, and difficult to simply summarize into a single scalar signal-often requiring heavily supervised reward models and creating risks of reward hacking. Thus, we introduce ORBIT, an open-ended rubric-based incremental training framework tailored for critical medical dialogues. ORBIT integrates medical dialogue construction with dynamically generated case-conditioned rubrics that serve as adaptive guides for incremental RL. Unlike approaches that rely on external medical knowledge bases or handcrafted rules, ORBIT uses rubric-guided evaluation and can be implemented with general-purpose instruction-following LLMs, avoiding task-specific judge fine-tuning. With only 2k training samples, ORBIT raises Qwen3-4B-Instruct's HealthBench-Hard score from 7.0 to 27.5, achieving state-of-the-art performance among similarly sized open-source models while maintaining strong consultation quality as rubric coverage broadens.

URL PDF HTML ☆

赞 0 踩 0

2509.21379 2026-06-01 cs.CV cs.AI 版本更新

SAEmnesia: Erasing Concepts in Diffusion Models with Supervised Sparse Autoencoders

SAEmnesia：基于监督稀疏自编码器的扩散模型概念擦除

Enrico Cassano, Riccardo Renzulli, Marco Nurisso, Mirko Zaffaroni, Alan Perotti, Marco Grangetto

发表机构 * University of Turin, Italy（意大利都灵大学）； Intesa Sanpaolo AI Research, Italy（意大利Intesa Sanpaolo人工智能研究院）

AI总结提出监督稀疏自编码器框架SAEmnesia，通过强制一对一概念-神经元映射实现特征集中化，从而高效、精准地擦除扩散模型中的概念。

Comments Accepted at ICML 2026

详情

AI中文摘要

扩散模型中的概念遗忘受到特征分裂的阻碍，即概念分布在许多潜在特征上，使得移除它们具有挑战性且计算成本高。我们引入了SAEmnesia，一种监督稀疏自编码器框架，通过强制一对一的概念-神经元映射来克服这一问题。通过在训练过程中系统地标记概念，我们的方法实现了特征集中化，将每个概念绑定到一个可解释的神经元上。这使得概念擦除高度精准且高效。与最先进的基于稀疏自编码器的遗忘方法相比，SAEmnesia将超参数搜索减少了96.67%，并在UnlearnCanvas对象基准上实现了9.22%的提升。我们的方法在顺序遗忘中也表现出卓越的可扩展性，在移除九个对象时准确率提高了28.4%，为精确可控的概念擦除迈出了一步。此外，SAEmnesia在I2P基准上有效抑制了裸体内容，并对对抗攻击保持鲁棒性。源代码可在https://github.com/EIDOSLAB/SAEmnesia获取。

英文摘要

Concept unlearning in diffusion models is hampered by feature splitting, where concepts are distributed across many latent features, making their removal challenging and computationally expensive. We introduce SAEmnesia, a supervised sparse autoencoder framework that overcomes this by enforcing one-to-one concept-neuron mappings. By systematically labeling concepts during training, our method achieves feature centralization, binding each concept to a single, interpretable neuron. This enables highly targeted and efficient concept erasure. Compared to the state-of-the-art sparse autoencoder-based unlearning approach, SAEmnesia reduces hyperparameter search by 96.67% and achieves a 9.22% improvement on the UnlearnCanvas benchmark for objects. Our method also shows superior scalability in sequential unlearning, improving accuracy by 28.4% when removing nine objects, establishing a step forward for precise and controllable concept erasure. Moreover, SAEmnesia effectively suppresses nudity on the I2P benchmark and remains robust to adversarial attacks. Source code available at https://github.com/EIDOSLAB/SAEmnesia.

URL PDF HTML ☆

赞 0 踩 0

2511.19433 2026-06-01 cs.RO cs.AI cs.CV 版本更新

Mixture of Horizons in Action Chunking

动作分块中的视野混合

Dong Jing, Gang Wang, Jiaqi Liu, Weiliang Tang, Zelong Sun, Yunchao Yao, Zhenyu Wei, Yunhui Liu, Zhiwu Lu, Mingyu Ding

发表机构 * Renmin University of China（中国人民大学）； University of North Carolina at Chapel Hill（北卡罗来纳大学教堂山分校）； The Chinese University of Hong Kong（香港中文大学）

AI总结针对视觉-语言-动作模型中动作分块长度（视野）的权衡问题，提出混合视野策略，通过并行处理不同视野的动作片段并融合输出，同时提升长期预见与短期精度，实现性能与泛化性的改进。

Comments Accepted at ICML 2026

详情

AI中文摘要

视觉-语言-动作（VLA）模型在机器人操作中展现出显著能力，但其性能对训练中使用的$ extbf{动作分块长度}$（称为$ extbf{视野}$）敏感。我们的实证研究揭示了一个内在权衡：较长的视野提供更强的全局预见但降低细粒度精度，而较短的视野增强局部控制但在长期任务上表现不佳，这意味着固定选择单一视野是次优的。为缓解这一权衡，我们提出$ extbf{混合视野（MoH）}$策略。MoH将动作分块重新排列为多个不同视野的片段，通过共享动作变换器并行处理，并使用轻量线性门控融合输出。它具有三个吸引人的优点：1) MoH在单个模型中联合利用长期预见和短期精度，提高了复杂任务的性能和泛化能力。2) MoH对全注意力动作模块即插即用，训练或推理开销极小。3) MoH支持自适应视野的动态推理，通过跨视野共识选择稳定动作，实现比基线高2.5倍的吞吐量，同时保持优越性能。在基于流的策略$π_0$、$π_{0.5}$和单步回归策略$π_{ ext{reg}}$上的大量实验表明，MoH在仿真和真实世界任务上均取得一致且显著的提升。值得注意的是，在混合任务设置下，带有MoH的$π_{0.5}$在LIBERO上仅经过$30k$次训练迭代即达到99$\%$的平均成功率，创下新纪录。项目页面：https://timsty1.github.io/moh/

英文摘要

Vision-language-action (VLA) models have shown remarkable capabilities in robotic manipulation, but their performance is sensitive to the $\textbf{action chunk length}$ used during training, termed $\textbf{horizon}$. Our empirical study reveals an inherent trade-off: longer horizons provide stronger global foresight but degrade fine-grained accuracy, while shorter ones sharpen local control yet struggle on long-term tasks, implying fixed choice of single horizons being suboptimal. To mitigate the trade-off, we propose a $\textbf{mixture of horizons (MoH)}$ strategy. MoH rearranges the action chunk into several segments with different horizons, processes them in parallel with a shared action transformer, and fuses outputs with a light linear gate. It has three appealing benefits. 1) MoH exploits long-term foresight and short-term precision jointly within a single model, improving both performance and generalizability to complex tasks. 2) MoH is plug-and-play for full-attention action modules with minimal training or inference overhead. 3) MoH enables dynamic inference with adaptive horizons, which selects stable actions through cross-horizon consensus, achieving 2.5$\times$ higher throughput than baselines while preserving superior performance. Extensive experiments over flow-based policies $π_0$, $π_{0.5}$, and one-step regression policy $π_{\text{reg}}$ demonstrate that MoH yields consistent and significant gains on both simulations and real-world tasks. Notably, under mixed-task setting, $π_{0.5}$ with MoH reaches a new state-of-the-art with 99$\%$ average success rate on LIBERO after only $30k$ training iterations. Project page: https://timsty1.github.io/moh/

URL PDF HTML ☆

赞 0 踩 0

2511.18760 2026-06-01 cs.AI cs.FL 版本更新

迈向人性化的社交媒体生态系统：面向安全、自主与福祉的AI增强人机交互设计模式

Mohd Ruhul Ameen, Akif Islam

发表机构 * College of Engineering（工程学院）； Computer Sciences Marshall University Huntington, WV, USA（计算机科学马歇尔大学亨廷顿州威斯康星州）； Department of Computer Science（计算机科学系）； Engineering University of Rajshahi Rajshahi 6205, Bangladesh（工程 Rajshahi 大学 Rajshahi 6205 巴基斯坦）

AI总结提出Human-Layer AI（HL-AI）框架，通过浏览器端用户拥有的可解释中介，在不依赖平台合作的情况下赋予用户实时控制权，实现内容重写、完整性检测、信息流定制、行为中断和恢复模式等五种设计模式，以提升社交媒体安全性与用户福祉。

Comments 6 pages, 5 tables, 7 figures, and 2 algorithm tables. Accepted at International Conference on Signal Processing, Information, Communication and Systems (SPICSCON 2025)

详情

DOI: 10.1109/SPICSCON69221.2025.11504070
Journal ref: 2025 IEEE International Conference on Signal Processing, Information, Communication and Systems (SPICSCON)

AI中文摘要

社交平台连接了数十亿人，但其以参与度优先的算法往往对用户施加影响而非与用户协作，加剧了压力、虚假信息和失控感。我们提出Human-Layer AI（HL-AI）——用户拥有的、可解释的中介，位于浏览器中平台逻辑与界面之间。HL-AI赋予人们实用的、即时的控制权，无需平台合作。我们贡献了一个可用的Chrome/Edge原型，实现了五种代表性模式框架——上下文感知帖子重写器、帖子完整性检测器、精细信息流策展器、微退出代理和恢复模式——以及一个统一的数学公式，平衡用户效用、自主成本和风险阈值。评估涵盖技术准确性、可用性和行为结果。结果是一套人性化的控制手段，帮助用户在伤害发生前重写内容、通过完整性提示阅读、有意图地调整信息流、暂停强迫性循环以及在骚扰期间寻求庇护，同时通过解释和覆盖选项保留自主权。该原型为改造当今的信息流以融入安全性、自主性和福祉提供了实用路径，并邀请进行严格的跨文化用户评估。

英文摘要

Social platforms connect billions of people, yet their engagement-first algorithms often work on users rather than with them, amplifying stress, misinformation, and a loss of control. We propose Human-Layer AI (HL-AI)--user-owned, explainable intermediaries that sit in the browser between platform logic and the interface. HL-AI gives people practical, moment-to-moment control without requiring platform cooperation. We contribute a working Chrome/Edge prototype implementing five representative pattern frameworks--Context-Aware Post Rewriter, Post Integrity Meter, Granular Feed Curator, Micro-Withdrawal Agent, and Recovery Mode--alongside a unifying mathematical formulation balancing user utility, autonomy costs, and risk thresholds. Evaluation spans technical accuracy, usability, and behavioral outcomes. The result is a suite of humane controls that help users rewrite before harm, read with integrity cues, tune feeds with intention, pause compulsive loops, and seek shelter during harassment, all while preserving agency through explanations and override options. This prototype offers a practical path to retrofit today's feeds with safety, agency, and well-being, inviting rigorous cross-cultural user evaluation.

URL PDF HTML ☆

赞 0 踩 0

2511.04393 2026-06-01 cs.AI 版本更新

Post-Training LLMs as Better Decision-Making Agents: A Regret-Minimization Approach

将LLM后训练为更好的决策智能体：一种遗憾最小化方法

Chanwoo Park, Ziyang Chen, Asuman Ozdaglar, Kaiqing Zhang

发表机构 * Massachusetts Institute of Technology（麻省理工学院）； University of Maryland, College Park（马里兰大学哥伦比亚学院）

AI总结提出迭代遗憾最小化微调（Iterative RMFT），通过反复蒸馏低遗憾决策轨迹来后训练LLM，提升其在在线决策任务中的表现，无需依赖已知算法或人工模板。

Comments Camera ready version of ICML 2026

详情

AI中文摘要

大型语言模型（LLM）越来越多地被部署为交互式和动态环境中的决策智能体。然而，由于它们最初并非为决策设计，最近的研究表明，LLM即使在基本的在线决策问题中也可能表现不佳，无法实现低遗憾或有效的探索-利用权衡。为了解决这个问题，我们引入了迭代遗憾最小化微调（Iterative RMFT），这是一种后训练过程，反复将低遗憾决策轨迹蒸馏回基础模型。在每次迭代中，模型生成多个决策轨迹，选择k个最低遗憾的轨迹，并在此基础上进行微调。与先前方法（a）从已知决策算法中蒸馏动作序列或（b）依赖人工设计的思维链模板不同，我们的方法利用遗憾度量来激发模型自身的决策能力和推理依据。这种对模型生成推理的依赖避免了僵化的输出工程，并提供了更灵活、自然语言的训练信号。实验结果表明，Iterative RMFT在多种模型上提升了LLM的决策性能——从具有数值输入/输出的Transformer，到开源权重LLM，再到像GPT-4o mini这样的先进闭源模型。其在输出和推理格式上的灵活性使其能够泛化到具有不同时间范围、动作空间、奖励过程和自然语言上下文的任务。最后，我们提供了理论见解，表明在这种范式下，单层Transformer可以在简化设置中充当无遗憾学习器。总体而言，Iterative RMFT为增强LLM的决策能力提供了一个有原则且通用的后训练框架。

英文摘要

Large language models (LLMs) are increasingly deployed as "agents" for decision-making (DM) in interactive and dynamic environments. Yet, since they were not originally designed for DM, recent studies show that LLMs can struggle even in basic online DM problems, failing to achieve low regret or an effective exploration-exploitation tradeoff. To address this, we introduce Iterative Regret-Minimization Fine-Tuning (Iterative RMFT), a post-training procedure that repeatedly distills low-regret decision trajectories back into the base model. At each iteration, the model rolls out multiple decision trajectories, selects the k-lowest regret ones, and fine-tunes itself on them. Unlike prior methods that (a) distill action sequences from known DM algorithms or (b) rely on manually crafted chain-of-thought templates, our approach leverages the regret metric to elicit the model's own DM ability and reasoning rationales. This reliance on model-generated reasoning avoids rigid output engineering and provides more flexible, natural-language training signals. Empirical results show that Iterative RMFT improves LLMs' DM performance across diverse models - from Transformers with numerical input/output, to open-weight LLMs, and advanced closed-weight models like GPT-4o mini. Its flexibility in output and reasoning formats enables generalization across tasks with varying horizons, action spaces, reward processes, and natural-language contexts. Finally, we provide theoretical insight showing that a single-layer Transformer under this paradigm can act as a no-regret learner in a simplified setting. Overall, Iterative RMFT offers a principled and general post-training framework for enhancing LLMs' decision-making capabilities.

URL PDF HTML ☆

赞 0 踩 0

2511.03100 2026-06-01 cs.LG cs.AI cs.MA 版本更新

Scaling Multi-Agent Environment Co-Design with Diffusion Models

基于扩散模型的多智能体环境协同设计扩展

Hao Xiang Li, Michael Amir, Amanda Prorok

发表机构 * Department of Computer Science, University of Cambridge, Cambridge, United Kingdom（剑桥大学计算机科学系，剑桥，英国）

AI总结提出扩散协同设计（DiCoDe）框架，通过投影通用引导（PUG）和评论家蒸馏机制，实现高维环境设计空间下的可扩展、样本高效的智能体-环境协同优化。

详情

AI中文摘要

智能体-环境协同设计范式联合优化智能体策略和环境配置，以寻求系统性能提升。其应用领域从仓库物流到风电场管理，有望从根本上改变多智能体系统的部署方式。然而，当前的协同设计方法难以扩展：在高维环境设计空间下失效，且在处理联合优化中固有的移动目标时样本效率低下。我们通过开发扩散协同设计（DiCoDe）来应对这些挑战，这是一个可扩展且样本高效的协同设计框架，将协同设计推向实际相关场景。DiCoDe包含两项核心创新。首先，我们引入投影通用引导（PUG），这是一种采样技术，使DiCoDe能够在满足硬约束（如障碍物之间的空间间隔）的同时，探索奖励最大化环境的分布。其次，我们设计了一种评论家蒸馏机制，以共享来自强化学习评论家的知识，确保引导扩散模型利用密集且最新的学习信号适应不断演化的智能体策略。在具有挑战性的多智能体环境协同设计基准（包括仓库自动化、多智能体路径规划和风电场优化）上验证时，这些改进共同产生了更优的环境-策略对。我们的方法持续超越现有技术，例如在仓库场景中，以少66%的仿真样本实现了39%更高的奖励。这为智能体-环境协同设计设立了新标准，并向着在现实世界中收获协同设计成果迈出了关键一步。

英文摘要

The agent-environment co-design paradigm jointly optimises agent policies and environment configurations in search of improved system performance. With application domains ranging from warehouse logistics to windfarm management, co-design promises to fundamentally change how we deploy multi-agent systems. However, current co-design methods struggle to scale. They collapse under high-dimensional environment design spaces and suffer from sample inefficiency when addressing moving targets inherent to joint optimisation. We address these challenges by developing Diffusion Co-Design (DiCoDe), a scalable and sample-efficient co-design framework pushing co-design towards practically relevant settings. DiCoDe incorporates two core innovations. First, we introduce Projected Universal Guidance (PUG), a sampling technique that enables DiCoDe to explore a distribution of reward-maximising environments while satisfying hard constraints such as spatial separation between obstacles. Second, we devise a critic distillation mechanism to share knowledge from the reinforcement learning critic, ensuring that the guided diffusion model adapts to evolving agent policies using a dense and up-to-date learning signal. Together, these improvements lead to superior environment-policy pairs when validated on challenging multi-agent environment co-design benchmarks including warehouse automation, multi-agent pathfinding and wind farm optimisation. Our method consistently exceeds the state-of-the-art, achieving, for example, 39% higher rewards in the warehouse setting with 66% fewer simulation samples. This sets a new standard in agent-environment co-design, and is a stepping stone towards reaping the rewards of co-design in real world domains.

URL PDF HTML ☆

赞 0 踩 0

2503.05846 2026-06-01 cs.CL cs.AI 版本更新

EMCEE: Improving Multilingual Capability of LLMs via Bridging Knowledge and Reasoning with Extracted Synthetic Multilingual Context

EMCEE：通过提取合成多语言上下文桥接知识与推理以提升大语言模型的多语言能力

Hamin Koo, Jaehyung Kim

发表机构 * Yonsei University（延世大学）

AI总结提出EMCEE框架，通过从LLM自身提取并融合语言特定知识，结合推理输出，显著提升多语言任务性能，尤其在低资源语言上平均提升31.7%。

Comments ACL 2026 Main

详情

AI中文摘要

大语言模型（LLMs）在广泛任务中取得了显著进展，但其对以英语为中心的训练数据的严重依赖导致在非英语语言中性能大幅下降。虽然现有的多语言提示方法强调将查询重新表述为英语或增强推理能力，但它们往往未能融入对某些查询至关重要的语言和文化特定基础。为了解决这一局限性，我们提出了EMCEE（提取合成多语言上下文并合并），一个简单而有效的框架，通过从LLM自身显式提取和利用查询相关知识来增强其多语言能力。具体来说，EMCEE首先提取合成上下文以揭示LLM中编码的潜在语言特定知识，然后通过基于判断的选择机制动态地将这种上下文见解与面向推理的输出合并。在涵盖多种语言和任务的四个多语言基准上的大量实验表明，EMCEE始终优于先前的方法，总体平均相对提升16.4%，在低资源语言中提升31.7%。

英文摘要

Large Language Models (LLMs) have achieved impressive progress across a wide range of tasks, yet their heavy reliance on English-centric training data leads to significant performance degradation in non-English languages. While existing multilingual prompting methods emphasize reformulating queries into English or enhancing reasoning capabilities, they often fail to incorporate the language- and culture-specific grounding that is essential for some queries. To address this limitation, we propose EMCEE (Extracting synthetic Multilingual Context and merging), a simple yet effective framework that enhances the multilingual capabilities of LLMs by explicitly extracting and utilizing query-relevant knowledge from the LLM itself. In particular, EMCEE first extracts synthetic context to uncover latent, language-specific knowledge encoded within the LLM, and then dynamically merges this contextual insight with reasoning-oriented outputs through a judgment-based selection mechanism. Extensive experiments on four multilingual benchmarks covering diverse languages and tasks demonstrate that EMCEE consistently outperforms prior approaches, achieving an average relative improvement of 16.4% overall and 31.7% in low-resource languages.

URL PDF HTML ☆

赞 0 踩 0

2510.11683 2026-06-01 cs.LG cs.AI cs.CL 版本更新

深度持续学习中的谱坍缩导致塑性丧失

Arjun Prakash, Naicheng He, Kaicheng Guo, Saket Tiwari, Ruo Yu Tao, Tyrone Serapio, Amy Greenwald, George Konidaris

发表机构 * Department of Computer Science, Brown University（布朗大学计算机科学系）

AI总结研究深度神经网络在持续学习中塑性丧失的原因，发现新任务初始化时的Hessian谱坍缩是主要因素，并提出基于Kronecker分解的两种正则化方法以保持塑性。

详情

AI中文摘要

我们研究为什么深度神经网络在持续学习中会丧失塑性，从而在不重新初始化参数的情况下无法学习新任务。我们表明，这种失败之前在新任务初始化时会出现Hessian谱坍缩，其中有意义的曲率方向消失，梯度下降变得无效。通过分析线性化ReLU网络，我们推导出成功训练的显式$ε$-秩条件，并证明损失加权Gram矩阵在谱上与广义高斯-牛顿近似等价，从而将NTK动力学与Hessian曲率联系起来。直接针对谱坍缩，我们讨论了Hessian的Kronecker因子近似，这激发了两种正则化增强：保持高有效特征秩和应用L2惩罚。在持续监督学习和强化学习任务上的实验证实，结合这两种正则化器可以有效保持塑性。

英文摘要

We investigate why deep neural networks suffer from loss of plasticity in continual learning, and thus fail to learn new tasks without reinitializing parameters. We show that this failure is preceded by Hessian spectral collapse at new-task initialization, where meaningful curvature directions vanish and gradient descent becomes ineffective. Analyzing a linearized ReLU network, we derive explicit $ε$-rank conditions for successful training and prove that the loss-weighted Gram matrix is spectrally equivalent to the Generalized Gauss-Newton approximation, thereby relating NTK dynamics to Hessian curvature. Targeting spectral collapse directly, we then discuss the Kronecker factored approximation of the Hessian, which motivates two regularization enhancements: maintaining high effective feature rank and applying L2 penalties. Experiments on continual supervised and reinforcement learning tasks confirm that combining these two regularizers effectively preserves plasticity.

URL PDF HTML ☆

赞 0 踩 0

2506.11653 2026-06-01 cs.CV cs.AI cs.LG 版本更新

DISCO: Mitigating Bias in Deep Learning with Conditional Distance Correlation

DISCO: 使用条件距离相关性减轻深度学习中的偏差

Emre Kavak, Tom Nuno Wolf, Christian Wachinger

发表机构 * Technical University of Munich, Germany（慕尼黑技术大学）； Konrad Zuse School of Excellence in Reliable AI, Germany（Konrad Zuse可靠性人工智能卓越学院）； Munich Center for Machine Learning (MCML), Germany（慕尼黑机器学习中心（MCML））

AI总结提出基于反因果模型的条件独立性准则，并设计条件距离相关性的高效估计器DISCO$_m$和sDISCO，通过正则化实现梯度模型中的偏差缓解，在多个数据集上优于或媲美现有方法。

Comments Accepted to ICML 2026 (oral)

2509.00834 2026-06-01 cs.AI cs.FL cs.LG cs.LO 版本更新

大型语言模型中推理时间不确定性的人类对齐与校准

Kyle Moore, Jesse Roberts, Daryl Watson

AI总结本文评估了多种推理时间不确定性度量，发现它们与人类群体不确定性高度对齐，尽管与人类答案偏好不一致，但在正确性相关性和分布分析上表现出中等到强校准证据。

Comments We have discovered a critical error in the normalized entropy calculation that may have substantially inflated nearly all results herein. We have since fixed this error in a new work, but we believe that the new work is sufficiently dissimilar in focus, methods, dataset, and results as to be misleading if presented as a simple replacement. As such, we propose removal and retraction instead

详情

AI中文摘要

最近，评估大型语言模型的不确定性校准引起了广泛关注，以促进模型控制和调节用户信任。推理时间不确定性可能为模型或外部控制模块提供实时信号，对于应用这些概念以改善LLM用户体验尤为重要。尽管许多现有论文考虑模型校准，但相对较少的工作试图评估模型不确定性与人类不确定性的对齐程度。在这项工作中，我们使用既有度量和新颖变体评估了一系列推理时间不确定性度量，以确定它们与人类群体水平不确定性以及传统模型校准概念的接近程度。我们发现，许多度量显示出与人类不确定性强烈对齐的证据，尽管与人类答案偏好缺乏对齐。对于那些成功的度量，我们在正确性相关性和分布分析方面发现了中等到强校准证据。

英文摘要

There has been much recent interest in evaluating large language models for uncertainty calibration to facilitate model control and modulate user trust. Inference time uncertainty, which may provide a real-time signal to the model or external control modules, is particularly important for applying these concepts to improve LLM-user experience in practice. While many of the existing papers consider model calibration, comparatively little work has sought to evaluate how closely model uncertainty aligns to human uncertainty. In this work, we evaluate a collection of inference-time uncertainty measures, using both established metrics and novel variations, to determine how closely they align with both human group-level uncertainty and traditional notions of model calibration. We find that numerous measures show evidence of strong alignment to human uncertainty, even despite the lack of alignment to human answer preference. For those successful metrics, we find moderate to strong evidence of model calibration in terms of both correctness correlation and distributional analysis.

URL PDF HTML ☆

赞 0 踩 0

2411.19463 2026-06-01 cs.SE cs.AI 版本更新

Understanding the Fundamental Design Decisions of Retrieval-Augmented Generation Systems

理解检索增强生成系统的基本设计决策

Shengming Zhao, Yuchen Shao, Yuheng Huang, Jiayang Song, Zhijie Wang, Chengcheng Wan, Lei Ma

发表机构 * Fudan University（复旦大学）； East China Normal University（华东师范大学）； Shanghai Innovation Institute（上海创新研究院）； The University of Tokyo（东京大学）； Macau University of Science and Technology（澳门科学理工学院）； Concordia University（Concordia大学）； University of Alberta（阿尔伯塔大学）； The University of Tokyo, Japan（日本东京大学）

AI总结本文通过系统实验，研究了RAG部署中的三个关键决策（是否部署、检索量、知识集成方式），揭示了任务和模型依赖的优化策略，为实践者提供基于证据的指导。

详情

DOI: 10.1145/3802824
Journal ref: ACM Transactions on Software Engineering and Methodology (TOSEM), 2026

AI中文摘要

检索增强生成（RAG）已成为增强大型语言模型（LLM）能力的关键技术。然而，实践者在做出RAG部署决策时面临重大挑战。尽管现有研究优先考虑算法创新，但在理解决定RAG成功的基本工程权衡方面仍存在系统性空白。我们首次对三个通用的RAG部署决策进行了全面研究：是否部署RAG、检索多少信息以及如何有效集成检索到的知识。通过在三个LLM和六个数据集（涵盖问答和代码生成任务）上的系统实验，我们揭示了关键见解：（1）RAG部署必须高度选择性，即使有完美文档，可变召回阈值和失败模式也会影响多达12.6%的样本。（2）最优检索量表现出任务依赖性：问答任务呈现通用模式（5-10个文档最优），而代码生成需要针对场景的优化。（3）知识集成有效性取决于任务和模型特性，代码生成从提示方法中显著受益，而问答任务改进甚微。这些发现表明，通用的RAG策略是不够的。有效的RAG系统需要基于任务特性和模型能力的上下文感知设计决策。我们的分析为实践者提供了基于证据的指导，并为原则性RAG部署建立了基础见解。我们的代码、数据和工件公开于https://github.com/ShengmingZ/RAG_Benchmark_Code_QA。

英文摘要

Retrieval-Augmented Generation (RAG) has emerged as a critical technique for enhancing large language model (LLM) capabilities. However, practitioners face significant challenges when making RAG deployment decisions. While existing research prioritizes algorithmic innovations, a systematic gap persists in understanding fundamental engineering trade-offs that determine RAG success. We present the first comprehensive study of three universal RAG deployment decisions: whether to deploy RAG, how much information to retrieve, and how to integrate retrieved knowledge effectively. Through systematic experiments across three LLMs and six datasets spanning question answering and code generation tasks, we reveal critical insights: (1) RAG deployment must be highly selective, with variable recall thresholds and failure modes affecting up to 12.6\% of samples even with perfect documents. (2) Optimal retrieval volume exhibits task-dependent behavior QA tasks show universal patterns (5-10 documents optimal) while code generation requires scenario-specific optimization. (3) Knowledge integration effectiveness depends on task and model characteristics, with code generation benefiting significantly from prompting methods while question answering shows minimal improvement. These findings demonstrate that universal RAG strategies prove inadequate. Effective RAG systems require context-aware design decisions based on task characteristics and model capabilities. Our analysis provides evidence-based guidance for practitioners and establishes foundational insights for principled RAG deployment. Our code, data and artifacts are publicly available at https://github.com/ShengmingZ/RAG_Benchmark_Code_QA.

URL PDF HTML ☆

赞 0 踩 0

2507.11075 2026-06-01 cs.CV cs.AI 版本更新

Joint angle based learning to refine kinematic human pose estimation

基于关节角度学习的运动学人体姿态估计精化

Chang Peng, Yifei Zhou, Haoqiang Ren, Shiqing Huang, Chuangye Chen, Jianming Yang, Bao Yang, Huifeng Xi, Zhenyu Jiang

发表机构 * Department of Engineering Mechanics, School of Civil Engineering and Transportation, South China University of Technology（工程力学系，交通工程学院，华南理工大学）； School of Mechanics and Construction Engineering, Jinan University（机械与建筑工程学院，暨南大学）； Guangdong Provincial Key Laboratory of Speed Capability, School of Physical Education, Jinan University（广东省速度能力重点实验室，暨南大学体育学院）

AI总结提出一种基于关节角度的双向循环网络后处理模块，利用高阶傅里叶级数近似生成可靠真值，以精化单图像人体姿态估计，纠正错误关键点并平滑轨迹。

详情

AI中文摘要

无标记人体姿态估计（HPE）在各个领域中的应用日益增多。当前的HPE在分析运动学人体姿态时，偶尔会出现关键点识别错误和关键点轨迹随机波动的问题。现有基于深度学习的HPE精化模型的性能受到训练数据集（关键点手动标注）不准确的显著限制。本文提出了一种新方法克服这一困难，关键技术包括：(i) 基于关节角度的运动学人体姿态鲁棒描述；(ii) 使用高阶傅里叶级数近似关节角度的时间变化以获得可靠的“真值”；(iii) 设计双向循环网络作为后处理模块，以精化基于单图像的HPE模型的估计。使用我们方法构建的高质量数据集训练后，该网络在纠正错误识别关节和平滑其时空轨迹方面表现出卓越性能。测试表明，在花样滑冰和霹雳舞等挑战性案例中，基于关节角度的精化（JAR）优于最先进的HPE精化网络。JAR还展示了纠正现有数据集的巨大潜力。

英文摘要

Marker-free human pose estimation (HPE) has found increasing applications in various fields. Current HPE suffers from occasional errors in keypoint recognition and random fluctuation in keypoint trajectories when analyzing kinematic human poses. The performance of existing deep learning-based models for HPE refinement is considerably limited by inaccurate training datasets in which the keypoints are manually annotated. This paper proposed a novel method to overcome the difficulty, in which the key techniques include: (i) A robust joint angle-based description of kinematic human poses; (ii) Approximating temporal variation of joint angles using high order Fourier series to get reliable "ground truth"; (iii) A bidirectional recurrent network is designed as a post-processing module to refine the estimation of single image-based HPE models. Trained with the high-quality dataset constructed using our method, the network demonstrates outstanding performance to correct wrongly recognized joints and smooth their spatiotemporal trajectories. Tests show that joint angle-based refinement (JAR) outperforms the state-of-the-art HPE refinement network in challenging cases like figure skating and breaking. JAR also demonstrates great potential to rectify existing datasets.

URL PDF HTML ☆

赞 0 踩 0

2507.05488 2026-06-01 cs.AI cs.CY 版本更新

OLG++: A Semantic Extension of Obligation Logic Graph

OLG++：义务逻辑图的语义扩展

Subhasis Dasgupta, Jon Stephens, Amarnath Gupta

发表机构 * University of California San Diego（加州大学圣地亚哥分校）

AI总结提出OLG++，通过引入空间、时间、当事人组、可废止性和逻辑分组等节点与边类型，扩展义务逻辑图以建模市政和跨司法管辖区的法规规则，并通过食品商业法规示例展示其在法律问答中的应用。

详情

利用语言模型从时间序列中推断事件

Mingtian Tan, Mike A. Merrill, Zack Gottesman, Tim Althoff, David Evans, Tom Hartvigsen

发表机构 * University of Virginia（弗吉尼亚大学）； Stanford University（斯坦福大学）； University of Washington（华盛顿大学）

AI总结研究大型语言模型能否从时间序列数据中推断自然语言事件，提出自动化任务生成方法和新基准，并通过蒸馏与强化学习提升小模型性能。

Comments 21 pages, 15 Figures

2411.13865 2026-06-01 cs.IR cs.AI cs.CL cs.LG 版本更新

Breaking Information Cocoons: A Hyperbolic Framework for Balancing Exploration and Exploitation in Recommender Systems

打破信息茧房：推荐系统中平衡探索与利用的双曲框架

Qiyao Ma, Menglin Yang, Mingxuan Ju, Tong Zhao, Neil Shah, Rex Ying

发表机构 * University of California, Davis（加州大学戴维斯分校）； The Hong Kong University of Science（香港科学大学）； Snap Inc.（Snap公司）； Yale University（耶鲁大学）

AI总结提出双曲框架HERec，通过语义增强的层次机制和自动层次聚类，在推荐系统中平衡探索与利用，有效缓解信息茧房。

Comments Accepted to KDD 2026. Code: https://github.com/Martin-qyma/HERec

详情

AI中文摘要

现代推荐系统常常形成信息茧房，限制用户接触多样化内容。核心挑战在于平衡内容探索与利用，同时允许用户调整推荐偏好。理想情况下，这种平衡可以通过层次表示来捕捉，其中深度搜索促进利用，广度搜索促进探索。然而，现有方法面临两个基本限制：欧几里得方法难以捕捉层次结构，而双曲方法尽管在层次建模上表现优越，但缺乏对用户和物品画像的语义理解，且未能提供平衡探索与利用的原则性机制。为解决这些问题，我们提出HERec，一个在推荐系统中有效平衡探索与利用的双曲框架。我们的框架引入两项关键创新：（1）语义增强的层次机制，直接在双曲空间中将丰富的文本描述与协同信息对齐。理论梯度分析表明，这种对齐有效利用了底层双曲流形结构，从而更准确地建模用户和物品；（2）通过优化Dasgupta代价的自动层次聚类机制，无需预定义超参数即可发现层次结构，实现用户可调节的探索-利用权衡。大量实验表明，HERec持续优于欧几里得和双曲基线，在效用指标上提升高达5.49%，多样性指标提升11.39%，有效缓解了信息茧房。

英文摘要

Modern recommender systems often create information cocoons, restricting users' exposure to diverse content. The central challenge is to balance content exploration and exploitation while allowing users to adjust their recommendation preferences. Ideally, this balance can be captured with a hierarchical representation, where depth search facilitates exploitation and breadth search enables exploration. However, existing approaches face two fundamental limitations: Euclidean methods struggle to capture hierarchical structures, while hyperbolic methods, despite their superior hierarchical modeling, lack semantic understanding of user and item profiles and fail to provide a principled mechanism for balancing exploration and exploitation. To address these challenges, we propose HERec, a hyperbolic framework that effectively balances exploration and exploitation in recommender systems. Our framework introduces two key innovations: (1) a semantic-enhanced hierarchical mechanism that aligns rich textual descriptions with collaborative information directly in hyperbolic space. Theoretical gradient analysis demonstrates that this alignment effectively leverages the underlying hyperbolic manifold structure, resulting in more accurate modeling of users and items; (2) an automatic hierarchical clustering mechanism by optimizing Dasgupta's cost, which discovers hierarchical structures without requiring predefined hyperparameters, enabling user-adjustable exploration-exploitation trade-offs. Extensive experiments demonstrate that HERec consistently outperforms both Euclidean and hyperbolic baselines, achieving up to 5.49% improvement in utility metrics and 11.39% increase in diversity metrics, effectively mitigating information cocoons.

URL PDF HTML ☆

赞 0 踩 0

2409.14583 2026-06-01 cs.AI 版本更新

LLM Bias Evaluation: Gender, Racial, and Age Disparities in Occupational and Crime Scenarios

LLM偏差评估：职业与犯罪场景中的性别、种族和年龄差异

Vishal Mirza, Rahul Kulkarni, Aakanksha Jadhav

发表机构 * New York University（纽约大学）； Northeastern University（东北大学）； Washington University in St. Louis（圣路易斯华盛顿大学）

AI总结本文评估了2024年四大领先LLM在职业和犯罪场景中的性别、种族和年龄偏差，发现去偏努力常导致新的公平性权衡，即“去偏悖论”。

Comments Updated title and abstract to emphasize key findings on the debiasing paradox for improved discoverability. Content and findings unchanged. 11 pages, 17 figures, Accepted at IEEE Conference on Artificial Intelligence (IEEE CAI) 2025. Full Paper acceptance in the Vertical HUMAN-CENTERED AI category

详情

DOI: 10.1109/CAI64502.2025.00045
Journal ref: 2025 IEEE Conference on Artificial Intelligence (CAI)

AI中文摘要

LLM偏差评估至关重要，因为大型语言模型（LLM）越来越多地影响高风险决策。本文对领先LLM中的性别、种族和年龄差异进行了全面评估，揭示出去偏努力常常创造新的公平性权衡。近年来LLM的进展显著，但由于各种限制，企业广泛采用仍然有限。本文考察了LLM中的偏差——这是一个影响其可用性、可靠性和公平性的关键问题。我们的研究评估了2024年发布的四个领先LLM（Gemini 1.5 Pro、Llama 3 70B、Claude 3 Opus和GPT-4o）在职业场景中的性别偏差以及犯罪场景中的性别、年龄和种族偏差。结果显示，LLM在各种职业中描绘女性角色的频率往往高于男性，与美国劳工统计局数据相比偏差达37%。在犯罪场景中，与美国联邦调查局数据的偏差在性别上为54%，种族上为28%，年龄上为17%。关键的是，我们观察到减少性别和种族偏差的努力常常导致过度偏向某一子类的结果，可能加剧差异——这种“去偏悖论”凸显了当前偏差缓解技术的局限性，并强调了更有效方法的必要性。

英文摘要

LLM bias evaluation is critical as large language models (LLMs) increasingly influence high-stakes decisions. This paper provides a comprehensive assessment of gender, racial, and age disparities in leading LLMs, revealing that debiasing efforts often create new fairness trade-offs. Recent advancements in LLMs have been notable, yet widespread enterprise adoption remains limited due to various constraints. This paper examines bias in LLMs - a crucial issue affecting their usability, reliability, and fairness. Our study evaluates gender bias in occupational scenarios and gender, age, and racial bias in crime scenarios across four leading LLMs released in 2024: Gemini 1.5 Pro, Llama 3 70B, Claude 3 Opus, and GPT-4o. Findings reveal that LLMs often depict female characters more frequently than male ones in various occupations, showing a 37% deviation from US BLS data. In crime scenarios, deviations from US FBI data are 54% for gender, 28% for race, and 17% for age. Critically, we observe that efforts to reduce gender and racial bias often lead to outcomes that may over-index one sub-class, potentially exacerbating disparities - a "debiasing paradox" that highlights the limitations of current bias mitigation techniques and underscores the need for more effective approaches.

URL PDF HTML ☆

赞 0 踩 0

2501.01926 2026-06-01 cs.CV cs.AI 版本更新

Cross-Modal Attention Calibration for LVLM Hallucination Mitigation

跨模态注意力校准用于LVLM幻觉缓解

Jiaming Li, Jiacheng Zhang, Zequn Jie, Lin Ma, Guanbin Li

发表机构 * Sun Yat-sen University（中山大学）； The University of Hong Kong（香港大学）； Meituan（美团）； Inspur Database Technology（Inspur数据库技术）； Guilin University of Electronic Technology（桂林电子科技大学）； Shenzhen Loop Area Institute（深圳环湖院）； Guangdong Key Laboratory of Big Data Analysis and Processing（广东大数据分析与处理重点实验室）

AI总结提出一种无需训练的跨模态注意力校准方法，通过设计模态间解码和位置校准模块，缓解大型视觉语言模型中的幻觉问题。

Comments CVPR2026

详情

AI中文摘要

大型视觉语言模型（LVLM）在视觉-语言理解方面表现出显著能力。尽管取得了成功，LVLM在复杂生成任务中仍然会产生幻觉，导致视觉输入与生成内容不一致。为了解决这个问题，一些方法引入了推理时干预，如对比解码，以减少对语言先验的过度依赖。然而，这些方法忽略了由位置偏差和虚假跨模态相关性引起的幻觉。在本文中，我们提出了一种跨模态注意力校准（CMAC）方法，以无需训练的方式缓解LVLM中的幻觉。在该方法中，我们设计了一个模态间解码（IMD）模块，通过一种新颖的对比解码机制来减轻幻觉。IMD将具有显著跨模态注意力权重的值向量掩蔽为失真，从而同时解决了单模态过度依赖和误导性跨模态相关性问题。此外，跨模态位置校准（CMPC）模块缩小了图像标记的位置差距，缓解了跨模态注意力中的位置偏差。在多种幻觉基准上的实验结果验证了我们的方法在减少LVLM幻觉方面优于现有最先进技术。我们的代码将在https://github.com/lijm48/IMCCD上提供。

英文摘要

Large vision-language models (LVLMs) have shown remarkable capabilities in visual-language understanding. Despite their success, LVLMs still suffer from generating hallucinations in complex generation tasks, leading to inconsistencies between visual inputs and generated content. To address this issue, some approaches have introduced inference-time interventions, such as contrastive decoding, to reduce overreliance on language priors. However, these approaches overlook hallucinations stemming from position bias and spurious inter-modality correlations. In this paper, we propose a Cross-Modal Attention Calibration (CMAC) method to mitigate hallucinations in LVLMs in a training-free manner. In this method, we design an Inter-Modality Decoding (IMD) module to alleviate hallucination by a novel contrastive decoding mechanism. IMD masks the value vectors associated with significant cross-modal attention weights as distortion, which addresses both uni-modality overreliance and misleading inter-modality correlations. Additionally, a Cross-Modal Position Calibration (CMPC) module shrinks the position gap of image tokens, alleviating the position bias in cross-modal attention. Experimental results on diverse hallucination benchmarks validate the superiority of our method over existing state-of-the-art techniques in reducing hallucinations for LVLM. Our code will be available at https://github.com/lijm48/IMCCD.

URL PDF HTML ☆

赞 0 踩 0

2502.15224 2026-06-01 cs.LG cs.AI 版本更新

Auto-Discovery-Bench: Diagnosing Structured State Tracking in Oracle-Guided Discovery

自动发现基准：在Oracle引导发现中诊断结构化状态追踪

Tingting Chen, Beibei Lin, Srinivas Anumasa, Vedant Shah, Zifeng Yuan, Qiran Zou, Anirudh Goyal, Dianbo Liu

发表机构 * National University of Singapore（国立新加坡大学）； Mila-Quebec AI institute（魁北克AI研究院）； Meta Superintelligence Labs（Meta超智能实验室）

AI总结提出Auto-Discovery-Bench基准，通过确定性Oracle引导的假设-干预-反馈循环，诊断智能体在结构化状态追踪中的能力瓶颈。

Comments 13 pages

详情

AI中文摘要

交互式发现要求智能体在多轮反馈中维护和更新结构化信念。在评估智能体于嘈杂、开放的科学环境中的表现之前，有必要在受控条件下隔离这一先决能力。我们引入了Auto-Discovery-Bench，一个确定性的Oracle引导诊断基准，其中智能体通过重复的假设-干预-反馈循环恢复隐藏结构。该基准实例化了三种受控发现抽象：有向图发现、无向关系发现和符号方程发现。在所有模型中，性能随着变量数量、轨迹长度和干扰项的增加而下降。一个独立的轨迹追踪诊断表明，即使移除了干预选择和假设生成，许多失败仍然存在，这表明在维护和整合长程结构化信息方面的限制是Oracle引导发现的重要瓶颈。Auto-Discovery-Bench并非旨在取代真实的发现环境；相反，它提供了一个可重复、低混淆的诊断测试平台，用于隔离交互式科学智能体的先决能力。

英文摘要

Interactive discovery requires agents to maintain and update structured beliefs over many rounds of feedback. Before evaluating agents in noisy, open-ended scientific environments, it is useful to isolate this prerequisite capability under controlled conditions. We introduce Auto-Discovery-Bench, a deterministic oracle-guided diagnostic benchmark in which agents recover hidden structures through repeated hypothesis--intervention--feedback cycles. The benchmark instantiates three controlled discovery abstractions: directed graph discovery, undirected relational discovery, and symbolic equation discovery. Across models, performance degrades as the number of variables, trajectory length, and distractors increase. A separate trajectory-tracking diagnostic shows that many failures persist even when intervention selection and hypothesis generation are removed, suggesting that limitations in maintaining and integrating long-range structured information are an important bottleneck for oracle-guided discovery. Auto-Discovery-Bench is not intended to replace realistic discovery environments; rather, it provides a reproducible, low-confound diagnostic testbed for isolating a prerequisite capability for interactive scientific agents.

URL PDF HTML ☆

赞 0 踩 0

2502.04671 2026-06-01 cs.AI cs.LG cs.LO cs.PL 版本更新

ProofWala: A Framework for Multilingual Proof Data Synthesis and Theorem-Proving

ProofWala: 多语言证明数据合成与定理证明框架

Amitayush Thakur, George Tsoukalas, Greg Durrett, Swarat Chaudhuri

发表机构 * University of Texas, Austin, USA（得克萨斯大学奥斯汀分校）

AI总结提出ProofWala框架，通过itp-interface库实现与交互式定理证明器的程序化交互，支持多语言证明数据合成、并行证明搜索，并验证了跨语言与跨领域迁移的有效性。

详情

AI中文摘要

神经定理证明方法需要强大的基础设施来与交互式定理证明器（ITP）交互、提取结构化证明数据以及大规模执行证明搜索。然而，现有工具通常针对特定助手且面向文件级执行，使得仓库级分析和并行实验变得困难。我们提出ProofWala，一个多语言证明工程框架，基于 exttt{itp-interface}构建，这是一个用于与ITP进行程序化交互的可重用库。对于Lean 4，我们实现了一个在阐释器内部执行的元编程交互层，支持语义上忠实的策略级跟踪，以及跨整个仓库的声明和依赖级提取。该设计超越了传统的REPL式交互，支持项目范围的分析、环境克隆和证明状态的池化执行。相同的接口抽象支持多个版本的Rocq，形成统一的跨助手流水线。基于此基础设施，ProofWala提供标准化的多语言证明数据集、模型训练工具和并行证明搜索算法。使用该框架，我们展示了跨Lean和Rocq的多语言训练能够实现跨语言和跨领域迁移。我们在Lean Mathlib和领域适应（CategoryTheory）上观察到统计显著的改进，而其他设置也呈现一致的增长趋势。我们在两个仓库中开源了完整框架、并行证明搜索模块、数据集和模型：ProofWala (https://github.com/trishullab/proof-wala) 和 itp-interface 库 (https://github.com/trishullab/itp-interface)。

英文摘要

Neural approaches to theorem proving require robust infrastructure for interfacing with interactive theorem provers (ITPs), extracting structured proof data, and executing proof search at scale. However, existing tooling is often assistant-specific and oriented toward file-level execution, making repository-scale analysis and parallel experimentation challenging. We present ProofWala, a multilingual proof engineering framework built around \texttt{itp-interface}, a reusable library for programmatic interaction with ITPs. For Lean 4, we implement a meta-programmed interaction layer executing inside the elaborator, enabling semantically faithful tactic-level tracing alongside declaration- and dependency-level extraction across entire repositories. This design extends beyond traditional REPL-style interaction by supporting project-wide analysis, environment cloning, and pooled execution of proof states. The same interface abstraction supports multiple versions of Rocq, yielding a unified cross-assistant pipeline. Built on this infrastructure, ProofWala provides standardized multilingual proof datasets, model training utilities, and parallel proof search algorithms. Using the framework, we demonstrate that multilingual training across Lean and Rocq enables cross-lingual and cross-domain transfer. We observe statistically significant improvements on Lean Mathlib and in domain adaptation (CategoryTheory), while other settings exhibit consistent upward trends. We open-source the full framework, parallel proof search module, datasets, and models across two repositories: ProofWala (https://github.com/trishullab/proof-wala) and the itp-interface library (https://github.com/trishullab/itp-interface).

URL PDF HTML ☆

赞 0 踩 0

2502.04554 2026-06-01 cs.AI 版本更新

Unifying and Optimizing Data Values for Selection via Sequential Decision-Making

通过序列决策统一和优化数据选择的数据价值

Hongliang Chi, Qiong Wu, Zhengyi Zhou, Jonathan Light, Emily Dodwell, Yao Ma

发表机构 * Rensselaer Polytechnic Institute（伦塞拉尔理工学院）

AI总结将数据选择重构为序列决策问题，通过动态规划得到最优选择序列，并统一解释Data Shapley等现有方法为近视线性近似，提出基于二分图的高效替代方法，在经典ML和大规模LLM微调数据选择中显著优于现有方法。

详情

AI中文摘要

数据选择已成为数据价值的一个关键下游应用，然而在数据价值用于选择的理论基础方面仍未被充分探索。我们将数据选择重新表述为一个序列决策问题，其中最优选择序列由动态规划产生，而数据价值可以被理解为该最优序列的编码。这一框架通过近似动态规划的视角统一并重新解释了现有方法（如Data Shapley），揭示它们是对序列问题的近视线性近似。我们进一步分析了在子模性下选择最优性如何随效用曲率下降，解释了这些近似何时以及为何失败。为了弥合理论与实践，我们提出了一种基于二分图的高效替代方法，该方法在保持子模结构的同时，实现了具有可证明保证的可扩展贪心选择。在经典机器学习基准和大规模LLM微调数据选择上的实验表明，该方法显著优于现有方法。代码公开于https://github.com/frankhlchi/SeqDataVal。

英文摘要

Data selection has emerged as a crucial downstream application of data valuation, yet the theoretical foundations for using data values in selection remain underexplored. We reformulate data selection as a sequential decision-making problem where the optimal selection sequence arises from dynamic programming, and data values can be understood as encodings of this optimal sequence. This framework unifies and reinterprets existing methods like Data Shapley through the lens of approximate dynamic programming, revealing them as myopic linear approximations to the sequential problem. We further analyze how selection optimality degrades with utility curvature under submodularity, explaining when and why these approximations fail. To bridge theory and practice, we propose an efficient bipartite graph-based surrogate that preserves submodular structure while enabling scalable greedy selection with provable guarantees. Experiments on classical ML benchmarks and large-scale LLM fine-tuning data selection demonstrate substantial improvements over existing methods. Code is publicly available at https://github.com/frankhlchi/SeqDataVal

URL PDF HTML ☆

赞 0 踩 0

2404.14928 2026-06-01 cs.LG cs.AI cs.CL cs.SI 版本更新

Graph Machine Learning in the Era of Large Language Models (LLMs)

大语言模型时代的图机器学习

Shijie Wang, Jiani Huang, Zhikai Chen, Yu Song, Wenzhuo Tang, Haitao Mao, Wenqi Fan, Hui Liu, Xiaorui Liu, Dawei Yin, Qing Li

发表机构 * The Hong Kong Polytechnic University（香港理工大学）； Michigan State University（密歇根州立大学）； North Carolina State University（北卡罗来纳州立大学）； Baidu Inc（百度公司）

AI总结本文综述了大语言模型如何增强图机器学习的泛化、迁移和少样本学习能力，以及图如何提升大语言模型的推理和可解释性。

Comments Accepted by TIST

详情

AI中文摘要

图在表示社交网络、知识图谱和分子发现等各个领域的复杂关系中扮演着重要角色。随着深度学习的出现，图神经网络（GNN）已成为图机器学习（Graph ML）的基石，促进了图的表示和处理。最近，大语言模型（LLM）在语言任务中展现出前所未有的能力，并被广泛应用于计算机视觉和推荐系统等各种应用中。这一显著成功也引起了将LLM应用于图领域的兴趣。越来越多的努力致力于探索LLM在提升图机器学习的泛化性、迁移性和少样本学习能力方面的潜力。同时，图，尤其是知识图谱，富含可靠的事实知识，可用于增强LLM的推理能力，并可能缓解其局限性，如幻觉和缺乏可解释性。鉴于这一研究方向的快速进展，有必要对LLM时代图机器学习的最新进展进行系统综述，为研究人员和从业者提供深入理解。因此，在本综述中，我们首先回顾了图机器学习的最新发展。然后，我们探讨了如何利用LLM来增强图特征的质量，减轻对标注数据的依赖，并解决图异质性和分布外（OOD）泛化等挑战。之后，我们深入探讨了图如何增强LLM，突出了它们增强LLM预训练和推理的能力。此外，我们调查了各种应用，并讨论了这一有前景领域的潜在未来方向。

英文摘要

Graphs play an important role in representing complex relationships in various domains like social networks, knowledge graphs, and molecular discovery. With the advent of deep learning, Graph Neural Networks (GNNs) have emerged as a cornerstone in Graph Machine Learning (Graph ML), facilitating the representation and processing of graphs. Recently, LLMs have demonstrated unprecedented capabilities in language tasks and are widely adopted in a variety of applications such as computer vision and recommender systems. This remarkable success has also attracted interest in applying LLMs to the graph domain. Increasing efforts have been made to explore the potential of LLMs in advancing Graph ML's generalization, transferability, and few-shot learning ability. Meanwhile, graphs, especially knowledge graphs, are rich in reliable factual knowledge, which can be utilized to enhance the reasoning capabilities of LLMs and potentially alleviate their limitations such as hallucinations and the lack of explainability. Given the rapid progress of this research direction, a systematic review summarizing the latest advancements for Graph ML in the era of LLMs is necessary to provide an in-depth understanding to researchers and practitioners. Therefore, in this survey, we first review the recent developments in Graph ML. We then explore how LLMs can be utilized to enhance the quality of graph features, alleviate the reliance on labeled data, and address challenges such as graph Heterophily and out-of-distribution (OOD) generalization. Afterward, we delve into how graphs can enhance LLMs, highlighting their abilities to enhance LLM pre-training and inference. Furthermore, we investigate various applications and discuss the potential future directions in this promising field.

URL PDF HTML ☆

赞 0 踩 0