arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 21512
2606.04238 2026-06-04 cs.LG cs.AI

Recover-LoRA for Aggressive Quantization: Reclaiming Accuracy in 2-Bit Language Models via Low-Rank Adaptation with Knowledge Distillation on Synthetic Data

Recover-LoRA 用于激进量化:通过低秩适配与合成数据知识蒸馏恢复2比特语言模型的精度

Devleena Das, Rajeev Patwari, Elliott Delaye, Ashish Sirasao

发表机构 * Advanced Micro Devices, Inc.(先进微器件公司)

AI总结 针对2比特激进量化导致的大语言模型精度严重下降问题,提出Recover-LoRA方法,结合选择性混合精度策略(仅MLP的gate和up层量化为2比特)和基于合成数据蒸馏的低秩适配训练,在Qwen3-4B上以1万合成样本在12个基准中恢复9个基准80-95%的精度。

详情
AI中文摘要

将权重激进量化至2比特精度可大幅提升大语言模型推理的吞吐量和内存效率,但通常会导致严重的精度下降。这些增益对于内存容量和带宽为主要限制的边缘和设备端部署尤为重要。在本工作中,我们将Recover-LoRA——一种最初为通用模型权重损坏设计的轻量级、无需数据的精度恢复方法——扩展到超低比特量化场景。我们提出了一种选择性混合精度策略,其中仅MLP的gate和up投影层被量化为2比特(W2),而所有其他线性层保持更高精度,从而形成混合精度的GateUp配置。通过三个模型系列(4B-20B)和两个硬件平台的屋顶线分析,我们证明W4/W2-GateUp部署(4比特基础加2比特gate/up)相比均匀W4可实现7.5-23.3%的TPS提升(取决于模型和上下文长度),同时将量化误差限制在可预测的层子集内。然后,我们应用Recover-LoRA——在量化层上通过合成数据的logit蒸馏训练低秩适配器——来恢复因gate和up层的2比特量化而损失的精度。在Qwen3-4B的案例研究中,Recover-LoRA仅使用1万合成训练样本且无需标注数据,就在12个基准中的9个上实现了80-95%的精度恢复。我们进一步证明,对于基于蒸馏的恢复,合成数据的表现与精心整理的标注数据相当,并且恢复结果可泛化到分布外评估任务。我们的结果表明,Recover-LoRA是一种实用的后量化精度恢复工具,适用于部署场景中的激进权重压缩。

英文摘要

Aggressive weight quantization to 2-bit precision offers substantial throughput and memory gains for large language model (LLM) inference, but typically incurs severe accuracy degradation. These gains are particularly relevant for edge and on-device deployment, where memory capacity and bandwidth are primary constraints. In this work, we extend Recover-LoRA -- a lightweight, data-free accuracy recovery method originally developed for general model weight corruption -- to the setting of ultra-low-bit quantization. We propose a selective mixed-precision strategy in which only gate and up projection layers of the MLP are quantized to 2-bit (W2), while all other linear layers remain at higher precision, yielding a mixed-precision GateUp configuration. We demonstrate via roofline analysis across three model families (4B--20B) and two hardware platforms that a W4/W2-GateUp deployment (4-bit base with 2-bit gate/up) delivers 7.5--23.3\% TPS improvement over uniform W4 depending on model and context length, while confining quantization error to a predictable subset of layers. We then apply Recover-LoRA -- training low-rank adapters on the quantized layers via logit distillation with synthetic data -- to recover accuracy lost from 2-bit quantization of the gate and up layers. In a case study on Qwen3-4B, Recover-LoRA achieves 80--95\% accuracy recovery on 9 of 12 benchmarks, using only 10k synthetic training samples and no labeled data. We further demonstrate that synthetic data performs comparably to curated labeled data for distillation-based recovery, and that recovery generalizes to out-of-distribution evaluation tasks. Our results present Recover-LoRA as a practical post-quantization accuracy recovery tool for aggressive weight compression in deployment settings.

2606.04236 2026-06-04 cs.CL cs.AI cs.LG

Supportive Token Revealing for Fast Diffusion Language Model Decoding

支持性标记揭示:快速扩散语言模型解码

Giries Abu Ayoub, Mario Barbara, Lluís Pastor-Pérez, Tanja Bien, Aneesh Barthakur, Alaa Maalouf, Loay Mualem

发表机构 * Department of Computer Science, University of Haifa(海法大学计算机科学系) Institute for AI, University of Stuttgart(斯图加特大学人工智能研究所) IMPRS-IS

AI总结 提出AXON模块,通过选择注意力、不确定性和置信度信号中的锚点标记来改善扩散语言模型并行解码的质量-延迟权衡。

详情
AI中文摘要

离散扩散语言模型可以通过并行更新多个掩码位置来高效生成文本,但这种并行性引入了质量-延迟权衡。激进的解码可能过早提交相互依赖的标记,而保守的解码则需要大量去噪步骤。现有方法通过使用置信度或依赖性标准决定哪些标记可以安全揭示来解决这一矛盾。然而,避免不安全提交并不一定使剩余的掩码序列易于解码,因为不确定的标记可能依赖于掩码标记,从而成为去噪步骤的瓶颈。我们提出AXON,一个无需训练的模块,可添加到现有扩散语言模型的并行解码策略之上。AXON不替换基础解码器,而是监控剩余不确定的掩码标记,并仅当它们当前状态表明需要额外上下文时才进行干预。然后它将标准从揭示哪些标记最安全转变为哪些自信揭示最能支持后续去噪。AXON使用注意力、不确定性和置信度信号选择锚点,即不确定位置关注的自信掩码标记。在多个扩散语言模型的推理和代码生成基准上的实验表明,AXON改善了现有并行解码器的质量-延迟权衡,通常减少函数评估次数,同时保持或提高准确性。

英文摘要

Discrete diffusion language models can generate text efficiently by updating multiple masked positions in parallel, but this parallelism introduces a quality-latency trade-off. Aggressive decoding may commit mutually dependent tokens too early, while conservative decoding requires many denoising steps. Existing methods address this tension by deciding which tokens are safe to reveal using confidence or dependency criteria. However, avoiding unsafe commits does not necessarily make the remaining masked sequence easy to decode, since uncertain tokens may depend on masked tokens, creating a bottleneck for denoising steps. We propose AXON, a training-free module that can be added on top of existing parallel decoding strategies for diffusion language models. Rather than replacing the base decoder, AXON monitors the remaining uncertain masked tokens and intervenes only when their current state suggests that additional context is needed. It then shifts the criterion from which tokens are safest to reveal to which confident reveals would best support later denoising. AXON selects anchors, confident masked tokens that uncertain positions attend to, using attention, uncertainty, and confidence signals. Experiments on reasoning and code-generation benchmarks across multiple diffusion language models show that AXON improves the quality-latency trade-off of existing parallel decoders, often reducing the number of function evaluations while maintaining or improving accuracy.

2606.04233 2026-06-04 cs.RO

What Are We Actually Benchmarking in Robot Manipulation?

我们究竟在机器人操作中基准测试什么?

Tianchong Jiang, Xiangshan Tan, Samuel Wheeler, Luzhe Sun, Tewodros W. Ayalew, Matthew Walter

发表机构 * Toyota Technological Institute at Chicago(丰田技术研究所芝加哥分校) University of Chicago(芝加哥大学) Argonne National Laboratory(阿贡国家实验室)

AI总结 本文通过识别基准测试的四种失效模式(捷径可解性、缺乏统计显著性、渐进过拟合和数据源依赖性),并提出相应诊断方法,对LIBERO、CALVIN、SimplerEnv、RoboCasa和RoboTwin 2.0进行审计,发现多数基准测试存在缺陷,并发布了诊断工具。

Comments 31 pages, 6 figures

详情
AI中文摘要

机器人基准测试分数衡量的是在固定评估设置下的成功率,但通常被当作通用操作能力的证据。我们识别出四种失效模式,每种模式都会削弱或否定基准测试作为该能力有效代理的作用:捷径可解性、缺乏统计显著性、渐进过拟合和数据源依赖性。我们为每种失效模式提出一种诊断方法。我们使用这些诊断方法审计了LIBERO、CALVIN、SimplerEnv、RoboCasa和RoboTwin 2.0。LIBERO和CALVIN未通过多项诊断。RoboCasa和RoboTwin 2.0未通过较少,尽管它们在近期进展声明中出现的频率远低于前者。在LIBERO上,一个没有语言编码器的0.09B探针得分达到或接近报告的最优结果,且大多数报告的性能提升无法证明具有统计显著性。在CALVIN上,在训练范围内随机化块的位置会降低所有测试策略的性能。我们发布了四种诊断方法及其参考实现,供作者和审稿人在将基准测试分数视为进展证据之前使用。代码和工件可在https://ripl.github.io/manipulation_benchmark_audit/获取。

英文摘要

A robotics benchmark score measures success under one fixed evaluation setup, yet is routinely treated as evidence of general manipulation capability. We identify four failure modes, each of which weakens or invalidates a benchmark's role as a valid proxy for that capability: shortcut solvability, lack of statistical significance, creeping overfitting, and data-source dependence. We propose one diagnostic per failure mode. We audit LIBERO, CALVIN, SimplerEnv, RoboCasa, and RoboTwin 2.0 under these diagnostics. LIBERO and CALVIN fail multiple diagnostics. RoboCasa and RoboTwin 2.0 fail fewer, despite appearing far less often in recent progress claims. On LIBERO, a 0.09B probe with no language encoder scores at or near reported SOTA, and most reported gains are not provably statistically significant. On CALVIN, randomizing block poses within the training range drops performance for every tested policy. We release the four diagnostics with reference implementations for authors and reviewers to apply before treating a benchmark score as evidence of progress. Code and artifacts are available at https://ripl.github.io/manipulation_benchmark_audit/.

2606.04231 2026-06-04 cs.CL cs.AI

MM-BizRAG: Rethinking Multimodal Retrieval-Augmented Generation for General Purpose Enterprise Q&A

MM-BizRAG:面向通用企业问答的多模态检索增强生成再思考

Hanoz Bhathena, Parin Rajesh Jhaveri, Rohan Mittal, Prateek Singh, Aymen Kallala, Rachneet Kaur, Yiqiao Jin, Zhen Zeng, Adwait Ratnaparkhi, Denis Kochedykov

发表机构 * JPMorgan Chase & Co.(摩根大通公司) Georgia Institute of Technology(佐治亚理工学院)

AI总结 提出MM-BizRAG框架,通过文档结构感知分割和布局感知解析,结合统一LLM驱动的工件转换与推理时多模态组装,无需微调即可提升企业文档问答性能,在异构企业数据集和两个公开基准上超越基线最多32个百分点。

Comments Accepted at ACL 2026 (Industry Track)

详情
AI中文摘要

近期多模态检索增强生成(MM-RAG)的进展倾向于最小化解析,依赖页面级图像来生成检索器嵌入和答案生成。虽然高效,但这种趋势往往忽略了对复杂企业文档中丰富结构化信息的显式处理,而是依赖预训练嵌入或视觉语言模型隐式捕获这种结构。在本工作中,我们采取更直接的方法:MM-BizRAG通过文档结构感知分割主动提取和表示文档结构,该分割根据文档方向动态路由文档至特定方向的摄取管道,对垂直结构文档(如报告)应用显式布局感知解析,对水平结构文档(如幻灯片)应用整体页面级表示。统一的LLM驱动的工件转换管道通过基于占位符的位置对齐保留自然阅读顺序,而推理时的多模态组装将检索表示与生成上下文解耦,无需任何微调即可生成更丰富、更基于事实的答案。通过在大型异构企业数据集和两个公开基准(SlideVQA和FinRAGBench-V)上的实验,MM-BizRAG一致地超越最先进的以视觉为中心的基线最多32个百分点,在报告式布局上尤其强劲。此外,我们引入了FastRAGEval,一种单次调用的LLM评判指标,用于细粒度生成召回,将RAGChecker的成本减半,同时实现更强的人类对齐。

英文摘要

Recent advances in multimodal retrieval-augmented generation (MM-RAG) have shifted toward minimal parsing, relying on page-level images for producing retriever embeddings and for answer generation. While efficient, this trend often neglects explicit handling of the rich, structured information in complex enterprise documents, instead depending on pre-trained embeddings or vision-language models to implicitly capture such structure. In this work, we take a more direct approach: MM-BizRAG proactively extracts and represents document structure via a document structure-aware split that dynamically routes documents through orientation-specific ingestion pipelines, applying explicit layout-aware parsing for vertically structured documents (e.g., reports) and holistic page-level representations for horizontally structured documents (e.g., slide decks). A unified LLM-driven artifact transformation pipeline with placeholder-based positional alignment preserves natural reading order, while inference-time multimodal assembly decouples retrieval representations from generation context, enabling richer, more grounded answers without any finetuning requirement. Through experiments on a large, heterogeneous enterprise dataset and two public benchmarks (SlideVQA and FinRAGBench-V), MM-BizRAG consistently outperforms state-of-the-art vision-centric baselines by up to 32% points, with especially strong gains on report-style layouts. Furthermore, we introduce FastRAGEval, a single-call LLM Judge metric for fine-grained generative recall that halves RAGChecker's cost while achieving stronger human alignment.

2606.04226 2026-06-04 cs.RO cs.AI

PerceptTwin: Semantic Scene Reconstruction for Iterative LLM Planning and Verification

PerceptTwin:面向迭代LLM规划与验证的语义场景重建

Charlie Gauthier, Sacha Morin, Liam Paull

发表机构 * Department of Computer Science and Operations Research, Université de Montréal(蒙特利尔大学计算机科学与运筹学系) Mila - Quebec AI Institute(魁北克人工智能研究所) CIFAR AI Chair(CIFAR人工智能主席)

AI总结 提出PerceptTwin自动管道,从机器人感知的语义场景表示构建交互式仿真,结合LLM法官验证规划正确性与人类偏好,提升规划成功率约39%。

Comments Accepted at ICRA 2026 (Vienna); published on arxiv for archival purposes. See also https://percept-twin.github.io/

详情
AI中文摘要

仿真环境对于机器人策略学习以及规划验证与确认都很有用。传统上,创建仿真的过程是繁重的。为机器人运行的每个单独环境创建定制的仿真环境是不可行的。在这项工作中,我们引入了PerceptTwin,这是一个全自动管道,直接从机器人感知栈产生的语义场景表示构建交互式仿真。PerceptTwin结合了开放词汇对象地图与3D资产生成、 afford预测和常识条件检查。这些交互式仿真可用于在机器人硬件上执行规划之前验证和完善规划。借鉴AI对齐文献,我们还引入了一个LLM法官,用于验证规划的正确性和与人类偏好的一致性。实验表明,PerceptTwin反馈允许LLM规划器完善规划、增强安全性并抵抗有害的黑盒提示攻击。在我们的任务套件中,PerceptTwin使GPT5、GPT5Mini和GPT5Nano规划器的规划成功率平均提高约39%。此外,对于因未满足技能前提条件而失败的规划,PerceptTwin还将人类规划验证平均提高高达18%。我们的结果证明了从机器人感知进行开放词汇场景仿真作为更安全、更可靠的机器人规划基础的潜力。

英文摘要

Simulation environments are useful for both robot policy learning and planning verification and validation. Traditionally, the process of creating a simulation was onerous. Creating a bespoke simulation environment for each individual environment that a robot would operate in was simply infeasible. In this work, we introduce PerceptTwin, a fully automatic pipeline that constructs interactive simulations directly from semantic scene representations produced by a robot's perception stack. PerceptTwin combines open-vocabulary object maps with 3D asset generation, affordance prediction, and commonsense condition checking. These interactive simulations can be used to validate and refine plans before they are executed on the robot hardware. Borrowing from the AI alignment literature, we also introduce an LLM judge that verifies plan correctness and alignment with human preferences. Experiments show that PerceptTwin feedback allows LLM planners to refine plans, enhance safety, and resist harmful black-box prompting attacks. In our suite of tasks, PerceptTwin improves plan success by an average of approximately 39% for GPT5, GPT5Mini, and GPT5Nano planners. Additionally, PerceptTwin also improves human plan verification by up to 18% on average for plans that fail due to unfilled skill preconditions. Our results demonstrate the potential of open-vocabulary scene simulation from robot perception as a foundation for safer, more reliable robot planning.

2606.04223 2026-06-04 cs.AI

Consensus is Strategically Insufficient: Reasoning-Trace Disagreement as a Knowledge-Representation Signal

共识在策略上是不充分的:推理轨迹分歧作为知识表示信号

Michał Wawer, Jarosław A. Chudziak

发表机构 * Laboratory of The New Ethos(新伦理实验室) Warsaw University of Technology(华沙理工大学) Institute of Computer Science(计算机科学研究所) Faculty of Electronics and Information Technology(电子与信息技术学院)

AI总结 本文提出在价值负载任务中,分歧可能反映规范不确定性而非错误,通过将推理轨迹和决策抽象为符号分歧状态,构建知识表示层以支持可废止策略路由,连接亚符号LLM审议与符号知识表示。

Comments Accepted to LAMAS&SR workshop at FLoC 2026 (KR + ICPL + LICS + CP + FSCD)

详情
AI中文摘要

多智能体系统通常通过投票、共识协议、辩论或容错聚合来减少分歧。我们认为,对于价值负载任务,这一目标是不充分的,因为分歧可能反映真正的规范不确定性而非智能体错误。基于先前关于人机协作审核中推理轨迹分歧的工作,我们提出一个知识表示层,其中推理轨迹和智能体决策被抽象为符号分歧状态。给定产生显式推理轨迹和二元决策的智能体,我们根据推理相似性和结论一致性区分四种状态:收敛一致、发散一致、收敛分歧和发散分歧。这些状态支持可废止的策略路由规则。我们在内容审核中实例化该框架,并论证分歧感知路由为多智能体策略推理中亚符号LLM审议与符号知识表示之间提供了桥梁。

英文摘要

Multi-agent systems are commonly designed to reduce disagreement through voting, consensus protocols, debate, or fault-tolerant aggregation. We argue that this objective is insufficient for value-laden tasks, where disagreement may reflect genuine normative uncertainty rather than agent error. Building on prior work on reasoning-trace disagreement in human-AI collaborative moderation, we propose a knowledge-representation layer in which reasoning traces and agent decisions are abstracted into symbolic disagreement states. Given agents producing explicit reasoning traces and binary decisions, we distinguish four states according to reasoning similarity and conclusion agreement: convergent agreement, divergent agreement, convergent disagreement and divergent disagreement. These states support defeasible strategic routing rules. We instantiate the framework in content moderation and argue that disagreement-aware routing provides a bridge between sub-symbolic LLM deliberation and symbolic knowledge representation for multi-agent strategic reasoning.

2606.04222 2026-06-04 cs.RO

Towards Estimating Normal and Shear Interface Pressures in Prosthetic Sockets via Least Squares and Mechanics Modeling

通过最小二乘和力学建模估算假肢接受腔中的法向和剪切界面压力

Axel González Cornejo, Tianhao Yu, Chi Hwan Lee, Edgar Bolívar-Nieto

发表机构 * University of California, Berkeley(加州大学伯克利分校) University of Michigan(密歇根大学)

AI总结 针对假肢接受腔界面压力测量中剪切力缺失和传感器串扰问题,提出一种基于稀疏传感和最小二乘的准静态弹簧-质量接触模型,通过全局力/力矩和局部压力数据验证模型性能。

详情
AI中文摘要

假肢接受腔的适配仍然主要依靠手工和迭代,客观适配指标仍然有限。挑战之一在于缺乏残肢-接受腔界面的长期真实压力数据。传统压力传感器随时间漂移,且仅能捕捉接受腔内稀疏位置的法向压力,缺失了生物力学分析的关键分量:剪切力。尽管某些传感器可以同时报告法向和剪切界面应力,但由于测量串扰,这些分量往往难以解耦。一个潜在的解决途径是开发能够增强现有测量的模型。本文引入了一个测试平台,使用两种互补的验证信号评估稀疏压力传感下的模型性能:(i)通过人工残肢传递的全局力螺旋(即正交坐标系中的总力和力矩),以及(ii)由稀疏传感簇(每个簇由四个电容传感通道组成)测量的局部界面载荷(即每个仪器位置处右手正交坐标系中解耦的法向和剪切压力分量)。本文不呈现全场压力估计,而是聚焦于一个分析序列,量化候选力学模型在受控条件下解释全局和局部测量的能力。评估了一个准静态弹簧-质量接触模型,并通过两阶段凸最小二乘问题识别其参数。静态加载下的验证表明,估计恒定偏置项可以减少力螺旋通道的稳态偏移,并改善与局部测量的一致性。帕累托前沿敏感性分析进一步说明了当包含偏置项时,全局和局部目标之间的权衡如何变化。

英文摘要

Prosthetic socket fitting remains largely manual and iterative, and objective fit metrics are still limited. Part of the challenge is the lack of long-term real-life pressure data at the residual limb--socket interface. Traditional pressure sensors are prone to drift over time, and capture only normal pressures at sparse locations within the socket, missing a critical component for biomechanical analysis: shear. Although some sensors can report both normal and shear interface stresses, these components are often difficult to decouple because of measurement crosstalk. One potential path forward is to develop models that can augment available measurements. This work introduces a testbed to evaluate model performance under sparse pressure sensing using two complementary validation signals: (i) the global wrench (\ie, total forces and moments expressed in an orthonormal frame) transmitted through the socket, by an artificial residual-limb, and (ii) local interface loads (\ie, decoupled normal and shear pressure components in a right-hand-rule orthogonal frame that lives in each instrumented location) measured by sparse sensing clusters, each composed of four capacitance-sensing channels. Rather than presenting full-field pressure estimates, the focus is on an analysis sequence that quantifies how well candidate mechanical models explain both global and local measurements under controlled conditions. A quasi-static spring--mass contact model is evaluated, and its parameters are identified via a two-stage convex least-squares problem. Validation under static loading shows that estimating constant bias terms reduces steady offsets in the wrench channels and improves agreement with local measurements. A Pareto-front sensitivity analysis further illustrates how the trade-off between global and local objectives changes when bias terms are included.

2606.04221 2026-06-04 cs.SD cs.AR eess.AS

Feasibility of Time-Domain DNN-Based Speech Enhancement on Embedded FPGA for Hearing Aid

基于时域DNN的助听器嵌入式FPGA语音增强可行性研究

Feyisayo Olalere, Umut Altin, Kiki van der Heijden, Marcel van Gerven

发表机构 * Radboud University, Donders Institute for Brain, Cognition, and Behaviour, The Netherlands(拉德堡德大学,脑认知行为研究所,荷兰) Mortimer B. Zuckerman Mind, Brain, Behavior Institute, Columbia University, USA(莫蒂默·B·齐克曼心智、大脑与行为研究所,哥伦比亚大学,美国)

AI总结 本文在AMD-Xilinx Kria KV260上部署轻量级SuDoRM-RF++模型,通过FP32和16位定点精度评估语音分离和降噪,发现数据移动是主要瓶颈,定点降噪加速器达到9.7ms首样本延迟,满足10ms临床阈值。

Comments 13 pages

详情
AI中文摘要

助听器对延迟和功耗有严格限制,当前基于DNN的语音增强系统在嵌入式硬件上难以满足这些要求。我们通过在AMD-Xilinx Kria KV260上部署轻量级SuDoRM-RF++架构进行语音分离和降噪,对每个任务评估了FP32和16位定点精度。在这些配置中,首样本延迟与片上参数缓存相关而非算术吞吐量,表明数据移动是主要瓶颈。精度降低使模型内存占用减半而不损害客观语音质量。定点降噪加速器达到9.7毫秒的首样本延迟,满足10毫秒的临床阈值,而语音分离达到16.0毫秒。这些测量结果为嵌入式DNN语音增强建立了具体的资源需求,并量化了与助听器部署之间的剩余差距。

英文摘要

Hearing aids impose strict latency and power constraints that current DNN-based speech enhancement systems struggle to meet on embedded hardware. We characterize this gap by deploying both speech separation and denoising using the lightweight SuDoRM-RF++ architecture on the AMD-Xilinx Kria KV260, evaluated at FP32 and 16-bit fixed-point precision for each task. Across these configurations, first-sample latency tracks with on-chip parameter caching rather than arithmetic throughput, identifying data movement as the primary bottleneck. Precision reduction halves the model memory footprint without compromising objective speech quality. The fixed-point denoising accelerator achieves a first-sample latency of 9.7~ms, meeting the 10~ms clinical threshold, while speech separation reaches 16.0~ms. These measurements establish concrete resource requirements for embedded DNN-based speech enhancement and quantify the remaining gap to hearing aid deployment.

2606.04209 2026-06-04 cs.LG

A Geometric View of Counterfactual Behavior: Interaction of Boundary Proximity and Local Support

反事实行为的几何视角:边界接近度与局部支持的交互作用

Ioanna Gemou, Matteo Gamba, Randall Balestriero, Ritambhara Singh

发表机构 * Brown University(布朗大学)

AI总结 本文通过几何视角研究反事实行为,发现决策边界接近度与局部数据支持的交互作用决定了反事实的可行性,且反事实行为是独立于预测性能的维度,可在不改变准确率的情况下被改变。

详情
AI中文摘要

反事实解释寻求对输入进行小的、语义上有意义的改变,以改变模型的预测,并广泛用于解释和审计机器学习系统。在现代视觉、语言和多模态系统中,预训练编码器将输入映射到表示空间,下游分类器头在这些空间内施加决策边界。因此,附近反事实的可行性和距离取决于边界相对于数据的位置。然而,具有相似预测性能的模型在是否能够实现此类改变以及表示必须移动多远方面可能存在显著差异。本文通过使用标准化局部搜索探针,在多个预训练编码器和线性分类器头上检验了这种变化。结果表明,尽管预测性能相似,但模型在反事实行为上存在显著差异。在固定表示下,仅改变分类器头就会改变反事实结果,而预测性能基本保持不变。这种变化由决策边界接近度和局部数据支持的交互作用解释,两者共同决定了预测变化是否可行且位于数据支持的区域内,并且还可以改进固定模型内的反事实搜索。总之,这些发现将反事实行为识别为超越预测性能的独立维度,并表明可以在不改变准确率的情况下改变它,这对模型选择、鲁棒性和反事实方法的可靠性具有启示意义。

英文摘要

Counterfactual explanations seek small, semantically meaningful changes to an input that alter a model's prediction, and are widely used to interpret and audit machine learning systems. In modern vision, language, and multimodal systems, pretrained encoders map inputs to representation spaces, and downstream classifier heads impose decision boundaries within those spaces. As a result, the feasibility and distance of nearby counterfactuals depend on boundary placement relative to the data. Yet models with similar predictive performance can differ substantially in whether such changes are achievable and how far representations must move. This work examines this variation using a standardized local search probe across several pretrained encoders and linear classifier heads. Results show that despite similar predictive performance, models differ substantially in their counterfactual behavior. Under fixed representations, varying only the classifier head alters counterfactual outcomes while leaving predictive performance largely unchanged. This variation is explained by the interaction of decision-boundary proximity and local data support, which jointly determine whether prediction changes are both feasible and lie in regions supported by the data, and can also improve counterfactual search within fixed models. Together, these findings identify counterfactual behavior as a distinct dimension beyond predictive performance and show that it can be altered without changing accuracy, with implications for model selection, robustness, and the reliability of counterfactual methods.

2606.04206 2026-06-04 cs.RO

DLO-Lab: Benchmarking Deformable Linear Object Manipulations with Differentiable Physics

DLO-Lab: 基于可微物理的可变形线性物体操作基准测试

Junyi Cao, Yian Wang, Ziyan Xiong, Chunru Lin, Zhehuan Chen, Chuang Gan

发表机构 * DLO-Lab(DLO实验室)

AI总结 针对机器人操作绳索、电缆等可变形线性物体(DLO)的挑战,提出一个可微模拟器,支持多种材料属性,并构建基准任务套件,结合专用DLO智能体,评估策略学习算法并验证仿真到现实的迁移。

Comments ICML 2026, the project page: https://dlo-lab-26.github.io/

详情
AI中文摘要

我们解决了使机器人能够操作可变形线性物体(DLO),如绳索、电缆和橡皮筋的挑战。先前的工作主要集中于狭窄的、任务特定的问题,通常依赖于真实世界的演示或手工制作的启发式方法。然而,这些方法难以扩展到实践中遇到的各种材料和任务,并且收集足够多样化的真实世界数据通常是不切实际的。此外,现有的仿真环境对可泛化DLO操作所需的广泛材料行为支持有限。为了克服这些限制,我们引入了一个明确设计用于多功能DLO操作的可微模拟器。我们的模拟器模拟了广泛的材料属性——包括(不可)延伸性、弹性、弯曲塑性以及与其他物体的复杂交互——为学习和评估操作技能提供了坚实的基础。基于此模拟器,我们提出了一个代表性任务的基准套件,突出了DLO操作的独特挑战。这些任务的成功执行通常受到DLO固有的拓扑复杂性和抓取敏感性的阻碍。因此,我们引入了一个专门的DLO智能体,通过提出战略性抓取点并将长视界任务分解以最大化控制权,明确管理这些挑战。最后,我们使用我们的框架评估了各种策略学习算法,并进行了仿真到现实的迁移实验,展示了我们平台在推进DLO操作方面的潜力。

英文摘要

We address the challenge of enabling robots to manipulate deformable linear objects (DLOs), such as ropes, cables, and rubber bands. Prior work has primarily focused on narrow, task-specific problems, often relying on real-world demonstrations or handcrafted heuristics. Such approaches, however, struggle to scale to the wide variety of materials and tasks encountered in practice, and collecting sufficiently diverse real-world data is often impractical. Additionally, existing simulation environments offer limited support for the broad spectrum of material behaviors necessary for generalizable DLO manipulation. To overcome these limitations, we introduce a differentiable simulator explicitly designed for versatile DLO manipulation. Our simulator models a wide range of material properties-including (in)extensibility, elasticity, bending plasticity, and complex interactions with other objects-providing a robust foundation for learning and evaluating manipulation skills. Building on this simulator, we propose a benchmark suite of representative tasks that highlight the unique challenges of DLO manipulation. The successful execution of these tasks is often hindered by the topological complexity and grasp sensitivity inherent to DLOs. Therefore, we introduce a specialized DLO agent that explicitly manages these challenges by proposing strategic grasping points and decomposing long-horizon tasks to maximize control authority. Finally, we evaluate various policy-learning algorithms using our framework, alongside sim-to-real transfer experiments, demonstrating our platform's potential to advance DLO manipulation.

2606.04202 2026-06-04 cs.AI

SMAC-Talk: A Natural Language Extension of the StarCraft Multi-Agent Challenge for Large Language Models

SMAC-Talk: 面向大型语言模型的星际争霸多智能体挑战的自然语言扩展

Joel Sol, Homayoun Najjaran

发表机构 * Faculty of Engineering and Computer Science(工程与计算机科学学院) University of Victoria(维多利亚大学)

AI总结 提出SMAC-Talk环境,通过自然语言通信通道评估LLM智能体在合作多智能体场景中的协调与信任,并构建含欺骗性通信者的评估场景。

Comments 8 pages, 1 figure

详情
AI中文摘要

随着LLM的广泛部署,它们越来越需要与其他AI智能体协同工作而非孤立运行。在这些场景中,有效协调要求智能体进行通信、共享信息并在不确定性下做出决策。我们提出了SMAC-Talk,这是星际争霸多智能体挑战的自然语言扩展,用于评估基于LLM的智能体在合作多智能体环境中的表现。该环境具有分散控制、部分可观测性和长周期决策等关键特征。SMAC-Talk包含一个自然语言通信通道,用于探测智能体的协调与信任。我们利用该通信通道构建了不同的评估场景,包括嵌入欺骗性通信者的设置,该通信者试图仅通过通信来干扰和欺骗盟友。我们提供了三个基准测试智能体,使用Qwen3.5系列的4个模型,并研究了推理结构、记忆和模型规模如何影响智能体间的协调。我们将SMAC-Talk作为开放基准发布,以支持研究社区在合作多智能体场景中开发和评估LLM智能体。

英文摘要

As LLMs become more widely deployed, they are increasingly expected to work alongside other AI agents rather than operating in isolation. Effective coordination in these settings requires agents to communicate, share information and make decisions under uncertainty. We introduce SMAC-Talk, a natural language extension of the StarCraft Multi-Agent Challenge for evaluating LLM-based agents in cooperative multi-agent environments. The environment has several key features such as decentralized control, partial observability and long-horizon decision making. SMAC-Talk includes a natural language communication channel which is used to probe agent coordination and trust. We use this communication channel to construct different evaluation scenarios, including settings with an embedded deceptive communicator that tries to disrupt and deceive allies through communication alone. We provide three agents for benchmarking using 4 models from the Qwen3.5 family and study how reasoning structure, memory and model scale affect coordination between agents. We release SMAC-Talk as an open benchmark to support the research community in developing and evaluating LLM agents in cooperative multi-agent settings.

2606.04199 2026-06-04 cs.CL cs.LG

Cross-Prompt Generalization in Detecting AI-Generated Fake News Using Interpretable Linguistic Features

使用可解释语言特征检测AI生成假新闻的跨提示泛化

Aya Vera-Jimenez, Samuel Jaeger, Calvin Ibenye, Dhrubajyoti Ghosh

发表机构 * Department of Mathematics(数学系) School of Data Science and Analytics(数据科学与分析学院) Department of Computer Science(计算机科学系)

AI总结 研究通过提取词汇多样性、可读性和情感特征,在跨提示框架下使用随机森林分类器检测AI生成假新闻,发现模型在不同提示下均表现稳定(AUC 0.988-1.000),表明这些特征可泛化。

详情
AI中文摘要

大型语言模型的日益普及引发了对AI生成假新闻传播的担忧,尤其是在不同的提示策略下。大多数现有的检测模型是在单一生成设置下训练和评估的,其跨未见提示的泛化能力尚不清楚。在本研究中,我们使用三个在不同提示下生成的AI文章数据集以及真实新闻文章,研究了假新闻检测中的跨提示泛化。我们提取了捕捉词汇多样性、可读性和情感特征的可解释语言特征,并在跨提示框架下评估了随机森林分类器,其中在一个提示上训练的模型在另一个提示上进行测试。在所有六个训练-测试组合中,性能始终保持较高,AUC值在0.988到1.000之间。特征分布分析显示,与整体数据集相比,AI生成文本表现出更高的词汇多样性、更低的可读性和显著较低的情感强度,且不同提示间存在差异。尽管存在这些分布变化,分类器仍保持强劲性能,表明这些特征捕捉了AI生成文本的稳定属性,这些属性可跨提示策略泛化。这些发现表明,基于特征的方法可以在提示变化下提供对AI生成假新闻的稳健检测。

英文摘要

The increasing use of large language models has raised concerns about the spread of AI-generated fake news, particularly under varying prompting strategies. Most existing detection models are trained and evaluated under a single generation setting, leaving their ability to generalize across unseen prompts unclear. In this study, we investigate cross-prompt generalization in fake news detection using three datasets of AI-generated articles produced under distinct prompts, combined with real news articles. We extract interpretable linguistic features capturing lexical diversity, readability, and emotion-based characteristics and evaluate a random forest classifier under a cross-prompt framework, where models trained on one prompt are tested on another. Across all six train-test combinations, performance remains consistently high, with AUC values ranging from 0.988 to 1.000. Analysis of feature distributions shows that AI-generated text exhibits increased lexical diversity, reduced readability, and substantially lower emotional intensity compared to the overall dataset, with variations across prompts. Despite these distributional shifts, the classifier maintains strong performance, indicating that these features capture stable properties of AI-generated text that generalize across prompting strategies. These findings suggest that feature-based approaches can provide robust detection of AI-generated fake news under prompt variability.

2606.04198 2026-06-04 cs.CV

Spatial Artifact Coherence Determines Codec Robustness in Patch-Based rPPG

空间伪影相干性决定基于补丁的rPPG中的编解码鲁棒性

Achraf Ben Ahmed

发表机构 * PlesmoSense SARL(PlesmoSense公司)

AI总结 提出空间伪影相干性(SAC)度量,解释编解码压缩下基于补丁的rPPG方法优于全局投影方法的原因,并设计PatchPCA算法族,实验表明SAC解释了93.8%的PCA优势方差。

详情
AI中文摘要

远程光电容积描记法(rPPG)在未压缩基准上实现了低心率误差,但在远程医疗、新生儿ICU和驾驶员疲劳应用中通过压缩视频通道部署。先前没有工作确定在编解码压缩下空间分解优于全局投影方法的物理量。我们提出空间伪影相干性(SAC),定义为4x4块间绿色通道协方差矩阵(带通0.75-2.5 Hz)的非对角能量与对角能量之比,以及PatchPCA算法族(四种编解码感知的rPPG算法)。我们在三个公共数据集上评估了280名受试者、11种编解码退化变体(MPEG-4、H.265、H.264、JPEG、色度子采样)和13种算法,通过Wilcoxon检验(BH-FDR,q < 0.05,904次检验)。SAC解释了PCA优势中93.8%的变体间方差(r = +0.969),编解码族之间零重叠:非MPEG-4变体聚集在SAC 0.10-0.18,PCA胜率84-90%;而MPEG-4变体聚集在SAC 0.48-0.59,胜率61%,平均改进降低5.8倍。在受试者内部,78%确认了预期模式(p < 10^-22,dz = 0.73)。变体内部受试者水平SAC相关性为r = +0.099,确认SAC分类编解码族而非预测个体结果。MPEG-4的影响是结构性的(宏块DCT几何,而非噪声幅度),由源编解码状态而非分辨率决定。P-Hybrid被确定为最部署鲁棒的算法。建立了PatchPCA优势的两个必要操作条件:SAC < 0.30和低到中等运动,直接排除了原始到MPEG-4转码流水线。SAC为临床远程监测系统中编解码感知的rPPG算法选择提供了物理基础度量。

英文摘要

Remote photoplethysmography (rPPG) achieves low heart-rate error on uncompressed benchmarks yet is deployed over compressed video channels in telehealth, neonatal ICU, and driver fatigue applications. No prior work identifies the physical quantity determining when spatial decomposition outperforms global-projection methods under codec compression. We propose Spatial Artifact Coherence (SAC), defined as the ratio of off-diagonal to diagonal energy in the 4x4 inter-patch Green-channel covariance matrix (bandpass 0.75-2.5 Hz), and the PatchPCA algorithm family (four codec-aware rPPG algorithms). We evaluate 280 subjects across three public datasets, 11 codec degradation variants (MPEG-4, H.265, H.264, JPEG, chroma subsampling), and 13 algorithms via Wilcoxon tests (BH-FDR, q < 0.05, 904 tests). SAC explains 93.8% of between-variant variance in PCA advantage (r = +0.969), with zero overlap between codec families: non-MPEG-4 variants cluster at SAC 0.10-0.18 with 84-90% PCA win rates, while MPEG-4 variants cluster at SAC 0.48-0.59 with 61% win rate and a 5.8x reduction in mean improvement. Within subjects, 78% confirm the expected pattern (p < 10^-22, dz = 0.73). Within-variant subject-level SAC correlation is r = +0.099, confirming SAC classifies codec families rather than predicting individual outcomes. MPEG-4's effect is structural (macroblock DCT geometry, not noise amplitude), governed by source codec state, not resolution. P-Hybrid is identified as the most deployment-robust algorithm. Two necessary operating conditions for PatchPCA advantage are established: SAC < 0.30 and low-to-moderate motion, directly ruling out raw-to-MPEG-4 transcoding pipelines. SAC provides a physically grounded metric for codec-aware rPPG algorithm selection in clinical remote monitoring systems.

2606.04194 2026-06-04 cs.LG cs.CL cs.IR

Training-Free Lexical-Dense Fusion for Conversational-Memory Retrieval

免训练的词汇-稠密融合用于对话记忆检索

Christian Lysenstøen

发表机构 * Inland Norway University of Applied Sciences(内陆挪威应用科学大学) University of California, Berkeley(加州大学伯克利分校)

AI总结 本文提出一种免训练、仅CPU的检索方法,通过分数级融合最大查询-轮次相似度(后期交互)与BM25,显著提升多会话对话记忆检索的命中率,并分析了不同编码器和池化策略的影响。

Comments 9 pages, 3 figures, 10 tables. Code, data, and per-table receipts: https://github.com/Chrislysen/opsem

详情
AI中文摘要

在跨长多会话历史中检索回答新查询的过去几轮是长期对话记忆(LoCoMo, LongMemEval)背后的检索瓶颈。最近的并行工作Nano-Memory表明,通过最大查询-轮次相似度(后期交互,“轮次隔离检索”)对会话进行评分优于均值池化的会话嵌入。我们不声称该效果;我们复现它并询问一个免训练、仅CPU的检索阶段应在其周围添加什么。我们报告四个发现。(1)融合:在单个留一对话权重下,后期交互稠密分数与BM25的分数级融合,在六个编码器上比单独后期交互增加+8.8到+17.2个LoCoMo Hit@1点(所有p<1e-4),达到Hit@1 0.752 / NDCG@5 0.829(e5-large-v2),比BM25高+11.2个百分点。(2)一个现成的网络搜索交叉编码器重排序器在融合的前10个结果上效果不佳,将Hit@1降低6.9个百分点(一个重排序器,一种配置)。(3)池化算子消融显示top-k后期交互匹配最大相似度,但朴素的平滑最大值(log-sum-exp)对一半编码器失效。(4)所有六个编码器的后期减早期差距很大,且较大的编码器差距往往更大,而边际融合增益缩小;在LongMemEval-S上,一个BM25饱和的词汇机制中,相对于BM25的净融合增益很小且不显著。按类别分析将增益视为分工:稠密后期交互在多跳和时间问题上帮助最大,但在对抗性问题上落后于BM25。贡献是对一个强大的免训练检索方案的可控、可复现的描述,而非后期交互检索器本身(Nano-Memory的)。我们不声称完整的记忆架构;这是一个检索阶段的研究。

英文摘要

Retrieving the few past turns that answer a new query across long multi-session histories is the retrieval bottleneck behind long-term conversational memory (LoCoMo, LongMemEval). Recent concurrent work, Nano-Memory, shows that scoring a session by the maximum query-turn similarity (late interaction, "Turn Isolation Retrieval") beats mean-pooled session embeddings. We do not claim that effect; we replicate it and ask what a training-free, CPU-only retrieval stage should add around it. We report four findings. (1) Fuse: score-level fusion of the late-interaction dense score with BM25, under a single leave-one-conversation-out weight, adds +8.8 to +17.2 points of LoCoMo Hit@1 over late interaction alone across six encoders (all p<1e-4), reaching Hit@1 0.752 / NDCG@5 0.829 (e5-large-v2), +11.2 pp over BM25. (2) An off-the-shelf web-search cross-encoder reranker over the fused top-10 hurts here, degrading Hit@1 by 6.9 pp (one reranker, one configuration). (3) A pooling-operator ablation shows top-k late interaction matches max-similarity, but a naive smooth-max (log-sum-exp) collapses for half the encoders. (4) The late-minus-early gap is large for all six encoders and tends to be larger for larger ones, while the marginal fusion gain shrinks; on LongMemEval-S, a lexical regime where BM25 saturates, the net fusion gain over BM25 is small and not significant. A per-category analysis frames the gain as a division of labor: dense late interaction helps most on multi-hop and temporal questions but trails BM25 on adversarial ones. The contribution is a controlled, reproducible account of a strong training-free retrieval recipe, not the late-interaction retriever itself (Nano-Memory's). We make no claim to a complete memory architecture; this is a retrieval-stage study.

2606.04191 2026-06-04 cs.LG cs.AI

Metric-Aware Hybrid Forecasting for the CTF4Science Lorenz Challenge

CTF4Science Lorenz挑战的度量感知混合预测

Cen Lu

发表机构 * EPFL & Idiap Research Institute(瑞士联邦理工学院(EPFL)及Idiap研究所)

AI总结 针对CTF4Science Lorenz挑战,提出一种度量感知混合系统,通过为不同度量族分配专用预测器(去噪器、ODE拟合、直方图替换),在九项任务对上取得高分。

详情
AI中文摘要

我们描述了针对CTF4Science Lorenz挑战的方法,该基准混合了短时预测、长时间分布匹配和轨迹重建,涵盖九项任务对。关键发现是,没有单一模型族在所有度量上占优。相反,我们构建了一个度量感知混合系统,为每个度量族分配不同的预测器:(1)用于全轨迹重建的合成预训练去噪器,(2)用于前20个预测步的Lorenz ODE拟合和轨迹射击,以及(3)使用合成Lorenz库的直方图尾部替换用于长时间评估。该系统中一个具有代表性的成熟提交在公共排行榜上得分为83.83551,而采用相同思想的小型后续堆栈达到了83.85529。我们专注于更干净的中间系统,因为它捕获了完整方法,同时足够简单以重现和分析,而最终提交可以理解为同一骨干的保守扩展。

英文摘要

We describe our approach to the CTF4Science Lorenz challenge, a benchmark that mixes short-horizon forecasting, long-time distribution matching, and trajectory reconstruction across nine task pairs. The key discovery is that no single model family dominated all metrics. Instead, we built a metric-aware hybrid system that assigned a different predictor to each metric family: (1) synthetic-pretrained denoisers for full-trajectory reconstruction, (2) Lorenz ODE fitting and trajectory shooting for the first 20 forecast steps, and (3) histogram-tail substitution using synthetic Lorenz libraries for long-time evaluation. A representative mature submission from this system family scored 83.83551 on the public leaderboard, and a small follow-up stack of the same ideas reached 83.85529. We focus on the cleaner intermediate system because it captures the full method while remaining simple enough to reproduce and analyze, while the final submission can be understood as a conservative extension of the same backbone.

2606.04189 2026-06-04 cs.CL

ACAT: A Collaborative Platform for Efficient Aspect-Based Sentiment Dataset Annotation

ACAT:一种用于高效方面级情感数据集标注的协作平台

Ana-Maria Luisa Mocanu, Ciprian-Octavian Truica, Elena-Simona Apostol

发表机构 * National University of Science and Technology POLITEHNICA Bucharest(波兰那尔大学科学与技术学院)

AI总结 提出ACAT平台,通过自动化ETL流程和原生支持四种ABSA工作流,解决多标注者数据整合与一致性计算问题,实现高效标注并直接导出训练就绪数据集。

Comments Accepted at The 28th International Conference on Big Data Analytics and Knowledge Discovery (DaWak 2026)

详情
AI中文摘要

方面级情感分析(ABSA)需要高质量的数据集来训练可靠的模型。然而,现有的标注工具将输出视为平面文件,使得研究人员不得不通过自定义脚本手动整合多标注者数据、重建关系结构并计算可靠性指标。本文介绍了ACAT(基于方面的情感分析协作标注工具),这是一个基于Web的平台,原生支持四种ABSA工作流:(1)方面类别情感分析,(2)子句级分割,(3)具有字符级位置跟踪的方面术语情感分析,以及(4)具有双跨度偏移保留的方面情感三元组提取。其核心贡献是一个自动化的提取、转换、加载(ETL)管道,该管道在导出时直接对齐协作标注并计算标注者间一致性(IAA)指标,生成训练就绪的数据集。在1002条餐厅评论的初步验证中,由两名不同专业水平的标注者进行标注,ACAT的中位标注时间为31.58秒,所有任务的原始IAA在0.78到0.86之间。

英文摘要

Aspect-Based Sentiment Analysis (ABSA) requires high-quality datasets to train reliable models. However, existing annotation tools treat output as flat files, leaving researchers to manually consolidate multi-annotator data, reconstruct relational structures, and compute reliability metrics through custom scripts. This paper introduces ACAT (Aspect-based sentiment analysis Collaborative Annotation Tool), a web-based platform natively supporting four ABSA workflows: (1) Aspect-Category Sentiment Analysis, (2) Clause-Level Segmentation, (3) Aspect-Term Sentiment Analysis with character-level position tracking, and (4) Aspect Sentiment Triplet Extraction with dual span offset preservation. Its core contribution is an automated Extract, Transform, Load (ETL) pipeline that aligns collaborative annotations and computes Inter-Annotator Agreement (IAA) metrics directly at export, yielding training-ready datasets. In a preliminary validation on 1,002 restaurant reviews with two annotators of differing expertise, ACAT achieves a median annotation time of 31.58 seconds and a raw IAA ranging from 0.78 to 0.86 across all tasks.

2606.04188 2026-06-04 cs.LG cs.AI cs.RO

Dual Advantage Fields

双优势场

Alexey Zemtsov, Maxim Bobrin, Alexander Nikulin, Dmitry V. Dylov, Fakhri Karray, Vladislav Kurenkov, Martin Takáč, Arip Asadulaev

发表机构 * NUST MISIS(努斯大学材料科学与工程学院) MSU(莫斯科大学) Computational Imaging Lab(计算成像实验室) MBZUAI(马斯喀特人工智能研究院) dunnolab(杜诺实验室) Innopolis University(因诺波利斯大学)

AI总结 提出双优势场(DAF)方法,利用双线性对偶值模型生成局部优势信号,通过动作-效应模型预测折扣特征位移并与目标方向对齐来评分动作,实现离线目标条件强化学习中的策略提取。

Comments Accepted by ICML 2026 Workshop on Decision-Making from Offline Datasets to Online Adaptation: Black-Box Optimization to Reinforcement Learning

详情
AI中文摘要

离线目标条件强化学习需要长期可达性估计和局部动作比较。双目标表示提供捕获全局目标可达性的值场,但它们不直接指定在给定状态下应优先选择哪个动作。我们提出双优势场(DAF),一种策略提取方法,将双线性对偶值模型转化为局部优势信号。在双线性对偶参数化下,目标嵌入是值场关于状态表示的梯度。DAF学习一个动作-效应模型,预测由动作引起的折扣特征位移,并通过该位移与目标方向的对齐程度对动作进行评分。在可实现的情况下,该分数等于目标条件Bellman优势,从而提供标准的局部策略改进保证。在OGBench的 locomotion、manipulation 和 puzzle 任务上,DAF改进了聚合RLiable指标,并在局部正确动作与直接朝向最终目标移动不同的设置中表现强劲。

英文摘要

Offline goal-conditioned reinforcement learning requires both long-horizon reachability estimates and local action comparisons. Dual goal representations provide value fields that capture global goal reachability, but they do not directly specify which action should be preferred at a given state. We propose Dual Advantage Fields, a policy-extraction method that turns a bilinear dual value model into a local advantage signal. Under bilinear dual parameterization, the goal embedding is the gradient of the value field with respect to the state representation. DAF learns an action-effect model that predicts the discounted feature displacement induced by an action and scores actions by the alignment between this displacement and the goal direction. In the realizable case, this score equals the goal-conditioned Bellman advantage, yielding a standard local policy-improvement guarantee. On OGBench locomotion, manipulation, and puzzle tasks, DAF improves aggregate RLiable metrics and performs strongly in settings where locally correct actions differ from direct movement toward the final goal.

2606.04185 2026-06-04 cs.RO

Distribution-Free Risk-Aware Planning and Control Under Uncertainty Using Conformal Spectral Risk Control

基于共形谱风险控制的免分布风险感知规划与控制

Junsik Eom, Tulga Ersal

发表机构 * Department of Mechanical Engineering, University of Michigan(密歇根大学机械工程系)

AI总结 提出一种免分布的风险感知模型预测控制框架,通过扩展共形风险控制到谱风险度量,生成预测集以在不确定性下保证风险低于用户指定阈值,并在车辆避障仿真中验证了安全性和效率提升。

Comments Submitted to IEEE Robotics and Automation Letters

详情
AI中文摘要

在动态和不确定环境中的安全导航通常依赖于对真实潜在不确定性的准确估计或假设。然而,由于数据有限或信息不完善,准确描述真实不确定性分布往往很困难。即使在高风险规避水平下,对不确定性及其相关风险的错误理解也可能导致危险决策。为了解决这个问题,我们提出了一种风险感知模型预测控制(RA-MPC)框架,该框架结合预测集来保证风险控制在用户指定阈值以下,而无需对潜在不确定性分布做出假设。为了生成预测集,我们开发了一种免分布的风险量化框架,将共形风险控制(CRC)扩展到一般谱风险度量。然后,我们证明将预测集纳入MPC框架即使在不确定性错误指定的情况下也能提供关于谱风险约束满足的统计安全保证。我们在模拟的车辆避障场景中验证了所提出的框架,与基线RA-MPC框架相比,展示了更高的安全性和更短的求解时间。

英文摘要

Safe navigation in dynamic and uncertain environments often relies on accurate estimation of, or assumptions about, the true underlying uncertainty. However, accurately characterizing the true uncertainty distribution is often difficult due to limited data or imperfect information. An incorrect understanding of the uncertainty and its associated risk may lead to dangerous decisions even under high levels of risk aversion. To address this issue, we propose a risk-aware model predictive control (RA-MPC) framework that incorporates prediction sets to guarantee risk control below a user-specified threshold without requiring assumptions about the underlying uncertainty distribution. To generate the prediction sets, we develop a distribution-free risk quantification framework that extends conformal risk control (CRC) to general spectral risk measures. We then show that incorporating the prediction sets into the MPC framework provides statistical safety guarantees in terms of spectral risk constraint satisfaction even under uncertainty misspecification. We validate the proposed framework in simulated vehicle obstacle avoidance scenarios, demonstrating improved safety and reduced solve time compared to a baseline RA-MPC framework.

2606.04182 2026-06-04 cs.LG cs.AI stat.ML

Exact Unlearning in Reinforcement Learning

强化学习中的精确遗忘

Thanh Nguyen-Tang, Raman Arora

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 本文提出强化学习中的精确遗忘问题,通过ρ-TV稳定算法实现数据删除后输出与从未学习该数据时不可区分,并给出近乎最优的遗憾界。

Comments ICML Spotlight

详情
AI中文摘要

我们提出了强化学习中的精确遗忘问题,目标是设计一个高效框架,使得在收到删除请求后能够移除任何用户的数据,即遗忘后在线学习者的输出与从未与学习者交互过的用户所产生的结果不可区分。对于任意 $ρ>0$,我们证明存在一个 $ρ$-TV 稳定的强化学习算法,支持精确遗忘过程,其期望计算成本仅为从头重新训练计算成本的 $ρ\sqrt{\ln T}$ 分之一。我们为表格型马尔可夫决策过程构造了这样一个 $ρ$-TV 稳定的强化学习算法,其遗憾界为 $\mathcal{O}(H^2 \sqrt{SAT} + H^3 S^2 A + {H^{2.5} S^2 A}/ρ)$,其中 $S, A, H, T$ 分别表示状态数、动作数、回合长度和回合数。我们还为 $ρ$-TV 稳定的强化学习算法建立了 $\Omega(H\sqrt{\!SAT}\! +\! {SAH}/ρ)$ 的下界,表明我们的算法几乎是极小化最优的。

英文摘要

We formulate the problem of \emph{exact unlearning} in reinforcement learning, where the goal is to design an efficient framework that enables the removal of any user's data upon deletion request, i.e., the online learner's output after unlearning is \emph{indistinguishable} from what would have been produced had the deleted user never interacted with the learner. For any $ρ>0$, we show that there exists a reinforcement learning (RL) algorithm that is $ρ$-TV-stable and supports an exact unlearning procedure whose expected computational cost is only a $ρ\sqrt{\ln T}$ fraction of the computational cost of retraining from scratch. We construct such a $ρ$-TV-stable RL algorithm for tabular Markov decision processes (MDPs), which achieves a regret bound of $\mathcal{O}(H^2 \sqrt{SAT} + H^3 S^2 A + {H^{2.5} S^2 A}/ρ)$, where $S, A, H$, and $T$ denote the number of states, the number of actions, the episode horizon, and the number of episodes, respectively. We also establish a lower bound of $Ω(H\sqrt{\!SAT}\! +\! {SAH}/ρ)$ for $ρ$-TV-stable RL algorithms, showing that our algorithm is nearly minimax optimal.

2606.04180 2026-06-04 cs.LG cs.IT math.IT

KODA: Contrastive Representation Comparison and Alignment for Vision-Language Foundation Models

KODA: 视觉-语言基础模型的对比表示比较与对齐

Youqi Wu, Mohammad Jalali, Farzan Farnia

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 提出KODA框架,通过核优化方法对比分析视觉-语言基础模型的表示差异,并识别弱聚类与强聚类的样本子集,实现表示对齐。

详情
AI中文摘要

视觉-语言基础模型(如CLIP和SigLIP)为多模态学习系统提供了广泛使用的表示。虽然这些模型通常通过下游性能进行比较,但这种评估往往不能解释它们的表示在结构上如何不同。在本文中,我们通过对比嵌入聚类任务研究这一问题:识别在一个表示下弱聚类但在另一个表示下强聚类的样本子集。我们提出了\emph{核优化差异分析(KODA)},一个基于核的对比表示比较与对齐框架。KODA通过模态核组合构建统一的多模态核,并将差异发现形式化为一个约束优化问题,该问题在一个表示中搜索一致结构,同时抑制参考表示中的一致性。这产生了与特定样本子集和模态交互相关的可解释差异方向。为了将KODA扩展到大型视觉-语言数据集,我们开发了使用随机投影的联合核随机低维近似,包括用于平移不变核的随机傅里叶特征。实验上,KODA在视觉-语言表示中识别出一致且可解释的差异结构,并为表示对齐提供了样本子集。代码可在https://github.com/yokiwuuu/KODA获取。

英文摘要

Vision-language foundation models such as CLIP and SigLIP provide widely used representations for multimodal learning systems. While these models are typically compared through downstream performance, such evaluations often do not explain how their representations differ structurally. In this work, we study this problem through the task of Contrastive Embedding Clustering: identifying sample subsets that are weakly clustered under one representation but strongly clustered under another. We propose \emph{Kernel Optimization for Discrepancy Analysis (KODA)}, a kernel-based framework for contrastive representation comparison and alignment. KODA constructs unified multimodal kernels through modality-wise kernel composition and formulates discrepancy discovery as a constrained optimization problem that searches for coherent structures in one representation while suppressing coherence in a reference representation. This yields interpretable discrepancy directions associated with specific sample subsets and modality interactions. To scale KODA to large vision-language datasets, we develop randomized low-dimensional approximations of joint kernels using random projections, including Random Fourier Features for shift-invariant kernels. Empirically, KODA identifies consistent and interpretable discrepancy structures across vision-language representations and provides sample subsets for representation alignment. The code is available at https://github.com/yokiwuuu/KODA.

2606.04177 2026-06-04 cs.CL cs.AI

A Systematic Analysis of Linguistic Features in AI-Generated Text Detection Across Domains and Models

跨领域与模型的人工智能生成文本检测中语言特征的系统分析

Yassir El Attar, Esra Dönmez, Maximilian Maurer, Agnieszka Falenska

发表机构 * Institute for Natural Language Processing, University of Stuttgart(斯图加特大学自然语言处理研究所) Interchange Forum for Reflecting on Intelligent Systems, University of Stuttgart(智能系统反思交流论坛,斯图加特大学) GESIS Leibniz Institute for the Social Sciences(莱比锡社会科学院) Heinrich-Heine University Düsseldorf(杜塞尔多夫海因里希-海涅大学)

AI总结 通过大规模实证研究,系统评估284个可解释语言特征在27个LLM和10个文本领域中的鲁棒性,发现词汇丰富度是跨模型和领域的最可靠信号。

Comments preprint

详情
AI中文摘要

可解释的语言特征为解释给定文本为何看似机器生成提供了一种有前景的方法,尤其对于非专业用户。然而,关于哪些特征可靠地指示LLM生成文本的现有发现仍然分散在不同的特征集、模型和文本领域中。为解决这一差距,我们进行了一项大规模实证研究,评估语言信号在表征AI生成文本方面的鲁棒性。我们的分析涵盖了来自27个LLM和十个文本领域的输出中的284个可解释语言特征,并在跨模型和跨领域泛化设置下进行。我们表明,仅基于语言特征的分类器可以可靠地区分AI生成文本和人类撰写文本。然而,许多先前提出的指标被证明高度依赖上下文,但词汇丰富度指标除外,这些指标在模型家族和文本领域中保持鲁棒信号。这些结果展示了哪些语言信号在上下文中泛化,并为更可靠、可解释的AI生成语言分析提供了基础。

英文摘要

Interpretable linguistic features offer a promising approach for explaining why a given text appears machine-generated, particularly for non-expert users. However, existing findings on which features reliably indicate LLM-generated text remain fragmented across feature sets, models, and text domains. To address this gap, we conduct a large-scale empirical study assessing the robustness of linguistic signals for characterizing AI-generated text. Our analysis covers 284 interpretable linguistic features across outputs from 27 LLMs and ten text domains under cross-model and cross-domain generalization settings. We show that classifiers based solely on linguistic features can reliably distinguish AI-generated from human-written text. However, many previously proposed indicators prove strongly context-dependent, with the exception of measures of lexical richness, which remain robust signals across model families and text domains. These results demonstrate which linguistic signals generalize across contexts and provide a foundation for more reliable, interpretable analyses of AI-generated language.

2606.04176 2026-06-04 cs.LG math.ST stat.ML stat.TH

Low-rank Distributional Matrix Completion

低秩分布矩阵补全

Jiayi Wang, Raymond K. W. Wong

发表机构 * University of Texas at Dallas(德克萨斯大学达拉斯分校) Texas A&M University(德克萨斯农工大学)

AI总结 针对每个条目为概率分布的矩阵,提出基于核均值嵌入和Tucker秩的低秩结构,通过函数展开算子连接无限维与有限维,实现分布矩阵补全并给出非渐近误差界。

详情
AI中文摘要

我们研究了矩阵补全问题的分布推广,其中目标矩阵的每个条目是概率分布而非标量。在此设置中,仅观察到矩阵条目的一个子集,即使对于观察到的条目,底层分布也无法直接获取;相反,我们观察到从这些分布中抽取的有限样本。为了表示分布条目,我们采用核均值嵌入,并引入分布值矩阵的Tucker秩概念以捕捉其低秩结构。核嵌入的无限维性质带来了重大的方法论挑战。为解决此问题,我们引入了函数展开算子,将所提出的分布低秩结构与有限维张量的经典Tucker秩联系起来。基于此框架,我们提出了一种用于分布矩阵补全的新估计器。我们建立了非渐近误差界,刻画了估计器的统计性能。在合成数据和真实世界应用上的大量实验证明了所提方法的有效性。

英文摘要

We study a distributional generalization of the matrix completion problem in which each entry of the target matrix is a probability distribution rather than a scalar. In this setting, only a subset of matrix entries is observed, and even for observed entries, the underlying distributions are not directly accessible; instead, we observe finitely many samples drawn from them. To represent distributional entries, we employ kernel mean embeddings and introduce a notion of Tucker rank for distribution-valued matrices to capture their low-rank structure. The infinite-dimensional nature of kernel embeddings poses significant methodological challenges. To address this, we introduce functional unfolding operators that link the proposed distributional low-rank structure to the classical Tucker rank for finite-dimensional tensors. Based on this framework, we propose a novel estimator for distributional matrix completion. We establish non-asymptotic error bounds that characterize the statistical performance of the estimator. Extensive experiments on synthetic data and a real-world application demonstrate the effectiveness of the proposed method.

2606.04172 2026-06-04 cs.RO

Affordance2Action: Task-Conditioned Scene-level Affordance Grounding for Real-Time Manipulation

Affordance2Action: 任务条件下的场景级功能区域定位用于实时操作

Litao Liu, Yifan Han, Pengfei Yi, Wenbo Yu, Hanqing Wang, Haoran Du, Enze Yuan, Zilin Yuan, Ruiding Feng, Michael Liu, Qi Zhang, Jingjin Yu

发表机构 * Department of Computer Science, Rutgers University-New Brunswick(罗格斯大学新布朗斯维尔回声分校计算机科学系) The Hong Kong University of Science and Technology (GZ)(香港科学与技术大学(GZ)) Shanghai AI Laboratory(上海人工智能实验室)

AI总结 提出Affordance2Action框架,通过构建A2A-Bench基准和A2A-AffordGen标注流程,解决场景级任务条件功能区域定位中的多区域对应问题,并支持实时操作。

Comments 23 pages

详情
AI中文摘要

任务条件操作需要将指令定位到与任务相关的功能部件,而非物体类别。这种设置依赖于场景,并且在杂乱场景中通常是一对多的:同一物体在不同任务中可能提供不同的交互,而单个任务可能对应一个功能区域或多个有效功能区域,具体取决于场景布局。现有的功能区域数据集和基准与此设置不一致,因为它们通常侧重于抓取或物体级功能区域,依赖合成场景,或假设单一的指令-区域对应关系。我们提出了Affordance2Action (A2A),一个以基准为中心的学习框架,用于场景级、任务条件的功能区域定位。其核心是A2A-Bench,一个面向操作的基准,涵盖了日常场景中的单区域和多区域指令对应关系,其中多区域对应关系突显了现实多物体环境中功能区域定位的模糊性和多样性。为了大规模构建该基准,我们构建了A2A-AffordGen,一个代理辅助的标注流程,结合了语言模型过滤、交互式部件分割、实例级遮罩细化、任务推理指令生成和人工验证。A2A-Bench的监督进一步支持多种下游应用,其中实时功能区域定位和功能区域条件操作策略是两个代表性示例。实验表明,A2A暴露了通用分割、基于VLM的定位和功能区域蒸馏基线中的显著差距,同时改进了任务级定位并为下游操作提供了有用的空间先验。所有数据集和代码将公开发布,以促进开放研究。

英文摘要

Task-conditioned manipulation requires grounding instructions to task-relevant functional parts rather than object categories. This setting is scene-dependent and often one-to-many in cluttered scenes: the same object may afford different interactions across tasks, while a single task may correspond to either one functional region or multiple valid functional regions, depending on the scene layout. Existing affordance datasets and benchmarks remain misaligned with this setting, as they typically focus on grasping or object-level affordances, rely on synthetic scenes, or assume a single instruction-region correspondence. We present Affordance2Action (A2A), a benchmark-centered learning framework for scene-level, task-conditioned part affordance grounding. At its core is A2A-Bench, a manipulation-oriented benchmark that covers both single-region and multi-region instruction correspondences in everyday scenes, with the latter highlighting the ambiguity and diversity of affordance grounding in realistic multi-object environments. To construct it at scale, we build A2A-AffordGen, an agent-assisted annotation pipeline that combines language-model filtering, interactive part segmentation, instance-level mask-out refinement, task-reasoning instruction generation, and human verification. A2A-Bench's supervision further supports diverse downstream applications, with real-time affordance grounding and affordance-conditioned manipulation policies as two representative examples. Experiments show that A2A exposes substantial gaps in generic segmentation, VLM-based grounding, and affordance distillation baselines, while improving task-level localization and providing useful spatial priors for downstream manipulation. All datasets and code will be publicly released to promote open research.

2606.04168 2026-06-04 cs.LG cs.CR

When Autoregressive Consistency Hurts Safety Alignment

当自回归一致性损害安全对齐

Bochen Lyu, Yiyang Jia, Xiaohao Cai, Zhanxing Zhu

发表机构 * University of Southampton(索姆塞特大学)

AI总结 本文通过分析自回归一致性机制,揭示了大语言模型安全对齐的浅层性,并提出随机插入攻击和对抗性安全对齐方法。

Comments 21 pages

详情
AI中文摘要

大语言模型(LLMs)的安全对齐是脆弱的,部分原因在于它通常是浅层的:微调主要重塑模型在最初几个输出标记附近的行为。我们认为,这种现象可以通过自回归一致性来理解,即下一个标记预测倾向于一致地保持和扩展当前响应轨迹。通过分析安全对齐的学习动态,我们表明自回归一致性可以将对齐更新集中在早期标记上,为浅层安全对齐提供机制解释。同样的机制还预测了一类更广泛的LLM攻击:在输出轨迹的任意位置诱导有害延续状态的攻击。作为一个具体例子,我们引入了随机插入攻击,该攻击将一个短的有害片段插入原本安全的拒绝轨迹中,并利用自回归一致性维持由此产生的有害分支,从而绕过安全对齐。值得注意的是,即使在一个长的拒绝前缀之后,一个短的有害片段也能将生成重定向为有害,这突显了自回归一致性作为一个潜在的更广泛失败机制。这表明安全对齐还应该在整个输出轨迹中打破有害的自回归一致性。因此,我们提出了对抗性安全对齐,一个基于最坏情况有害延续状态的初始框架,并通过随机最坏插入训练实例化它。总体而言,我们的结果表明,自回归一致性应被视为安全对齐和攻击设计中的核心考虑因素。

英文摘要

Safety alignment in large language models (LLMs) is fragile in part because it is often shallow: fine-tuning mainly reshapes the model's behavior near the first few output tokens. We argue that this phenomenon can be understood through autoregressive consistency, the tendency of next-token prediction to preserve and extend the current response trajectory consistently. By analyzing the learning dynamics of safety alignment, we show that autoregressive consistency can concentrate alignment updates on early tokens, offering a mechanistic explanation for shallow safety alignment. The same mechanism also predicts a broader class of attacks on LLMs: attacks that induce harmful continuation states at arbitrary positions in the output trajectory. As a concrete example, we introduce random insertion attack, which inserts a short harmful span into an otherwise safe refusal trajectory and exploits autoregressive consistency to sustain the resulting harmful branch, thereby bypassing safety alignment. Notably, a short harmful span can redirect the generation to be harmful even after a long refusal prefix, highlighting autoregressive consistency as a potential broader failure mechanism. This suggests that safety alignment should also break harmful autoregressive consistency throughout the output trajectory. We therefore propose adversarial safety alignment, an initial framework based on worst-case harmful continuation states, and instantiate it with random worst-insertion training. Overall, our results suggest that autoregressive consistency should be treated as a central consideration in both safety alignment and attack design.

2606.04167 2026-06-04 cs.LG cs.AI

Smart Transportation Without Neurons -- Fair Metro Network Expansion with Tabular Reinforcement Learning

无神经元的智能交通——基于表格强化学习的公平地铁网络扩展

Dimitris Michailidis, Sennay Ghebreab, Fernando P. Santos

发表机构 * Socially Intelligent Artificial Systems University of Amsterdam(社会智能人工智能系统大学阿姆斯特丹)

AI总结 针对地铁网络扩展问题,提出将非马尔可夫奖励决策过程与表格强化学习相结合的方法,在保证性能的同时大幅降低训练轮次和碳排放,并融入社会公平性指标。

Comments 16 pages

详情
AI中文摘要

我们解决了地铁网络扩展问题(MNEP),这是交通网络设计问题(TNDP)的一个子集,专注于扩展地铁系统以满足出行需求。传统方法依赖于精确和启发式方法,需要专家定义的约束来缩小搜索空间。最近,深度强化学习(Deep RL)因其在复杂序列决策过程中的有效性而出现,但它仍然计算成本高、环境成本高,并且需要额外的工程来解释。我们表明,MNEP问题规模足够小,不需要深度强化学习方法。将MNEP重新表述为非马尔可夫奖励决策过程(NMRDP),我们使用表格强化学习以显著更少的训练轮次实现类似的性能,此外还提供了更高的可解释性。此外,我们将社会公平标准纳入奖励函数,侧重于效率和公平性,突出了我们方法的多功能性。在现实场景中——西安和阿姆斯特丹——我们的方法平均将总轮次减少了18倍,总碳排放减少了12倍,同时与深度强化学习保持竞争力。这种方法提供了一种可复制、模块化、可解释且资源高效的解决方案,并具有应用于其他组合优化问题的潜力。

英文摘要

We tackle the Metro Network Expansion Problem (MNEP), a subset of the Transport Network Design Problem (TNDP), which focuses on expanding metro systems to satisfy travel demand. Traditional methods rely on exact and heuristic approaches that require expert-defined constraints to reduce the search space. Recently, deep reinforcement learning (Deep RL) has emerged due to its effectiveness in complex sequential decision-making processes-it remains, however, computationally expensive, environmentally costly, and requires additional engineering to interpret. We show that MNEP problems are small enough to not require Deep RL methods. Reformulating the MNEP as a Non-Markovian Rewards Decision Process (NMRDP), we use tabular RL to achieve similar performance with significantly fewer training episodes, additionally offering greater interpretability. Additionally, we incorporate social equity criteria into the reward functions, focusing on efficiency and fairness, highlighting the versatility of our method. Evaluated in real-world settings-Xi'an and Amsterdam-our method reduces total episodes by a factor of 18 and total carbon emissions by a factor of 12 on average, while remaining competitive with Deep RL. This approach offers a replicable, modular, interpretable, and resource-efficient solution with potential applications to other combinatorial optimization problems.

2606.04166 2026-06-04 cs.CV

End-to-End Text Line Detection and Ordering

端到端文本行检测与排序

Benjamin Kiessling

发表机构 * ALMAnaCH, Inria, France(ALMAnaCH、法国国家信息与自动化研究所)

AI总结 提出Orli模型,将文本行检测与阅读顺序排序统一为图像到序列问题,通过自回归生成基线实现端到端处理,在多种历史文档上达到先进性能。

详情
AI中文摘要

实际的历史文档文本识别流程通常将布局分析分解为行检测和单独的阅读顺序步骤,后者通常由手工编码的几何启发式方法处理,但难以应对旁注、多列、表格和特定来源的编辑惯例。本文介绍了Orli(行的有序回归),一个端到端模型,将两个子任务视为单一的图像到序列问题:从页面图像中,Orli以自回归方式直接按阅读顺序生成文本行基线。基线采用弦框架参数化表示,该参数化锚定行的位置、方向和范围,同时通过垂直偏移编码局部几何;迭代细化头和局部视觉细化器生成最终曲线。在涵盖十种书写系统的196,691页异构语料库上训练,Orli在没有数据集特定训练的情况下,略微超过了之前报道的cBAD行检测的最先进水平,在多个阅读顺序基准测试中零样本达到近乎完美的覆盖率和排序,并通过有限的微调适应更专业的域外布局。该方法的源代码和模型权重在开放许可下可从https://github.com/mittagessen/orli获取。

英文摘要

Practical text-recognition pipelines for historical documents typically decompose layout analysis into line detection followed by a separate reading-order step, with the latter most often handled by a hand-coded geometric heuristic that struggles with marginalia, multiple columns, tables, and source-specific editorial conventions. This article introduces Orli (Ordered Regression of Lines), an end-to-end model that casts both sub-tasks as a single image-to-sequence problem: from a page image, Orli autoregressively generates text-line baselines directly in reading order. Baselines are represented in a chord-frame parameterization that anchors a line's position, orientation, and extent while encoding local geometry through perpendicular offsets; an iterative refinement head and a local visual refiner produce the final curve. Trained on a heterogeneous corpus of 196,691 pages spanning ten writing systems, Orli marginally exceeds the previously reported state of the art for cBAD line detection without dataset-specific training, reaches near perfect coverage and ordering on multiple reading-order benchmarks zero-shot, and adapts to more specialized out-of-domain layouts with limited fine-tuning. The method's source code and model weights are available under an open license at https://github.com/mittagessen/orli.

2606.04164 2026-06-04 cs.LG cs.AI

ADAPTOOD: Uncertainty-Aware Fine-Tuning for Out-of-Distribution ECG Time Series Models

ADAPTOOD:面向分布外心电图时间序列模型的不确定性感知微调

Sotirios Vavaroutas, Yu Yvonne Wu, Ali Etemad, Cecilia Mascolo

发表机构 * University of Cambridge(剑桥大学) Dartmouth College(达特茅斯学院) Queen’s University(皇后大学)

AI总结 提出ADAPTOOD框架,利用数据不确定性量化分布偏移严重性,结合低秩更新和自适应超参数优化,在分布外心电图时间序列任务上提升准确率高达7%和精确率12.9%。

Comments 11 pages

详情
AI中文摘要

用于训练的数据样本通常与微调和部署期间遇到的数据不同,尽管机器学习模型显示出潜力,但在只有少量标注数据集可用时,其性能仍然有限。在由不同传感器、人群和应用设置引起的分布偏移下,性能通常会下降。尽管预训练有所帮助,但模型在现实环境中经常遇到分布外(OOD)数据,导致鲁棒性降低。现有的自适应方法通常假设固定的分布偏移,并在出现多种类型或严重性时难以应对。特别是,它们忽略了偏移的严重性,例如将适应大型熟悉数据集与适应带有新任务的小型数据集同等对待,这限制了泛化能力。为了解决这个问题,我们提出了ADAPTOOD,这是一个新颖的框架,利用数据不确定性来量化分布偏移的严重性并指导时间序列的微调。这种不确定性衡量目标部署分布中的样本与预训练分布偏离的程度,提供了OOD严重性的直接信号。我们的框架将这种不确定性与低秩模型更新和自适应超参数优化相结合,以改进自适应。我们表明,在OOD任务中,ADAPTOOD比现有方法实现了高达7%的准确率和12.9%的精确率提升,在分布偏移严重性增加时仍保持强劲性能。

英文摘要

Data samples used for training often differ from those encountered during fine-tuning and deployment, and while ML models show promise, their performance remains limited when only small annotated datasets are available. Performance often degrades under distribution shifts caused by diverse sensors, populations, and application settings. Although pre-training helps, models frequently encounter out-of-distribution (OOD) data in real-world settings, leading to reduced robustness. Existing adaptation methods usually assume fixed distribution shifts and struggle when multiple types or severities occur. In particular, they overlook shift severity, for example treating adaptation to a large familiar dataset the same as adaptation to a small dataset with a new task, which limits generalisation. To address this, we propose ADAPTOOD, a novel framework that leverages data uncertainty to quantify distribution shift severity and guide fine-tuning for time series. This uncertainty measures how strongly samples from the target deployment distribution deviate from the pre-training distribution, providing a direct signal of OOD severity. Our framework combines this uncertainty with low-rank model updates and adaptive hyperparameter optimisation to improve adaptation. We show that ADAPTOOD achieves up to 7% higher accuracy and 12.9% higher precision than existing methods in OOD tasks, maintaining strong performance as distribution shift severity increases.

2606.04161 2026-06-04 cs.LG

When Offline Selectors Cannot Beat the Best Single Model: A Diagnostic Study on edX Dropout Prediction

当离线选择器无法超越最佳单一模型:基于edX辍学预测的诊断研究

Tyler Crosse, Alan Nadelsticher Ruvalcaba, Dustin Khang LeDuc, Thomas Trask, Nicholas Lytle, David Joyner

发表机构 * edX

AI总结 针对离线选择器在实践中的表现常不如最佳单一模型的问题,提出三阶段诊断方法,通过k-NN标签一致性、离线学习器性能比较和状态特征消融实验,识别瓶颈为局部表示模糊性,建议改进状态或收集新数据而非调优学习器。

详情
AI中文摘要

不同的预测器通常在不同输入上表现优异,因此每实例选择最佳预测器有望比固定单一模型获得更高准确率。在实践中,从日志数据训练的选择器经常无法击败最强的单一预测器。在进一步调优之前,三个原因通常未被区分:不匹配的学习器、无法预测哪个模型获胜的状态、或从缓存到部署的标签偏移。 一个三阶段诊断在共享缓存上排除这些原因。第一阶段通过k-NN标签一致性估计oracle恢复的局部上限。第二阶段询问配对BC和离线RL学习器(BC、DQN和CQL,跨惩罚权重)是否达到该上限。第三阶段消融选择器状态,测试更丰富的特征是否会提高上限。综合结论指向最有希望的下一步:调优学习器、重新设计状态或收集新数据。 我们将其应用于在edX点击流数据上选择五个辍学预测模型。在16个时间窗口上,oracle平均比最强单一基模型高出9.7个准确率点,但BC、DQN和CQL均落在其下方的相同测试准确率带内(对十倍缓存扫描和N=2,000个保留样本鲁棒)。瓶颈是局部表示模糊性:CQL缩小了模仿差距但无部署增益(非保守性),遗憾在学习器间紧密聚集(非打破平局),三个学习器在测试准确率上收敛(非偏移)。下一次迭代应改变状态或收集新数据,而非进一步调优离线学习器。

英文摘要

Different predictors often excel on different inputs, so picking the best one per instance promises higher accuracy than committing to a single model. In practice, selectors trained from logged data routinely fail to beat the strongest single predictor. Three causes typically go unseparated before more tuning is applied: a mismatched learner, a state that does not predict which model wins, or buffer-to-deployment label shift. A three-stage diagnostic rules them out on a shared buffer. Stage~1 estimates a local ceiling on oracle recovery from $k$-NN label consistency. Stage~2 asks whether paired BC and offline-RL learners (BC, DQN, and CQL across penalty weights) reach that ceiling. Stage~3 ablates the selector state to test whether richer features would raise it. The combined verdict points to the most promising next step: tuning the learner, redesigning the state, or collecting new data. We apply it to selecting among five dropout-prediction models on edX clickstream data. Across 16 windows, the oracle beats the strongest single base model by 9.7 accuracy points on average, yet BC, DQN, and CQL land in the same test-accuracy band below it (robust to a tenfold buffer sweep and $N{=}2{,}000$ held-out examples). The bottleneck is local representational ambiguity: CQL closes the imitation gap without a deployment gain (not conservatism), regret clusters tightly across learners (not tie-breaking), and the three learners converge on test accuracy (not shift). The next iteration should change the state or collect new data, not tune the offline learner further.

2606.04160 2026-06-04 cs.CL cs.LG

Expert-Aware Refusal Steering

专家感知的拒绝引导

Anna C. Marbut, Daniel R. Olson, Travis J. Wheeler

发表机构 * Department of Interdisciplinary Studies(交叉学科研究部) University of Montana(蒙大拿大学) Department of Pharmacy Practice & Science(药学与科学系) University of Arizona(亚利桑那大学) European Bioinformatics Institute(欧洲生物信息研究所) European Molecular Biology Laboratory(欧洲分子生物学实验室) Wellcome Genome Campus(沃氏基因组校园)

AI总结 研究在混合专家(MoE)大语言模型中,通过专家感知的引导向量抑制拒绝行为,发现单个专家输出即可有效引导,且注意力机制在MoE拒绝行为中起重要作用。

Comments Under review for COLM 2026

详情
AI中文摘要

指令调优的大语言模型(LLM)的安全对齐依赖于模型可靠地拒绝回答有害或不允许请求的能力。最近的研究表明,在推理过程中对密集LLM应用引导向量可以有效抑制拒绝行为,诱导模型响应有害请求。我们将这种拒绝引导方法扩展到三个开源混合专家(MoE)LLM,并发现引导性能不受MoE架构固有的复杂路由模式影响。然后,我们提出了两种专家感知的拒绝引导方法,利用拒绝特定的专家路由模式和专家特定的引导方向来抑制正常的拒绝行为。我们发现,基于单个专家的输出即可有效引导拒绝行为。我们的结果表明,引导方法捕获的拒绝信号与专家路由行为不同,这表明注意力在MoE拒绝行为中扮演重要角色。

英文摘要

Safety alignment in instruction-tuned large language models (LLMs) depends on a model's ability to reliably refuse to respond to harmful or disallowed requests. Recent work has shown that a steering vector can be applied to a dense LLM during inference to effectively suppress refusal behavior, inducing response to harmful requests. We extend this refusal steering method to three open-source Mixture-of-Experts (MoE) LLMs and find that steering performance is uninhibited by the complex routing patterns inherent to the MoE architecture. We then propose two expert-aware refusal steering methods that leverage refusal-specific expert routing patterns and expert-specific steering directions to suppress normal refusal behavior. We find that refusal behavior can be effectively steered based on the output of a single expert. Our results show that refusal signals captured by steering methods differ from expert routing behavior, suggesting a substantial role for attention in MoE refusal behavior.

2606.04158 2026-06-04 cs.RO

Multi-Agent Next-Best-View Optimization for Risk-Averse Planning

多智能体风险规避规划中的下一最佳视角优化

Amirhossein Mollaei Khass, Vivek Pandey, Guangyi Liu, Athanasios Cosse, Emrah Bayrak, Nader Motee

发表机构 * Department of Mechanical Engineering and Mechanics, Lehigh University(莱文大学机械工程与力学系) Amazon Robotics(亚马逊机器人)

AI总结 提出一种分布式、风险感知的多智能体下一最佳视角框架,通过共识ADMM优化信息增益并建模碰撞风险,在降低通信开销的同时接近集中式方法的映射质量和轨迹安全性。

Comments 8 pages, 5 figures. Submitted to IROS 2026

详情
AI中文摘要

在不确定和未知环境中,多智能体下一最佳视角选择用于安全路径规划需要信息丰富、安全感知且高效的协调。集中式方法依赖于共享原始传感器数据或大量通信开销,导致可扩展性有限。我们提出一种分布式、风险感知的多智能体NBV框架,其中每个机器人维护一个私有的局部3D高斯溅射地图,团队共同最大化沿规划轨迹的掩蔽区域内的期望信息增益。通过通信图上的共识ADMM求解分布式目标,每个机器人仅交换候选视角、规划轨迹描述符和标量EIG贡献。通过局部3DGS地图上的平均风险价值对每条轨迹的碰撞风险进行建模,并用于塑造掩蔽半径和评分规划路径。在多个团队规模的Gibson环境中的实验表明,分布式公式在映射质量和轨迹安全性方面接近集中式基线,同时将通信量降低数个数量级。

英文摘要

Multi-agent Next-Best-View (NBV) selection for safe path planning in uncertain and unknown environments requires informative, safety-aware, and efficient coordination. Centralized approaches rely on sharing raw sensor data or significant communication overhead, resulting in limited scalability. We propose a distributed, risk-aware multi-agent NBV framework in which each robot maintains a private local 3D Gaussian Splatting map and the team jointly maximizes expected information gain (EIG) restricted to masked zones along planned trajectories. The resulting distributed objective is solved by Consensus ADMM (C-ADMM) over a communication graph, with each robot exchanging only candidate viewpoints, planned trajectory descriptors, and scalar EIG contributions. Collision risk along each trajectory is modeled via Average Value-at-Risk (AV@R) over the local 3DGS map and used both to shape the masking radius and to score planned paths. Experiments in Gibson environments at multiple team sizes show that the distributed formulation approaches the centralized baseline in mapping quality and trajectory safety while reducing communication by orders of magnitude.