arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2029
2606.05614 2026-06-05 cs.AI

Safety Paradox: How Enhanced Safety Awareness Leaves LLMs Vulnerable to Posterior Attack

安全悖论:增强的安全意识如何使LLM易受后验攻击

Long P. Hoang, Hai V. Le, Shaoyang Xu, Wei Lu, Wenxuan Zhang

发表机构 * Singapore University of Technology and Design(新加坡科技设计大学) Nanyang Technological University(南洋理工大学)

AI总结 本文揭示安全对齐增强的LLM因内部安全评估能力而面临后验攻击漏洞,通过实验和理论分析证明安全判断能力越强越易被利用,并提出因果干预验证。

详情
AI中文摘要

大型语言模型(LLM)经过严格对齐以拒绝有害请求,这一过程内在培养了评估和识别不安全内容的潜在能力。在这项工作中,我们揭示了这种高级安全意识无意中引入了一个致命漏洞。我们提出了后验攻击(Posterior Attack),一种单次查询的越狱方法,通过提示模型生成其内部分类器通常会标记为不安全的精确有害响应来绕过防护栏。通过对30个开源LLM(参数规模高达35B)和前沿模型(如GPT-5、Claude 4.6)的广泛实证评估,我们观察到一个显著现象:具有更优安全判断能力的模型更容易受到这种利用。为了解释这一点,我们形式化了安全悖论(Safety Paradox),分析表明安全对齐的单调改进自然放大了后验漏洞。最后,我们通过强化学习干预建立了因果联系,示例说明人为降低模型的安全判断能力可使其免疫攻击,而增强判断则会加剧漏洞。我们的发现揭示了当前对齐范式中的潜在缺陷,表明防御机制可能需要进一步的结构性改进。

英文摘要

Large language models (LLMs) are rigorously aligned to refuse harmful requests, a process that inherently cultivates a latent capacity to evaluate and recognize unsafe content. In this work, we reveal that this advanced safety awareness inadvertently introduces a fatal vulnerability. We introduce Posterior Attack, a single-query jailbreak that bypasses guardrails by prompting the model to generate the exact harmful response its internal classifier would normally flag as unsafe. Through extensive empirical evaluation across 30 open-source LLMs (up to 35B parameters in size) and frontier models (e.g., GPT-5, Claude 4.6), we observe a striking phenomenon: models with superior safety-judgment capabilities are disproportionately more susceptible to this exploitation. To explain this, we formalize the Safety Paradox, analytically showing that monotonic improvements in safety alignment naturally amplify posterior vulnerability. Finally, we establish a causal link via reinforcement learning interventions, exemplifying that artificially degrading a model's safety judgment immunizes it against the attack, whereas enhancing judgment exacerbates the vulnerability. Our findings highlight potential flaws in current alignment paradigms, indicating that defense mechanisms may require further structural refinement.

2606.05613 2026-06-05 cs.AI

Multilingual Fine-Tuning via Localized Gradient Conflict Resolution

通过局部梯度冲突解决的多语言微调

Long P. Hoang, Yiran Zhao, Wei Lu, Wenxuan Zhang

发表机构 * Singapore University of Technology and Design(新加坡科技设计大学) Salesforce AI Research(Salesforce人工智能研究) Nanyang Technological University(南洋理工大学)

AI总结 提出Bucket-Level MOO框架,将多语言微调重构为多目标优化问题,通过局部梯度冲突解决提升多语言性能。

详情
AI中文摘要

大型语言模型(LLMs)的快速发展已将跨语言多功能性确立为现代系统的定义特征。然而,微调这些模型经常引发跨语言的负面干扰。为了解决这个问题,我们将多语言微调重构为多目标优化(MOO)问题。具体来说,我们引入了Bucket-Level MOO,一个可扩展的分布式框架,它在参数桶上局部应用基于梯度的MOO算法。这使得冲突感知更新成为可能,而无需重建完整梯度向量的高昂通信开销。理论上,我们证明了这种局部解决自然地强制执行精炼帕累托平稳性,这是帕累托最优性的一个严格更紧的必要条件。实验上,Bucket-Level MOO通过驱动LLMs构建特定的语言维度来减轻干扰,提高了表示的可分离性。在四个基础LLM上的广泛实验表明,我们的方法在标准微调范式上显著提高了所见和未见的多语言性能。

英文摘要

The rapid evolution of Large Language Models (LLMs) has established cross-lingual versatility as a defining feature of modern systems. However, fine-tuning these models frequently induces negative interference across languages. To address this, we reformulate multilingual fine-tuning as a multi-objective optimization (MOO) problem. Specifically, we introduce Bucket-Level MOO, a scalable distributed framework that applies gradient-based MOO algorithms locally on parameter buckets. This enables conflict-aware updates without the prohibitive communication overhead of reconstructing full gradient vectors. Theoretically, we prove this localized resolution natively enforces Refined Pareto Stationarity, a strictly tighter necessary condition for Pareto optimality. Empirically, Bucket-Level MOO mitigates interference by driving LLMs to construct distinct language-specific dimensions, improving representational separability. Extensive experiments across four base LLMs demonstrate that our method significantly improves both seen and unseen multilingual performance over standard fine-tuning paradigms.

2606.05610 2026-06-05 cs.CL

Predictable Scaling Laws of Optimal Hyperparameters for LLM Continued Pre-training

LLM持续预训练中最优超参数的可预测缩放定律

Yongwei Zhou, Juncheng Diao, Junlin Shang, Peiguang Li, Rongxiang Weng

发表机构 * MeiTuan(美团) University of Chinese Academy of Sciences(中国科学院大学) Harbin Institute of Technology(哈尔滨工业大学)

AI总结 本文发现持续预训练中学习率和批大小等最优超参数遵循稳定可预测的缩放定律,并提出一个两阶段框架,通过小规模代理模型和状态感知预测,将超参数搜索开销降低90%且性能相当或更优。

详情
AI中文摘要

大型语言模型(LLM)持续预训练的效果取决于超参数配置,如学习率和批大小。然而,当前实践通常依赖启发式方法或网格搜索,导致训练不稳定和成本过高。在这项工作中,我们首先通过实验发现,在整个持续预训练过程中,最优超参数遵循稳定且可预测的缩放定律。利用这些见解,我们提出了一个新框架,用于建立给定检查点的计算预算与最优超参数之间的定量关系。我们的方法分为两个阶段:(1)经验定律发现,其中我们训练小规模代理模型,通过标准的损失-计算缩放定律推导出将计算预算映射到最优超参数的函数;(2)状态感知超参数预测,其中我们评估初始检查点的验证损失,并使用逆缩放定律估计其等效预训练计算量——即从零开始达到相同损失所需的计算量。结合计划的计算预算,我们预测目标运行的最优超参数。实验结果表明,我们的方法将超参数搜索开销降低了高达90%,同时实现了与基线相当或更优的性能。这个与模型无关的框架可跨架构推广,为从任意给定点开始的多样化持续预训练场景提供了一种原则性且高效的方法。

英文摘要

The efficacy of continued pre-training for Large Language Models (LLMs) hinges upon hyperparameter configurations, such as learning rate and batch size. However, current practices often rely on heuristics or grid searches, leading to training instability and excessive costs. In this work, we first empirically discover that optimal hyperparameters follow stable and predictable scaling laws throughout the continued pre-training process. Leveraging these insights, we propose a novel framework to establish quantitative relationships between compute budget and optimal hyperparameters for a given checkpoint. Our approach has two stages: (1) \textit{Empirical Law Discovery}, where we train small-scale proxy models to derive functions mapping compute budget to optimal hyperparameters via standard loss-compute scaling laws; and (2) \textit{State-Aware Hyperparameter Prediction}, where we evaluate an initial checkpoint's validation loss and use the inverse scaling law to estimate its \textit{equivalent pre-training compute} -- the compute needed to achieve the same loss from scratch. Combining this with the planned compute budget, we predict optimal hyperparameters for the target run. Empirical results demonstrate that our method reduces the hyperparameter search overhead by up to 90\% while achieving comparable or superior performance relative to baselines. This model-agnostic framework generalizes across architectures, providing a principled and efficient methodology for diverse continued pre-training scenarios starting from any given point.

2606.05606 2026-06-05 cs.LG cs.AI math.OC

Cross-Epoch Adaptive Rollout Optimization for RL Post-Training

跨时代自适应展开优化用于强化学习后训练

Yiming Zong, Yige Wang, Jiashuo Jiang

发表机构 * Department of Industrial Engineering & Decision Analytics, Hong Kong University of Science and Technology(工业工程与决策分析系,香港科学与技术大学)

AI总结 针对提示词训练信号差异大的问题,提出CERO方法,通过贝叶斯估计提示词成功概率并利用Fenchel对偶优化自适应分配展开预算,在固定总预算下提升样本效率。

详情
AI中文摘要

LLM后训练通常依赖于对每个提示采样多次展开的强化学习方法,但大多数现有方法对每个提示使用固定的展开预算,尽管不同提示提供的训练信号差异很大。本文研究在固定全局预算下的自适应展开分配,并将问题形式化为具有提示级递减收益的在线资源分配。我们的方法CERO维护每个提示成功概率的Beta后验分布,并使用后验期望伯努利方差作为额外展开价值的贝叶斯估计。我们利用该估计构建累积分配上的凹饱和效用函数,得到一个目标函数,其中跨提示和跨时代的决策通过全局预算耦合。由于所得目标在时间上不可分离,我们推导出Fenchel对偶重写,并通过投影在线梯度下降更新提示级和预算级对偶变量。在固定提示效用下,我们证明相对于离线分配基准的$O(\sqrt{K})$遗憾界。在数学推理问题上的实验表明,CERO在多个开源LLM和基准上持续优于GRPO,证明自适应展开预算可以提高样本效率。

英文摘要

LLM post-training often relies on reinforcement learning methods that sample multiple rollouts per prompt, yet most existing approaches use a fixed rollout budget for every prompt, despite large differences in the training signal different prompts provide. In this paper, we study adaptive rollout allocation under a fixed global budget and formulate the problem as online resource allocation with prompt-level diminishing returns. Our method, CERO, maintains a Beta posterior over each prompt's success probability and uses the posterior expected Bernoulli variance as a Bayesian estimate of the value of additional rollouts. We use this estimate to construct a concave, saturating utility over cumulative allocations, yielding an objective in which decisions across prompts and epochs are coupled by the global budget. Since the resulting objective is temporally nonseparable, we derive a Fenchel-dual reformulation and update both prompt-level and budget-level dual variables via projected online gradient descent. Under fixed prompt utilities, we prove an $O(\sqrt{K})$ regret bound against the offline allocation benchmark. Experiments on mathematical-reasoning problems show that CERO consistently outperforms GRPO across multiple open-weight LLMs and benchmarks, demonstrating that adaptive rollout budgeting can improve sample efficiency.

2606.05605 2026-06-05 cs.LG cs.NE

From Prediction to Self: Developmental Conditions for Agency in Minimal Neural Systems

从预测到自我:最小神经系统中能动性的发展条件

Evan Ye

发表机构 * Independent Researcher(独立研究者)

AI总结 通过40个逐步增加的实验,研究最小GRU系统如何区分自我与世界因果影响,发现四个严格顺序的发展条件,并提出能动性增益作为度量指标。

Comments 18 pages, 6 figures

详情
AI中文摘要

一个仅仅预测世界的系统如何区分自身的因果影响与其他一切?我们在一个最小192维GRU中通过40个受控实验(按发展序列排列,一次添加一个组件)追踪这一转变,并跟踪系统是否能区分自我引起的变化与世界引起的变化。发展路径揭示了必须严格按顺序满足的四个条件:(1)形成稳定吸引子的持久状态,(2)连接输出到输入的因果动作循环,(3)使隐式因果知识显式的本体感觉反馈,以及(4)异步觉醒——感知学习必须在动作学习开始之前巩固。我们提出能动性增益(A = Err_world - Err_self),即了解自身动作的预测优势,作为跟踪这一过程的度量。自我感知预测器在周期性(正弦)和混沌(洛伦兹)环境中始终优于自我盲预测器,并且该度量在移除所有辅助组件后仍然有效。只有前向采样的动作选择产生有意义的能动性增益;两种基于梯度的替代方案退化。同样重要的是12个被证伪的假设,它们映射了发展停滞的地方:仅靠预测编码不会产生自我表征。

英文摘要

How does a system that merely predicts the world come to distinguish its own causal influence from everything else? We trace this transition in a minimal 192-dimensional GRU through 40 controlled experiments arranged as a developmental sequence, adding components one at a time and tracking whether the system can distinguish self-caused from world-caused changes. The developmental path reveals four conditions that must be satisfied in strict order: (1) persistent state forming stable attractors, (2) a causal action loop linking output to input, (3) proprioceptive feedback that makes implicit causal knowledge explicit, and (4) asynchronous awakening - perceptual learning must consolidate before action learning begins. We propose agency gain (A = Err_world - Err_self), the predictive advantage of knowing one's own action, as a metric to track this process. The self-aware predictor consistently outperforms the self-blind predictor across periodic (sinusoidal) and chaotic (Lorenz) environments, and the metric survives ablation of all auxiliary components. Only forward-sampled action selection produces meaningful agency gain; two gradient-based alternatives degenerate. Equally significant are 12 falsified hypotheses mapping where development stalls: predictive coding alone does not produce self-represent

2606.05602 2026-06-05 cs.AI cs.HC cs.LG

Fix the Mind, Not the Move: Interpretable AI Assistance via Knowledge-Gap Localization

修正思维,而非动作:通过知识缺口定位实现可解释的AI辅助

Ayano Hiranaka, Ya-Chuan Hsu, Stefanos Nikolaidis, Erdem Bıyık, Daniel Seita

发表机构 * University of Tokyo(东京大学) National Institute of Information and Communications Technology(信息与通信技术国家研究所)

AI总结 提出SENSEI框架,通过结构化知识表示推断用户误解并提供针对性建议,在长时任务中实现零样本组合泛化,纠正90%的学生误解。

Comments Accepted to International Conference on Machine Learning (ICML) 2026

详情
AI中文摘要

在人机协作中,AI助手通常通过行为反馈(例如辅助驾驶中的警报或方向盘提示)来纠正次优的人类行为。此类干预可以缓解即时错误,但长期改进需要解决导致重复错误的潜在误解。我们引入了SENSEI,一个从交互行为推断用户误解并提供针对性、最小但充分建议的框架。我们的方法通过操作结构化知识表示来定位和纠正错误行为的根源,从而脱离动作或轨迹层面的干预。在具有不同误解和相应行为的三个长时任务中,SENSEI展示了零样本组合泛化能力,尽管仅针对单一误解案例进行训练,却能解开多个重叠的误解。一项用户研究进一步表明,我们的方法能够识别真实的人类误解,并提供有效的指导,从而提高长时任务表现,成功纠正了90%的学生误解。代码和项目页面见https://misoshiruseijin.github.io/SENSEI/。

英文摘要

AI assistants in human-AI collaboration often correct suboptimal human actions through behavioral feedback (e.g., alerts or steering-wheel nudges in assistive driving). Such interventions can mitigate immediate errors, but long-term improvement requires addressing the underlying misconceptions that cause repeated mistakes. We introduce SENSEI, a framework that infers user misconceptions from interaction behavior and provides targeted, minimal yet sufficient suggestions to correct them. Our approach departs from action- or trajectory-level interventions by operating over a structured knowledge representation to localize and correct the sources of erroneous behavior. Across three long-horizon tasks with diverse misconceptions and corresponding behaviors, SENSEI demonstrates zero-shot compositional generalization, disentangling multiple overlapping misconceptions despite training only on single-misconception cases. A user study further shows that our method identifies real human misconceptions and provides effective guidance that improves long-horizon task performance, successfully correcting $90\%$ of student misconceptions. Code and project page are available at https://misoshiruseijin.github.io/SENSEI/.

2606.05599 2026-06-05 cs.LG math.ST stat.ME stat.ML stat.TH

Mitigating the Curse of Dimensionality in Uniform Convergence of Deep Neural Networks via Smooth Activations

通过平滑激活函数缓解深度神经网络一致收敛中的维度灾难

Yizhe Ding, Runze Li, Jia Liu, Lingzhou Xue

发表机构 * Department of Statistics, The Pennsylvania State University(宾夕法尼亚州立大学统计学系)

AI总结 本文通过分析平滑激活深度神经网络,建立了统一收敛的理论框架,证明其能够通过自适应利用目标函数的低维层次组合结构来缓解维度灾难。

Comments 30 pages, 5 figures

详情
AI中文摘要

本文为平滑激活深度神经网络(DNN)估计量的一致收敛建立了理论框架。虽然标准ReLU网络在各种非参数回归任务中,在$L^2(P)$范数下达到了极小化最优速率,但我们建立了一个理论下界,表明最小二乘ReLU估计量在其一致收敛行为中可能遭受维度灾难。受下游任务中对最坏情况可靠性的需求驱动,我们通过分析平滑激活DNN(平滑DNN),包括前馈和残差结构,来解决这一局限性。我们为这些模型的逼近器建立了新的伪维数界、非渐近逼近保证和Hölder范数界。利用这些结果,我们推导了平滑DNN估计量在多种统计上下文(包括Huber回归、最小二乘回归、分位数回归和逻辑回归)中的非渐近一致收敛速率。我们证明,平滑DNN可以通过自适应利用目标函数的低维层次组合结构来缓解一致收敛中的维度灾难。通过模拟研究和实际应用的支持,我们的结果将平滑DNN定位为在需要一致保证的统计学习任务中,理论上合理且实践上可行的ReLU网络替代方案。

英文摘要

This paper establishes a theoretical framework for the uniform convergence of smoothly activated deep neural network (DNN) estimators. While standard ReLU networks achieve minimax-optimal rates in the $L^2(P)$ norm for various nonparametric regression tasks, we establish a theoretical lower bound demonstrating that least-squares ReLU estimators can suffer from the curse of dimensionality in their uniform convergence behavior. Motivated by the need for reliable uniform guarantees in downstream tasks requiring worst-case reliability, we address this limitation by analyzing smoothly activated DNNs (smooth DNNs), encompassing both feedforward and residual structures. We establish novel pseudo-dimension bounds, non-asymptotic approximation guarantees, and Hölder-norm bounds for the approximators of these models. Leveraging these results, we derive non-asymptotic uniform convergence rates for smooth DNN estimators across multiple statistical contexts, including Huber, least-squares, quantile, and logistic regression. We prove that smooth DNNs can mitigate the {curse of dimensionality} in uniform convergence by adaptively exploiting the low-dimensional hierarchical composition structure of the target function. Supported by both simulation studies and a real-world application, our results position smooth DNNs as a theoretically grounded and practically viable alternative to ReLU networks for statistical learning tasks requiring uniform guarantees.

2606.05588 2026-06-05 cs.RO cs.LG

Auditing Demonstration Curation Metrics: Action-Only Scorers Fail on the Structural Defects That Degrade Imitation Policies

审计示范策展指标:仅动作评分器在降低模仿策略的结构缺陷上失败

Aarav Bedi

发表机构 * Aarav Bedi

AI总结 本研究构建受控测试平台,注入两类示范缺陷(细微扰动和结构错误),审计七种策展指标,发现仅动作指标无法检测结构错误,且部分指标评分倒置,而状态轨迹指标能部分检测但下游性能恢复有限。

Comments 5 pages, 3 figures, 4 tables

详情
AI中文摘要

模仿学习策略继承了其训练示范的质量,越来越多的策展指标声称能自动评分和过滤低质量示范。这些指标各自在不同协议的不同数据上验证,因此不清楚哪些指标真正识别出损害策略的示范。我们构建了一个受控测试平台,其中示范缺陷以已知类型注入,并沿两个轴审计七种策展指标:每个指标区分缺陷示范与清洁示范的效果,以及基于每个指标策展的子集训练行为克隆策略是否提高任务成功率。我们研究两种缺陷机制。细微扰动(相关动作噪声、震颤、截断)可通过多变量离群值评分检测,一旦移除,可恢复全部下游差距。结构错误,即示范在关键时刻执行错误动作,对我们测试的每个仅动作指标都是不可见的,其中两个指标是倒置的:它们将缺陷示范评分为更高质量,并用于策展时,往往使策略处于或低于未策展基线,而非高于基线。只有检查状态轨迹的指标能检测结构错误,即使最好的指标也只能恢复三分之一的下游差距。高检测准确性并不保证下游改进。我们发布了测试平台和所有策展实现。

英文摘要

Imitation-learning policies inherit the quality of the demonstrations they are trained on, and a growing set of curation metrics promise to score and filter low-quality demonstrations automatically. These metrics are each validated on different data with different protocols, so it is unclear which of them actually identify the demonstrations that harm a policy. We build a controlled testbed in which demonstration defects are injected with known type, and audit seven curation metrics along two axes: how well each separates defective from clean demonstrations, and whether training a behavior-cloning policy on each metric's curated subset improves task success. We study two defect regimes. Subtle perturbations (correlated action noise, tremor, truncation) are detectable by multivariate outlier scoring and, once removed, recover the full downstream gap. Structural errors, where the demonstration executes a wrong action at a key moment, are invisible to every action-only metric we test, and two of them are inverted: they score defective demonstrations as higher quality and, used for curation, tend to leave the policy at or below the uncurated baseline rather than above it. Only metrics that examine the state trajectory detect structural errors, and even the best of them recovers just a third of the downstream gap. High detection accuracy does not guarantee downstream improvement. We release the testbed and all curation implementations.

2606.05587 2026-06-05 cs.CV cs.AI cs.LG

HDST-GNN: Heterogeneous Dynamic Spatiotemporal Graph Neural Networks for Multi-Object Tracking in UAV Aerial Imagery

HDST-GNN:用于无人机航拍图像多目标跟踪的异质动态时空图神经网络

Phillip Jiang

发表机构 * Phillip Jiang(菲利普·姜)

AI总结 针对无人机航拍中目标小、密集、遮挡导致身份切换的问题,提出异质动态时空图神经网络HDST-GNN,通过高度自适应边构建、异质节点表示和遮挡门控时序聚合提升跟踪性能。

Comments 18 pages, 4 figures, 6 tables

详情
AI中文摘要

无人机航拍图像的多目标跟踪(MOT)面临独特挑战:序列间高度变化、目标小而密集、频繁遮挡导致身份切换。现有基于图的跟踪器假设固定空间上下文并统一处理所有目标,忽略了检测、活跃轨迹和丢失目标等异质生命周期状态。我们提出HDST-GNN,一种异质动态时空图神经网络,包含三项创新。首先,高度自适应边构建根据平均目标面积估计相机高度代理,并相应调整图连接半径。其次,异质节点表示将检测(D型)、确认轨迹(T型)和丢失轨迹(L型)建模为不同节点类型,具有专用投影和类型化边关系。第三,遮挡门控时序聚合根据每个节点的遮挡置信度门控其注意力贡献,防止被遮挡节点破坏邻居嵌入。HDST-GNN使用可微Sinkhorn头部,结合交叉熵和三元组损失进行端到端训练。在VisDrone2019-MOT上使用oracle检测时,HDST-GNN达到94.51% MOTA和97.24% IDF1,比SORT高出+5.0 MOTA点,身份切换减少81%。使用真实YOLOv8n检测时,HDST-GNN相比SORT身份切换减少49%。消融研究证实了每个组件的独立贡献。

英文摘要

Multi-object tracking (MOT) from UAV imagery presents unique challenges: altitude varies across sequences, objects are small and densely packed, and frequent occlusion causes identity switches. Existing graph-based trackers assume fixed spatial context and treat all objects uniformly, ignoring the heterogeneous lifecycle states of detections, active tracklets, and lost targets. We propose HDST-GNN, a Heterogeneous Dynamic Spatiotemporal Graph Neural Network with three novel contributions. First, Altitude-Adaptive Edge Construction estimates a camera-altitude proxy from mean object area and adjusts the graph connectivity radius accordingly. Second, Heterogeneous Node Representation models detections (Type-D), confirmed tracklets (Type-T), and lost tracklets (Type-L) as distinct node types with dedicated projections and typed edge relations. Third, Occlusion-Gated Temporal Aggregation gates each node's attention contribution by its occlusion confidence, preventing occluded nodes from corrupting neighbour embeddings. HDST-GNN is trained end-to-end with a differentiable Sinkhorn head using joint cross-entropy and triplet loss. On VisDrone2019-MOT with oracle detections, HDST-GNN achieves 94.51% MOTA and 97.24% IDF1, outperforming SORT by +5.0 MOTA points and reducing identity switches by 81%. With real YOLOv8n detections, HDST-GNN reduces identity switches by 49% vs. SORT. Ablation studies confirm the independent contribution of each component.

2606.05586 2026-06-05 cs.CV cs.MM

BMCR: Adaptive Backbone Module Composition via Reinforcement Learning for Remote Sensing Object Detection

BMCR: 基于强化学习的自适应主干模块组合用于遥感目标检测

Wenlin Liu, Xikun Hu, Ping Zhong

发表机构 * College of Electronic Science and Technology, National University of Defense Technology(电子科学与技术学院,国防科技大学)

AI总结 提出BMCR方法,通过强化学习动态组合CNN和ViT的模块化主干,解决遥感目标检测中不同复杂度输入的自适应特征提取问题,在多个数据集上取得领先性能。

详情
AI中文摘要

在遥感目标检测中,卷积神经网络擅长捕捉局部细节,而视觉Transformer更擅长全局上下文建模。然而,现有检测器通常依赖单一固定主干或手动设计的混合架构,无法自适应地利用这些互补优势处理不同复杂度的输入。为解决这一局限,我们提出基于强化学习的主干模块组合(BMCR)。BMCR从现成的CNN和ViT主干中分解出可重用模块,动态组装输入自适应推理路径。为实现跨家族组合,我们首先构建了一个可扩展的模块工具箱。具体而言,我们将代表性的CNN和ViT主干分解为可重用的功能模块,并为每个模块封装明确的结构、语义和计算元数据,以实现兼容性感知的组装。为弥合基于网格的CNN特征与基于令牌的ViT表示之间的差距,我们设计了一种轻量级的基于最优传输(OT)的过渡接口,在保持空间一致性的同时确保分布感知对齐。然后,将主干组合过程建模为序列决策问题,其中策略网络根据中间多尺度观测逐步选择任务相关模块。为稳定可重用模块和路由策略的联合优化,我们进一步开发了自适应模块协同优化(AMCO)策略,在训练过程中协调模块更新、路由探索和奖励分配。在DOTA-v1.0、DOTA-v1.5和DIOR-R上,BMCR分别达到79.31%、73.41%和71.86%的mAP,在保持竞争效率的同时,超越强静态和动态基线最多2.5个百分点。

英文摘要

In remote sensing object detection, Convolutional Neural Networks (CNNs) excel at capturing local details while Vision Transformers (ViTs) are better at global context modeling. However, existing detectors typically rely on a single fixed backbone or a manually designed hybrid architecture, and thus fail to adaptively exploit these complementary strengths across inputs of diverse complexity. To address this limitation, we propose Backbone Module Composition via Reinforcement Learning (BMCR). BMCR dynamically assembles input-adaptive inference paths from reusable modules decomposed from off-the-shelf CNN and ViT backbones. To enable such cross-family composition, we first construct an extensible module toolbox. Specifically, we decompose representative CNN and ViT backbones into reusable functional modules and encapsulate each module with explicit structural, semantic, and computational metadata for compatibility-aware assembly. To bridge the gap between grid-based CNN features and token-based ViT representations, we design a lightweight Optimal Transport (OT) based transition interface that ensures distribution-aware alignment while respecting spatial consistency. The backbone composition process is then formulated as a sequential decision problem, in which a policy network progressively selects task-relevant modules according to intermediate multi-scale observations. To stabilize the joint optimization of reusable modules and the routing policy, we further develop an Adaptive Module Cooperative Optimization (AMCO) strategy that coordinates module updating, routing exploration, and reward assignment during training. On DOTA-v1.0, DOTA-v1.5 and DIOR-R, BMCR achieves 79.31\%, 73.41\% and 71.86\% mAP, respectively, surpassing strong static and dynamic baselines by up to 2.5 points while maintaining competitive efficiency.

2606.05576 2026-06-05 cs.CV

UltraVR: A Diagnostic Ultra-Resolution Image-VQA Benchmark for Evidence-Grounded Reasoning

UltraVR:面向证据推理的诊断性超分辨率图像VQA基准

Gexin Huang, Yanting Yang, Myeongkyun Kang, Beidi Zhao, Jun Zhou, Chen Zhou, Gang Wang, Zu-hua Gao, Xiaoxiao Li

发表机构 * University of British Columbia(不列颠哥伦比亚大学) Vector Institute(向量研究所) BC Cancer Agency(不列颠哥伦比亚癌症中心) The Hong Kong Polytechnic University(香港理工大学)

AI总结 提出UltraVR基准,通过结构化思维链标注诊断视觉语言模型在超分辨率图像上的证据推理能力,发现模型在证据定位和局部感知环节错误集中。

Comments 10 pages, 1 figure

详情
AI中文摘要

视觉语言模型(VLM)在视觉问答和多模态推理基准上表现出色。然而,它们在超分辨率图像上的能力——其中关键证据微小、细微、空间遥远或分布广泛——仍不清楚。现有评估主要报告最终答案准确率,对模型是否获取并整合必要视觉证据的洞察有限。我们引入UltraVR,一个面向超分辨率图像上基于证据的视觉推理的诊断性基准。UltraVR涵盖四个高价值场景:CCTV监控、遥感(RS)、全切片图像(WSI)病理学和工业异常检测(AD)。这些领域提出互补挑战:拥挤CCTV场景中的细粒度目标定位、RS中的长程空间比较、WSI中的多尺度证据导航以及重复工业布局中的细微不规则检测。除了标准QA三元组,每个实例包括一个结构化的真实思维链,包含步骤级问题、中间答案和推理标签。这些标签将推理分解为证据定位、局部感知、量化、证据整合和决策推断,从而实现对黑盒评分的流程级诊断。使用UltraVR,我们评估前沿VLM,并表明当前模型在超分辨率推理上仍远不可靠。重要的是,结构化注释使我们能够定位从视觉到决策流水线中的失败:错误集中在证据定位和局部感知,而当提供中间视觉事实时,下游推理通常能够恢复。这些发现表明UltraVR是一个诊断性测试平台,不仅衡量VLM是否回答正确,还衡量其超分辨率推理过程在何处中断。

英文摘要

Vision-language models (VLMs) excel on visual question answering and multimodal reasoning benchmarks. Yet their capability on ultra-resolution images - where critical evidence is tiny, subtle, spatially distant, or distributed - remains unclear. Existing evaluations largely report final-answer accuracy, offering limited insight into whether models acquire and integrate the necessary visual evidence. We introduce UltraVR, a diagnostic benchmark for evidence-grounded visual reasoning over ultra-resolution images. UltraVR spans four high-value scenarios: CCTV surveillance, remote sensing (RS), whole-slide image (WSI) pathology, and industrial anomaly detection (AD). These domains pose complementary challenges: fine-grained object grounding in crowded CCTV scenes, long-range spatial comparison in RS, multi-scale evidence navigation in WSI, and subtle irregularity detection in repetitive industrial layouts. Beyond standard QA triples, each instance includes a structured ground-truth chain of thought with step-level questions, intermediate answers, and reasoning labels. These labels decompose reasoning into evidence grounding, local perception, quantification, evidence integration, and decision inference, enabling process-level diagnosis over black-box scoring. Using UltraVR, we evaluate frontier VLMs and show that current models remain far from reliable on ultra-resolution reasoning. Importantly, the structured annotations allow us to localize failures across the visual-to-decision pipeline: errors concentrate in evidence grounding and local perception, while downstream inference often recovers when intermediate visual facts are supplied. These findings demonstrate UltraVR as a diagnostic testbed for measuring not only whether VLMs answer correctly, but where their ultra-resolution reasoning process breaks.

2606.05575 2026-06-05 cs.SD eess.AS

SB-RF: Schrödinger Bridge Rectified Flow for One-Step Robust Speech Enhancement

SB-RF: 用于一步鲁棒语音增强的薛定谔桥整流流

Caixia Lu, Xueyang Lv, Penglong Hu, Jiaming Xu

发表机构 * Xiaomi Corporation, Beijing, China(小米公司,北京,中国)

AI总结 提出SB-RF,一种结合整流流与薛定谔桥理论的一步生成式语音增强框架,通过熵正则化最优传输构建条件桥,实现高质量一步生成,在VoiceBank-DEMAND基准上达到领先性能,并在低信噪比场景下展现出强鲁棒性和高效率。

详情
AI中文摘要

生成模型在语音增强中表现出令人印象深刻的结果,但通常受限于多步推理。我们提出SB-RF,一种将整流流(RF)与薛定谔桥(SB)理论相结合的一步生成框架。SB-RF通过熵正则化最优传输在干净和带噪语音分布之间构建条件桥。通过RF的速度匹配目标将SB轨迹与最优传输测地线对齐,SB-RF能够通过一步生成实现高质量增强。实验表明,SB-RF在VoiceBank-DEMAND基准上达到了生成方法中的领先性能。此外,为了全面评估在具有挑战性的真实场景中的性能,我们在一个模拟的低信噪比测试集上使用扩大的训练数据集评估SB-RF。在这些条件下,SB-RF展现出强大且具有竞争力的鲁棒性和高效率,验证了其在现实应用中的潜力。

英文摘要

Generative models have shown impressive results in speech enhancement but often suffer from multi-step inference. We propose SB-RF, a one-step generative framework integrating Rectified Flow (RF) with Schrödinger Bridge (SB) theory. SB-RF constructs a conditional bridge between clean and noisy speech distributions via entropy-regularized optimal transport. By aligning SB trajectories with the optimal transport geodesic through the velocity-matching objective of RF, SB-RF enables high-quality enhancement with one-step generation. Experiments demonstrate that SB-RF achieves leading performance among generative methods on the VoiceBank-DEMAND benchmark. Furthermore, to fully assess performance in challenging real-world scenarios, we evaluate SB-RF on a simulated low signal-to-noise ratio test set using an expanded training dataset. Under these conditions, SB-RF exhibits strong and competitive robustness with high efficiency, validating its potential for real-world applications.

2606.05571 2026-06-05 cs.SD eess.AS

Sound Effects Dataset Unification With the Universal Category System

使用通用分类系统统一音效数据集

Jun Woo Beck, Alexander Lerch

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 提出一个基于通用分类系统(UCS)的模块化数据集重新标注框架,通过规则驱动的多阶段流水线和冲突解决实现高自动转换率,并创建了包含58,057个音频片段的统一数据集EnvSound-UCS。

Comments DAFx 2026 camera-ready version

详情
AI中文摘要

音效(SFX)数据集和库通常采用不同的标注方案、分类法和元数据结构。这给SFX分类和生成的研究带来了挑战,因为不兼容的分类法导致数据集孤立,可能需要个性化方法,产生不可比较的结果,并阻碍数据合并策略。我们提出了一个模块化的数据集重新标注框架,采用通用分类系统(UCS)——一种行业标准的音效层次分类法——作为共享结构基础。这个开源框架使我们能够(i)通过基于规则的多阶段流水线和冲突解决,将现有数据集的标签转换为UCS,实现高自动转换率;(ii)为新标签建议分层数据集划分;(iii)合并多个数据集。为了展示实际效用,我们引入了EnvSound-UCS数据集,这是一个公开可用的、符合UCS的统一环境声音数据集,包含来自AudioSet、FSD50K和ESC-50三个来源的58,057个音频片段。

英文摘要

Sound effects (SFX) datasets and libraries often employ distinct tagging schemes, taxonomies, and metadata structures. This creates challenges for research on SFX classification and generation because incompatible taxonomies lead to siloed datasets that might require individualized approaches, result in non-comparable outcomes, and prevent data merging strategies. We propose a modular dataset relabeling framework that adopts the Universal Category System (UCS), an industry-standard hierarchical taxonomy for sound effects, as a shared structural foundation. This open-source framework enables us (i) to convert tags of existing datasets to UCS with a rule-based multi-stage pipeline and conflict resolution to achieve high automatic conversion rates, (ii) to suggest a stratified dataset split for the new labels, and (iii) to combine multiple datasets. To showcase the practical utility, we introduce the EnvSound-UCS dataset, a publicly available unified UCS-compliant dataset of environmental sounds with 58,057 sound clips from three sources: AudioSet, FSD50K, and ESC-50.

2606.05570 2026-06-05 cs.CL cs.AI

TensorBench: Benchmarking Coding Agents on a Compiler-Based Tensor Framework

TensorBench: 在基于编译器的张量框架上对编码智能体进行基准测试

Bobby Yan, Fredrik Kjolstad

发表机构 * Department of Computer Science, Stanford University(计算机科学系,斯坦福大学)

AI总结 本文提出 TensorBench,一个包含199个特征添加和重构任务的基准测试,用于评估编码智能体在基于编译器的张量框架上的表现,并通过测试套件自动评分。

详情
AI中文摘要

仓库级别的编码基准测试面临任务难度与评估可靠性之间的权衡:挑战前沿模型的任务通常涉及代码库庞大且测试覆盖不完整,而人工审查难以扩展。我们引入了 TensorBench,这是一个包含199个特征添加和重构任务的基准测试,基于一个开源的基于编译器的张量框架,该框架通过一流的密集和稀疏张量支持扩展了 PyTorch。任务涵盖新的稀疏格式、密集优化过程、IR 转换、调度器更改、运行时组件以及高级数值算子。TensorBench 通过应用智能体的补丁并运行框架的测试套件(包括预先存在的随机回归测试和智能体添加的任何测试)来对每次运行进行评分。对于特征添加任务,通过意味着修补后的仓库保留了测试过的预先存在的行为,并满足了智能体为请求特征添加的检查。我们评估了七个编码智能体,涵盖三个前沿模型系列和一个开放权重模型。在此标准下的通过率从最强智能体的 $64.8\%$ 到最弱智能体的 $22.1\%$ 不等。智能体通过不同的任务子集:成对 Cohen's $κ$ 范围从 $-0.07$ 到 $0.43$,两个最强智能体的 $κ= 0.05$。

英文摘要

Repository-level coding benchmarks face a trade-off between task difficulty and evaluation reliability: tasks that challenge frontier models often involve large codebases with incomplete test coverage, while human review does not scale. We introduce TensorBench, a benchmark of 199 feature-addition and refactoring tasks on an open-source compiler-based tensor framework that extends PyTorch with first-class support for dense and sparse tensors. Tasks cover new sparse formats, dense optimization passes, IR transformations, scheduler changes, runtime components, and high-level numerical operators. TensorBench grades each run by applying the agent's patch and running the framework's test suite, which includes the pre-existing randomized regression tests and any tests the agent adds. For feature-addition tasks, a pass means that the patched repository preserves the tested pre-existing behavior and satisfies the agent-added checks for the requested feature. We evaluate seven coding agents spanning three frontier model families and one open-weight model. Pass rates under this criterion range from $64.8\%$ for the strongest agent to $22.1\%$ for the weakest. Agents pass different subsets of tasks: pairwise Cohen's $κ$ ranges from $-0.07$ to $0.43$, with $κ= 0.05$ for the two strongest agents.

2606.05569 2026-06-05 cs.CL cs.SD eess.AS

Domain-Aware Mispronunciation Detection and Diagnosis Using Language-Specific Statistical Graphs

基于语言特定统计图的领域感知发音错误检测与诊断

Huu Tuong Tu, Hanh Nguyen, Thien Van Luong, Nguyen Tien Cuong, Vu Huan, Nguyen Thi Thu Trang

发表机构 * Hanoi University of Science and Technology(河内理工大学) VNPT AI, VNPT Group(VNPT AI,VNPT集团) National Economics University(国家经济大学)

AI总结 提出一种利用语言特定统计图学习音素混淆模式的方法,在L2-ARCTIC基准上实现59.52%的F1分数,优于多个基线。

Comments Accepted at Interspeech 2026

详情
AI中文摘要

近年来,发音错误检测与诊断(MDD)在计算机辅助语言学习和语音技术中变得越来越重要。本文提出了一种构建统计图的方法,使模型能够学习表示为有向图的音素混淆模式。此外,我们引入了一种语言特定策略,以捕捉不同母语(L1)背景下的系统性发音差异。通过在L2-ARCTIC基准上的大量实验证明了我们方法的有效性,该方法达到了59.52%的F1分数,优于多个竞争基线。

英文摘要

Mispronunciation Detection and Diagnosis (MDD) has gained increasing importance in computer-assisted language learning and speech technology in recent years. In this paper, we propose a method for constructing statistical graphs that enable models to learn phoneme confusion patterns represented as directed graphs. Furthermore, we introduce a language-specific strategy to capture systematic pronunciation differences across various native language (L1) backgrounds. The effectiveness of our approach is demonstrated through extensive experiments on the L2-ARCTIC benchmark, where it achieves an F1-score of 59.52%, outperforming several competitive baselines.

2606.05566 2026-06-05 cs.AI cs.CR

GuardNet: Ensemble Strategies of Shallow Neural Networks for Robust Prompt Injection and Jailbreak Detection

GuardNet: 用于鲁棒提示注入和越狱检测的浅层神经网络集成策略

Paulo Ricardo Ferreira Neves, Edson Rodrigues da Cruz Filho, Paulo Henrique Eleuterio Falsetti, João Vitor Pavan, Ian Degaspari, Henrique Vieira Laturrague, Patrick Vieira Laturrague, Guilherme Nielsen Dias, Marccello Wilson Perez Berto, Gustavo Voltani Von Atzingen

发表机构 * Quickium Technology Ltd.(Quickium技术有限公司) Federal University of São Carlos (UFSCar)(萨尔瓦多·卡罗斯联邦大学) Federal Institute of Education, Science and Technology of São Paulo (IFSP)(圣保罗教育、科学和技术联邦研究所)

AI总结 提出GuardNet,一种基于浅层神经网络(BiLSTM)集成的护栏系统,通过多样性示例覆盖和阈值校准实现对抗鲁棒性,在低延迟下达到与轻量检测器竞争的性能。

详情
AI中文摘要

大型语言模型(LLMs)已经改变了自然语言处理,但它们仍然容易受到提示注入(PI)和越狱(JB)攻击。此外,基准评估可能受到污染和部分信息泄漏的影响,从而损害性能估计。本文提出了GuardNet,一个基于浅层神经网络(BiLSTM)集成的护栏系统,参数约4700万。我们研究了这样一个假设:对抗场景中的鲁棒性更多地取决于示例覆盖的多样性和阈值校准,而不是模型规模。结果表明,GuardNet与轻量检测器相比达到了竞争性能,并在低延迟下具有高效率,尽管更大的LLMs(如Mistral-7B和Llama-3.1-8B)在盲测JBB-Behaviors基准上仍在F1分数和AUROC方面表现更优。尽管如此,GuardNet在盲测数据集(n=200)上实现了0.747的AUROC,在专有基准(n=50)上实现了0.92的F1分数,这是在阈值校准和声明部分信息泄漏的情况下评估的。该系统在CPU上的平均延迟约为50毫秒,使其适合部署在成本和基础设施受限的生产环境中。

英文摘要

Large Language Models (LLMs) have transformed natural language processing, but they remain vulnerable to Prompt Injection (PI) and Jailbreak (JB) attacks. In addition, benchmark evaluations may be affected by contamination and partial information leakage, compromising performance estimates. This work presents GuardNet, a guardrail system based on an ensemble of shallow neural networks (BiLSTMs) with approximately 47 million parameters. We investigate the hypothesis that robustness in adversarial scenarios depends more on the diversity of example coverage and threshold calibration than on model scale. The results indicate that GuardNet achieves competitive performance compared with lightweight detectors and high efficiency at low latency, although larger LLMs such as Mistral-7B and Llama-3.1-8B still achieve superior performance in terms of F1 score and AUROC on the blind JBB-Behaviors benchmark. Nevertheless, GuardNet achieves an AUROC of 0.747 on the blind dataset (n = 200) and an F1 score of 0.92 on a proprietary benchmark (n = 50), under threshold calibration and evaluation with declared partial information leakage. The system operates with an average latency of approximately 50 ms on CPU, making it suitable for deployment in production environments with cost and infrastructure constraints.

2606.05564 2026-06-05 cs.CL

Using Large Language Models to Support High Volume Application Review for an Undergraduate Research Program

使用大型语言模型支持本科研究项目的高容量申请评审

Varun Aggarwal, Kay Kobak, John Howarter

发表机构 * Engineering Undergraduate Research Office, Purdue University(普渡大学本科生研究办公室) Elmore School of Electrical and Computer Engineering, Purdue University(普渡大学电子与计算机工程学院) School of Materials Engineering, Purdue University(材料工程学院)

AI总结 本研究开发并部署基于GPT模型(GPT-4o、GPT-5-mini、GPT-5.2)的工具,对普渡大学SURF项目约1200份目的陈述进行自动化评分与理由注释,将评审时间从数周缩短至约4小时。

详情
AI中文摘要

本科研究项目(如普渡大学的暑期本科生研究奖学金SURF)每年收到数千份申请,需要项目工作人员花费大量时间和精力在紧迫的时间线内一致地评估每份提交。这篇进行中的论文描述了一个基于大型语言模型(LLM)的工具的开发和初步部署,用于协助评估普渡大学SURF 2026周期的约1200份学生目的陈述(SoP)。该工作流程使用OpenAI GPT模型(GPT-4o、GPT-5-mini和GPT-5.2),并采用一个包含六个子类别的结构化评分标准,每个子类别按0-3分评分。少数由项目工作人员评分的SoP用于调整模型响应。模型提示设计为生成数值分数、理由(包括正面和负面方面)以及每份提交的简短摘录。使用GPT-5.2,全部1200份SoP在约4.6小时的计算时间内处理完毕,平均每份SoP约14秒(每份SoP的处理时间随其长度变化,范围从500到2000词)。不同模型版本在评分标准遵循度上存在显著差异,其中GPT-5.2遵循最严格。模型分数的不一致在低分提交中更为明显。LLM输出复制了之前由分布式人工评分员扮演的角色,为项目协调员提供了整个申请人群体的评分和理由注释输出。然后,项目协调员将这些输出与每位申请人的SoP一起审查,应用与之前SURF周期相同的下游办公室标准,以产生强候选人的短名单。这次协调员审查在大约4小时内完成,而之前项目周期需要数周的协调工作。

英文摘要

Undergraduate research programs such as the Summer Undergraduate Research Fellowship (SURF) at Purdue University receive thousands of applications every year, requiring significant time and effort for program staff to evaluate each submission consistently and within tight timelines. This work-in-progress paper describes the development and initial deployment of a large language model (LLM)-based tool to assist in the evaluation of approximately 1,200 student Statements of Purpose (SoPs) for the SURF 2026 cycle at Purdue University. The workflow utilizes OpenAI GPT models (GPT-4o, GPT-5-mini, and GPT-5.2) and uses a structured rubric across six subcategories, each scored on a 0-3 scale. A few SoPs, graded by program staff, were used to tune the model responses. The model prompt was designed to generate both numerical scores, rationales (including positive and negative aspects) and short excerpts from each submission. Using GPT-5.2, the full batch of 1,200 SoPs was processed in approximately 4.6 hours of compute time, averaging roughly 14 seconds per SoP (with per-SoP timing varying with SoP length, which ranged from 500 to 2,000 words). Notable differences in rubric adherence were observed across model versions, with GPT-5.2 adhering most closely. Disagreement in model scores was more pronounced for lower-scoring submissions. The LLM outputs replicated the role previously played by distributed human graders, providing the program coordinator with scored and rationale-annotated outputs for the entire applicant pool. The program coordinator then reviewed these outputs alongside each applicant's SoP, applying the same downstream office criteria used in prior SURF cycles, to produce a shortlist of strong candidates. This coordinator review was completed in approximately 4 hours, compared to the multi-week coordination effort required in prior program cycles.

2606.05563 2026-06-05 cs.AI cs.CL

SoCRATES: Towards Reliable Automated Evaluation of Proactive LLM Mediation across Domains and Socio-cognitive Variations

SoCRATES:跨领域和社会认知变异的前瞻性LLM调解的可靠自动化评估

Taewon Yun, Hyeonseong Park, Jeonghwan Choi, Hayoon Park, Yeeun Choi, Hwanjun Song

发表机构 * University of California, Berkeley(加州大学伯克利分校) Stanford University(斯坦福大学)

AI总结 提出SoCRATES基准,通过多领域真实冲突场景和五维社会认知适应轴评估LLM调解员,使用主题定位评估器实现0.82的人类专家一致性,发现最强模型仅缩小约三分之一的未调解共识差距。

详情
AI中文摘要

评估LLM调解员仍然具有挑战性,因为调解是一个实时轨迹,由争议者不断变化的情感、意图和背景塑造。现有的测试平台依赖于少数专家撰写的领域,主要变化战略姿态,并对每个话题的每一轮进行评分,引入了离题噪声。我们引入了SoCRATES,一个用于在现实的多领域测试平台中评估前瞻性LLM调解员的基准。它通过一个跨八个领域的代理管道从真实冲突中构建场景,探测五个社会认知适应轴(战略姿态、参与者组成、历史长度、情感反应和文化身份),并通过主题定位评估器仅对推进每个话题的轮次进行评分。该评估器与人类专家的一致性达到0.82,是每轮基线的两倍以上。对八个前沿LLM的基准测试发现,即使是最强的调解员,在多样化和现实的测试平台下,也仅能缩小约三分之一的未调解共识差距,且性能因社会认知轴而异,突显出进步在于对不同条件的社会适应。

英文摘要

Evaluating LLM mediators remains challenging, as mediation unfolds as a real-time trajectory shaped by disputants' shifting emotions, intentions, and context. Existing testbeds rely on a few expert-authored domains, vary mainly strategic posture, and score every turn against every topic, introducing off-topic noise. We introduce SoCRATES, a benchmark for evaluating proactive LLM mediators in realistic, multi-domain testbeds. It constructs scenarios from real conflicts through an agentic pipeline across eight domains, probes five socio-cognitive adaptation axes (strategic posture, party composition, history length, emotional reactivity, and cultural identity), and scores each topic only on the turns that advance it via a topic-localized evaluator. The evaluator reaches 0.82 alignment with human experts, more than doubling a per-turn baseline. Benchmarking eight frontier LLMs, we find that even the strongest mediator closes only about a third of the unmediated consensus gap under diverse and realistic testbeds, with performance varying sharply by socio-cognitive axis, highlighting that progress lies in social adaptation to diverse conditions.

2606.05561 2026-06-05 cs.CL cs.AI

InfoShield: Privacy-Preserving Speech Representations for Mental Health Screening via Information-Theoretic Optimization

InfoShield:通过信息论优化实现心理健康筛查的隐私保护语音表示

Xueyang Wu, Siyuan Liu, Kezhuo Yang, Guang Ling

发表机构 * Shenzhen NeurStar Inc., China(深圳NeurStar公司,中国) University of York, United Kingdom(约克大学,英国) Shanghai Jiao Tong University, China(上海交通大学,中国)

AI总结 提出InfoShield框架,通过最小化语音表示与敏感属性间的互信息,在保持抑郁分类性能的同时有效降低人口统计信息泄露风险。

详情
AI中文摘要

基于语音的心理健康筛查提供了可扩展的抑郁症检测方法,但临床部署面临一个重大障碍:用户对人口统计信息暴露的隐私担忧。当前技术难以解决这一冲突。对抗训练通常无法应对未知威胁,而差分隐私则倾向于通过向所有特征注入噪声来损害诊断性能。本文提出InfoShield,它在保持抑郁分类准确性的同时最小化语音表示与敏感属性之间的互信息。我们发现标准MINE估计器因时间-静态错位而难以处理序列语音,并引入带有跨模态注意力的TimeAwareMINE来对齐声学帧与属性嵌入。在Androids语料库上的实验表明,InfoShield将性别推断从92.6%降至55.5%,年龄推断从55.7%降至30.3%,且效用损失有限(F1降低6%),达到F1=0.784,而先前SOTA为0.723。

英文摘要

Speech-based mental health screening offers scalable depression detection, yet clinical deployment faces a significant barrier: users' privacy concerns about demographic information exposure. Current techniques struggle to resolve this conflict. Adversarial training often fails against unseen threats, whereas Differential Privacy tends to compromise diagnostic performance by injecting noise across all features. This paper presents InfoShield, which minimizes mutual information between speech representations and sensitive attributes while preserving depression classification accuracy. We identify that standard MINE estimators struggle with sequential speech due to temporal-static misalignment, and introduce TimeAwareMINE with cross-modal attention to align acoustic frames with attribute embeddings. Experiments on the Androids Corpus show InfoShield reduces gender inference from 92.6\% to 55.5\% and age inference from 55.7\% to 30.3\% with limited utility loss (6\% F1 reduction), achieving F1=0.784 compared to prior SOTA's 0.723.

2606.05559 2026-06-05 cs.LG

CLaaS: Continual learning as a service for sample efficient online learning

CLaaS: 作为服务的持续学习,用于样本高效的在线学习

Kion Fallah, Silen Naihin, Barak Widawsky, Qingqing Mao

发表机构 * Resolute Labs(Resolute实验室) Incept Labs(Incept实验室)

AI总结 提出CLaaS系统,通过经验回放缓冲区实现异步训练中的梯度复用,在对抗性任务中展示参数更新优于上下文学习的前向迁移和遗忘减少。

Comments 4 pages main content, 7 figures

详情
AI中文摘要

部署的大型语言模型代理必须适应动态环境中的分布偏移。理想情况下,可以从累积的代理经验中进行适应,并在转移到未来任务时保留先前的能力。然而,由于真实环境无法轻易重置,每个场景中代理的动作和环境转换只能采样一次。为此,我们研究了一种体验式和在线持续学习设置,其中代理从一系列场景中学习。我们提出了持续学习即服务(CLaaS),这是一个系统,使代理能够在部署期间改进,并通过聊天API抽象化。为了提高样本效率,CLaaS将轨迹存储在经验回放缓冲区中,以便在异步训练期间重用梯度。我们在对抗性任务上评估了CLaaS,证明参数更新比上下文学习具有更好的前向迁移和更少的遗忘,其中回放是样本效率的关键选择。

英文摘要

Deployed large language model agents must adapt to distribution shift in dynamic environments. Ideally, adaptation can be performed from accumulated agent experiences and retain prior capabilities while transferring to future tasks. However, agent actions and environmental transitions can only be sampled once per scenario, as real-world environments cannot be trivially reset. To this end, we investigate an experiential and online continual learning setting in which agents learn from a stream of scenarios. We propose continual learning as-a-service (CLaaS), a system which enables agents to improve during deployment, abstracted behind a chat API. To increase sample efficiency, CLaaS stores rollouts in an experience replay buffer for gradient reuse during asynchronous training. We evaluate CLaaS on an adversarial task, demonstrating that parametric updates lead to superior forward transfer and less forgetting than in-context learning, with replay being a critical choice for sample efficiency.

2606.05558 2026-06-05 cs.LG

Autoregressive Diffusion World Models for Off-Policy Evaluation of LLM Agents

自回归扩散世界模型用于LLM智能体的离线评估

Kaixuan Liu, Guojun Xiong, Weinan Zhang, Shengpu Tang

发表机构 * Department of Computer Science, Emory University(埃默里大学计算机科学系) School of Computer Science, Shanghai Jiao Tong University(上海交通大学计算机科学学院)

AI总结 提出ADWM框架,通过自回归扩散世界模型从预收集轨迹中模拟环境响应,实现无需在线交互的LLM智能体策略离线评估。

详情
AI中文摘要

在多轮交互环境中评估大语言模型(LLM)智能体成本高且风险大,因为它需要在线环境交互。我们提出ADWM(自回归扩散世界模型),一个仅从预收集轨迹中估计新LLM智能体策略性能的评估框架。核心思想是学习一个潜在扩散世界模型,模拟环境如何响应评估策略,而无需在真实环境中执行。现有的基于扩散的OPE方法通过联合扩散状态和动作,在单次传递中引导完整轨迹,这一假设对于动作是离散文本且必须在观察环境后从策略中采样的LLM智能体不成立。与遭受复合误差的自回归世界模型不同,ADWM将每个转移建模为独立的去噪过程,实现可靠的逐步展开,其中世界模型和智能体按因果顺序交替。关键的是,被评估的LLM智能体通过策略条件得分函数直接引导每一步的扩散生成,确保模拟轨迹准确反映其决策模式。实验上,ADWM在多种多轮智能体任务中实现了准确的价值估计和评估可靠性,展示了其作为离线LLM智能体评估实用框架的前景。

英文摘要

Evaluating large language model (LLM) agents in multi-turn interactive environments is expensive and risky, as it requires online environment interaction. We propose ADWM (Autoregressive Diffusion World Model), an evaluation framework that estimates the performance of a new LLM agent policy purely from pre-collected trajectories. The core idea is to learn a latent diffusion world model that simulates how the environment responds to the evaluation policy, without ever executing it in the real environment. Existing diffusion-based OPE methods guide full trajectories in a single pass by jointly diffusing states and actions, an assumption that breaks down for LLM agents whose actions are discrete text that must be sampled from the policy after observing the environment. Unlike autoregressive world models that suffer from compounding errors, ADWM models each transition as an independent denoising process, enabling reliable step-by-step rollouts where the world model and agent alternate in causal order. Crucially, the LLM agent under evaluation directly guides the diffusion generation at each step via a policy-conditioned score function, ensuring that simulated trajectories accurately reflect its decision-making patterns. Empirically, ADWM achieves accurate value estimates and evaluation reliability across diverse multi-turn agent tasks, demonstrating its promise as a practical framework for offline LLM agent evaluation.

2606.05557 2026-06-05 cs.CL

AURA: Intent-Directed Probing for Implicit-Need Surfacing in Situated LLM Agents

AURA: 面向情境化LLM代理中隐式需求挖掘的意图导向探测

Yang Li, Jiaxiang Liu, Jiang Cai, Mingkun Xu

发表机构 * Guangdong Institute of Intelligence Science and Technology(广东省智能科学与技术研究院)

AI总结 提出AURA方法,通过在场景感知和工具使用之间插入意图推理步骤生成IntentFrame,以结构化估计隐式需求并控制探测预算,在隐式意图基准上提升覆盖率达+0.07,同时减少82%的探测次数并避免隐私违规。

Comments Submitted to EMNLP 2026. Code, simulator, and benchmark: https://github.com/innovation64/AURA

详情
AI中文摘要

像“Lin Wei在哪里?”这样的情境化查询通常编码了比字面内容更多的信息:用户可能还想知道Lin Wei是否有空、心情好或是否值得现在打扰。标准的工具使用代理回答字面问题后就停止了。AURA在场景感知和工具使用之间插入一个推理步骤,生成IntentFrame:一个对隐式需求的结构化估计,带有一个标量差距分数,用于控制每次查询的探测预算和工具选择。在一个包含100个查询、四个场景的隐式意图基准上,AURA相比ReAct风格的探测将隐式需求覆盖率提高了(Delta = +0.07,p < 10^-6);四个场景中有三个单独显著,该增益在第二个骨干网络上重现,并且提示消融将提升归因于差距校准而非答案记忆。在事实查找上,控制器以原始准确度为代价,减少了82%的探测次数,并在一个隐私敏感切片上实现了零违禁工具违规;范围条件在局限性中详述。代码、模拟器和基准测试已在https://github.com/innovation64/AURA发布。

英文摘要

A situated query like "where is Lin Wei?" often encodes more than its literal content: the user may also want to know whether Lin Wei is free, in a good mood, or worth interrupting now. Standard tool-use agents answer the literal question and stop. AURA inserts an inference step between scene perception and tool use that produces an IntentFrame: a structured estimate of the implicit need with a scalar gap score that controls per-query probe budget and tool selection. On a 100-query four-scene implicit-intent benchmark, AURA improves implicit-need coverage over ReAct-style probing (Delta = +0.07, p < 10^-6); three of four scenes are individually significant, the gain reproduces on a second backbone, and a prompt ablation attributes the lift to gap calibration rather than answer memorisation. On factual lookup the controller trades raw accuracy for 82% fewer probes and zero forbidden-tool violations on a privacy-sensitive slice; scope conditions are detailed in Limitations. Code, simulator, and benchmark are released at https://github.com/innovation64/AURA.

2606.05555 2026-06-05 cs.LG cs.AI

Representation Learning Enables Scalable Multitask Deep Reinforcement Learning

表示学习实现可扩展的多任务深度强化学习

Johan Obando-Ceron, Lu Li, Scott Fujimoto, Pierre-Luc Bacon, Aaron Courville, Pablo Samuel Castro

发表机构 * Mila – Québec AI Institute(魁北克AI研究所) Université de Montréal(蒙特利尔大学) McGill University(麦吉尔大学) CIFAR AI Chair(CIFAR人工智能 chair) Google DeepMind(谷歌DeepMind)

AI总结 本文提出一种结合预测性表示学习与高容量值函数近似的无模型算法MR.Q,在无需规划的情况下,在多任务连续控制任务中超越基于世界模型的方法和多种深度强化学习基线,并显著降低计算开销。

详情
AI中文摘要

将强化学习扩展到多样化的多任务设置仍然是一个核心挑战。虽然基于模型的强化学习的最新进展取得了强劲的性能,但它们依赖于规划和复杂的训练流程,使得不清楚哪些组件对可扩展性至关重要。我们重新审视这个问题,并认为可扩展多任务强化学习的主要驱动力不是基于模型的控制,而是\emph{表示学习}。特别地,我们表明,将预测性的、基于模型的表示与高容量值函数逼近相结合,即使没有规划,也足以实现强劲的性能。我们评估了一种简单的无模型算法MR.Q,将辅助预测目标与可扩展的actor-critic架构相结合。这种方法在多样化的多任务连续控制任务套件中优于最近基于世界模型的方法和一系列深度强化学习基线,同时显著降低了计算开销并提高了实际时间效率。我们观察到随着模型容量的增加而持续改进,并通过消融实验表明预测性表示学习对性能至关重要。

英文摘要

Scaling reinforcement learning (RL) to diverse multitask settings remains a central challenge. While recent advances in model-based RL achieve strong performance, they rely on planning and complex training pipelines, making it unclear which components are essential for scalability. We revisit this question and argue that the primary driver of scalable multitask RL is not model-based control, but \emph{representation learning}. In particular, we show that combining predictive, model-based representations with high-capacity value function approximation is sufficient to achieve strong performance, even without planning. We evaluate a simple model-free algorithm, MR.Q, coupled with auxiliary predictive objectives into a scalable actor-critic architecture. This approach outperforms a recent world-model-based method and a range of deep RL baselines across a diverse suite of multitask continuous control tasks, while significantly reducing computational overhead and improving wall-clock efficiency. We observe consistent improvements with increased model capacity and show through ablations that predictive representation learning is critical for performance.

2606.05553 2026-06-05 cs.CL cs.AI

ArcANE: Do Role-Playing Language Agents Stay in Character at the Right Time?

ArcANE:角色扮演语言代理是否在正确的时间保持角色?

Woojung Song, Nalim Kim, Sangjun Song, Chaewon Heo, Jongwon Lim, Yohan Jo

发表机构 * Graduate School of Data Science, Seoul National University(首尔国立大学数据科学研究生院)

AI总结 提出ArcANE基准,通过角色弧将叙事分段,评估角色扮演语言代理在不同阶段是否与角色心理轨迹一致,实验表明基于角色弧的上下文策略最优,尤其在源文本外场景。

详情
AI中文摘要

角色扮演语言代理(RPLAs)应扮演其价值观和行为随故事发展而演变的角色,而非保持固定人格。现有基准衡量给定章节的事实回忆,而非回应是否与角色的心理轨迹一致,尤其是在源文本从未探索的场景中。我们引入ArcANE(弧感知叙事评估),一个自动构建的基准,涵盖17部小说和80个主要角色。角色弧将叙事沿心理轴分段,每个探针在多个阶段提出相同场景,涵盖源文本内和源文本外情境。在六个模型和六种上下文模式下,基于角色弧的条件在每项模型上均优于所有其他上下文策略,且在源文本外场景(检索无法找到信息)中差距最大。我们进一步在同一数据上微调开放权重模型,得到ArcANE-8B/32B,在源文本外场景中进一步扩大了弧优势。

英文摘要

Role-playing language agents (RPLAs) should play characters whose values and behavior evolve as the story progresses, not maintain a fixed persona. Existing benchmarks measure factual recall at a given chapter, not whether responses align with the character's psychological trajectory, especially in scenarios the source text never explores. We introduce ArcANE (Arc-Aware Narrative Evaluation), an automatically constructed benchmark spanning 17 novels and 80 principal characters. A Character Arc segments the narrative into phases along a psychological axis, and each probe poses the same scenario across phases, spanning both situations within the source text and situations beyond it. Across six models and six context modes, conditioning on the Character Arc tops every other context strategy on every model, and the gap is largest on scenarios outside the source text where retrieval has nothing to find. We further fine-tune open-weight models on the same data to obtain ArcANE-8B/32B, which widen the Arc advantage even more on scenarios outside the source text.

2606.05552 2026-06-05 cs.LG cs.AI cs.GR

Balancing Image Compression and Generation with Bootstrapped Tokenization

平衡图像压缩与生成:自引导分词

Haozhe Chi, Jinghan Li, Hao Jiang, Wu Sheng, Yi Ma, Jing Wang, Yadong Mu

发表机构 * Peking University(北京大学) Central Media Technology Institute, Huawei(华为中央媒体技术研究所)

AI总结 提出SelfBootTok方法,通过自引导学习将图像信息分解为全局和局部标记组,使生成器仅依赖全局标记,减少40%计算量并提升重建与生成质量,以64个标记实现1.56的gFID新纪录。

详情
AI中文摘要

尽管图像分词取得了进展,但标准方法通过在每个标记中混合所有粒度来编码冗余信息,因此标记之间仍存在冗余。不同粒度信息的混合也增加了生成器训练的复杂性。本文介绍了SelfBootTok,一种通过将信息干净地分解为全局和局部标记组来解决此问题的方法。通过自引导学习,模型仅从全局标记预测局部细节,将视觉细节的负担从生成器转移到分词器。因此,我们的生成器效率更高,仅需全局标记,计算量减少约40%,同时提供更优的重建和生成。此外,该范式优雅地扩展:通过利用更多数据或参数来自监督局部表示学习,SelfBootTok仅使用64个标记就实现了1.56的最优gFID分数。

英文摘要

Despite progress in image tokenization, standard methods encode redundant information by mixing all granularities within each token, thus redundancy persists between tokens. The mix of information of different granularity also complicates the training of generators. This paper introduces SelfBootTok, a method that resolves this by cleanly decomposing information into global and local token groups. Through self-bootstrapped learning, the model predicts local details exclusively from global tokens, shifting the burden of visual details from the generator to the tokenizer. Consequently, our generator is far more efficient, requiring only global tokens and reducing computation by approximately 40%, while delivering superior reconstruction and generation. Moreover, this paradigm scales elegantly: by leveraging more data or parameters to self-supervise local representation learning, SelfBootTok achieves a new state-of-the-art gFID score of 1.56 using only 64 tokens.

2606.05545 2026-06-05 cs.CL

Multilingual Detection of Alzheimer's Disease from Speech: A Cross-Linguistic Transfer Learning Approach

基于语音的多语言阿尔茨海默病检测:跨语言迁移学习方法

Nadine Yasser Abdelhalim, Emmanuel Akinrintoyo, Nicole Salomons

发表机构 * Imperial College London(帝国理工学院伦敦分校)

AI总结 提出跨语言训练方法,利用英语、中文、阿拉伯语和印地语数据集开发基于Transformer的模型,实现多语言阿尔茨海默病检测,F1分数达82%,推理时间0.5秒,支持实时筛查。

Comments 5 pages

详情
AI中文摘要

由于特定语言模型训练的资源密集性和耗时性,多语言阿尔茨海默病痴呆(AD)检测模型的开发面临重大挑战。我们提出了一种新颖的解决方案,使用跨语言训练来检测训练模型所用语言之外的语言中的AD。本研究调查了用于跨不同语言和认知障碍水平检测AD的多语言深度学习模型。使用英语、中文、阿拉伯语和印地语的数据集,我们开发了基于Transformer的模型用于二元AD分类。我们的方法在所有语言中实现了82%的F1分数,展示了强大的跨语言泛化能力。快速推理时间(0.5秒)支持潜在的实时筛查应用,而跨语言的一致性能表明全球部署的可行性。

英文摘要

The development of multilingual Alzheimer's Disease Dementia (AD) detection models presents significant challenges due to the resource-intensive and time-consuming nature of language-specific model training. We propose a novel solution using cross-language training to detect AD in languages beyond those used for model training. This study investigates multilingual deep learning models for detecting AD across different languages and cognitive impairment levels. Using datasets in English, Chinese, Arabic, and Hindi, we developed transformer-based models for binary AD classification. Our approach achieved F1 scores of 82\% across all languages, demonstrating strong cross-linguistic generalization. The rapid inference time (0.5 seconds) supports potential real-time screening applications, while consistent performance across languages indicates feasibility for global deployment.

2606.05544 2026-06-05 cs.SD eess.AS

Probing Spatial Structure in Pretrained Audio Representations

探究预训练音频表示中的空间结构

Chuyang Chen, Sivan Ding, Adrian S. Roman, Juan Pablo Bello

发表机构 * Music and Audio Research Laboratory, New York University, USA(音乐与音频研究实验室,纽约大学,美国)

AI总结 通过提出SARL基准,系统评估预训练音频模型对空间信息的编码能力,发现源因素比房间因素更易解码,且不同编码器对空间变化响应存在异质性。

Comments Accepted to Interspeech 2026

详情
AI中文摘要

预训练空间音频编码器越来越多地被用作感知任务的通用表示,但其空间编码能力仍知之甚少。我们引入了空间音频表示学习(SARL)基准,这是一个用于评估预训练音频模型中空间信息的受控框架。SARL探测源级因素(方位角、仰角、距离、类别)和房间级因素(RT60、体积、形状)。跨多种编码器的实验揭示了三种模式:输入配置和训练范式塑造空间编码;源因素始终比房间因素更容易解码;在受控扰动下的敏感性分析显示了对源和房间变化的异质性响应。这些结果揭示了当前预训练音频表示中的系统性偏差。SARL作为开源基准发布,用于可重复评估空间音频表示。

英文摘要

Pretrained spatial audio encoders are increasingly used as general-purpose representations for perceptual tasks, yet their spatial encoding capabilities remain poorly understood. We introduce the Spatial Audio Representation Learning (SARL) benchmark, a controlled framework for evaluating spatial information in pretrained audio models. SARL probes source-level factors (azimuth, elevation, distance, class) and room-level factors (RT60, volume, shape). Experiments across diverse encoders reveal three patterns: input configuration and training paradigm shape spatial encoding; source factors are consistently easier to decode than room factors; and sensitivity analysis under controlled perturbations shows heterogeneous responses to source and room variation. These results reveal systematic biases in current pretrained audio representations. SARL is released as an open-source benchmark for reproducible evaluation of spatial audio representations.

2606.05538 2026-06-05 cs.LG cs.CL

Less is MoE: Trimming Experts in Domain-Specialist Language Models

少即是MoE:修剪领域专家语言模型中的专家

Haoze He, Xinkai Zou, Xuan Jiang, Xingyuan Ding, Ao Qu, Juncheng Billy Li, Heather Miller

发表机构 * Carnegie Mellon University(卡内基梅隆大学) UCSD(加州大学圣地亚哥分校) MIT(麻省理工学院)

AI总结 针对MoE模型部署时参数过多的问题,提出基于Fisher重要性的中间维度修剪方法Fisher-MoE,在50%压缩比下保持模型能力,减少约45%权重内存并提升21%推理吞吐量。

详情
AI中文摘要

混合专家(MoE)模型通过条件计算实现了强大的性能,但其庞大的参数规模带来了部署挑战。先前的MoE压缩方法在常识推理之外的通用基准测试中评估时灾难性地失败。我们将这一失败归因于压缩的粒度:重要能力分布在各个专家中,但集中在FFN稀疏中间维度。为了识别这些维度,我们使用Fisher重要性,它优于基于激活、路由器得分和幅度的方法,并识别出极小的任务关键维度集:在Qwen1.5-MoE中,仅移除1.35M路由FFN中间维度中的12个就导致GSM8K准确率崩溃,同时基本保持事实知识性能。基于此,我们提出Fisher-MoE,它在FFN内部操作,移除按Fisher重要性排序的中间维度。在相同的50% MoE压缩比下,Fisher-MoE保持了模型能力,同时减少了约45%的权重内存并提高了21%的推理吞吐量。这些发现表明,中间维度粒度是MoE模型中能力集中的有效压缩和排序单元。

英文摘要

Mixture-of-Experts (MoE) models achieve strong performance through conditional computation, but their large parameter footprint poses deployment challenges. Prior MoE compression approaches catastrophically fail when evaluated on general-purpose benchmarks beyond commonsense reasoning. We trace this failure to the granularity of compression: important capabilities are distributed across experts but concentrated in FFN sparse intermediate dimensions. To identify these dimensions, we use Fisher importance which outperforms activation-, router-score-, and magnitude-based alternatives, and identifies tiny sets of task-critical dimensions: in Qwen1.5-MoE, removing as few as 12 of 1.35M routed-FFN intermediate dimensions collapses GSM8K accuracy while largely preserving factual-knowledge performance. Building on this, we propose Fisher-MoE, which operates within FFN to remove intermediate dimensions ranked by Fisher importance. At the same 50% MoE compression ratio, Fisher-MoE preserves model capability, while reducing weight memory by ~45% and improving inference throughput by 21%. These findings suggest intermediate dimension granularity is an effective unit for both compression and ranking where capability concentrates in MoE models.

2606.05536 2026-06-05 cs.CV

Dual Feature Decoupling for Fine-Grained OOD Detection

面向细粒度OOD检测的双重特征解耦

Xiaokun Li, Yaping Huang, Qingji Guan

发表机构 * School of Computer Science and Technology, Beijing Jiaotong University(计算机科学与技术学院,北京交通大学)

AI总结 提出双重特征解耦网络(DFDNet),通过空间-频率解耦和重建引导解耦模块,解决细粒度分类中因类间差异小和背景干扰导致的OOD检测难题。

详情
AI中文摘要

离群检测(OOD)是将机器学习模型应用于现实场景时不可或缺的技术。现有大多数OOD检测方法都是在类间分布差异较大的理想化假设下开发的,而很大程度上忽略了以细微变化为特征的细粒度任务,如医学图像分类和车辆识别。细粒度子类别之间的高视觉相似性,加上背景因素的干扰,使得OOD检测极具挑战性。为了解决这个问题,我们提出了一种新颖的双重特征解耦网络(DFDNet),从特征解缠的角度解决细粒度OOD检测。所提出的DFDNet包含两个关键组件:空间-频率解耦模块和重建引导解耦模块。空间-频率解耦模块旨在保留对分类有判别性的内容特征,同时抑制与任务无关的风格信息。另一方面,重建引导解耦模块引入了一种新颖的像素级对抗重建任务,以进一步去除低层、非判别性信息,并增强类别特定的高层语义表示。大量实验表明,我们的方法在多个数据集上取得了有竞争力的性能提升。

英文摘要

Out-of-distribution detection (OOD) is an indispensable technique when applying machine learning models to real-world scenarios. Most existing OOD detection methods have been developed under the idealized assumption of large inter-class distributional differences, while largely overlooking fine-grained tasks characterized by subtle variations, such as medical image classification and vehicle recognition. The high visual similarity among fine-grained subcategories, together with the interference of background factors, makes OOD detection extremely challenging. To tackle this problem, we propose a novel Dual Feature Decoupling Network (DFDNet), which addresses fine-grained OOD detection from the perspective of feature disentanglement. The proposed DFDNet comprises two key components: a spatial-frequency decoupling module and a reconstruction-guided decoupling module. The spatial-frequency decoupling module is designed to preserve content features that are discriminative for classification while suppressing task-irrelevant style information. On the other hand, the reconstruction-guided decoupling module introduces a novel pixel-level adversarial reconstruction task to further remove low-level, non-discriminative information and enhance category-specific high-level semantic representations. Extensive experiments demonstrate that our method achieves competitive performance improvements on multiple datasets.

2606.05535 2026-06-05 cs.CV cs.AI

Noise-Aware Visual Representation Learning for Medical Visual Question Answering

面向医学视觉问答的噪声感知视觉表示学习

I Putu Adi Pratama, Bahadorreza Ofoghi, Atul Sajjanhar, Shang Gao

发表机构 * Deakin University(德克萨斯大学)

AI总结 提出一种噪声感知的医学视觉问答框架,通过去噪自编码器学习鲁棒的视觉表示,并利用低秩适配高效微调,在SLAKE和PathVQA基准上提升了抗噪性和性能。

Comments 15 pages, 2 figures. Conference submission

详情
AI中文摘要

医学视觉问答(Med-VQA)通过使AI模型能够解释医学图像并回答临床相关问题,在临床决策支持方面具有巨大潜力。近期方法通常通过轻量级映射网络将现成的视觉编码器与大语言模型(LLM)连接起来,以降低计算成本。然而,这些方法往往忽视了处理视觉表示中噪声和小无关变化的重要性。为应对这些挑战,我们提出了一种噪声感知的Med-VQA框架,该框架在视觉嵌入映射到LLM输入空间之前,引入了一个去噪自编码器。去噪自编码器经过预训练,能够从被破坏的输入中重建干净的视觉嵌入,从而鼓励模型学习对噪声不敏感的鲁棒视觉表示。然后,使用多层感知器(MLP)将得到的嵌入投影到语言模型嵌入空间中,形成为LLM提供图像信息的视觉前缀令牌。为了实现无需完全重新训练的高效适配,我们采用低秩适配(LoRA)进行参数高效微调。所提出的方法在SLAKE和PathVQA基准上进行了评估。实验结果表明,该方法在多个评估标准下对噪声输入嵌入具有更强的鲁棒性,同时保持了有竞争力的干净性能。这些发现表明,学习更鲁棒的视觉表示可以提升Med-VQA的性能和鲁棒性。

英文摘要

Medical visual question answering (Med-VQA) has strong potential for clinical decision support by enabling AI models to interpret medical images and answer clinically relevant queries. Recent approaches typically connect off-the-shelf vision encoders with large language models (LLMs) through lightweight mapping networks to reduce computational cost. However, these methods often overlook the importance of handling noise and small irrelevant changes in visual representations. To address these challenges, we propose a noise-aware Med-VQA framework that incorporates a denoising autoencoder before visual embeddings are mapped into the input space of an LLM. The denoising autoencoder is pretrained to reconstruct clean visual embeddings from corrupted inputs, encouraging the model to learn robust visual representations that are less sensitive to noise. The resulting embeddings are then projected into the language model embedding space using a multi-layer perceptron (MLP), forming visual prefix tokens that provide image information to the LLM. To enable efficient adaptation without full retraining, we employ parameter-efficient fine-tuning using low-rank adaptation (LoRA). The proposed method is evaluated on the SLAKE and PathVQA benchmarks. Experimental results show improved robustness to noisy input embeddings while maintaining competitive clean performance across multiple evaluation criteria. These findings suggest that learning more robust visual representations can enhance Med-VQA performance and robustness.