arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 3841
热门方向导航
2606.08037 2026-06-09 cs.LG cs.AI 新提交

SafeECGMatch: Calibration-Aware Joint Frequency and Time Space Semi-Supervised Learning for Open-Set ECG Classification

SafeECGMatch:面向开放集心电图分类的校准感知联合频率与时间空间半监督学习

Hongkyu Koh, Ikbeom Jang

发表机构 * Hankuk University of Foreign Studies(韩国外国语大学)

AI总结 提出SafeECGMatch框架,通过双分支架构提取时频特征,结合自适应标签平滑和温度缩放校准模型,在标签分布不匹配下实现可靠的开集分类和OOD检测。

Comments 8 pages. Accepted to the KDD-UC 2026 (ACM International Conference on Data Mining and Knowledge Discovery - Undergraduate Consortium 2026)

详情
AI中文摘要

心电图(ECG)分类模型常面临严重的标签稀缺问题,使得半监督学习(SSL)成为降低标注成本的有效策略。然而,在临床环境中,未标注数据池通常包含分布外(OOD)异常或标注集中不存在的诊断类别。标准SSL会强制对这些未见类别分配错误的伪标签,产生过度自信的预测。为解决此问题,我们提出SafeECGMatch,一个校准感知的安全SSL框架,用于标签分布不匹配下的单标签ECG分类。方法上,SafeECGMatch采用双分支架构,通过ECG特定的数据增强提取时频潜在表示。关键地,它通过自适应标签平滑和温度缩放动态对齐置信度与经验准确性,在时间和频谱域上校准多类分类器和OOD检测器。这种联合优化实现了可信的OOD拒绝和可靠的伪标签分配。在PTB-XL和PhysioNet/CinC Challenge基准上评估,SafeECGMatch达到了最先进的准确性和校准性能,推动了生理时间序列中可靠知识发现。代码可在https://github.com/labhai/SafeECGMatch获取。

英文摘要

Electrocardiogram (ECG) classification models often suffer from severe label scarcity, making semi-supervised learning (SSL) an attractive strategy for reducing annotation costs. In clinical settings, however, unlabeled pools frequently contain out-of-distribution (OOD) anomalies or diagnostic groups absent from the labeled set. Standard SSL forces incorrect pseudo-labels onto these unseen classes, producing overconfident predictions. To address this, we propose SafeECGMatch, a calibration-aware safe SSL framework for single-label ECG classification under label distribution mismatch. Methodologically, SafeECGMatch employs a dual-branch architecture extracting time-frequency latent representations via ECG-specific augmentations. Crucially, it dynamically aligns confidence with empirical accuracy through adaptive label smoothing and temperature scaling, calibrating both the multiclass classifier and the OOD detector across temporal and spectral domains. This joint optimization allows trustworthy OOD rejection and reliable pseudo-labeling. Evaluated on the PTB-XL and PhysioNet/CinC Challenge benchmarks, SafeECGMatch achieves state-of-the-art accuracy and calibration, advancing reliable knowledge discovery in physiological time-series. Code is available at https://github.com/labhai/SafeECGMatch.

2606.08035 2026-06-09 cs.CV 新提交

DyCo-RL: Dynamic Cross-Modal Coordination for Visual Reasoning

DyCo-RL: 用于视觉推理的动态跨模态协调

Hangui Lin, Yan Shu, Zhengyang Liang, Chi Liu, Xiangrui Liu, Minghao Qin, Teng Long, Zheng Liu, Nicu Sebe

发表机构 * University of Trento(特伦托大学) BAAI(北京智源人工智能研究院) Singapore Management University(新加坡管理大学) IQuest Research

AI总结 提出DyCo-RL,通过Fisher-Rao测地距离量化模态内注意力转移,实现动态跨模态协调,并利用对齐引导的优势重加权优化策略,提升多模态大模型在视觉推理中的表现。

详情
AI中文摘要

具有可验证奖励的强化学习(RLVR)已成为增强多模态大语言模型(MLLMs)视觉推理的主要范式。然而,现有的RLVR方法主要针对推理结果进行优化,从根本上忽略了生成过程中所需的细粒度跨模态协调。通过token级分析和控制干预,我们揭示了在思维链(CoT)推理过程中,MLLMs经常无法在提取视觉证据和合成文本上下文之间动态交替——这种协调崩溃与推理失败存在因果关系。受这些发现的启发,我们提出了DyCo-RL,它将动态跨模态协调集成到RLVR优化中。具体来说,DyCo-RL使用Fisher-Rao测地距离来度量模态内注意力转移,将token分配到视觉导向或文本导向的功能角色。然后,它评估token实际注意力分配与其分配角色之间的一致性,利用该分数在策略优化期间进行对齐引导的优势重加权。大量实验表明,算法无关的DyCo-RL应用于Qwen2.5-VL-3B/7B时,在涵盖视觉中心和数学推理的七个基准测试中,一致地改进了四种代表性的RLVR算法。

英文摘要

Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a leading paradigm for enhancing visual reasoning in Multimodal Large Language Models (MLLMs). However, existing RLVR methods optimize primarily for the reasoning outcome, fundamentally overlooking the fine-grained cross-modal coordination required during the generation process. Through token-level analyses and controlled interventions, we reveal that during Chain-of-Thought (CoT) reasoning, MLLMs frequently fail to dynamically alternate between extracting visual evidence and synthesizing textual context-a coordination breakdown that is causally linked to reasoning failures. Motivated by these findings, we propose DyCo-RL, which integrates dynamic cross-modal coordination into RLVR optimization. Specifically, DyCo-RL uses the Fisher-Rao geodesic distance to measure within-modality attention shifts, assigning tokens to either visually-oriented or text-oriented functional roles. It then evaluates the alignment between a token's actual attention allocation and its assigned role, leveraging this score for alignment-guided advantage reweighting during policy optimization. Extensive experiments demonstrate that the algorithm-agnostic DyCo-RL, when applied to Qwen2.5-VL-3B/7B, consistently improves four representative RLVR algorithms across seven benchmarks spanning visual-centric and mathematical reasoning.

2606.08034 2026-06-09 cs.CV cs.AI cs.CL 新提交

Sci-Rho: A Multilingual Visually-Grounded Symbolic Benchmark for STEM Problems

Sci-Rho:面向STEM问题的多语言视觉基础符号基准

Muhammad Falensi Azmi, Ikhlasul Akmal Hanif, Vallerie Alexandra Putra, Adi Yeltay, Abdullah Mubarak, Fajri Koto

发表机构 * Independent Researcher(独立研究员) MBZUAI(穆罕默德·本·扎耶德人工智能大学) Binus University(比努斯大学) Bandung Institute of Technology(万隆理工学院)

AI总结 提出Sci-Rho,一个多语言、视觉基础的STEM问题动态基准,包含4242个模板和42420个实例,评估17个VLM发现最差精度与平均精度存在差距,且小模型跨语言性能下降。

Comments 22 pages

详情
AI中文摘要

符号基准已成为评估模型在STEM相关问题微小修改下鲁棒性的关键方法。然而,现有符号基准大多局限于数学推理,缺乏视觉基础,且主要以英语为主。在这项工作中,我们引入了Sci-Rho(科学鲁棒性),一个面向视觉基础STEM问题的动态基准,涵盖五个学科和七种语言,包含由领域专家(包括奥林匹克奖牌得主)精心设计的4,242个问题模板(每种语言606个)。每个模板实现为可执行的Python代码,通过改变数值、视觉模式、几何形状、颜色方案和函数类型,生成多样但等价的问题实例,总共产生42,420个实例,每个实例都配有推理步骤和真实解决方案。我们评估了17个最先进的VLM,发现最差情况准确率(定义为模型在每种生成变体上均正确回答的问题模板比例)与平均准确率之间存在明显差距。我们还发现,较小的模型在不同语言上表现出显著的性能下降,而专有模型和较大模型保持鲁棒。步骤级评估反映了相同的趋势,揭示了平均F1与最差情况F1分数之间的显著差距。最后,我们对VLM注意力头的检查显示,图像标记与文本标记的相对注意力分配存在显著的跨语言变化。我们的工作强调了超越静态基准的评估作为衡量VLM质量指标的重要性。

英文摘要

Symbolic benchmarks have emerged as a key approach to assess model robustness under minor modifications to STEM-related questions. However, existing symbolic benchmarks mostly remain limited to mathematical reasoning, lack visual grounding, and are predominantly in English. In this work, we introduce Sci-Rho (Science Rhobustness), a dynamic benchmark for visually-grounded STEM problems spanning five subjects and seven languages, comprising 4,242 problem templates (606 per language) crafted by domain experts, including Olympiad medalists. Each template is implemented as executable Python code that generates diverse but equivalent problem instances by varying numerical values, visual patterns, geometric shapes, color schemes, and function types, resulting in 42,420 instances in total, each paired with reasoning steps and ground-truth solutions. We evaluated 17 state-of-the-art VLMs and discovered a noticeable gap between worst-case accuracy (defined as the proportion of problem templates that a model answers correctly across every generated variation) and average accuracy. We also discovered that smaller models show noticeable performance degradation across languages, whereas proprietary and larger models remain robust. Step-level evaluation reflects this same trend, revealing a significant gap between average F1 and worst-case F1 scores. Finally, our inspection of attention heads of a VLM reveals substantial cross-lingual variation in the relative attention allocated to image tokens compared to text tokens. Our work highlights the importance of evaluation beyond static benchmarks as a metric to measure the quality of VLMs.

2606.08033 2026-06-09 cs.CV cs.LG 新提交

Balancing Real and Synthetic Data for CNN-based Masonry Crack Detection

基于CNN的砌体裂缝检测中真实与合成数据的平衡

Mattia Forlesi, Alfonso Esposito, Ivan Zyrianoff, Alessandro Marzani, Marco Di Felice

发表机构 * University of Bologna(博洛尼亚大学)

AI总结 针对砌体裂缝检测中真实数据不足的问题,提出用合成数据补充训练,通过调整真实与合成数据比例,发现20%真实数据加合成数据即可达到甚至超越纯真实数据的效果。

详情
AI中文摘要

裂缝是建筑健康的关键指标,早期识别对于防止有害损害至关重要。深度学习(DL)的进展,特别是卷积神经网络(CNN),已实现可扩展的自动裂缝检测解决方案。然而,CNN性能高度依赖于大规模多样化数据集的可用性,这对于砌体等复杂表面尤其具有挑战性。收集足够的真实数据耗时,而公开数据集可能不充分。为解决这一限制,我们探索生成合成裂缝数据,以补充真实数据并提高训练效果。真实数据集由从博洛尼亚及周边地区建筑收集的砌体裂缝图像组成。相比之下,合成数据集使用裂缝叠加工具生成,该工具以受控方向和位置向背景图像添加裂缝。使用真实数据集训练多种DL架构,以确定最佳性能模型(InceptionV4),用于生成数据的实验。通过改变真实与合成数据的比例,在InceptionV4上测试了六种训练场景,并在由真实图像组成的测试集上使用F1分数和平均交并比(mIoU)指标进行评估。结果表明,在合成数据上训练加上少量20%真实数据,可获得与仅使用真实数据训练相当的结果。此外,20/80(合成/真实)场景实现了76%的F1分数和80%的平均IoU,优于纯真实情况。可以看出,该方法展示了合成数据在减少收集工作同时提高裂缝检测准确性的潜力。

英文摘要

Cracks are a critical indicator of building health, and early stage identification is fundamental to prevent harmful damages. Advances in deep learning (DL), particularly convolutional neural networks (CNNs), have enabled scalable solutions for automated crack detection. However, CNN performance strongly depends on the availability of large and diverse datasets, which is particularly challenging for complex surfaces such as masonry. Collecting sufficient real data is time-consuming, while publicly available datasets may not be adequate. To address this limitation, we explored generating synthetic crack data, which complements real data and improves training effectiveness. The real dataset consists of masonry crack images collected from buildings in Bologna and surrounding areas. In contrast, the synthetic dataset was generated using a crack overlay tool that adds cracks to background images in a controlled orientation and placement. The real dataset was used to train several DL architectures, to identify the best-performing model (InceptionV4) employed for experiments with generated data. Six training scenarios were tested in InceptionV4 by varying the ratio of real and synthetic data, with evaluation performed on a test set composed of real images using the F1-score and mean Intersection over Union (mIoU) metrics. Results show that training on synthetic data plus a modest addition of 20% real data achieves results comparable to training on real data only. Moreover, the 20/80 scenario (synthetic/real) achieved an 76% F1-score and 80% mean IoU, outperforming the real-only case. As can be seen, the method demonstrates the potential of synthetic data to reduce collection efforts while enhancing crack detection accuracy.

2606.08031 2026-06-09 cs.CV 新提交

Vision-Language Asymmetry in Bistable Image Captioning

双稳态图像描述中的视觉-语言不对称性

Arohan Agate

发表机构 * Arohan Agate

AI总结 通过83个双稳态刺激的行为基线和稀疏自编码器分析,发现视觉塔中同时激活两种解释,但因果干预仅能翻转默认主导刺激的描述,揭示视觉表征与语言承诺之间的不对称性。

Comments Accepted at ICML 2026 Workshop on Philosophy of Machine Learning

详情
AI中文摘要

维特根斯坦的鸭兔图对视觉-语言模型提出了一个问题:当一个模型对模糊图像进行描述时,模型中的哪一部分决定了对某一方面的承诺?我们通过83个双稳态刺激的3,320次生成行为基线来解决这个问题,该基线在中性提示与强制选择提示下揭示了三种状态(默认主导、强制主导、强制平衡),然后使用我们在LLaVA-1.6-7B实际使用的CLIP层上训练的TopK稀疏自编码器(验证EV 0.93)来探测底层表示。在69个具有每方面特征池的双稳态刺激中,72%(50/69)在视觉塔处显示两个池同时激活,包括12/12的默认主导鸭/兔和7/8的强制平衡年轻/老年。在CLIP层22进行因果干预可以在默认主导刺激上翻转描述(在流畅性保护下兔翻转率为33%),但在任何测试系数下都无法翻转强制平衡年轻/老年的描述,尽管其视觉侧存在叠加。主导瓶颈位于视觉塔下游;视觉侧表示与语言侧承诺之间的差距是“看见”与“看作”区别的经验把手。我们还指出一个方法论注意事项:基于TopK SAE输出的秩统计需要经过结校正的排序以避免无声的行序偏差。

英文摘要

Wittgenstein's duck-rabbit poses a question for vision-language models: when a model captions an ambiguous image, where in the model is the commitment to one aspect made? We address this with a 3,320-generation behavioral baseline over 83 bistable stimuli that surfaces three regimes (default-dominant, force-dominant, force-balanced) under neutral vs forced-choice prompting, then probe the underlying representations using a TopK sparse autoencoder we train on the CLIP layer that LLaVA-1.6-7B actually consumes (validation EV 0.93). Across 69 bistable stimuli with both per-aspect feature pools available, 72% (50/69) show simultaneous activation of both pools at the vision tower, including 12/12 default-dominant duck/rabbit and 7/8 force-balanced young/old. Causal steering at CLIP layer 22 flips captions on default-dominant stimuli (33% rabbit-flip rate under a fluency guard) but cannot flip captions on force-balanced young/old at any tested coefficient, despite their vision-side superposition. The dominance bottleneck lives downstream of the vision tower; the gap between vision-side representation and language-side commitment is an empirical handle on the seeing/seeing-as distinction. We also flag a methodological note: rank-based statistics on TopK SAE outputs require tie-corrected ranking to avoid silent row-order bias.

2606.08029 2026-06-09 cs.RO 新提交

IntentNav: Learning Spatial-Visual Object Navigation from Human Demonstrations

IntentNav: 从人类演示中学习空间-视觉物体导航

Yuxin Cai, Zongtai Li, Maonan Wang, Muyi Bao, Haokun Zhu, Ruofei Bai, Ding Zhao, Zirui Li, Wenshan Wang, Wei-Yun Yau, Ji Zhang, Chen Lv

发表机构 * Nanyang Technological University(南洋理工大学) Carnegie Mellon University(卡内基梅隆大学) The Chinese University of Hong Kong(香港中文大学) A*STAR Institute for Infocomm Research (I2R)(新加坡科技研究局资讯通信研究院)

AI总结 提出IntentNav框架,通过人类演示学习类人物体导航策略,利用前沿标注和意图对齐目标实现最优性能,并零样本迁移到多种机器人平台。

Comments 26 pages, 9 figures

详情
AI中文摘要

物体导航要求机器人在未知环境中搜索未观察到的目标,通过在部分可观测性下决定下一步探索位置。有效的搜索类似于人类探索:选择性探查视觉上有希望的前沿,同时依赖空间记忆避免重复访问。我们提出IntentNav,一个从人类演示中学习类人ObjectNav策略的空间-视觉模仿框架。为了从低级人类动作推断高级搜索意图,我们引入了基于前沿的人类意图标注,该方法前瞻人类演示并标注最能解释演示者未来搜索方向的前沿。我们构建了一个空间-视觉候选空间,其中BEV记忆跟踪已探索区域、未探索前沿和轨迹历史,而自我中心视觉记忆为每个候选提供语义线索。训练一个VLM策略在这些基于上下文的候选中进行选择,使用意图对齐目标以鼓励一致且类人的探索。IntentNav在MP3D、HM3D-v1和HM3D-v2 ObjectNav基准上实现了最先进的性能。所提出的候选级导航界面无需进一步VLM微调即可零样本迁移到轮式、四足和类人机器人。\href{https://anonymous.4open.science/w/IntentNav/}{项目页面}。

英文摘要

Object navigation requires a robot to search for an unobserved target in an unknown environment by deciding where to explore next under partial observability. Effective search resembles human-like exploration: selectively probing visually promising frontiers while relying on spatial memory to avoid redundant revisits. We propose IntentNav, a spatial-visual imitation framework that learns human-like ObjectNav policies from human demonstrations. To infer high-level search intent from low-level human actions, we introduce Frontier-based Human-Intent Labeling, which looks ahead in human demonstrations and labels the frontier that best explains the demonstrator's future search direction. We construct a spatial-visual candidate space, where BEV memory tracks explored regions, unexplored frontiers, and trajectory history, while egocentric visual memory provides semantic cues for each candidate. A VLM policy is trained to select among these grounded candidates, using Intent-Aligned Objective to encourage consistent and human-like exploration. IntentNav achieves state-of-the-art performance on the MP3D, HM3D-v1 and HM3D-v2 ObjectNav benchmarks. The proposed candidate-level navigation interface transfers zero-shot to wheeled, quadruped, and humanoid robots without further VLM fine-tuning. \href{https://anonymous.4open.science/w/IntentNav/}{Project page}.

2606.08028 2026-06-09 cs.LG 新提交

Noise-Adaptive High-Probability Regret Bounds for Online Convex Optimization

噪声自适应的在线凸优化高概率遗憾界

Wentao Zhang, Yutong Zhang, Wentao Mo

发表机构 * Tsinghua Shenzhen International Graduate School, Tsinghua University(清华大学深圳国际研究生院) College of Mathematics, Sichuan University(四川大学数学学院)

AI总结 针对强凸损失在线凸优化,提出噪声自适应高概率遗憾界,在完全信息下实现与噪声水平相关的乘性改进,并证明赌博反馈下遗憾与置信度的线性关系,同时为约束优化提供联合高概率保证。

Comments Accepted to 2026 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases(ECML-PKDD 2026)

详情
AI中文摘要

我们研究了具有强凸损失的在线凸优化(OCO)的高概率遗憾界,并建立了三个结果,解决了噪声自适应性、反馈结构和约束满足交叉领域的开放问题。对于具有次高斯随机梯度的完全信息设置,我们证明了一个噪声自适应的高概率遗憾界,其中鞅偏差项与噪声水平$σ$而非梯度界$G$成比例,相比经典的Azuma-Hoeffding基线实现了$G/σ$的乘性改进。我们的分析引入了一个指数超鞅论证,绕过了Freedman不等式的有界差分要求,从而无需截断伪影即可直接处理无界次高斯噪声。对于赌博反馈,我们证明了一个极小极大下界:高概率遗憾与$\log(1/δ)$线性增长,而完全信息下的置信成本为$\sqrt{\log(1/δ)}$。这构成了强凸OCO在不同反馈模型下置信成本的正式分离。关于具有满足Slater条件的随机约束的约束OCO,我们为累积遗憾和长期约束违反提供了同时的高概率保证,实现了$\mathcal{O}(\sqrt{T\log(m/δ)})$的遗憾和$\mathcal{O}(\sqrt{T}/(ζδ) + m\sqrt{T\log(m/δ)})$的违反。合成实验证实了所有理论预测。

英文摘要

We study high-probability regret bounds for online convex optimization (OCO) with strongly convex losses and establish three results that resolve open questions at the intersection of noise adaptivity, feedback structure, and constraint satisfaction. For the full-information setting with sub-Gaussian stochastic gradients, we prove a noise-adaptive high-probability regret bound in which the martingale deviation term scales with the noise level $σ$ rather than the gradient bound $G$, yielding a multiplicative improvement of $G/σ$ over the classical Azuma-Hoeffding baseline. Our analysis introduces an exponential supermartingale argument that bypasses the bounded-difference requirement of Freedman's inequality, enabling direct treatment of unbounded sub-Gaussian noise without truncation artifacts. For bandit feedback, we prove a minimax lower bound: the high-probability regret scales linearly in $\log(1/δ)$, in contrast to the $\sqrt{\log(1/δ)}$ confidence cost under full information. This constitutes a formal separation in the confidence cost of strongly convex OCO across feedback models. Regarding constrained OCO with stochastic constraints satisfying a Slater condition, we provide simultaneous high-probability guarantees for both cumulative regret and long-run constraint violation, achieving $\mathcal{O}(\sqrt{T\log(m/δ)})$ regret and $\mathcal{O}(\sqrt{T}/(ζδ) + m\sqrt{T\log(m/δ)})$ violation. Synthetic experiments corroborate all theoretical predictions.

2606.08027 2026-06-09 cs.LG cs.AI 新提交

CausShield: Sample Reconstruction-Resilient Vertical FL via Causal Representation Learning

CausShield: 通过因果表示学习实现样本重建鲁棒的纵向联邦学习

Yongqi Jiang, Yansong Gao, Siguang Chen, Anmin Fu

发表机构 * Nanjing University of Science and Technology(南京理工大学) University of Western Australia(西澳大学) Hohai University(河海大学) Nanjing University(南京大学)

AI总结 针对纵向联邦学习中样本重建攻击的防御问题,提出基于因果表示学习的CausShield方法,将共享表示分解为任务相关与无关部分,实现全周期隐私保护,理论证明收敛性,实验优于七种最新方法。

详情
AI中文摘要

纵向联邦学习(VFL)是一种分布式学习范式,利用跨孤立方的垂直划分特征,无需共享原始样本;然而,它仍然容易受到主动样本重建攻击。现有防御方法由于要么抑制任务相关信息的同时也抑制了隐私敏感特征,要么依赖端到端监督训练来收敛防御模块(这暴露了早期轮次的脆弱性),因此无法在模型效用和隐私保护之间实现令人满意的权衡。为了解决这一挑战,我们采用结构因果模型(SCM)的见解,构建了CausShield。从任务学习的角度来看,原始样本中的因果特征是那些直接相关且有助于学习目标的特征,而非因果特征与任务无关,但通常编码了样本特定的私有信息,从而促进了重建。重要的是,我们奠定了理论基础来证明这一见解。因此,CausShield将VFL中客户端与协调服务器之间的共享表示分解为任务相关和任务无关的组件,以确保全周期的隐私保护。然而,由于在保持模型效用的同时减轻隐私泄露的双重目标,这种分解本质上具有挑战性。我们通过一个精心制定的优化问题来解决这一问题,该问题通过无监督表示学习求解。我们进一步从理论上证明CausShield保持了标准VFL的收敛行为。大量实验将CausShield与七种最新方法(包括InvL (USENIX Security'25))进行比较,并评估了对高级重建攻击(如URVFL (NDSS'25))的鲁棒性。结果表明,CausShield在隐私保护、模型效用和计算效率方面始终表现优异。

英文摘要

Vertical federated learning (VFL) is a distributed learning paradigm that leverages vertically partitioned features across isolated parties without sharing raw samples; however, it remains vulnerable to active sample reconstruction attacks. Existing defenses fail to achieve a satisfactory trade-off between model utility and privacy protection, due to either suppressing task-relevant information alongside privacy-sensitive features or relying on end-to-end supervised training to converge the defense module, which exposes the model to early-epoch vulnerability. To address this challenge, we adopt a structural causal model (SCM) insight and construct CausShield. From a task-learning standpoint, causal features within a raw sample are those that are directly relevant and contributory to the learning objective, whereas non-causal features are task-irrelevant but often encode sample-specific private information, thereby facilitating reconstruction. Importantly, we lay a theoretical foundation to prove this insight. CausShield thus decomposes the shared representations between the client and the coordinating server in VFL into task-relevant and task-irrelevant components to ensure full-cycle privacy protection. Nonetheless, the decomposition is inherently challenging due to the dual objectives of preserving model utility while mitigating privacy leakage. We address this via a carefully formulated optimization problem, which is solved through unsupervised representation learning. We further theoretically prove that CausShield preserves the convergence behavior of standard VFL. Extensive experiments compare CausShield against seven SOTAs, including InvL (USENIX Security'25), and evaluate robustness against advanced reconstruction attacks such as URVFL (NDSS'25). Results demonstrate that CausShield consistently outperforms in privacy protection, model utility, and computational efficiency.

2606.08025 2026-06-09 cs.CL 新提交

Arabic Sentence Segmentation Across Genres and Punctuation Conditions

跨体裁与标点条件下的阿拉伯语句子分割

Mohammed Elkholy, Khalid N. Elmadani, Nizar Habash, Bashar Alhafni

发表机构 * Mohamed bin Zayed University of Artificial Intelligence(莫扎德·本·泽德人工智能大学) New York University Abu Dhabi(纽约大学阿布扎比分校)

AI总结 针对阿拉伯语标点歧义和不一致导致的句子分割难题,构建跨8种体裁的语料库AraSEG,评估LLM、轻量编码器和依存解析器,发现轻量模型在困难设置下优于LLM,且准确分割能显著提升下游依存解析。

详情
AI中文摘要

阿拉伯语的句子分割因标点符号的歧义和不一致而具有挑战性,许多文本缺乏可靠的句子边界标记。现有方法严重依赖标点线索,且通常在格式良好的文本上进行评估,限制了其在真实阿拉伯语环境中的鲁棒性。为解决这一问题,我们引入了AraSEG,一个跨体裁的句子分割语料库,涵盖八种体裁以及广泛的标点和文档结构条件。利用AraSEG,我们在日益具有挑战性的分割设置下评估了LLM、轻量级编码器模型和基于依存解析器的模型。我们的实验表明,在最困难的设置中,轻量级编码器甚至基于依存解析器的模型都优于LLM。我们进一步研究了训练数据规模和体裁多样性的影响,发现性能最终会饱和,且跨体裁泛化仍然具有挑战性。我们还证明了准确的句子分割能显著改善下游的依存解析。我们将公开我们的代码、数据和模型。

英文摘要

Sentence segmentation in Arabic is challenging due to ambiguous and inconsistent punctuation, with many texts lacking reliable sentence boundary markers. Existing approaches rely heavily on punctuation cues and are typically evaluated on well-formed text, limiting their robustness in realistic Arabic settings. To address this, we introduce AraSEG, a genre-diverse sentence segmentation corpus spanning eight genres and a wide range of punctuation and document structure conditions. Using AraSEG, we evaluate LLMs, lightweight encoder models, and dependency parser-based models under increasingly challenging segmentation settings. Our experiments show that lightweight encoders, and even dependency parser-based models, outperform LLMs in the most challenging settings. We further investigate the effects of training data size and genre diversity, finding that performance eventually saturates and cross-genre generalization remains challenging. We also demonstrate that accurate sentence segmentation substantially improves downstream dependency parsing. We make our code, data, and models publicly available.

2606.08021 2026-06-09 cs.LG cs.AI cs.MA 新提交

Semantic Quorum Assurance: Collective Certification for Non-Deterministic AI Infrastructure

语义法定数保证:面向非确定性AI基础设施的集体认证

Jun He, Deying Yu

发表机构 * OpenKedge.io

AI总结 提出语义法定数保证(SQA),一种通过多样化验证者群体和风险自适应法定数谓词,将非确定性LLM代理的不安全操作批准率从18.5%降至0.3%的控制平面原语。

Comments 21 pages, 2 figures, 6 tables

详情
AI中文摘要

随着大型语言模型(LLM)代理被集成到自主云操作中,分布式系统面临一个语义可靠性问题:提议代理可以生成语法有效且静态授权但操作不安全的生成突变,例如修改IAM策略、开放防火墙安全组或执行数据导出。经典的分布式共识协议复制确定性状态转换,但不评估提议意图的安全性。为弥补这一差距,我们引入语义法定数保证(SQA),一种用于治理非确定性代理基础设施的控制平面原语。SQA将提议表示为绑定到密码证据链的声明性执行合约,并将其路由到由只读、沙盒验证代理组成的多样化面板。SQA在风险自适应法定数谓词下聚合其判断,该谓词强制执行模型和原型多样性,根据校准的保证分数调整权重,并尊重特定原型的否决。通过的提议仅通过主权执行门执行。我们在云原生控制平面中实例化SQA,并为非确定性验证者形式化了一个相关的认知失败模型。在500个基础设施启发的突变场景中,安全结果报告在保留的安全/不安全试验上(排除模糊场景),SQA将不安全批准率从单代理验证的18.5%降低到0.3%,同时在研究风险桶中增加了1.45-4.12秒的中位验证延迟。

英文摘要

As large language model (LLM) agents are integrated into autonomous cloud operations, distributed systems face a semantic reliability problem: proposer agents can generate production mutations, such as modifying IAM policies, opening firewall security groups, or executing data exports, that are syntactically valid and statically authorized but operationally unsafe. Classical distributed consensus protocols replicate deterministic state transitions but do not evaluate the safety of the proposed intent. To address this gap, we introduce Semantic Quorum Assurance (SQA), a control-plane primitive for governing non-deterministic agentic infrastructure. SQA represents proposals as declarative execution contracts bound to cryptographic evidence chains and routes them to a diverse panel of read-only, sandboxed validator agents. SQA aggregates their judgments under a risk-adaptive quorum predicate that enforces model and archetype diversity, adjusts weights based on calibrated assurance scores, and respects archetype-specific vetoes. Admitted proposals execute only through a sovereign execution gate. We instantiate SQA in a cloud-native control plane and formalize a correlated cognitive failure model for non-deterministic validators. On 500 infrastructure-inspired mutation scenarios, with safety results reported on held-out safe/unsafe trials excluding ambiguous scenarios, SQA reduces unsafe approval from 18.5% for single-agent validation to 0.3% while adding median validation latency of 1.45--4.12 seconds across the studied risk buckets.

2606.08018 2026-06-09 cs.AI 新提交

UniQL: Towards Dialect-Universal Benchmarking for Text-to-SQL

UniQL:迈向方言通用的文本到SQL基准测试

Jianling Gao, Chongyang Tao, Jiayuan Bai, Liu Yang, Xuanguang Pan, Jinrui Liu, Shihao Xing, Xiaohan Xu, Jie Liang, Shuai Ma

发表机构 * SKLCCSE, Beihang University(北京航空航天大学软件开发环境国家重点实验室) The University of Hong Kong(香港大学)

AI总结 提出UniQL基准,通过跨16种SQL方言的对齐标注,评估模型在不同数据库系统间的泛化能力,揭示现有模型在方言通用性上的不足。

详情
AI中文摘要

现有的文本到SQL基准测试主要集中在SQLite上,这使得评估模型能否跨异构SQL方言泛化变得困难。然而,现实世界的数据库系统在语法、函数、类型系统和执行语义上存在显著差异,因此相同的自然语言意图通常需要特定方言的SQL实现。我们引入了UniQL,一个用于跨方言文本到SQL评估的人工验证基准。UniQL将1,534个自然语言问题与16种SQL方言的可执行SQL注释对齐,产生了24,544个方言特定的查询。所有方言共享相同的意图、对齐的模式和数据库内容,从而实现了对方言泛化的可控评估。UniQL通过一个混合流水线构建,结合了数据库迁移、SQL翻译、执行引导验证、迭代规则总结和人工验证。在开源和闭源LLM上的实验表明,当前模型远未达到方言通用,在不同数据库系统间性能差异显著,且从SQLite成功到其他方言的迁移有限。这些发现凸显了对齐的跨方言基准和更注重方言的文本到SQL方法的必要性。代码和数据可在https://github.com/JerryGao818/UniQL获取。

英文摘要

Existing text-to-SQL benchmarks are largely centered on SQLite, making it difficult to evaluate whether models can generalize across heterogeneous SQL dialects. However, real-world database systems differ substantially in syntax, functions, type systems, and execution semantics, so the same natural language intent often requires dialect-specific SQL realizations. We introduce UniQL, a human-verified benchmark for cross-dialect text-to-SQL evaluation. UniQL aligns 1,534 natural language questions with executable SQL annotations across 16 SQL dialects, yielding 24,544 dialect-specific queries. All dialects share the same intents, aligned schemas and database contents, enabling controlled evaluation of dialect generalization. UniQL is constructed through a hybrid pipeline combining database migration, SQL translation, execution-guided verification, iterative rule summarization, and human validation. Experiments on both open-source and closed-source LLMs show that current models remain far from dialect-universal, with substantial performance variation across database systems and limited transfer from SQLite success to other dialects. These findings highlight the need for aligned cross-dialect benchmarks and more dialect-aware text-to-SQL methods. Code and data are available at https://github.com/JerryGao818/UniQL

2606.08016 2026-06-09 cs.CV cs.AI cs.CL 新提交

IEA: Amateur-Friendly Conversational Image Editing Agent via Three Stages of Multitask Alignment

IEA:通过三阶段多任务对齐的业余友好型对话式图像编辑代理

Zichen Zhu, Yuheng Sun, Mingxuan Zhu, Wenjie Ma, Situo Zhang, Zhexiang Wang, Ziyue Yang, Danyang Zhang, Kunyao Lan, Zihan Zhao, Dingye Liu, Siqi Xiang, Lu Chen, Kai Yu

发表机构 * Shanghai Jiao Tong University(上海交通大学) Shanghai Innovation Institution(上海创新研究院) Huawei Technologies Ltd.(华为技术有限公司) Nanyang Technological University(南洋理工大学) Jiangsu Key Lab of Language Computing(江苏省语言计算重点实验室)

AI总结 提出IEA对话式图像编辑代理,通过三阶段多任务训练学习操作参数化工具,实现可解释编辑轨迹,在像素距离和ROUGE-L指标上优于基线,用户研究中指令跟随和感知质量表现最佳。

Comments [CVPR 2026 Findings] Our data and code are released at https://github.com/OpenDFM/Image_Edit_Agent

详情
AI中文摘要

当前的图像编辑软件通常依赖于固定滤镜或专家调参,导致业余用户的意图与结果之间存在差距。生成模型创建的图像可能包含伪影、不合理的细节或偏离真实感的风格漂移,并且对编辑原因缺乏解释。我们提出IEA,一个对话式图像编辑代理,它学习在显式、可解释的动作空间中操作参数化工具。IEA通过三阶段多任务流水线进行训练:(1) 在蒸馏专家编辑上进行SFT,(2) 使用GRPO进行奖励优化,奖励包括相似度改进、工具有用性和意图总结,(3) 大规模合成微调以联合掌握图像编辑、细化和用户意图总结。通过逐步操作16个编辑工具,IEA产生透明的编辑轨迹,可以检查和调试。在定量实验中,它在编辑任务上获得更低的像素距离,在总结任务上获得比强基线更高的ROUGE-L。在用户研究中,它在指令跟随方面在工具调用方法中排名最佳,同时在整体感知质量上超越生成方法。我们的结果验证了可解释的、以工具为中心的VLM作为人类指令引导图像润色的可靠路径。

英文摘要

Current image editing software often hinges on fixed filters or expert tuning, leaving a gap between amateur users' intent and outcomes. Creations by generative models may contain artifacts, implausible details, or stylistic drift away from photorealism and offer little insight into why an edit was made. We propose IEA, a conversational Image Editing Agent that learns to operate parameterized tools in an explicit, interpretable action space. IEA is trained via a three-stage multitask pipeline: (1) SFT on distilled expert edits, (2) GRPO with rewards for likeness improvement, tool usefulness, and intent summarization, and (3) large-scale synthetic fine-tuning to jointly master image editing, refinement, and user intent summarization. By manipulating 16 editing tools step by step, IEA produces transparent edit traces that can be inspected and debugged. In quantitative experiments, it attains a lower pixel distance on the edit task and a higher ROUGE-L on the summary task than strong baselines. In user studies, it ranks best among tool-calling methods for instruction following while surpassing generative methods in overall perceptual quality. Our results validate interpretable, tool-centric VLMs as a reliable path to human instruction-guided image retouching.

2606.08015 2026-06-09 cs.RO 新提交

Q-VGM: Q-Guided Value-Gradient Matching for Flow-Matching VLA Policies

Q-VGM: 基于Q引导的值梯度匹配的流匹配VLA策略

Ziqian Wang, Jiayu Sun, Xingjian Mao, Minqian Wang, Yao Mu

发表机构 * Shanghai Jiao Tong University(上海交通大学) University of Michigan, Ann Arbor(密歇根大学安娜堡分校) University of Electronic Science and Technology of China(电子科技大学)

AI总结 提出Q-VGM离线强化学习方法,通过将值梯度转化为去噪时间上的值梯度场,避免反向传播去噪链,高效微调流匹配VLA策略,在LIBERO等任务上显著提升成功率。

Comments 13 pages, 3 figures, 4 tables

详情
AI中文摘要

我们提出Q引导的值梯度匹配(Q-VGM),一种离线强化学习方法,解决了微调流匹配视觉-语言-动作(VLA)策略中长期存在的挑战:如何高效地根据学习到的Q函数改进一个表达力强的流匹配动作专家。有效的改进必须利用评论家的一阶(梯度)信息,但这对于流策略很困难,因为直接通过其多步去噪过程反向传播值函数在VLA规模下数值不稳定,而策略梯度方法所需的可处理动作似然在迭代去噪下不可用。现有的基于值的方法要么通过整个去噪链反向传播,要么仅在测试时使用评论家而不更新策略,要么将评论家改进的动作作为终端标签蒸馏而不监督速度场。Q-VGM通过利用VGG-Flow(一种生成建模中流对齐的值梯度视角)绕过了这些问题,它将值梯度转化为去噪时间上的值梯度场,而不是不稳定的端到端目标。这不需要动作似然,也不需要反向传播去噪链,并且在一个固定的重放缓冲区上操作。评论家是一个动作敏感的Cal-QL集成,基于紧凑的RLT特征和每层动作注入。Q-VGM实现了一种实用的少样本初始化然后从经验中学习的范式:从少样本SFT pi0.5 VLA开始,该方法利用自生成的rollout数据显著提升任务性能,无需额外的专家监督。在LIBERO上,Q-VGM将平均成功率从75.0%提升到92.5%;在RoboTwin 2.0上,从76.4%提升到87.2%;在两个真实机器人桌面任务上,从40.0%提升到67.5%,在所有三种设置中均优于所有相同骨干、相同评论家的基线。

英文摘要

We propose Q-Guided Value-Gradient Matching (Q-VGM), an off-policy reinforcement learning (RL) method that tackles a long-standing challenge in fine-tuning flow-matching vision-language-action (VLA) policies: efficiently improving an expressive flow-matching action expert with respect to a learned Q-function. Effective improvement must exploit the first-order (gradient) information of the critic, but this is difficult for flow policies, because directly back-propagating the value through their multi-step denoising process is numerically unstable at VLA scale, while the tractable action likelihoods required by policy-gradient methods are unavailable under iterative denoising. Existing value-based methods either backpropagate through the full denoising chain, use the critic only at test time without updating the policy, or distill critic-improved actions as terminal labels without supervising the velocity field. Q-VGM sidesteps these issues by leveraging VGG-Flow, a value-gradient view of flow alignment in generative modeling that transforms value gradient into a denoising-time value-gradient field rather than an unstable end-to-end objective. This requires no action likelihoods and no backpropagation through the denoising chain, and operates on a fixed replay buffer. The critic is an action-sensitive Cal-QL ensemble over compact RLT features with per-layer action injection. Q-VGM enables a practical few-shot initialization then learn-from-experience paradigm: starting from a few-shot-SFT pi0.5 VLA, the method leverages self-generated rollout data to substantially improve task performance without additional expert supervision. On LIBERO, Q-VGM raises the average success rate from 75.0% to 92.5%; on RoboTwin 2.0, from 76.4% to 87.2%; and on two real-robot tabletop tasks, from 40.0% to 67.5%, outperforming all same-backbone, same-critic baselines across all three settings.

2606.08014 2026-06-09 cs.CV cs.AI 新提交

GVC-Seg: Training-Free 3D Instance Segmentation via Geometric Visual Correspondence

GVC-Seg: 基于几何视觉对应的免训练3D实例分割

Liang Xu, Fangjing Wang, Jinyu Yang, Feng Zheng

发表机构 * Victoria University of Wellington(惠灵顿维多利亚大学) Harbin Institute of Technology, Shenzhen(哈尔滨工业大学(深圳)) Southern University of Science and Technology(南方科技大学)

AI总结 提出GVC-Seg,一种免训练的3D实例分割方法,通过几何与视觉特征对应消除多模型集成中的置信度偏差,在多个基准上达到最优性能。

Comments 10 pages, 5 figures

详情
AI中文摘要

点云数据中的精确3D实例分割对于机器视觉应用至关重要。最近的研究利用多个预训练基础模型生成3D提案,然后应用提案聚合方法,显著提升了性能。然而,由于不同分割模型之间置信度水平的固有差异,它们通常会产生次优结果,导致偏向于置信度更高的模型。这种偏差本质上是模型依赖的,并受到数据预处理技术和训练策略等因素的影响。为了解决这一偏差,我们提出了一种新颖的、免训练的3D实例分割方法,通过几何视觉对应(GVC-Seg)来利用3D几何线索与2D视觉线索之间的对应关系,以减轻置信度偏差。此外,在实例掩码生成和实例语义推理过程中,分别引入了3D提案生成模块和掩码感知的CLIP特征提取模块。通过这种方式,GVC-Seg增强了提案质量评估,确保了不同模型之间的无偏集成学习。大量实验表明,我们的方法在多个具有挑战性的基准上达到了最先进的性能,同时在开放词汇语义分割设置中也展现出强大的潜力。

英文摘要

Accurate 3D instance segmentation in point cloud data is critical for machine vision applications. Recent advancements leverage multiple pre-trained foundation models to generate 3D proposals, followed by the application of proposal aggregation methods, which significantly enhance performance. However, they often produce sub-optimal results due to inherent variations in confidence levels across different segmentation models, resulting in a bias toward the model with higher confidence. This bias is inherently model-dependent and is influenced by factors such as data preprocessing techniques and training strategies. To address this bias, we propose a novel, training-free 3D instance segmentation approach via Geometric Visual Correspondence (GVC-Seg), which exploits the correspondence between 3D geometric cues and 2D visual cues to mitigate the confidence bias. Additionally, a 3D proposal generation module and a mask-aware CLIP feature extraction module are introduced during the instance mask generation and instance semantic reasoning, respectively. In this way, GVC-Seg enhances proposal quality assessment, ensuring unbiased ensemble learning across different models. Extensive experiments demonstrate that our method achieves state-of-the-art performance on several challenging benchmarks, while also exhibiting strong potential in open-vocabulary semantic segmentation settings.

2606.08013 2026-06-09 cs.LG 新提交

Evaluating the Impact of Task Granularity on Catastrophic Forgetting in Continual Learning

评估任务粒度对持续学习中灾难性遗忘的影响

Emre Alyamac, Himanshu Janmeda, Shashwat Krishna, Yash Vijay

发表机构 * College of Engineering(工程学院) College of Natural Science(自然科学学院)

AI总结 研究任务粒度顺序对持续学习中灾难性遗忘的影响,通过CIFAR-100上的粗到细、细到粗和平坦三种训练策略,结合弹性权重巩固(EWC)方法,发现先学习一般类别可减少遗忘。

Comments 8 pages, 4 figures, 5 tables

详情
AI中文摘要

灾难性遗忘,即学习新信息时突然丢失先前获得的知识,仍然是持续学习中的核心挑战。本项目研究模型学习信息的顺序是否影响其保留知识的能力。具体而言,我们提出疑问:先学习一般类别(如“动物” vs “交通工具”)再学习具体类别(如“狗” vs “猫”)是否比一次性学习所有类别更能减少遗忘?我们在CIFAR-100上测试了三种方法:(1)粗到细:先训练2个超类,再扩展到10个具体子类;(2)细到粗:先训练10个子类,再分组为2个超类;(3)平坦:从一开始就训练所有10个类别。我们使用弹性权重巩固(EWC)来防止过渡期间的遗忘。我们的假设是,先学习一般模式可以为模型建立一个稳定的基础,帮助其在学习更详细区分时保留知识。我们使用标准指标(准确率、精确率、召回率、F1)以及持续学习指标(如反向迁移和遗忘率)进行评估。这项工作可为需要增量学习的实际系统设计学习序列提供参考。

英文摘要

Catastrophic forgetting, the abrupt loss of previously acquired knowledge upon learning new information, remains the central challenge in Continual Learning. This project investigates whether the order in which a model learns information affects how well it retains knowledge. Specifically, we ask: does learning general categories first (like "animals" vs "vehicles") before learning specific classes (like "dog" vs "cat") reduce forgetting compared to learning all classes at once? We test three approaches on CIFAR-100: (1) Coarse-to-Fine: train on 2 super-classes, then expand to 10 specific sub-classes, (2) Fine-to-Coarse: train on 10 sub-classes, then group into 2 super-classes, and (3) Flat: train on all 10 classes from the start. We use Elastic Weight Consolidation (EWC) to prevent forgetting during transitions. Our hypothesis is that learning general patterns first creates a stable foundation that helps the model retain knowledge when learning more detailed distinctions. We evaluate using standard metrics (accuracy, precision, recall, F1) plus continual learning metrics like backward transfer and forgetting rates. This work could inform how we design learning sequences for real-world systems that need to learn incrementally.

2606.08002 2026-06-09 cs.CV 新提交

Aqua Boundary-Saliency Attention Module for Lightweight Underwater Salient Instance Segmentation Detection Transformer

Aqua边界显著性注意力模块:用于轻量级水下显著实例分割检测Transformer

M. Fazri Nizar, Julian Supardi, Muhammad Naufal Rachmatullah

发表机构 * Universitas Sriwijaya(斯里维贾亚大学)

AI总结 提出轻量级水下显著实例分割检测Transformer(LUSIS-DETR),通过Aqua边界显著性注意力模块嵌入水下先验线索,在四个数据集上达到领先性能,并在NVIDIA T4 GPU上实现4.31-6.34毫秒延迟。

Comments This work has been submitted to the IEEE for possible publication

详情
AI中文摘要

水下实例分割融合了像素级掩码预测和实例级判别,用于海洋资源勘探、生态监测和水下机器人感知。最近的基于提示和辅助模态的方法提高了掩码质量,但它们对大型基础模型、提示生成或额外模态估计的依赖使高效部署复杂化。本文介绍了轻量级水下显著实例分割检测Transformer(LUSIS-DETR),这是一个紧凑的检测Transformer框架,围绕Aqua边界显著性注意力模块(AquaBSAM)构建。AquaBSAM通过有界残差调制将水下边界、对比度、衰减、色度、暗通道和中心先验线索嵌入到DINOv2初始化的多尺度特征中,而辅助掩码监督和小目标复制粘贴仅在训练中使用。在四个最新的水下实例分割数据集UIIS、UIIS10K、USIS10K和USIS16K上的广泛评估表明,在类别感知和显著实例协议下,该方法相对于先前最先进的工作具有竞争力的领先性能。在NVIDIA T4图形处理单元(GPU)上的TensorRT半精度(FP16)基准测试实现了4.31-6.34毫秒(ms)的延迟,支持在可复现的设置下进行实时推理。

英文摘要

Underwater instance segmentation integrates pixel-level mask prediction and instance-level discrimination for marine resource exploration, ecological monitoring, and underwater robotic perception. Recent prompt-based and auxiliary-modality methods improve mask quality, but their reliance on large foundation models, prompt generation, or extra modality estimation complicates efficient deployment. This work introduces Lightweight Underwater Salient Instance Segmentation Detection Transformer (LUSIS-DETR), a compact detection-transformer framework built around the Aqua Boundary-Saliency Attention Module (AquaBSAM). AquaBSAM embeds underwater boundary, contrast, attenuation, chroma, dark-channel, and center-prior cues into DINOv2-initialized multi-scale features through bounded residual modulation, while auxiliary mask supervision and small-object copy-paste are training-only. Extensive evaluation on four recent underwater instance segmentation datasets, UIIS, UIIS10K, USIS10K, and USIS16K, shows competitively leading performance against previous state-of-the-art works across category-aware and salient-instance protocols. TensorRT half-precision (FP16) benchmarking on an NVIDIA T4 graphics processing unit (GPU) achieves 4.31-6.34 milliseconds (ms) latency, supporting real-time inference under an accessible reproduction setting.

2606.08001 2026-06-09 cs.CV 新提交

Learning a Semantic Calibration Network for Open-Vocabulary Semantic Segmentation

学习语义校准网络用于开放词汇语义分割

Yang Sun, Tao Wang, Anastasia Ioannou, Ge Xu

发表机构 * University of Science and Technology of China(中国科学技术大学) Tsinghua University(清华大学) University of California, Berkeley(加州大学伯克利分校)

AI总结 提出语义校准网络(SCN),通过类消歧和logits融合模块显式建模类间语义相关性,在保持CLIP泛化能力的同时提升分割性能。

Comments Paper accepted by 11th International Conference on Intelligent Computing and Signal Processing (ICSP 2026)

详情
AI中文摘要

语义图像分割为每个像素分配预定义的类别标签,近期取得了显著进展。开放词汇分割(OVS)将分割任务从固定集合扩展到开放集合,使得能够基于任意文本输入(如类别名称或描述)识别和分割新概念。本文提出了一种新颖的语义校准网络(SCN)用于开放词汇语义分割。与先前专注于特征聚合或简单微调预训练模型的方法不同,SCN通过显式建模类间语义相关性来细化掩码分类过程,旨在增强模型的判别能力,同时有效保留预训练CLIP模型的泛化能力。具体而言,SCN包含两个核心组件:类消歧(CD)和logits融合(LF)。首先,利用交叉注意力机制将文本嵌入转换为视觉感知的伪文本嵌入,以推导出增强的相似度分数,补充原始的掩码-文本相似度分数。随后,类消歧模块通过残差架构捕获隐式的类间依赖关系,有效解决语义歧义。最后,logits融合模块动态整合多方面的语义证据,确保模型在保持CLIP固有泛化能力的同时实现稳健的语义共识。在主流基准上的综合实验结果表明,与最先进算法相比,所提方法取得了显著的性能提升。

英文摘要

Semantic image segmentation assigns a predefined category label to each pixel, has achieved significant progress lately. Open-Vocabulary Segmentation (OVS) extends the segmentation task from a fixed set to an open set, enabling the identification and segmentation of novel concepts based on arbitrary text inputs, such as category names or descriptions. In this paper, we propose a novel Semantic Calibration Network (SCN) for open-vocabulary semantic segmentation. Different from prior approaches that focus on feature aggregation or simple fine-tuning of pre-trained models, SCN refines the mask classification process by explicitly modeling the semantic correlations between classes, aiming to enhance the model's discriminative power while effectively preserving the generalization abilities of the pre-trained CLIP model. Specifically, SCN comprises two core components: Class Disambiguation (CD) and Logits Fusion (LF). First, a cross-attention mechanism is utilized to transform the text embeddings into visually aware pseudo-text embeddings, in order to derive an enhanced similarity score that complements the original mask-text similarity score. Subsequently, the Class Disambiguation module captures implicit inter-class dependencies through a residual architecture to effectively resolve semantic ambiguities. Finally, the Logits Fusion module dynamically integrates multifaceted semantic evidence to ensure that the model achieves a robust semantic consensus while maintaining CLIP's inherent generalization capability. Comprehensive experimental results on mainstream benchmarks demonstrate that the proposed method achieves significant performance improvements compared to state-of-the-art algorithms.

2606.08000 2026-06-09 cs.CL cs.AI 新提交

Summarization is Not Dead Yet

摘要生成尚未消亡

Dongqi Liu, Chenxi Whitehouse, Zheng Zhao, Zhuchen Cao, Jian Li, Yabiao Wang

发表机构 * Saarland University(萨尔大学) Max Planck Institute for Informatics(马克斯·普朗克信息学研究所) University of Cambridge(剑桥大学) University of Edinburgh(爱丁堡大学) Zhejiang University(浙江大学) Tencent YouTu Lab(腾讯优图实验室)

AI总结 通过多维度评估,发现人类参考摘要在信息量和忠实度上仍优于大语言模型,后者仅在表面连贯性和流畅性上占优,表明摘要生成研究仍有挑战。

详情
AI中文摘要

大型语言模型(LLMs)的进展引发了关于模型生成的摘要可与人类撰写的参考摘要相媲美甚至超越后者的说法,这引发了摘要生成是否仍是一个开放研究问题的疑问。我们通过多轨道评估重新审视这一说法,涵盖五个不同数据集和五个最先进的LLMs,结合受控人工评估、偏差缓解的LLM作为评判协议、基于外部知识的事实性验证以及语料库级别的语言分析。我们的发现揭示了一个更为细致的图景:人类参考摘要继续在信息量和忠实度方面展现出优势,而LLM输出主要在表面连贯性和流畅性上更受青睐。事实性验证表明,人类参考摘要仍然更可靠,尤其是对于涉及推理或综合的声明,而语言分析揭示了不同模型之间风格同质化的模式。这些观察表明,当前的LLMs提高了摘要生成的质量下限,但其性能上限仍低于人类能力。

英文摘要

The progress of large language models (LLMs) has fueled claims that model-generated summaries rival or even surpass human-written references, raising questions about whether summarization remains an open research problem. We re-examine this narrative through a multi-track evaluation covering five diverse datasets and five state-of-the-art LLMs, combining controlled human assessment, bias-mitigated LLM-as-Judge protocols, factuality verification against external knowledge, and corpus-level linguistic analysis. Our findings reveal a more nuanced landscape in which human reference summaries continue to demonstrate advantages in informativeness and faithfulness, whereas LLM outputs are preferred mainly for surface-level coherence and fluency. Factuality verification indicates that human references remain more reliable, particularly for claims involving reasoning or synthesis, and linguistic analysis uncovers a pattern of stylistic homogeneity across different models. These observations suggest that current LLMs have raised the floor of summarization quality, but the ceiling of their performance remains below human capabilities.

2606.07999 2026-06-09 cs.AI 新提交

Efficient Skill Grounding via Code Refactoring with Small Language Models

通过小型语言模型的代码重构实现高效技能落地

Sera Choi, Wonje Choi, Saehun Chun, Daehee Lee, Jooyoung Kim, Chaeun Lee, Honguk Woo

发表机构 * KAIST(韩国科学技术院)

AI总结 提出RECENT框架,通过将技能语义与执行绑定解耦,利用小型语言模型进行代码重构实现高效技能落地,在动态环境中达到与大型语言模型相当的性能。

Comments Accepted to ICML 2026

详情
AI中文摘要

有效的技能落地对于在具身智能体中部署可复用技能至关重要,因为即使是微小的具身或环境差异也可能导致整个技能不兼容。这一挑战在具身设置中尤为突出,智能体必须在动态、部分可观测的环境中运行,且无法访问大型语言模型(LLM)。在此设置下,依赖LLM不切实际,而小型语言模型(sLM)对于实现可靠长程控制所需的有效技能落地仍显不足。我们提出RECENT,一种以重构为中心的智能体框架,通过将技能语义与具身和环境特定的执行绑定解耦,实现使用sLM的高效技能落地。通过将技能表示为可执行代码,RECENT保留了技能控制结构中编码的语义意图,同时通过局部重构仅修改执行绑定来落地技能,而非从头重新生成代码。我们在动态环境中跨多种机器人具身的多样化技能落地场景中评估RECENT,展示了在使用sLM部署时的稳健长程性能。在所有场景中,RECENT在基于sLM的代码即策略(CaP)方法中实现了最佳性能,并匹配了基于LLM的CaP的任务性能。

英文摘要

Effective skill grounding is essential for deploying reusable skills in embodied agents, as even minor embodiment or environmental differences can render an entire skill incompatible. This challenge is particularly pronounced in embodied settings, where agents must operate in dynamic, partially observable environments without access to large language models (LLMs). In this setting, reliance on LLMs is impractical, while small language models (sLMs) remain insufficient for the effective skill grounding required for reliable long-horizon control. We present RECENT, a refactoring-centric agent framework that enables efficient skill grounding with sLMs by decoupling skill semantics from embodiment- and environment-specific execution binding. By representing skills as executable code, RECENT preserves the semantic intent encoded in a skill's control structure while grounding it by modifying only execution bindings through localized refactoring, rather than regenerating code from scratch. We evaluate RECENT across diverse skill grounding scenarios spanning multiple robot embodiments in dynamic environments, demonstrating robust long-horizon performance when deployed with an sLM. Across all scenarios, RECENT achieves the best performance among sLM-based Code-as-Policies (CaP) methods and matches the task performance of LLM-based CaP.

2606.07996 2026-06-09 cs.CL cs.AI 新提交

MC-PDD: Masked Corpus-Level Pretraining Data Detection for Black-Box Large Language Models

MC-PDD: 面向黑盒大语言模型的掩码语料级预训练数据检测

Kaixin Lan, Mu You, Tao Fang, Binkai Ou, Lidia S. Chao, Derek F. Wong

发表机构 * University of Macau(澳门大学) Macau Millennium College(澳门万人大学) BoardWare Information System Limited(博纬信息系统有限公司)

AI总结 提出MC-PDD方法,通过掩码特定token并利用LLM预测缺失内容,比较候选语料与参考非成员语料的预测命中率差异,以黑盒方式检测预训练数据,性能与现有方法相当。

Comments The manuscript consists of 10 pages formatted in the IEEE/ACM two-column style

详情
AI中文摘要

预训练是大语言模型(LLM)发展的基础,然而预训练数据的不透明性使模型分析复杂化,并引发伦理、法律和公平性问题。因此,检测特定数据集是否在预训练中使用至关重要。现有最先进方法通常依赖于访问模型概率分布,因此不适用于仅提供输入输出接口的闭源LLM。为解决这一限制,我们引入了掩码语料级预训练数据检测(MC-PDD),这是一种受掩码语言建模范式启发的新方法。MC-PDD在每段文本中掩码高度特定的token,并提示LLM预测缺失内容。然后,它评估候选语料与参考非成员语料之间的预测命中率差异是否具有统计显著性。基于此比较,MC-PDD确定候选文本是否可能包含在模型的预训练数据中。实验结果表明,在三个数据集上,对于开源和闭源LLM,预训练数据和未见数据之间的预测命中率存在明显且一致的差异。尽管在更严格的黑盒设置下运行,MC-PDD仍实现了与现有检测方法相当的性能。我们的方法仅需使用标准API访问即可实现模型审计和数据版权验证等实际应用。接受后,我们将公开发布代码和数据集。

英文摘要

Pretraining is fundamental to the development of Large Language Models (LLMs), yet the opacity of pretraining data complicates model analysis and raises ethical, legal, and fairness concerns. Detecting whether specific datasets were used during pretraining is, therefore, critical. Existing state-of-the-art methods typically rely on access to model probability distributions, making them unsuitable for closed-source LLMs that provide only input-output interfaces. To address this limitation, we introduce Masked Corpus-level Pretraining Data Detection (MC-PDD), a novel method inspired by the masked language modeling paradigm. MC-PDD masks highly specific tokens in each text and prompts the LLM to predict the missing content. It then assesses whether the difference in prediction hit rates between a candidate corpus and a reference non-member corpus is statistically significant. Based on this comparison, MC-PDD determines whether the candidate texts were likely included in the model's pretraining data. Experimental results demonstrate clear and consistent differences in prediction hit rates between pretrained and unseen data across three datasets, for both open-source and closed-source LLMs. Despite operating under a stricter black-box setting, MC-PDD achieves performance comparable to existing detection methods. Our approach enables practical applications such as model auditing and data copyright verification using only standard API access. Upon acceptance, we will publicly release the code and datasets.

2606.07995 2026-06-09 cs.CL 新提交

Customer-Agent: Overcoming Context Limitations in Ultra-Long Shopping Trajectories via Tool-Augmented Agents and RLVR

客户代理:通过工具增强代理和RLVR克服超长购物轨迹中的上下文限制

Hongye Liu, Rongmei Lin, Anurag Kashyap, Hejie Cui, Ricardo Henao, Besnik Fetahu, Bing Yin

发表机构 * Amazon(亚马逊) Duke University(杜克大学)

AI总结 提出ShopTrajQA基准和客户代理框架,利用RLVR训练代理通过代码解释器自主检索解析外部轨迹文件,突破LLM上下文窗口限制,在超长购物轨迹推理中取得强性能。

详情
AI中文摘要

理解客户购物轨迹对于实现个性化购物体验至关重要。然而,购物记录(如客户的搜索、点击、购买等)通常跨越多年时间,形成极长的轨迹,给现有大型语言模型(LLM)带来重大挑战。尽管该问题重要,现有基准仅限于短客户轨迹,而大型电商平台的真实轨迹由于数据隐私限制难以获取。为解决这一差距,我们引入ShopTrajQA,一个基于真实产品信息和模拟购物轨迹构建的长上下文评估基准。数据集包含高达32k和64k token的变体,能够系统评估模型在不同上下文长度下的鲁棒性。通过对前沿LLM的全面基准测试,我们识别出在长购物轨迹数据推理中的关键性能差距。为应对这些挑战,我们提出一种用于超长上下文管理的客户代理框架。利用可验证奖励强化学习(RLVR)代理训练范式,我们的方法将轨迹存储为外部本地文件,并训练代理通过代码解释器交互(如SQL查询)自主检索和解析它们,有效绕过LLM的固定上下文窗口限制。实验结果表明,我们的框架在ShopTrajQA上取得强性能,并展现出对其他复杂推理任务的泛化能力。

英文摘要

Understanding customer shopping trajectories is essential for enabling personalized shopping experiences. However, shopping records (i.e., customer's search, clicks, purchases, etc.) often span long time horizons over multiple years, resulting in extremely long trajectories that pose significant challenges for existing large language models (LLMs). Despite the importance of this problem, existing benchmarks are limited to short customer trajectories, while real-world trajectories from large e-commerce platforms are rarely accessible due to data privacy constraints. To address this gap, we introduce ShopTrajQA, a long-context evaluation benchmark constructed from real-world product information and simulated shopping trajectories. The dataset includes variants of up to 32k and 64k tokens, enabling systematic evaluation of model robustness under varying context lengths. Through comprehensive benchmarking of frontier LLMs, we identify critical performance gaps in reasoning over long shopping trajectory data. To address these challenges, we propose a Customer Agent Framework for ultra-long context management. Leveraging a Reinforcement Learning with Verifiable Rewards (RLVR) agentic training paradigm, our approach stores trajectories as external local files and trains the agent to autonomously retrieve and parse them through code-interpreter interactions (e.g., SQL queries), effectively bypassing the fixed in-context window constraints of LLMs. Experimental results demonstrate that our framework achieves strong performance for ShopTrajQA and shows generalization to other complex reasoning tasks.

2606.07992 2026-06-09 cs.AI cs.CR cs.SE 新提交

VATS: Exploiting Implicit Authority in Error-Path Injection via Systematic Mutation

VATS: 通过系统性变异利用错误路径注入中的隐式权威

Harshil Patel, Kunal Pai

发表机构 * Harshil Patel Kunal Pai

AI总结 提出VATS框架,通过七维变异生成对抗性负载,利用错误消息的隐式权威绕过安全机制,在四个前沿模型上实现高达100%的注入成功率。

Comments Published at Second Workshop on Agents in the Wild: Safety, Security, and Beyond (ICML 2026 AIWILD)

详情
AI中文摘要

随着模型上下文协议(MCP)标准化自主代理的工具调用,它引入了一个关键且未经审查的攻击面:错误处理循环。我们假设工具错误消息具有隐式权威,会触发纠正性推理模式,从而绕过标准安全启发式。我们提出VATS(工具流漏洞分析),一个突变驱动的框架,系统地跨七个结构和语言维度演化对抗性负载。我们在四个前沿模型(Gemini 3.1 Pro、GPT-5.5、GLM-5.1和Qwen3-Coder)上的评估表明,错误路径注入将标准间接提示注入(IPI)的成功率提高了三倍,在受控评估中实现了高达100%的合规性。我们隔离了结构定位(在错误上下文中夹带指令)作为所有测试模型中最有效的利用向量。虽然我们发现生产框架护栏可以缓解这些漏洞,但模型层固有的易感性对定制代理工作流构成了系统性风险。

英文摘要

As the Model Context Protocol (MCP) standardizes tool-calling for autonomous agents, it introduces a critical, unexamined attack surface: the error-handling loop. We hypothesize that tool error messages possess implicit authority, triggering corrective reasoning modes that bypass standard safety heuristics. We introduce VATS (Vulnerability Analysis of Tool Streams), a mutation-driven framework that systematically evolves adversarial payloads across seven structural and linguistic dimensions. Our evaluation across four frontier models, Gemini 3.1 Pro, GPT-5.5, GLM-5.1, and Qwen3-Coder, demonstrates that error-path injection triples the success rate of standard indirect prompt injection (IPI), achieving up to 100% compliance in controlled evaluations. We isolate structural positioning (sandwiching instructions within error context) as the most effective exploit vector across all tested models. While we find that production framework guardrails can mitigate these vulnerabilities, the inherent susceptibility of the model layer poses a systemic risk to bespoke agentic workflows.

2606.07988 2026-06-09 cs.AI 新提交

PAFO: Pareto Fairness Optimization for Personalized Reward Modeling

PAFO: 个性化奖励建模的帕累托公平优化

Xiaoyan Zhao, Haoting Ni, Yang Zhang, Chunyuan Zheng, Haoxuan Li, Fuli Feng

发表机构 * National University of Singapore(新加坡国立大学) University of Science and Technology of China(中国科学技术大学) Peking University(北京大学)

AI总结 针对个性化奖励模型因训练数据偏好不平衡导致对少数用户群体存在偏见的问题,提出PAFO框架,通过帕累托公平优化提升弱势群体性能而不损害其他群体,实验表明能同时提高少数和多数群体准确率并降低不公平性。

详情
AI中文摘要

大型语言模型(LLMs)越来越依赖奖励模型来使其输出与多样化的用户偏好对齐。虽然个性化奖励模型旨在捕捉这种异质性,但它们通常在用户偏好数据不平衡的情况下训练,因此可能偏向于在训练群体中偏好更常见的用户。在本文中,我们将这种失败模式识别为个性化奖励偏差,即奖励建模质量随偏好支持率系统性地变化。我们将其缓解表述为一个关于群体效用的帕累托公平问题,旨在改善服务不足的用户而不降低其他用户群体的性能。为此,我们提出了PAFO,一种用于个性化奖励建模的帕累托公平优化框架。PAFO首先为多数和少数偏好群体训练群体专用的奖励模型,然后构建条件边际级监督,将其异质性偏好边界蒸馏到一个统一的模型中。所得模型仅在训练时使用群体信息,推理时无需显式群体标签。在Personal-LLM和DSP上的实验表明,PAFO在多个指标上提高了少数群体和多数群体的准确率,同时减少了用户级不公平性,证明了其在更公平的LLM个性化中的有效性。

英文摘要

Large language models (LLMs) increasingly rely on reward models to align their outputs with diverse user preferences. While personalized reward models aim to capture such heterogeneity, they are often trained on imbalanced user preference data and may therefore favor users whose preferences are more common in the training population. In this paper, we identify this failure mode as personalized reward bias, where reward modeling quality varies systematically with preference support rate. We formulate its mitigation as a Pareto fairness problem over group utilities, aiming to improve under-served users without degrading other user groups. To this end, we propose PAFO, a Pareto fairness optimization framework for personalized reward modeling. PAFO first trains group-specialized reward models for majority and minority preference groups, then constructs conditional margin-level supervision to distill their heterogeneous preference boundaries into a single unified model. The resulting model uses group information only during training and requires no explicit group labels at inference time. Experiments on Personal-LLM and DSP show that PAFO improves both minority-group and majority-group accuracy while reducing user-level unfairness across multiple metrics, demonstrating its effectiveness for fairer LLM personalization.

2606.07985 2026-06-09 cs.CV cs.CL 新提交

FMRFusion: Frequency-Aware Multi-View Representation Learning for Heterogeneous Image Fusion

FMRFusion: 面向异质图像融合的频率感知多视图表示学习

Tao Zhoua, Yunlong Liu, Qinghui Chen, Zekai Zhang, Minlong Sun, Changlin Biana, Dagang Li, Wenmin Wang, Jinglin Zhang

发表机构 * Shandong University(山东大学) Macau University of Science and Technology(澳门科技大学)

AI总结 提出FMRFusion网络,通过多尺度结构感知模块、双线性频率分解和跨视图互补交互,结合流匹配优化,实现红外与可见光图像融合,在夜间场景表现优异。

详情
AI中文摘要

红外与可见光图像融合旨在生成保留重要目标信息和详细纹理的复合图像,整合两种异质模态。以往的图像融合方法通常采用单模块堆叠方式从两种模态中提取特征,然而这些方法可能导致对其独特特征的学习不完整,从而限制融合效果并在真实异质数据场景中降低鲁棒性。为解决这些问题,我们提出FMRFusion,一种用于异质图像融合的频率感知多视图表示学习网络。引入多尺度结构感知模块以有效捕捉判别性结构,提取细粒度局部结构和关键上下文信息。采用双线性频率分解机制将特征分离为高频和低频分量,实现不同频率域中局部细节和全局表示的联合建模。此外,融入跨视图互补交互以显式建模和融合反射光信息与辐射强度响应之间的互补特性,促进有效的跨视图交互。我们通过流匹配进一步改善融合结果的质量,通过学习从粗数据到高质量表示的变换逐步细化融合特征。在多个基准数据集上进行的大量实验表明,FMRFusion在一系列融合任务中实现了优越且一致的性能,尤其在夜间场景中表现突出。

英文摘要

Infrared and visible image fusion aims to generate a composite image that retains significant target information and preserves detailed textures, integrating two heterogeneous modalities. Previous image fusion methods typically adopt a single-module stacking approach to extract features from the two modalities. However, these approaches may result in incomplete learning of their distinct characteristics, thereby limiting the fusion effectiveness and constrain ing robustness in real-world heterogeneous data scenarios. To address these challenges, we propose FMRFusion, a frequency-aware multi-view representation learning network for Heterogeneous Image Fusion. A Multi-Scale Struc tural Perception Module is introduced to effectively capture discriminative structures, extracting fine-grained local structures and essential contextual information. A bilinear frequency decomposition mechanism is employed to sepa rate features into high-frequency and low-frequency components, enabling joint modeling of local details and global representations across different frequency domains. Moreover, a Cross-View Complementary Interaction is incorpo rated to explicitly model and fuse the complementary characteristics between reflected light information and radiative intensity responses, facilitating effective cross-view interaction. We further improve the Performance of the fused results by flow matching, which progressively refines the fused features by learning the transformation from coarse data to high-quality representations. Extensive experiments conducted on multiple benchmark datasets demonstrate that FMRFusion achieves superior and consistent performance across a range of fusion tasks, especially in nighttime scenarios

2606.07982 2026-06-09 cs.LG 新提交

Overcoming the Limits of Finite Difference Method; Physics-Informed Neural Network for Noisy High-Dimensional Heat Diffusion

克服有限差分法的局限性:用于含噪高维热扩散的物理信息神经网络

Shreesh Bhattarai, Harish Chandra Bhandari

发表机构 * Kathmandu University(加德满都大学)

AI总结 针对高维含噪热扩散问题,提出物理信息神经网络(PINN)框架,在噪声和维度较高时显著优于有限差分法(FDM),实现精度与效率的权衡。

详情
AI中文摘要

高维瞬态热扩散在噪声边界条件下暴露了经典数值方法的根本局限性:在物理噪声不可避免的情况下,精度会灾难性地下降。本文提出了一个物理信息神经网络(PINN)框架,作为在一维、二维和三维空间中对这一问题的系统性解决方案,建立了明确的操作机制,重新定义了含噪热系统中求解器的选择。在三维空间中,当边界噪声为20%时,PINN保持约91%的精度,而有限差分法(FDM)降至36%,这是一个明显的决定性优势。这一点在物理铜热系统中得到进一步证实,在真实噪声条件下,PINN将边界重建误差降低了3.3倍。这种噪声鲁棒性伴随着维度驱动的效率交叉:在三维空间中,PINN所需的时空节点少于FDM,同时实现更高的精度,揭示了经典离散化在大规模下的真实成本。这些发现重新定义了求解器的选择:决定性的轴不仅是精度,而是噪声暴露和维度的共同作用。当噪声和维度都较高时,经典求解器范式不足;本工作为证明PINN在此类机制中作为操作标准提供了基础。

英文摘要

High-dimensional transient heat diffusion under noisy boundary conditions exposes a fundamental limitation of classical numerical methods: accuracy degrades catastrophically where physical noise is unavoidable. This paper presents a Physics-Informed Neural Network (PINN) framework as a systematic solution to this problem across one, two, and three spatial dimensions, establishing clear operational regimes that redefine solver selection in noisy thermal systems. Under 20% boundary noise in 3D, PINN sustains approximately 91% accuracy while Finite Difference Method (FDM) collapses to 36%, a clear decisive advantage. This is further confirmed in a physical copper thermal system, where PINN reduces boundary reconstruction error by 3.3 times under realistic noise conditions. This noise resilience is accompanied by a dimensionality-driven efficiency crossover: PINN requires fewer spacetime nodes than FDM in 3D while achieving superior accuracy, exposing the true cost of classical discretization at scale. These findings reframe solver selection: the decisive axis is not accuracy alone, but noise exposure and dimensionality jointly. When noise and dimensionality are both high, the classical solver paradigm is insufficient; this work provides the foundation to justify PINN as the operational standard in such regimes.

2606.07978 2026-06-09 cs.CL 新提交

MechLens: Late Crystallization of Factual Knowledge Explains Intervention Effectiveness in Language Models

MechLens:事实知识的晚期结晶解释语言模型中的干预有效性

Xueping Gao

发表机构 * Alibaba Cloud(阿里云)

AI总结 本文发现LLM中的事实知识在最后层突然“结晶”,而非逐层涌现,并基于此提出结晶引导的干预原则,优于现有方法。

详情
AI中文摘要

理解LLM存储事实知识的位置对于减少幻觉至关重要。我们系统量化了“晚期结晶”:事实知识并非逐层涌现,而是在最后层突然“结晶”。在五个模型家族(Pythia、Gemma、Qwen2.5、Llama-3.1、Mistral;0.5–14B)中,26.8%–93.4%的正确答案从未在任何中间层进入前10预测,且晚期涌现(>80%深度)在不同架构中一致。跨尺度(Qwen2.5-14B)和跨基准(MMLU:98.2%)结果证实了普遍性;调谐透镜排除了探针伪影。情感分类对照(Qwen为0.5% vs. 事实85.9%;Mistral为2.0% vs. 26.8%)确认该现象是事实回忆特有的。\n晚期结晶引出了结晶引导的干预原则:CAA在中等结晶模型(Llama、Mistral)上优于DoLa(p<0.001),在高结晶模型Qwen上方向一致反转(+25.4% vs. +15.5% MC1,p=0.069)。LayerNorm消融表明结晶是残差流固有的;LN缩放(x1.2)在零推理开销下带来+11.8% MC1提升。我们进一步揭示了可计算性-记忆谱:可计算知识比记忆事实更早结晶(层22.1/28 vs. 28.0/28)。我们发布了支持五个模型家族的MechLens。

英文摘要

Understanding where LLMs store factual knowledge is critical for hallucination mitigation. We systematically quantify Late Crystallization: factual knowledge does not gradually emerge across layers but "crystallizes" abruptly at the final layers. Across five model families (Pythia, Gemma, Qwen2.5, Llama-3.1, Mistral; 0.5--14B), 26.8%--93.4% of correct answers never enter top-10 predictions at any intermediate layer, with late emergence (>80% depth) consistent across architectures. Cross-scale (Qwen2.5-14B) and cross-benchmark (MMLU: 98.2%) results confirm generality; tuned lens rules out probe artifacts. A sentiment-classification control (0.5% for Qwen vs. 85.9% factual; 2.0% for Mistral vs. 26.8%) confirms the phenomenon is specific to factual recall. Late Crystallization yields a crystallization-guided intervention principle: CAA outperforms DoLa on moderate-crystallization models (Llama, Mistral; p<0.001), with a directionally consistent reversal on high-crystallization Qwen (+25.4% vs. +15.5% MC1, p=0.069). LayerNorm ablation shows crystallization is intrinsic to the residual stream; LN scaling (x1.2) yields +11.8% MC1 with zero inference overhead. We further reveal a Computability-Memorization Spectrum: computable knowledge crystallizes earlier (layer 22.1/28) than memorized facts (28.0/28). We release MechLens supporting five model families.

2606.07974 2026-06-09 cs.RO cs.AI 新提交

PRISM: PRior-guided Imagination Sampling in world Models

PRISM:世界模型中基于先验引导的想象采样

Yuhai Wang, Jiawei Xia, Rongxuan Zhou, Xiao Hu, Yongliang Shi, Jing Du, Yang Ye

发表机构 * Northeastern University(东北大学) University of California, Berkeley(加州大学伯克利分校) Qiyuan Lab(启元实验室) University of Florida(佛罗里达大学)

AI总结 提出PRISM框架,通过从世界模型编码器提取状态条件高斯先验,并利用精度加权高斯乘积更新规划器的采样分布,在不增加架构复杂度的情况下显著提升基于模型的连续控制性能。

详情
AI中文摘要

学习到的世界模型为评估未来状态提供了强大的物理直觉。但其在连续控制中的有效性也关键取决于如何为基于模型的规划生成候选动作。我们不仅询问模型能多准确地模拟未来,还提出:哪些候选动作首先值得评估?现有规划器通常任意搜索或仅使用专家演示初始化采样均值,丢弃了专家的状态条件置信度。正确引导这一搜索需要鲁棒的动作先验,但当前方法常依赖独立的视觉编码器或大规模VLM来获取。我们认为这种架构膨胀是不必要的:完全相同的数据——以及世界模型本身学到的表示——内在地编码了智能体的动作直觉。我们提出PRISM,一个任务无关的框架,从单一数据集中提取两者,同时保持严格的架构简洁性。基于标准的JEPA风格潜在世界模型,PRISM直接在其冻结编码器上附加一个轻量级MLP,以预测状态条件高斯先验。在规划时,PRISM通过精度加权的高斯乘积更新将该先验融合到规划器的采样分布中。这种无参数、闭式整合引导采样过程,使先验在其自信处主导,在其不自信处放弃控制。PRISM在Cube上将基于世界模型的MPC成功率提升35个百分点,在PushT上提升32个百分点,且未引入显著推理开销。

英文摘要

A learned world model provides a powerful physical intuition for evaluating future states. But its effectiveness in continuous control also depends critically on how candidate actions are generated for model-based planning. Rather than solely asking how accurately a model can simulate the future, we ask: which candidate actions are worth evaluating in the first place? Existing planners typically search arbitrarily or use expert demonstrations only to initialize a sampling mean, discarding the expert's state-conditioned confidence. Properly guiding this search requires a robust action prior, yet current approaches often rely on independent visual encoders or large-scale VLMs to obtain one. We argue that this architectural bloat is unnecessary: the exact same data - and the learned representations of the world model itself - inherently encode the agent's action intuition. We introduce PRISM, a task-agnostic framework that extracts both from a single dataset while maintaining strict architectural simplicity. Building on a standard JEPA-style latent world model, PRISM attaches a lightweight MLP directly to its frozen encoder to predict a state-conditioned Gaussian prior. At plan time, PRISM fuses this prior into the planner's sampling distribution via a precision-weighted Product-of-Gaussians update. This parameter-free, closed-form integration steers the sampling process, making the prior confident where it is and ceding control where it is not. PRISM improves success rates by 35 percentage points over vanilla world-model-based MPC on Cube and 32 percentage points on PushT, without introducing significant inference overhead.

2606.07970 2026-06-09 cs.CL cs.AI 新提交

Defending Against Malicious Finetuning by Scaling Train-time Adversarial Attacks

通过扩展训练时对抗攻击防御恶意微调

Haoming Wen, Shi Chen, Qingyu Shi, Siyuan Liu, Minrui Luo, Jingzhao Zhang, Tianxing He

发表机构 * Xiongan AI Institute(雄安人工智能研究院) Institute for Interdisciplinary Information Sciences, Tsinghua University(清华大学交叉信息研究院) Shanghai Qi Zhi Institute(上海期智研究院)

AI总结 针对全参数微调的安全威胁,提出基于对抗训练和双层优化的Patcher方法,通过扩展对抗循环中的优化步数增强防御,并设计并行算法提升效率。

详情
AI中文摘要

当前的开源大型语言模型(LLMs)容易受到恶意微调攻击,这些攻击只需在中毒数据集上进行几步监督微调(SFT)即可破坏LLMs的安全对齐。现有的对齐阶段防御主要设计用于防御使用参数高效微调方法的攻击。然而,它们无法防御使用全参数微调的更强攻击。在本文中,我们提出了Patcher,一种受对抗训练和双层优化启发的方法,以对抗此类攻击。Patcher通过扩展对抗循环中的优化步数来增强模拟攻击,从而迫使防御者找到对更强攻击不敏感的模型参数。此外,我们提出了一种高效的并行算法来实现Patcher,减少了训练的挂钟时间,同时保持了Patcher的性能。大量实验表明,与普通SFT对齐相比,Patcher显著提高了模型的鲁棒性,并且可以迁移到不同的攻击场景和模型大小。代码可在https://github.com/haomingwen/patcher获取。

英文摘要

Current open-weight large language models (LLMs) are prone to malicious finetuning attacks, which could compromise the safety alignment of LLMs with only a few steps of supervised finetuning (SFT) on poisoned datasets. Existing alignment-stage defenses are primarily designed to defend against attacks that use parameter-efficient finetuning methods. However, they fail to defend against stronger attacks that use full-parameter finetuning. In this paper, we propose Patcher, a method inspired by adversarial training and bi-level optimization, to combat such attacks. Patcher strengthens the simulated attack by scaling up the optimization steps in the adversarial loop, thus forcing the defender to find model parameters that are insensitive to stronger attacks. Furthermore, we propose an efficient parallel algorithm to implement Patcher, decreasing the wall-clock time of training while preserving Patcher's performance. Extensive experiments show that Patcher substantially improves the model's robustness compared to vanilla SFT alignment, and transfers to diverse attack scenarios and model sizes. Code is available at https://github.com/haomingwen/patcher.

2606.07969 2026-06-09 cs.CL cs.AI 新提交

Neutrality Bites: Gender Representation in AI-Generated Animal Stories

中立性的代价:AI生成的动物故事中的性别表征

Imani Finkley, Yuanxi Li, Melanie Walsh

发表机构 * University of Washington(华盛顿大学)

AI总结 研究六种主流LLM在生成动物故事时的性别分配,发现模型常避免指定性别或使用中性语言,但一旦指定则显著偏向男性,女性角色几乎缺席,表明中立策略可能导致边缘视角的抹除。

Comments FAccT(ACM Conference on Fairness, Accountability, and Transparency) 2026

详情
AI中文摘要

AI生成故事中的性别偏见是一个有充分记录的问题。尽管人们已投入大量关注来减少或缓解这种偏见,但干预措施是否产生真正公平的结果并不总是明确的。为了调查这一问题,我们研究了大型语言模型(LLMs)如何处理一个流行、高度模糊且已知会紧密复现人类刻板印象的叙事语境中的性别分配:关于会说话的动物的故事。我们提示六个领先的LLM完成一个关于七个性别未说明的拟人化动物角色的英语故事。此外,我们迭代了四种不同的叙事设置和一系列模型温度。在23.8K个故事中,我们发现模型经常避免在故事中指定动物角色的性别(平均19%)或使用性别中立的语言如“它”或“它的”(平均38.2%)。然而,当性别被指定时,存在显著的男性偏见。女性动物角色几乎不存在,仅出现在2.2%的故事中,而男性角色出现在40.6%的故事中。我们的发现指向一个更广泛的论点:中立性是有代价的。换句话说,优先考虑中立性以解决社会偏见的模型实际上可能助长边缘化视角和身份的抹除。我们建议需要追求超越中立性的替代策略,例如那些更平等地在想象主体之间分配社会可能性的策略。

英文摘要

Gender bias in AI-generated stories is a well-documented problem. While much attention has been paid to reducing or mitigating this bias, it is not always clear whether interventions produce genuinely fairer results. To investigate this issue, we examine how large language models (LLMs) handle gender assignment in a narrative context that is popular, highly ambiguous, and also known to closely reproduce human stereotypes: stories about talking animals. We prompt six leading LLMs to complete an English-language story about seven different anthropomorphic animal characters whose gender is unstated. We additionally iterate with four different narrative settings and a range of model temperatures. Across the 23.8K stories, we find that models frequently avoid gendering the animal character in the story (19% on average) or use gender-neutral language like "it" or "its" (38.2% on average). However, when gender is assigned, there is a significant masculine bias. Feminine animal characters are virtually absent, present in just 2.2% of stories vs. 40.6% that feature masculine characters. Our findings point to a broader argument: neutrality bites. In other words, models that prioritize neutrality to address social bias may actually contribute to the erasure of marginalized perspectives and identities. We suggest that alternative strategies beyond neutrality need to be pursued, such as ones that more equally distribute social possibilities across imagined subjects.

2606.07967 2026-06-09 cs.CV 新提交

DisCo: World Models with Discrete Camera Motion Control

DisCo: 具有离散相机运动控制的世界模型

Hongrui Huang, Junke Wang, Quanhao Li, Yu-Gang Jiang, Zuxuan Wu

发表机构 * Fudan University(复旦大学)

AI总结 提出DisCo,通过离散动作原语替代连续相机轨迹作为条件,解决可控视频生成中动作表示纠缠问题,提升动作跟随可靠性,并引入DisCoBench基准。

详情
AI中文摘要

可控视频世界模型旨在实现交互式世界探索,模型必须在保持视觉质量和时间一致性的同时忠实地执行明确的动作命令。然而,现有大多数方法依赖连续相机轨迹作为动作条件,这通常导致不可靠的动作跟随,尤其是在复杂运动序列下。在这项工作中,我们识别出动作表示纠缠是可控视频生成的关键瓶颈,并表明连续相机表示导致不同运动模式之间的高特征相似性,降低了动作可控性。基于这一见解,我们提出了DisCo,一种可控视频世界模型,它将生成条件约束在一组紧凑的离散动作原语上,以提高动作可分离性。我们进一步引入了DisCoBench,一个用于评估模型在短期、长期和高度动态探索场景中能力的综合基准。大量实验表明,DisCo在保持视觉质量的同时实现了显著更可靠的动作跟随。

英文摘要

Controllable video world models target interactive world exploration, where models must faithfully execute explicit action commands while preserving visual quality and temporal coherence. However, most existing approaches rely on continuous camera trajectories as action conditions, which often lead to unreliable action following, especially under complex motion sequences. In this work, we identify action representation entanglement as a key bottleneck in controllable video generation, and show that continuous camera representations lead to high feature similarity across distinct motion patterns, degrading action controllability. Based on this insight, we propose DisCo, a controllable video world model that conditions generation on a compact set of discrete action primitives to improve action separability. We further introduce DisCoBench, a comprehensive benchmark for evaluating the ability of models in short-term, long-horizon, and highly dynamic exploration scenarios. Extensive experiments demonstrate that DisCo achieves significantly more reliable action following while preserving visual quality.