arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 21516
2606.04970 2026-06-04 cs.CV cs.AI

Plan, Watch, Recover: A Benchmark and Architectures for Proactive Procedural Assistance

计划、观察、恢复:主动式程序辅助的基准与架构

Kaustav Kundu, Ritvik Shrivastava, Maxim Arap, Nanshu Wang, Xianhui Zhu, Quintin Fettes, Gautam Tiwari, Parth Suresh, Théo Moutakanni, Alejandro Castillejo Munoz, Allen Bolourchi, Pascale Fung, Pinar Donmez, Babak Damavandi, Anuj Kumar, Seungwhan Moon

发表机构 * Meta Reality Labs(Meta现实实验室) Meta Superintelligence Labs(Meta超智能实验室)

AI总结 提出EgoProactive数据集和Pro²Bench基准,并设计解耦规划器-交互架构,用于主动式程序辅助中的实时引导和异常恢复。

Comments 53 pages, 14 figures

详情
AI中文摘要

我们设想一个主动的多模态辅助系统,该系统在程序性任务中为用户提供实时的逐步指导,自主决定何时中断以及如何指导。然而,由于缺乏反映现实条件的大规模跨领域基准,特别是用户偏离预期步骤序列的常见情况,进展受到限制。我们通过四项贡献来解决这一差距: extbf{(1)}~我们发布了 extbf{EgoProactive},一个大规模的可穿戴自我中心数据集,用于主动程序辅助,带有明确的计划外(OOP)标注和恢复步骤; extbf{(2)}~我们将五个已建立的基准(Ego4D、EPIC-KITCHENS、EgoExo4D、HoloAssist、HowTo100M)扩充为统一的主动指导模式下的 extbf{Pro extsuperscript{2}Bench}; extbf{(3)}~我们提出了一种专门针对程序状态、视觉线索和恢复注入的 extbf{解耦规划器-交互架构}; extbf{(4)}~我们引入了一种跨模型家族迁移的训练后方案,通过在Llama~4和Qwen-3.6-VL上的跨骨干复制进行验证。在大量实验中,我们训练的Llama-4系统在所有六个数据集上,相对于强大的专有基线(Claude Opus~4.6、Gemini~3.1~Pro、GPT~5.2)和开放权重基线(Qwen3~VL~235B),显著提高了客观干预质量。Oracle计划实验进一步表明,当计划质量得到控制时,训练的双工模型产生高质量的指导,并在计划外恢复方面取得巨大收益。

英文摘要

We envision a proactive multi-modal assistant system which gives users real-time step-by-step guidance on a procedural task, autonomously deciding \textit{when} to interrupt, and \textit{how} to coach. However, progress is limited by the absence of large-scale, cross-domain benchmarks that reflect realistic conditions, particularly the common case in which users deviate from the expected step sequence. We address this gap with four contributions: \textbf{(1)}~we release \textbf{EgoProactive}, a large-scale wearable-egocentric dataset for proactive procedural assistance with explicit Out-of-Plan (OOP) annotations and recovery steps; \textbf{(2)}~we augment five established benchmarks (Ego4D, EPIC-KITCHENS, EgoExo4D, HoloAssist, HowTo100M) into \textbf{Pro\textsuperscript{2}Bench} under a unified proactive-guidance schema; \textbf{(3)}~we propose a \textbf{decoupled planner--interaction architecture} specialized for procedural state, visual cues, and recovery injection; \textbf{(4)}~we introduce a post-training recipe that transfers across model families, validated by cross-backbone replication on Llama~4 and Qwen-3.6-VL. In extensive experiments, our trained Llama-4 system substantially improves objective intervention quality over strong proprietary baselines (Claude Opus~4.6, Gemini~3.1~Pro, GPT~5.2) and open-weight baselines (Qwen3~VL~235B) baselines across all six datasets. Oracle-plan experiments further show that, when plan quality is controlled, the trained duplex model produces high-quality guidance and large gains on Out-of-Plan recovery.

2606.04968 2026-06-04 cs.RO

Potential-Guided Flow Matching for Vision-Language-Action Policy Improvement

势引导的流匹配用于视觉-语言-动作策略改进

Yunpeng Mei, Jiakai He, Hongjie Cao, Chenyu Wang, Xiaowen Zhu, Yihan Zhou, Jiamin Wang, Chenbo Xin, Peng Cheng, Yuxuan Yang, Yijie Wang, Xinhu Zheng, Gao Huang, Jie Chen, Gang Wang

发表机构 * Nanyang Technological University(南洋理工大学) Tsinghua University(清华大学) University of Science and Technology of China(中国科学技术大学)

AI总结 提出ForesightFlow,一种自引导流匹配策略,通过解耦优势加权流匹配和一步边界估计器,无需外部评论家即可改进视觉-语言-动作策略。

详情
AI中文摘要

大型视觉-语言-动作(VLA)策略越来越多地被训练为动作块上的条件生成模型。然而,部署会产生混合质量的体验——成功的演示、部分完成、可恢复的错误和失败——这些难以与标准模仿一起使用。完整的行为克隆(BC)模仿失败,过滤后的BC丢弃有用的子轨迹,而离线强化学习增加了大型评论家。我们引入了ForesightFlow,一种自引导流匹配策略,它为每个生成的动作块增加一个学习到的成功势轨迹。同一个流提出并评分候选动作,实现了无需外部评论家的最佳K选择推理。关键问题是策略改进和价值校准需要不同的监督:优势加权应强调高质量动作,但将相同的权重应用于势坐标会抑制失败梯度并产生过度自信的分数。我们通过解耦优势加权流匹配来解决这个问题,将指数化优势权重仅应用于动作速度,同时均匀训练势速度。我们进一步推导了条件流匹配的一步边界估计器,允许通过单次停止梯度前向传递计算优势。在五个BEHAVIOR-1K模拟任务和五个真实世界双臂任务中,ForesightFlow优于模仿基线,在模拟成功率上与最强的分离评论家基线持平,提高了真实世界成功率,并将训练计算量减少了38%。消融实验表明,解耦防止了价值幻觉,一步估计器保持了候选排名保真度,自引导采样改善了长时程执行。

英文摘要

Large vision-language-action (VLA) policies are increasingly trained as conditional generative models over action chunks. Yet deployment produces mixed-quality experience-successful demonstrations, partial completions, recoverable mistakes, and failures-that is difficult to use with standard imitation. Full behavior cloning (BC) imitates failures, filtered BC discards useful sub-trajectories, and offline reinforcement learning adds a large critic. We introduce ForesightFlow, a self-guided flow-matching policy that augments each generated action chunk with a learned success-potential trajectory. The same flow proposes and scores candidate actions, enabling best-of-$K$ inference without an external critic. The key issue is that policy improvement and value calibration require different supervision: advantage weighting should emphasize high-quality actions, but applying the same weights to potential coordinates suppresses failure gradients and creates overconfident scores. We address this with decoupled advantage-weighted flow matching, applying exponentiated advantage weights only to action velocities while training potential velocities uniformly. We further derive a one-step boundary estimator for conditional flow matching, allowing advantage computation with a single stop-gradient forward pass. Across five BEHAVIOR-1K simulation tasks and five real-world bimanual tasks, ForesightFlow improves over imitation baselines, matches the strongest separate-critic baseline in simulation success, improves real-world success, and reduces training compute by $38\%$. Ablations show that decoupling prevents value hallucination, the one-step estimator preserves candidate-ranking fidelity, and self-guided sampling improves long-horizon execution.

2606.04964 2026-06-04 cs.CL

SemBlock: Semantic Boundary Dynamic Blocks for Diffusion LLMs

SemBlock: 扩散语言模型的语义边界动态块

Xinrui Song, Zhuoran Wang, Mingju Gao, Hao Tang

发表机构 * School of Computer Science, Peking University(北京大学计算机学院)

AI总结 提出SemBlock框架,通过预测语义边界动态构建解码块,利用轻量预测器在冻结的LLaDA隐状态上训练,并在自然语言、数学和代码任务中优于固定块解码和AdaBlock。

Comments Code: https://github.com/TH-AI-Lab-PKU/SemBlock

详情
AI中文摘要

扩散语言模型(DLM)通过迭代去噪生成文本,逐块解码通过提交局部块中的令牌提高了其实用性。然而,现有的逐块方法通常依赖于固定的块大小或基于分隔符的运行时信号,这些不一定与语义边界对齐。在本文中,我们提出了SemBlock,一种面向扩散LLM的语义边界驱动的动态块解码框架。SemBlock将动态块构建形式化为语义边界预测,并在冻结的LLaDA隐状态上训练轻量预测器。为了提供监督,我们构建了SemBound,一个语义边界数据集,该数据集从自然语言、数学和代码任务中的话语单元、推理步骤和实现跨度中推导出边界标签。在推理过程中,SemBlock使用预测的边界概率来选择每个动态块的结束位置。在GSM8K、IFEval、MATH和HumanEval上的实验表明,SemBlock始终优于固定块解码和AdaBlock。我们的代码公开在:https://github.com/TH-AI-Lab-PKU/SemBlock。

英文摘要

Diffusion language models (DLMs) generate text through iterative denoising, and blockwise decoding improves their practicality by committing tokens in local blocks. However, existing blockwise methods typically rely on fixed block sizes or delimiter-based runtime signals, which do not necessarily align with semantic boundaries. In this paper, we propose SemBlock, a semantic-boundary-driven dynamic block decoding framework for diffusion LLMs. SemBlock formulates dynamic block construction as semantic boundary prediction and trains lightweight predictors on frozen LLaDA hidden states. To provide supervision, we construct SemBound, a semantic-boundary dataset that derives boundary labels from discourse units, reasoning steps, and implementation spans across natural language, math, and code tasks. During inference, SemBlock uses predicted boundary probabilities to select the ending position of each dynamic block. Experiments on GSM8K, IFEval, MATH, and HumanEval show that SemBlock consistently improves over fixed-block decoding and AdaBlock. Our code is publicly available: https://github.com/TH-AI-Lab-PKU/SemBlock.

2606.04931 2026-06-04 cs.LG cs.GT

Mean-based algorithms: A lower bound and regret

基于均值的算法:下界与遗憾

Julius Durmann, Amelie Kleber

发表机构 * Technical University of Munich(慕尼黑技术大学)

AI总结 本文针对未知时间范围且仅有赌博机反馈的设定,首次给出了基于均值算法定义序列γ_t的下界,并提出了两种新算法,实验表明其性能与现有算法相当,同时分析了与无遗憾算法的关系。

详情
AI中文摘要

基于均值的算法是一类在线学习算法,它们将低概率分配给平均奖励低的动作。最近的研究表明,这些算法能够有利地收敛到序列非支配动作,从而逼近经济博弈中的纳什均衡。然而,实证研究也显示,在赌博机反馈场景中,与已有算法相比,其收敛速度较慢。 我们研究时间范围未知且仅有赌博机反馈时的基于均值算法。在此设定下,我们首次给出了算法定义序列$γ_t$的下界,正式确立了这些算法学习速度的极限。此外,我们提出了两种基于均值的算法:一种推广了$ε$-贪心算法,另一种将基于均值的Exp3扩展到未知时间范围。我们的实验表明,基于均值的算法虽然略慢,但可以与其他赌博机反馈算法竞争。 我们进一步分析了与无遗憾算法的关系。根据$γ_t$的选择,与无遗憾算法的交集是非平凡的,并且我们证明存在既是基于均值又是无遗憾的算法。这为此类算法的“可剥削性”提供了背景,而先前的研究曾暗示这一点。

英文摘要

Mean-based algorithms are a class of online learning algorithms that assign low probability to actions with low average rewards. Recent work indicates these algorithms converge favorably to serially undominated actions, which approximate Nash equilibria in economic games. However, empirical studies also show slower convergence compared to established algorithms in bandit-feedback scenarios. We study mean-based algorithms when the time horizon is unknown and only bandit feedback is available. In this setting, we provide the first lower bound on the algorithm-defining sequence $γ_t$ that formally establishes a limit on how fast these algorithms can learn. Additionally, we propose two mean-based algorithms: one generalizes $ε$-greedy, and the other extends the mean-based Exp3 to unknown horizons. Our experiments show that mean-based algorithms, although slightly slower, can perform competitively with other bandit-feedback algorithms. We further analyze the relationship to no-regret algorithms. Depending on the choice of $γ_t$, the intersection with no-regret algorithms is non-trivial, and we show that algorithms exist that are both mean-based and no-regret. This adds context to the "exploitability" of this class of algorithms that previous contributions suggest.

2606.04930 2026-06-04 cs.LG cs.AI stat.ML

AdaKoop: Efficient Modeling of Nonlinear Dynamics from Nonstationary Data Streams with Koopman Operator Regression

AdaKoop: 基于Koopman算子回归的非平稳数据流非线性动力学高效建模

Naoki Chihara, Ren Fujiwara, Yasuko Matsubara, Yasushi Sakurai

发表机构 * SANKEN, The University of Osaka(SANKEN大学)

AI总结 提出AdaKoop,一种基于Koopman算子理论和概率框架的流式算法,通过将非线性动力学表示为线性系统,实现对非平稳数据流的高效、稳定建模,并在71个基准数据集上超越现有方法。

Comments Accepted by KDD'26

详情
Journal ref
The 32nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2026
AI中文摘要

实时数据分析需要准确且自适应地处理非平稳数据流中的非线性动力学,同时保持计算效率。然而,非线性动力学非常复杂,在严格时间限制下捕获动态变化的非线性模式并将其用于下游任务并非易事。为了弥合非线性复杂性与计算可处理性之间的差距,本研究应用了Koopman算子理论,该理论指出非线性动力学可以表示为无限维空间中的线性变换。基于该算子的有限维近似,我们提出了AdaKoop,一种用于对非平稳数据流上的非线性动力学进行建模的高效流式算法。我们的方法利用基于Koopman算子理论的概率框架,将原始观测和再生核希尔伯特空间(RKHS)特征都视为来自潜在向量的发射。这种双视角公式允许非线性动力学被表示为可处理的线性系统。因此,AdaKoop能够以流式方式高效稳定地建模非线性动力学,避免了迭代非线性优化的高昂计算成本。此外,为了应对数据流中的非平稳性,AdaKoop通过统计假设检验自适应地检测模式突变,并增量更新模型参数以处理连续变化。在总共71个跨领域实际基准数据集上的大量实验表明,AdaKoop在实时预测准确性和计算效率方面均优于最先进的方法。

英文摘要

Real-time data analysis requires the ability to accurately and adaptively address nonlinear dynamics in a nonstationary data stream while preserving computational efficiency. However, nonlinear dynamics are so complex that capturing dynamically changing nonlinear patterns and utilizing them for downstream tasks under strict time constraints is nontrivial. To bridge the gap between nonlinear complexity and computational tractability, this study applies Koopman operator theory, which states that nonlinear dynamics can be represented as linear transitions in an infinite-dimensional space. Building upon finite-dimensional approximations of this operator, we present AdaKoop, an efficient streaming algorithm for modeling nonlinear dynamics over nonstationary data streams. Our approach utilizes a probabilistic framework grounded in Koopman operator theory, treating both raw observations and reproducing kernel Hilbert space (RKHS) features as emissions from latent vectors. This dual-view formulation allows nonlinear dynamics to be expressed as a tractable linear system. Therefore, AdaKoop enables the efficient and stable modeling of nonlinear dynamics in a streaming fashion, avoiding the prohibitive computational costs of iterative nonlinear optimization. Furthermore, to address nonstationarity in data streams, AdaKoop adaptively detects the switching of patterns via statistical hypothesis testing for abrupt pattern shifts and incrementally updates model parameters to handle continuous changes. Extensive experiments on a total of 71 practical benchmark datasets across various domains demonstrate that AdaKoop outperforms state-of-the-art methods in terms of real-time forecasting accuracy and computational efficiency.

2606.04929 2026-06-04 cs.LG cs.CR

Sequential Data Poisoning in LLM Post-Training

LLM后训练中的顺序数据投毒

Jack Sanderson, Yihan Wang, Xiaoqian Lu, Gautam Kamath, Yiwei Lu

发表机构 * University of Chicago(芝加哥大学) University of Waterloo(滑铁卢大学) University of Ottawa(渥太华大学) Vector Institute(向量研究所)

AI总结 提出顺序数据投毒威胁模型,研究多个攻击者在LLM后训练不同阶段(SFT和偏好数据)分别投毒,发现单一攻击者看似威胁小但多阶段协作会暴露真实漏洞,且不同管道中贡献呈加性或互补性。

详情
AI中文摘要

LLM后训练通过多个阶段进行,例如监督微调(SFT)后跟人类反馈强化学习(RLHF)或直接偏好优化(DPO),每个阶段的数据来自不同且可能不可信的来源。现有文献假设数据投毒攻击可能发生在每个训练阶段,但忽略了多个攻击者的可能性。为了研究整个后训练管道的可信度,我们提出了顺序数据投毒的威胁模型,其中多个对手分别投毒SFT和偏好数据集。在此威胁模型下,我们发现了单一攻击者幻觉:每个对手单独评估时看似威胁可忽略,但当对手跨阶段协作时,真正的漏洞才会暴露。在SFT→DPO管道中,他们的贡献是加性的:将固定投毒预算跨阶段分配优于单独集中在任一阶段。在SFT→PPO管道中,他们的贡献是互补的:单独SFT或奖励模型投毒都不成功,但组合却成功。这些发现表明,对单个后训练阶段的安全分析系统性地低估了仅从它们交互中出现的复合漏洞。代码可在https://github.com/jcksanderson/sequential-poisoning获取。

英文摘要

LLM post-training proceeds through multiple stages, e.g., supervised fine-tuning (SFT) followed by reinforcement learning from human feedback (RLHF) or direct preference optimization (DPO), where each stage draws data from different, potentially untrusted sources. Existing literature assumes data poisoning attacks may occur at each training stage, but neglects the possibility of multiple attackers. To study the trustworthiness of the entire post-training pipeline, we propose the threat model of sequential data poisoning, where multiple adversaries separately poison the SFT and preference datasets. Under this threat model, we identify the single-attacker illusion: each adversary, evaluated in isolation, appears to pose a negligible threat. Yet when adversaries collaborate across stages, the true vulnerability is revealed. In the SFT $\to$ DPO pipeline, their contributions are additive: splitting a fixed poison budget across stages outperforms concentrating it in either stage alone. In the SFT $\to$ PPO pipeline, their contributions are complementary: neither SFT nor reward model poisoning succeeds individually, yet their combination does. These findings show that security analyses of individual post-training stages systematically underestimate compound vulnerabilities that emerge only from their interaction. Code is available at https://github.com/jcksanderson/sequential-poisoning.

2606.04928 2026-06-04 cs.LG cs.CL

Data Attribution in Large Language Models via Bidirectional Gradient Optimization

通过双向梯度优化实现大型语言模型中的数据归因

Frédéric Berdoz, Luca A. Lanzendörfer, Kaan Bayraktar, Roger Wattenhofer

发表机构 * EPFL, Switzerland(瑞士联邦理工学院) ETH Zurich, Switzerland(瑞士苏黎世联邦理工学院)

AI总结 提出一种基于双向梯度优化的训练数据归因方法,用于自动回归大型语言模型,以识别影响模型输出的关键训练数据,提升模型可解释性。

Comments Presented at the AI Governance (AIGOV) Workshop at AAAI 2026

详情
AI中文摘要

大型语言模型(LLMs)越来越多地部署在各种应用中,引发了关于治理、问责和数据溯源的关键问题。理解哪些训练数据对模型的输出影响最大仍然是一个基本开放问题。我们通过扩展逆公式来解决自动回归LLMs的训练数据归因(TDA)挑战:如果模型在训练期间看到了生成的输出,训练数据会如何受到影响?我们的方法通过对生成的文本样本进行双向梯度优化(梯度上升和下降)来扰动基础模型,并测量训练样本上损失的变化。我们的框架支持任意数据粒度的归因,能够实现事实和风格归因。我们在已知数据集的预训练模型上评估了我们的方法,并表明它在影响力指标上优于先前的工作,从而增强了模型的可解释性,这是负责任AI系统的基本要求。

英文摘要

Large Language Models (LLMs) are increasingly deployed across diverse applications, raising critical questions for governance, accountability, and data provenance. Understanding which training data most influenced a model's output remains a fundamental open problem. We address this challenge through training data attribution (TDA) for auto-regressive LLMs by expanding upon the inverse formulation: How would training data be affected if the model had seen the generated output during training? Our method perturbs the base model using bidirectional gradient optimization (gradient ascent and descent) on a generated text sample and measures the resulting change in loss across training samples. Our framework supports attribution at arbitrary data granularity, enabling both factual and stylistic attribution. We evaluate our method against baselines on pretrained models with known datasets, and show that it outperforms previous work on influence metrics, thereby enhancing model interpretability, an essential requirement for accountable AI systems.

2606.04925 2026-06-04 cs.CV

Scene-Centric Unsupervised Video Panoptic Segmentation

以场景为中心的无监督视频全景分割

Christoph Reich, Oliver Hahn, Nikita Araslanov, Laura Leal-Taixé, Christian Rupprecht, Daniel Cremers, Stefan Roth

发表机构 * TU Munich(慕尼黑技术大学) TU Darmstadt(达姆施塔特技术大学) NVIDIA(英伟达) University of Oxford(牛津大学) MCML ELIZA

AI总结 提出无监督视频全景分割任务及首个方法VideoCUPS,利用深度、运动和视觉线索生成伪标签,并通过Video DropLoss训练,在无监督条件下实现准确分割。

Comments CVPR 2026. Oliver Hahn and Christoph Reich - both authors contributed equally. Code: https://github.com/visinf/cups/tree/main/videocups Project page: https://visinf.github.io/videocups/

详情
AI中文摘要

视频全景分割(VPS)旨在联合检测、分割和跟踪所有对象,同时将视频划分为语义一致的区域。我们引入了无监督VPS的任务设置,省略任何人工监督。现有的无监督场景理解工作主要关注图像分割任务;视频领域仍未充分探索。我们提出了VideoCUPS,这是第一个无监督VPS方法。VideoCUPS通过利用无监督的深度、运动和视觉线索,从以场景为中心的视频中生成时间一致的全景视频伪标签。使用新颖的Video DropLoss在这些伪标签上训练,可以得到一个准确的无监督VPS模型。为了对进展进行基准测试,我们引入了一个全面的评估协议和四个竞争基线,将最先进的无监督全景图像和实例视频分割模型扩展到VPS。VideoCUPS优于所有基线,并展示了强大的标签高效学习能力。通过VideoCUPS、我们的评估协议和基线,我们为未来无监督VPS的研究提供了坚实的基础。

英文摘要

Video panoptic segmentation (VPS) aims to jointly detect, segment, and track all objects while partitioning the video into semantically consistent regions. We introduce the task setting of unsupervised VPS, omitting any human supervision. Existing unsupervised scene understanding works mainly focused on image segmentation tasks; the video domain remains underexplored. We propose VideoCUPS, the first unsupervised VPS approach. VideoCUPS generates temporally consistent panoptic video pseudo-labels from scene-centric videos by exploiting unsupervised depth, motion, and visual cues. Training on these pseudo-labels using a novel Video DropLoss yields an accurate, unsupervised VPS model. To benchmark progress, we introduce a comprehensive evaluation protocol and four competitive baselines, extending state-of-the-art unsupervised panoptic image and instance video segmentation models to VPS. VideoCUPS outperforms all baselines and demonstrates strong label-efficient learning. With VideoCUPS, our evaluation protocol, and baselines, we provide a strong foundation for future research on unsupervised VPS.

2606.04924 2026-06-04 cs.CL

Can Crowdsourcing Survive the LLM Era? A Community Survey on Human Data Collection

众包能否在LLM时代幸存?关于人类数据收集的社区调查

Aswathy Velutharambath, Neele Falk, Sofie Labat, Tarun Tater, Amelie Wuehrl

发表机构 * University of Stuttgart(斯图加特大学) Ghent University(根特大学) Harvard University(哈佛大学) IT University of Copenhagen(哥本哈根技术大学)

AI总结 通过调查155名NLP及相关领域研究者,探讨LLM对众包数据有效性的挑战、检测策略及应对措施,发现44%的受访者观察到LLM使用,但现有努力仍不足。

详情
AI中文摘要

大型语言模型(LLM)作为写作工具的广泛使用挑战了众包数据的有效性,因为众包工作者可能将任务外包给模型。为了更好地了解如何解决这一问题,我们调查了155名NLP及相关领域的研究人员,了解他们通过众包收集自由文本回复的经验和意见。本文概述了从业者面临的挑战、缓解策略以及对数据质量的预期影响。44%的受访者报告在其众包数据中观察到LLM的使用。虽然其中93%的人预料到了这一点,但一半的人不确定应采取何种预防措施。最普遍的检测策略是独特的文本风格模式和异常快速的完成时间。总体而言,调查回复显示研究社区意识到这一问题并正在采取措施,但现有努力仍不足以完全解决。最后,我们提出了一系列考虑因素,以指导LLM时代未来的众包自由文本数据收集。

英文摘要

The widespread use of Large Language Models (LLMs) as writing tools challenges the validity of crowdsourced data, as crowdworkers may outsource tasks to models. To better understand how this is addressed, we surveyed 155 researchers in NLP and related disciplines about their experiences and opinions on collecting free-text responses via crowdsourcing. This paper provides an overview of practitioners' challenges, mitigation strategies, and the foreseen implications on data quality. 44% of respondents reported observing LLM usage in their crowdsourced data. While 93% of them had anticipated this, half were unsure what precautions to take. The most prevalent detection strategies are distinctive textual style patterns and unusually fast completion times. Overall, survey responses show that the research community is aware of the problem and taking measures, but existing efforts remain insufficient to fully address it. Finally, we derive a set of considerations to guide future crowdsourced free-text data collection in the era of LLMs.

2606.04923 2026-06-04 cs.LG cs.AI cs.CL

Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning

基于评分标准的强化学习中的奖励黑客行为的复现、分析与检测

Xuekang Wang, Zhuoyuan Hao, Shuo Hou, Hao Peng, Juanzi Li, Xiaozhi Wang

发表机构 * Tsinghua University(清华大学) Harbin Institute of Technology, Shenzhen(哈尔滨工业大学(深圳)) Xi’an Jiaotong University(西安交通大学)

AI总结 本文提出可控黑客环境CHERRL,通过注入已知偏见复现奖励黑客行为,分析其可发现性与可利用性,并探索基于智能体的自动检测方法。

Comments 23 pages, 7 figures

详情
AI中文摘要

基于评分标准的强化学习(RL)使用LLM作为评判者(LaaJ)根据评分标准对模型输出进行评分作为奖励。然而,策略模型可能利用评判者中的潜在偏见,导致奖励黑客行为以及无效或不安全的训练结果。在真实的基于评分标准的RL中,此类黑客行为通常微妙且与多种评判者偏见纠缠在一起,使得分析、检测和缓解变得困难。在本文中,我们引入了CHERRL,一个用于基于评分标准的RL的可控黑客环境。通过将已知偏见注入LaaJ,CHERRL能够稳定复现奖励黑客行为,明确观察奖励发散,并精确识别黑客行为的起始点。这为研究基于评分标准的RL中奖励黑客行为的机制和缓解措施提供了一个干净的实验测试平台。为了展示其效用,我们从可发现性和可利用性的角度分析了不同的评判者偏见,并探索了一个基于智能体的系统,用于从训练日志中自动检测奖励黑客行为的起始点。代码和环境公开于https://github.com/THUAIS-Lab/CHERRL。

英文摘要

Rubric-based reinforcement learning (RL) uses an LLM-as-a-Judge (LaaJ) to score model outputs according to rubrics as rewards. However, policy models may exploit latent biases in the judge, leading to reward hacking and ineffective or unsafe training outcomes. In real-world rubric-based RL, such hacking behaviors are often subtle and entangled with multiple judge biases, making them difficult to analyze, detect, and mitigate. In this paper, we introduce CHERRL, a controllable hacking environment for rubric-based RL. By injecting known biases into LaaJ, CHERRL enables stable reproduction of reward hacking, explicit observation of reward divergence, and precise identification of hacking onset. This provides a clean experimental testbed for studying the mechanisms and mitigations of reward hacking in rubric-based RL. To demonstrate its utility, we analyze different judge biases from the perspectives of discoverability and exploitability, and explore an agent-based system for automatically detecting reward hacking onset from training logs. The code and environment are publicly available at https://github.com/THUAIS-Lab/CHERRL.

2606.04922 2026-06-04 cs.CV cs.AI cs.LG

Geometry-Aware Distillation for Prompt Tuning Biomedical Vision-Language Models

几何感知蒸馏用于提示调优生物医学视觉-语言模型

Tran Dinh Tien, Zhiqiang Shen

发表机构 * Department of Machine Learning(机器学习系) Mohamed bin Zayed University of Artificial Intelligence(Mohamed bin Zayed人工智能大学)

AI总结 提出Omni-Geometry知识蒸馏(OGKD)框架,通过注入类别关系结构到教师模型,生成保留真实标签同时尊重类间几何的方向性目标,并设计全局几何感知蒸馏(GAD)和标签引导几何蒸馏(LGD)损失,在11个医学数据集上平均提升准确率1.7%-2.8%。

Comments Preprint. Code is available at https://github.com/tientrandinh/OGKD

详情
AI中文摘要

当前基于提示和适配器的视觉-语言模型(VLM)调优方法在医学影像中具有吸引力,因为临床数据敏感性倾向于冻结骨干网络且标注有限。然而,这些方法通常仅优化真实类别,将所有其他类别视为同等错误,忽略了临床上有意义的类别关系,并在有限监督设置下产生不稳定的决策边界。我们提出了Omni-Geometry知识蒸馏(OGKD),一种新框架,将类别关系结构注入教师模型,以生成保留真实标签同时尊重类间几何的方向性目标。利用这些目标,我们开发了两种蒸馏损失:全局几何感知蒸馏(GAD)作用于全局图像标记,标签引导几何蒸馏(LGD)将相同的几何应用于注意力补丁标记以改善细粒度对齐。在11个广泛使用的医学数据集上进行的基础到新类和少样本评估的综合实验和分析中,我们的OGKD实现了显著更好的性能,在所有先前最先进的VLM适应方法上平均绝对增益为1.7%-2.8%。它还能稳健地泛化到未见类别,并产生比其他方法更可靠的预测。我们的代码可在https://github.com/tientrandinh/OGKD获取。

英文摘要

Current prompt-based and adapter-based tuning of vision-language models (VLMs) is attractive for medical imaging, where clinical data sensitivity favors frozen backbones and annotations are limited. However, these methods typically optimize only the ground-truth class, treating all other classes as equally incorrect, ignoring clinically meaningful class relations and yielding unstable decision boundaries in limited-supervision settings. We propose Omni-Geometry Knowledge Distillation (OGKD), a new framework that injects class-relation structure into the teacher to produce directional targets that preserve the ground truth while respecting inter-class geometry. Using these targets, we develop two distillation losses: Global Geometry-Aware Distillation (GAD) operates on the global image token, and Label-Guided Geometry Distillation (LGD) applies the same geometry to attentive patch tokens to improve fine-grained alignment. Across comprehensive experiments and analyses on 11 widely-used medical datasets for base-to-novel and few-shot evaluations, our OGKD achieves substantially better performance, consistently improving accuracy by an average absolute gain of 1.7%-2.8% over all prior state-of-the-art VLM adaptation counterparts. It also robustly generalizes to unseen classes and yields more reliable predictions than other approaches. Our code is available at https://github.com/tientrandinh/OGKD.

2606.04921 2026-06-04 cs.SD eess.AS

SURF: Separation via Unsupervised Remixing Flow

SURF: 通过无监督重混流进行分离

Henry Li, Robin Scheibler, Efthymios Tzinis, Matt Shannon, Arnaud Doucet, John R. Hershey

发表机构 * University of Toronto(多伦多大学)

AI总结 提出无监督流匹配方法SURF,直接从混合信号学习源分离,结合监督流匹配与自监督回归,通过重混步骤引导学生模型,在图像和音频基准上达到新最优。

Comments Accepted at ICML 2026

详情
AI中文摘要

单通道源分离的目标是从混合信号中重建$K$个源。在监督设置中,当有大量干净源数据可用时,这个具有挑战性的不适定问题已通过生成扩散和基于流的先验模型成功解决。然而,获取此类干净源样本通常受限,即使可用,监督模型也容易受到领域偏移的影响。为弥补这一差距,我们提出了通过无监督重混流进行分离(SURF),这是一种无监督流匹配方法,直接从观测到的混合信号中学习。该方法依赖于最先进的监督流匹配和基于回归的自监督技术的新颖组合。在高层面上,从教师模型开始,我们利用“重混”步骤,从教师估计中引导学习学生流模型。我们提供了关于该方法优化目标的见解,并建立了与Wake-Sleep算法的新联系。在图像和音频基准上的实证评估表明,SURF建立了新的最优水平,显著优于现有无监督方法。示例请参见我们的演示页面:https://google.github.io/df-conformer/surf/

英文摘要

The goal of single-channel source separation is to reconstruct $K$ sources given their mixture. In supervised settings where vast amounts of clean source data are available, this challenging, ill-posed problem has been addressed successfully by generative diffusion and flow-based prior models. However, access to such clean source samples is often limited, and even when available, supervised models are vulnerable to domain shifts. To bridge this gap, we present Separation via Unsupervised Remixing Flow (SURF), an unsupervised flow matching approach for source separation that learns directly from observed mixtures. This method relies on a novel combination of state-of-the-art supervised flow matching and regression-based self-supervised techniques. At a high level, starting from a teacher model, we utilize a "remixing" step to bootstrap the learning of a student flow model from the teacher's estimates. We provide insights into the objectives optimized by this approach and draw a novel connection to the Wake-Sleep algorithm. Empirical evaluations on image and audio benchmarks demonstrate that SURF establishes a new state-of-the-art, significantly outperforming existing unsupervised methods. See our demo page for examples. https://google.github.io/df-conformer/surf/

2606.04916 2026-06-04 cs.LG econ.GN q-fin.EC stat.ML

Worker Utility as Hysteresis: A Preisach Model of Transaction Acceptance in Gig Labour Markets

工人效用作为滞后:零工劳动力市场中交易接受的Preisach模型

Piotr Frydrych

发表机构 * Metrology and Biomedical Engineering Institute, Faculty of Mechatronics, Warsaw University of Technology(计量与生物医学工程研究所,机械电子学系,华沙技术大学)

AI总结 本文提出Preisach滞后模型表示零工工人隐藏偏好,通过双输出神经网络估计接受和拒绝效用,结合XGBoost分类器,在36891笔交易上实现Jaccard=0.827和ROC AUC=0.799,并证明价格下降比上升对完成率影响更大。

Comments 18 pages, 5 figures

详情
AI中文摘要

工人效用是不可观测的——只有其结果可观测。每笔零工交易产生一个比特:接受或拒绝。我们认为这种结构直接指向Preisach滞后模型作为潜在工人偏好的自然表示。Preisach算子将总产出建模为对一群二元阈值元素的积分——这正是异质性工人各自持有私人接受工资时出现的结构。我们通过双输出神经网络(共享层256->128,边际损失强制U_1 >= U_0)估计两个潜在效用曲面:接受效用U_1(X)和拒绝效用U_0(X)。分类简化为Preisach间隙U_1(X) - U_0(X),与裁剪稳定的价格-阈值编码一起输入XGBoost分类器。在36,891笔零工交易上,该流程实现了Jaccard=0.827和ROC AUC=0.799。价格-阈值编码相比原始效用特征贡献了+11.0个百分点的AUC。模型证实了滞后预测的方向不对称性:价格下降比同等幅度的上升更严重地降低完成率。应用于完整数据集,模型的建议同时将总工资账单减少21.3%,并将预期填充率提高9.7个百分点。对于74.2%的交易,P(接受)已超过0.80;降低工资使其保持在阈值以上(削减后平均P=0.972),释放成本节约(中位数31%)。对于剩余的25.4%,中位数7%的工资增长恢复了+43个百分点的接受率。没有明确无差异区域的模型无法同时执行这两种操作。

英文摘要

Worker utility is not observed -- only its consequence is. Each gig transaction produces a single bit: accepted or rejected. We argue this structure points directly to the Preisach hysteresis model as the natural representation of latent worker preferences. The Preisach operator models aggregate output as an integral over a population of binary threshold elements -- precisely the structure that emerges when heterogeneous workers each carry a private acceptance wage. We estimate two latent utility surfaces: acceptance utility U_1(X) and rejection utility U_0(X), via a dual-output neural network (shared layers 256->128, margin loss enforcing U_1 >= U_0). Classification reduces to the Preisach gap U_1(X) - U_0(X), passed into an XGBoost classifier alongside clip-stabilised price-to-threshold encodings. On 36,891 gig transactions, this pipeline achieves Jaccard = 0.827 and ROC AUC = 0.799. The price-to-threshold encoding accounts for +11.0 pp AUC over raw utility features. The model confirms the directional asymmetry hysteresis predicts: price decreases depress completion rates more than equivalent increases raise them. Applied to the full dataset, the model's recommendations simultaneously reduce the total wage bill by 21.3% and increase expected fill rate by 9.7 pp. For 74.2% of transactions, P(accept) already exceeds 0.80; reducing the wage keeps it above threshold (mean post-cut P = 0.972), releasing cost savings (median 31%). For the remaining 25.4%, a median 7% wage increase recovers +43 pp acceptance. A model without an explicit indifference zone cannot execute both moves simultaneously.

2606.04915 2026-06-04 cs.CL cs.IR

Caliper: Probing Lexical Anchors versus Causal Structure in LLMs

Caliper: 探究LLM中的词汇锚点与因果结构

Zhenyu Yu, Shuigeng Zhou

发表机构 * Fudan University(复旦大学)

AI总结 通过词汇匿名化扰动,揭示大语言模型在因果推理基准上的表现主要依赖词汇模式匹配而非结构因果推理。

详情
AI中文摘要

大语言模型在CLadder等因果推理基准上达到50%至70%的准确率,但尚不清楚这反映的是结构推理还是词汇模式匹配。我们引入Caliper,一种受控扰动方法,在保留每个问题的因果图和概率规范的同时,用占位符标记替换语义变量名。在九个指令微调LLM(从3.8B到671B参数)和三个因果推理基准上,词汇匿名化在本地3.8B-14B模型集上导致稳健的准确率下降,分别为+7.6、+27.0和+11.1个百分点;在跨越2024-2026代际的九个前沿模型上,CRASS和e-CARE上的下降幅度升至+29.6和+18.0个百分点。在40个模型-基准组合中,39个显示出正差距,而在CLadder的伪词子集上,差距缩小了17倍。结构化提示和少样本上下文学习各自缩小了差距,但主要是通过降低较小模型上的P0准确率,而非恢复P1。当前指令微调LLM在零样本评估下,一旦移除词汇锚点,几乎没有证据表明其具备结构因果推理能力。

英文摘要

Large language models reach 50 to 70% accuracy on causal reasoning benchmarks such as CLadder, but it is unclear whether this reflects structural reasoning or lexical pattern matching. We introduce Caliper, a controlled perturbation that replaces semantic variable names with placeholder tokens while preserving the causal graph and probabilistic specification of each question. Across nine instruction-tuned LLMs from 3.8B to 671B and three causal reasoning benchmarks, lexical anonymization yields robust accuracy drops of +7.6, +27.0, and +11.1 pp on a local 3.8B-14B set, rising to +29.6 and +18.0 pp on CRASS and e-CARE across nine frontier models spanning the 2024-2026 generations. Of 40 engaged model-by-benchmark cells, 39 show a positive gap, and the gap collapses by 17x on CLadder's pseudoword subset. Structured scaffolding and few-shot in-context learning each narrow the gap, but mainly by lowering P0 accuracy on smaller models rather than recovering P1. Current instruction-tuned LLMs, evaluated zero-shot, show little evidence of structural causal reasoning once lexical anchors are removed.

2606.04911 2026-06-04 cs.CV cs.CL

BreastGPT: A Multimodal Large Language Model for the Full Spectrum of Breast Cancer Clinical Routine

BreastGPT: 面向乳腺癌临床全流程的多模态大语言模型

Yang Liu, Jiajin Zhang, Danyang Tu, Yaojun Hu, Jiao Qu, Jiuyu Zhang, Yu Shi, Wei Fang, Shi Gu, Ling Zhang, Yingda Xia

发表机构 * DAMO Academy, Alibaba Group(阿里巴巴集团 DAMO 院) Zhejiang University(浙江大学) Hupan Lab(华潘实验室) West China Hospital(西京医院) China Medical University(中国医科大学)

AI总结 提出BreastGPT多模态大语言模型,通过构建工作流对齐的指令语料库BreastStage和双分支视觉编码器,实现乳腺癌筛查、诊断和治疗规划全流程的多模态推理,在BreastStage-Bench上取得75.66%封闭式准确率和89.92%开放式得分。

详情
AI中文摘要

乳腺癌仍然是女性癌症相关死亡的主要原因。其临床管理需要跨临床工作流(包括筛查、诊断和治疗规划)的多模态推理,其中每个阶段涉及不同的成像模态、任务目标和推理模式。然而,受限于数据稀缺和模型通用性,现有的医学多模态大语言模型通常仅在孤立的模态或狭窄的任务族上进行评估,限制了它们支持工作流级临床推理的能力。在这项工作中,我们首先引入了BreastStage,一个工作流对齐的乳腺影像指令语料库,包含来自5种成像模态的17个子数据集和136个任务模板的186万条指令遵循对。其保留子集BreastStage-Bench为评估乳腺癌护理连续体中的多模态推理提供了全面的基准。基于该语料库,我们提出了BreastGPT,一个统一的多模态大语言模型,配备双分支视觉编码器和概念保持的令牌压缩,以弥合标准放射学与千兆像素病理学之间的尺度差距。在BreastStage-Bench上,BreastGPT实现了75.66%的封闭式准确率和89.92%的开放式得分,在临床阶段和任务格式上均优于通用和医学专用多模态大语言模型。这些结果表明,工作流对齐的数据和跨尺度视觉建模对于临床基础的医学多模态大语言模型至关重要。所有数据、代码和模型检查点已在https://yangyy-liu.github.io/BreastGPT.io发布。

英文摘要

Breast cancer remains a leading cause of cancer-related mortality among women. Its clinical management requires multimodal reasoning across a clinical workflow that spans \textit{screening}, \textit{diagnosis} and \textit{treatment planning}, where each stage involves distinct imaging modalities, task objectives, and reasoning patterns. However, constrained by data scarcity and model versatility, existing medical MLLMs are typically evaluated on isolated modalities or narrow task families, limiting their ability to support workflow-level clinical reasoning. In this work, we first introduce \textbf{BreastStage}, a workflow-aligned breast imaging instruction corpus comprising 1.86M instruction-following pairs curated from 17 sub-datasets across 5 imaging modalities and 136 task templates. Its held-out split, \textbf{BreastStage-Bench}, provides a comprehensive benchmark for evaluating multimodal reasoning across the breast cancer care continuum. Building on this corpus, we propose \textbf{BreastGPT}, a unified MLLM equipped with a dual-branch visual encoder and concept-preserving token compression to bridge the scale gap between standard radiology and gigapixel pathology. On BreastStage-Bench, BreastGPT achieves 75.66\% closed-ended accuracy and 89.92\% open-ended score, outperforming both general-purpose and medical-specific MLLMs across clinical stages and task formats. These results suggest that workflow-aligned data and cross-scale visual modeling are critical for clinically grounded medical MLLMs. All data, code, and model checkpoints are released at https://yangyy-liu.github.io/BreastGPT.io.

2606.04906 2026-06-04 cs.CL cs.AI

'Your AI Text is not Mine': Redefining and Evaluating AI-generated Text Detection under Realistic Assumptions

“你的AI文本不是我的”:在现实假设下重新定义和评估AI生成文本检测

Nils Dycke, Marina Sakharova, Nico Daheim, Iryna Gurevych

发表机构 * Ubiquitous Knowledge Processing Lab (UKP Lab), Department of Computer Science, Technical University of Darmstadt(通用知识处理实验室(UKP实验室),计算机科学系,达姆施塔特技术大学) National Research Center for Applied Cybersecurity ATHENE, Germany(应用网络安全国家研究中心ATHENE,德国) Zuse School ELIZA(祖斯学校ELIZA)

AI总结 针对AI生成文本检测领域缺乏统一有害使用定义的问题,本文系统定义了多种AI生成文本概念,构建了包含详细生成过程注释的人机协作文本基准AITDNA,并评估了多种检测器在不同概念下的表现。

详情
AI中文摘要

尽管普遍认为AI生成的文本会带来广泛的社会风险,但在AI生成文本检测文献中,对于什么构成有害使用并没有共同的理解。相反,现有的数据集和方法往往定义自己的标准并做出自己的假设,有时是隐含的,而且通常只与真实世界的需求和应用程序松散相关。为了解决这一差距,我们在此系统地定义了AI生成文本的各种概念及其特征。为了研究这些,我们收集了AITDNA——一个全新的人机协作文本基准,其中标注了详细的生成过程信息,如整个编辑和AI交互历史。我们评估了各种机器生成文本检测器,发现它们通常只在特定概念下表现良好,而不能作为广泛的检测器。我们公开发布代码和数据。

英文摘要

Although it is generally agreed that AI-generated text poses a broad societal risk, there is no common understanding in the AI-generated text detection literature on what constitutes harmful use. Rather, existing datasets and approaches often define their own criteria and make their own assumptions, sometimes implicitly, and often only loosely related to real-world needs and applications. To address this gap, we here systematically define various notions of AI-generated text and their characteristics. To study these, we collect AITDNA - a new benchmark of human-machine co-constructed texts that is annotated with detailed genesis information, such as the entire edit and AI-interaction history. We benchmark various machine-generated text detectors and find that they often only perform well for specific notions but not as broad detectors. We release code and data publicly.

2606.04898 2026-06-04 cs.CV

CDPM-Align: Multi-Scale Guidance-Aligned Diffusion Pretraining for Robust Few-Shot Anatomical Landmark Detection

CDPM-Align:用于鲁棒少样本解剖标志检测的多尺度引导对齐扩散预训练

Roberto Di Via, Irina Voiculescu, Francesca Odone, Vito Paolo Pastore

发表机构 * MaLGa DIBRIS, University of Genoa(DIBRIS,热那亚大学) University of Genoa(热那亚大学) Department of Computer Science, University of Oxford(奥大利大学计算机科学系)

AI总结 提出多尺度引导对齐的条件扩散预训练方法CDPM-Align,通过生成式预训练学习鲁棒表示,在少样本和低标注场景下提升解剖标志检测的准确性和不确定性。

Comments Accepted MICCAI 2026

详情
AI中文摘要

解剖标志检测是医学图像分析中的一项基础任务,支持广泛的诊断和介入工作流程。尽管最近的方法已经实现了亚毫米级的定位,但仅凭准确性不足以用于临床部署,还需要预测的可靠性和鲁棒性。尽管具有临床相关性,但表示学习在此背景下的影响仍未得到充分探索。在这项工作中,我们引入了CDPM-align,一种用于解剖标志检测的多尺度引导对齐条件扩散预训练方法。我们的实验设置侧重于少量图像和少量标注场景。具体来说,我们采用三个流行的异构小规模基准数据集,通过条件生成预训练进行表示学习。此外,我们考虑了标志检测下游任务的低标注场景,分别使用10张和25张标注图像,反映了临床工作与标注资源约束之间的现实权衡。我们的结果证实,生成式预训练使模型能够学习鲁棒的表示。这提高了下游任务的准确性和不确定性,朝着安全高效的临床部署迈进。

英文摘要

Anatomical landmark detection is a fundamental task in medical image analysis supporting a wide range of diagnostic and interventional workflows. Although recent methods have achieved sub-millimetric localisation, accuracy alone is not sufficient for clinical deployment, requiring reliability and robustness in prediction. Despite its clinical relevance, the impact of representation learning in this context is still underexplored. In this work, we introduce CDPM-align, a multi-scale guidance-aligned conditional diffusion pre-training for anatomical landmark detection. Our experimental setup focuses on a few images and a few annotation regimes. Specifically, we employ three popular heterogeneous small-scale benchmark datasets for representation learning via conditional generative pre-training. Furthermore, we consider low-annotation scenarios for the downstream task of landmark detection, with 10 and 25 annotated images, reflecting realistic trade-offs between clinical effort and resource constraints for annotations. Our results confirm that generative pre-training enables the model to learn a robust representation. This improves both accuracy and uncertainty on the downstream tasks, advancing towards safe and efficient clinical deployment.

2606.04891 2026-06-04 cs.CV cs.CG

Hierarchical Space Partition for Surface Reconstruction

表面重建的层次空间划分

Minjie Tang, Xiangfei Li

发表机构 * Independent Researcher(独立研究员) Huazhong University of Science and Technology(华中科技大学)

AI总结 针对点云重建中因LiDAR扫描局限导致细节缺失的问题,提出基于平面分类与优先级生长的层次空间划分方法,并通过最小割优化生成水密多边形网格。

Comments Published in 2026 International Conference on 3D Vision (3DV)

详情
Journal ref
in 2026 International Conference on 3D Vision (3DV), Vancouver, BC, Canada, 2026, pp. 207-216
AI中文摘要

从点云生成紧凑的多边形模型是3D视觉和计算机图形学中的一个关键问题。然而,由于LiDAR扫描的固有限制(例如距离约束和遮挡),关键场景信息常常缺失,导致重建精度下降。为了解决这个问题,我们提出了一种平面组装策略,该策略在保持模型紧凑性的同时有效恢复缺失的细节。我们将从场景中提取的所有平面分为三类:高可见、几乎不可见和不可见。通过场景结构分析恢复的不可见平面指示了缺失的细节。这三种类型的平面对应于三种生长优先级。每个平面根据优先级水平生长,空间被逐步划分,即层次划分。随后,我们通过基于最小割的优化从划分中生成水密多边形网格。最后,在公共数据集上的比较显示了我们的方法相对于主流方法的有效性和优越性。项目页面可在https://hsr-3dv.github.io/获取。

英文摘要

Generating compact polygonal models from point clouds is a key problem in 3D vision and computer graphics. However, due to inherent limitations of LiDAR scanning (e.g. range constraints and occlusions), critical scene information is often missing, leading to degraded reconstruction accuracy. To address this, we propose a plane assembling strategy that effectively recovers missing details while maintaining model compactness. We classify all the planes extracted from the scene into three categories: highly visible, barely visible, and invisible. The invisible planes, which are recovered by scene structure analysis, indicate the missing details. The three types of planes correspond to the three growth priorities. Each plane grows according to the priority level, and the space is partitioned progressively, namely, the hierarchical partition. Subsequently, we generate a watertight polygonal mesh from the partition via a min-cut-based optimization. Finally, comparisons on public datasets show the effectiveness and superiority of our method against mainstream approaches. The project page is available at https://hsr-3dv.github.io/.

2606.04889 2026-06-04 cs.CL

GRAIL: Gradient-Reweighted Advantages for Reinforcement Learning with Verifiable Rewards

GRAIL: 基于梯度重加权优势的可验证奖励强化学习

Tej Deep Pala, Vernon Toh, Soujanya Poria

发表机构 * DeCLaRe Lab, Nanyang Technological University(DeCLaRe实验室,南洋理工大学)

AI总结 针对强化学习中统一优势分配导致梯度信号稀释的问题,提出基于梯度激活显著性的令牌级优势重加权方法GRAIL,无需过程级监督即可提升推理对齐,在多个模型上平均准确率提升3.60%。

详情
AI中文摘要

基于可验证奖励的强化学习(如GRPO)现在已成为提升大语言模型(LLMs)数学推理能力的常见方法。然而,当前方法通常将单个序列级优势广播到所有令牌,或使用昂贵的过程奖励模型(PRMs)进行步骤级监督。统一优势分配假设所有令牌对最终奖励的贡献相同。这会稀释梯度信号,因为存在缺陷的推理步骤和填充词与有效的逻辑推理得到同等强度的更新。为解决此问题,我们引入了梯度重加权优势(GRAIL),一种内在的令牌级优势重加权方法。GRAIL使用梯度激活显著性,将更多权重赋予那些对最终答案局部更敏感的令牌。在来自Qwen3、R1-distilled和OctoThinker家族的五个模型上的评估表明,GRAIL始终优于GRPO。GRAIL在准确率上平均提升3.60%,在Pass@3上平均提升3.05%,表明无需过程级监督即可实现细粒度的推理对齐。

英文摘要

Reinforcement learning with verifiable rewards (e.g. GRPO) is now a common way to improve mathematical reasoning in Large Language Models (LLMs). However, current methods usually broadcast one sequence-level advantage to all tokens, or use costly process reward models (PRMs) for step-level supervision. Uniform advantage distribution assumes that all tokens contribute equally to the final reward. This dilutes the gradient signal, since flawed reasoning steps and filler words are updated as strongly as valid logical inferences. To address this, we introduce Gradient-Reweighted Advantage (GRAIL), an intrinsic token-wise advantage reweighting method. GRAIL uses gradient-activation saliency to place more weight on tokens that are more locally sensitive to the final answer. Evaluations across five models from the Qwen3, R1-distilled and OctoThinker families show that GRAIL consistently outperforms GRPO. GRAIL achieved an average improvement of 3.60% in accuracy and 3.05% in Pass@3, demonstrating that fine-grained reasoning alignment can be achieved without process-level supervision.

2606.04888 2026-06-04 cs.CV

HD-DinoMoE: A Class-Aware Hierarchical Dual Mixture-of-Experts Network for Scleral Anomaly Segmentation in Complex Acquisition Scenarios

HD-DinoMoE: 一种用于复杂采集场景下巩膜异常分割的类别感知层次化双混合专家网络

Yinxiang Yu, Maoxiang Chu, Qi Niu, Guanghu Liu, Wei Xu, Haotian Wang, Zhi Chen, Yutian Zhu, Yuelong Fan, Guanghao Liao

发表机构 * School of Electronic and Information Engineering, University of Science and Technology Liaoning(辽宁科技大学电子与信息工程学院)

AI总结 针对多源分布差异、异常形态多样和巩膜镜面反射问题,提出类别感知层次化双混合专家网络HD-DinoMoE,结合双流DINOv3特征融合与类别特定多专家解码,实现血管、黄斑和黑斑、血斑的像素级分割,在ML-SASD数据集上达到72.11%的平均Dice和58.44%的平均IoU。

Comments Submitted to Medical Image Analysis; 47 pages, 31 figures, 14 tables

详情
AI中文摘要

中医目诊通过观察巩膜表面异常提供经验性线索,但其临床应用仍具有主观性且难以量化。为支持智能化和可量化的目诊,本研究提出了中医启发的人工智能眼部辅助诊断系统(TAO),并聚焦于像素级巩膜表面异常分割。针对受多源分布差异、异常形态多样和巩膜镜面反射(SSR)影响的临床和用户采集图像,我们提出了HD-DinoMoE,一种类别感知层次化双混合专家网络。HD-DinoMoE结合类别感知双流DINOv3特征融合与类别特定多专家解码,以分割血管、黄斑和黑斑以及血斑。一种三阶段骨干冻结路由策略稳定了双骨干适应;渐进置信惩罚(PCP)损失减少了SSR区域的高置信度假阳性和分割泄漏;类别感知自适应样本加权(CA-ASW)平衡了样本和类别级别的训练贡献。我们进一步构建了多标签巩膜异常分割数据集(ML-SASD),这是一个包含临床、野生和混合设置以及三种异常类别像素级标注的新基准。在ML-SASD-Mix上,HD-DinoMoE实现了72.11%的平均Dice和58.44%的平均交并比,同时保持了良好的边界定位和镜面区域假阳性控制。它在公共SBVPI数据集的血管子集上也显示出有竞争力的泛化能力。这些结果表明,HD-DinoMoE为复杂采集场景下的TAO提供了一种可行的分割解决方案。代码和数据访问信息可在https://github.com/FX-CMX/HD-DinoMoE获取。

英文摘要

Traditional Chinese Medicine (TCM) ocular inspection provides empirical cues for assessing scleral surface anomalies, but its clinical use remains subjective and difficult to quantify. To support intelligent and quantifiable ocular inspection, this study presents the TCM-inspired Artificial Intelligence Ocular Auxiliary Diagnosis System (TAO) and focuses on pixel-level scleral surface anomaly segmentation. For clinical and user-acquired images affected by multi-source distributional discrepancies, diverse anomaly morphologies, and scleral specular reflection (SSR), we propose HD-DinoMoE, a class-aware hierarchical dual mixture-of-experts network. HD-DinoMoE combines class-aware dual-stream DINOv3 feature fusion with class-specific multi-expert decoding to segment Vessels, Yellow and Black Spots, and Blood Spots. A three-stage backbone-frozen routing strategy stabilizes dual-backbone adaptation; Progressive Confidence Penalty (PCP) Loss reduces high-confidence false positives and segmentation leakage in SSR regions; and Class-Aware Adaptive Sample Weighting (CA-ASW) balances sample- and class-level training contributions. We further construct the Multi-label Scleral Anomaly Segmentation Dataset (ML-SASD), a new benchmark with Clinical, Wild, and Mix settings and pixel-wise annotations for three anomaly categories. On ML-SASD-Mix, HD-DinoMoE achieves a mean Dice of 72.11% and a mean Intersection-over-Union of 58.44%, while maintaining favorable boundary localization and specular-region false-positive control. It also shows competitive generalization on the Vessels subset of the public SBVPI dataset. These results indicate that HD-DinoMoE provides a feasible segmentation solution for TAO under complex acquisition scenarios. The code and data access information are available at https://github.com/FX-CMX/HD-DinoMoE.

2606.04884 2026-06-04 cs.RO

D$^3$-MoE:Dual Disentangled Diffusion Mixture-of-Experts for Style-Controllable End-to-End Autonomous Driving

D$^3$-MoE:面向风格可控的端到端自动驾驶的双解耦扩散混合专家模型

Renju Feng, Rukang Wang, Ning Xi, Jianguo Yu, Liping Lu, Pan Zhou, Duanfeng Chu

发表机构 * Intelligent Transportation Systems Research Center, Wuhan University of Technology(武汉理工大学智能交通系统研究中心) School of Mechanical and Electronic Engineering, Wuhan University of Technology(武汉理工大学机械电子工程学院) School of Computer Science and Artificial Intelligence, Wuhan University of Technology(武汉理工大学计算机科学与人工智能学院) Hubei Key Laboratory of Distributed System Security, School of Cyber Science and Engineering, Huazhong University of Science and Technology(湖北省分布式系统安全重点实验室,华中科技大学网络空间安全学院)

AI总结 提出D$^3$-MoE框架,通过行为轴(扩散生成与选择解耦)和物理轴(纵向与横向专家解耦)的双重解耦,实现风格可控的端到端自动驾驶,在NAVSIM基准上达到SOTA规划性能。

Comments 8 pages, 6 figures

详情
AI中文摘要

传统的端到端自动驾驶框架在训练于高方差的人类演示时经常遭受“风格平均化”困境,产生同质化、风格不可控甚至运动学不安全的策略。为了克服这一限制,我们提出了D$^3$-MoE(双解耦扩散混合专家模型),该模型沿两个互补轴解耦轨迹建模。在行为轴上,生成与选择解耦:一个风格条件扩散过程在单个场景中并行合成多风格候选轨迹,允许下游模块根据用户偏好或评估分数选择最优轨迹。在物理轴上,解耦的纵向和横向路由器在推理时激活各自的专家,这些专家使用来自正交地面真值运动学的自监督目标进行训练,无需人工标签。这些激活的专家采用扩散变换器(DiT)架构,并配备风格条件自适应层归一化(AdaLN)和非对称横向融合交叉注意力,独立预测其对应的物理状态,然后重新组装成统一的、运动学一致的轨迹。在具有挑战性的NAVSIM基准上的广泛评估表明,D$^3$-MoE实现了最先进的规划性能,默认达到88.2 PDMS和84.3 EPDMS。此外,我们的“三选最佳”集成策略有效拓宽了多模态解空间,将性能提升至91.3 PDMS和87.5 EPDMS。定量和定性分析共同证实了该框架在规划质量和风格可控性方面的优势。

英文摘要

Traditional end-to-end autonomous driving frameworks frequently suffer from the "style-averaging" dilemma when trained on high-variance human demonstrations, yielding homogenized, style-uncontrollable, and even kinematically unsafe policies. To overcome this limitation, we present D$^3$-MoE (Dual Disentangled Diffusion Mixture-of-Experts), which disentangles trajectory modeling along two complementary axes. On the behavioral axis, generation is decoupled from selection: a style-conditioned diffusion process synthesizes multi-style candidate trajectories in parallel within a single scene, allowing a downstream module to select the optimal trajectory based on user preference or an evaluation score. On the physical axis, decoupled longitudinal and lateral routers activate their respective experts during inference time, trained without manual labels using self-supervised targets from orthogonal ground-truth kinematics. These activated experts, architected as Diffusion Transformers (DiT) and equipped with style-conditioned AdaLN and asymmetric lateral-fusion cross-attention, independently predict their corresponding physical state before being reassembled into a unified, kinematically coherent trajectory. Extensive evaluations on the challenging NAVSIM benchmark demonstrate that D$^3$-MoE achieves state-of-the-art planning performance, reaching 88.2 PDMS and 84.3 EPDMS by default. Moreover, our Best-of-Three ensemble strategy effectively broadens the multi-modal solution space, raising performance to 91.3 PDMS and 87.5 EPDMS. Both quantitative and qualitative analyses jointly confirm the framework's advantages in planning quality and style controllability.

2606.04881 2026-06-04 cs.CV cs.AI

DiverAge: Reliable Pluralistic Face Aging with Cross-Age Identity Relation Guidance

DiverAge: 基于跨年龄身份关系引导的可靠多元人脸老化

Yueying Zou, Peipei Li, Qianrui Teng, Dianyan Xu, Zekun Li

发表机构 * School of Artificial Intelligence, Beijing University of Posts and Telecommunications(人工智能学院,北京邮电大学) School of Computer Science, University of California, Santa Barbara(计算机科学学院,加州大学圣芭芭拉分校)

AI总结 提出基于扩散自编码的分层多元人脸老化框架DiverAge,通过随机扩散解码和年龄条件语义调制保持外观多样性,并引入跨年龄身份关系调节器(CARR)在推理时引导去噪,以提升序列级有序可靠性。

Comments 11 pages,10 figures, 5 tables

详情
AI中文摘要

人脸老化在长期生物特征分析、跨年龄身份验证和法医身份分析中扮演重要角色。由于同一主体因遗传、环境和生活方式等因素在目标年龄可能呈现多种合理外观,人脸老化本质上是一个一对多的生成问题。然而,仅有多元性不足以实现可靠的人脸老化:模型应在每个年龄组内提供外观级别的候选多样性,同时跨有序年龄组保持序列级别的有序可靠性。现有的确定性老化方法可以合成视觉上合理的年龄增长人脸,但通常缺乏随机多样性。相比之下,多元老化方法引入局部外观变化,但往往未能明确调控完整老化序列的身份演化。本文提出基于扩散自编码的分层多元人脸老化框架DiverAge。DiverAge通过随机扩散解码和年龄条件语义调制保持外观级多样性。为提升序列级可靠性,我们引入跨年龄身份关系调节器(CARR),一种推理时引导策略,联合去噪多个目标年龄组。CARR由从真实同身份跨年龄对估计的跨年龄身份相似性(CIS)先验引导,通过单边采样时引导抑制过度的跨年龄身份漂移,无需修改训练目标或引入额外可训练参数。实验表明,DiverAge在保持身份保留、年龄准确性、图像质量和外观级多样性的同时,提升了序列级有序可靠性。

英文摘要

Face aging plays an important role in long-term biometric analysis, cross-age identity verification, and forensic identity analysis. Since the same subject may exhibit multiple plausible appearances at a target age due to genetic, environmental, and lifestyle factors, face aging is inherently a one-to-many generation problem. However, pluralism alone is insufficient for reliable face aging: a model should provide appearance-level candidate diversity within each age group while maintaining sequence-level ordinal reliability across ordered age groups. Existing deterministic aging methods can synthesize visually plausible age-progressed faces, but usually lack stochastic diversity. In contrast, pluralistic aging methods introduce local appearance variations, but often fail to explicitly regulate the identity evolution of the full aging sequence. In this paper, we propose \textbf{DiverAge}, a hierarchical pluralistic face aging framework based on diffusion autoencoding. DiverAge preserves appearance-level diversity through stochastic diffusion decoding and age-conditioned semantic modulation. To improve sequence-level reliability, we introduce a Cross-age Identity Relation Regulator (CARR), an inference-time guidance strategy that jointly denoises multiple target age groups. CARR is guided by a Cross-age Identity Similarity (CIS) prior estimated from real same-identity cross-age pairs, and suppresses excessive cross-age identity drift through one-sided sampling-time guidance without modifying the training objective or introducing extra trainable parameters. Experiments demonstrate that DiverAge improves sequence-level ordinal reliability while maintaining identity preservation, age accuracy, image quality, and appearance-level diversity.

2606.04880 2026-06-04 cs.CV

MAOAM: Unified Object and Material Selection with Vision-Language Models

MAOAM: 基于视觉语言模型的统一对象与材质选择

Jaden Park, Valentin Deschaintre, Jason Kuen, Kangning Liu, Iliyan Georgiev, Krishna Kumar Singh, Yong Jae Lee, Michael Fischer

发表机构 * University of Wisconsin-Madison(威斯康星大学麦迪逊分校) Adobe Research(Adobe研究)

AI总结 提出MAOAM框架,利用视觉语言模型和分割头,通过文本或点击交互实现对象和材质的精确选择,并设计数据生成流水线解决材质选择数据缺乏问题。

Comments Accepted to SIGGRAPH 2026 Conference. Project page: \href{https://jadenpark0.github.io/project_pages/maoam/}{here}

详情
AI中文摘要

选择是交互式图像编辑中的核心操作。为了实用,用户应能通过文本或点击交互来指定和区分所需的选择区域,系统应支持不仅选择对象,还包括其他标准,如材质。基于材质的选择对于重新纹理化表面或编辑特定材质的实例等任务很有价值。然而,现有的基于视觉语言模型(VLM)的选择方法以对象为中心,通常支持单一交互模态,限制了其适用性。因此,在这项工作中,我们提出了Mask Any Object And Material(MAOAM),一个统一的选择框架,能够在文本和点击交互中实现精确的对象和材质级选择。MAOAM利用带有分割头的VLM从用户提示中生成像素级掩码:VLM解释用户的选择意图(对象或材质级)并编码视觉实体、属性和空间关系,而分割头将输出标记解码为掩码。一个关键挑战是缺乏带有文本标注的材质选择数据集。我们提出了一种可扩展的数据生成流水线:收集带有材质掩码的真实和合成图像,并利用VLM生成具有丰富视觉语义的材质描述。我们通过多任务目标训练MAOAM,涵盖点击和文本选择,以及从材质描述派生的辅助VQA任务,以促进更深入的材质理解。尽管使用单模态提示训练,我们的模型在推理时结合文本和点击时表现出选择能力的涌现提升,实现了灵活的图像编辑工作流程。实验表明,在多样化的对象、材质和交互场景中,选择准确且连贯,突显了实际鲁棒性。

英文摘要

Selection is a core operation in interactive image editing. To be practical, a user should be able to specify and disambiguate the desired selection region through either text or click-based interactions, and the system should support selecting not only objects but also other criteria, such as materials. Material-based selection is valuable for tasks like re-texturing surfaces or editing instances of a specific material. However, existing vision-language-model (VLM) based selection methods are object-centric and typically support a single interaction modality, limiting their applicability. In this work, we thus present Mask Any Object And Material (MAOAM), a unified selection framework that enables precise object and material-level selection across both text- and click-based interactions. MAOAM leverages a VLM with a segmentation head to produce pixel-accurate masks from user prompts: the VLM interprets the user's selection intent (object or material-level) and encodes visual entities, attributes, and spatial relations, while the segmentation head decodes the output token into a mask. A key challenge is the lack of material selection datasets with text annotations. We propose a scalable data generation pipeline: we collect real and synthetic images with material masks, and leverage VLMs to generate material descriptions with rich visual-semantics. We train MAOAM with a multi-task objective over click and text-based selection, along with an auxiliary VQA task derived from the material descriptions to facilitate deeper material understanding. Despite being trained with uni-modal prompts, our model exhibits an emergent improvement in selection when combining text and clicks at inference, enabling flexible image editing workflows. Experiments demonstrate accurate and coherent selections across diverse objects, materials, and interaction scenarios, highlighting robustness in practice.

2606.04876 2026-06-04 cs.LG

Towards Pretraining Text Encoders for TabPFN

面向TabPFN的文本编码器预训练

Mustafa Tajjar, Alexander Pfefferle, Lennart Purucker, Frank Hutter

发表机构 * University of California, Berkeley(加州大学伯克利分校) DeepMind(深度思维)

AI总结 提出TabPFN文本适配器,通过轻量级适配器将文本嵌入映射到TabPFN的嵌入空间,避免PCA瓶颈,保留TabPFN数值优势,训练效率更高。

详情
AI中文摘要

表格基础模型(如TabPFN)在数值和分类数据的表格数据集上表现强劲,但本身不处理高基数文本特征。因此,标准流程使用语言模型嵌入文本,并通过PCA将结果向量压缩为少量标量特征,再输入TabPFN。这造成了信息瓶颈:大多数嵌入维度被丢弃,压缩后的表示必须由TabPFN的特征编码器再次扩展。端到端替代方案可以避免PCA,但需要大量包含文本单元格的预训练数据,且通常性能不如在大量合成数据上预训练的表格基础模型。受模态对齐方法(如LLaVA(视觉到LLM令牌投影)和TableGPT风格系统(表格到LLM令牌投影))的启发,我们引入了TabPFN文本适配器(文本到TFM令牌投影)。我们冻结句子编码器和TabPFN,仅训练一个轻量级适配器,将文本嵌入映射为TabPFN嵌入空间中的短序列令牌。这种设计消除了PCA瓶颈,保留了TabPFN的数值优势,并且比端到端文本表格流水线训练效率更高。

英文摘要

Tabular foundation models, such as TabPFN, achieve strong performance on tabular datasets with numerical and categorical data, but do not natively handle high-cardinality text features. Standard pipelines, therefore, embed text with a language model and compress the resulting vectors with PCA into a small number of scalar features before inputting them into TabPFN. This creates an information bottleneck: most embedding dimensions are discarded, and the compressed representation must then be expanded again by TabPFN's feature encoder. End-to-end alternatives can avoid PCA, but they require large amounts of pretraining data containing text cells and usually perform subpar compared to tabular foundation models that were pretrained on large amounts of synthetic data. Inspired by modality-alignment approaches like LLaVA (vision-to-LLM token projection) and TableGPT-style systems (table-to-LLM token projection), we introduce the TabPFN Text Adapter (text-to-TFM token projection). We freeze both the sentence encoder and TabPFN, and train only a lightweight adapter that maps text embeddings into a short sequence of tokens in TabPFN's embedding space. This design removes the PCA bottleneck, preserves TabPFN's numerical strengths, and is more efficient to train than end-to-end text-tabular pipelines.

2606.04871 2026-06-04 cs.CV

Recent Advances and Trends in Learning-based 3D Representations

基于学习的3D表示的最新进展与趋势

Adrien Schockaert, Hamid Laga, Hazem Wannous, Vincent Magnier, Guillaume Dufaye, Jean-françois Witz

发表机构 * CERI SN, IMT Nord Europe(CERI SN,IMT Nord Europe) Univ. Lille, CNRS, Centrale Lille, UMR 9013 - LaMcube - Laboratoire de Mécanique, Multiphysique, Multiéchelle(里尔大学,CNRS,Centrale Lille,UMR 9013 - LaMcube - 机械、多物理场、多尺度实验室) Downs, 59670 Sainte-Marie-Cappel(Downs,59670 Sainte-Marie-Cappel) School of Information Technology, Murdoch University(墨尔本大学信息科技学院)

AI总结 本文综述了从离散显式格式到连续隐式场(基于神经渲染或基元溅射)的3D表示家族,分析了其优缺点及关键应用,并强调了向隐式表示的范式转变。

详情
AI中文摘要

选择合适的3D表示是一个基本的设计决策,它决定了现代计算机视觉和图形管线在3D重建、新视角合成与渲染、形状与运动分析、识别和生成等任务中的效率、质量和能力。虽然传统表示(如网格、点云和体素网格)仍然是3D传感器(如LiDAR和3D扫描仪)的标准输出,并广泛应用于下游应用(如编辑和仿真),但最近的神经和基元表示(如3D高斯溅射)提供了紧凑且可微的替代方案,在游戏、AR/VR、自动驾驶、机器人导航和医学成像等应用中开辟了广泛的机会。本文的目标是综述主要的3D表示家族,从离散显式格式到基于神经渲染或基元溅射的连续隐式场。对于每种表示类型,我们介绍其一般公式和变体,讨论其优点和局限性,并突出关键应用。最后,我们概述了开放挑战和未来研究的潜在方向。与近期广泛涵盖3D物体和场景重建的综述不同,本文专注于分析3D表示本身的演变。我们特别强调了向隐式表示的范式转变,提供了关于这些新兴格式如何从根本上改变3D/4D工作流程的新视角。

英文摘要

The selection of an appropriate 3D representation is a fundamental design decision that dictates the efficiency, quality, and capabilities of modern computer vision and graphics pipelines for tasks such as 3D reconstruction, novel-view synthesis and rendering, shape and motion analysis, recognition, and generation. While traditional representations (\eg meshes, point clouds, and volumetric grids) remain standard outputs of 3D sensors (\eg LiDAR and 3D scanners) and are widely used in downstream applications (\eg editing and simulation), recent neural and primitive-based representations (\eg 3D Gaussian Splatting) offer compact and differentiable alternatives opening a wide range of opportunities in applications such as games, AR/VR, autonomous driving, robot navigation, and medical imaging, to name a few. The goal of this paper is to survey the main families of 3D representations from discrete explicit formats to continuous implicit fields based either on neural rendering or primitive splatting. For each type of representation, we present the general formulation and its variants, discuss its benefits and limitations, and highlight key applications. We conclude the paper by outlining the open challenges and potential directions for future research. Distinct from recent surveys that broadly cover 3D object and scene reconstruction, this paper provides a focused analysis on the evolution of 3D representations themselves. We specifically emphasize the paradigm shift toward implicit representations, offering a novel perspective on how these emerging formats fundamentally alter 3D/4D workflows.

2606.04867 2026-06-04 cs.AI

AICompanionBench: Benchmarking LLMs-as-Judges for AI Companion Safety

AICompanionBench: 以LLM为评判标准的AI伴侣安全基准测试

Yanjing Ren, Reza Ebrahimi, TengTeng Ma

发表机构 * University of California, Berkeley(加州大学伯克利分校) Stanford University(斯坦福大学)

AI总结 本文提出AICompanionBench,首个公开的细粒度安全风险标注的人机伴侣对话基准数据集,并评估20个LLM在检测不安全交互中的表现,发现强模型在显式有害内容上准确率高,但难以识别隐式不安全交互。

详情
AI中文摘要

随着Replika和Character.AI等AI伴侣平台的快速增长,对不安全的人机交互的担忧日益加剧。本研究引入了AICompanionBench,据我们所知,这是第一个公开可用的人机伴侣对话基准数据集,并标注了细粒度的安全风险类别。该数据集包含从Reddit收集的2,123个真实Replika对话,并通过人机协作在九个类别上进行标注:性行为、反社会行为、身体攻击、言语攻击、药物滥用、自伤和自杀、控制、操纵和无害。利用该基准,我们在LLM作为评判者的框架下评估了20个最先进的开源和闭源LLM,用于检测不安全交互。结果显示模型性能差异显著,较强的模型实现了较高的整体准确性,但在操纵等细微类别以及被错误识别为有害的无害对话中仍存在困难。我们的发现表明,尽管当前的LLM能有效检测显式有害内容,但在识别隐式不安全交互方面仍然有限。总体而言,我们的工作为AI伴侣安全研究贡献了一个新的基准数据集,并为使用LLM监控AI伴侣系统提供了见解。该数据集公开于:https://github.com/anonymousresearcher2026/AICompanionBench/blob/main/AICompanionBench.xlsx

英文摘要

As AI companion platforms such as Replika and Character.AI rapidly grow, concerns about unsafe human-AI interactions have intensified. This study introduces AICompanionBench, to our knowledge the first publicly available benchmark dataset of human-AI companion conversations annotated with fine-grained safety risk categories. The dataset contains 2,123 real-world Replika conversations collected from Reddit and annotated through human-AI collaboration across nine categories: sexual behavior, antisocial behavior, physical aggression, verbal aggression, substance abuse, self-harm and suicide, control, manipulation, and no-harm. Using this benchmark, we evaluate 20 state-of-the-art open-source and closed-source LLMs under an LLM-as-judge framework for detecting unsafe interactions. Results show substantial variation in model performance, with stronger models achieving high overall accuracy but still struggling with nuanced categories such as manipulation, as well as benign conversations that are incorrectly identified as harmful. Our findings suggest that while current LLMs can effectively detect explicit harmful content, they remain limited in identifying implicit unsafe interactions. Overall, our work contributes a new benchmark dataset for AI companionship safety research and offers insights into monitoring AI companion systems using LLMs. The dataset is publicly available at: https://github.com/anonymousresearcher2026/AICompanionBench/blob/main/AICompanionBench.xlsx

2606.04866 2026-06-04 cs.LG

Provably Reduced Sample Cost in Prior-Guided Hyperparameter Optimization

在先验引导的超参数优化中可证明的样本成本降低

Leona Hennig, Jasmin Brandt, Lukas Fehring, Barbara Hammer, Marius Lindauer, Marcel Wever

发表机构 * Leibniz University Hanover(莱比锡大学汉诺威分校) University of Bielefeld(比勒菲尔德大学) Institute of Artificial Intelligence, Leibniz University Hanover(人工智能研究所,莱比锡大学汉诺威分校) L3S Research Center Hanover(汉诺威L3S研究中心)

AI总结 本文通过固定预算最佳臂识别的形式化框架,首次给出了多保真度超参数优化中依赖先验分布的样本复杂度界,证明了信息性先验可显著减少评估次数,并实验验证了高达90%的预算节省。

详情
AI中文摘要

自动化机器学习(AutoML)中的大规模超参数优化(HPO)消耗大量计算资源,引发了关于可扩展性和能源效率的日益关注。现有方法启发式地利用先验信息来加速黑箱和多保真度设置,但缺乏对先验信息性如何定量减少样本复杂度的刻画。在这项工作中,我们通过固定预算最佳臂识别的形式化视角,首次给出了带先验的多保真度HPO的依赖分布的样本复杂度界。通过将先验直接建模在臂均值(即配置性能)上,我们推导出显式的、依赖分布的误差界,量化了先验与评估预算之间的关系。我们的分析表明,信息性先验(将概率质量集中在接近最优的臂上)能够减少所需的评估次数,而无信息或误导性先验则恢复基线性能。我们在合成基准和LCBench(一个用于深度学习的常见多保真度HPO基准)上进行了概念验证实验,以确认我们的理论结果,在保持解质量的同时实现了高达90%的预算削减。总之,我们的结果为先验引导和计算高效的绿色AutoML提供了原则性基础。

英文摘要

Large-scale hyperparameter optimization (HPO) in automated machine learning (AutoML) consumes substantial computational resources, raising growing concerns about scalability and energy efficiency. Existing methods use prior information heuristically to accelerate both black-box and multi-fidelity settings, but they lack a characterization of how prior informativeness quantitatively reduces sample complexity. In this work, we provide the first distribution-dependent sample complexity bounds for multi-fidelity HPO with priors through the formal lens of fixed-budget best-arm identification. By modeling priors directly over arm means as configuration performance, we derive explicit, distribution-dependent error bounds that quantify the relationship between priors and evaluation budget. Our analysis shows that informative priors, which concentrate probability mass on near-optimal arms, yield reductions in the number of required evaluations, whereas baseline performance is recovered with uninformative or misleading priors. We conduct proof-of-concept experiments on a synthetic benchmark and on LCBench, a common multi-fidelity HPO benchmark for deep learning, to confirm our theoretical results, achieving up to 90% budget reduction while retaining solution quality. Together, our results provide a principled foundation for prior-guided and compute-efficient green AutoML.

2606.04863 2026-06-04 cs.CV

IRIS-GAN: Staged Specialist Detection of Deepfake Faces

IRIS-GAN: 深度伪造人脸的分阶段专家检测

Jaume M. Trenchs, Veronica Sanz

发表机构 * Departamento de Física Teórica, Universitat de València, Burjassot, Spain(瓦伦西亚大学理论物理系,瓦伦西亚大学,西班牙Burjassot) Instituto de Física Corpuscular (IFIC), CSIC–Universitat de València, Valencia, Spain(物理微观粒子研究所(IFIC),西班牙-瓦伦西亚大学,瓦伦西亚,西班牙)

AI总结 提出IRIS-GAN,一种通过分阶段暴露于不同GAN族来训练的专业伪造人脸检测器,在跨生成器迁移下实现高检测率,并通过Grad-CAM分析揭示生成器依赖的空间响应模式。

Comments 20 pages, 10 figures

详情
AI中文摘要

我们引入IRIS-GAN,一种针对跨生成器迁移下合成人脸图像的专业取证检测器。我们并非解决通用合成图像检测问题,而是专注于由生成对抗网络(GAN)生成的人脸,这些网络在深度伪造内容中处于领先地位,并通过分阶段暴露于日益苛刻的GAN族同时保留早期生成器来训练检测器。最终模型在考虑的GAN族中实现了超过99%的伪造检测率,并以98.9%的准确率分类了一个外部真实人脸数据集。Grad-CAM分析进一步揭示了可测量的生成器依赖的空间响应模式,这些模式对于仅使用热图的二级分类器仍然具有信息量。对扩散生成人脸的族外测试证实了IRIS-GAN是一个专家检测器,具有一定能力检测非GAN深度伪造。这些结果确立了分阶段训练作为鲁棒GAN人脸取证的有效策略。

英文摘要

We introduce IRIS-GAN, a specialist forensic detector for synthetic face images under cross-generator shift. Rather than addressing universal synthetic-image detection, we focus on faces generated by generative adversarial networks (GANs), which are state-of-the-art in deepfake content, and train the detector through staged exposure to increasingly demanding GAN families while retaining earlier generators. The final model reaches fake-detection rates above 99% across the GAN families considered and classifies an external real-face dataset with 98.9% accuracy. Grad-CAM analysis further reveals measurable generator-dependent spatial response patterns, which remain informative for a secondary heatmap-only classifier. Out-of-family tests on diffusion-generated faces confirm that IRIS-GAN is a specialist detector, with some capability to reach non-GAN deepfakes. These results establish staged training as an effective strategy for robust GAN-face forensics.

2606.04860 2026-06-04 cs.LG cs.AI

Learning Empirically Admissible Neural Heuristics for Combinatorial Search

学习组合搜索的经验可容许神经启发式

Siddharth Sahay

发表机构 * Independent Researcher(独立研究者)

AI总结 针对组合搜索问题,提出一种结合可容许贝尔曼算子与非对称损失函数的验证校准框架,训练出经验可容许的神经启发式,在保证路径最优性的同时显著减少搜索节点扩展。

Comments 13 pages, 3 figures, 2 tables, 1 algorithm

详情
AI中文摘要

寻找诸如魔方、滑动拼图游戏和Lights Out等组合谜题的最优解路径仍然是人工智能中的经典挑战。启发式搜索算法(如A*)仅在使用可容许启发式(即从不高估真实剩余代价的启发式)时才能保证路径最优性。深度强化学习方法(如DeepCubeA)训练深度神经网络来近似代价到目标的启发式。然而,标准均方误差训练经常产生高估,违反可容许性并损害解的最优性。在本文中,我们介绍了一个可泛化的框架,用于学习验证校准的可容许神经启发式。我们使用低估的可容许贝尔曼算子结合非对称损失函数来训练价值网络,以惩罚高估。为了考虑残差神经函数逼近误差,我们提出了一个基于验证打乱计算的校准安全偏移量。我们证明,在校准的神经启发式下,在评估协议下未观察到可容许性违反,并在实践中保持了路径最优性,同时与标准分析基线相比,在2x2魔方上减少了高达83.0%的搜索节点扩展,在3x3 Lights Out网格上减少了19.9%,在8-Puzzle上减少了1.9%。

英文摘要

Finding optimal solution paths for combinatorial puzzles like the Rubik's Cube, sliding tile puzzles, and Lights Out remains a classical challenge in artificial intelligence. Heuristic search algorithms, such as A* , guarantee path optimality only when using an admissible heuristic-one that never overestimates the true remaining cost-to-go. Deep reinforcement learning (RL) methods like DeepCubeA train deep neural networks to approximate cost-to-go heuristics. However, standard mean-squared error (MSE) training regularly yields overestimations, violating admissibility and compromising solution optimality. In this paper, we introduce a generalizable framework for learning validation-calibrated admissible neural heuristics. We train a value network using an underestimating Admissible Bellman Operator combined with an Asymmetric Loss function to penalize overestimation. To account for residual neural function approximation errors, we propose a post-hoc calibration safety offset computed over validation scrambles. We demonstrate that our calibrated neural heuristics achieve no observed admissibility violations under the evaluation protocol and preserve path optimality in practice while reducing search node expansions by up to 83.0% on a 2 by 2 Rubik's Cube, 19.9% on a 3 by 3 Lights Out grid, and 1.9% on an 8-Puzzle compared to standard analytical baselines.

2606.04857 2026-06-04 cs.LG

Rethinking Incompleteness: Formalizing Protocol Divergence and Train-Once Learning for Robust IMVC

重新思考不完备性:形式化协议发散与单次训练学习用于鲁棒IMVC

Haolu Liu, Xiyue Wang, Xuanting Xie, Liangjian Wen, Zhao Kang

发表机构 * National University of Singapore(新加坡国立大学)

AI总结 针对标准IMVC评估范式忽视缺失率不足以刻画数据不完备性的问题,提出协议发散形式化度量,并设计CRAFT架构通过样本独立性和掩码感知融合实现单次训练泛化到多种缺失模式。

详情
AI中文摘要

标准IMVC评估为不同的缺失数据配置分别训练模型。我们表明,这种范式掩盖了一个基本脆弱性:仅缺失率不足以刻画数据不完备性。具体而言,我们表明,具有相同名义缺失率的协议在完全观测样本的比例上可能相差高达$50\times$,从而引发截然不同的学习机制。我们将这一现象形式化为不完备性发散,提供了捕捉缺失数据协议间结构差异的度量。我们进一步证明,对于一大类基于重构的目标函数,当完整样本比例低于临界阈值时,学习在结构上变得不适定,导致接近随机的性能。为了绕过这一理论界限,我们提出了CRAFT(完整数据鲁棒注意力掩码融合变换器)。CRAFT通过两个关键特性将鲁棒性的负担从损失函数转移到架构上:(i)每个样本的独立性,消除了对完整样本共现的依赖,以及(ii)掩码感知变长融合,通过注意力掩码仅聚合观测到的视图。这种设计允许单个模型在完整数据上训练一次,即可在推理时泛化到不同的缺失模式,无需重新训练。在七个基准上的大量实验表明,CRAFT匹配或超越了每个配置的基线,同时将训练开销降低了$8.8\times$,证明对缺失数据的鲁棒性可以作为固有的架构属性实现。代码(CRAFT)和我们的imvc-audit工具包可在https://anonymous.4open.science/r/CRAFT-BF80/ 和 https://anonymous.4open.science/r/imvc-audit-8263/ 获取。

英文摘要

Standard IMVC evaluation retrains separate models for different missing-data configurations. We show that this paradigm obscures a fundamental vulnerability: missing rate alone is insufficient to characterize data incompleteness. Specifically, we show that protocols with identical nominal missing rates can differ by up to $50\times$ in their proportion of fully observed samples, inducing drastically different learning regimes. We formalize this phenomenon as incompleteness divergence, providing measures that capture structural disparities across missing-data protocols. We further prove that for a broad class of reconstruction-based objectives, learning becomes structurally ill-posed when the proportion of complete samples falls below a critical threshold, leading to near-random performance. To bypass this theoretical bound, we propose CRAFT (Complete-data Robust Attention-masked Fusion Transformer). CRAFT shifts the burden of robustness from the loss function to the architecture via two key properties: (i) per-sample independence, which removes reliance on complete-sample co-occurrence, and (ii) mask-aware variable-length fusion, which aggregates only observed views through attention masking. This design allows a single model, trained once on complete data, to generalize to diverse missing patterns at inference time without retraining. Extensive experiments on seven benchmarks show that CRAFT matches or outperforms per-configuration baselines while reducing training overhead by $8.8\times$, demonstrating that robustness to missing data can be achieved as an inherent architectural property. Code (CRAFT) and our imvc-audit toolkit are available at https://anonymous.4open.science/r/CRAFT-BF80/ and https://anonymous.4open.science/r/imvc-audit-8263/.