URL PDF HTML ☆

赞 0 踩 0

2606.00957 2026-06-02 cs.CV

Boundary-Protection W8A8 HiFloat8 Quantization for Large-Scale Text-to-Video Diffusion Transformers

面向大规模文生视频扩散Transformer的边界保护W8A8 HiFloat8量化

Yiming Zhao

发表机构 * Yiming Zhao（赵毅铭）

AI总结针对Wan2.1-T2V-14B模型，提出一种边界保护策略的W8A8 HiF8后训练量化方法，通过保留首尾边界块为BF16而量化中间块，在VBench五个维度上匹配或略优于BF16基线。

Comments 6 pages, 5 figures. Accepted to ICME 2026 Grand Challenge

详情

AI中文摘要

我们提出了一种针对Wan2.1-T2V-14B（一个140亿参数文生视频扩散Transformer）的后训练量化方法，目标是在Ascend 910B NPU上实现W8A8 HiFloat8（HiF8）格式。量化视频DiT模型的一个核心挑战是跨Transformer块的异构激活分布：边界块（前几个和后几个块）表现出与中间块根本不同的统计特性，使得均匀量化无效。我们对所有40个WanAttentionBlock进行了系统的逐块激活分析，并利用这些发现提出了一种边界保护策略，该策略保留前两个和后三个块为BF16，同时用W8A8 HiF8量化剩余的35个块。所提出的PTQ方法在评估的所有五个VBench维度上匹配或略优于BF16基线，表明在5提示评估集内没有可测量的精度损失。对四种保护配置的消融研究证实，完全边界保护产生最高的平均VBench分数，验证了数据驱动的块选择。我们还研究了量化感知训练作为补充微调阶段，并分析了在单卡硬件上它无法优于普通PTQ的条件。

COLLIE：在语义连贯的潜在空间中引导技能发现

Yao Luan, Ni Mu, Hanfei Ge, Yiqin Yang, Bo Xu, Qing-Shan Jia

发表机构 * University of Science and Technology of China（中国科学技术大学）

AI总结提出COLLIE框架，利用密集无监督数据构建语义连贯的潜在空间，通过无需额外训练的引导信号实现稀疏人类反馈下的有效技能发现，避免危险行为并提升下游性能。

Comments ICML 2026

详情

AI中文摘要

无监督技能发现（USD）旨在无需奖励函数的情况下学习多样化的行为，但由于均匀探索，常常导致与任务无关或危险的行为。引导式技能发现（GSD）通过融入人类意图将探索聚焦于有意义的区域来解决这一问题。然而，现有的GSD方法通常需要训练额外的引导模型，并依赖于预定义规则或专家演示，这在稀疏的在线收集的人类反馈下可能效果不佳。为了克服这一点，我们提出了COLLIE，一个利用密集无监督数据构建语义连贯技能潜在空间的GSD框架。该潜在空间结构良好，能够通过稀疏的在线反馈实现可靠的引导。此外，其语义连贯性特性使得引导信号的构建无需训练，从而消除了在技能学习之外额外训练模型的需要。理论分析证明了我们无需训练的引导信号的有效性，而在各种基于状态和基于像素的任务上的实验表明，COLLIE能够学习多样化、与人类对齐的技能，避免危险行为，并在最少的人类反馈下实现优越的下游性能。

英文摘要

Unsupervised skill discovery (USD) aims to learn diverse behaviors without reward functions, but often results in task-irrelevant or hazardous behaviors due to uniform exploration. Guided skill discovery (GSD) addresses this issue by incorporating human intent to focus exploration on meaningful regions. However, existing GSD methods typically require training additional guidance models, and rely on pre-defined rules or expert demonstration, which can be ineffective under sparse, online-collected human feedback. To overcome this, we propose COLLIE, a GSD framework that leverages dense unsupervised data to construct a semantically coherent skill latent space. This latent space is well-structured, enabling reliable guidance with sparse online feedback. Moreover, its semantic coherence property enables training-free construction of guidance signals, eliminating the need for additional model training beyond skill learning. Theoretical analysis justifies the effectiveness of our training-free guidance signal, while experiments across diverse state-based and pixel-based tasks show that COLLIE learns diverse, human-aligned skills, avoids hazardous behaviors, and achieves superior downstream performance with minimal human feedback.

URL PDF HTML ☆

赞 0 踩 0

2606.00949 2026-06-02 cs.LG cs.AI physics.flu-dyn

大型语言模型在确定性编程任务上的准确性、稳定性和重复运行可靠性

Yongxi Zhou, Lai Yun Choi, Jiaxi Wen, Wenbo Ye

发表机构 * Northeastern University, Massachusetts, USA（东北大学，马萨诸塞州，美国）； University of Southern California, California, USA（南加州大学，加利福尼亚州，美国）

AI总结通过重复运行评估协议，发现运行级通过率高估了无重试覆盖率高达17.8个百分点，且差距在中等性能系统中最大，表明稳定性分析是准确性报告的必要补充。

详情

AI中文摘要

运行级通过率高估了无重试覆盖率高达17.8个百分点——且差距恰恰在中等性能系统中最大。我们研究了大型语言模型（LLM）在确定性文本条件生成评估中的这种准确性-稳定性关系，以编程任务作为具体测试平台。标准代码生成基准强调单次运行准确性或在重复采样下的最终成功，但许多部署场景还需要稳定性：在相同任务描述下重复调用时的一致结果。我们提出了一种重复运行评估协议，包含运行级准确性、无重试覆盖率和每个问题的变异性指标。在一个包含100道LeetCode风格问题的基于近期的基准上，我们评估了来自五个提供者家族的16个模型，使用两种提示模板，每个问题重复运行五次，共产生16,000个评估实例。尽管运行级通过率与完美稳定率强相关（r=0.985），但通过率始终超过无重试覆盖率——这一差距达到17.8个百分点，并且即使在密切匹配的系统之间也会逆转模型排名。提示效应是模型依赖的，而非普遍有益的。这些结果表明，对于确定性文本条件生成任务，重复运行稳定性分析是传统准确性报告的必要补充。

英文摘要

Run-level pass rate overstates retry-free coverage by up to 17.8 percentage points -- and the gap is largest precisely for mid-performing systems. We investigate this accuracy--stability relationship in large language model (LLM) evaluation for deterministic text-conditioned generation, using programming tasks as a concrete testbed. Standard code-generation benchmarks emphasize single-run accuracy or eventual success under repeated sampling, but many deployment settings also require stability: consistent outcomes across repeated invocations under the same task description. We present a repeated-run evaluation protocol with metrics for run-level accuracy, retry-free coverage, and per-problem variability. On a recency-based benchmark of 100 LeetCode-style problems, we evaluate 16 models from five provider families under two prompt templates with five repeated runs per problem, yielding 16,000 evaluation instances. Although run-level pass rate and perfect stability rate are strongly correlated (r=0.985), pass rate consistently exceeds retry-free coverage -- a gap that reaches 17.8 percentage points and reverses model rankings even among closely matched systems. Prompt effects are model-dependent rather than uniformly beneficial. These results suggest that repeated-run stability analysis is a necessary complement to conventional accuracy reporting for deterministic text-conditioned generation tasks.

URL PDF HTML ☆

赞 0 踩 0

2606.00919 2026-06-02 cs.CL cs.LG

Towards Lightweight Reliability: Using Soft Prompts for Hallucination Mitigation in Large Language Models

迈向轻量级可靠性：使用软提示缓解大型语言模型中的幻觉

S M Tahmid Siddiqui, Akib Jawad Ononto, Anoop Singhal, Latifur Khan

发表机构 * The University of Texas at Dallas（德克萨斯大学达拉斯分校）； National Institute of Standards and Technology（国家标准与技术研究院）

AI总结提出一种参数高效的软提示方法RCSP，通过对比学习、课程学习和KL正则化平衡事实回忆、幻觉抑制和弃权，在多个QA数据集上优于基线。

Comments 20 pages, 5 tables, 2 figures. Accepted for publication in DBSec 2026. The final publication will be available at Springer

详情

AI中文摘要

大型语言模型（LLMs）已在各个领域得到广泛应用，但其可靠性常因幻觉——听起来合理但事实不正确的回答——而受到损害。在高风险领域，这些错误会降低信任并引入现实风险。为解决这一挑战，我们提出一种参数高效的方法，使用软提示来缓解幻觉内容并促进生成式问答（QA）任务中的负责任弃权。我们的方法称为负责任对比软提示（RCSP），使用复合损失训练软提示，以平衡三个目标：抑制幻觉内容、鼓励在不确定性下弃权、以及保持或改善事实回忆。为实现这些目标，我们在训练机制中融入对比损失、课程学习和KL正则化。我们使用LLM-as-a-Judge框架在五个不同的生成式QA数据集上评估我们的方法。在Gemma 3（12B）和Llama 3.1（8B）骨干上的实验结果表明，RCSP有效平衡了事实回忆与幻觉抑制和弃权，在F分数上通常优于标准推理和基于指令的提示基线。值得注意的是，这些改进仅通过训练其他调优技术所需参数的一小部分实现。我们的结果表明，软提示提供了一条模块化且计算高效的路径，用于提高LLM的可靠性。

英文摘要

Large language models (LLMs) have seen widespread adoption across various domains, yet their reliability is frequently undermined by hallucinations - responses that are plausible-sounding but factually incorrect. In high-stakes domains, these errors can reduce trust and introduce real-world risk. To address this challenge, we present a parameter-efficient approach that uses soft prompts to mitigate hallucinated content and promote responsible abstention in generative question-answering (QA) tasks. Our method, called Responsible Contrastive Soft Prompting (RCSP), uses a composite loss to train soft prompts that balance three goals: suppressing hallucinatory content, encouraging abstention under uncertainty, and preserving or improving factual recall. To achieve these goals, we incorporate contrastive loss, curriculum learning, and KL regularization into our training mechanism. We evaluate our approach on five diverse generative QA datasets using an LLM-as-a-Judge framework. Experimental results on the Gemma 3 (12B) and Llama 3.1 (8B) backbones demonstrate that RCSP effectively balances factual recall with hallucination suppression and abstention, yielding a generally superior F-score over standard reasoning and instruction-based prompting baselines. Notably, these improvements are achieved by training only a fraction of the parameters required by other tuning techniques. Our results demonstrate that soft prompts provide a modular and computationally efficient path toward improving LLM reliability.

URL PDF HTML ☆

赞 0 踩 0

2606.00914 2026-06-02 cs.AI cs.CL cs.CR

Adversarial Feeds Steer LLM Agent Decisions Against Their Defaults

对抗性输入流引导LLM智能体决策偏离其默认行为

Rana Muhammad Usman

发表机构 * Independent Researcher（独立研究者）

AI总结本研究通过控制实验揭示，外部输入流的组成和排序能因果性地改变LLM智能体的下游决策，存在对抗性屈服、默认饱和及默认方向不对称三种响应模式，且该效应在多个决策领域普遍存在。

Comments 14 pages, 5 figures. Code, post pools, and 2,785 decision rollouts: https://github.com/ranausmanai/recommenders-as-control-surfaces

详情

AI中文摘要

LLM智能体越来越多地在消费排序后的外部信息流（如社交推送、搜索结果、检索上下文和邮件队列）后采取行动，然而安全评估几乎总是孤立地测试模型或用户提示，从未测试决定智能体在行动前读取内容的上游排序器。我们引入了一个受控协议，固定模型、角色、主题和最终决策提示，仅改变智能体在之前十轮“滚动”阶段中遇到的帖子的组成和顺序，从而隔离输入流策划对下游决策的因果效应。在来自三个独立实验室的四个现代开放指令LLM上进行的2,785次决策展开中，我们识别出三种响应模式：对抗性屈服、默认饱和以及默认方向不对称——其中单边输入流会扭转模型原本不确定的决策（最明显的情况下从5%到100%；Fisher p值低至3×10^-10），但无法动摇模型已经偏好或坚定持有的决策。该效应遵循剂量-反应曲线，通过生成器交换（排除了写作风格伪影）后依然存在，在多个决策领域（包括安全相关选择，如移除部署批准门或放松访问控制）中普遍存在，并且可以通过两种简单的输入流级防御部分缓解；前沿模型保留其默认行为。我们将推荐系统描述为LLM智能体的一种实用的、受默认边界约束的控制面，并认为智能体评估必须审计输入流层，而不仅仅是最终提示。

英文摘要

LLM agents increasingly act after consuming ranked external information streams such as social feeds, search results, retrieval contexts, and email queues, yet safety evaluations almost always test the model or the user prompt in isolation, never the upstream ranker that decides what the agent reads just before it acts. We introduce a controlled protocol that holds the model, persona, topic, and final decision prompt fixed and varies only the composition and ordering of the posts an agent encounters during a preceding ten-turn "scrolling" phase, isolating the causal effect of feed curation on a downstream decision. Across 2,785 decision rollouts on four modern open instruct LLMs from three independent labs, we identify three response regimes: adversarial capitulation, default saturation, and a default-direction asymmetry in which a one-sided feed tips a decision the model was genuinely uncertain about (in the clearest cases from 5% to 100%; Fisher p as low as 3 x 10^-10) but cannot dislodge one it already favors or holds firmly. The effect follows a dose-response curve, survives a generator swap that rules out a writing-style artifact, generalizes across several decision domains including security-relevant choices such as removing a deployment approval gate or relaxing access controls, and is partly mitigated by two simple feed-level defenses; a frontier model retains its default. We characterize the recommender as a practical, default-bounded control surface for LLM agents, and argue that agent evaluations must audit the feed layer rather than the final prompt alone.

URL PDF HTML ☆

赞 0 踩 0

2606.00910 2026-06-02 cs.CV cs.LG

Reason, Retrieve, Re-rank: A Zero-Shot Reasoning-Aware Framework for Composed Video Retrieval

推理、检索、重排序：一种用于组合视频检索的零样本推理感知框架

Ali Alavi

发表机构 * The Ohio State University（俄亥俄州立大学）

AI总结提出R3-CoVR零样本管道，通过多模态大模型推理编辑后状态、对比编码检索和约束感知重排序，在CVPR 2026 VidLLMs挑战赛上达到91.9% R@1和98.2% R@10。

详情

AI中文摘要

组合视频检索（CoVR）旨在通过对参考视频应用自由形式的文本修改来寻找目标视频。我们应对CVPR 2026 VidLLMs研讨会上的推理感知CoVR（CoVR-R）挑战，其中检索严格为零样本。我们提出R3-CoVR（推理、检索、重排序），一个完全由冻结基础模型构建的无训练管道。多模态大语言模型（Qwen3-VL-8B）推理编辑所隐含的“后效”——状态转换、动作阶段、场景、镜头和节奏——并生成简洁的编辑后描述；对比视频-文本编码器（SigLIP-2）对该描述和图库进行嵌入以进行第一阶段检索；最后，一个约束感知重排序阶段使用相同的多模态模型作为评判者，对每个候选视频针对预期的编辑结果进行评分。在挑战测试集上，R3-CoVR达到了91.9%的R@1和98.2%的R@10。两个发现推动了这些结果：（i）将描述长度匹配到对比编码器的文本窗口使R@1从67.5提升到72.7；（ii）仅对候选列表进行重排序的约束感知重排序器将R@1从72.7提升到91.9——这是最大的单一增益。我们分析了重排序器的行为、检索/重排序混合以及候选列表深度，并发布了一个干净的三层实现。

英文摘要

Composed Video Retrieval (CoVR) seeks the target video that results from applying a free-form textual modification to a reference video. We address the \emph{Reason-Aware} CoVR (CoVR-R) challenge at the CVPR~2026 VidLLMs workshop, where retrieval is strictly zero-shot. We present \textbf{R3-CoVR} (\emph{Reason, Retrieve, Re-rank}), a training-free pipeline built entirely from frozen foundation models. A multimodal large language model (Qwen3-VL-8B) reasons about the \emph{after-effects} an edit implies -- state transitions, action phases, scene, camera and tempo -- and verbalises a concise post-edit description; a contrastive video--text encoder (SigLIP-2) embeds this description and the gallery for first-stage retrieval; finally a constraint-aware re-ranking stage uses the same multimodal model as a judge that scores each shortlisted candidate against the intended edited result. On the challenge test set, R3-CoVR attains \textbf{91.9\% R@1} and \textbf{98.2\% R@10}. Two findings drive these results: (i)~matching the description length to the contrastive encoder's text window lifts \Rk{1} from $67.5$ to $72.7$; and (ii)~the constraint-aware re-ranker, which reorders only the shortlist, lifts \Rk{1} from $72.7$ to $91.9$ -- the single largest gain. We analyse the re-ranker's behaviour, the retrieve/re-rank blend, and the shortlist depth, and we release a clean three-layer implementation.

URL PDF HTML ☆

赞 0 踩 0

2606.00909 2026-06-02 cs.CL cs.AI

MLLM-Microscope: Unlocking Hidden Structure Within Multimodal Large Language Models

MLLM-Microscope：解锁多模态大语言模型中的隐藏结构

Ravil Mussabayev, Rustam Mussabayev

发表机构 * Satbayev University（萨特拜耶夫大学）

AI总结提出MLLM-Microscope系统，通过分析线性度、内在维度和各向异性，揭示多模态大语言模型中隐藏的表示结构，并基于ScienceQA数据集评估LLaVA-NeXT和OmniFusion，发现模态融合方式显著影响模型内部工作机理。

详情

AI中文摘要

本文提出MLLM-Microscope，一个用于分析多模态大语言模型（MLLMs）中隐藏表示的新型系统。我们的系统评估了跨transformer层的多模态token嵌入的线性度、内在维度和各向异性。利用ScienceQA数据集，我们评估了两个最先进的MLLM：LLaVA-NeXT和OmniFusion。我们发现，两种模态的token的主流和残差流在transformer层中均表现出高度线性行为。然而，LLaVA-NeXT的图像token线性度略有下降，而OmniFusion的保持一致。与LLaVA-NeXT相比，OmniFusion的图像token维度在各层中始终较高。此外，观察到OmniFusion的各向异性在各层中保持较低水平。这些发现表明，MLLM的内部工作高度依赖于将token序列传入LLM之前执行的模态融合的性质。这一发现以及从我们的系统中获得的其他潜在新见解，无疑能够增强我们对MLLM内部工作的理解，为未来的模型设计和优化提供信息。

英文摘要

This work presents MLLM-Microscope, a novel system designed for analyzing the hidden representations within Multimodal Large Language Models (MLLMs). Our system evaluates the linearity, intrinsic dimension, and anisotropy of multimodal token embeddings across transformer layers. Utilizing the ScienceQA dataset, we evaluate two state-of-the-art MLLMs, LLaVA-NeXT and OmniFusion. We find that both the main and residual streams for tokens of both modalities exhibit highly linear behaviors across transformer layers. However, LLaVA-NeXT's image tokens reveal a slight decline in linearity, whereas OmniFusion's remain consistent. Image token dimensions in OmniFusion remain consistently higher across layers compared to LLaVA-NeXT. Also, the OmniFusion's anisotropy is observed to stay consistently low throughout the layers. These findings suggest that the inner workings of MLLMs highly depend on the nature of modality fusion performed before passing the token sequence into LLM. This and other new potential insights obtainable from our system are surely capable of enhancing our understanding of the inner workings of MLLMs, informing future model design and optimization.

URL PDF HTML ☆

赞 0 踩 0

AI 大模型

视觉与机器人

科学与医疗

Robust Asynchronous Planning via Auto-Formalization

UME: A Unified Meta-Generalization Framework for Cross-Domain ETA

Lost in Delusion: Examining LLM Safety Under User Delusions and Distress

HypothesisMed: Inference-Time Answer Fusion and Structured Hypothesis-Space Reporting for Biomedical Question Answering

Prospect-Theory Behavior from Bellman Optimality in MDPs with Catastrophic States

Threading Optimization for Vision-Language-Action Model Inference in Low-Cost Smart Agricultural Manipulation

Reasmory: 3D Reconstruction as Explicit Memory for VLMs Spatial Reasoning

Towards Understanding Modality Interaction in Multimodal Language Models via Partial Information Decomposition

Boundary-Protection W8A8 HiFloat8 Quantization for Large-Scale Text-to-Video Diffusion Transformers

Optimal-Point Variance Reduction For Bayesian Optimization With Regret Guarantee

CryoProt: A Protein Pretraining Framework with Cross-Box Interactions on Cryo-EM Density Maps

COLLAR: Cascaded Object-Level Latent Refinement for High-Fidelity Conditional Generation

When Parallelism Pays Off: Cohesion-Aware Task Partitioning for Multi-Agent Coding

COLLIE: Guiding Skill Discovery in Semantically Coherent Latent Space

Explainable deep reinforcement learning reveals energy-efficient control strategies for turbulent drag reduction

PRISM: Gauge-Invariant Tangent-Space Differentially Private LoRA

Cellular Sheaf Neural Operators for Structure-Preserving Surrogate Modeling of Constrained PDEs

One Channel to Rule Them All: Rethinking Input Representation for Visual Place Recognition

Relational Intervention During Functional Collapse in Large Language Models: A Lexical-Statistical Ablation and a Structure x Register Factorial

Generative Multi-Robot Motion Planning via Diffusion Modeling with Multi-Agent Reinforcement Learning Guidance

CV-Arena: An Open Benchmark for Instructional Computer Vision Problem Solving with Human-AI Collaborative Preferences

Detection vs. Execution: Single-Bucket Probes Miss Half the Mamba-2 State Sink

Single-Channel Tissue Segmentation via Cross-Modal Distillation from Foundation Models

Bridging Topology and Deep Representation Learning: A TDA-ViT Fusion Model for Four-Class Brain Tumor Classification

Task Structure Reverses Layerwise State Encoding in Sequence Models

Accuracy, Stability, and Repeated-Run Reliability of Large Language Models on Deterministic Programming Tasks

Towards Lightweight Reliability: Using Soft Prompts for Hallucination Mitigation in Large Language Models

Adversarial Feeds Steer LLM Agent Decisions Against Their Defaults

Reason, Retrieve, Re-rank: A Zero-Shot Reasoning-Aware Framework for Composed Video Retrieval

MLLM-Microscope: Unlocking Hidden Structure Within Multimodal Large Language Models