arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2601.20844 2026-06-03 cs.LG cs.AI cs.IR

$\mathbb{R}^{2k}$ is Theoretically Large Enough for Embedding-based Top-$k$ Retrieval

$\mathbb{R}^{2k}$ 理论上足够大，用于基于嵌入的 Top-$k$ 检索

Zihao Wang, Hang Yin, Lihui Liu, Hanghang Tong, Yangqiu Song, Ginny Wong, Simon See

发表机构 * University of California, Berkeley（加州大学伯克利分校）

AI总结研究最小可嵌入维度（MED），证明对于内积、欧氏距离和余弦相似度，MED 为 Θ(k)，与 m 无关；进一步考虑鲁棒 MED（RMED），推导出可行性上限 ε_⋆(m,k)，并通过实验验证理论结果。

Comments v2: fix broken citation. v3: ICML 2026

详情

AI中文摘要

本文研究最小可嵌入维度（MED）：即存在 m 个对象向量配置的最小维度，使得每个大小至多为 k 的子集都能通过分数比较被精确检索。我们的结果表明，对于内积、欧氏距离和余弦相似度，MED 为 Θ(k)，与 m 无关。然后我们考虑鲁棒 MED（RMED），其中所有向量为单位范数，并且需要 ε 的分数间隙。我们推导出依赖于 m 的可行性上限 ε_⋆(m,k)=m/√(k(m-1)(m-k))，当 m≫k 时趋近于 1/√k，并且高斯质心构造在可行边界区域内给出了鲁棒见证的上界。在合成 top-2 检索上的数值模拟，使用循环多面体和质心查询优化，证实了我们的理论主张。在 LIMIT 和 LIMIT-small 数据集上的实验也表明，简单的基于嵌入的检索基线可能过拟合，并优于报告的单向量 LLM 嵌入基线。理论和实证结果都排除了精确几何容量不足作为障碍的可能性。

英文摘要

This paper studies the Minimal Embeddable Dimension (MED): the least dimension in which there exists a configuration of $m$ object vectors so that every subset of size at most $k$ is exactly retrieved by score comparison. Our result shows MED is $Θ(k)$, independent of $m$, for inner product, Euclidean distance, and cosine similarity. We then consider Robust MED (RMED), where all vectors are unit normed and an $ε$ gap of scores is required. We derive the $m$-dependent feasibility ceiling $ε_\star(m,k)=m/\sqrt{k(m-1)(m-k)}$, which approaches $1/\sqrt{k}$ when $m\gg k$, and a Gaussian centroid construction gives a robust witness upper bound in the feasible margin regime. Numerical simulation on synthetic top-$2$ retrieval with cyclic polytope and centroid query optimization confirmed our theoretical claims. Experiments on LIMIT and LIMIT-small datasets also show that simple embedding-based retrieval baselines can overfit and outperform the reported single-vector LLM embedding baseline. Both theoretical and empirical findings rule out the lack of exact geometric capacity as the obstruction.

URL PDF HTML ☆

赞 0 踩 0

2601.12247 2026-06-03 cs.CL cs.AI cs.LG

Plan, Verify and Fill: A Structured Parallel Decoding Approach for Diffusion Language Models

规划、验证与填充：扩散语言模型的结构化并行解码方法

Miao Li, Hanyang Jiang, Sikai Cheng, Hengyu Fu, Yuhang Cai, Baihe Huang, Tinghan Ye, Xuanzhou Chen, Pascal Van Hentenryck

发表机构 * Georgia Institute of Technology（佐治亚理工学院）； University of California, Berkeley（加州大学伯克利分校）； University of Michigan（密歇根大学）

AI总结提出Plan-Verify-Fill (PVF)方法，通过定量验证进行分层骨架规划，并采用验证协议实现结构化停止，在保持准确性的同时将函数评估次数减少高达65%。

2501.17377 2026-06-03 cs.LG cs.AI

ASAP: Exploiting the Satisficing Generalization Edge in Neural Combinatorial Optimization

ASAP：利用神经组合优化中的满意泛化优势

Han Fang, Paul Weng, Yutong Ban

发表机构 * GitHub

AI总结针对神经组合优化模型在分布偏移下的脆弱性，提出ASAP框架，通过将决策分解为提案和选择两阶段，并利用MAML增强在线适应能力，在3D-BPP、TSP和CVRP上提升了泛化性能。

Comments Accepted as poster of ICML-2026

详情

AI中文摘要

深度强化学习（DRL）已成为解决组合优化（CO）问题（如3D装箱问题（3D-BPP）、旅行商问题（TSP）或车辆路径问题（VRP））的一种有前景的方法，但这些神经求解器在面对分布偏移时往往表现出脆弱性。为了解决这个问题，我们揭示了满意泛化优势，并在理论和实验上进行了验证：识别一组有希望的行动本质上比选择单一最优行动更具泛化性。为了利用这一特性，我们提出了自适应选择后提案（ASAP），这是一个通用框架，将决策过程分解为两个不同的阶段：作为鲁棒过滤器的提案策略和作为可适应决策者的选择策略。这种架构使得一种高效的在线适应策略成为可能，其中选择策略可以在新分布上快速微调。具体地，我们引入了一个由模型无关元学习（MAML）增强的两阶段训练框架，以使模型能够快速适应。在3D-BPP、TSP和CVRP上的大量实验表明，ASAP提高了最先进基线的泛化能力，并在分布外实例上实现了优越的在线适应。

英文摘要

Deep Reinforcement Learning (DRL) has emerged as a promising approach for solving Combinatorial Optimization (CO) problems, such as the 3D Bin Packing Problem (3D-BPP), Traveling Salesman Problem (TSP), or Vehicle Routing Problem (VRP), but these neural solvers often exhibit brittleness when facing distribution shifts. To address this issue, we uncover the Satisficing Generalization Edge, which we validate both theoretically and experimentally: identifying a set of promising actions is inherently more generalizable than selecting the single optimal action. To exploit this property, we propose Adaptive Selection After Proposal (ASAP), a generic framework that decomposes the decision-making process into two distinct phases: a proposal policy that acts as a robust filter, and a selection policy as an adaptable decision maker. This architecture enables a highly effective online adaptation strategy where the selection policy can be rapidly fine-tuned on a new distribution. Concretely, we introduce a two-phase training framework enhanced by Model-Agnostic Meta-Learning (MAML) to prime the model for fast adaptation. Extensive experiments on 3D-BPP, TSP, and CVRP demonstrate that ASAP improves the generalization capability of state-of-the-art baselines and achieves superior online adaptation on out-of-distribution instances.

URL PDF HTML ☆

赞 0 踩 0

2601.17130 2026-06-03 cs.LG cs.CR

Impact of Graph Structure on Membership-Inference Risk for Graph Neural Networks

图结构对图神经网络成员推理风险的影响

Megha Khosla

发表机构 * Delft University of Technology（代尔夫特理工大学）

AI总结本文通过分析训练图构建和推理时边访问两个维度，研究了图结构如何影响图神经网络的节点级成员推理风险，并发现雪球采样会损害泛化能力，而推理时边访问能显著改变成员推理优势。

Comments Accepted for publication in PETS 2026

详情

AI中文摘要

图神经网络（GNN）广泛用于节点分类和链接预测等任务，但在敏感场景中的使用引发了训练数据泄露的担忧。先前关于GNN隐私泄露的工作大多借鉴非图领域的假设，忽视了图结构的作用。我们主张对隐私风险进行图特定的分析，并研究图结构如何影响节点级成员推理。我们形式化了节点-邻域元组上的成员推理（MI），并探讨了两个重要维度：（i）训练图构建和（ii）推理时边访问。我们比较了雪球采样（一种结构感知过程）与均匀随机节点采样用于构建训练图。实验表明，雪球采样由于其覆盖偏差，通常比随机采样更损害泛化能力。相反，在推理时允许访问训练-测试间边可以提高测试准确率，缩小训练-测试差距，同时也会对成员推理优势产生强烈且依赖于设置的影响。这些结果表明图结构直接塑造了隐私风险。我们进一步表明，泛化差距（以训练和测试节点之间的性能差异衡量）是成员推理风险的不完全代理：成员推理优势可以独立于该差距的变化而上升或下降，而推理时边访问通常起着关键作用。理论上，我们证明对于节点级任务，基于成员推理的标准隐私审计结果不能直接推广到归纳图设置，因为训练和测试节点在结构上相互依赖而非可互换。我们在https://github.com/PriXAI/GraphStructurePrivacyAnalysis-public 发布代码和数据。

英文摘要

Graph neural networks (GNNs) are widely used for tasks such as node classification and link prediction, but their use in sensitive settings raises concerns about training-data leakage. Prior work on privacy leakage in GNNs largely borrows assumptions from non-graph domains, overlooking the role of graph structure. We argue for a graph-specific analysis of privacy risk and study how graph structure affects node-level membership inference. We formalize membership inference (MI) over node-neighborhood tuples and investigate two important dimensions: (i) training-graph construction and (ii) inference-time edge access. We compare snowball sampling, a structure-aware procedure, with uniform random node sampling for constructing training graphs. Our experiments show that snowball sampling often hurts generalization relative to random sampling due to its coverage bias. In contrast, allowing access to inter-train-test edges at inference improves test accuracy, reduces the train-test gap, while also having a strong and setting-dependent effect on membership advantage. These results show that graph structure directly shapes privacy risk. We further show that the generalization gap, measured as the performance difference between training and test nodes, is an incomplete proxy for membership inference risk: membership advantage can rise or fall independently of changes in this gap, with inference-time edge access often playing a crucial role. Theoretically, we show that for node-level tasks, standard privacy-auditing results based on membership inference do not directly carry over to inductive graph settings, because training and test nodes are structurally dependent rather than interchangeable. We release the code and data at https://github.com/PriXAI/GraphStructurePrivacyAnalysis-public.

URL PDF HTML ☆

赞 0 踩 0

2601.14569 2026-06-03 cs.CL cs.LG

Social Caption: Evaluating Social Understanding in Multimodal Models

Social Caption: 评估多模态模型的社会理解能力

Leena Mathur, Bhaavanaa Thumu, Youssouf Kebe, Louis-Philippe Morency

发表机构 * School of Computer Science, Carnegie Mellon University（卡内基梅隆大学计算机科学学院）

AI总结提出基于交互理论的SOCIAL CAPTION框架，从社会推理、整体社会分析和定向社会分析三个维度评估多模态大语言模型的社会理解能力，并分析影响性能的因素。

Comments 25 pages, 10 figures

2505.16014 2026-06-03 cs.CL

Ranking Free RAG: Replacing Re-ranking with Selection in RAG for Sensitive Domains

无排序RAG：用选择替代重排序以应用于敏感领域

Yash Saxena, Ankur Padia, Mandar S Chaudhary, Kalpa Gunaratna, Srinivasan Parthasarathy, Manas Gaur

发表机构 * University of Washington（华盛顿大学）

AI总结提出METEORA框架，通过DPO微调LLM生成检索理由、统计肘部检测自适应截断和验证器过滤，在敏感领域实现可解释、高效且鲁棒的证据选择，无需重排序。

Comments ICML 2026

详情

AI中文摘要

部署在敏感领域的检索增强生成（RAG）系统必须提供可解释的证据选择，并针对数据投毒提供稳健的防护，然而当前方法依赖于不透明的基于相似性的检索，并采用任意的top-k截断，这些方法对其选择不提供任何解释，且容易受到对抗性操纵。METEORA通过三个组件用理由驱动的选择替代重排序：一个DPO微调的LLM，生成明确的检索理由；一个证据块选择引擎（ECSE），利用这些理由结合统计肘部检测进行自适应截断确定；以及一个验证器LLM，使用相同的理由过滤投毒证据。在六个数据集上，METEORA实现了召回率提高13.41%，精确率提高21.05%（无扩展），证据量减少80%，答案准确率提高33.34%，对抗鲁棒性提高4.4倍。人工评估证实了真正的可解释性（置信度3.64/5；86%的真实标签一致性），表明可解释性、效率和鲁棒性是协同而非竞争的目标。代码可在GitHub仓库https://github.com/YashSaxena21/METEORA中获取。

英文摘要

Retrieval-Augmented Generation (RAG) systems deployed in sensitive domains must provide interpretable evidence selection and robust safeguards against data poisoning, yet current approaches rely on opaque similarity-based retrieval with arbitrary top-k cutoffs that offer no explanation for their selections and remain vulnerable to adversarial manipulation. METEORA replaces re-ranking with rationale-driven selection via three components: a DPO-tuned LLM that generates explicit retrieval rationales, an Evidence Chunk Selection Engine (ECSE) that uses those rationales with statistical elbow detection for adaptive cutoff determination, and a Verifier LLM that filters poisoned evidence using the same rationales. Across six datasets, METEORA achieves 13.41% higher recall, 21.05% higher precision (without expansion), an 80% reduction in evidence volume, a 33.34% improvement in answer accuracy, and a 4.4x improvement in adversarial robustness. Human evaluation confirms genuine interpretability (3.64/5 confidence; 86% ground-truth agreement), demonstrating that interpretability, efficiency, and robustness are synergistic rather than competing objectives. The code is available in the GitHub repository https://github.com/YashSaxena21/METEORA

URL PDF HTML ☆

赞 0 踩 0

2601.11667 2026-06-03 cs.LG cs.AI

Distill-then-Replace: Efficient Task-Specific Hybrid Attention Model Construction

Distill-then-Replace: 高效的任务特定混合注意力模型构建

Xiaojie Xia, Huigang Zhang, Chaoliang Zhong, Jun Sun, Yusuke Oishi

发表机构 * Fujitsu Research & Development Center CO., LTD（富士通研发中心有限公司）； Fujitsu Research, FUJITSU LTD（富士通研究所，富士通有限公司）

AI总结提出Distill-then-Replace (DtR)方法，通过逐块局部蒸馏和贪婪层替换策略，将预训练的全注意力模型高效转换为任务特定的混合注意力模型，无需重新训练或神经架构搜索。

详情

AI中文摘要

Transformer架构通过密集的全注意力机制实现了最先进的准确性，但其相对于序列长度的二次时间和内存复杂度限制了实际部署。线性注意力机制提供线性或接近线性的缩放，但通常会导致性能下降。集成全注意力和线性注意力层的混合模型有望在效率和表达能力之间取得平衡，但面临两个主要挑战：从头训练此类混合模型计算成本高，且手动设计注意力类型的最佳放置位置非常困难。我们提出DtR（Distill-then-Replace），首先通过逐块局部蒸馏将预训练的全注意力模块的权重转移到其线性注意力对应模块，然后应用贪婪层替换策略，迭代地用线性注意力块替换全注意力块，同时监控目标任务的验证性能。DtR在单次高效过程中生成任务特定的混合模型，无需昂贵的重新训练或神经架构搜索，并可应用于任何预训练的全注意力骨干网络以处理各种下游任务。

英文摘要

Transformer architectures deliver state-of-the-art accuracy via dense full-attention, but their quadratic time and memory complexity with respect to sequence length limits practical deployment. Linear attention mechanisms offer linear or near-linear scaling yet often incur performance degradation. Hybrid models that integrate full and linear attention layers promise a balance between efficiency and expressiveness, but face two major challenges: training such hybrid models from scratch is computationally expensive, and manually designing the optimal placement of attention types is highly nontrivial. We propose DtR (Distill-then-Replace), which first transfers weights from the pretrained full-attention modules to its linear attention counterparts through blockwise local distillation, and then applies a greedy layer replacement strategy that iteratively substitutes full attention blocks with linear ones while monitoring validation performance on the target task. DtR yields a task-specific hybrid model in a single efficient pass, without costly re-training or neural architecture search, and can be applied to any pretrained full-attention backbone for diverse downstream tasks.

URL PDF HTML ☆

赞 0 踩 0

2510.22491 2026-06-03 cs.LG cs.CE cs.CV

LAMP: Data-Efficient Linear Affine Weight-Space Models for Parameter-Controlled 3D Shape Generation and Extrapolation

LAMP: 数据高效的线性仿射权重空间模型用于参数控制的3D形状生成与外推

Ghadi Nehme, Yanxia Zhang, Dule Shu, Matt Klenk, Faez Ahmed

发表机构 * GitHub

AI总结提出LAMP框架，通过过拟合共享初始化的符号距离函数解码器并对齐权重空间，以少量样本实现参数约束下的可控3D生成与外推，并引入线性失配安全度量确保可靠性。

详情

AI中文摘要

在显式参数约束下生成高保真3D几何体是工程设计的核心，但当前方法通常需要大型数据集，且无法在训练分布之外提供可靠控制。我们提出LAMP，一个数据高效的框架，用于可控和可解释的3D生成，该框架通过从共享初始化过拟合每个样本并对齐符号距离函数（SDF）解码器，然后在对齐的权重空间中通过求解参数约束的仿射混合问题来生成新设计。为了提高可靠性，我们提出一种线性失配安全度量，用于检测混合解码器何时离开有效的局部区域。我们在DrivAerNet++、BlendedNet以及额外的工业级车辆系列（包括跑车、SUV和敞篷车）上评估LAMP。LAMP能够以少至50个样本实现受控插值，在训练范围外安全外推高达100%，并在固定参数下进行性能引导优化，在外推、数据效率和参数保真度方面优于条件自编码器和深度网络插值（DNI）基线。我们的结果表明，LAMP推进了用于设计探索、数据集生成和性能驱动优化的可控、数据高效且安全的3D生成。

英文摘要

Generating high-fidelity 3D geometries under explicit parameter constraints is central to engineering design, yet current methods often require large datasets and fail to provide reliable control beyond the training distribution. We introduce LAMP, a data-efficient framework for controllable and interpretable 3D generation that aligns signed distance function (SDF) decoders by overfitting each exemplar from a shared initialization, then generates new designs by solving a parameter-constrained affine mixing problem in the aligned weight space. To improve reliability, we propose a linearity-mismatch safety metric that detects when mixed decoders leave the valid local regime. We evaluate LAMP on DrivAerNet++, BlendedNet, and additional industry-level vehicle families, including sports cars, SUVs, and convertibles. LAMP enables controlled interpolation with as few as 50 samples, safe extrapolation up to 100% beyond training ranges, and performance-guided optimization under fixed parameters, outperforming conditional autoencoder and Deep Network Interpolation (DNI) baselines in extrapolation, data efficiency, and parameter fidelity. Our results demonstrate that LAMP advances controllable, data-efficient, and safe 3D generation for design exploration, dataset generation, and performance-driven optimization.

URL PDF HTML ☆

赞 0 踩 0

2601.09869 2026-06-03 cs.AI cs.HC

A Scoping Review of the Ethical Perspectives on Anthropomorphising Large Language Model-Based Conversational Agents

拟人化大型语言模型对话代理的伦理视角：一项范围综述

Andrea Ferrario, Rasita Vinay, Matteo Casserini, Alessandro Facchini

发表机构 * Institute of Biomedical Ethics and History of Medicine, University of Zürich（苏黎世大学生物医学伦理与医学史研究所）； Dalle Molle Institute for Artificial Intelligence (IDSIA), SUPSI（瑞士SUPSI人工智能研究所）； ETH Zürich（苏黎世联邦理工学院）； Institute for Implementation Science in Health Care, University of Zürich（苏黎世大学医疗实施科学研究所）； Department of Management, Technology and Economics, ETH Zürich（苏黎世联邦理工学院管理、技术与经济系）； Dipartimento Tecnologie Innovative, SUPSI（SUPSI创新技术系）； Management in Networked and Digital Societies (MINDS) Department, Kozminski University（科兹明斯基大学网络化与数字化社会管理系）

AI总结本文通过范围综述，系统梳理了拟人化LLM对话代理的伦理挑战与机遇，包括概念基础、伦理问题及方法论，并提出了研究议程与设计治理建议。

Comments 2026 ACM Conference on Fairness, Accountability, and Transparency (FAccT'26)

详情

AI中文摘要

拟人化——将人类特质赋予非人类实体的现象——随着基于大型语言模型（LLM）的对话代理（CAs）的兴起而日益显著。与早期的聊天机器人不同，基于LLM的CA通常会生成互动和语言线索，例如第一人称自我指涉、认知和情感表达，实证研究表明这些可以增加参与度。另一方面，拟人化引发了伦理担忧，包括欺骗、过度依赖和剥削性关系框架，而一些作者认为拟人化互动可能支持自主性、福祉和包容性。尽管对该现象的兴趣日益增加，文献仍跨领域分散，并且在如何定义、操作化和规范性评估拟人化方面存在显著差异。本范围综述绘制了关于拟人化基于LLM的CA的伦理导向工作，覆盖五个数据库和三个预印本存储库。我们综合了（1）概念基础，（2）伦理挑战与机遇，以及（3）方法论方法。我们发现基于归因的定义趋于一致，但操作化存在显著差异，主要是风险导向的规范性框架，以及将观察到的互动效应与可操作的治理指导联系起来的实证工作有限。我们最后提出了研究议程和设计/治理建议，用于在基于LLM的对话代理中伦理地部署拟人化线索。

英文摘要

Anthropomorphisation -- the phenomenon whereby non-human entities are ascribed human-like qualities -- has become increasingly salient with the rise of large language model (LLM)-based conversational agents (CAs). Unlike earlier chatbots, LLM-based CAs routinely generate interactional and linguistic cues, such as first-person self-reference, epistemic and affective expressions that empirical work shows can increase engagement. On the other hand, anthropomorphisation raises ethical concerns, including deception, overreliance, and exploitative relationship framing, while some authors argue that anthropomorphic interaction may support autonomy, well-being, and inclusion. Despite increasing interest in the phenomenon, literature remains fragmented across domains and varies substantially in how it defines, operationalizes, and normatively evaluates anthropomorphisation. This scoping review maps ethically oriented work on anthropomorphising LLM-based CAs across five databases and three preprint repositories. We synthesize (1) conceptual foundations, (2) ethical challenges and opportunities, and (3) methodological approaches. We find convergence on attribution-based definitions but substantial divergence in operationalization, a predominantly risk-forward normative framing, and limited empirical work that links observed interaction effects to actionable governance guidance. We conclude with a research agenda and design/governance recommendations for ethically deploying anthropomorphic cues in LLM-based conversational agents.

URL PDF HTML ☆

赞 0 踩 0

2601.08173 2026-06-03 cs.AI

The Agent's First Day: Benchmarking Learning, Exploration, and Scheduling in the Workplace Scenarios

Agent 的第一天：在工作场景中基准测试学习、探索和调度

Daocheng Fu, Jianbiao Mei, Rong Wu, Xuemeng Yang, Jia Xu, Ding Wang, Pinlong Cai, Yong Liu, Licheng Wen, Botian Shi

发表机构 * Fudan University（复旦大学）； Shanghai AI Laboratory（上海人工智能实验室）； Zhejiang University（浙江大学）； Shanghai Innovation Institute（上海创新研究院）； Shanghai Jiao Tong University（上海交通大学）

AI总结针对多模态大语言模型在动态工作场景中面临的任务调度、主动探索和持续学习三大挑战，提出动态评估环境 EvoEnv，实验表明现有 agent 在这些方面存在显著不足。

详情

AI中文摘要

多模态大语言模型（MLLMs）的快速发展推动了工作流自动化；然而，现有研究主要针对静态环境中的性能上限，忽视了随机真实世界部署的鲁棒性。我们识别出三个关键挑战：动态任务调度、不确定性下的主动探索以及从经验中持续学习。为弥补这一差距，我们引入了 \method{}，一个动态评估环境，模拟“实习生”agent 持续探索新环境。与传统基准不同，\method{} 从三个维度评估 agent：（1）针对具有不同优先级的流式任务的上下文感知调度；（2）通过主动探索谨慎获取信息以减少幻觉；（3）通过从基于规则的动态生成任务中提炼通用策略实现持续进化。实验表明，最先进的 agent 在动态环境中存在显著缺陷，尤其是在主动探索和持续学习方面。我们的工作建立了一个评估 agent 可靠性的框架，将评估从静态测试转向现实的、面向生产的场景。我们的代码可在 https://github.com/KnowledgeXLab/EvoEnv 获取。

英文摘要

The rapid evolution of Multi-modal Large Language Models (MLLMs) has advanced workflow automation; however, existing research mainly targets performance upper bounds in static environments, overlooking robustness for stochastic real-world deployment. We identify three key challenges: dynamic task scheduling, active exploration under uncertainty, and continuous learning from experience. To bridge this gap, we introduce \method{}, a dynamic evaluation environment that simulates a "trainee" agent continuously exploring a novel setting. Unlike traditional benchmarks, \method{} evaluates agents along three dimensions: (1) context-aware scheduling for streaming tasks with varying priorities; (2) prudent information acquisition to reduce hallucination via active exploration; and (3) continuous evolution by distilling generalized strategies from rule-based, dynamically generated tasks. Experiments show that cutting-edge agents have significant deficiencies in dynamic environments, especially in active exploration and continual learning. Our work establishes a framework for assessing agent reliability, shifting evaluation from static tests to realistic, production-oriented scenarios. Our codes are available at https://github.com/KnowledgeXLab/EvoEnv

URL PDF HTML ☆

赞 0 踩 0

2512.23234 2026-06-03 cs.CV cs.AI

Edge-Aware and Content-Adaptive Infrared Gas Leak Detection for Industrial Safety Monitoring

边缘感知与内容自适应的工业安全监控红外气体泄漏检测

Dongsheng Li, Tianli Ma, Siling Wang, Beibei Duan, Song Gao

发表机构 * School of Mechatronic Engineering, Xi’an Technological University（机械电子工程学院，西安理工大学）； School of Electronic Information Engineering, Xi’an Technological University（电子信息工程学院，西安理工大学）； Shaanxi Shanhua Coal Chemical Co., Ltd.（陕西神华化工有限公司）

AI总结针对红外气体羽流微弱、半透明且边界模糊的检测难题，提出一种边缘感知与内容自适应特征融合检测器（ECAF-Det），通过羽流导向的局部-全局特征增强、多尺度边缘感知模块和内容自适应稀疏路由路径聚合网络，在IIG和LangGas数据集上显著提升了检测精度。

详情

AI中文摘要

红外气体泄漏检测对于工业安全和环境监测至关重要，但由于气体羽流通常微弱、细小、半透明且边界模糊，自动检测仍然具有挑战性。本文提出了一种边缘感知与内容自适应特征融合检测器（ECAF-Det），用于杂乱热场景中的弱羽流检测。ECAF-Det集成了三个面向任务的设计：羽流导向的局部-全局特征增强块，用于保留精细边界线索并捕获长程上下文连续性；多尺度边缘感知模块，将方向梯度和相位一致性线索转化为分层边缘先验，用于边界敏感的羽流表示；以及内容自适应稀疏路由路径聚合网络，动态调节多尺度特征传播，以强调信息丰富的羽流特征并抑制冗余背景响应。在IIG数据集上的实验表明，ECAF-Det实现了29.8%的AP、84.3%的AP50和25.3%的小目标AP，分别比RT-DETR-R18基线提高了3.0、6.5和5.4个百分点，计算量为43.7 GFLOPs，参数量为14.9 M。在LangGas数据集上，ECAF-Det实现了36.3%的AP和68.5%的AP50，展示了其对不同红外气体羽流外观的泛化能力。主要的人工智能贡献在于边缘感知表示学习与内容自适应稀疏特征路由，用于弱红外羽流感知。所提出的检测器可作为工业气体泄漏监测中早期预警和远程巡检的视觉感知组件。

英文摘要

Infrared gas leak detection is important for industrial safety and environmental monitoring, but automatic detection remains challenging because gas plumes are often faint, small, semi-transparent, and weakly bounded. This paper proposes an Edge-Aware and Content-Adaptive Feature Fusion Detector (ECAF-Det) for weak-plume detection in cluttered thermal scenes. ECAF-Det integrates three task-oriented designs: a plume-oriented local-global feature enhancement block to preserve fine boundary cues and capture long-range contextual continuity; a multi-scale edge perception module that transforms directional gradient and phase-consistency cues into hierarchical edge priors for boundary-sensitive plume representation; and a content-adaptive sparse routing path aggregation network that dynamically regulates multi-scale feature propagation to emphasize informative plume features and suppress redundant background responses. Experiments on the IIG dataset show that ECAF-Det achieves 29.8% AP, 84.3% AP50, and 25.3% small-object AP, improving the RT-DETR-R18 baseline by 3.0, 6.5, and 5.4 percentage points, respectively, with 43.7 GFLOPs and 14.9 M parameters. On the LangGas dataset, ECAF-Det achieves 36.3% AP and 68.5% AP50, demonstrating its generalization to different infrared gas plume appearances. The main AI contribution is edge-aware representation learning with content-adaptive sparse feature routing for weak infrared plume perception. The proposed detector can serve as a visual perception component for early warning and remote inspection in industrial gas leak monitoring.

URL PDF HTML ☆

赞 0 踩 0

2504.04942 2026-06-03 cs.AI cs.LO

Lemmanaid: Neuro-Symbolic Lemma Conjecturing

Lemmanaid: 神经符号引理猜想

Yousef Alhessi, Sólrún Halla Einarsdóttir, George Granberry, Emily First, Moa Johansson, Sorin Lerner, Nicholas Smallbone

发表机构 * Department of Computer Science and Engineering University of California, San Diego, USA（计算机科学与工程系，加州大学圣地亚哥分校）； Department of Computer Science and Engineering Chalmers University of Technology & University of Gothenburg（计算机科学与工程系，查尔姆斯理工大学及哥德堡大学）

AI总结提出首个神经符号引理猜想工具LEMMANAID，通过类比数学理论生成引理，结合微调LLM与符号方法，在Isabelle测试集上优于纯神经和纯符号方法。

详情

AI中文摘要

数学家和计算机科学家越来越多地利用证明助手来形式化和检查复杂证明，这需要大量的专业知识。我们能否通过自动化猜想有用、有趣且新颖的引理来降低门槛？我们提出了首个神经符号引理猜想工具LEMMANAID，旨在通过类比数学理论来发现猜想。LEMMANAID使用微调后的LLM生成描述引理形状的引理模板，并使用符号方法填充细节。我们将LEMMANAID与直接微调生成引理的相同LLM以及完全符号的猜想方法进行了比较。在来自Isabelle的HOL库和形式化证明档案（AFP）的测试集上，LEMMANAID始终优于神经和符号方法。使用DeepSeek-coder-6.7B作为后端，LEMMANAID发现了50%（HOL）和29%（AFP）的金标准引理，当集成提示策略时，这一比例提高到55%和35%。在关于八元数的案例研究中，LEMMANAID发现了79%的金标准引理，而纯神经方法为62%，最先进的符号工具为23%。此外，在针对性比较中，LEMMANAID发现的金标准引理数量超过了Claude Opus 4.5和GPT-5.2。我们的结果表明，LEMMANAID能够在数学和计算机科学的复杂形式化中猜想出大量有趣的引理。

英文摘要

Mathematicians and computer scientists are increasingly leveraging proof assistants to formalize and check complex proofs, a task that demands substantial expertise. Can we lower the bar by automating the conjecturing of helpful, interesting and novel lemmas? We present the first neuro-symbolic lemma conjecturing tool, LEMMANAID, designed to discover conjectures by drawing analogies between mathematical theories. LEMMANAID uses a fine-tuned LLM to generate lemma templates that describe the shape of a lemma, and symbolic methods to fill in the details. We compare LEMMANAID against the same LLM fine-tuned to generate lemmas directly, as well as a fully symbolic conjecturing method. On test sets from Isabelle's HOL library and Archive of Formal Proofs (AFP), LEMMANAID consistently outperforms both neural and symbolic methods. Using DeepSeek-coder-6.7B as a backend, LEMMANAID discovers 50% (HOL) and 29% (AFP) of the gold standard lemmas, increasing to 55% and 35% when ensembling prompting strategies. In a case study on Octonions, LEMMANAID discovers 79% of the gold standard lemmas, compared to 62% for neural-only and 23% for the state of the art symbolic tool. Furthermore, in a targeted comparison, LEMMANAID discovers more gold standard lemmas than both Claude Opus 4.5 and GPT-5.2. Our results show that LEMMANAID can conjecture a significant number of interesting lemmas across complex formalizations in mathematics and computer science.

URL PDF HTML ☆

赞 0 踩 0

2512.10999 2026-06-03 cs.CL

KBQA-R1: Reinforcing Large Language Models for Knowledge Base Question Answering

KBQA-R1：强化大语言模型用于知识库问答

Xin Sun, Zhongqi Chen, Xing Zheng, Qiang Liu, Shu Wu, Bowen Song, Zilei Wang, Weiqiang Wang, Liang Wang

发表机构 * University of Science and Technology of China（中国科学技术大学）

AI总结提出KBQA-R1框架，通过强化学习（GRPO）将知识库问答建模为多轮决策过程，并引入参考拒绝采样（RRS）解决冷启动问题，在多个基准上取得最优性能。

Comments ICML 2026

详情

AI中文摘要

知识库问答（KBQA）挑战模型通过生成可执行的逻辑形式来弥合自然语言与严格知识图谱模式之间的差距。虽然大语言模型（LLMs）推动了这一领域的发展，但当前的方法常常陷入两种失败模式：要么生成未经模式验证的幻觉查询，要么表现出僵化的、基于模板的推理，模仿合成轨迹而没有真正理解环境。为了解决这些局限性，我们提出了 extbf{KBQA-R1}框架，该框架通过强化学习将范式从文本模仿转变为交互优化。将KBQA视为一个多轮决策过程，我们的模型学习使用动作列表导航知识库，利用组相对策略优化（GRPO）根据具体的执行反馈而非静态监督来优化其策略。此外，我们引入了 extbf{参考拒绝采样（RRS）}，一种数据合成方法，通过严格对齐推理轨迹与真实动作序列来解决冷启动问题。在WebQSP、GrailQA和GraphQuestions上的大量实验表明，KBQA-R1实现了最先进的性能，有效地将LLM推理锚定在可验证的执行中。

英文摘要

Knowledge Base Question Answering (KBQA) challenges models to bridge the gap between natural language and strict knowledge graph schemas by generating executable logical forms. While Large Language Models (LLMs) have advanced this field, current approaches often struggle with a dichotomy of failure: they either generate hallucinated queries without verifying schema existence or exhibit rigid, template-based reasoning that mimics synthesized traces without true comprehension of the environment. To address these limitations, we present \textbf{KBQA-R1}, a framework that shifts the paradigm from text imitation to interaction optimization via Reinforcement Learning. Treating KBQA as a multi-turn decision process, our model learns to navigate the knowledge base using a list of actions, leveraging Group Relative Policy Optimization (GRPO) to refine its strategies based on concrete execution feedback rather than static supervision. Furthermore, we introduce \textbf{Referenced Rejection Sampling (RRS)}, a data synthesis method that resolves cold-start challenges by strictly aligning reasoning traces with ground-truth action sequences. Extensive experiments on WebQSP, GrailQA, and GraphQuestions demonstrate that KBQA-R1 achieves state-of-the-art performance, effectively grounding LLM reasoning in verifiable execution.

URL PDF HTML ☆

赞 0 踩 0

2505.11785 2026-06-03 cs.LG cs.AI stat.ML

Improving Coverage in Combined Prediction Sets with Weighted p-values

通过加权p值提高组合预测集的覆盖范围

Gina Wong, Drew Prinster, Suchi Saria, Rama Chellappa, Anqi Liu

发表机构 * Johns Hopkins University（约翰霍普金斯大学）

AI总结提出一种加权聚合预测集的框架，通过为每个预测集分配权重，实现覆盖范围在$1-2α$与$1-α$之间的灵活控制，并推广到数据依赖权重，在混合专家模型等场景中保持有限样本有效性。

Journal ref AISTATS 2026

详情

AI中文摘要

共形预测通过用有效的预测集增强点预测来量化机器学习模型的不确定性。对于涉及多个试验、模型或数据源的复杂场景，可以聚合共形预测集以创建捕获整体不确定性的预测集，通常能提高精度。然而，聚合具有个体$1-α$覆盖率的多个预测集不可避免地削弱了整体保证，通常导致最坏情况覆盖率为$1-2α$。在这项工作中，我们提出了一个预测集加权聚合的框架，其中根据每个预测集的贡献为其分配权重。我们的框架提供了对集合聚合方式的灵活控制，实现了更紧的覆盖界限，根据权重的分布在组合模型的$1-2α$保证和单个模型的$1-α$保证之间插值。重要的是，我们的框架推广到数据依赖的权重，因为我们推导了一个加权聚合程序，即使权重依赖于数据，也能保持有限样本有效性。这一扩展使我们的框架广泛适用于权重被学习的场景，例如混合专家模型（MoE），并且我们通过在MoE设置中的实验证明，我们的方法实现了自适应覆盖。

英文摘要

Conformal prediction quantifies the uncertainty of machine learning models by augmenting point predictions with valid prediction sets. For complex scenarios involving multiple trials, models, or data sources, conformal prediction sets can be aggregated to create a prediction set that captures the overall uncertainty, often improving precision. However, aggregating multiple prediction sets with individual $1-α$ coverage inevitably weakens the overall guarantee, typically resulting in $1-2α$ worst-case coverage. In this work, we propose a framework for the weighted aggregation of prediction sets, where weights are assigned to each prediction set based on their contribution. Our framework offers flexible control over how the sets are aggregated, achieving tighter coverage bounds that interpolate between the $1-2α$ guarantee of the combined models and the $1-α$ guarantee of an individual model depending on the distribution of weights. Importantly, our framework generalizes to data-dependent weights, as we derive a procedure for weighted aggregation that maintains finite-sample validity even when the weights depend on the data. This extension makes our framework broadly applicable to settings where weights are learned, such as mixture-of-experts (MoE), and we demonstrate through experiments in the MoE setting that our methods achieve adaptive coverage.

URL PDF HTML ☆

赞 0 踩 0

2507.16003 2026-06-03 cs.CL cs.LG

Learning without training: The implicit dynamics of in-context learning

无需训练的学习：上下文学习的内在动态

Benoit Dherin, Michael Munn, Hanna Mazzawi, Michael Wunder, Javier Gonzalvo

发表机构 * Google（谷歌）

AI总结本文通过理论分析和实验证明，自注意力层与MLP的组合使Transformer块能够根据上下文隐式修改MLP权重，从而解释大语言模型在推理时无需权重更新即可进行上下文学习的机制。

2512.15427 2026-06-03 cs.LG cond-mat.stat-mech math.ST stat.TH

Statistics of Min-max Normalized Eigenvalues in Random Matrices

随机矩阵中最小-最大归一化特征值的统计

Hyakka Nakada, Shu Tanaka

发表机构 * Graduate School of Science and Technology（理工学研究科）； Keio University（庆应大学）； Department of Applied Physics and Physico-Informatics（应用物理与物理信息学系）； Keio University Sustainable Quantum Artificial Intelligence Center (KSQAIC)（庆应大学可持续量子人工智能中心）； Human Biology-Microbiome-Quantum Research Center (WPI-Bio2Q)（人生物学-微生物组-量子研究中心（WPI-Bio2Q））； Green Computing System Research Organization（绿色计算系统研究机构）

AI总结研究随机矩阵中最小-最大归一化特征值的统计性质，提出有效分布并推导累积分布的标度律和矩阵分解的残差误差。

Comments 4 pages, 4 figures

Journal ref Journal of the Physical Society of Japan, vol. 95, no. 6, pp. 064003, 2026

2512.13996 2026-06-03 cs.AI

DTop-p MoE: Sparsity-Controlled Dynamic Top-p MoE for Foundation Model Pre-training

DTop-p MoE：面向基础模型预训练的稀疏度可控动态Top-p MoE

Can Jin, Hongwu Peng, Mingcan Xiang, Qixin Zhang, Xiangchi Yuan, Amit Hasan, Ohi Dibua, Yifan Gong, Yan Kang, Dimitris N. Metaxas

发表机构 * University of Electronic Science and Technology of China（电子科技大学）

AI总结提出DTop-p动态路由机制，通过比例积分控制器学习Top-p概率阈值并采用动态路由归一化，在全局稀疏约束下实现层间专家选择，一致优于Top-k和固定Top-p基线，且FLOPs与Top-k MoE相当。

详情

AI中文摘要

稀疏混合专家架构对于高效扩展模型容量至关重要，但标准的Top-$k$路由施加了固定的稀疏模式，忽略了令牌难度和层特定计算需求的内在差异。Top-$p$路由更具自适应性，因为它选择专家直到其累积路由概率达到阈值，允许置信令牌使用更少的专家，而模糊令牌则招募更多专家。然而，我们证明，现有的具有固定全局概率阈值的朴素Top-$p$实现相比Top-$k$仅带来边际收益，存在超参数敏感性，并导致不可控的计算成本。在本文中，我们提出**DTop-$p$**，一种稀疏度可控的动态路由机制，它使用比例积分控制器学习Top-$p$概率阈值，并采用动态路由归一化来在全局稀疏约束下支持逐层专家选择。在大语言模型和扩散Transformer上的大量实验表明，**DTop-$p$**在匹配Top-$k$ MoE平均FLOPs的同时，始终优于Top-$k$和固定Top-$p$基线。我们的分析证实，**DTop-$p$**在专家粒度、总专家容量、模型大小和数据集大小方面表现出强大的可扩展性，为基础模型预训练提供了一个鲁棒且高效的MoE框架。

英文摘要

Sparse Mixture-of-Experts architectures are essential for scaling model capacity efficiently, yet the standard Top-$k$ routing imposes a rigid sparsity pattern that ignores the intrinsic variance in token difficulty and layer-specific computational needs. Top-$p$ routing is more adaptive because it selects experts until their cumulative routing probability reaches a threshold, allowing confident tokens to use fewer experts and ambiguous tokens to recruit more. However, we demonstrate that existing naive Top-$p$ implementations with fixed global probability thresholds provide only marginal gains over Top-$k$, suffer from hyperparameter sensitivity, and result in uncontrolled computational costs. In this paper, we propose **DTop-$p$**, a sparsity-controllable dynamic routing mechanism that learns the Top-$p$ probability threshold with a Proportional-Integral controller and uses dynamic routing normalization to support layer-wise expert selection under a global sparsity constraint. Extensive experiments on Large Language Models and Diffusion Transformers demonstrate that **DTop-$p$** consistently outperforms both Top-$k$ and fixed Top-$p$ baselines while matching the average FLOPs of Top-$k$ MoE. Our analysis confirms that **DTop-$p$** exhibits strong scaling properties across expert granularity, total expert capacity, model size, and dataset size, offering a robust and efficient MoE framework for foundation model pre-training.

URL PDF HTML ☆

赞 0 踩 0

2512.11213 2026-06-03 cs.AI cs.CL

FutureWeaver: Planning Test-Time Compute for Multi-Agent Systems with Modularized Collaboration

FutureWeaver: 面向模块化协作的多智能体系统的测试时计算规划

Dongwon Jung, Peng Shi, Muhao Chen, Yi Zhang

发表机构 * University of California, Davis（加州大学戴维斯分校）； University of Waterloo（滑铁卢大学）； Greenshoe, Inc（Greenshoe公司）

AI总结提出FutureWeaver框架，通过双层次规划架构和自诱导协作模块，在固定预算下优化多智能体系统的测试时计算分配，显著提升协作性能。

详情

AI中文摘要

扩展测试时计算已被证明可以在无需额外训练的情况下显著提升大语言模型（LLM）的性能。然而，将这些技术扩展到多智能体系统仍然具有挑战性：现有方法缺乏原则性的机制来分配计算以实现有效协作、扩展协调本身，或在明确的预算约束下优化计算使用。为弥补这一差距，我们提出了FutureWeaver，一个在固定预算下规划和优化多智能体系统中测试时计算分配的框架。它引入了协作模块，形式化为模块化的、可调用的函数，封装了可复用的多智能体工作流，并通过自博弈反思从重复出现的交互模式中自动归纳。基于这些模块，它采用了一种双层次规划架构，联合执行短视动作选择和长远抽象前瞻，以在预算约束下优化推理轨迹。在复杂智能体基准上的实验表明，FutureWeaver在各种预算设置下始终优于基线，验证了其在推理时优化中多智能体协作的有效性。

英文摘要

Scaling test-time computation has been shown to significantly improve large language model (LLM) performance without additional training. However, extending these techniques to multi-agent systems remains challenging: existing approaches lack principled mechanisms for allocating compute to enable effective collaboration, scaling coordination itself, or optimizing compute usage under explicit budget constraints. To address this gap, we propose FutureWeaver, a framework for planning and optimizing test-time compute allocation in multi-agent systems under fixed budgets. It introduces collaboration modules, formalized as modular, callable functions that encapsulate reusable multi-agent workflows and are automatically induced via self-play reflection from recurring interaction patterns. Building on these modules, it employs \emph{a dual-level planning architecture} that jointly performs short-horizon action selection and long-horizon abstract lookahead to optimize inference trajectories under budget constraints. Experiments on complex agent benchmarks demonstrate that FutureWeaver consistently outperforms baselines across diverse budget settings, validating its effectiveness for multi-agent collaboration in inference-time optimization.

URL PDF HTML ☆

赞 0 踩 0

2512.07394 2026-06-03 cs.CV

Reconstructing Objects along Hand Interaction Timelines in Egocentric Video

在手交互时间线中重建第一人称视频中的物体

Zhifan Zhu, Siddhant Bansal, Shashank Tripathi, Dima Damen

发表机构 * University of Bristol, UK（英国布里斯托大学）； Max Planck Institute for Intelligent Systems, Tübingen, Germany（德国图宾根马克斯·普朗克智能系统研究所）

AI总结提出ROHIT任务，通过定义手交互时间线（HIT）并利用约束优化与传播（COP）框架，在无3D真值的情况下，从第一人称视频中重建刚性物体的姿态，显著提升重建精度。

Comments webpage: https://zhifanzhu.github.io/objects-along-hit

详情

AI中文摘要

我们引入了沿手交互时间线重建物体（ROHIT）的任务。首先从刚性物体的角度定义手交互时间线（HIT）。在HIT中，物体最初相对于场景静止，然后被手持并接触，其姿态发生变化。通常在使用过程中会有一个牢固的抓握，之后物体被释放，再次相对于场景静止。我们对HIT上的这些姿态约束进行建模，并提出沿HIT传播物体姿态，通过我们提出的约束优化与传播（COP）框架实现更优的重建。重要的是，我们关注稳定抓取的时间线——即手稳定地握住物体，在使用过程中保持恒定接触。这使得我们能够在没有3D真值的情况下，高效地标注、研究和评估视频中的物体重建。我们在两个第一人称数据集HOT3D和野外EPIC-Kitchens上评估了我们提出的任务ROHIT。在HOT3D中，我们整理了1.2K个稳定抓取片段。在EPIC-Kitchens中，我们标注了2.4K个稳定抓取片段，包括来自141个环境中日常交互视频的9个类别的390个物体实例。在没有3D真值的情况下，我们利用2D投影误差来评估重建。定量结果表明，COP通过约束姿态传播，将稳定抓取重建提高了6.2-11.3%，将HIT重建提高了高达24.5%。

英文摘要

We introduce the task of Reconstructing Objects along Hand Interaction Timelines (ROHIT). We first define the Hand Interaction Timeline (HIT) from a rigid object's perspective. In a HIT, an object is first static relative to the scene, then is held in hand following contact, where its pose changes. This is usually followed by a firm grip during use, before it is released to be static again w.r.t. to the scene. We model these pose constraints over the HIT, and propose to propagate the object's pose along the HIT enabling superior reconstruction using our proposed Constrained Optimisation and Propagation (COP) framework. Importantly, we focus on timelines with stable grasps - i.e. where the hand is stably holding an object, effectively maintaining constant contact during use. This allows us to efficiently annotate, study, and evaluate object reconstruction in videos without 3D ground truth. We evaluate our proposed task, ROHIT, over two egocentric datasets, HOT3D and in-the-wild EPIC-Kitchens. In HOT3D, we curate 1.2K clips of stable grasps. In EPIC-Kitchens, we annotate 2.4K clips of stable grasps including 390 object instances across 9 categories from videos of daily interactions in 141 environments. Without 3D ground truth, we utilise 2D projection error to assess the reconstruction. Quantitatively, COP improves stable grasp reconstruction by 6.2-11.3% and HIT reconstruction by up to 24.5% with constrained pose propagation.

URL PDF HTML ☆

赞 0 踩 0

2512.05530 2026-06-03 cs.AI

MIND: Multi-rationale INtegrated Discriminative Reasoning Framework for Multi-modal Large Models

MIND：面向多模态大模型的多理由集成判别推理框架

Chuang Yu, Jinmiao Zhao, Mingxuan Zhao, Yunpeng Liu, Xiujun Shu, Yuanhao Feng, Bo Wang, Xiangyu Yue

发表机构 * Shenyang Institute of Automation, Chinese Academy of Sciences（中国科学院沈阳自动化研究所）； University of Chinese Academy of Sciences（中国科学院大学）； Peking University（北京大学）； MMLab, CUHK（CUHK多模态实验室）

AI总结针对多模态大语言模型在多理由语义建模、逻辑鲁棒性和抗误导方面的不足，提出MIND推理框架，通过“理解-反思-纠正”机制实现从被动模仿到主动判别推理的范式转变。

Comments Accepted to ICML 2026

详情

AI中文摘要

最近，多模态大语言模型（MLLMs）被广泛应用于推理任务。然而，它们存在多理由语义建模有限、逻辑鲁棒性不足以及易受误导线索影响的问题。因此，我们提出了一个多理由集成判别（MIND）推理框架，旨在赋予MLLMs类似人类的“理解-反思-纠正”认知能力，实现从基于被动模仿的推理到主动判别推理的范式演变。具体而言，我们引入了理由增强与判别（RAD）范式，提供了统一且可扩展的数据基础。同时，我们设计了渐进式两阶段纠正学习（P2CL）策略：第一阶段增强多理由正向学习，第二阶段实现主动逻辑判别与纠正。此外，为了缓解多理由语义空间中的表示纠缠，我们提出了多理由对比对齐（MCA）优化策略。大量实验表明，我们的MIND在多个公共数据集上达到了最先进的性能。我们的数据和代码可在https://github.com/YuChuang1205/MIND获取。

英文摘要

Recently, multimodal large language models (MLLMs) have been widely applied to reasoning tasks. However, they suffer from limited multi-rationale semantic modeling, insufficient logical robustness, and susceptibility to misleading cues. Therefore, we propose a Multi-rationale INtegrated Discriminative (MIND) reasoning framework, which is designed to endow MLLMs with human-like cognitive abilities of "Understand -> Rethink -> Correct", and achieves a paradigm evolution from passive imitation-based reasoning to active discriminative reasoning. Specifically, we introduce a Rationale Augmentation and Discrimination (RAD) paradigm, which provides a unified and extensible data foundation. Meanwhile, we design a Progressive Two-stage Correction Learning (P2CL) strategy. The first phase enhances multi-rationale positive learning, while the second phase enables active logic discrimination and correction. In addition, to mitigate representation entanglement in the multi-rationale semantic space, we propose a Multi-rationale Contrastive Alignment (MCA) optimization strategy. Extensive experiments show that our MIND achieves SOTA performance on multiple public datasets. Our data and code are available at https://github.com/YuChuang1205/MIND

URL PDF HTML ☆

赞 0 踩 0

2512.03627 2026-06-03 cs.AI

MemVerse: Multimodal Memory for Lifelong Learning Agents

MemVerse：面向终身学习智能体的多模态记忆

Junming Liu, Yifei Sun, Weihua Cheng, Haodong Lei, Yirong Chen, Licheng Wen, Xuemeng Yang, Daocheng Fu, Pinlong Cai, Nianchen Deng, Yi Yu, Shuyue Hu, Botian Shi, Ding Wang

发表机构 * Shanghai Artificial Intelligence Laboratory（上海人工智能实验室）

AI总结提出MemVerse，一种模型无关的即插即用记忆框架，通过分层检索记忆与参数化快速回忆结合，解决智能体在多模态交互中的灾难性遗忘和长程推理问题。

Comments 25 pages, 6 figures, 14 tables

详情

AI中文摘要

尽管大规模语言和视觉模型取得了快速进展，但AI智能体仍然存在一个根本性限制：它们无法记忆。没有可靠的记忆，智能体会灾难性地遗忘过去的经验，难以进行长程推理，并且在多模态或交互环境中无法连贯地运行。我们提出了MemVerse，一种模型无关的即插即用记忆框架，它将快速的参数化回忆与基于检索的分层记忆相结合，实现了可扩展和自适应的多模态智能。MemVerse维护短期记忆以处理近期上下文，同时将原始多模态经验转化为结构化的长期记忆，组织为分层知识图谱。这种设计支持持续整合、自适应遗忘和有界的记忆增长。为了满足实时需求，MemVerse引入了一种周期性蒸馏机制，将长期记忆中的关键知识压缩到参数化模型中，从而实现快速、可微的回忆，同时保持可解释性。大量实验表明，MemVerse显著提高了多模态推理和持续学习效率，使智能体能够在扩展的交互中记忆、适应和连贯推理。

英文摘要

Despite rapid progress in large-scale language and vision models, AI agents still suffer from a fundamental limitation: they cannot remember. Without reliable memory, agents catastrophically forget past experiences, struggle with long-horizon reasoning, and fail to operate coherently in multimodal or interactive environments. We introduce MemVerse, a model-agnostic, plug-and-play memory framework that bridges fast parametric recall with hierarchical retrieval-based memory, enabling scalable and adaptive multimodal intelligence. MemVerse maintains short-term memory for recent context while transforming raw multimodal experiences into structured long-term memories organized as hierarchical knowledge graphs. This design supports continual consolidation, adaptive forgetting, and bounded memory growth. To handle real-time demands, MemVerse introduces a periodic distillation mechanism that compresses essential knowledge from long-term memory into the parametric model, allowing fast, differentiable recall while preserving interpretability. Extensive experiments demonstrate that MemVerse significantly improves multimodal reasoning and continual learning efficiency, empowering agents to remember, adapt, and reason coherently across extended interactions.

URL PDF HTML ☆

赞 0 踩 0

2512.03019 2026-06-03 cs.LG cs.AI

Distribution-Calibrated Inference Time Compute for Thinking LLM-as-a-Judge

分布校准的推理时间计算用于思考型LLM作为评判者

Hamid Dadkhahi, Firas Trabelsi, Parker Riley, Juraj Juraska, Mehdi Mirzazadeh

发表机构 * University of California, Berkeley（加州大学伯克利分校）； DeepMind（深Mind）； University of Cambridge（剑桥大学）

AI总结针对思考型大语言模型作为评判者时单样本噪声和聚合不一致问题，提出基于Bradley-Terry-Davidson模型的分布校准聚合方案，利用极性（非平局边际）和决定性（非平局率）区分微弱多数与强共识，显著降低MAE并提高成对准确率，匹配或超越人类评判者。

详情

AI中文摘要

用作成对偏好评判的思考型大语言模型在单样本层面仍存在噪声，常见的聚合规则（多数投票、软自一致性或基于指令的自聚合）在允许平局时不一致。我们研究了评估者的推理时间计算（ITC），该评估者为每个项目生成n个独立的思考-评分样本，并提出了一种原则性的、分布校准的聚合方案。我们的方法使用Bradley-Terry-Davidson公式对评分计数进行三向偏好建模，利用极性（非平局间的边际）和决定性（非平局率）来区分微弱多数与强共识。在各种评估基准上，与标准基线相比，我们的方法持续降低MAE并提高成对准确率，并且在针对人类共识元标签进行评估时，匹配或超过单个人类评判者。这些结果表明，精心分配ITC并使用分布感知方法进行聚合，可以将嘈杂的个体模型判断转化为可靠的评估评分。

英文摘要

Thinking Large Language Models (LLMs) used as judges for pairwise preferences remain noisy at the single-sample level, and common aggregation rules (majority vote, soft self-consistency, or instruction-based self-aggregation) are inconsistent when ties are allowed. We study inference-time compute (ITC) for evaluators that generate n independent thinking--rating samples per item, and propose a principled, distribution-calibrated aggregation scheme. Our method models three-way preferences with a Bradley-Terry-Davidson formulation on rating counts, leveraging both polarity (margin among non-ties) and decisiveness (non-tie rate) to distinguish narrow margins from strong consensus. Across various evaluation benchmarks, our approach consistently reduces MAE and increases pairwise accuracy versus standard baselines, and when evaluated against human-consensus meta-labels, matches or exceeds individual human raters. These results show that carefully allocating ITC and aggregating with distribution-aware methods turns noisy individual model judgments into reliable ratings for evaluation.

URL PDF HTML ☆

赞 0 踩 0

2511.21731 2026-06-03 cs.CL cs.AI

Identifying Quantum Structure in AI Language: Evidence for Evolutionary Convergence of Human and Artificial Cognition

识别AI语言中的量子结构：人类与人工智能认知进化趋同的证据

Diederik Aerts, Jonito Aerts Arguëlles, Lester Beltran, Suzette Geriente, Roberto Leporini, Massimiliano Sassoli de Bianchi, Sandro Sozzo

发表机构 * Center Leo Apostel for Interdisciplinary Studies, Vrije Universiteit Brussel (VUB)（利奥·阿波斯泰尔跨学科研究中心，布鲁塞尔自由大学）； Department of Economics, University of Bergamo（博洛尼亚大学经济系）； Department of Humanities and Cultural Heritage (DIUM) and Centre CQSCS, University of Udine（乌迪内大学人文与文化遗产系及CQSCS中心）

AI总结通过对大型语言模型进行认知测试，发现其概念组合中存在贝尔不等式显著违背和玻色-爱因斯坦统计，表明人类与人工智能在概念-语言领域均涌现非经典量子结构，支持认知进化趋同假说。

Journal ref Entropy 28, 622, 2026

详情

DOI: 10.3390/e28060622

AI中文摘要

我们展示了使用特定大型语言模型（LLMs）作为测试对象进行的概念组合认知测试结果。在第一个测试中，使用ChatGPT和Gemini，我们表明贝尔不等式被显著违背，这表明存在一个概率不满足Kolmogorov公理的“非经典概率模型”。在第二个测试中，同样使用ChatGPT和Gemini，我们在大型文本中的单词分布中识别出“玻色-爱因斯坦统计”的存在，而非直觉预期的“麦克斯韦-玻尔兹曼统计”。有趣的是，这些发现与之前在人类参与者认知测试和大规模语料库信息检索测试中获得的结果相呼应。综合来看，它们指向“概念-语言领域中非经典量子类结构的系统性涌现”，无论认知主体是人类还是人工智能。尽管LLMs因历史原因被归类为神经网络，但我们认为，在神经网络之上构建的向量空间的分布式语义结构中，发生了一种更本质的知识组织形式。正是这种承载意义的结构，促成了通过生物进化缓慢建立的人类认知与语言，与通过自我学习和训练快速涌现的LLM认知与语言之间的进化趋同现象。我们分析了支持上述假设的各种方面和实例。我们还提出了一个统一框架，解释了我们识别出的普遍量子组织意义。

英文摘要

We present the results of cognitive tests on conceptual combinations, performed using specific Large Language Models (LLMs) as test subjects. In the first test, performed with ChatGPT and Gemini, we show that Bell's inequalities are significantly violated, which indicates the presence of a 'non-classical probability model' with probabilities that do not satisfy Kolmogorov's axioms. In the second test, also performed using ChatGPT and Gemini, we identify the presence of 'Bose-Einstein statistics', rather than the intuitively expected 'Maxwell-Boltzmann statistics', in the distribution of the words contained in large-size texts. Interestingly, these findings mirror the results previously obtained in both cognitive tests with human participants and information retrieval tests on large corpora. Taken together, they point to the 'systematic emergence of non-classical quantum-like structures in conceptual-linguistic domains', regardless of whether the cognitive agent is human or artificial. Although LLMs are classified as neural networks for historical reasons, we believe that a more essential form of knowledge organization takes place in the distributive semantic structure of vector spaces built on top of the neural network. It is this meaning-bearing structure that lends itself to a phenomenon of evolutionary convergence between human cognition and language, slowly established through biological evolution, and LLM cognition and language, emerging much more rapidly as a result of self-learning and training. We analyze various aspects and examples that contain evidence supporting the above hypothesis. We also advance a unifying framework that explains the pervasive quantum organization of meaning that we identify.

URL PDF HTML ☆

赞 0 踩 0

2511.19995 2026-06-03 cs.CV

CREward: A Type-Specific Creativity Reward Model

CREward：一种类型特定的创造力奖励模型

Jiyeon Han, Ali Mahdavi-Amiri, Hao Zhang, Haedong Jeong

发表机构 * Simon Fraser University（西蒙弗雷泽大学）； Sogang University（首尔大学）

AI总结提出首个类型特定的创造力奖励模型CREward，通过几何、材质和纹理三个轴评估创造力，并应用于创造力评估、可解释创造力及创意样本获取。

Comments Accepted to CVPR 2026

详情

AI中文摘要

创造力是一种复杂现象。在表征和评估创造力时，将其视为单一的未分化量显得幼稚且不足。在这项工作中，我们学习了第一个类型特定的创造力奖励模型，称为CREward，它跨越三个创造力“轴”：几何、材质和纹理，使我们能够通过图像形成流程的视角来审视创造力。为了构建我们的奖励模型，我们首先进行人类基准评估，以捕捉人类对各种创意图像中每种类型的创造力感知。然后，我们分析人类判断与大型视觉语言模型（LVLMs）预测之间的相关性，确认LVLMs与人类感知高度一致。基于这一观察，我们收集LVLM生成的标签来训练我们的CREward模型，该模型适用于创意图像的评估和生成。我们探索了CREward的三个应用：创造力评估、可解释创造力以及创意样本获取，用于人类设计灵感和通过低秩适应引导创意生成。

英文摘要

Creativity is a complex phenomenon. When it comes to representing and assessing creativity, treating it as a single undifferentiated quantity would appear naive and underwhelming. In this work, we learn the \emph{first type-specific creativity reward model}, coined CREward, which spans three creativity ``axes," geometry, material, and texture, to allow us to view creativity through the lens of the image formation pipeline. To build our reward model, we first conduct a human benchmark evaluation to capture human perception of creativity for each type across various creative images. We then analyze the correlation between human judgments and predictions by large vision-language models (LVLMs), confirming that LVLMs exhibit strong alignment with human perception. Building on this observation, we collect LVLM-generated labels to train our CREward model that is applicable to both evaluation and generation of creative images. We explore three applications of CREward: creativity assessment, explainable creativity, and creative sample acquisition for both human design inspiration and guiding creative generation through low-rank adaptation.

URL PDF HTML ☆

赞 0 踩 0

2511.19959 2026-06-03 cs.LG cs.DC

ParaBlock: Communication-Computation Parallel Block Coordinate Federated Learning for Large Language Models

ParaBlock：面向大语言模型的通信-计算并行块坐标联邦学习

Yujia Wang, Yuanpu Cao, Jinghui Chen

发表机构 * College of Information Sciences and Technology（信息科学与技术学院）； Pennsylvania State University（宾夕法尼亚州立大学）

AI总结提出ParaBlock方法，通过并行化通信与计算线程，在联邦学习大语言模型时提升通信效率，并理论证明其收敛率与标准方法相同，实验验证其性能与效率优势。

Comments Accepted by TMLR

详情

AI中文摘要

联邦学习作为一种隐私保护训练范式已被广泛研究。最近，联邦块坐标下降方案在训练大规模模型中成为流行选择，因为它允许客户端仅本地训练模型的一个子集而非整个模型。然而，在大语言模型时代，即使单个块也可能包含大量参数，导致显著的通信延迟，特别是对于资源受限的客户端。为了解决联邦训练/微调大语言模型中的这一挑战，我们提出了ParaBlock，一种新颖的方法，它建立两个并行线程分别用于通信和计算，以提高通信效率。我们从理论上证明，所提出的ParaBlock实现了与标准联邦块坐标下降方法相同的收敛率。在通用指令遵循和数学推理任务上微调大语言模型的实证评估证实，ParaBlock不仅保持了强大的性能，而且显著提高了通信效率。

英文摘要

Federated learning (FL) has been extensively studied as a privacy-preserving training paradigm. Recently, federated block coordinate descent scheme has become a popular option in training large-scale models, as it allows clients to train only a subset of the model locally instead of the entire model. However, in the era of large language models (LLMs), even a single block can contain a significant number of parameters, posing substantial communication latency, particularly for resource-constrained clients. To address this challenge in federated training/fine-tuning LLMs, we propose ParaBlock, a novel approach that establishes two parallel threads for communication and computation to enhance communication efficiency. We theoretically prove that the proposed ParaBlock achieves the same convergence rate as the standard federated block coordinate descent methods. Empirical evaluations on fine-tuning LLMs on general instruction following and mathematical reasoning confirm that ParaBlock not only maintains strong performance but also significantly improves communication efficiency.

URL PDF HTML ☆

赞 0 踩 0

2503.07265 2026-06-03 cs.CV cs.AI cs.CL

WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation

WISE: 一种基于世界知识的文本到图像生成语义评估方法

Yuwei Niu, Munan Ning, Mengren Zheng, Weiyang Jin, Bin Lin, Peng Jin, Jiaqi Liao, Chaoran Feng, Fanqing Meng, Kunpeng Ning, Bin Zhu, Li Yuan

发表机构 * University of Science and Technology of China（中国科学技术大学）

AI总结针对现有文本到图像生成模型缺乏复杂语义理解和世界知识整合评估的问题，提出WISE基准，包含25个子领域的1000个精心设计的提示，并引入WiScore指标评估知识-图像对齐，实验表明当前模型在整合世界知识方面存在显著局限。

Comments Accepted to ICML 2026. We have also released an updated version of the benchmark, WISE_Verified. Please refer to https://github.com/PKU-YuanGroup/WISE for the latest version

详情

AI中文摘要

文本到图像（T2I）模型能够生成高质量的艺术创作和视觉内容。然而，现有研究和评估标准主要关注图像真实性和浅层的文本-图像对齐，缺乏对文本到图像生成中复杂语义理解和世界知识整合的全面评估。为解决这一挑战，我们提出了 extbf{WISE}，这是首个专门用于 extbf{W}orld Knowledge- extbf{I}nformed extbf{S}emantic extbf{E}valuation（世界知识引导的语义评估）的基准。WISE超越了简单的词-像素映射，通过1000个精心设计的提示，涵盖文化常识、时空推理和自然科学等25个子领域，对模型进行挑战。为了克服传统CLIP指标的局限性，我们引入了 extbf{WiScore}，一种用于评估知识-图像对齐的新型定量指标。通过对20个模型（10个专用T2I模型和10个统一多模态模型）在涵盖25个子领域的1000个结构化提示上进行全面测试，我们的发现揭示了它们在图像生成过程中有效整合和应用世界知识的能力存在显著局限，为下一代T2I模型增强知识整合与应用指明了关键路径。代码和数据可在\href{https://github.com/PKU-YuanGroup/WISE}{PKU-YuanGroup/WISE}获取。

英文摘要

Text-to-Image (T2I) models are capable of generating high-quality artistic creations and visual content. However, existing research and evaluation standards predominantly focus on image realism and shallow text-image alignment, lacking a comprehensive assessment of complex semantic understanding and world knowledge integration in text-to-image generation. To address this challenge, we propose \textbf{WISE}, the first benchmark specifically designed for \textbf{W}orld Knowledge-\textbf{I}nformed \textbf{S}emantic \textbf{E}valuation. WISE moves beyond simple word-pixel mapping by challenging models with 1000 meticulously crafted prompts across 25 subdomains in cultural common sense, spatio-temporal reasoning, and natural science. To overcome the limitations of traditional CLIP metric, we introduce \textbf{WiScore}, a novel quantitative metric for assessing knowledge-image alignment. Through comprehensive testing of 20 models (10 dedicated T2I models and 10 unified multimodal models) using 1,000 structured prompts spanning 25 subdomains, our findings reveal significant limitations in their ability to effectively integrate and apply world knowledge during image generation, highlighting critical pathways for enhancing knowledge incorporation and application in next-generation T2I models. Code and data are available at \href{https://github.com/PKU-YuanGroup/WISE}{PKU-YuanGroup/WISE}.

URL PDF HTML ☆

赞 0 踩 0

2511.13020 2026-06-03 cs.CV cs.AI

PHASE: Physiology-Aware Hyperspectral Reconstruction via Object-to-Human Domain Adaptation

PHASE: 通过对象到人体域适应的生理感知高光谱重建

Yufei Wen, Shuxing Zhong, Jingdan Kang, Yuting Zhang, Jintai Chen, Kaishun Wu

发表机构 * The Hong Kong University of Science and Technology (Guangzhou)（香港科技大学（广州））； South China University of Technology（华南理工大学）

AI总结针对现有高光谱重建方法在生理成像中失效的问题，提出PHASE范式，通过生理通道重新解释和生理约束对齐，实现从对象到人体的域适应，仅需1.5%标注数据即可显著提升重建质量。

Comments To KDD26

详情

AI中文摘要

尽管高光谱成像提供了无与伦比的无创生理洞察，但其笨重的硬件、缓慢的采集速度和监管负担严重限制了其临床可用性。一种自然的替代方案是从无处不在的RGB或CASSI测量中重建高光谱信息。然而，现有的为以对象为中心的场景开发的范式依赖于基于反射率的特征对齐，假设光谱相似性保持语义一致性。这一假设在生理成像中不成立，因为视觉上相似的RGB响应可能源于不同且纠缠的生理状态。这种不匹配促使从反射率对齐转向基于共享光-物质相互作用原理的生理感知表示学习——这一转变引入了来自跨通道语义偏移（C1）和基于RGB采集的不可逆信息丢失（C2）的基本挑战。因此，我们设计了PHASE，一种生理感知的高光谱重建范式，通过生理通道重新解释解耦跨通道生理语义，并通过生理约束对齐将重建限制在生理上合理的解，从根本上重新定义了对象到人体的迁移。在两种源到目标迁移协议下，PHASE仅需1.5%的标注监督，在SSIM上一致优于最先进方法最多+2.20，在SAM上最多-3.06。

英文摘要

Although hyperspectral imaging offers unparalleled non-invasive physiological insight, its bulky hardware, slow acquisition, and regulatory burden severely limit its clinical availability. A natural workaround is to reconstruct hyperspectral information from ubiquitous RGB or CASSI measurements. However, existing paradigms, developed for object-centric scenes, rely on reflectance-based feature alignment, assuming that spectral similarity preserves semantic meaning. This assumption breaks down in physiological imaging, where visually similar RGB responses may arise from distinct and entangled physiological states. This mismatch motivates a shift from reflectance alignment to physiology-aware representation learning, grounded in shared light-matter interaction principles -- a shift that introduces fundamental challenges from cross-channel semantic shifts (C1) and irreversible information loss in RGB-based acquisition (C2). We therefore design PHASE, a physiology-aware hyperspectral reconstruction paradigm that fundamentally redefines object-to-human transfer by disentangling cross-channel physiological semantics via Physiological Channel Reinterpretation and restricting reconstruction to physiologically plausible solutions through Physiologically Constrained Alignment. Under two source-to-target transfer protocols, PHASE consistently outperforms state-of-the-art methods by up to +2.20 SSIM and -3.06 in SAM with merely 1.5% labeled supervision.

URL PDF HTML ☆

赞 0 踩 0

2511.11346 2026-06-03 cs.LG

Fast and Expressive Multi-Byte Prediction with Probabilistic Circuits

基于概率电路的快速且富有表现力的多字节预测

Andreas Grivas, Lorenzo Loconte, Emile van Krieken, Piotr Nawrot, Yu Zhao, Euan Wielewski, Pasquale Minervini, Edoardo Ponti, Antonio Vergari

发表机构 * University of Cambridge（剑桥大学）

AI总结提出MTPC框架，利用概率电路编码未来令牌的联合分布，在字节级LLM中实现快速生成，同时保持表现力。

详情

AI中文摘要

多令牌预测（MTP）是一种显著加速大型语言模型（LLM）生成的突出策略，尤其是在字节级LLM中，这些模型无需分词器但速度极慢。然而，许多现有的MTP方法要么假设未来令牌之间独立，牺牲了表现力，要么在窗口内逐个生成令牌，增加了延迟。在这项工作中，我们在概率电路（PC）框架内研究了MTP中表现力与延迟之间的权衡。我们的框架MTPC允许通过选择电路架构来探索编码未来令牌联合分布的不同方式，推广了经典模型，如（层次）混合模型、隐马尔可夫模型和张量网络。我们通过改造现有的字节级LLM（如EvaByte）和字节化的子词模型（如Llama3.2 3B）展示了MTPC的有效性。实验表明，当与推测解码结合时，与具有独立性假设的MTP相比，MTPC显著加速了生成，同时保证保持原始验证器LLM的性能。我们还严格研究了在探索MTPC的可能参数化（如PC架构以及验证器和草稿LLM之间的部分层共享）时，表现力与延迟之间的最优权衡。

英文摘要

Multi-token prediction (MTP) is a prominent strategy to significantly speed up generation in large language models (LLMs), especially in byte-level LLMs, which are tokeniser-free but prohibitively slow. However, many existing MTP methods either assume independence between future tokens, sacrificing expressiveness, or generate tokens one at a time within the window, increasing latency. In this work, we investigate the trade-off between expressiveness and latency in MTP within the framework of probabilistic circuits (PCs). Our framework, MTPC, allows one to explore different ways to encode the joint distributions over future tokens by selecting circuit architectures, generalising classical models such as (hierarchical) mixture models, hidden Markov models, and tensor networks. We show the efficacy of MTPC by retrofitting existing byte-level LLMs, such as EvaByte, and byte-fied subword models, such as Llama3.2 3B. Our experiments show that, when combined with speculative decoding, MTPC substantially speeds up generation compared to MTP with independence assumptions, while guaranteeing to retain the performance of the original verifier LLM. We also rigorously study the optimal trade-off between expressiveness and latency when exploring the possible parameterisations of MTPC, such as PC architectures and partial layer sharing between the verifier and draft LLMs.

URL PDF HTML ☆

赞 0 踩 0

2508.21448 2026-06-03 cs.CL

When Models Refuse: Political Steerability and Feature Richness as Measures of Ideological Depth

当模型拒绝时：政治可操控性与特征丰富度作为意识形态深度的度量

Shariar Kabir

发表机构 * Bangladesh University of Engineering and Technology（孟加拉工程与技术大学）

AI总结本文提出意识形态深度概念，通过可操控性和稀疏自编码器测量的特征丰富度，研究大语言模型拒绝遵循良性指令是否源于能力缺陷而非安全规则。

详情

AI中文摘要

大型语言模型有时会拒绝遵循良性指令，例如拒绝论证某种政治立场或采用指定角色，这种拒绝通常被视为安全护栏在起作用。我们探究这些拒绝是否可能表明**能力缺陷**：模型缺乏从指令角度进行推理所需的内部表示。为此，我们引入**意识形态深度**，该属性包含两个组成部分：(i) 模型遵循政治指令而不*失败*的能力（可操控性），以及 (ii) 其内部政治表示的**特征丰富度**，通过稀疏自编码器测量。使用两个广泛使用的开源权重LLM作为候选，我们比较基于提示和激活操控的干预措施，并用公开可用的SAE探测政治特征。我们发现巨大且系统性的差异：在两个意识形态方向上更具可操控性的模型激活了约**7.3倍**更多的独特政治特征，而另一个模型则以增加拒绝作为回应。从前者模型中因果性地消融一小部分目标政治特征，重现了相同的特征贫乏行为并导致拒绝增加。综合这些结果表明，对良性提示的拒绝可能源于**能力缺陷**而非固定的安全规则，并且意识形态深度是LLM的一个可测量属性，有助于预测模型何时会拒绝。

英文摘要

Large language models (LLMs) sometimes refuse to follow benign instructions, such as declining to argue a political position or adopt a stated persona, and such refusals are commonly read as safety guardrails at work. We ask whether they can instead signal a **capability deficit**: a shortage of the internal representations a model needs to reason from the instructed perspective. To investigate, we introduce **ideological depth**, a property with two components: (i) a model's ability to follow political instructions without *failure* (steerability), and (ii) the **feature richness** of its internal political representations, measured with sparse autoencoders (SAEs). Using two widely used openweight LLMs as candidates, we compare interventions based on prompts and activation-steering, and probe political features with publicly available SAEs. We find large, systematic differences: a model that is more steerable in both ideological directions activates **~7.3x** more distinct political features, while the other model instead responds with increased refusals. Causally ablating a small, targeted set of political features from the former model reproduces the same feature-poor behavior and drives up refusals. Together, these results indicate that refusals on benign prompts can arise from **capability deficits** rather than fixed safety rules, and that ideological depth is a measurable property of LLMs that helps predict when a model will refuse.

URL PDF HTML ☆

赞 0 踩 0

2511.10055 2026-06-03 cs.CV

Physical Plausibility Reasoning via HCM-GRPO: Empowering Compact Model for Superior Performance

通过 HCM-GRPO 实现物理合理性推理：赋能紧凑模型以获得卓越性能

Zhiyuan Hu, Zheng Sun, Yi Wei, Long Yu

发表机构 * Tsinghua University（清华大学）； Alibaba Health Information Technology Limited（阿里巴巴健康信息技术有限公司）

AI总结针对多模态大语言模型在物理合理性推理中数据缺乏和推理能力弱的问题，提出包含大规模数据集和 HCM-GRPO 方法的完整解决方案，以紧凑模型超越大规模开源和闭源模型。

详情

AI中文摘要

近年来，图像生成的性能得到了显著提升。然而，图像筛选的研究很少，且由于缺乏数据以及多模态大语言模型（MLLMs）中物理合理性推理能力较弱，其性能并不令人满意。在这项工作中，我们提出了一个完整的解决方案，从数据和方法论两方面解决这些问题。在数据方面，我们收集了一个包含超过 128k 样本的综合图像筛选数据集，涉及约 640k 张图像。每个样本由一张原始图像和四张生成图像组成。该数据集从四个方面评估物理合理性推理能力：外观变形、物理阴影、放置布局和扩展合理性。关于数据标注，我们研究了多种方法，包括纯人工、全自动和答案驱动的标注，以最经济的方式获取高质量的思维链（CoT）数据。在方法论上，我们将一种硬案例挖掘（HCM）策略与动态比例准确率（DPA）奖励引入到组相对策略优化（GRPO）框架中，称为 HCM-GRPO。与原始 GRPO 相比，这种增强方法展示了更优越的物理合理性推理能力。我们的实验结果表明，即使是像 GPT5.2 和 Gemini3-Pro 这样的最先进的闭源 MLLMs，在物理合理性推理方面也表现出不令人满意的性能。相比之下，通过利用 HCM-GRPO，我们能够以更小的模型超越大规模开源和领先闭源模型的分数。

英文摘要

The performance of image generation has been significantly improved in recent years. However, the study of image screening is rare, and its performance with Multimodal Large Language Models (MLLMs) is unsatisfactory due to the lack of data and the weak physical plausibility reasoning ability in MLLMs. In this work, we propose a complete solution to address these problems in terms of data and methodology. For data, we collect a comprehensive image screening dataset with over 128k samples, comprising about 640k images. Each sample consists of an original image and four generated images. The dataset evaluates the physical plausibility reasoning ability under four aspects: appearance deformation, physical shadow, placement layout, and extension rationality. Regarding data annotation, we investigate multiple approaches, including purely manual, fully automated, and answer-driven annotations, to acquire high-quality chains of thought (CoT) data in the most cost-effective manner. Methodologically, we introduce a Hard Cases Mining (HCM) strategy with a Dynamic Proportional Accuracy (DPA) reward into the Group Relative Policy Optimization (GRPO) framework, called HCM-GRPO. This enhanced method demonstrates superior physical plausibility reasoning capabilities compared to the original GRPO. Our experimental results reveal that even state-of-the-art closed-source MLLMs, such as GPT5.2 and Gemini3-Pro, exhibit unsatisfactory performance in physical plausibility reasoning. In contrast, by leveraging the HCM-GRPO, we are able to surpass the scores of both large-scale open-source and leading closed-source models with a much smaller model.

URL PDF HTML ☆

赞 0 踩 0

AI 大模型

视觉与机器人

科学与医疗

$\mathbb{R}^{2k}$ is Theoretically Large Enough for Embedding-based Top-$k$ Retrieval

Plan, Verify and Fill: A Structured Parallel Decoding Approach for Diffusion Language Models

ASAP: Exploiting the Satisficing Generalization Edge in Neural Combinatorial Optimization

Impact of Graph Structure on Membership-Inference Risk for Graph Neural Networks

Social Caption: Evaluating Social Understanding in Multimodal Models

Ranking Free RAG: Replacing Re-ranking with Selection in RAG for Sensitive Domains

Distill-then-Replace: Efficient Task-Specific Hybrid Attention Model Construction

LAMP: Data-Efficient Linear Affine Weight-Space Models for Parameter-Controlled 3D Shape Generation and Extrapolation

A Scoping Review of the Ethical Perspectives on Anthropomorphising Large Language Model-Based Conversational Agents

The Agent's First Day: Benchmarking Learning, Exploration, and Scheduling in the Workplace Scenarios

Edge-Aware and Content-Adaptive Infrared Gas Leak Detection for Industrial Safety Monitoring

Lemmanaid: Neuro-Symbolic Lemma Conjecturing

KBQA-R1: Reinforcing Large Language Models for Knowledge Base Question Answering

Improving Coverage in Combined Prediction Sets with Weighted p-values

Learning without training: The implicit dynamics of in-context learning

Statistics of Min-max Normalized Eigenvalues in Random Matrices

DTop-p MoE: Sparsity-Controlled Dynamic Top-p MoE for Foundation Model Pre-training

FutureWeaver: Planning Test-Time Compute for Multi-Agent Systems with Modularized Collaboration

Reconstructing Objects along Hand Interaction Timelines in Egocentric Video

MIND: Multi-rationale INtegrated Discriminative Reasoning Framework for Multi-modal Large Models

MemVerse: Multimodal Memory for Lifelong Learning Agents

Distribution-Calibrated Inference Time Compute for Thinking LLM-as-a-Judge

Identifying Quantum Structure in AI Language: Evidence for Evolutionary Convergence of Human and Artificial Cognition

CREward: A Type-Specific Creativity Reward Model

ParaBlock: Communication-Computation Parallel Block Coordinate Federated Learning for Large Language Models

WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation

PHASE: Physiology-Aware Hyperspectral Reconstruction via Object-to-Human Domain Adaptation

Fast and Expressive Multi-Byte Prediction with Probabilistic Circuits

When Models Refuse: Political Steerability and Feature Richness as Measures of Ideological Depth

Physical Plausibility Reasoning via HCM-GRPO: Empowering Compact Model for Superior Performance