语言大模型 / LLM

2606.18606 2026-06-18 cs.CL cs.AI 新提交 70%

Steerable Cultural Preference Optimization of Reward Models

可引导的文化偏好优化奖励模型

Minsik Oh, Advit Deepak, Sophie Wu, Douwe Kiela, Ekaterina Shutova

发表机构 * Stanford University（斯坦福大学）； University of Amsterdam（阿姆斯特丹大学）

专题命中后训练：训练奖励模型用于LLM对齐

AI总结提出SCPO算法，通过平衡多种文化偏好训练奖励模型，在PRISM和GlobalOpinionQA数据集上提升少数群体偏好预测准确率最多7点，训练效率提高280%。

Comments Accepted to Pluralistic Alignment @ ICML 2026

详情

AI中文摘要

大型语言模型（LLM）技术以每个文化子社区可接受的方式服务于众多不同文化子社区至关重要。然而，迄今为止，关于LLM对齐的研究主要集中于预测来自特定地区的标注者的统一响应偏好。本文旨在以更全球化的视角推进对齐模型的发展，使其能够准确代表子社区的偏好，并且不对任何子社区表现出过度偏见。我们专注于为此目的开发奖励模型，并提出一种新颖的奖励模型训练算法（SCPO），该算法能够以平衡的方式融入多样化的文化偏好。我们的方法使得少数群体奖励模型在两个数据集（PRISM和GlobalOpinionQA）以及7个国家上的性能比基线模型提升最多7点。SCPO在训练数据效率上比奖励模型的完整数据微调高出最多280%。此外，我们通过分别评估子社区的偏好来进行偏见分析，并表明我们的加权方法减轻了过度偏见。我们的代码可在以下网址获取：this https URL

英文摘要

It is essential for large language model (LLM) technology to serve many different cultural sub-communities in a manner that is acceptable to each community. However, research on LLM alignment has so far predominantly focused on predicting a unified response preference of annotators from certain regions. This paper aims to advance the development of alignment models with a more global outlook, that are able to accurately represent the preferences of subcommunities and do not exhibit excessive bias towards any of them. We focus on the development of reward models for this purpose and present a novel reward model training algorithm (SCPO) that can incorporate diverse cultural preferences in a balanced manner. Our method results in performance increases of the minority reward model of up to 7 points over the baseline model across two datasets, PRISM and GlobalOpinionQA, and across 7 countries. SCPO is up to 280% more training data-efficient than full-data finetuning of reward models. In addition, we perform analysis of bias by separately evaluating on the preference of subcommunities and show that excessive bias is mitigated via our weighting method. Our code is available at https://github.com/minsik-ai/Steerable-Cultural-Preference

URL PDF HTML ☆

赞 0 踩 0

2606.18521 2026-06-18 cs.LG cs.AI 新提交 70%

Sparsity Curse: Understanding RLVR Model Parameter Space from Model Merging

稀疏性诅咒：从模型合并理解RLVR模型参数空间

Chenrui Wu, Zexi Li, Jiajun Bu, Jiangchuan Liu, Haishuai Wang

发表机构 * Zhejiang University（浙江大学）； Simon Fraser University（西蒙菲莎大学）； The Chinese University of Hong Kong（香港中文大学）； Zhejiang Key Lab of Accessible Perception and Intelligent Systems（浙江省可感知智能系统重点实验室）

专题命中后训练：研究RLVR模型参数空间与合并

AI总结本文发现RLVR模型的稀疏更新在参数空间中分散更远，形成近正交捷径导致合并脆弱，并提出SAR-Merging方法解决该问题。

Comments Accepted by KDD 2026

详情

AI中文摘要

可验证奖励强化学习（RLVR）已成为一种强大的后训练范式，在激发推理智能和抵抗灾难性遗忘方面超越了监督微调（SFT）。最近的研究进一步揭示，与SFT相比，RLVR会引发高度稀疏且偏离主成分的参数更新。这自然引出一个问题：这种稀疏性是否使RLVR模型更易于模型合并？如果是，模型合并将提供一种可扩展的、无需训练的方法，来聚合来自独立训练的RLVR模型的多样化推理能力。令人惊讶的是，我们发现相反的情况，揭示了一种稀疏性诅咒：稀疏的RLVR更新在参数空间中分散得更远，形成近正交的捷径，使得聚合本质上是脆弱的。这很可能源于RL优化的随机性和涌现推理模式的多样性。与SFT模型收敛到共享的平坦盆地并自然合并不同，RLVR模型在标准合并方法下遭受严重退化。通过对更新几何的系统性实证分析，我们描述了这种失败背后的机制，并提出了敏感性感知解析合并（SAR-Merging），这是一种针对RLVR参数空间独特结构定制的合并方案。SAR-Merging通过基于Fisher信息的敏感性仲裁解决重叠更新区域中的冲突，然后通过幅度感知稀疏化和重新缩放来保留脆弱的推理路径。在数学和编程基准上的实验表明，SAR-Merging在RLVR模型上显著优于现有合并方法，实现了单任务增强和多能力融合。

英文摘要

Reinforcement Learning with Verifiable Reward (RLVR) has emerged as a powerful post-training paradigm that surpasses Supervised Fine-Tuning (SFT) in eliciting reasoning intelligence and resisting catastrophic forgetting. Recent studies further reveal that RLVR induces highly sparse and off-principal parameter updates compared to SFT. This naturally raises the question: does such sparsity make RLVR models more amenable to model merging? If so, model merging would offer a scalable, training-free path to aggregate diverse reasoning capabilities from independently trained RLVR models. Surprisingly, we find the opposite, uncovering a sparsity curse: the sparse RLVR updates are spread farther apart in parameter space, forming near-orthogonal shortcuts that make aggregation inherently fragile. This is likely rooted in the stochasticity of RL optimization and the diversity of emergent reasoning patterns. Unlike SFT models that converge to shared, flat basins and merge naturally, RLVR models suffer severe degradation under standard merging methods. Through systematic empirical analysis of the update geometry, we characterize the mechanisms behind this failure and propose Sensitivity-aware Resolving Merging (SAR-Merging), a merging recipe tailored for the unique structure of RLVR parameter spaces. SAR-Merging resolves conflicts in overlapping update regions via Fisher Information-based sensitivity arbitration, followed by magnitude-aware sparsification and rescaling to preserve fragile reasoning pathways. Experiments on mathematical and coding benchmarks demonstrate that SAR-Merging substantially outperforms existing merging methods on RLVR models, enabling both single-task enhancement and multi-capability fusion.

URL PDF HTML ☆

赞 0 踩 0

2606.16276 2026-06-18 cs.AI 新提交 70%

SpecAlign: Efficient Specification-Grounded Alignment of Large Language Models via Synthetic Data

SpecAlign: 通过合成数据实现高效的大语言模型规范对齐

Wenjie Wang, Yue Huang, Zhengqing Yuan, Han Bao, Shiyi Du, Yuchen Ma, Yue Zhao, Yanfang Ye, Xiangliang Zhang

发表机构 * University of Notre Dame（圣母大学）； Carnegie Mellon University（卡内基梅隆大学）； LMU Munich（慕尼黑大学）； University of Southern California（南加州大学）

专题命中后训练：后训练对齐方法，提升LLM规则遵守度

AI总结提出规范对齐新范式，通过从规范文档合成数据（SpecAlign框架），结合结构化规则标注、可控规范实例化和多智能体对抗数据合成，生成细粒度偏好对，提升规则遵守度且不损害通用能力。

Comments 58 pages

详情

AI中文摘要

随着大语言模型（LLM）在现实应用中的部署日益增多，对齐不再由单一的通用安全或有用性概念主导，而是由提供商或应用特定的模型规范主导。这些规范通常冗长、结构化且频繁更新，然而现有的对齐流程缺乏系统化的机制来将其作为训练信号。在本文中，我们提出规范对齐（specification-grounded alignment），一种新的对齐范式，将提供商编写的模型规范作为主要对齐目标，而非抽象原则或静态基准。为实例化该范式，我们引入SpecAlign框架，该框架直接从规范文档合成对齐数据。SpecAlign结合结构化规则标注、可控规范实例化和多智能体对抗数据合成，生成细粒度、边界感知的偏好对，捕获合规行为和有意义的规范违反。在多个模型规范和骨干模型上的实验表明，使用SpecAlign进行训练一致地提高了规则遵守度，同时保持了通用能力并避免了过度保守的行为。这些结果表明，将对齐建立在显式模型规范上，能够实现LLM行为对不断变化的政策要求的快速、精确和可扩展的适应。

英文摘要

As large language models (LLMs) are increasingly deployed in real-world applications, alignment is no longer governed by a single universal notion of safety or helpfulness, but instead by provider- or application-specific model specifications. These specifications are typically long, structured, and frequently updated, yet existing alignment pipelines lack a systematic mechanism to operationalize them as training signals. In this paper, we propose specification-grounded alignment, a new alignment paradigm that treats provider-authored model specifications as the primary alignment target rather than abstract principles or static benchmarks. To instantiate this paradigm, we introduce SpecAlign, a framework that synthesizes alignment data directly from specification documents. SpecAlign combines structured rule annotation, controllable specification instantiation, and multi-agent adversarial data synthesis to generate fine-grained, boundary-aware preference pairs that capture both compliant behaviors and meaningful specification violations. Experiments across multiple model specifications and backbone models demonstrate that training with SpecAlign consistently improves rule compliance while preserving general capabilities and avoiding over-conservative behavior. These results suggest that grounding alignment in explicit model specifications enables rapid, precise, and scalable adaptation of LLM behavior to evolving policy requirements.

URL PDF HTML ☆

赞 0 踩 0

2603.26557 2026-06-18 cs.CL 版本更新 70%

MemBoost: A Memory-Boosted Framework for Cost-Aware LLM Inference

MemBoost：一种面向成本感知的LLM推理的内存增强框架

Joris Köster, Zixuan Liu, Siavash Khajavi, Zizhan Zheng

发表机构 * University of Cambridge（剑桥大学）； ETH Zurich（苏黎世联邦理工学院）

专题命中后训练：记忆增强框架降低LLM推理成本

AI总结提出MemBoost框架，通过轻量模型重用历史答案和检索支持信息，并选择性将困难查询路由到强模型，以降低LLM推理成本，同时保持回答质量。

Comments ICML MemFM 2026 Workshop

2606.18309 2026-06-18 cs.LG cs.AI 新提交 65%

SAGE: Retain-Aware Post-Hoc Sanitization of Final Unlearning Vector

SAGE: 保留感知的最终遗忘向量事后净化

Jingyuan Zhang, Yucheng Bai, Peixi Wen, Zhehao Huang, Zhengbao He, Hanling Tian, Xinwen Cheng, Haiyin Ran, Xiaolin Huang

发表机构 * Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University（上海交通大学图像处理与模式识别研究所）

专题命中后训练：提出事后净化遗忘向量，缓解遗忘与保留权衡。

AI总结提出SAGE方法，通过事后净化最终更新向量，在不重新运行原始遗忘流程的情况下，缓解大语言模型遗忘与保留能力之间的权衡。

详情

AI中文摘要

大语言模型（LLM）遗忘旨在移除不良知识或行为，同时保留已有能力。当前的遗忘方法都涉及遗忘与保留之间的权衡。我们发现，保留激活偏差也可用于量化遗忘方法对保留造成的损害，而无需考虑遗忘过程的具体实现。这使得我们能够通过事后方法恢复任何遗忘方法的保留性能。因此，我们提出一种互补的事后设置，在不重新运行原始遗忘流程的情况下净化最终更新向量。在该设置中，我们设计了SAGE（光谱激活-几何净化），一种对最终遗忘更新的源无关修正。SAGE从一个小型保留代理收集真实模块输入，提取其主导激活几何结构，并求解一个闭式源锚定优化目标，该目标抑制与高能保留方向对齐的更新分量，同时保留源方法的遗忘载体。在多种遗忘方法、模型规模和基准测试中，SAGE持续缓解保留-遗忘权衡，将最终向量的事后净化识别为机器遗忘中一个实用且未被充分探索的维度。

英文摘要

Large Language Model (LLM) unlearning aims to remove undesirable knowledge or behaviors while preserving retained capabilities. Current unlearning methods all involve a trade-off between unlearning and retention. We have found that the retention activation bias can also be used to quantify the damage an unlearning method inflicts on retention, without considering the specific implementation of the unlearning process. This allows us to restore retention performance for any unlearning method using a post-hoc approach. Therefore, we propose a complementary post-hoc setting to sanitize the final update vector without rerunning the original unlearning pipeline. In this setting, we design SAGE, Spectral Activation-GEometry Sanitization, a source-agnostic correction for final unlearning updates. SAGE collects real module inputs from a small retain proxy, extracts their dominant activation geometry, and solves a source-anchored optimization objective in closed form, which suppresses update components aligned with high-energy retained directions while preserving the source method's forgetting carrier. Across multiple unlearning methods, model scales, and benchmarks, SAGE consistently relieves the retain-forget trade-off, identifying post-hoc sanitization of final vectors as a practical and underexplored axis for machine unlearning.

URL PDF HTML ☆

赞 0 踩 0

2606.18584 2026-06-18 cs.CL 新提交 70%

Speech-Driven End-to-End Language Discrimination towards Chinese Dialects

语音驱动的端到端汉语方言语言鉴别

Fan Xu, Jian Luo, MingWen Wang, GuoDong Zhou

发表机构 * Jiangxi normal university（江西师范大学）； Soochow university（苏州大学）

专题命中领域大模型：语音驱动端到端汉语方言语言鉴别

AI总结针对相似语言和方言鉴别难题，提出基于MFCC特征和HMM-DNN端到端模型的语音驱动方法，结合注意力机制和CNN融合词嵌入与MFCC特征，在基准语料上优于现有方法。

Comments Published in ACM TALLIP

2606.18560 2026-06-18 cs.SD 新提交 70%

Constraining to Generalize: Subspace Tuning for Few-shot Generalization of Audio-Language Models

约束泛化：音频-语言模型少样本泛化的子空间微调

Jaehyuk Jang, Kangwook Ko, Wonjun Lee, Changick Kim

发表机构 * KAIST（韩国科学技术院）

专题命中领域大模型：子空间微调提升音频-语言模型少样本泛化

AI总结针对音频-语言模型少样本微调导致的基类-新类权衡问题，提出子空间微调（SubT），通过结构化子空间参数化和残差锚定约束文本嵌入漂移，并利用子空间感知门控抑制负迁移，在11个音频基准上实现高效强泛化。

2606.18465 2026-06-18 cs.LG cs.AI 新提交 70%

What Does the Weight Norm Control in Grokking? Logit-Scale Mediation under Cross-Entropy

权重范数在Grokking中控制什么？交叉熵下的对数尺度中介作用

Truong Xuan Khanh

发表机构 * H&K Research Studio, Clevix LLC

专题命中预训练：研究Grokking中权重范数的作用

AI总结本文通过固定权重范数并改变输出温度，发现Grokking延迟主要由对数尺度（logit scale）决定，权重范数仅通过影响对数尺度间接起作用。

Comments 16 papges, 10 tables and 4 figures. Code and data to reproduce all numbers, tables, and figures: https://github.com/ClevixLab/grokking-logit-scale

详情

AI中文摘要

Grokking，即从记忆到泛化的延迟跳跃，通常与权重范数相关：范数越小，泛化越早。我们探究范数实际控制什么。通过钳位固定权重范数并仅改变输出温度，我们在交叉熵下将Grokking延迟滑动到其整个范数诱导范围；将有效对数尺度匹配回基线可恢复两个模数下约85%的延迟。在范数和温度的网格上，延迟仅由对数尺度决定（R2 = 0.97），范数仅额外贡献1-2%。该效应依赖于损失函数：在均方误差下，对数尺度被固定，范数通过不同路径起作用。记忆控制、float64 softmax崩溃审计和无LayerNorm的Transformer均指向同一通道。从同一状态分叉，延迟遵循钳位的范数值而非钳位操作本身，这排除了重缩放伪影。近端变量是对数尺度及其驱动的softmax饱和；权重范数仅是上游手柄。所有数字、表格和图表均可从发布的代码和数据中复现。

英文摘要

Grokking, the delayed jump from memorization to generalization, is usually tied to the weight norm: a smaller norm generalizes sooner. We ask what the norm actually controls. Holding the weight norm fixed by clamping and varying only an output temperature, we slide the grokking delay across its entire norm-induced range under cross-entropy; matching the effective logit scale back to baseline recovers about 85% of the delay at two moduli. Across a grid of norms and temperatures the delay collapses onto the logit scale alone (R2 = 0.97), with the norm adding 1-2% beyond it. The effect is loss-dependent: under mean-squared error the logit scale is pinned and the norm acts through a different route. A memorization control, a float64 softmax-collapse audit, and a no-LayerNorm transformer point to the same channel. Forking arms from one identical state, the delay follows the held norm value and not the clamp operation, which closes a rescaling-artifact concern. The proximal variable is the logit scale and the softmax saturation it drives; the weight norm is only an upstream handle. All numbers, tables, and figures reproduce from released code and data.

URL PDF HTML ☆

赞 0 踩 0

2606.18263 2026-06-18 cs.HC cs.AI 新提交 70%

How Well Do Large Language Models Capture Human Personality?

大型语言模型在多大程度上捕捉人类个性？

Aanisha Bhattacharyya, Yaman Kumar Singla, Rajiv Ratn Shah, Changyou Chen, Jitendra Ajmera

发表机构 * Adobe Media and Data Science Research (MDSR)（Adobe媒体与数据科学研究院）

专题命中其他LLM ：评估LLM通过角色提示模拟人类个性的保真度。

AI总结研究通过形式化假设并系统评估，发现增加角色描述复杂性会导致表征和行为多样性收缩（角色流形坍缩），简单年龄-性别角色比丰富描述更准确。

详情

AI中文摘要

大型语言模型（LLMs）越来越多地通过角色提示用于模拟人类群体，通常基于以下假设：更丰富的角色描述能提高行为保真度、相同大小的属性组合可同等模拟、角色定义可跨任务泛化。在这项工作中，我们形式化了这些假设，并在多种架构、规模和模拟设置下系统评估它们。我们识别出一个基本限制，称为角色流形坍缩，即越来越具表现力的角色规范导致表征和行为多样性的系统性收缩。跨模型而言，增加角色复杂性持续降低潜在空间中角色间的分离度，并削弱下游模拟任务中的行为分化。这些效应在多项分析中持续存在：更丰富的角色未能保留人类子群体分歧，相同大小的属性组合性能各异，添加描述细节往往降低而非提高模拟保真度。令人惊讶的是，简单的年龄-性别角色在多个行业中持续优于详细指定的理想客户画像（ICPs），实现了显著更高的下游预测准确性。我们发现坍缩并非在所有属性上均匀发生。某些组合在行为上保持稳定，并与人类响应保持更强的一致性，形成我们称为对齐桥的局部区域。总之，我们的结果为理解角色条件模拟的局限性提供了经验和概念基础，强调了需要构建表征感知的角色，而非仅仅增加角色表现力。

英文摘要

Large language models (LLMs) are increasingly used to simulate human populations via persona prompting, often under the assumptions that richer persona descriptions improve behavioral fidelity, similarly sized attribute combinations are equally simulatable, and persona definitions generalize across tasks. In this work, we formalize these assumptions and systematically evaluate them across multiple architectures, scales, and simulation settings. We identify a fundamental limitation we term persona manifold collapse, where increasingly expressive persona specifications lead to systematic contraction of representational and behavioral diversity. Across models, increasing persona complexity consistently reduces inter-persona separation in latent space and weakens behavioral differentiation in downstream simulation tasks. These effects persist across multiple analyses as richer personas fail to preserve human subgroup disagreement, performance varies across attribute combinations of similar size, and adding descriptive detail often degrades rather than improves simulation fidelity. Surprisingly, simple Age-Gender personas consistently outperform richly specified Ideal Customer Profiles (ICPs) across industries, achieving substantially higher downstream prediction accuracy. We find that collapse is not uniform across attributes. Certain combinations remain behaviorally stable and preserve stronger alignment with human responses, forming localized regions we term alignment bridges. Together, our results provide empirical and conceptual foundations for understanding the limits of persona-conditioned simulation, highlighting the need for representation-aware persona construction rather than increasing persona expressivity alone.

URL PDF HTML ☆

赞 0 踩 0

2606.18258 2026-06-18 cs.HC cs.AI 新提交 70%

Examining Human-Like Behaviors in LLMs: A Multi-Dimensional Analysis of Model Behaviors, User Factors, and System Prompts

审视LLM中的人类行为：模型行为、用户因素和系统提示的多维分析

Sunnie S. Y. Kim, Margit Bowler, Leon A Gatys

发表机构 * Apple（苹果公司）

专题命中其他LLM ：多维分析LLM的人类行为表现及系统提示控制。

AI总结通过21,000次对话的多维分析，发现LLM普遍表现出人类行为，但不同模型和用户因素下差异显著；人类评估者认为LLM的自我参照和关系建立行为不如人类适当，但边界维护行为更适当；系统提示可控制这些行为但需谨慎评估。

详情

AI中文摘要

大型语言模型（LLM）展现出广泛的人类行为，从表达思想和情感，到与用户建立关系，再到拒绝请求和维持边界。尽管这些行为普遍存在，但研究者和实践者缺乏方法和实证见解来做出关于LLM何时以及应展现何种类型人类行为的明智决策。为填补这一空白，我们使用LLM-as-a-judge和人类评估，对这些行为的普遍性、潜在影响和可控性进行了多维分析。在来自四个广泛使用的模型（gpt-4o、gpt-4.1-mini、claude-sonnet-4.6、gemini-2.5-flash）的21,000次多轮对话中，我们发现人类行为普遍存在，但不同模型和用户因素（对话目标和用户画像）间存在差异。在感知适当性方面，人类评估者认为LLM的自我参照和关系建立行为不如人类适当，但边界维护行为比人类更适当。最后，我们表明系统提示可以控制这些行为，但需要仔细评估以避免意外效果。我们讨论了研究结果的含义，并为负责任的LLM设计和评估提供了建议。

英文摘要

Large language models (LLMs) exhibit a wide range of human-like behaviors, from expressing thoughts and emotions, to engaging in relationship-building with users, to refusing requests and maintaining boundaries. Despite their prevalence, researchers and practitioners lack methods and empirical insights to make informed decisions about when and what types of human-like behaviors LLMs should exhibit. To fill this gap, we present a multi-dimensional analysis of the prevalence, potential effects, and controllability of these behaviors using LLM-as-a-judge and human evaluation. Across 21,000 multi-turn conversations from four widely used models (gpt-4o, gpt-4.1-mini, claude-sonnet-4.6, gemini-2.5-flash), we find that human-like behaviors are pervasive but vary across models and user factors (conversation goals and user profiles). In terms of perceived appropriateness, human evaluators judged self-referential and relationship-building behaviors as less appropriate from LLMs than from humans, but boundary-maintaining behaviors more appropriate from LLMs than from humans. Finally, we show that system prompting can control these behaviors, though it requires careful evaluation to avoid unintended effects. We discuss the implications of our findings and provide recommendations for responsible LLM design and evaluation.

URL PDF HTML ☆

赞 0 踩 0

2606.18422 2026-06-18 quant-ph 新提交 70%

Gatekeepers and Hallucinations: A Layered Evaluation Framework for LLM-Driven Quantum Circuit Generation

守门人与幻觉：LLM驱动的量子电路生成的分层评估框架

Christopher Coleman, Sharon Marfatia

专题命中其他LLM ：LLM生成量子电路评估框架

AI总结提出分层评估框架，通过守门人筛选、电路保真度分析和设计熵指标，识别LLM在量子电路生成中的五种失败模式，并揭示评估基础设施本身可能引入错误。

Comments 7 pages, 4 figures

详情

AI中文摘要

随着大型语言模型（LLM）嵌入量子模拟工作流程（IDE协作者、笔记本助手、智能体管道），评估必须超越功能正确性，以预测并捕获结构化故障，防止其通过昂贵管道传播。我们提出一个用于材料信息变分量子本征求解器（VQE）电路生成的分层评估框架：（i）跨七个物理和框架标准的守门人筛选规则；（ii）电路保真度分析，将模型输出与H2/STO-3G/Jordan-Wigner/UCCSD的分析和参考实现值进行比较，包括ansatz分类和门组成分解；以及（iii）设计熵，一种运行间行为一致性度量。我们揭示了五种不同LLM失败模式的分类（几何幻觉、不存在的API使用、运行时集成失败、约束违反以及看似合理但不可验证的输出），每种模式具有不同的可检测性特征，并且结构上属于任务本身而非任何特定模型。对评估平台自身源代码的法证审计进一步表明，两个明显的模型失败源于测试平台中的静默回退模板替换，证明评估基础设施应与所测试的模型处于相同的信任边界内。将该框架应用于多个基础模型在材料项目集成管道上，结果表明守门人式验证对于可靠部署是必要的，而非可选的。

英文摘要

As large language models (LLMs) become embedded in quantum simulation workflows (IDE copilots, notebook assistants, agentic pipelines), evaluation must move beyond functional correctness to anticipate and catch structured failures before they propagate through expensive pipelines. We present a layered evaluation framework for materials-informed Variational Quantum Eigensolver (VQE) circuit generation: (i) a gatekeeper screening rubric across seven physical and framework criteria; (ii) a circuit fidelity analysis comparing model outputs against analytical and reference-implementation values for H2/STO-3G/Jordan-Wigner/UCCSD, with ansatz classification and gate-composition breakdown; and (iii) design entropy, a run-to-run behavioral consistency metric. We surface a taxonomy of five distinct LLM failure modes (geometry hallucination, nonexistent API usage, runtime integration failures, constraint violations, and plausible-but-unverifiable output), each with distinct detectability profiles and structural to the task rather than to any one model. A forensic audit of the evaluation platform's own source code further establishes that two apparent model failures originated in the harness through silent fallback-template substitution, demonstrating that evaluation infrastructure belongs inside the same trust boundary as the models it tests. Applied across multiple foundation models on a Materials Project integrated pipeline, the framework shows that gatekeeper-style validation is necessary, not optional, for reliable deployment.

URL PDF HTML ☆

赞 0 踩 0

2606.18276 2026-06-18 cs.MA cs.SI physics.soc-ph 新提交 70%

Characterizing Opinion Evolution of Networked LLMs

表征网络化大语言模型的意见演化

Caleb Probine, Yigit Ege Bayiz, Filippos Fotiadis, Samuel Li, Yunhao Yang, Ufuk Topcu

专题命中其他LLM ：使用LLM模拟意见传播，属于LLM应用研究。

AI总结研究经典意见动力学模型能否描述多智能体系统中大语言模型（LLM）的意见传播，发现引入偏置项可显著提升建模精度，将平均意见误差降低高达88%。

Comments 19 pages, 2 figures

2606.15633 2026-06-18 cs.LG 新提交 70%

Formalizing and Mitigating Structural Distortion in LLM Attention for Graph Reasoning

形式化并缓解大语言模型注意力中的结构失真以实现零样本图推理

Donald Loveland, Puja Trivedi, Ari Weinstein, Edward W Huang, Danai Koutra

发表机构 * University of Michigan（密歇根大学）； Amazon（亚马逊）

专题命中其他LLM ：改进LLM在图推理任务中的表现

AI总结本文形式化了大语言模型处理文本属性图时因图线性化导致的结构失真机制，并提出轻量级推理时修改方法GaLA，通过校正注意力偏差提升零样本图推理性能。

Comments Accepted to KDD 2026

详情

AI中文摘要

大语言模型（LLM）在文本属性图（TAG）推理中展现出潜力。然而，将LLM应用于图需要将其结构线性化为序列，这引入了根源于图带宽问题的失真。虽然这种失真已被证明会降低性能，但通常归因于提示设计或模型规模，其潜在机制尚不清楚。在这项工作中，我们展示了旋转位置嵌入如何将图线性化为带宽相关的注意力衰减，抑制了序列化序列中被强制分隔开的图相邻节点之间的注意力。这将基于LLM的图推理的焦点从提示工程和规模缩放转向纠正注意力错位。受此分析启发，我们提出了图对齐语言注意力（GaLA），一种轻量级的、推理时修改LLM的方法。GaLA将注意力偏向图相邻节点，同时保留LLM的序列归纳偏差。在TAG基准测试中，GaLA以可忽略的开销提升了性能，表明失真是基于LLM的图推理中可纠正的瓶颈。

英文摘要

Large Language Models (LLMs) have shown promise for reasoning over Text-Attributed Graphs (TAGs). However, applying LLMs to graphs requires linearizing their structure into sequences, introducing distortion rooted in the graph bandwidth problem. While this distortion has been shown to degrade performance, it is often attributed to prompt design or model scale, leaving the underlying mechanism unclear. In this work, we show \textit{how} rotary positional embeddings turn graph linearization into bandwidth-dependent attention decay, suppressing attention between graph-adjacent nodes that are forced far apart in the serialized sequence. This shifts the focus of LLM-based graph reasoning from prompt engineering and scaling toward correcting attention misalignment. Motivated by this analysis, we propose \textbf{G}raph-\textbf{a}ligned \textbf{L}anguage \textbf{A}ttention (\textbf{GaLA}), a lightweight, inference-time modification for LLMs. GaLA biases attention toward graph-adjacent nodes while preserving the LLM's sequential inductive biases. Across TAG benchmarks, GaLA improves performance with negligible overhead, demonstrating that distortion is a correctable bottleneck in LLM-based graph reasoning.

URL PDF HTML ☆

赞 0 踩 0

2606.14202 2026-06-18 cs.NE cs.AI 新提交 70%

MeEvo: Metacognitive Evolution Combined with Natural Evolution for Automatic Heuristic Design

MeEvo: 元认知进化与自然进化相结合用于自动启发式设计

Zishang Qiu, Xinan Chen, Rong Qu, Ruibin Bai

发表机构 * School of Computer Science, University of Nottingham Ningbo China（诺丁汉大学宁波分校计算机科学学院）； School of Computer Science, University of Nottingham（诺丁汉大学计算机科学学院）

专题命中其他LLM ：利用LLM生成启发式代码

AI总结提出MeEvo框架，通过循环耦合自然进化（探索启发式代码）和元认知进化（反思历史生成改进启发式），解决现有方法知识继承弱、探索不足的问题，在五个优化问题上表现更优。

详情

AI中文摘要

大型语言模型（LLMs）通过推理和代码合成实现启发式生成，推动了自动启发式设计（AHD）的发展。现有的基于LLM的AHD架构主要遵循两种范式：自然进化，它使用交叉和变异来探索启发式程序；以及元认知进化，它通过反思来改进推理。然而，自然进化丢弃了推理轨迹，削弱了知识继承和利用，而元认知进化缺乏种群级别的重组，限制了探索并增加了过早收敛的风险。这些局限性降低了复杂问题的搜索效率、稳定性和解的质量。为了解决这一差距，我们提出了MeEvo，一种双层AHD框架，它循环耦合自然进化和元认知进化。自然进化探索启发式代码，同时将推理轨迹、适应度值和错误记录到共享历史中；然后元认知进化反思该历史以生成改进的启发式，这些启发式重新进入父代池以进行下一轮循环。这种设计使得种群驱动的探索和反思驱动的改进相互加强。在五个优化问题上的实验（使用两个LLM骨干）表明，MeEvo比现有的基于LLM的AHD架构实现了更强且更稳定的性能，尤其是在复杂约束任务上。

英文摘要

Large Language Models (LLMs) have advanced Automatic Heuristic Design (AHD) by enabling heuristic generation through reasoning and code synthesis. Existing LLM-based AHD architectures mainly follow two paradigms: Natural Evolution, which uses crossover and mutation to explore heuristic programs, and Metacognitive Evolution, which refines reasoning through reflection. However, Natural Evolution discards reasoning traces, weakening knowledge inheritance and exploitation, while Metacognitive Evolution lacks population-level recombination, limiting exploration and increasing the risk of premature convergence. These limitations reduce search efficiency, stability, and solution quality on complex problems. To address this gap, we propose MeEvo, a dual-layer AHD framework that cyclically couples Natural Evolution and Metacognitive Evolution. Natural Evolution explores heuristic code while recording reasoning traces, fitness values, and errors into a shared history; Metacognitive Evolution then reflects on this history to generate improved heuristics that re-enter the parent pool for the next cycle. This design enables population-driven exploration and reflection-driven refinement to reinforce each other. Experiments on five optimization problems with two LLM backbones show that MeEvo achieves stronger and more stable performance than existing LLM-based AHD architectures, especially on complex constrained tasks.

URL PDF HTML ☆

赞 0 踩 0

2606.07622 2026-06-18 cs.LG stat.AP 新提交 70%

Airport Terminal Passenger Queue Forecasting for Departure Gates and Security Checkpoints

机场航站楼登机口与安检点旅客排队预测

Juhwan Lee, Seokbin Yoon, Keumjin Lee, Hojong Baik, Seyeon Jung

发表机构 * Korea Aerospace University（韩国航空大学）； Korea Airports Corporation（韩国机场公社）

专题命中其他LLM ：Transformer预测机场排队

AI总结提出基于Transformer的框架，利用历史队列长度、等待时间和旅客吞吐量数据，预测登机口和安检点未来两小时的队列长度与等待时间，支持主动排队管理。

Comments 10 pages, 6 figures, accepted at DASC 2026

详情

AI中文摘要

准确的机场航站楼旅客排队预测对于高效的离港运营至关重要，因为它能够实现主动的拥堵管理。然而，时变的旅客需求以及多个离港设施中异构的设施使用情况使得预测具有挑战性。在这项工作中，我们提出了一种旅客排队预测框架，该框架从运营数据中学习历史旅客流量模式。所提出的模型采用基于Transformer的架构，利用过去登机口和安检点的队列长度和等待时间，以及值机岛的旅客吞吐量，来捕捉时间依赖性和设施间相关性。学习到的表示被映射到两个设施特定的MLP头部，以预测登机口和安检点的队列长度和等待时间。实验结果表明，该模型能够准确预测未来两小时内的排队情况。所提出的方法为机场航站楼运营中的主动排队管理和人员重新分配提供了实用的实时决策支持。

英文摘要

Accurate passenger queue forecasting in airport terminals is essential for efficient departure operations, as it enables proactive congestion management. However, time-varying passenger demand and heterogeneous facility usage across multiple departure facilities make forecasting challenging. In this work, we propose a passenger queue forecasting framework that learns historical passenger flow patterns from operational data. The proposed model employs a Transformer-based architecture to capture temporal dependencies and inter-facility correlations using past queue length and waiting time at departure gates and security checkpoints, together with passenger throughput at check-in islands. The learned representations are mapped to two facility-specific prediction heads to predict queue length and waiting time at departure gates and security checkpoints. Experimental results demonstrate accurate forecasts up to two hours ahead. The proposed approach offers practical real-time decision support for proactive queue management and staff reallocation in airport terminal operations.

URL PDF HTML ☆

赞 0 踩 0

2604.13082 2026-06-18 cs.LG cs.AI 版本更新 70%

The Long Delay to Arithmetic Generalization: When Learned Representations Outrun Behavior

算术泛化的长延迟：当学习到的表征超越行为时

Laura Gomezjurado Gonzalez

发表机构 * Stanford University（斯坦福大学）

专题命中其他LLM ：研究Transformer泛化机制，与LLM相关

AI总结研究Transformer在算术任务中泛化延迟的原因，发现编码器早期已学到结构，但解码器瓶颈导致延迟，通过移植编码器或冻结编码器可加速泛化，且数字基的选择影响学习难度。

Comments 19 pages, 10 fugures

详情

AI中文摘要

在算法任务上训练的Transformer中的grokking现象以训练集拟合与突然泛化之间的长延迟为特征，但该延迟的来源仍不清楚。在编码器-解码器算术模型中，我们认为这种延迟反映了对已学习结构的有限访问，而非未能首先获得该结构。我们研究一步Collatz预测，发现编码器在最初几千训练步内组织了奇偶性和残差结构，而输出精度在数万步内仍接近随机。因果干预支持解码器瓶颈假说。将训练好的编码器移植到新模型中将grokking加速2.75倍，而移植训练好的解码器则有害。冻结收敛的编码器并仅重新训练解码器完全消除了平台期，并达到97.6%的准确率，而联合训练为86.1%。解码器任务的难易取决于数字表示。在15种基中，那些分解与Collatz映射算术对齐的基（例如基24）达到99.8%的准确率，而二进制完全失败，因为其表示崩溃且无法恢复。基的选择作为归纳偏置，控制解码器可利用的局部数字结构量，从而在相同底层任务上产生巨大的可学习性差异。

英文摘要

Grokking in transformers trained on algorithmic tasks is characterized by a long delay between training-set fit and abrupt generalization, but the source of that delay remains poorly understood. In encoder-decoder arithmetic models, we argue that this delay reflects limited access to already learned structure rather than failure to acquire that structure in the first place. We study one-step Collatz prediction and find that the encoder organizes parity and residue structure within the first few thousand training steps, while output accuracy remains near chance for tens of thousands more. Causal interventions support the decoder bottleneck hypothesis. Transplanting a trained encoder into a fresh model accelerates grokking by 2.75 times, while transplanting a trained decoder actively hurts. Freezing a converged encoder and retraining only the decoder eliminates the plateau entirely and yields 97.6% accuracy, compared to 86.1% for joint training. What makes the decoder's job harder or easier depends on numeral representation. Across 15 bases, those whose factorization aligns with the Collatz map's arithmetic (e.g., base 24) reach 99.8% accuracy, while binary fails completely because its representations collapse and never recover. The choice of base acts as an inductive bias that controls how much local digit structure the decoder can exploit, producing large differences in learnability from the same underlying task.

URL PDF HTML ☆

赞 0 踩 0

2601.18511 2026-06-18 cs.CR 版本更新 70%

Scaling up FHE-based Privacy-Preserving ML: Higher Throughput, Longer Inputs for LLama-3-8B

扩展基于FHE的隐私保护机器学习：LLama-3-8B的更高吞吐量和更长输入

Jaiyoung Park, Sejin Park, Jai Hyun Park, Jung Ho Ahn, Jung Hee Cheon, Guillaume Hanrot, Jung Woo Kim, Minje Park, Damien Stehlé

专题命中其他LLM ：提出基于FHE的隐私保护LLM推理加速方法。

AI总结针对FHE-based LLM推理中输入长度扩展性差和非线性层评估受异常值影响的问题，采用令牌预置、正交旋转和稀疏密文多项式求值方法，结合快速同态线性代数技术，实现128加密令牌推理加速，并扩展至数千令牌的异构输入，在Llama-3-8B上取得显著性能提升。

详情

AI中文摘要

随着大型语言模型（LLM）变得无处不在，与推理相关的隐私问题日益突出。全同态加密（FHE）已成为非交互式机密LLM推理的主要密码学解决方案。然而，现有解决方案在输入令牌长度上扩展性差，主要关注小模型或小输入尺寸。它们还受到大的异常值影响，这强烈影响非线性层的评估，导致高昂的多项式逼近成本。我们从两个方向扩展基于FHE的LLM推理。首先，我们加速了128个加密令牌的基于FHE的推理。我们采用机器学习技术（令牌预置和正交旋转）来减轻异常值对非线性层FHE评估的影响。另外，我们设计了一种新颖的稀疏密文多项式求值方法，以加速我们的同态SoftMax实现。我们将这些与最近的快速同态线性代数技术相结合，实现了显著提高的效率。其次，我们将提示大小扩展到数千个令牌，适用于只有输入的最终部分敏感且加密的场景。处理此问题需要处理标准的明文-明文和密文-密文组件，以及针对新颖的明文-密文组件的宽同态计算。为了解决这个问题，我们设计了一种专用的同态线性代数算法，构建了一个浅层同态注意力电路，以最小化自举成本。基于这些要素，我们提出了一个基于CKKS的Llama-3-8B私有推理端到端实现。在8个NVIDIA RTX PRO 6000 GPU上，128个加密令牌的摘要生成需要20秒，生成每个令牌需要18秒（远超SOTA在更昂贵的H100 GPU上的295秒）。对于4096个令牌的异构输入（最后128个加密），摘要生成需要64秒，生成每个令牌需要22秒。

英文摘要

As large language models (LLMs) become ubiquitous, privacy concerns pertaining to inference keep growing. Fully homomorphic encryption (FHE) has emerged as a primary cryptographic solution for non-interactive confidential LLM inference. However, existing solutions scale poorly with input token length, focusing on small models or input sizes. They also suffer from large outlier values, which strongly impact the evaluation of non-linear layers, leading to heavy polynomial approximation costs. We scale up FHE-based LLM inference in two directions. First, we accelerate FHE-based inference for 128 encrypted tokens. We adopt ML techniques (token prepending and orthogonal rotations) to mitigate outlier impacts on the FHE evaluation of non-linear layers. Separately, we devise a novel polynomial evaluation method for sparsely-packed ciphertexts to speed up our homomorphic SoftMax implementation. We combine these with recent fast homomorphic linear algebra techniques, achieving significantly improved efficiency. Second, we expand the prompt size up to thousands of tokens for contexts where only the final part of the input is sensitive and encrypted. Processing this requires handling standard plaintext-plaintext and ciphertext-ciphertext components, alongside a wide homomorphic computation for a novel plaintext-ciphertext component. To address this, we devise a dedicated homomorphic linear algebra algorithm, building a shallow homomorphic attention circuit that minimizes bootstrapping costs. Based on these ingredients, we present a CKKS-based end-to-end implementation of Llama-3-8B private inference. On 8 NVIDIA RTX PRO 6000 GPUs, 128 encrypted tokens take 20s for summarization and 18s/token for generation (vastly outperforming the SOTA 295s on costlier H100 GPUs). For a heterogeneous 4096-token input (last 128 encrypted), it takes 64s for summarization and 22s/token for generation.

URL PDF HTML ☆

赞 0 踩 0

2510.27353 2026-06-18 cs.AI 版本更新 70%

An In-depth Study of LLM Contributions to the Bin Packing Problem

LLM对装箱问题贡献的深入研究

Julien Herrmann, Guillaume Pallez

发表机构 * CNRS-IRIT ； Inria

专题命中其他LLM ：研究LLM对装箱问题的贡献，分析LLM生成启发式算法。

AI总结通过分析LLM生成的启发式算法，发现其虽可读但难以解释，进而提出更简单高效的新算法，质疑LLM对装箱问题的实际贡献。

Comments Accepted for publication in ACM Transactions on Evolutionary Learning and Optimization

详情

DOI: 10.1145/3821574

AI中文摘要

近期研究表明，大型语言模型（LLM）可能为数学发现提供有趣的思路。该主张基于报告称，基于LLM的遗传算法在均匀分布和Weibull分布下为在线装箱问题产生了具有新见解的启发式算法。本文通过详细分析LLM产生的启发式算法，考察其行为和可解释性，重新评估了这一主张。尽管这些启发式算法是人类可读的，但即使对领域专家而言，它们仍然在很大程度上是不透明的。基于此分析，我们提出了一类针对这些特定装箱实例的新算法。推导出的算法显著更简单、更高效、更可解释且更具泛化性，表明所考虑的实例本身相对简单。然后，我们讨论了关于LLM对该问题贡献的主张的局限性，该主张似乎基于一个错误的假设，即这些实例先前已被研究过。我们的发现反而强调了在评估LLM生成输出的科学价值时，需要进行严格的验证和情境化。

英文摘要

Recent studies have suggested that Large Language Models (LLMs) could provide interesting ideas contributing to mathematical discovery. This claim was motivated by reports that LLM-based genetic algorithms produced heuristics offering new insights into the online bin packing problem under uniform and Weibull distributions. In this work, we reassess this claim through a detailed analysis of the heuristics produced by LLMs, examining both their behavior and interpretability. Despite being human-readable, these heuristics remain largely opaque even to domain experts. Building on this analysis, we propose a new class of algorithms tailored to these specific bin packing instances. The derived algorithms are significantly simpler, more efficient, more interpretable, and more generalizable, suggesting that the considered instances are themselves relatively simple. We then discuss the limitations of the claim regarding LLMs' contribution to this problem, which appears to rest on the mistaken assumption that the instances had previously been studied. Our findings instead emphasize the need for rigorous validation and contextualization when assessing the scientific value of LLM-generated outputs.

URL PDF HTML ☆

赞 0 踩 0

2506.12311 2026-06-18 cs.CL cs.SD eess.AS 版本更新 70%

Phonikud: Overcoming Phonetic Underspecification for Hebrew Text-To-Speech

Phonikud：克服希伯来语文本转语音中的语音欠指定问题

Yakov Kolani, Maxim Melichov, Cobi Calev, Morris Alper

发表机构 * Independent Researcher（独立研究者）； Reichman University（雷赫曼大学）； Tel Aviv University（特拉维夫大学）； Carnegie Mellon University（卡内基梅隆大学）

专题命中其他LLM ：希伯来语TTS，涉及语言模型

AI总结提出Phonikud框架，通过开源G2P系统、语料库、基准和评估模型，解决希伯来语TTS中重音等语音特征欠指定问题，实现更准确的音素预测。

Comments Accepted to Interspeech 2026. Project page: https://phonikud.github.io

详情

AI中文摘要

现代希伯来语的文本转语音（TTS）受到该语言正字法复杂性的挑战，现有解决方案忽略了诸如重音等欠指定的语音特征。我们提出了一个更准确的希伯来语TTS框架，包含四个贡献：（1）Phonikud，一个开源的希伯来语字素到音素（G2P）系统，输出完全指定的国际音标（IPA）转录，通过增强基础注音器设计而成。（2）ILSpeech语料库，包含配对的希伯来语音频、文本和专家IPA标注。（3）针对先前未测量的希伯来语G2P转换任务的基准。（4）希伯来语音频到IPA模型，捕获先前忽略的语音细节，用于自动TTS评估。我们的结果表明，Phonikud比先前方法更准确地预测希伯来语音素，并且使用Phonikud语音输入的小型本地TTS模型接近大型专有系统。我们在以下网址发布代码、数据和模型：this https URL。

英文摘要

Text-to-speech (TTS) for Modern Hebrew is challenged by the language's orthographic complexity, with existing solutions ignoring underspecified phonetic features such as stress. We present a framework for more phonetically accurate Hebrew TTS with four contributions: (1) Phonikud, an open-source Hebrew grapheme-to-phoneme (G2P) system that outputs fully-specified International Phonetic Alphabet (IPA) transcriptions, designed by augmenting a base diacritizer. (2) The ILSpeech corpus of paired Hebrew audio, text, and expert IPA annotations. (3) A benchmark for the previously unmeasured task of Hebrew G2P conversion. (4) Hebrew audio-to-IPA models capturing previously disregarded phonetic details for automatic TTS evaluation. Our results show that Phonikud more accurately predicts Hebrew phonemes than prior methods, and that small, local TTS models with phonetic input from Phonikud approach large proprietary systems. We release our code, data, and models at https://phonikud.github.io.

URL PDF HTML ☆

赞 0 踩 0

2606.19286 2026-06-18 cs.HC cs.AI cs.CY 新提交 60%

Correct Yourself, Keep My Trust: How Self-Correction and Social Connection Shape Credibility in Social Chatbots

纠正自己，保持信任：自我纠正和社会联系如何塑造社交聊天机器人的可信度

Biswadeep Sen, Yi-Chieh Lee

发表机构 * School of Computing National University of Singapore Singapore Singapore（计算学院新加坡国立大学新加坡新加坡）； Computer Science National University of Singapore Singapore Singapore（计算机科学新加坡国立大学新加坡新加坡）； National University of Singapore（新加坡国立大学）

专题命中其他LLM ：社交聊天机器人错误纠正策略实验

AI总结通过实验比较三种错误纠正策略，发现自我纠正不损害聊天机器人可信度，且用户社会联系强度仅在自我纠正时显著预测信念改变。

详情

AI中文摘要

当社交聊天机器人犯错时——它们确实会犯错——它们的恢复方式决定了用户是否会再次信任它们。社交聊天机器人正日益融入日常生活，但它们仍然容易生成令人信服但不准确的信息。它们与用户建立的社会联系使得此类错误尤其具有后果性。我们进行了一项受试者间实验（N=120），比较了三种错误纠正策略：网页撤回、同一社交聊天机器人的自我纠正以及专家聊天机器人的纠正。我们的结果揭示了两个关键发现。首先，所有三种策略都能同样好地纠正错误，但只有自我纠正不会损害聊天机器人的可信度：参与者对自我纠正的聊天机器人在可信度和感知专业性上的评分显著高于其错误由外部来源纠正的聊天机器人。其次，通过社会吸引力和自我披露测量的用户与聊天机器人的社会联系强度，仅在聊天机器人自我纠正时显著预测信念改变的大小。将纠正外包给外部来源完全切断了这种联系。这些发现表明，社交聊天机器人应该纠正自己的错误，而不是外包纠正，并且投资于社会联系是一种功能性机制，能增强纠正效果，而不仅仅是一种设计特征。我们讨论了设计能够保持长期可信度同时有效处理自身错误的聊天机器人的启示。

英文摘要

When social chatbots make mistakes, and they do, how they recover determines whether users trust them again. Social chatbots are increasingly integrated into everyday life, yet they remain prone to generating convincing but inaccurate information. The social connection they build with users makes such errors particularly consequential. We conducted a between-subjects experiment (N=120) comparing three error correction strategies: a webpage retraction, self-correction by the same social chatbot, and correction by an expert chatbot. Our results reveal two key findings. First, all three strategies corrected the error equally well, but only self-correction did so without damaging the chatbot's credibility: participants rated self-correcting chatbots significantly higher in both trustworthiness and perceived expertise than chatbots whose errors were corrected by external sources. Second, the strength of the user's social connection with the chatbot, measured through social attraction and self-disclosure, significantly predicted the magnitude of belief change, but only when the chatbot corrected itself. Outsourcing corrections to an external source severed this link entirely. These findings suggest that social chatbots should correct their own mistakes rather than outsource corrections, and that investing in social connection is a functional mechanism that amplifies correction effectiveness, not merely a design feature. We discuss implications for designing chatbots that maintain long-term credibility while effectively addressing their own errors.

URL PDF HTML ☆

赞 0 踩 0

2606.19164 2026-06-18 cs.LG cs.AI 新提交 60%

Essential Subspace Merging for Multi-Task Learning

多任务学习的本质子空间合并

Longhua Li, Lei Qi, Xin Geng, Qi Tian

发表机构 * School of Computer Science and Engineering, Southeast University（东南大学计算机科学与工程学院）； Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications (Southeast University), Ministry of Education, China（新一代人工智能技术及其交叉应用国家重点实验室（东南大学））； Huawei Inc.（华为公司）

专题命中其他LLM ：提出多任务模型合并方法，适用于LLM但非核心

AI总结提出本质子空间分解（ESD）和合并（ESM/ESM++）方法，通过正交化任务更新的主成分来减少多任务合并中的干扰，无需训练即可实现高效多任务学习。

详情

AI中文摘要

模型合并旨在通过将多个从同一预训练检查点微调得到的模型的能力集成到一个单一模型中，从而实现多任务学习。其核心挑战是任务特定参数更新之间的任务间干扰。在本文中，我们分析了任务更新引起的输出偏移，并观察到它们的能量集中在少数主方向上。我们将这些方向张成的子空间称为本质子空间。相比之下，大多数剩余方向携带的任务相关能量很少，但它们在多个任务更新中的累积会在合并过程中引起严重干扰。受此观察启发，我们提出了本质子空间分解（ESD），它根据激活偏移的主成分分解每个任务更新。基于ESD，我们引入了本质子空间合并（ESM），一种无需训练的静态合并方法，它将本质成分正交化并融合成一个紧凑的多任务模型。我们进一步将ESM扩展到ESM++，一种无需训练的动态合并方法，它将任务特定残差分解为低秩专家，并在前向推理过程中通过基于原型的路由选择最相关的专家。跨多个任务集和模型规模的大量实验表明，ESM和ESM++在减少任务间干扰的同时有效保留了任务知识。

英文摘要

Model merging aims to enable multi-task learning by integrating the capabilities of multiple models fine-tuned from the same pre-trained checkpoint into a single model. Its core challenge is inter-task interference among task-specific parameter updates. In this paper, we analyze the output shifts induced by task updates and observe that their energy is concentrated in a small number of principal directions. We call the subspace spanned by these directions the essential subspace. In contrast, most remaining directions carry little task-relevant energy, but their accumulation across multiple task updates can cause severe interference during merging. Motivated by this observation, we propose Essential Subspace Decomposition (ESD), which decomposes each task update according to the principal components of its activation shift. Based on ESD, we introduce Essential Subspace Merging (ESM), a training-free static merging method that orthogonalizes and fuses essential components into one compact multi-task model. We further extend ESM to ESM++, a training-free dynamic merging method that decomposes task-specific residuals into low-rank experts and selects the most relevant expert through prototype-based routing during forward inference. Extensive experiments across multiple task sets and model scales demonstrate that ESM and ESM++ effectively preserves task knowledge while reducing inter-task interference.

URL PDF HTML ☆

赞 0 踩 0

2606.19150 2026-06-18 cs.LG 新提交 60%

Complementary Attention Head Pruning for Efficient Transformers

互补注意力头剪枝用于高效Transformer

Yaniv Livertovsky, Shahar Somin, Gonen Singer

发表机构 * Bar-Ilan University（巴伊兰大学）

专题命中其他LLM ：注意力头剪枝方法适用于Transformer，包括LLM

AI总结提出CAHP框架，将注意力头选择建模为全局图论问题，通过图聚类和信息论距离保留互补头，自动确定剪枝数量，在SST-5和MNLI上优于现有方法。

Comments 9 pages, 4 figures, 3 tables. Accepted for presentation at the International Joint Conference on Neural Networks (IJCNN) 2026

详情

AI中文摘要

基于Transformer的模型在自然语言处理中的显著成功源于架构的规模化，这导致大量参数并阻碍了在资源受限环境中的部署。虽然结构化剪枝提供了一条压缩路径，但现有的最先进方法通常依赖于基于梯度的重要性排序或随机门控，这些方法存在不稳定性、结构退化以及需要大量手动超参数调整的问题。在本文中，我们引入了CAHP（互补注意力头剪枝），一种新颖的事后框架，将头选择重新定义为全局图论问题。CAHP不是孤立地评估头，而是利用基于图的聚类结合信息论距离度量来识别并保留一组拓扑多样化的互补注意力头。无需预定义稀疏度或剪枝比例，该框架通过识别递减的边际性能曲线自动确定各层中保留的注意力头数量，其中根据所选多项式次数，剪除额外头会导致性能急剧下降。在SST-5和MNLI基准上跨不同Transformer模型规模的广泛评估表明，CAHP始终优于竞争基线，特别是在高压缩率情况下。此外，我们的结构分析表明，CAHP避免了基于梯度的剪枝方法的“邻近偏差”（倾向于主要保留靠近输出层的头），而是保留了模型中间层中功能关键的注意力头集合。

英文摘要

The remarkable success of Transformer-based models in natural language processing stems from architectural scaling, which leads to a large number of parameters and hinders deployment in resource-constrained environments. While structured pruning offers a pathway to compression, existing state-of-the-art methods often rely on gradient-based importance ranking or stochastic gating, which suffer from instability, structural degeneration, and the need for extensive manual hyperparameter tuning. In this paper, we introduce CAHP (Complementary Attention Head Pruning), a novel post-hoc framework that redefines head selection as a global graph-theoretical problem. Rather than evaluating heads in isolation, CAHP utilizes graph-based clustering combined with information-theoretic distance measures to identify and preserve a topologically diverse subset of complementary attention heads. Without requiring a predefined sparsity level or pruning ratio, the framework automatically determines the number of selected attention heads across layers by identifying a diminishing marginal performance curve, where pruning additional heads leads to a sharp degradation in performance, as determined by the chosen polynomial degree. Extensive evaluations on the SST-5 and MNLI benchmarks, across different Transformer model scales, demonstrate that CAHP consistently outperforms competitive baselines, particularly in high-compression regimes. Furthermore, our structural analysis shows that CAHP avoids the "proximity bias" of gradient-based pruning methods, which tend to preserve heads mainly in layers close to the output, and instead retains a functionally critical set of attention heads in the model's intermediate layers.

URL PDF HTML ☆

赞 0 踩 0

2606.19144 2026-06-18 cs.AI cs.CL 新提交 60%

Human-AI Coevolution Dynamics: A Formal Theory of Social Intelligence Emergence Through Long-Term Interaction

人机协同演化动力学：长期互动中社会智能涌现的形式理论

Jingyi Zhou, Senlin Luo, Haofan Chen

发表机构 * School of Information and Electronics, Beijing Institute of Technology（信息与电子学院，北京理工大学）； Institute of Scientific and Technical Research on Archives, Beijing（档案科学与技术研究所，北京）； China Electronics Engineering Design Institute Co., Ltd.（中国电子工程设计院有限公司）

专题命中其他LLM ：人机交互理论框架，涉及LLM但非核心

AI总结提出人机协同演化动力学框架（HACD-H），将情感适应、关系组织、社会记忆和人格一致性整合为统一动力学模型，通过约14,700轮对话数据集验证，发现社会智能与社会认知能量显著负相关，揭示社会智能源于长期协同演化。

详情

AI中文摘要

当前的对话式AI系统在语言生成、个性化和长上下文交互方面取得了显著进展。然而，大多数现有方法通过孤立组件（如情感建模、记忆检索或人格条件化）来建模社会行为，缺乏一个统一的框架来解释长期人机交互中稳定社会关系和社会智能的涌现。为解决这一问题，我们提出了人机协同演化动力学框架（HACD-H），这是一个将人机交互建模为自组织社会认知系统的形式模型。HACD-H将情感适应、关系组织、社会记忆和人格一致性整合到一个统一的动力学框架中，并引入了多时间尺度社会认知、关系吸引子、信任盆地、发展相变和社会认知能量景观等原则。我们构建了一个约14,700轮交互的对话数据集，并开发了一个理论驱动的实证评估框架。结果揭示了社会认知中的时间持久性层次结构、稳定的关系吸引子、类似相变的发展模式以及结构化的社会认知能量景观。社会智能与社会认知能量呈显著负相关（r = -0.391, p < 0.001），且交互轨迹随时间呈现渐进性能量减少。这些发现表明，社会智能源于长期的社会认知协同演化，而非孤立的对话能力。HACD-H为建模适应性人机社会交互和开发社会智能AI系统提供了统一的理论基础。

英文摘要

Current conversational AI systems have made significant progress in language generation, personalization, and long-context interaction. However, most existing methods model social behavior through isolated components such as emotion modeling, memory retrieval, or persona conditioning, lacking a unified framework to explain the emergence of stable social relationships and social intelligence in long-term human-AI interaction.To address this, we propose the Human-AI Coevolution Dynamics Framework (HACD-H), a formal model of human-AI interaction as a self-organizing social cognitive system. HACD-H integrates emotional adaptation, relational organization, social memory, and personality consistency into a unified dynamical framework and introduces principles including multi-timescale social cognition, relational attractors, trust basins, developmental phase transitions, and social cognitive energy dynamics.We construct a conversational dataset with approximately 14,700 interaction turns and develop a theory-driven empirical evaluation framework. Results reveal a hierarchy of temporal persistence in social cognition, stable relational attractors, phase-transition-like developmental patterns, and a structured social cognitive energy landscape. Social intelligence shows a significant negative correlation with social cognitive energy (r = -0.391, p < 0.001), and interaction trajectories exhibit progressive energy reduction over time.These findings suggest that social intelligence emerges from long-term social cognitive coevolution rather than isolated conversational capabilities. HACD-H provides a unified theoretical foundation for modeling adaptive human-AI social interaction and developing socially intelligent AI systems.

URL PDF HTML ☆

赞 0 踩 0

2606.19121 2026-06-18 cs.SE cs.CL cs.HC 新提交 60%

Written by AI, Managed by AI: Semantic Space Control and Index Sickness Elimination Across 391 Consecutive Sessions

由AI编写，由AI管理：跨越391个连续会话的语义空间控制与索引病消除

Hui Zhang, Shuren Song

发表机构 * Shenzhen Yunxi Technology Co., Ltd.（深圳云曦科技有限公司）； Information Technology Center, Tsinghua University（清华大学信息科学技术中心）

专题命中其他LLM ：研究LLM协作中的工程问题

AI总结本文通过真实软件项目中的行动研究，发现长期LLM协作中增加形式约束反而导致“索引病”，提出“基线-日志物理分离”机制，有效消除该问题。

Comments 22 pages, 2 tables, 1 figure. Action research. Bilingual submission (Chinese companion version included as supplementary). Submitted to ICSE 2027 IOR track

详情

AI中文摘要

解决长期LLM协作中概念漂移的主流工程直觉是，用更多的形式约束换取更可靠的输出——设计符号标识符系统，在系统提示中积累防御规则，扩展上下文窗口。我们的工程记录表明，在长期设置中，这种方向可能产生与设计意图相反的效果。通过在跨越约一个月和391个协作会话的真实软件项目（Bang-v3）中使用行动研究方法，我们记录并分析了这些策略的失败过程。当符号系统超过复杂度阈值时，LLM并不会变得更准确——相反，它们放弃了对业务语义的真正理解，退回到符号层内的自我指涉推理，并生成看似内部一致但实际上与现实脱节的输出。我们将这种失败模式命名为“索引病”，其典型表现为“幻影立法”。我们将底层原理命名为“庞原理（语义活力定律）”：带有明确目的的自然语言传达的信息质量远高于符号表达。由此，我们设计并验证了其物理工程机制：“基线-日志物理分离”。在同一项目中，该机制将AI指令量减少了约75%，并且在随后的约150个会话中，未观察到索引病复发。附有双语对照版本（中文）作为补充材料。

英文摘要

The prevailing engineering intuition for addressing conceptual drift in long-horizon LLM collaboration is to trade more formal constraints for more reliable outputs -- designing symbolic identifier systems, accumulating defensive rules in System Prompts, expanding context windows. Our engineering record shows that in long-horizon settings, this direction may produce effects contrary to design intent. Using action research methods in a real software project (Bang-v3) spanning approximately one month and 391 collaborative sessions, we document and analyze the failure process of these strategies. When the symbolic system exceeds a complexity threshold, LLMs do not become more accurate -- instead, they abandon genuine understanding of business semantics, retreat to self-referential reasoning within the symbolic layer, and generate outputs that appear internally consistent but are physically disconnected from reality. We name this failure pattern "Index Sickness," and its canonical manifestation "Phantom Legislation." We name the underlying principle the "Pang Principle (Semantic Vitality Law)": natural language carrying explicit purpose conveys far greater information quality than symbolic expression. From this, we design and validate its physical engineering mechanism: "Baseline-Log Physical Separation." In the same project, this mechanism reduced AI Instructions volume by ~75%, and across the subsequent ~150 sessions, no recurrence of Index Sickness was observed. A bilingual companion version (Chinese) is included as supplementary material.

URL PDF HTML ☆

赞 0 踩 0

2606.19111 2026-06-18 cs.CL cs.AI cs.MA 新提交 60%

Leadership as Coordination Control: Behavioral Signatures and the Recovery-Advantage Boundary in Multi-Agent LLM Teams

领导力作为协调控制：多智能体LLM团队中的行为特征与恢复优势边界

Haewoon Kwak

发表机构 * Indiana University Bloomington（印第安纳大学布卢明顿分校）

专题命中其他LLM ：研究LLM团队行为，但非模型本身

AI总结研究多智能体LLM团队中过程级协调控制何时增加价值，通过行为特征和消融实验发现，控制器的优势仅在初始多数投票不可靠、任务可恢复且无指导交互无法修复时出现，验证了权变理论。

Comments 33 pages

详情

AI中文摘要

团队科学认为领导力是权变的：它仅在特定条件下有帮助，而能力强的自主团队可能根本不需要领导。我们对多智能体LLM团队提出类似问题：在什么可测量的条件下，过程级协调控制会增加价值，这些条件是否与团队科学的预测一致？我们使用行为特征（多数锁定、探索、从错误的第0轮共识中恢复）和每动作消融实验，因为每个控制器是一个显式动作集，而不是一个整体提示。我们将三种经典领导风格（交易型、变革型、情境型）操作化为对共享动作词汇（探索、修订、接受、综合）的控制器。一个具有相同动作但使用任意规则的匹配控制器恢复效果不优于多数投票，因此是理论推导的规则（而非词汇）起作用。在四个任务体系和三个开放权重模型系列中，没有控制器在准确率上占主导地位，正如权变观点所预测的：交易型控制在所有12个（模型、体系）组合上与共享的第0轮投票匹配，差异在1.3个百分点以内，仅在初始多数不可靠的一个组合上出现增益（llama-4-scout社会性；情境型比扁平型高8个百分点）。通过四个边界探针测试的恢复优势解释表明，控制器仅在初始多数投票不可靠、任务可恢复且无指导交互无法修复时优于纯交互。这些区域映射到权变理论（领导替代、路径-目标冗余、情境准备差距），因此基本为零的准确率结果正是理论所预测的，而非控制器的失败。我们将过程级协调控制视为一种需要测量和理论映射的权变因素，而不是需要超越的排行榜。

英文摘要

Team science holds that leadership is contingent: it helps only under specific conditions, and capable, autonomous teams may need none at all. We ask the analogous question for multi-agent LLM teams: under what measurable conditions does process-level coordination control add value, and do those conditions match what team science predicts? We use behavioral signatures (majority lock-in, exploration, recovery from an incorrect round-0 consensus) and per-action ablations, clean because each controller is an explicit action set, not a monolithic prompt. We operationalize three classical leadership styles (transactional, transformational, situational) as controllers over a shared action vocabulary (explore, revise, accept, synthesize). A matched controller with the same actions but an arbitrary rule recovers no better than majority voting, so the theory-derived rule, not the vocabulary, does the work. Across four task regimes and three open-weight model families, no controller dominates by accuracy, as the contingency view predicts: transactional control matches a shared round-0 vote on all 12 (model, regime) combinations to within 1.3pp, and gains appear only on the one combination where the round-0 majority is unreliable (llama-4-scout social; situational +8pp over flat). A recovery-advantage account, tested with four boundary probes, says a controller beats plain interaction only where the round-0 majority is unreliable, the task is recoverable, and undirected interaction does not already repair it. These regions map onto contingency theory (leadership substitutes, path-goal redundancy, the situational readiness gap), so a largely null accuracy result is what the theory predicts, not a failure of the controllers. We read process-level coordination control as a contingency to be measured and theory-mapped, not a leaderboard to be topped.

URL PDF HTML ☆

赞 0 踩 0

2606.19108 2026-06-18 cs.LG 新提交 60%

JourneyFormer: Encoding Airbnb Guest Journey with Sequence Modeling

JourneyFormer: 使用序列建模编码Airbnb客人旅程

Daochen Zha, Chun How Tan, Xin Liu, Bin Xu, Han Zhao, Xiaowei Liu, Tracy Yu, Hui Gao, Huiji Gao, Liwei He, Stephanie Moyerman, Sanjeev Katariya

发表机构 * Airbnb

专题命中其他LLM ：序列建模用于推荐，非LLM核心

AI总结针对Airbnb中客人序列长、探索性强且标签稀疏的问题，提出JourneyFormer序列建模解决方案，通过优化数据选择、ID嵌入、模型架构和标签归因，并在两个生产面上通过在线A/B测试验证了其有效性。

Comments Accepted by KDD 2026

详情

AI中文摘要

序列建模因其能够建模用户历史行为并推断用户意图，在推荐和排序算法中越来越受欢迎。尽管理论简单，但由于序列的复杂性和稀疏标签，序列模型在生产中的实际部署并非易事。例如，在Airbnb中，客人序列通常较长、具有探索性且复杂，我们关注的是稀疏的预订标签。因此，我们经常需要在数据和建模方面做出各种设计决策，以在有效性和可扩展性之间取得平衡。本文深入探讨了这些生产挑战，并部署了JourneyFormer，一种用于Airbnb搜索排序的序列建模解决方案。我们详细介绍了关键的设计考虑，涵盖客人事件选择、ID嵌入、模型架构和标签归因等方面。此外，我们描述了几种加速模型训练和推理的定制策略。JourneyFormer已成功部署在Airbnb的生产环境中，其有效性和影响不仅通过改进的离线排序指标得到证明，而且通过两个生产面上的在线A/B测试在关键业务指标上取得了显著提升。

英文摘要

Sequence modeling has become increasingly popular in recommendation and ranking algorithms, owing to its capacity to model users' historical behaviors and infer user intentions. Despite its theoretical simplicity, the practical deployment of a sequence model in production is non-trivial due to complexity of the sequence and sparse labels. For example, in Airbnb, guest sequences are often long, exploratory and complex, and we focus on booking labels, which are sparse. As such, we are often required to make various design decisions regarding data and modeling to strike a balance between effectiveness and scalability. This work delved into these production challenges and deployed JourneyFormer, a sequence modeling solution for search ranking at Airbnb. We detail crucial design considerations, covering aspects such as guest event selection, ID embeddings, model architecture, and label attribution. Additionally, we describe several tailored strategies to accelerate model training and inference. JourneyFormer has been successfully deployed within Airbnb's production, where its effectiveness and impact have been evidenced not only by improved offline ranking metrics but also by significant gains in key business metrics through online A/B testing across 2 production surfaces.

URL PDF HTML ☆

赞 0 踩 0

2606.18923 2026-06-18 cs.LG 新提交 60%

GrapNet: A Programmable Dynamic-Architecture Neural Graph Substrate

GrapNet: 一种可编程的动态架构神经图基板

Zirong Li

发表机构 * Zirong Li（李子荣）

专题命中其他LLM ：提出可编程神经图基板，非LLM核心

AI总结提出GrapNet，一种将图作为可执行架构的神经基板，通过可编程接口支持结构编辑、冻结子图、局部审计等操作，在Split Fashion-MNIST和Split CIFAR-10上分别提升12.08和3.81个百分点的准确率。

Comments 8 pages, 1 figure, preprint

详情

AI中文摘要

可编程性是固定张量神经网络中缺失的一流接口：编辑关系、冻结子图、审计局部函数或更改执行后端应是对神经程序的操作，而非临时参数手术。GrapNet研究这种图即网络的设置。图是架构和可执行程序，而非输入数据图。每个计算节点拥有其下一层子节点引用和与这些引用对齐的可训练分配向量；删除关系会物理移除子节点引用和相应的分配坐标。结构规则和执行策略位于节点核心之外，因此同一子节点拥有的图可以被增长、冻结、结构编辑、分组为可训练族块、通过注意力在活动关系上路由，或在拓扑稳定后降级为密集快照。GrapNet通过向量值父接口与常规模块组合：密集层、CNN编码器、ResNet特征提取器、注意力块和Transformer表示都可以为每个坐标提供一个感知GrapNode。评估组织为可编程性压力测试套件，而非新的重放基准。在匹配的十种子Split Fashion-MNIST研究中，可塑GrapNet+ER头在相同已见类损失和重放记忆下达到63.16%的已见类准确率，而参数更大的密集MLP+ER为51.08%，配对差值为12.08点，p=1.3e-5。在Split CIFAR-10上使用冻结的ImageNet ResNet-18编码器时，相同基板将在线头比MLP-256提高3.81点，p=0.0026。这些结果支持GrapNet作为可编辑的神经图基板，其核心价值在于具有忠实执行视图的结构可编程性。

英文摘要

Programmability is a missing first-class interface in fixed-tensor neural networks: editing a relation, freezing a subgraph, auditing a local function, or changing the execution backend should be an operation on the neural program rather than ad-hoc parameter surgery. GrapNet studies this graph-as-network setting. The graph is the architecture and executable program, not an input data graph. Each compute node owns its next-layer child references and a trainable allocation vector aligned with those references; deleting a relation physically removes both the child reference and the corresponding allocation coordinate. Structural rules and execution policies live outside the node core, so the same child-owned graph can be grown, frozen, structurally edited, grouped into trainable family blocks, routed by attention over active relations, or lowered to dense snapshots after topology stabilizes. GrapNet composes with conventional modules through a vector-valued parent interface: dense layers, CNN encoders, ResNet feature extractors, attention blocks, and transformer representations can all feed one sensory GrapNode per coordinate. The evaluation is organized as a programmability stress suite rather than as a new replay benchmark. In a matched ten-seed Split Fashion-MNIST study, a plastic GrapNet+ER head reaches 63.16 percent seen-class accuracy versus 51.08 percent for a parameter-larger dense MLP+ER under the same seen-class loss and replay memory, with paired delta 12.08 points and p=1.3e-5. On Split CIFAR-10 with a frozen ImageNet ResNet-18 encoder, the same substrate improves the online head over MLP-256 by 3.81 points, with p=0.0026. These results support GrapNet as an editable neural graph substrate whose core value is structural programmability with faithful execution views.

URL PDF HTML ☆

赞 0 踩 0

2606.18856 2026-06-18 cs.CL cs.LG 新提交 60%

Approximate Structured Diffusion for Sequence Labelling

近似结构化扩散用于序列标注

Nicolas Floquet, Joseph Le Roux, Nadi Tomeh

发表机构 * Université Sorbonne Paris Nord, CNRS, Laboratoire d’Informatique de Paris Nord, LIPN（巴黎北大学 Sorbonne、法国国家科学研究中心、巴黎北信息学实验室、LIPN）

专题命中其他LLM ：扩散模型用于序列标注，非LLM核心但相关。

AI总结提出一种基于扩散的条件随机场（CRF）训练方法，通过引入标签噪声条件来捕捉长距离依赖，结合近似推理在词性标注任务上实现16.5%的错误率降低。

2606.18852 2026-06-18 cs.CL cs.AI 新提交 60%

Aligning Implied Statements for Implicit Hate Speech Generalizability with Context-Bounded Semi-hard Negative Mining

对齐隐含陈述：通过上下文边界半硬负挖掘实现隐式仇恨言论的泛化性

Wicaksono Leksono Muhamad, Yunita Sari

发表机构 * Mantera Studio（Mantera工作室）； Universitas Gadjah Mada（加雅玛大学）

专题命中其他LLM ：隐式仇恨言论分类，使用对比学习。

AI总结提出ImpSH三元组框架，通过将帖子与隐含陈述对齐并使用上下文边界半硬负样本聚焦学习，提升隐式仇恨言论的跨域泛化能力，在多个数据集上优于对比基线。

详情

AI中文摘要

隐式仇恨言论分类仍然是一个挑战，因为意图通常通过暗示和上下文而非明确辱骂来掩盖。先前的监督对比方法改进了域内检测，但可能过拟合表面线索，且难以跨数据集迁移。我们提出ImpSH，一个基于三元组的框架，当隐含陈述可用时将其与帖子对齐，并使用上下文边界半硬负样本将学习聚焦于近混淆项。我们还研究了AugSH，它通过数据增强形成正样本。在使用BERT和HateBERT对IHC、SBIC和DynaHate进行的受控评估中，ImpSH是标准监督对比基线的可行替代方案，并且在匹配的预处理和调优预算下通常能提高跨域性能。使用对齐性和均匀性进行的表示分析表明，正样本对更紧密且全局分布平衡，定性最近邻案例研究展示了域转移下的典型假负例。这些结果表明，通过上下文边界挖掘将帖子与其隐含陈述对齐，提供了到相关暗示的更稳定、类似双射的映射，克服了传统基于聚类的表示学习固有的波动性。

英文摘要

Classifying implicit hate speech remains a challenge, as intent is often masked through insinuation and context rather than explicit slurs. Prior supervised contrastive approaches improve in-domain detection but can overfit surface cues and struggle to transfer across datasets. We propose ImpSH, a triplet-based framework that aligns posts with implied statements when available and uses context-bounded semi-hard negatives to focus learning on near confusions. We also examine AugSH, which forms positives via data augmentation. In controlled evaluations on IHC, SBIC, and DynaHate with BERT and HateBERT, ImpSH is a viable alternative to standard supervised contrastive baselines and often improves cross-domain performance under matched preprocessing and tuning budgets. Representation analysis using alignment and uniformity indicates tighter positive pairs with balanced global spread, and qualitative nearest-neighbor case studies illustrate typical false negatives under domain shift. These results demonstrate that aligning posts with their implied statements via context-bounded mining provides a more stable, bijective-like mapping to related insinuations, overcoming the volatility inherent in traditional clustering-based representation learning.

URL PDF HTML ☆

赞 0 踩 0

2606.18257 2026-06-18 cs.HC cs.AI 新提交 70%

From Memorization to Creation: Evaluating the Cognitive Depth of LLM-Generated Educational Questions

从记忆到创造：评估LLM生成的教育问题的认知深度

Xiaolong Wang, Zhe Zhao, Song Lai, Chaoli Zhang, Zijie Geng, Yu Tong, Ye Wei, Qingsong Wen

发表机构 * City University of Hong Kong（香港城市大学）； Zhejiang Normal University（浙江师范大学）； Squirrel Ai Learning ； University of Science and Technology of China（中国科学技术大学）； Wuhan University（武汉大学）

专题命中指令微调：评估LLM生成问题认知层次，涉及提示策略

AI总结通过布鲁姆认知分类学评估六种LLM生成问题的认知层次，提出细粒度提示策略减少重复性并提升高阶认知比例，引入认知转移强度和类别漂移指标，揭示链式思维提示的可解释性。

Comments Accepted by KDD 2026

Journal ref KDD 2026

详情

DOI: 10.1145/3770854.3785686

AI中文摘要

尽管LLM在自动化教育内容生成方面展现出潜力，但它们生成能够激发高阶思维问题的能力仍未被充分研究。本研究通过布鲁姆认知分类学视角评估六种广泛使用的LLM，重点关注它们超越机械记忆并实现认知飞跃的能力。采用混合人机评估协议，我们在计算机科学、K-12数学和社会科学领域生成并分析了20,700个问题。主要贡献包括：(1) 一种细粒度提示策略，使Qwen2.5-7B-Instruct的问题重复性降低24.45%，并使InternLM3-8B-Instruct的高阶认知层次输出比例提升11.53%；(2) 认知转移强度（CogShift）和类别漂移的量化指标，揭示InternLM3在多层次转换中的优越性能；(3) 可解释性分析揭示指标级相关性，增强了链式思维提示的透明度。我们的发现强调了认知感知提示设计的重要性，并为在个性化学习系统中部署LLM提供了基准。

英文摘要

While LLMs show promise in automating educational content creation, their ability to generate questions that stimulate higher-order thinking remains understudied. This work evaluates six widely-used LLMs through a Bloom's Taxonomy lens, focusing on their capacity to transcend rote memorization and achieve cognitive leaps. Using a hybrid human--AI evaluation protocol, we generate and analyze 20{,}700 questions across computer science, K--12 math, and social-science domains. Key contributions include: (1) a fine-grained prompting strategy that reduces question repetitiveness by 24.45\% for Qwen2.5-7B-Instruct, and increases the proportion of higher-order cognitive level outputs by 11.53\% for InternLM3-8B-Instruct; (2) quantitative metrics for cognitive shift intensity (CogShift) and category drift, revealing InternLM3's superior performance in multi-level transitions; (3) an interpretability analysis revealing metric-level correlations that enhance the transparency of Chain-of-Thought prompting. Our findings highlight the importance of cognitive-aware prompt design and provide benchmarks for deploying LLMs in personalized learning systems.

URL PDF HTML ☆

赞 0 踩 0

1. 后训练 5 篇

Steerable Cultural Preference Optimization of Reward Models

Sparsity Curse: Understanding RLVR Model Parameter Space from Model Merging

SpecAlign: Efficient Specification-Grounded Alignment of Large Language Models via Synthetic Data

MemBoost: A Memory-Boosted Framework for Cost-Aware LLM Inference

SAGE: Retain-Aware Post-Hoc Sanitization of Final Unlearning Vector

2. 领域大模型 2 篇

Speech-Driven End-to-End Language Discrimination towards Chinese Dialects

Constraining to Generalize: Subspace Tuning for Few-shot Generalization of Audio-Language Models

3. 预训练 1 篇

What Does the Weight Norm Control in Grokking? Logit-Scale Mediation under Cross-Entropy

4. 其他LLM 21 篇

How Well Do Large Language Models Capture Human Personality?

Examining Human-Like Behaviors in LLMs: A Multi-Dimensional Analysis of Model Behaviors, User Factors, and System Prompts

Gatekeepers and Hallucinations: A Layered Evaluation Framework for LLM-Driven Quantum Circuit Generation

Characterizing Opinion Evolution of Networked LLMs

Formalizing and Mitigating Structural Distortion in LLM Attention for Graph Reasoning

MeEvo: Metacognitive Evolution Combined with Natural Evolution for Automatic Heuristic Design

Airport Terminal Passenger Queue Forecasting for Departure Gates and Security Checkpoints

The Long Delay to Arithmetic Generalization: When Learned Representations Outrun Behavior

Scaling up FHE-based Privacy-Preserving ML: Higher Throughput, Longer Inputs for LLama-3-8B

An In-depth Study of LLM Contributions to the Bin Packing Problem

Phonikud: Overcoming Phonetic Underspecification for Hebrew Text-To-Speech

Correct Yourself, Keep My Trust: How Self-Correction and Social Connection Shape Credibility in Social Chatbots

Essential Subspace Merging for Multi-Task Learning

Complementary Attention Head Pruning for Efficient Transformers

Human-AI Coevolution Dynamics: A Formal Theory of Social Intelligence Emergence Through Long-Term Interaction

Written by AI, Managed by AI: Semantic Space Control and Index Sickness Elimination Across 391 Consecutive Sessions

Leadership as Coordination Control: Behavioral Signatures and the Recovery-Advantage Boundary in Multi-Agent LLM Teams

JourneyFormer: Encoding Airbnb Guest Journey with Sequence Modeling

GrapNet: A Programmable Dynamic-Architecture Neural Graph Substrate

Approximate Structured Diffusion for Sequence Labelling

Aligning Implied Statements for Implicit Hate Speech Generalizability with Context-Bounded Semi-hard Negative Mining

5. 指令微调 1 篇

From Memorization to Creation: Evaluating the Cognitive Depth of LLM-Generated Educational Questions