语言大模型 / LLM - arXivDaily 专题

2606.19286 2026-06-18 cs.HC cs.AI cs.CY 新提交 60%

Correct Yourself, Keep My Trust: How Self-Correction and Social Connection Shape Credibility in Social Chatbots

纠正自己，保持信任：自我纠正和社会联系如何塑造社交聊天机器人的可信度

Biswadeep Sen, Yi-Chieh Lee

发表机构 * School of Computing National University of Singapore Singapore Singapore（计算学院新加坡国立大学新加坡新加坡）； Computer Science National University of Singapore Singapore Singapore（计算机科学新加坡国立大学新加坡新加坡）； National University of Singapore（新加坡国立大学）

专题命中其他LLM ：社交聊天机器人错误纠正策略实验

AI总结通过实验比较三种错误纠正策略，发现自我纠正不损害聊天机器人可信度，且用户社会联系强度仅在自我纠正时显著预测信念改变。

详情

AI中文摘要

当社交聊天机器人犯错时——它们确实会犯错——它们的恢复方式决定了用户是否会再次信任它们。社交聊天机器人正日益融入日常生活，但它们仍然容易生成令人信服但不准确的信息。它们与用户建立的社会联系使得此类错误尤其具有后果性。我们进行了一项受试者间实验（N=120），比较了三种错误纠正策略：网页撤回、同一社交聊天机器人的自我纠正以及专家聊天机器人的纠正。我们的结果揭示了两个关键发现。首先，所有三种策略都能同样好地纠正错误，但只有自我纠正不会损害聊天机器人的可信度：参与者对自我纠正的聊天机器人在可信度和感知专业性上的评分显著高于其错误由外部来源纠正的聊天机器人。其次，通过社会吸引力和自我披露测量的用户与聊天机器人的社会联系强度，仅在聊天机器人自我纠正时显著预测信念改变的大小。将纠正外包给外部来源完全切断了这种联系。这些发现表明，社交聊天机器人应该纠正自己的错误，而不是外包纠正，并且投资于社会联系是一种功能性机制，能增强纠正效果，而不仅仅是一种设计特征。我们讨论了设计能够保持长期可信度同时有效处理自身错误的聊天机器人的启示。

英文摘要

When social chatbots make mistakes, and they do, how they recover determines whether users trust them again. Social chatbots are increasingly integrated into everyday life, yet they remain prone to generating convincing but inaccurate information. The social connection they build with users makes such errors particularly consequential. We conducted a between-subjects experiment (N=120) comparing three error correction strategies: a webpage retraction, self-correction by the same social chatbot, and correction by an expert chatbot. Our results reveal two key findings. First, all three strategies corrected the error equally well, but only self-correction did so without damaging the chatbot's credibility: participants rated self-correcting chatbots significantly higher in both trustworthiness and perceived expertise than chatbots whose errors were corrected by external sources. Second, the strength of the user's social connection with the chatbot, measured through social attraction and self-disclosure, significantly predicted the magnitude of belief change, but only when the chatbot corrected itself. Outsourcing corrections to an external source severed this link entirely. These findings suggest that social chatbots should correct their own mistakes rather than outsource corrections, and that investing in social connection is a functional mechanism that amplifies correction effectiveness, not merely a design feature. We discuss implications for designing chatbots that maintain long-term credibility while effectively addressing their own errors.

URL PDF HTML ☆

赞 0 踩 0

2606.19164 2026-06-18 cs.LG cs.AI 新提交 60%

Essential Subspace Merging for Multi-Task Learning

多任务学习的本质子空间合并

Longhua Li, Lei Qi, Xin Geng, Qi Tian

发表机构 * School of Computer Science and Engineering, Southeast University（东南大学计算机科学与工程学院）； Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications (Southeast University), Ministry of Education, China（新一代人工智能技术及其交叉应用国家重点实验室（东南大学））； Huawei Inc.（华为公司）

专题命中其他LLM ：提出多任务模型合并方法，适用于LLM但非核心

AI总结提出本质子空间分解（ESD）和合并（ESM/ESM++）方法，通过正交化任务更新的主成分来减少多任务合并中的干扰，无需训练即可实现高效多任务学习。

详情

AI中文摘要

模型合并旨在通过将多个从同一预训练检查点微调得到的模型的能力集成到一个单一模型中，从而实现多任务学习。其核心挑战是任务特定参数更新之间的任务间干扰。在本文中，我们分析了任务更新引起的输出偏移，并观察到它们的能量集中在少数主方向上。我们将这些方向张成的子空间称为本质子空间。相比之下，大多数剩余方向携带的任务相关能量很少，但它们在多个任务更新中的累积会在合并过程中引起严重干扰。受此观察启发，我们提出了本质子空间分解（ESD），它根据激活偏移的主成分分解每个任务更新。基于ESD，我们引入了本质子空间合并（ESM），一种无需训练的静态合并方法，它将本质成分正交化并融合成一个紧凑的多任务模型。我们进一步将ESM扩展到ESM++，一种无需训练的动态合并方法，它将任务特定残差分解为低秩专家，并在前向推理过程中通过基于原型的路由选择最相关的专家。跨多个任务集和模型规模的大量实验表明，ESM和ESM++在减少任务间干扰的同时有效保留了任务知识。

英文摘要

Model merging aims to enable multi-task learning by integrating the capabilities of multiple models fine-tuned from the same pre-trained checkpoint into a single model. Its core challenge is inter-task interference among task-specific parameter updates. In this paper, we analyze the output shifts induced by task updates and observe that their energy is concentrated in a small number of principal directions. We call the subspace spanned by these directions the essential subspace. In contrast, most remaining directions carry little task-relevant energy, but their accumulation across multiple task updates can cause severe interference during merging. Motivated by this observation, we propose Essential Subspace Decomposition (ESD), which decomposes each task update according to the principal components of its activation shift. Based on ESD, we introduce Essential Subspace Merging (ESM), a training-free static merging method that orthogonalizes and fuses essential components into one compact multi-task model. We further extend ESM to ESM++, a training-free dynamic merging method that decomposes task-specific residuals into low-rank experts and selects the most relevant expert through prototype-based routing during forward inference. Extensive experiments across multiple task sets and model scales demonstrate that ESM and ESM++ effectively preserves task knowledge while reducing inter-task interference.

URL PDF HTML ☆

赞 0 踩 0

2606.19150 2026-06-18 cs.LG 新提交 60%

Complementary Attention Head Pruning for Efficient Transformers

互补注意力头剪枝用于高效Transformer

Yaniv Livertovsky, Shahar Somin, Gonen Singer

发表机构 * Bar-Ilan University（巴伊兰大学）

专题命中其他LLM ：注意力头剪枝方法适用于Transformer，包括LLM

AI总结提出CAHP框架，将注意力头选择建模为全局图论问题，通过图聚类和信息论距离保留互补头，自动确定剪枝数量，在SST-5和MNLI上优于现有方法。

Comments 9 pages, 4 figures, 3 tables. Accepted for presentation at the International Joint Conference on Neural Networks (IJCNN) 2026

详情

AI中文摘要

基于Transformer的模型在自然语言处理中的显著成功源于架构的规模化，这导致大量参数并阻碍了在资源受限环境中的部署。虽然结构化剪枝提供了一条压缩路径，但现有的最先进方法通常依赖于基于梯度的重要性排序或随机门控，这些方法存在不稳定性、结构退化以及需要大量手动超参数调整的问题。在本文中，我们引入了CAHP（互补注意力头剪枝），一种新颖的事后框架，将头选择重新定义为全局图论问题。CAHP不是孤立地评估头，而是利用基于图的聚类结合信息论距离度量来识别并保留一组拓扑多样化的互补注意力头。无需预定义稀疏度或剪枝比例，该框架通过识别递减的边际性能曲线自动确定各层中保留的注意力头数量，其中根据所选多项式次数，剪除额外头会导致性能急剧下降。在SST-5和MNLI基准上跨不同Transformer模型规模的广泛评估表明，CAHP始终优于竞争基线，特别是在高压缩率情况下。此外，我们的结构分析表明，CAHP避免了基于梯度的剪枝方法的“邻近偏差”（倾向于主要保留靠近输出层的头），而是保留了模型中间层中功能关键的注意力头集合。

英文摘要

The remarkable success of Transformer-based models in natural language processing stems from architectural scaling, which leads to a large number of parameters and hinders deployment in resource-constrained environments. While structured pruning offers a pathway to compression, existing state-of-the-art methods often rely on gradient-based importance ranking or stochastic gating, which suffer from instability, structural degeneration, and the need for extensive manual hyperparameter tuning. In this paper, we introduce CAHP (Complementary Attention Head Pruning), a novel post-hoc framework that redefines head selection as a global graph-theoretical problem. Rather than evaluating heads in isolation, CAHP utilizes graph-based clustering combined with information-theoretic distance measures to identify and preserve a topologically diverse subset of complementary attention heads. Without requiring a predefined sparsity level or pruning ratio, the framework automatically determines the number of selected attention heads across layers by identifying a diminishing marginal performance curve, where pruning additional heads leads to a sharp degradation in performance, as determined by the chosen polynomial degree. Extensive evaluations on the SST-5 and MNLI benchmarks, across different Transformer model scales, demonstrate that CAHP consistently outperforms competitive baselines, particularly in high-compression regimes. Furthermore, our structural analysis shows that CAHP avoids the "proximity bias" of gradient-based pruning methods, which tend to preserve heads mainly in layers close to the output, and instead retains a functionally critical set of attention heads in the model's intermediate layers.

URL PDF HTML ☆

赞 0 踩 0

2606.19144 2026-06-18 cs.AI cs.CL 新提交 60%

Human-AI Coevolution Dynamics: A Formal Theory of Social Intelligence Emergence Through Long-Term Interaction

人机协同演化动力学：长期互动中社会智能涌现的形式理论

Jingyi Zhou, Senlin Luo, Haofan Chen

发表机构 * School of Information and Electronics, Beijing Institute of Technology（信息与电子学院，北京理工大学）； Institute of Scientific and Technical Research on Archives, Beijing（档案科学与技术研究所，北京）； China Electronics Engineering Design Institute Co., Ltd.（中国电子工程设计院有限公司）

专题命中其他LLM ：人机交互理论框架，涉及LLM但非核心

AI总结提出人机协同演化动力学框架（HACD-H），将情感适应、关系组织、社会记忆和人格一致性整合为统一动力学模型，通过约14,700轮对话数据集验证，发现社会智能与社会认知能量显著负相关，揭示社会智能源于长期协同演化。

详情

AI中文摘要

当前的对话式AI系统在语言生成、个性化和长上下文交互方面取得了显著进展。然而，大多数现有方法通过孤立组件（如情感建模、记忆检索或人格条件化）来建模社会行为，缺乏一个统一的框架来解释长期人机交互中稳定社会关系和社会智能的涌现。为解决这一问题，我们提出了人机协同演化动力学框架（HACD-H），这是一个将人机交互建模为自组织社会认知系统的形式模型。HACD-H将情感适应、关系组织、社会记忆和人格一致性整合到一个统一的动力学框架中，并引入了多时间尺度社会认知、关系吸引子、信任盆地、发展相变和社会认知能量景观等原则。我们构建了一个约14,700轮交互的对话数据集，并开发了一个理论驱动的实证评估框架。结果揭示了社会认知中的时间持久性层次结构、稳定的关系吸引子、类似相变的发展模式以及结构化的社会认知能量景观。社会智能与社会认知能量呈显著负相关（r = -0.391, p < 0.001），且交互轨迹随时间呈现渐进性能量减少。这些发现表明，社会智能源于长期的社会认知协同演化，而非孤立的对话能力。HACD-H为建模适应性人机社会交互和开发社会智能AI系统提供了统一的理论基础。

英文摘要

Current conversational AI systems have made significant progress in language generation, personalization, and long-context interaction. However, most existing methods model social behavior through isolated components such as emotion modeling, memory retrieval, or persona conditioning, lacking a unified framework to explain the emergence of stable social relationships and social intelligence in long-term human-AI interaction.To address this, we propose the Human-AI Coevolution Dynamics Framework (HACD-H), a formal model of human-AI interaction as a self-organizing social cognitive system. HACD-H integrates emotional adaptation, relational organization, social memory, and personality consistency into a unified dynamical framework and introduces principles including multi-timescale social cognition, relational attractors, trust basins, developmental phase transitions, and social cognitive energy dynamics.We construct a conversational dataset with approximately 14,700 interaction turns and develop a theory-driven empirical evaluation framework. Results reveal a hierarchy of temporal persistence in social cognition, stable relational attractors, phase-transition-like developmental patterns, and a structured social cognitive energy landscape. Social intelligence shows a significant negative correlation with social cognitive energy (r = -0.391, p < 0.001), and interaction trajectories exhibit progressive energy reduction over time.These findings suggest that social intelligence emerges from long-term social cognitive coevolution rather than isolated conversational capabilities. HACD-H provides a unified theoretical foundation for modeling adaptive human-AI social interaction and developing socially intelligent AI systems.

URL PDF HTML ☆

赞 0 踩 0

2606.19121 2026-06-18 cs.SE cs.CL cs.HC 新提交 60%

Written by AI, Managed by AI: Semantic Space Control and Index Sickness Elimination Across 391 Consecutive Sessions

由AI编写，由AI管理：跨越391个连续会话的语义空间控制与索引病消除

Hui Zhang, Shuren Song

发表机构 * Shenzhen Yunxi Technology Co., Ltd.（深圳云曦科技有限公司）； Information Technology Center, Tsinghua University（清华大学信息科学技术中心）

专题命中其他LLM ：研究LLM协作中的工程问题

AI总结本文通过真实软件项目中的行动研究，发现长期LLM协作中增加形式约束反而导致“索引病”，提出“基线-日志物理分离”机制，有效消除该问题。

Comments 22 pages, 2 tables, 1 figure. Action research. Bilingual submission (Chinese companion version included as supplementary). Submitted to ICSE 2027 IOR track

详情

AI中文摘要

解决长期LLM协作中概念漂移的主流工程直觉是，用更多的形式约束换取更可靠的输出——设计符号标识符系统，在系统提示中积累防御规则，扩展上下文窗口。我们的工程记录表明，在长期设置中，这种方向可能产生与设计意图相反的效果。通过在跨越约一个月和391个协作会话的真实软件项目（Bang-v3）中使用行动研究方法，我们记录并分析了这些策略的失败过程。当符号系统超过复杂度阈值时，LLM并不会变得更准确——相反，它们放弃了对业务语义的真正理解，退回到符号层内的自我指涉推理，并生成看似内部一致但实际上与现实脱节的输出。我们将这种失败模式命名为“索引病”，其典型表现为“幻影立法”。我们将底层原理命名为“庞原理（语义活力定律）”：带有明确目的的自然语言传达的信息质量远高于符号表达。由此，我们设计并验证了其物理工程机制：“基线-日志物理分离”。在同一项目中，该机制将AI指令量减少了约75%，并且在随后的约150个会话中，未观察到索引病复发。附有双语对照版本（中文）作为补充材料。

英文摘要

The prevailing engineering intuition for addressing conceptual drift in long-horizon LLM collaboration is to trade more formal constraints for more reliable outputs -- designing symbolic identifier systems, accumulating defensive rules in System Prompts, expanding context windows. Our engineering record shows that in long-horizon settings, this direction may produce effects contrary to design intent. Using action research methods in a real software project (Bang-v3) spanning approximately one month and 391 collaborative sessions, we document and analyze the failure process of these strategies. When the symbolic system exceeds a complexity threshold, LLMs do not become more accurate -- instead, they abandon genuine understanding of business semantics, retreat to self-referential reasoning within the symbolic layer, and generate outputs that appear internally consistent but are physically disconnected from reality. We name this failure pattern "Index Sickness," and its canonical manifestation "Phantom Legislation." We name the underlying principle the "Pang Principle (Semantic Vitality Law)": natural language carrying explicit purpose conveys far greater information quality than symbolic expression. From this, we design and validate its physical engineering mechanism: "Baseline-Log Physical Separation." In the same project, this mechanism reduced AI Instructions volume by ~75%, and across the subsequent ~150 sessions, no recurrence of Index Sickness was observed. A bilingual companion version (Chinese) is included as supplementary material.

URL PDF HTML ☆

赞 0 踩 0

2606.19111 2026-06-18 cs.CL cs.AI cs.MA 新提交 60%

Leadership as Coordination Control: Behavioral Signatures and the Recovery-Advantage Boundary in Multi-Agent LLM Teams

领导力作为协调控制：多智能体LLM团队中的行为特征与恢复优势边界

Haewoon Kwak

发表机构 * Indiana University Bloomington（印第安纳大学布卢明顿分校）

专题命中其他LLM ：研究LLM团队行为，但非模型本身

AI总结研究多智能体LLM团队中过程级协调控制何时增加价值，通过行为特征和消融实验发现，控制器的优势仅在初始多数投票不可靠、任务可恢复且无指导交互无法修复时出现，验证了权变理论。

Comments 33 pages

详情

AI中文摘要

团队科学认为领导力是权变的：它仅在特定条件下有帮助，而能力强的自主团队可能根本不需要领导。我们对多智能体LLM团队提出类似问题：在什么可测量的条件下，过程级协调控制会增加价值，这些条件是否与团队科学的预测一致？我们使用行为特征（多数锁定、探索、从错误的第0轮共识中恢复）和每动作消融实验，因为每个控制器是一个显式动作集，而不是一个整体提示。我们将三种经典领导风格（交易型、变革型、情境型）操作化为对共享动作词汇（探索、修订、接受、综合）的控制器。一个具有相同动作但使用任意规则的匹配控制器恢复效果不优于多数投票，因此是理论推导的规则（而非词汇）起作用。在四个任务体系和三个开放权重模型系列中，没有控制器在准确率上占主导地位，正如权变观点所预测的：交易型控制在所有12个（模型、体系）组合上与共享的第0轮投票匹配，差异在1.3个百分点以内，仅在初始多数不可靠的一个组合上出现增益（llama-4-scout社会性；情境型比扁平型高8个百分点）。通过四个边界探针测试的恢复优势解释表明，控制器仅在初始多数投票不可靠、任务可恢复且无指导交互无法修复时优于纯交互。这些区域映射到权变理论（领导替代、路径-目标冗余、情境准备差距），因此基本为零的准确率结果正是理论所预测的，而非控制器的失败。我们将过程级协调控制视为一种需要测量和理论映射的权变因素，而不是需要超越的排行榜。

英文摘要

Team science holds that leadership is contingent: it helps only under specific conditions, and capable, autonomous teams may need none at all. We ask the analogous question for multi-agent LLM teams: under what measurable conditions does process-level coordination control add value, and do those conditions match what team science predicts? We use behavioral signatures (majority lock-in, exploration, recovery from an incorrect round-0 consensus) and per-action ablations, clean because each controller is an explicit action set, not a monolithic prompt. We operationalize three classical leadership styles (transactional, transformational, situational) as controllers over a shared action vocabulary (explore, revise, accept, synthesize). A matched controller with the same actions but an arbitrary rule recovers no better than majority voting, so the theory-derived rule, not the vocabulary, does the work. Across four task regimes and three open-weight model families, no controller dominates by accuracy, as the contingency view predicts: transactional control matches a shared round-0 vote on all 12 (model, regime) combinations to within 1.3pp, and gains appear only on the one combination where the round-0 majority is unreliable (llama-4-scout social; situational +8pp over flat). A recovery-advantage account, tested with four boundary probes, says a controller beats plain interaction only where the round-0 majority is unreliable, the task is recoverable, and undirected interaction does not already repair it. These regions map onto contingency theory (leadership substitutes, path-goal redundancy, the situational readiness gap), so a largely null accuracy result is what the theory predicts, not a failure of the controllers. We read process-level coordination control as a contingency to be measured and theory-mapped, not a leaderboard to be topped.

URL PDF HTML ☆

赞 0 踩 0

2606.19108 2026-06-18 cs.LG 新提交 60%

JourneyFormer: Encoding Airbnb Guest Journey with Sequence Modeling

JourneyFormer: 使用序列建模编码Airbnb客人旅程

Daochen Zha, Chun How Tan, Xin Liu, Bin Xu, Han Zhao, Xiaowei Liu, Tracy Yu, Hui Gao, Huiji Gao, Liwei He, Stephanie Moyerman, Sanjeev Katariya

发表机构 * Airbnb

专题命中其他LLM ：序列建模用于推荐，非LLM核心

AI总结针对Airbnb中客人序列长、探索性强且标签稀疏的问题，提出JourneyFormer序列建模解决方案，通过优化数据选择、ID嵌入、模型架构和标签归因，并在两个生产面上通过在线A/B测试验证了其有效性。

Comments Accepted by KDD 2026

详情

AI中文摘要

序列建模因其能够建模用户历史行为并推断用户意图，在推荐和排序算法中越来越受欢迎。尽管理论简单，但由于序列的复杂性和稀疏标签，序列模型在生产中的实际部署并非易事。例如，在Airbnb中，客人序列通常较长、具有探索性且复杂，我们关注的是稀疏的预订标签。因此，我们经常需要在数据和建模方面做出各种设计决策，以在有效性和可扩展性之间取得平衡。本文深入探讨了这些生产挑战，并部署了JourneyFormer，一种用于Airbnb搜索排序的序列建模解决方案。我们详细介绍了关键的设计考虑，涵盖客人事件选择、ID嵌入、模型架构和标签归因等方面。此外，我们描述了几种加速模型训练和推理的定制策略。JourneyFormer已成功部署在Airbnb的生产环境中，其有效性和影响不仅通过改进的离线排序指标得到证明，而且通过两个生产面上的在线A/B测试在关键业务指标上取得了显著提升。

英文摘要

Sequence modeling has become increasingly popular in recommendation and ranking algorithms, owing to its capacity to model users' historical behaviors and infer user intentions. Despite its theoretical simplicity, the practical deployment of a sequence model in production is non-trivial due to complexity of the sequence and sparse labels. For example, in Airbnb, guest sequences are often long, exploratory and complex, and we focus on booking labels, which are sparse. As such, we are often required to make various design decisions regarding data and modeling to strike a balance between effectiveness and scalability. This work delved into these production challenges and deployed JourneyFormer, a sequence modeling solution for search ranking at Airbnb. We detail crucial design considerations, covering aspects such as guest event selection, ID embeddings, model architecture, and label attribution. Additionally, we describe several tailored strategies to accelerate model training and inference. JourneyFormer has been successfully deployed within Airbnb's production, where its effectiveness and impact have been evidenced not only by improved offline ranking metrics but also by significant gains in key business metrics through online A/B testing across 2 production surfaces.

URL PDF HTML ☆

赞 0 踩 0

2606.18923 2026-06-18 cs.LG 新提交 60%

GrapNet: A Programmable Dynamic-Architecture Neural Graph Substrate

GrapNet: 一种可编程的动态架构神经图基板

Zirong Li

发表机构 * Zirong Li（李子荣）

专题命中其他LLM ：提出可编程神经图基板，非LLM核心

AI总结提出GrapNet，一种将图作为可执行架构的神经基板，通过可编程接口支持结构编辑、冻结子图、局部审计等操作，在Split Fashion-MNIST和Split CIFAR-10上分别提升12.08和3.81个百分点的准确率。

Comments 8 pages, 1 figure, preprint

详情

AI中文摘要

可编程性是固定张量神经网络中缺失的一流接口：编辑关系、冻结子图、审计局部函数或更改执行后端应是对神经程序的操作，而非临时参数手术。GrapNet研究这种图即网络的设置。图是架构和可执行程序，而非输入数据图。每个计算节点拥有其下一层子节点引用和与这些引用对齐的可训练分配向量；删除关系会物理移除子节点引用和相应的分配坐标。结构规则和执行策略位于节点核心之外，因此同一子节点拥有的图可以被增长、冻结、结构编辑、分组为可训练族块、通过注意力在活动关系上路由，或在拓扑稳定后降级为密集快照。GrapNet通过向量值父接口与常规模块组合：密集层、CNN编码器、ResNet特征提取器、注意力块和Transformer表示都可以为每个坐标提供一个感知GrapNode。评估组织为可编程性压力测试套件，而非新的重放基准。在匹配的十种子Split Fashion-MNIST研究中，可塑GrapNet+ER头在相同已见类损失和重放记忆下达到63.16%的已见类准确率，而参数更大的密集MLP+ER为51.08%，配对差值为12.08点，p=1.3e-5。在Split CIFAR-10上使用冻结的ImageNet ResNet-18编码器时，相同基板将在线头比MLP-256提高3.81点，p=0.0026。这些结果支持GrapNet作为可编辑的神经图基板，其核心价值在于具有忠实执行视图的结构可编程性。

英文摘要

Programmability is a missing first-class interface in fixed-tensor neural networks: editing a relation, freezing a subgraph, auditing a local function, or changing the execution backend should be an operation on the neural program rather than ad-hoc parameter surgery. GrapNet studies this graph-as-network setting. The graph is the architecture and executable program, not an input data graph. Each compute node owns its next-layer child references and a trainable allocation vector aligned with those references; deleting a relation physically removes both the child reference and the corresponding allocation coordinate. Structural rules and execution policies live outside the node core, so the same child-owned graph can be grown, frozen, structurally edited, grouped into trainable family blocks, routed by attention over active relations, or lowered to dense snapshots after topology stabilizes. GrapNet composes with conventional modules through a vector-valued parent interface: dense layers, CNN encoders, ResNet feature extractors, attention blocks, and transformer representations can all feed one sensory GrapNode per coordinate. The evaluation is organized as a programmability stress suite rather than as a new replay benchmark. In a matched ten-seed Split Fashion-MNIST study, a plastic GrapNet+ER head reaches 63.16 percent seen-class accuracy versus 51.08 percent for a parameter-larger dense MLP+ER under the same seen-class loss and replay memory, with paired delta 12.08 points and p=1.3e-5. On Split CIFAR-10 with a frozen ImageNet ResNet-18 encoder, the same substrate improves the online head over MLP-256 by 3.81 points, with p=0.0026. These results support GrapNet as an editable neural graph substrate whose core value is structural programmability with faithful execution views.

URL PDF HTML ☆

赞 0 踩 0

2606.18856 2026-06-18 cs.CL cs.LG 新提交 60%

Approximate Structured Diffusion for Sequence Labelling

近似结构化扩散用于序列标注

Nicolas Floquet, Joseph Le Roux, Nadi Tomeh

发表机构 * Université Sorbonne Paris Nord, CNRS, Laboratoire d’Informatique de Paris Nord, LIPN（巴黎北大学 Sorbonne、法国国家科学研究中心、巴黎北信息学实验室、LIPN）

专题命中其他LLM ：扩散模型用于序列标注，非LLM核心但相关。

AI总结提出一种基于扩散的条件随机场（CRF）训练方法，通过引入标签噪声条件来捕捉长距离依赖，结合近似推理在词性标注任务上实现16.5%的错误率降低。

2606.18852 2026-06-18 cs.CL cs.AI 新提交 60%

Aligning Implied Statements for Implicit Hate Speech Generalizability with Context-Bounded Semi-hard Negative Mining

对齐隐含陈述：通过上下文边界半硬负挖掘实现隐式仇恨言论的泛化性

Wicaksono Leksono Muhamad, Yunita Sari

发表机构 * Mantera Studio（Mantera工作室）； Universitas Gadjah Mada（加雅玛大学）

专题命中其他LLM ：隐式仇恨言论分类，使用对比学习。

AI总结提出ImpSH三元组框架，通过将帖子与隐含陈述对齐并使用上下文边界半硬负样本聚焦学习，提升隐式仇恨言论的跨域泛化能力，在多个数据集上优于对比基线。

详情

AI中文摘要

隐式仇恨言论分类仍然是一个挑战，因为意图通常通过暗示和上下文而非明确辱骂来掩盖。先前的监督对比方法改进了域内检测，但可能过拟合表面线索，且难以跨数据集迁移。我们提出ImpSH，一个基于三元组的框架，当隐含陈述可用时将其与帖子对齐，并使用上下文边界半硬负样本将学习聚焦于近混淆项。我们还研究了AugSH，它通过数据增强形成正样本。在使用BERT和HateBERT对IHC、SBIC和DynaHate进行的受控评估中，ImpSH是标准监督对比基线的可行替代方案，并且在匹配的预处理和调优预算下通常能提高跨域性能。使用对齐性和均匀性进行的表示分析表明，正样本对更紧密且全局分布平衡，定性最近邻案例研究展示了域转移下的典型假负例。这些结果表明，通过上下文边界挖掘将帖子与其隐含陈述对齐，提供了到相关暗示的更稳定、类似双射的映射，克服了传统基于聚类的表示学习固有的波动性。

英文摘要

Classifying implicit hate speech remains a challenge, as intent is often masked through insinuation and context rather than explicit slurs. Prior supervised contrastive approaches improve in-domain detection but can overfit surface cues and struggle to transfer across datasets. We propose ImpSH, a triplet-based framework that aligns posts with implied statements when available and uses context-bounded semi-hard negatives to focus learning on near confusions. We also examine AugSH, which forms positives via data augmentation. In controlled evaluations on IHC, SBIC, and DynaHate with BERT and HateBERT, ImpSH is a viable alternative to standard supervised contrastive baselines and often improves cross-domain performance under matched preprocessing and tuning budgets. Representation analysis using alignment and uniformity indicates tighter positive pairs with balanced global spread, and qualitative nearest-neighbor case studies illustrate typical false negatives under domain shift. These results demonstrate that aligning posts with their implied statements via context-bounded mining provides a more stable, bijective-like mapping to related insinuations, overcoming the volatility inherent in traditional clustering-based representation learning.

URL PDF HTML ☆

赞 0 踩 0

2606.18820 2026-06-18 cs.LG cs.AI 新提交 60%

Maturing Markov Decision Processes: Decision Making under Increasing Information and Shrinking Action Sets

成熟马尔可夫决策过程：信息增加与动作集缩小下的决策制定

Jiaxi Liu, Aiping Yang, Yuhang Yang, Shuqi Zhang, Zewei Dong, Jiangming Yang, Xuebin Chen

发表机构 * Ant International（蚂蚁国际）； School of Economics, Sichuan University（四川大学经济学院）； School of Economics, Fudan University（复旦大学经济学院）

专题命中其他LLM ：提出MMDP框架，结构感知强化学习，与LLM弱相关

AI总结针对决策过程中信息增加与动作集缩小的不对称性，提出成熟马尔可夫决策过程（MMDP）框架，并基于过期动作优先级原则开发结构感知强化学习方法，实验证明其能提升学习效率。

Comments 25 pages, 9 figures

详情

AI中文摘要

序列决策问题通常表现出信息和决策灵活性的不对称演化：随着决策周期的展开，智能体获得更丰富的信息，而由于操作截止、承诺或资源约束，可行动作逐渐过期。标准的MDP公式通常将这种结构扁平化为阶段相关的状态描述和动作掩码，从而掩盖了嵌套的信息-动作不对称性，而这种不对称性决定了哪些决策是紧急的、哪些可以推迟。我们引入了成熟马尔可夫决策过程（MMDP），这是一种围绕这种信息-动作不对称性构建的公式。我们通过一个过期动作优先级原则来刻画其关键后果之一，该原则识别出必须在下一阶段之前解决的动作。受此结构启发，我们开发了一个结构感知的强化学习框架，包括阶段感知的策略设计、过期动作抽象以及带有蒸馏的搜索增强学习。在受控的多供应商补货问题、复杂度递增的简化现金管理环境以及生产级模拟器上的实验表明，显式建模这种不对称性可以提高学习效率，并且随着决策问题的规模扩大，其价值日益增加。

英文摘要

Sequential decision problems often exhibit an asymmetric evolution of information and decision flexibility: as a decision cycle unfolds, the agent receives richer information while feasible actions expire due to operational cutoffs, commitments, or resource constraints. Standard MDP formulations typically flatten this structure into stage-dependent state descriptions and action masks, thereby obscuring the nested information--action asymmetry that determines which decisions are urgent and which can be deferred. We introduce Maturing Markov Decision Processes (MMDPs), a formulation built around this information--action asymmetry. We characterize one of its key consequences through an expiring-action priority principle, which identifies the actions that must be resolved before the next stage. Motivated by this structure, we develop a structure-aware reinforcement learning framework with stage-aware policy design, expiring-action abstraction, and search-augmented learning with distillation. Experiments on a controlled multi-supplier replenishment problem, simplified cash-management environments of increasing complexity, and a production-scale simulator show that explicitly modeling this asymmetry improves learning efficiency and becomes increasingly valuable as decision problems scale.

URL PDF HTML ☆

赞 0 踩 0

2606.18790 2026-06-18 cs.SD cs.AI cs.LG 新提交 60%

Closing the Loop: PID Feedback Control for Interpretable Activation Steering in Symbolic Music Generation

闭环：用于符号音乐生成中可解释激活引导的PID反馈控制

Ioannis Prokopiou, Pantelis Vikatos, Maximos Kaliakatsos-Papakostas, Theodoros Giannakopoulos, Themos Stafylakis

发表机构 * Athens University of Economics and Business（雅典经济与商业大学）； Orfium Research（Orfium 研究）； Hellenic Mediterranean University（希腊地中海大学）； Archimedes / Athena Research Center（阿基米德/雅典娜研究中心）

专题命中其他LLM ：符号音乐生成中的激活引导，与LLM弱相关

AI总结提出基于PID反馈控制的推理时激活引导框架，通过差分均值法提取音高和时长潜在方向，并利用Gram-Schmidt正交化解耦多属性引导，实现符号音乐生成中细粒度、可解释的属性调制。

Comments Accepted at Learning to Listen: ICML 2026 Workshop on Machine Learning for Audio (43rd International Conference on Machine Learning - ICMLMLA26), 4 pages main (11 total), 2 figures

详情

AI中文摘要

基于Transformer的架构在生成复杂符号序列方面取得了显著进展，但在实现对离散信号属性的细粒度、可解释控制方面仍存在明显差距。本文研究了多轨音乐Transformer（MMT）的机制可解释性，并提出了一种无需重新训练即可通过推理时激活引导实现确定性属性调制的框架。利用差分均值（DiffMean）方法，我们在残差流中分离出信号属性（特别是音高和时长）的潜在方向。我们验证了该领域的线性表示假设，实现了引导幅度与属性偏移之间的高相关性。为了解决多属性引导中固有的特征纠缠问题，我们引入了一种利用Gram-Schmidt正交化的双引导框架。实验结果表明，与朴素向量加法相比，这种几何解耦减少了概念干扰和信号退化，即使在强自回归条件下也能实现独立的确定性控制。

英文摘要

Transformer-based architectures have significantly advanced the generation of complex symbolic sequences, yet a significant gap remains in achieving fine-grained, interpretable control over discrete signal attributes. This paper investigates the mechanistic interpretability of the Multitrack Music Transformer (MMT) and proposes a framework for deterministic attribute modulation without retraining to bridge this gap via inference-time activation steering. Utilizing the Difference-in-Means (DiffMean) methodology, we isolate latent directions for signal attributes, specifically Pitch and Duration, within the residual stream. We validate the Linear Representation Hypothesis in this domain, achieving high correlation between steering magnitude and attribute shift. To address the inherent feature entanglement in multi-attribute steering, we introduce a Dual Steering framework utilizing Gram-Schmidt Orthogonalization. Experimental results demonstrate that this geometric decoupling reduces conceptual interference and signal degradation compared to naive vector addition, enabling independent deterministic control even against strong autoregressive conditioning.

URL PDF HTML ☆

赞 0 踩 0

2606.18548 2026-06-18 cs.CY cs.AI 新提交 60%

Engagement Intensity as a Learner-Modeling Signal for Adaptive AI Ethics Instruction

参与强度作为自适应AI伦理教学的学习者建模信号

Yongkyung Oh, Lynn Talton, Alex Bui

发表机构 * University of California, Los Angeles (UCLA)（加州大学洛杉矶分校）

专题命中其他LLM ：研究LLM使用频率与AI感知关系

AI总结本研究比较了三种学习者特征（使用频率、自评熟悉度、先前AI教育）与AI感知结果的关系，发现使用频率与所有五项结果显著相关，为自适应AI伦理教学提供了简单的入学者建模信号。

详情

AI中文摘要

在研究生研究训练中，自适应AI伦理教学受益于反映先前LLM经验差异的入学者测量指标。先前的课程或研讨会参与是一个明显的候选指标，但尚不清楚它是否与关键AI感知项目的教学前评分相关。我们比较了三种候选入学者特征：自我报告的使用频率、自评LLM熟悉度和先前AI教育，针对93名参加必修研究伦理课程的生命科学研究生和博士后学员的五项基线感知结果。使用频率与所有五项结果显示出Holm校正的关联，自评熟悉度与三项结果相关，而先前AI教育与任何结果均无关联。在量表低端呈现阈值模式，在训练兴趣和准确性信任方面最为明显，而非在所有五项结果上呈现均匀梯度。在简短的入学者调查中，报告的LLM使用比先前的课程或研讨会更一致地与这些感知相关，自评熟悉度作为次要指标。这些结果表明，简单的教学前行为信号可以为自适应AI伦理教育的轻量级入学者画像提供信息。

英文摘要

Adaptive AI ethics instruction in graduate research training benefits from intake measures that reflect differences in prior LLM experience. Prior coursework or workshop attendance is an obvious candidate, but it is not clear whether it is associated with pre-instruction ratings on key AI perception items. We compare three candidate intake features, self-reported usage frequency, self-rated LLM familiarity, and prior AI education, across five baseline perception outcomes in 93 bioscience graduate and postdoctoral trainees enrolled in a required research ethics course. Usage frequency shows Holm-corrected associations with all five outcomes, self-rated familiarity with three, and prior AI education with none. A threshold-like pattern at the lower end of the scale is most visible for training interest and accuracy trust rather than appearing as a uniform gradient across all five outcomes. In a short intake survey, reported LLM use is more consistently associated with these perceptions than prior coursework or workshops, with self-rated familiarity serving as a secondary indicator. These results suggest that simple pre-instruction behavioral signals can inform lightweight intake profiling for adaptive AI ethics education.

URL PDF HTML ☆

赞 0 踩 0

2606.18539 2026-06-18 cs.LG stat.ML 新提交 60%

TS-Fault: Benchmarking Time Series Forecasters Against Structural Faults

TS-Fault: 针对结构性故障的时间序列预测器基准测试

Yuyang Zhao, Lian Xu, Hao Miao, Chenxi Liu, Hao Xue

发表机构 * Ray-zyy

专题命中其他LLM ：评估时间序列预测模型鲁棒性

AI总结提出TS-Fault基准，通过参数化故障场景（沿观测/机制、单变量/多变量两轴）评估时间序列预测模型鲁棒性，发现干净数据准确性与鲁棒性负相关、机制级故障重排排名、基础模型最脆弱。

详情

AI中文摘要

时间序列预测（TSF）支撑着能源、交通、金融和医疗等领域的关键决策，然而TSF模型几乎普遍通过在干净保留数据上的单一数字（如平均误差）进行排名，隐含假设该数字能预测部署可靠性。但实际故障并非独立同分布噪声，而是具有时间形状的结构化事件、断裂的跨变量依赖、伴随缺失的机制变化以及跨传感管道的因果传播。将TSF鲁棒性视为数据质量问题，我们提出TS-Fault，一个在显式、参数化且具有可控语义难度的故障场景下评估预测模型的基准。TS-Fault将重复出现的故障沿两个正交轴（观测级 vs 机制级；单变量 vs 多变量）组织为四种模式，并通过统一重要性评分将每种故障注入最关键的预测窗口。该设计使得鲁棒性能够针对模型实际依赖的结构进行测试，而非简化为通用噪声敏感性。我们在6个数据集、4种模式和5个难度级别上，采用配对干净/损坏协议评估了21个模型。结果揭示了三个与常见排行榜直觉相悖的发现：（i）干净数据准确性与鲁棒性负相关；（ii）干净排名在观测级故障下保持不变，但在机制级故障下重新洗牌；（iii）所有灾难性故障均发生在机制级故障下，基础模型在干净数据上准确率最高但表现出最大的脆弱性。代码已公开于该URL。

英文摘要

Time series forecasting (TSF) underpins consequential decisions in energy, transportation, finance, and healthcare, yet TSF models are almost universally ranked by a single number (e.g., average error) on clean held-out data, under the implicit assumption that it predicts deployed reliability. However, real faults are not i.i.d noise but structured events with temporal shape, broken cross-variable dependencies, regime change coupled with missingness, and causal propagation across a sensing pipeline. Treating TSF robustness as a data-quality problem, we present TS-Fault, a benchmark that evaluates forecasting models under explicit, parameterized fault scenarios with controllable semantic difficulty. TS-Fault organizes recurring failures into four modes along two orthogonal axes (observation- vs mechanism-level; univariate vs multivariate) and injects each fault into the most prediction-critical window via a unified importance score. This design enables robustness to be tested against the structures models actually rely on, rather than reduced to generic noise sensitivity. We evaluate 21 models across 6 datasets, 4 modes, and 5 difficulty levels under a paired clean/corrupt protocol. The results reveal three findings that contradict common leaderboard intuition: (i) clean-data accuracy anti-correlates with robustness; (ii) clean rankings are preserved under observation-level faults but reshuffled under mechanism-level faults; and (iii) all catastrophic failures occur under mechanism-level faults, with foundation models achieving the highest clean-data accuracy yet exhibiting the greatest fragility. The code is publicly available at https://github.com/Ray-zyy/TS-Fault.

URL PDF HTML ☆

赞 0 踩 0

2606.18525 2026-06-18 cs.LG 新提交 60%

Hierarchical Attention via Domain Decomposition

基于区域分解的层次注意力机制

Stephan Köhler, Oliver Rheinbach

发表机构 * Faculty of Mathematics and Computer Science（数学与计算机科学系）

专题命中其他LLM ：提出层次注意力机制改进Transformer

AI总结提出一种基于两水平重叠Schwarz区域分解的层次注意力机制，通过局部低秩注意力块与粗网格注意力块结合，在少参数下实现更快训练和更高精度。

Comments 20 pages, 10 figures

详情

AI中文摘要

我们提出了一种基于两水平重叠Schwarz区域分解的层次注意力机制。该方法的动机源于观察到两水平Schwarz区域分解方法将局部子域校正与一个传达全局、长程信息的粗水平相结合。我们在一个具有齐次Dirichlet边界条件的一维扩散问题背景下，测试了其在有限维算子学习中的实用性。尽管该问题简单，但它提供了一个受控的序列到序列设置，其中精确的非局部解算子已知。离散化后，学习解算子相当于逼近一个对称正定矩阵的逆。作为基线，我们使用一个全局无softmax的低秩注意力算子，形式为$QK^T$。所提出的构造将这个密集的全局分解替换为一个两水平加性结构：重叠子域上的局部低秩注意力块与一个粗注意力块相结合。得到的算子形式为$$M_{\theta}^{-1} = \Phi Q_0 K_0^T \Phi^T + \sum_{i=1}^{N} R_i^T D_i^{1/2} Q_i K_i^T D_i^{1/2} R_i.$$ 这里$R_i$限制到重叠子域，$D_i$是单位划分权重，$\Phi$是粗插值（或延拓）矩阵。针对合成Fourier右端项的数值实验表明，区域分解注意力算子能够比全局低秩注意力基线训练更快，并在使用显著更少参数的情况下提供更精确的逼近。

英文摘要

We propose a hierarchical attention mechanism based on two-level overlapping Schwarz domain decomposition. The method is motivated by the observation that two-level Schwarz domain decomposition methods combine local subdomain corrections with a coarse level that communicates global, long-range information. We test its usefulness in the context of finite-dimensional operator learning using a simple, one-dimensional diffusion problem with homogeneous Dirichlet boundary conditions. Although elementary, this problem provides a controlled sequence-to-sequence setting in which the exact nonlocal solution operator is known. After discretization, learning the solution operator amounts to approximating the inverse of a symmetric positive definite matrix. As a baseline, we use a global softmax-free low-rank attention operator of the form $QK^T$. The proposed construction replaces this dense global factorization by a two-level additive structure: local low-rank attention blocks on overlapping subdomains are combined with a coarse attention block. The resulting operator has the form $$M_θ^{-1} = ΦQ_0 K_0^T Φ^T + \sum_{i=1}^{N} R_i^T D_i^{1/2} Q_i K_i^T D_i^{1/2} R_i.$$ Here $R_i$ restricts to an overlapping subdomain, $D_i$ is a partition-of-unity weight, and $Φ$ is a coarse interpolation (or prolongation) matrix. Numerical experiments for synthetic Fourier right-hand sides indicate that the domain-decomposition attention operator is able to train faster and can give more accurate approximations than a global low-rank attention baseline while using significantly fewer parameters.

URL PDF HTML ☆

赞 0 踩 0

2606.18519 2026-06-18 cs.RO cs.AI 新提交 60%

As You Wish: Mission Planning with Formal Verification using LLMs in Precision Agriculture

如您所愿：利用LLM在精准农业中进行形式化验证的任务规划

Marcos Abel Zuzuárregui, Stefano Carpin

发表机构 * University of California, Merced（加州大学默塞德分校）

专题命中其他LLM ：利用LLM进行形式化验证的任务规划

AI总结针对自然语言歧义性，提出基于线性时序逻辑（LTL）反馈循环的LLM任务规划系统，通过双LLM分工实现规范生成与验证，提升精准农业任务规划的可靠性。

Journal ref Published in Proceedings of 2026 International Conference on Robotics and Automation (ICRA)

详情

AI中文摘要

尽管机器人系统现已商业化并部署于各行各业，但许多系统高度专业化，通常需要高级技能才能操作并确保其按指令执行。为缓解这一问题，我们近期引入了一个任务规划器，利用大语言模型（LLM）根据自然语言描述的任务描述合成精准农业中的任务计划。虽然该系统表现出色，但也存在自然语言固有的歧义性。本文通过引入多个基于线性时序逻辑（LTL）的反馈循环来扩展我们的系统，以确保任务规划系统满足用户制定的规范，同时仍使用自然语言。为减轻潜在偏差，我们使用两个不同的商业LLM分别负责规范生成和验证子任务。通过大量实验，我们强调了将任务验证集成到全自主流水线中的优势与局限，特别是关于LLM生成有效LTL公式的能力，并展示了我们的实现如何应对和解决这些挑战。

英文摘要

Though robotic systems are now being commercialized and deployed in various industries, many of these systems are highly specialized and often require an advanced skill set to operate and ensure they perform as instructed. To mitigate this problem, we recently introduced a mission planner leveraging LLMs to synthesize mission plans in precision agriculture based on mission descriptions provided in natural language. While the system demonstrates impressive performance, it also suffers from the inherent ambiguities of natural language. In this paper, we extend our system to address this issue by introducing multiple feedback loops in the planning architecture that leverage linear temporal logic (LTL) to ensure the mission planning system meets the specifications formulated by the user while still using natural language. To mitigate potential bias, this is achieved by using two different commercial LLMs in charge of the specification and verification subtasks. Through extensive experiments, we highlight the strengths and limitations of integrating mission verification into a fully autonomous pipeline, particularly regarding an LLM's ability to generate valuable LTL formulas, and show how our proposed implementation addresses and solves these challenges.

URL PDF HTML ☆

赞 0 踩 0

2606.18312 2026-06-18 cs.CR cs.DC cs.LG 新提交 60%

TIGER: Inverting Transformer Gradients via Embedding-Subspace Distance Optimization

TIGER：通过嵌入子空间距离优化反转Transformer梯度

William Kalikman, Ivo Petrov, Dimitar I. Dimitrov, Martin Vechev

发表机构 * ETH Zürich（苏黎世联邦理工学院）； INSAIT, Sofia University "St. Kliment Ohridski"（索菲亚大学"圣克莱门特·奥赫里茨基"）

专题命中其他LLM ：提出Transformer梯度反转攻击TIGER。

AI总结提出TIGER攻击，通过将子空间信号转化为可微目标，直接优化令牌嵌入以最小化到子空间的距离，在编码器模型上提升重建质量和速度，在解码器模型上增强对差分隐私的鲁棒性。

Comments 16 pages, 13 pages main text,

详情

AI中文摘要

联邦学习允许多个客户端通过向中央服务器发送梯度更新来联合训练共享模型，同时保持原始输入在本地。然而，先前的梯度反转攻击表明，这些更新可以泄露足够的信息来重建客户端输入。现有的针对Transformer的攻击要么优化虚拟输入以匹配真实的客户端更新，这对于现代模型来说成本高昂且不稳定；要么利用注意力梯度的低秩性来识别包含真实层嵌入的子空间，然后对候选令牌进行离散成员测试。然而，这种令牌测试在数值噪声（例如来自量化或差分隐私）下很脆弱，并且对于具有非因果注意力的编码器模型扩展性差。我们引入了TIGER，一种连续的梯度反转攻击，它将这种子空间信号转化为可微目标。TIGER不是搜索令牌或匹配完整梯度，而是直接优化令牌嵌入以最小化它们到子空间的距离。我们的实验表明，在仅编码器模型上，TIGER在重建质量和运行时间上均显著优于现有攻击；而在解码器模型上，TIGER比先前基于子空间的攻击更鲁棒，从而在受差分隐私保护的联邦学习设置中实现了首次成功的重建。

英文摘要

Federated learning allows multiple clients to jointly train a shared model by sending gradient updates to a central server while keeping raw inputs local. However, prior gradient inversion attacks show that these updates can reveal enough information to reconstruct client inputs. Existing attacks on transformers either optimize dummy inputs to match the true client updates, which is costly and unstable for modern models, or exploit the low rank of attention gradients to identify a subspace containing the true layer embeddings, followed by a discrete membership test for candidate tokens. However, this token test is brittle under numerical noise, i.e., from quantization or Differential Privacy (DP), and scales poorly for encoder models with non-causal attention. We introduce TIGER, a continuous gradient inversion attack that turns this subspace signal into a differentiable objective. Instead of searching over tokens or matching full gradients, TIGER directly optimizes token embeddings to minimize their distance to the subspace. Our experiments demonstrate that on encoder-only models, TIGER substantially improves both reconstruction quality and runtime over existing attacks, while on decoder models, TIGER is more robust than prior subspace-based attacks, enabling the first successful reconstructions in DP-defended federated learning settings.

URL PDF HTML ☆

赞 0 踩 0

2606.16214 2026-06-18 cs.LG cs.AI 新提交 60%

Calibrated Sampling-Free Uncertainty Estimation in Bayesian Deep Learning

贝叶斯深度学习中的校准无采样不确定性估计

Tobias Jan Wieczorek, Leon de Andrade, Thomas Möllenhoff, Marcus Rohrbach

发表机构 * TU Darmstadt & hessian.AI, Darmstadt, Germany（达姆施塔特工业大学 & hessian.AI，德国达姆施塔特）； RIKEN Center for Advanced Intelligence Project, Tokyo, Japan（日本理化学研究所革新智能研究中心，日本东京）

专题命中其他LLM ：贝叶斯深度学习不确定性估计，可应用于LLM

AI总结提出校准方差传播（CVP），通过新型归一化层传播方法、激活函数处理技术及轻量校准步骤，在单次前向传播中高效估计不确定性，在Transformer和CNN上达到与MC采样相当的精度，成本显著降低。

详情

AI中文摘要

现代深度学习模型仍然以过度自信而闻名，限制了它们在高风险应用中的可靠性。贝叶斯方法通过学习模型参数的分布来应对这一问题，最近的进展使得在大规模架构上以与AdamW相当的成本实现这一目标成为可能。然而，测试时仍存在一个挑战：预测必须对从后验中采样的权重进行多次前向传播的平均，这代价高昂。方差传播提供了一种高效的替代方案，在单次前向传播中计算每层不确定性的解析近似。虽然此类技术对MLP有效，但由于现代架构的深度增加和层类型多样性，其扩展仍然具有挑战性。为填补这一空白，我们提出了校准方差传播（CVP），它引入了一种新的归一化层传播方法，结合了处理激活函数的近期技术，并通过轻量校准步骤吸收残差误差。CVP在Transformer和CNN上产生与MC采样相当准确的不确定性估计，而成本仅为极小部分。与先前的方差传播工作相比，CVP在BEiT-3上对视觉推理（NLVR2）的$0.5\%$风险覆盖率从$8.2\%$提高到$14.6\%$，在ViLT上对VQAv2从$2.6\%$提高到$10.8\%$，且增益扩展到卷积架构。

英文摘要

Modern deep learning models remain notoriously prone to overconfidence, limiting their reliability in high-stakes applications. Bayesian methods aim to counter this by learning a distribution over model parameters, and recent advances now make this feasible for large-scale architectures at costs comparable to AdamW. However, a challenge remains at test time: predictions must be averaged across many forward passes with weights sampled from the posterior, which is prohibitively expensive. Variance propagation offers an efficient alternative, computing layer-wise analytical approximations of uncertainty in a single forward pass. While such techniques are effective for MLPs, their extension to modern architectures remains challenging, due to increased depth and diversity of layer types. To fill this gap, we propose Calibrated Variance Propagation (CVP), which introduces a new propagation method for normalization layers, combines it with recent techniques for handling activation functions, and absorbs residual error through a light calibration step. CVP yields comparably accurate uncertainty estimates to MC sampling across transformers and CNNs, at a fraction of the cost. Against prior variance propagation work, CVP improves coverage at $0.5\%$ risk from $8.2\%$ to $14.6\%$ with BEiT-3 on Visual Reasoning (NLVR2) and from $2.6\%$ to $10.8\%$ with ViLT on VQAv2, with gains extending to convolutional architectures.

URL PDF HTML ☆

赞 0 踩 0

2605.26903 2026-06-18 cs.CR cs.AI 版本更新 60%

Practical Anonymous Two-Party Gradient Boosting Decision Tree

实用的匿名两方梯度提升决策树

Chenyu Huang, Fan Zhang, Minxin Du, Sherman S. M. Chow, Huangxun Chen, Huaming Rao, Danqing Huang, Bo Qian, Peng Chen

发表机构 * Tencent（腾讯）； Hong Kong Polytechnic University（香港理工大学）； Chinese University of Hong Kong（香港中文大学）； HKUST-GZ

专题命中其他LLM ：梯度提升决策树安全训练，非LLM但涉及AI安全

AI总结针对两方垂直分割数据上的梯度提升决策树训练，提出一种基于双电路隐私集合求交和遗忘可编程伪随机函数的匿名协议，在隐藏记录标识符的同时保持效率。

Comments 19 pages; 2026 IEEE Symposium on Security and Privacy (SP)

Journal ref 2026 IEEE Symposium on Security and Privacy (SP)

详情

DOI: 10.1109/SP63933.2026.00084

AI中文摘要

梯度提升决策树（GBDT）擅长处理结构化数据，通常用于在互不信任的各方之间垂直分割的特征上进行训练。高速和可解释性使得GBDT在金融和医疗领域广受欢迎，而神经网络在这些领域可能表现不佳。为GBDT启用安全计算带来了独特的挑战，需要安全的记录对齐以进行比较。依赖隐私集合求交（PSI）是一种事实上的方法。将PSI误认为是安全措施实际上会暴露数据集中哪些记录标识符（ID）是共享的。尽管电路PSI可以提供帮助，但对于通用用途来说成本高昂。需要新的思路来在“黑暗森林”中高效训练。为了隐藏ID，我们启动了对两方持有的分割数据上的匿名GBDT训练的研究。我们设计中的双电路PSI让双方交替作为接收者，对本地特征执行“选取后求和”。通过遗忘可编程伪随机函数，我们将电路PSI的输出作为共享状态在运行之间传播。避免通用对齐，我们解决了被忽视的困境：隐藏ID会带来与域大小成比例的成本。接下来，我们将用于将单指令多数据同态加密从（环）学习误差转换的密文打包成本减半，相比之前的安全GBDT（Usenix Security' 23）和相关安全机器学习计算。对比实验表明，我们的协议在效率上与有泄漏的方法相比仍具有竞争力。通过启用隐藏ID的聚合，我们的技术可以扩展到其他垂直分割的分析场景。

英文摘要

Structured data is well handled by gradient-boosted decision trees (GBDT), which are usually trained on vertically partitioned features across mutually distrustful parties. High speed and interpretability make GBDTs popular in finance and healthcare, where neural networks may fall short. Enabling secure computation for GBDTs poses unique challenges, requiring secure record alignment for comparison. Relying on private set intersection (PSI) is a de facto approach. Mistaking PSI for a safety measure actually exposes which record identifiers (IDs) are shared between the datasets. Although circuit-PSI could help, it is costly for generic uses. New ideas are needed to efficiently train in a "dark forest". Aiming to hide the IDs, we initiate the study of anonymous GBDT training on split data held by two parties. Dual circuit-PSI in our design lets the parties alternate as receiver to run pick-then-sum over local features. Via oblivious programmable pseudorandom functions, we propagate circuit-PSI outputs as shared state across runs. Avoiding universal alignment, we resolve the neglected dilemma that ID hiding incurs a cost that scales with domain size. Next, we halve the cost of ciphertext packing used to convert single-instruction multiple-data homomorphic encryption from (ring) learning with errors in prior secure GBDT (Usenix Security' 23) and related secure machine-learning computations. Comparative experiments show our protocol remains competitive with leaky approaches in efficiency. Enabling ID-hiding aggregation, our techniques can extend to other vertically partitioned analytics.

URL PDF HTML ☆

赞 0 踩 0

2605.07036 2026-06-18 physics.ed-ph 版本更新 60%

Using Large Language Models to Analyze Engagement in Computational Thinking via Computational Physics Essays

使用大型语言模型通过计算物理论文分析计算思维中的参与度

Sean Savage, Amir Bralin, Paul Hur, N. Sanjay Rebello

专题命中其他LLM ：利用LLM自动评估学生计算物理论文中的计算思维。

AI总结本研究利用多模态大型语言模型自动评估100篇学生计算物理论文中的计算思维参与度，在明确子任务上达到84%的准确率，但主观整体质量评估准确率仅71%。

Comments 13 pages, 3 figures, 3 tables. Submitted to Physical Review Physics Education Research

详情

AI中文摘要

随着计算思维（CT）在物理教育中日益重要，对真实、基于项目的评估的需求也在增长。虽然开放式多模态作业（如计算物理论文，CPEs）有助于捕捉学生的推理并鼓励主动学习，但它们引入了显著的评估瓶颈。手动根据复杂的计算实践分类法对这些复杂笔记本进行评分是资源密集型的，并限制了大规模课程的扩展性。在本研究中，我们调查了使用多模态大型语言模型（LLM）自动评估100篇学生生成的CPEs的可行性。使用人工编码的基线，我们系统评估了模型在20个不同的CT子实践和整体质量评分中检测学生参与度的能力。结果表明，LLM在明确定义的任务上表现非常好，在二元子实践上达到了84%的精确一致率。然而，更主观的构念被证明具有挑战性，模型在整体质量分析中仅达到71%的一致率。我们的发现表明，虽然LLM可以可靠地自动化检测特定的计算实践，但主观评估仍然是一个障碍。

英文摘要

As computational thinking (CT) becomes increasingly important to physics education, the need for authentic, project-based assessments has grown. While open-ended multimodal assignments, such as Computational Physics Essays (CPEs), help capture student reasoning and encourage active learning, they introduce a significant evaluation bottleneck. Manually grading these complex notebooks across a complex taxonomy of computational practices is resource-intensive and limits scalability in large-enrollment courses. In this study, we investigated the viability of using a multimodal Large Language Model (LLM) to automate the evaluation of 100 student-generated CPEs. Using a human-coded baseline, we systematically evaluated the model's capacity to detect student engagement across 20 distinct CT sub-practices and a holistic overall quality score. The results showed that the LLM performs very well on clearly defined tasks, achieving an 84% exact agreement with human raters on the binary sub-practices. However, more subjective constructs proved challenging, with the model reaching only a 71% agreement for the holistic quality analysis. Our findings demonstrated that while LLMs can reliably automate the detection of specific computational practices, subjective evaluation remains a hurdle.

URL PDF HTML ☆

赞 0 踩 0

2604.04342 2026-06-18 cs.LG stat.ML 版本更新 60%

Generative models for decision-making under distributional shift

分布偏移下决策的生成模型

Xiuyuan Cheng, Yunqin Zhu, Yao Xie

发表机构 * Department of Mathematics, Duke University（杜克大学数学系）； H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology（佐治亚理工学院H. Milton Stewart工业与系统工程学院）

专题命中其他LLM ：生成模型用于决策，与LLM弱相关

AI总结本文提出基于流和分数生成模型的统一框架，通过传输映射、速度场等工具处理分布偏移下的决策问题，实现鲁棒性、条件分布生成及不确定性量化。

Comments INFORMS TutORials in Operations Research, 2026

详情

AI中文摘要

许多数据驱动的决策问题使用从历史数据估计的名义分布来制定，而性能最终由可能发生偏移、依赖于上下文、部分观测或由压力引起的部署分布决定。本教程介绍了现代生成模型，特别是基于流和分数的方法，作为构建决策相关分布的数学工具。从运筹学的角度来看，它们的主要价值不在于无约束的样本合成，而在于通过传输映射、速度场、分数场和引导随机动力学来表示和变换分布。我们提出了一个基于前推映射、连续性、Fokker-Planck方程、Wasserstein几何和概率空间优化的统一框架。在此框架内，生成模型可用于学习名义不确定性、构建用于鲁棒性的受压或最不利分布，以及在侧信息和部分观测下生成条件或后验分布。我们还强调了代表性的理论保证，包括迭代流模型的前向-反向收敛、传输映射空间中的一阶极小极大分析，以及具有生成先验的后验采样的误差传递界。本教程为在分布偏移下使用生成模型进行场景生成、鲁棒决策、不确定性量化及相关问题提供了原则性的介绍。

英文摘要

Many data-driven decision problems are formulated using a nominal distribution estimated from historical data, while performance is ultimately determined by a deployment distribution that may be shifted, context-dependent, partially observed, or stress-induced. This tutorial presents modern generative models, particularly flow- and score-based methods, as mathematical tools for constructing decision-relevant distributions. From an operations research perspective, their primary value lies not in unconstrained sample synthesis but in representing and transforming distributions through transport maps, velocity fields, score fields, and guided stochastic dynamics. We present a unified framework based on pushforward maps, continuity, Fokker-Planck equations, Wasserstein geometry, and optimization in probability space. Within this framework, generative models can be used to learn nominal uncertainty, construct stressed or least-favorable distributions for robustness, and produce conditional or posterior distributions under side information and partial observation. We also highlight representative theoretical guarantees, including forward-reverse convergence for iterative flow models, first-order minimax analysis in transport-map space, and error-transfer bounds for posterior sampling with generative priors. The tutorial provides a principled introduction to using generative models for scenario generation, robust decision-making, uncertainty quantification, and related problems under distributional shift.

URL PDF HTML ☆

赞 0 踩 0

2602.17187 2026-06-18 stat.ML cs.LG 版本更新 60%

Anti-causal domain generalization: Leveraging unlabeled data

反因果域泛化：利用无标签数据

Sorawit Saengkyongam, Juan L. Gamella, Andrew C. Miller, Jonas Peters, Nicolai Meinshausen, Christina Heinze-Deml

发表机构 * Apple（苹果公司）； ETH Zürich（苏黎世联邦理工学院）

专题命中其他LLM ：域泛化方法，可应用于LLM但非核心

AI总结针对反因果设置下的域泛化问题，提出利用无标签数据估计环境扰动方向，通过惩罚模型对协变量均值和协方差变化的敏感性实现鲁棒性，并提供最坏情况最优性保证。

Comments Accepted at the International Conference on Machine Learning (ICML) 2026

详情

AI中文摘要

域泛化问题关注的是学习在部署到新的、未见过的环境时对分布变化具有鲁棒性的预测模型。现有方法通常需要来自多个训练环境的标记数据，这在标记数据稀缺时限制了它们的适用性。在这项工作中，我们研究了反因果设置下的域泛化，其中结果导致观察到的协变量。在这种结构下，影响协变量的环境扰动不会传播到结果，这促使我们对模型对这些扰动的敏感性进行正则化。关键在于，估计这些扰动方向不需要标签，使我们能够利用来自多个环境的无标签数据。我们提出了两种方法，分别惩罚模型对跨环境协变量均值和协方差变化的敏感性，并证明这些方法在特定环境类别下具有最坏情况最优性保证。最后，我们在一个受控物理系统和一个生理信号数据集上展示了我们方法的实证性能。

英文摘要

The problem of domain generalization concerns learning predictive models that are robust to distribution shifts when deployed in new, previously unseen environments. Existing methods typically require labeled data from multiple training environments, limiting their applicability when labeled data are scarce. In this work, we study domain generalization in an anti-causal setting, where the outcome causes the observed covariates. Under this structure, environment perturbations that affect the covariates do not propagate to the outcome, which motivates regularizing the model's sensitivity to these perturbations. Crucially, estimating these perturbation directions does not require labels, enabling us to leverage unlabeled data from multiple environments. We propose two methods that penalize the model's sensitivity to variations in the mean and covariance of the covariates across environments, respectively, and prove that these methods have worst-case optimality guarantees under certain classes of environments. Finally, we demonstrate the empirical performance of our approach on a controlled physical system and a physiological signal dataset.

URL PDF HTML ☆

赞 0 踩 0

2602.14789 2026-06-18 cs.LG stat.ML 版本更新 60%

On the Stability of Nonlinear Dynamics in GD and SGD: Beyond Quadratic Potentials

关于GD和SGD中非线性动力学的稳定性：超越二次势能

Rotem Mulayoff, Sebastian U. Stich

发表机构 * CISPA Helmholtz Center for Information Security（CISPA赫尔姆霍兹信息安全中心）

专题命中其他LLM ：优化算法稳定性分析，与LLM训练相关但非核心

AI总结研究梯度下降和随机梯度下降中非线性项对动力学稳定性的影响，推导了多元设置下稳定振荡的精确条件，并发现SGD的稳定性由单个不稳定批次决定。

Comments Accepted to COLT 2026

详情

AI中文摘要

训练过程中迭代的动力稳定性在确定优化算法所获得的极小值方面起着关键作用。例如，梯度下降（GD）的稳定解对应于平坦极小值，而平坦极小值被认为具有有利特征。虽然先前的工作通常依赖线性化来确定稳定性，但线性化动力学是否忠实捕捉完整的非线性行为仍不清楚。最近的研究表明，GD可能在线性不稳定的极小值附近稳定振荡，并在步长衰减后收敛，这表明线性分析可能具有误导性。在这项工作中，我们明确研究了非线性项的影响。具体而言，我们在多元设置下推导了GD在极小值附近稳定振荡的精确准则。我们的条件依赖于高阶导数，推广了现有结果。将分析扩展到随机梯度下降（SGD），我们表明即使单个批次不稳定，非线性动力学也可能在期望上发散。这意味着稳定性可能由单个不稳定振荡的批次决定，而非线性分析所暗示的平均效应。最后，我们证明如果所有批次都是线性稳定的，则SGD的非线性动力学在期望上是稳定的。

英文摘要

The dynamical stability of the iterates during training plays a key role in determining the minima obtained by optimization algorithms. For example, stable solutions of gradient descent (GD) correspond to flat minima, which have been associated with favorable features. While prior work often relies on linearization to determine stability, it remains unclear whether linearized dynamics faithfully capture the full nonlinear behavior. Recent work has shown that GD may stably oscillate near a linearly unstable minimum and still converge once the step size decays, indicating that linear analysis can be misleading. In this work, we explicitly study the effect of nonlinear terms. Specifically, we derive an exact criterion for stable oscillations of GD near minima in the multivariate setting. Our condition depends on high-order derivatives, generalizing existing results. Extending the analysis to stochastic gradient descent (SGD), we show that nonlinear dynamics can diverge in expectation even if a single batch is unstable. This implies that stability can be dictated by a single batch that oscillates unstably, rather than an average effect, as linear analysis suggests. Finally, we prove that if all batches are linearly stable, the nonlinear dynamics of SGD are stable in expectation.

URL PDF HTML ☆

赞 0 踩 0

2602.11557 2026-06-18 cs.LG stat.ML 交叉投稿 60%

The Implicit Bias of Steepest Descent with Mini-batch Stochastic Gradient

小批量随机梯度下降的隐式偏差

Jichu Li, Xuan Tang, Difan Zou

专题命中其他LLM ：小批量随机梯度下降隐式偏差理论

AI总结研究小批量随机最陡下降在多类分类中的隐式偏差，揭示批大小、动量和方差缩减对最大间隔行为和收敛率的影响，并证明动量可实现小批量收敛，方差缩减可恢复全批量隐式偏差。

详情

AI中文摘要

多种广泛使用的优化方法，如SignSGD和Muon，可以被解释为在不同范数诱导几何下的最陡下降实例。在这项工作中，我们研究了多类分类中小批量随机最陡下降的隐式偏差，刻画了批大小、动量和方差缩减如何在一般逐项和Schatten-$p$范数下塑造极限最大间隔行为和收敛率。我们证明，在没有动量时，最坏情况下的收敛和成功分类只能通过全批量梯度保证。相反，动量通过批量-动量权衡使得小批量收敛到近似最大间隔解成为可能，尽管会减慢收敛速度。该方法提供了完全显式、与维度无关的收敛率，优于先前的结果。此外，我们证明方差缩减可以恢复任意批大小下的精确全批量隐式偏差，尽管收敛速度较慢。最后，我们进一步研究了无动量的单批量最陡下降，并通过一个具体数据示例揭示了其收敛到根本不同偏差的特性，这揭示了纯随机更新的一个关键局限性。总体而言，我们的统一分析阐明了随机优化何时与全批量行为一致，并为更深入地探索随机梯度最陡下降算法的训练行为铺平了道路。

英文摘要

A variety of widely used optimization methods like SignSGD and Muon can be interpreted as instances of steepest descent under different norm-induced geometries. In this work, we study the implicit bias of mini-batch stochastic steepest descent in multi-class classification, characterizing how batch size, momentum, and variance reduction shape the limiting max-margin behavior and convergence rates under general entry-wise and Schatten-$p$ norms. We show that, without momentum, worst-case convergence and successful classification can only be guaranteed with full-batch gradient. In contrast, momentum enables small-batch convergence to an approximate max-margin solution through a batch-momentum trade-off, though it slows convergence. This approach provides fully explicit, dimension-free rates that improve upon prior results. Moreover, we prove that variance reduction can recover the exact full-batch implicit bias for any batch size, albeit at a slower convergence rate. Finally, we further investigate the batch-size-one steepest descent without momentum, and reveal its convergence to a fundamentally different bias via a concrete data example, which reveals a key limitation of purely stochastic updates. Overall, our unified analysis clarifies when stochastic optimization aligns with full-batch behavior, and paves the way for perform deeper explorations of the training behavior of stochastic gradient steepest descent algorithms.

URL PDF HTML ☆

赞 0 踩 0

2602.09234 2026-06-18 cs.LG cs.AI 版本更新 60%

Do Neural Networks Lose Plasticity in a Gradually Changing World?

神经网络在渐变世界中会失去可塑性吗？

Tianhui Liu, Lili Mou

发表机构 * Dept. Computing Science \& Alberta Machine Intelligence Institute (Amii), University of Alberta ； Canada CIFAR AI Chair

专题命中其他LLM ：神经网络可塑性损失，持续学习

AI总结研究任务转换的突然性对神经网络可塑性损失的影响，通过输入/输出插值和任务采样模拟渐变环境，理论和实验表明可塑性损失严重程度与任务转换突然性密切相关，渐变环境下可显著减轻。

2601.23018 2026-06-18 cs.HC cs.AI cs.LG 交叉投稿 60%

Integrating Multi-Label Classification and Generative AI for Scalable Analysis of User Feedback

整合多标签分类与生成式AI实现用户反馈的可扩展分析

Sandra Loop, Erik Bertram, Sebastian Juhl, Martin Schrepp

发表机构 * SAP SE（SAP公司）； Hochschule Fresenius Heidelberg（弗赖辛大学海德堡分校）； University of Missouri（密苏里大学）

专题命中其他LLM ：使用生成式AI分析用户反馈，属于LLM应用。

AI总结提出结合监督多标签分类与生成式AI的方法，高效处理大量用户评论，自动分配主题标签并生成摘要，同时发现情感分析不能可靠反映产品满意度。

Comments 8 pages, 2 figures, submitted to Springer Nature

详情

AI中文摘要

在高度竞争的软件市场中，用户体验（UX）评估对于确保软件质量和促进产品长期成功至关重要。此类UX评估通常将标准化问卷的定量指标与通过开放式问题收集的定性反馈相结合。虽然开放式反馈为改进提供了有价值的见解，并有助于解释定量结果，但分析大量用户评论具有挑战性且耗时。在本文中，我们介绍了一家大型软件公司在长期UX测量项目中开发的技术，以高效处理和解释大量用户评论。为了提供收集到的评论的高层概述，我们采用监督机器学习方法，为每条评论分配有意义的预定义主题标签。此外，我们展示了如何利用生成式AI（GenAI）创建简洁且信息丰富的用户反馈摘要，促进向组织尤其是高层管理人员有效传达发现。最后，我们研究了用户评论中表达的情感是否可以作为整体产品满意度的指标。我们的结果表明，仅凭情感分析并不能可靠地反映用户满意度。相反，产品满意度需要在调查中明确评估，以衡量用户对产品的感知。

英文摘要

In highly competitive software markets, user experience (UX) evaluation is crucial for ensuring software quality and fostering long-term product success. Such UX evaluations typically combine quantitative metrics from standardized questionnaires with qualitative feedback collected through open-ended questions. While open-ended feedback offers valuable insights for improvement and helps explain quantitative results, analyzing large volumes of user comments is challenging and time-consuming. In this paper, we present techniques developed during a long-term UX measurement project at a major software company to efficiently process and interpret extensive volumes of user comments. To provide a high-level overview of the collected comments, we employ a supervised machine learning approach that assigns meaningful, pre-defined topic labels to each comment. Additionally, we demonstrate how generative AI (GenAI) can be leveraged to create concise and informative summaries of user feedback, facilitating effective communication of findings to the organization and especially upper management. Finally, we investigate whether the sentiment expressed in user comments can serve as an indicator for overall product satisfaction. Our results show that sentiment analysis alone does not reliably reflect user satisfaction. Instead, product satisfaction needs to be assessed explicitly in surveys to measure the user's perception of the product.

URL PDF HTML ☆

赞 0 踩 0

2502.02904 2026-06-18 cs.HC cs.CL q-bio.NC 版本更新 60%

ScholaWrite: A Dataset of End-to-End Scholarly Writing Process

ScholaWrite: 端到端学术写作过程数据集

Khanh Chi Le, Linghe Wang, Minhwa Lee, Ross Volkov, Luan Tuyen Chau, Dongyeop Kang

发表机构 * University of Minnesota（明尼苏达大学）

专题命中其他LLM ：数据集涉及LLM辅助写作，但非核心

AI总结提出ScholaWrite数据集，通过Chrome扩展记录Overleaf上的按键，捕捉从初稿到终稿的多月写作过程，包含5篇计算机科学预印本的近6.2万次文本修改及认知写作意图标注，揭示人类写作与LLM辅助之间的差距。

Comments Equal contribution: Khanh Chi Le, Linghe Wang, Minhwa Lee | project page: https://minnesotanlp.github.io/scholawrite/

详情

AI中文摘要

写作是一项认知要求高的活动，需要持续决策、高度依赖工作记忆，并在不同目标的任务之间频繁切换。为了构建与作者认知真正一致的写作助手，我们必须捕捉并解码作者将想法转化为最终文本背后的完整思维过程。我们提出了ScholaWrite，这是第一个端到端学术写作数据集，追踪从初稿到最终手稿的多月历程。我们贡献了三个关键进展：（1）一个Chrome扩展，可无干扰地记录Overleaf上的按键，从而能够收集真实、现场写作数据；（2）一个新颖的完整学术手稿语料库，附有认知写作意图的细粒度标注。该数据集包含基于LaTeX的五篇计算机科学预印本的编辑，捕捉了四个月内近6.2万次文本更改；（3）对学术写作微观动态的分析和见解，突出了人类写作过程与大型语言模型（LLM）在提供有意义帮助方面的当前能力之间的差距。ScholaWrite强调了捕获端到端写作数据以开发未来写作助手的重要性，这些助手支持而非取代科学家的认知工作。

英文摘要

Writing is a cognitively demanding activity that requires constant decision-making, heavy reliance on working memory, and frequent shifts between tasks of different goals. To build writing assistants that truly align with writers' cognition, we must capture and decode the complete thought process behind how writers transform ideas into final texts. We present ScholaWrite, the first dataset of end-to-end scholarly writing, tracing the multi-month journey from initial drafts to final manuscripts. We contribute three key advances: (1) a Chrome extension that unobtrusively records keystrokes on Overleaf, enabling the collection of realistic, in-situ writing data; (2) a novel corpus of full scholarly manuscripts, enriched with fine-grained annotations of cognitive writing intentions. The dataset includes \LaTeX-based edits from five computer science preprints, capturing nearly 62K text changes over four months; and (3) analyses and insights into the micro-dynamics of scholarly writing, highlighting gaps between human writing processes and the current capabilities of large language models (LLMs) in providing meaningful assistance. ScholaWrite underscores the value of capturing end-to-end writing data to develop future writing assistants that support, not replace, the cognitive work of scientists.

URL PDF HTML ☆

赞 0 踩 0

2502.17748 2026-06-18 cs.LG cs.CR 版本更新 60%

FinP: Fairness-in-Privacy in Federated Learning by Addressing Disparities in Privacy Risk

FinP：联邦学习中通过解决隐私风险差异实现隐私公平性

Tianyu Zhao, Mahmoud Srewa, Salma Elmalaki

发表机构 * University of California, Irvine（加州大学尔湾分校）

专题命中其他LLM ：联邦学习隐私，与LLM弱相关

AI总结针对联邦学习中隐私风险分布不均的问题，提出FinP框架，通过服务器端自适应聚合和客户端正则化技术，减轻源推理攻击风险，将隐私暴露差异降低57.14%，同时保持模型效用与基线相当。

Comments To appear in PoPETS 2026 Issue 4. Privacy Enhancing Technology Symposium (PETS) 2026

详情

AI中文摘要

联邦学习（FL）固有地缓解了大规模数据集中化风险；然而，其隐私保护并非均匀分布——使得脆弱个体不成比例地暴露于复杂的隐私攻击之下。关键的是，以人为中心的FL环境中的统计异质性常常导致隐私风险的不公平分布，尤其影响那些敏感属性或行为使其成为异常值的个体。为解决这一关键差距，我们引入了FinP，这是一个新颖的框架，旨在通过减轻客户端对源推理攻击（SIA）的过度脆弱性来形式化和实施隐私公平性。FinP实施了一种双管齐下的防御策略，同时解决隐私差异的症状和根本原因，确保没有一组客户端承担过度的隐私负担。它结合了服务器端自适应聚合机制（根据客户端的估计隐私风险动态加权其贡献）和客户端正则化技术（抑制导致独特数据记忆的局部过拟合）。在FEMNIST、人类活动识别（HAR）和CIFAR-10数据集上的广泛实证评估表明，FinP有效地将隐私公平性与主要任务效用对齐。值得注意的是，FinP成功减轻了SIA风险并减少了隐私暴露差异，证明了强大的隐私公平性保证无需牺牲模型效用。最终，FinP通过将脆弱性差异降低高达57.14%，同时将全局模型效用保持在标准联邦基线±1.75%的微小范围内，建立了公平的隐私保护。

英文摘要

Federated Learning (FL) inherently mitigates mass data centralization risks; however, its privacy protections are not equally distributed - leaving vulnerable individuals disproportionately exposed to sophisticated privacy attacks. Crucially, statistical heterogeneity in human-centric FL environments often results in an inequitable distribution of privacy risks, particularly affecting those whose sensitive attributes or behaviors make them outliers. To address this critical gap, we introduce FinP, a novel framework designed to formalize and enforce fairness-in-privacy by mitigating disproportionate client vulnerability to Source Inference Attacks (SIA). FinP operationalizes a two-pronged defense strategy that tackles both the symptoms and root causes of privacy disparity, ensuring that no group of clients bears an excessive privacy burden. It combines a server-side adaptive aggregation mechanism, which dynamically weights client contributions based on their estimated privacy risk, with a client-side regularization technique to curb localized overfitting that drives unique data memorization. Extensive empirical evaluations on FEMNIST, Human Activity Recognition (HAR), and CIFAR-10 datasets demonstrate that FinP effectively aligns privacy fairness with primary task utility. Notably, FinP successfully mitigates SIA risks and reduces disparities in privacy exposure, establishing that strong fairness-in-privacy guarantees need not compromise model utility. Ultimately, FinP establishes equitable privacy protections by reducing vulnerability disparities by up to 57.14%, while preserving global model utility within a marginal +/- 1.75% of standard federated baselines.

URL PDF HTML ☆

赞 0 踩 0

2505.23851 2026-06-18 cs.CL cs.AI cs.SC 版本更新 60%

ASyMOB: Algebraic Symbolic Mathematical Operations Benchmark

ASyMOB：代数符号数学运算基准

Michael Shalyt, Rotem Elimelech, Ido Kaminer

发表机构 * MIT（麻省理工学院）； Technion - Israel Institute of Technology（技术学院-以色列理工学院）

专题命中其他LLM ：涉及大模型在符号数学上的表现评估

AI总结提出ASyMOB基准，包含35,368个符号数学问题，通过扰动测试揭示大模型在符号数学推理中的鲁棒性不足，并发现LLM与CAS的互补潜力。

Comments Published in ICML2026: https://icml.cc/virtual/2026/poster/63549 Code repository: https://github.com/RamanujanMachine/ASyMOB Complete benchmark dataset: https://huggingface.co/datasets/Shalyt/ASyMOB-Algebraic_Symbolic_Mathematical_Operations_Benchmark

详情

AI中文摘要

大型语言模型（LLM）越来越多地应用于符号数学，然而现有评估常常混淆模式记忆与真正推理。为弥补这一空白，我们提出\textbf{ASyMOB}，一个包含\textit{35,368}个经过验证的符号数学问题的高分辨率数据集，涵盖积分、极限、微分方程、级数和超几何函数。与以往基准不同，\textbf{ASyMOB}通过符号、数值和等价保持变换系统地扰动每个种子问题，从而实现对泛化能力的细粒度评估。我们的评估揭示了三个关键发现：（1）大多数模型的性能在微小扰动下崩溃，而顶级系统表现出明显的鲁棒性\textit{机制转变}；（2）集成代码工具稳定了性能，尤其对较弱模型；（3）我们识别出计算机代数系统（CAS）失败而LLM成功的例子，以及仅通过LLM-CAS混合方法解决的问题，突显了有前景的集成前沿。\textbf{ASyMOB}作为一个原则性诊断工具，用于衡量和加速构建可验证、可信赖的AI以促进科学发现。

英文摘要

Large language models (LLMs) are increasingly applied to symbolic mathematics, yet existing evaluations often conflate pattern memorization with genuine reasoning. To address this gap, we present ASyMOB, a high-resolution dataset of 35,368 validated symbolic math problems spanning integration, limits, differential equations, series, and hypergeometrics. Unlike prior benchmarks, ASyMOB systematically perturbs each seed problem using symbolic, numeric, and equivalence-preserving transformations, enabling a fine-grained assessment of generalization. Our evaluation reveals three key findings: (1) most models' performance collapses under minor perturbations, while top systems exhibit an apparent regime shift in robustness; (2) integrated code tools stabilize performance, particularly for weaker models; and (3) we identify examples where Computer Algebra Systems (CAS) fail while LLMs succeed, as well as problems solved only via a hybrid LLM-CAS approach, highlighting a promising integration frontier. ASyMOB serves as a principled diagnostic tool for measuring and accelerating progress toward building verifiable, trustworthy AI for scientific discovery.

URL PDF HTML ☆

赞 0 踩 0

2606.18918 2026-06-18 cs.LG cs.CC 新提交 55%

Some Complexity Results for Robustness Verification for Binarized Neural Networks

二值化神经网络鲁棒性验证的一些复杂性结果

Harshit Goyal, Sudakshina Dutta

发表机构 * Indian Institute of Technology Goa（印度理工学院Goa）

专题命中其他LLM ：二值化神经网络鲁棒性验证，非LLM

AI总结本文通过从布尔可满足性问题归约证明二值化神经网络的可满足性是NP完全的，并利用均匀遮挡导致的网络输出分段常数结构，提出多项式时间鲁棒性检查算法。