语言大模型 / LLM

2606.19379 2026-06-19 cs.LG cs.AI cs.CL 新提交 70%

How Linear Is a Transformer Feed-Forward Block? Per-Block Linear Recoverability Is Learned, Not Architectural

Transformer 前馈块有多线性？逐块线性可恢复性是学习得到的，而非架构决定的

Stuart Whipp

发表机构 * Independent Research（独立研究）

专题命中预训练：分析Transformer前馈块的线性度，与模型架构相关。

AI总结通过精确最小二乘线性近似，测量训练后 Transformer 各前馈块的线性可恢复性，发现其高度异质且非单调，是学习得到的属性而非架构决定，并可用于压缩和诊断。

Comments 14 pages, 5 figures

详情

AI中文摘要

Transformer 前馈网络（FFN）通常被视为非线性的计算存储单元，但训练后的 FFN 块实际非线性程度很少被测量。我们将每个 FFN 视为位置级的输入-输出映射，并将其分解为精确的最小二乘线性近似加上残差。闭式线性映射解释的留出方差定义了一个块的线性可恢复性（R^2_lin），这是一种无需优化器的线性度量。在 GPT-2、Pythia-160m 和 llama-160m 的所有十二个块中，R^2_lin 高度异质且随深度非单调变化，相邻块之间范围从近线性（>0.99）到强非线性（<0.3），且并非由激活函数决定：相同宽度的 GELU 模型 GPT-2 和 Pythia-160m 具有截然不同的轮廓，因此可恢复性是单个训练块的学习属性，而非架构属性。残差的低秩双线性探针仅恢复少量 R^2 点，且增益与残差非线性不相关：未恢复的计算不是单个位置级乘积，而是高阶或分布式结构。该测量还作为有针对性的压缩信号：可恢复块允许大的单层替换（GPT-2 的早期 FFN 参数减少 8 倍，困惑度增加 +0.77），而低可恢复性块标记了这不安全的情况。它还暴露了一个方法论陷阱：训练后的线性基线可能在病态条件的 Transformer 激活上严重欠收敛，因此我们报告了整个过程中精确的闭式最小二乘上限。

英文摘要

Transformer feed-forward networks (FFNs) are often treated as nonlinear stores of computation, yet how nonlinear a trained FFN block actually is has rarely been measured. We treat each FFN as a position-wise input-to-output map and split it into the exact least-squares linear approximation plus a residual. The held-out variance the closed-form linear map explains defines a block's linear recoverability (R^2_lin), an optimiser-free measure of its linearity. Across all twelve blocks of GPT-2, Pythia-160m, and llama-160m, R^2_lin is highly heterogeneous and non-monotone with depth, ranging from near-linear (>0.99) to strongly nonlinear (<0.3) between adjacent blocks, and is not set by the activation function: same-width GELU models GPT-2 and Pythia-160m have sharply different profiles, so recoverability is a learned property of individual trained blocks, not an architectural one. A low-rank bilinear probe of the residual recovers only a few points of R^2, with gain uncorrelated with residual nonlinearity: the unrecovered computation is not a single position-wise product but higher-order or distributed structure. The measurement also serves as a targeted compression signal: recoverable blocks admit large single-layer replacements (GPT-2's early FFN at 8x fewer parameters for +0.77 perplexity), while low-recoverability blocks flag where this is unsafe. It further exposes a methodological pitfall: trained linear baselines can badly under-converge on ill-conditioned transformer activations, so we report the exact closed-form least-squares ceiling throughout.

URL PDF HTML ☆

赞 0 踩 0

2606.19367 2026-06-19 cs.LG 新提交 70%

Weibull Weight-Scale Parameter Evolution under AdamW Training Dynamics

Weibull 权重尺度参数在 AdamW 训练动态下的演化

Tiexin Ding

发表机构 * Independent Researcher（独立研究员）

专题命中预训练：研究AdamW训练动态，以Pythia模型为例。

AI总结研究 AdamW 训练中 Weibull 权重尺度参数 λ 增长、过冲和松弛的原因，推导出三种力（对齐、注入、衰减）的分解，并在 Pythia-70M 模型上验证对齐力主导上升阶段，贡献 88-94%。

Comments 21 pages, 14 figures

详情

AI中文摘要

基于用于诊断变压器权重分布的双参数 Weibull 框架，我们研究了为什么在 AdamW 训练期间 Weibull 权重尺度参数 λ 会增长、过冲然后松弛。我们从 AdamW 更新中推导出平方权重范数的领先阶三力分解：一个对齐力，测量权重与自适应更新方向之间的相关性；一个注入力，来自自适应步长幅度；以及一个衰减力，来自解耦的权重衰减。在具有真实优化器矩的自训练 Pythia-70M 模型上，对齐力主导上升阶段，在四个随机种子中贡献了绝对力预算的 88-94%，并且对超权重移除具有鲁棒性。接近饱和时，对齐力和衰减力趋于平衡，解释了从权重尺度增长到松弛的转变。这些力动态直接控制 λ(t) 背后的平方范数分量；剩余的 RMS 到 Weibull 重建偏移是可测量的，并分解为桥接分量和积分分量，在密集采样区域总计约 5-6%。为了将分析扩展到无法获得优化器矩的真实模型，我们引入了一种样条位移方法，该方法从稀疏检查点以约 92-94% 的准确率恢复对齐力，大约是朴素两点基线的两倍。我们进一步观察到，在我们的实验中，λ(t) 的峰值随训练数据一致性而变化，这表明权重尺度增长存在数据依赖成分，我们将其留待后续对照研究。代码和数据可在 https://this URL 获取。

英文摘要

Building on a two-parameter Weibull framework for diagnosing transformer weight distributions, we study why the Weibull weight-scale parameter $λ$ grows, overshoots, and then relaxes during AdamW training. We derive a leading-order three-force decomposition of the squared weight norm from the AdamW update: an alignment force measuring the correlation between weights and the adaptive update direction, an injection force from adaptive step magnitude, and a decay force from decoupled weight decay. On self-trained Pythia-70M models with ground-truth optimizer moments, alignment dominates the rise phase, contributing 88-94% of the absolute force budget across four random seeds and remaining robust to super-weight removal. Near saturation, alignment and decay approach balance, explaining the transition from weight-scale growth to relaxation. These force dynamics directly govern the squared-norm component underlying $λ(t)$; the remaining RMS-to-Weibull reconstruction offset is measurable and decomposes into bridge and integration components, totaling approximately 5-6% in densely sampled regions. To extend the analysis to real models where optimizer moments are unavailable, we introduce a spline displacement method that recovers the alignment force from sparse checkpoints with approximately 92-94% accuracy, about twice the naive two-point baseline. We further observe that the peak value of $λ(t)$ varies with training-data coherence in our experiments, suggesting a data-dependent component of weight-scale growth that we leave to a controlled follow-up study. Code and data are available at https://github.com/tiexinding/NPM-Weibull-public.

URL PDF HTML ☆

赞 0 踩 0

2606.18812 2026-06-19 cs.LG cs.AI 新提交 70%

Reinforcement Learning Foundation Models Should Already Be A Thing

强化学习基础模型本应已经存在

Abdelrahman Zighem, Jill-Jênn Vie

发表机构 * École normale supérieure de Paris, PSL University, Paris, France（巴黎高等师范学院，PSL大学，法国巴黎）； Soda team, Inria Saclay, Palaiseau, France（Soda团队，法国国家信息与自动化研究所萨克雷中心，法国帕莱索）

专题命中其他LLM ：提出强化学习基础模型概念

AI总结提出通过合成MDP构建强化学习基础模型，利用固定大小的充分统计量使注意力架构适用，在线和离线实验均优于传统算法。

详情

AI中文摘要

语言和视觉的基础模型由互联网规模的数据驱动，而结构化领域（表格预测、时间序列预测、图学习、强化学习）则不然。替代方案是合成数据，它将负担从收集转移到先验设计。这种先验已经存在于许多结构化任务中：TabPFN及其后续工作通过一个在合成贝叶斯先验上预训练的Transformer解决表格分类问题。我们提出两点。\textbf{首先}，强化学习是明显的空白：采样一个合成MDP与采样一个合成表格数据集一样可行，然而没有上下文强化学习工作将先验设计作为主要目标。\textbf{其次}，MDP允许一个固定大小的充分统计量，独立于观察到的回合且形状为表格形式，这使得它们直接适用于用于表格基础模型的基于注意力的架构，只需将策略头替换监督目标。这些共同定义了强化学习基础模型的议程。作为概念验证，我们完全在合成MDP上训练一个模型，并表明，无需任务特定的调优，它就能在上下文中解决留出的表格基准，包括在线和离线：在线时，使用比UCB-VI和表格Q-learning少得多的回合；离线时，与VI-LCB竞争。

英文摘要

Foundation models for language and vision are powered by internet-scale data, while structured domains such as tabular prediction are powered by synthetic data. This substitute shifts the challenge from collection to prior design. Such priors already exist for many structured tasks: TabPFN and its successors solve tabular classification with a transformer pretrained on a synthetic Bayesian prior. We make two points. \textbf{First}, reinforcement learning is the conspicuous gap: sampling a synthetic MDP is as feasible as sampling a synthetic tabular dataset, yet no in-context RL work treats prior design as a primary objective. \textbf{Second}, MDPs admit a fixed-size sufficient statistic, independent of the episodes observed and tabular in shape, which makes them directly amenable to the attention-based architectures used for tabular foundation models, with a policy head replacing the supervised target. Together these define the agenda for an RL foundation model. As a proof of concept, we train a Graph Attention Network entirely on synthetic MDPs and show that, with no task-specific tuning, it solves held-out tabular benchmarks in context, both online and offline: online, in far fewer episodes than UCB-VI and tabular Q-learning, and offline, competitively with VI-LCB.

URL PDF HTML ☆

赞 0 踩 0

2606.14784 2026-06-19 cs.SD cs.LG eess.AS 新提交 70%

LLM-Based Synthetic Ground Truth Generation for Audio-Based Emotion Classification via In-Context Learning

基于上下文学习的音频情感分类的LLM合成真实标签生成

Qing Huang, Pooja Pol, Jianing Zhang

发表机构 * School of Business, Technical University of Applied Sciences Augsburg（应用技术大学阿沙芬堡商学院）； Data Science und Autonome Systeme Technologietransferzentrum (TTZ)（数据科学与自主系统技术转移中心（TTZ））

专题命中其他LLM ：LLM生成音频情感标签

AI总结提出利用大语言模型（LLM）和上下文学习（ICL）从多用户VR环境的流式语音数据中自动生成情感相关合成真实标签，解决团队协作状态标注难题。

Comments https://icaiit.org/paper.php?paper=14th_ICAIIT_2/3_9

详情

AI中文摘要

理解人类状态和交互动态是人机交互（HCI）的核心目标。随着交互范式变得更加沉浸，虚拟现实（VR）已成为研究协作工作的强大平台。在此类环境中，评估团队协作状态（包括团队表现和团队韧性）需要从多模态传感器数据（如语音信号）中连续可靠地推断潜在的团队级认知和情感状态。然而，由于传感器噪声、上下文变异性和稀疏的专家标注，为这些潜在状态生成真实标签仍然具有挑战性。传统的自我报告方法仅提供静态和延迟的测量，因此不足以捕捉连续语音数据中反映的动态团队过程。在这项工作中，我们提出了一种由大语言模型（LLM）驱动的、基于代理的推理工作流，用于从多用户VR环境中的流式语音数据自动生成情感相关的合成真实标签。利用LLM的泛化能力，我们使用上下文学习（ICL）和少量配对的音频样本及其对应转录的演示。ICL倾向于实现与模型微调相当的任务适应，同时避免了参数更新的计算开销。为了构建信息丰富且鲁棒的上下文提示，我们采用基于检索的选择策略，根据声学特征空间中的相似性动态识别相关的音频演示。

英文摘要

Understanding human states and interaction dynamics is a core goal of human-computer interaction (HCI). As interaction paradigms become more immersive, virtual reality (VR) has emerged as a powerful platform for studying collaborative work. In such settings, evaluating team collaboration states, including team performance and team resilience, requires continuous and reliable inference of latent team-level cognitive and affective states from multi-modal sensor data, such as speech signals. However, generating ground truth labels for these latent states remains challenging due to sensor-induced noise, contextual variability, and sparse expert annotations. Traditional self-reporting approaches provide only static and delayed measurements and are therefore insufficient for capturing dynamic team processes reflected in continuous speech data. In this work, we propose a large language model (LLM)-driven, agentic inference workflow for automated emotion-related synthetic ground truth generation from streaming speech data in multi-user VR environments. Leveraging the generalization capabilities of LLMs, we use In-Context Learning (ICL) with few-shot demonstrations of paired audio-based samples and their corresponding transcriptions. ICL tends to achieve task adaptation comparable to model fine-tuning while circumventing the computational overhead of parameter updates. To construct informative and robust in-context prompts, we adopt a retrieval-based selection strategy that dynamically identifies relevant audio demonstrations based on similarity in the acoustic feature space.

URL PDF HTML ☆

赞 0 踩 0

2606.10616 2026-06-19 cs.AI 新提交 70%

Learning What to Remember: Observability-Safe Memory Retention via Constrained Optimization for Long-Horizon Language Agents

学习记住什么：通过约束优化实现长时域语言代理的观测安全记忆保留

Qingcan Kang, Liu Mingyang, Shixiong Kai, Kaichao Liang, Tao Zhong, Mingxuan Yuan

发表机构 * Huawei Noah's Ark Lab（华为诺亚方舟实验室）； Department of Computer Science, City University of Hong Kong（香港城市大学计算机科学系）

专题命中其他LLM ：针对语言代理的上下文窗口限制提出方法

AI总结针对长时域语言代理的有限上下文窗口，提出OSL-MR框架，将记忆保留建模为约束随机优化问题，通过在线可观测特征与离线监督的严格分离学习查询条件化的证据价值，实验表明在严格预算下优于现有方法。

详情

AI中文摘要

长时域语言代理积累的观测、推理轨迹和检索事实会超出其有限的上下文窗口，使得记忆保留成为一个基本的资源分配问题。现有记忆系统通过启发式评分、检索优化或学习压缩来改进管理，但大多将保留视为局部决策问题，并未在现实观测约束下显式建模其长期后果。为填补这一空白，我们将记忆保留建模为一个约束随机优化问题，具有明确的预算可行性、证据效用以及延迟成本（包括遗漏惩罚、重新获取延迟和过时信息风险）。随后，我们提出OSL-MR（观测安全记忆保留学习），这是一个新颖的框架，强制执行在线可观测特征与离线可用监督（OAS）之间的严格分离。OSL-MR结合了一个从实现的证据监督中训练的证据学习器和一个混合评分启发式，该启发式既作为可部署的在线安全基线，又作为结构化的归纳先验用于学习。由此产生的策略直接从交互数据中学习查询条件化的证据价值，同时在同一观测约束下保持可部署性。在LOCOMO和LongMemEval上的实验表明，OSL-MR在严格记忆预算下持续优于基于最近性的方法、生成式代理风格评分和其他启发式基线。混合评分先验在保持召回率的同时进一步提高了精确度，敏感性分析表明其在广泛的成本配置下具有鲁棒性。

英文摘要

Long-horizon language agents accumulate observations, reasoning traces, and retrieved facts exceeding context windows, making memory retention a fundamental resource-allocation problem. Existing systems treat retention as local and do not model long-term consequences under observability constraints. To fill this gap, we formulate memory retention as a constrained stochastic optimization with budget feasibility, evidence utility, and delayed costs including miss, reacquisition, and stale penalties. We show this multi-step problem is NP-hard, making exact solution intractable. Moreover, deployment decisions must be made under partial observability. To address these challenges, we propose OSL-MR (Observability-Safe Learning for Memory Retention), a learning-augmented framework that enforces a strict separation between online-observable features and offline-available supervision. OSL-MR combines an evidence learner trained from realized evidence with a Mixed-Score heuristic that serves as a deployable online-safe baseline and an inductive prior. The policy learns query-conditioned evidence from interaction data and remains deployable under the same constraints. Experiments on LoCoMo and LongMemEval show OSL-MR outperforms recency-based, Generative Agents-style, and other heuristic baselines, especially under tight budgets. The Mixed-Score prior improves precision and recall, and sensitivity analysis shows robustness across cost settings. On small solvable instances, single-step optimization is insufficient to anticipate future demand shifts, while OSL-MR stays significantly closer to the dynamic-programming optimum, confirming the necessity of the sequential formulation and reinforcing our learning-guided approximation. These results establish constrained stochastic optimization and optimization-guided learning as a principled foundation for memory management in long-horizon agents.

URL PDF HTML ☆

赞 0 踩 0

2606.06971 2026-06-19 cs.MA cs.SI 版本更新 70%

Modeling U.S. Attitudes Toward China via an Event-Steered Multi-Agent Simulator

通过事件驱动的多智能体模拟器建模美国对华态度

Chenxu Zhu, Hantao Yao, Wu Liu, Junbo Guo, Yongdong Zhang

专题命中其他LLM ：基于LLM的多智能体模拟，驱动舆论演化

AI总结提出事件驱动多智能体模拟器（ES-MAS），利用CURE数据集和双流数据集成引擎（DSDIE）及新闻驱动动态交互模块（NDDI），模拟美国对华舆论的动态演化，实验表明优于现有模型。

详情

AI中文摘要

理解舆论的动态演化，如美国公众对中国的态度，对于评估地缘政治风险至关重要。然而，现有的基于LLM的多智能体模拟器主要依赖静态规则和固定数据集，限制了其捕捉现实世界中宏观层面舆论转变的动态、事件驱动特性的能力。为解决这一限制，我们提出了一种事件驱动的多智能体模拟器（ES-MAS），其中重大事件和日常新闻通过智能体之间的动态交互持续驱动舆论演化。我们首先构建了中美关系演化（CURE）数据集，涵盖2021年至2025年的20个季度，包括258个重大事件和超过14,000篇日常新闻文章，为建模舆论动态提供了全面的时间基础。基于CURE数据集，我们提出了双流数据集成引擎（DSDIE），该引擎通过宏观层面事件将模拟与历史时间线对齐，同时基于个体智能体画像和上下文信号实现个性化信息暴露。此外，我们设计了新闻驱动的动态交互（NDDI）模块，该模块自适应地将具有共同新闻兴趣的智能体分组到局部交互上下文中，促进自下而上的共识形成，同时降低孤立信息茧房的风险。在CURE数据集上的实验结果表明，ES-MAS在复现真实世界历史趋势方面显著优于现有模拟器，为建模动态舆论演化提供了一个可扩展且有效的框架。

英文摘要

Understanding the dynamic evolution of opinions, such as U.S. public attitudes toward China, is essential for assessing geopolitical risks. However, existing LLM-based multiagent simulators predominantly rely on static rules and fixed datasets, limiting their ability to capture the dynamic, event-driven nature of macro-level opinion shifts in real-world settings. To address this limitation, we propose an Event-Steered Multi-Agent Simulator (ES-MAS), in which significant events and daily news continuously drive opinion evolution through dynamic interactions among agents. We first construct the China-U.S. Relation Evolution (CURE) dataset, covering 20 quarters from 2021 to 2025, including 258 major events and over 14,000 daily news articles, and providing a comprehensive temporal foundation for modeling opinion dynamics. Building upon the CURE dataset, we propose a Dual-Stream Data Integration Engine (DSDIE) that aligns simulations with historical timelines via macro-level events while enabling personalized information exposure based on individual agent profiles and contextual signals. Furthermore, we design a News-Driven Dynamic Interaction (NDDI) module, which adaptively groups agents with shared news interests into localized interaction contexts, facilitating bottom-up consensus formation while mitigating the risk of isolated information cocoons. Experimental results on the CURE dataset demonstrate that ES-MAS substantially outperforms existing simulators in reproducing real-world historical trends, offering a scalable and effective framework for modeling dynamic opinion evolution.

URL PDF HTML ☆

赞 0 踩 0

2604.07593 2026-06-19 cs.AI 版本更新 70%

Too long; didn't solve

太长；没解决

Lucía M. Cabrera, Isaac Saxton-Knight, Jocelyn D'Arcy

发表机构 * Instituto Balseiro（巴塞罗那研究所）； Poindexter Labs（波因迪克斯实验室）

专题命中其他LLM ：提示长度与数学推理性能关系研究

AI总结研究提示长度和解答长度与大型语言模型在数学问题上的性能关系，发现两者与模型失败率正相关。

2604.01955 2026-06-19 cs.CY 版本更新 70%

Teaching Students to Question the Machine: An AI Literacy Intervention Improves Students' Regulation of LLM Use in a Science Task

教导学生质疑机器：一项AI素养干预措施提升学生在科学任务中调节LLM使用的能力

O. Clerc, R. Abdelghani, C. Desvaux, E. Poisson, P. Y. Oudeyer, H. Sauzéon

专题命中其他LLM ：AI素养干预提升学生LLM使用能力

AI总结本研究通过两小时的AI素养工作坊，训练中学生（8-9年级）在科学问题解决中更有效地使用大语言模型，减少盲目依赖并提高答案质量。

Comments Workshop paper accepted at ALIT4ALL 2026: 2nd International Workshop on AI Literacy Education For All, co-located with AIED 2026

详情

AI中文摘要

生成式人工智能（GenAI）在学校中的快速普及引发了人们对学生不加批判地依赖其输出的担忧。有效使用大语言模型（LLM）不仅需要技术知识，还需要监控、评估和调节与系统交互的能力，这些过程与元认知调节密切相关。这些技能在中学阶段仍在发展中，使得学生特别容易过度信任和过早接受AI输出。由于课堂时间和教师培训资源有限，迫切需要开发和评估可在现实学校条件下实施的AI素养干预措施。我们报告了一项受控的课堂研究，考察两小时的AI素养工作坊是否能改善学生在LLM支持的科学问题解决中的交互策略和最终答案质量。共有116名学生（8-9年级；13-15岁）使用生成式AI系统完成了六项科学调查任务。两天前，干预组参加了工作坊，该工作坊结合了关于LLM如何工作及失败的信息，以及关于提示和响应评估的实用指导；对照组未接受培训。受过训练的学生表现出更少的盲目依赖：他们更频繁地重新表述查询、提出后续问题，并更准确地判断响应正确性，从而获得更好的表现。相比之下，GenAI和元认知自我报告分数不能预测表现，这表明有效使用生成式AI较少依赖于自我报告测量，而更多依赖于交互调节的明确训练。总体而言，结果表明，简短、可扩展的AI素养教学可以显著改善中学生在校本学习活动中使用生成式AI的方式。

英文摘要

The rapid adoption of generative artificial intelligence (GenAI) in schools raises concerns about students' uncritical reliance on its outputs. Effective use of large language models (LLMs) requires not only technical knowledge but also the ability to monitor, evaluate, and regulate one's interaction with the system, processes closely tied to metacognitive regulation. These skills are still developing in middle school, making students particularly vulnerable to over-trust and premature acceptance of AI outputs. Because classroom time and teacher training resources are constrained, there is a pressing need to develop and evaluate AI literacy interventions that can be implemented under realistic school conditions. We report a controlled classroom study examining whether a two-hour AI literacy workshop improves students' interaction strategies and quality of final answers in LLM-supported science problem solving. A total of 116 students (grades 8-9; ages 13-15) completed six science investigation tasks using a generative AI system. Two days prior, the intervention group attended the workshop, which combined information about how LLMs work and fail with practical guidance on prompting and response evaluation; the control group received no training. Trained students showed less uncritical reliance on the system: they more often reformulated queries, asked follow-up questions, and more accurately judged response correctness, leading to better performance. In contrast, GenAI and metacognitive self-report scores did not predict performance, suggesting that effective use of generative AI depends less on self-reported measures and more on explicit training in interaction regulation. Overall, the results show that brief, scalable AI literacy instruction can meaningfully improve how middle-school students use generative AI in school-like learning activities.

URL PDF HTML ☆

赞 0 踩 0

2603.16941 2026-06-19 eess.AS cs.CL cs.SD 版本更新 70%

The Voice Behind the Words: Quantifying Intersectional Bias in SpeechLLMs

言语背后的声音：量化语音大语言模型中的交叉偏见

Shree Harsha Bokkahalli Satish, Christoph Minixhofer, Maria Teleki, James Caverlee, Ondřej Klejch, Peter Bell, Gustav Eje Henter, Éva Székely

发表机构 * 1 Department of Speech, Music ； Hearing, KTH Royal Institute of Technology, Sweden 2 Centre for Speech Technology Research, University of Edinburgh, UK 3 Texas A\&M University, USA

专题命中其他LLM ：语音大语言模型中的交叉偏见量化

AI总结本研究通过2880次受控交互，评估三种语音大语言模型在六种英语口音和两种性别呈现中的口音与性别交叉偏见，发现东欧口音（尤其女性）获得更低有用性评分，且人类评估者比LLM评判更敏感。

Comments 5 pages, 3 figures, 1 table, Accepted to Interspeech 2026

详情

AI中文摘要

语音大语言模型直接处理语音输入，保留了之前级联管道中去除的口音和感知性别等线索，这导致了依赖于说话者身份的反应差异。我们使用2880次受控交互（涵盖六种英语口音和两种性别呈现，通过语音克隆保持语言内容不变），对三种语音大语言模型中的口音和性别偏见进行了大规模交叉评估。通过逐点LLM评判评分、成对比较以及经过人工验证的最佳-最差缩放，我们检测到反复出现的定向差异。东欧口音的语音获得较低的有用性评分，尤其是女性呈现的语音。反应保持礼貌但在有用性上存在差异。虽然LLM评判捕捉到了这些偏见的定向趋势，但人类评估者表现出显著更高的敏感性，显示出更强的口音级别对比。

英文摘要

Speech Large Language Models (SpeechLLMs) process spoken input directly, retaining cues such as accent and perceived gender that were previously removed in cascaded pipelines. This introduces speaker identity dependent variation in responses. We present a large-scale intersectional evaluation of accent and gender bias in three SpeechLLMs using 2,880 controlled interactions across six English accents and two gender presentations, keeping linguistic content constant through voice cloning. Using pointwise LLM-judge ratings, pairwise comparisons, and Best-Worst Scaling with human validation, we detect recurring directional disparities. Eastern European-accented speech receives lower helpfulness scores, particularly for female-presenting voices. Responses remain polite but differ in helpfulness. While LLM judges capture the directional trend of these biases, human evaluators exhibit significantly higher sensitivity, showing stronger accent-level contrasts.

URL PDF HTML ☆

赞 0 踩 0

2603.16357 2026-06-19 cs.CY cs.SE 版本更新 70%

Beyond Grading Accuracy: Exploring Alignment of TAs and LLMs

超越评分准确性：探索助教与LLMs的一致性

Matthijs Jansen op de Haar, Nacir Bouali, Faizan Ahmed

专题命中其他LLM ：开源LLM用于UML类图评分评估

AI总结本文提出一个评估管道，通过定量研究92个UML类图，比较助教与六个开源LLMs在单个评分标准上的表现，发现开源LLMs在评分准确性上接近助教，为混合主动评分系统提供了可能。

Comments 7 pages, 3 figures

详情

AI中文摘要

在本文中，我们研究了开源大型语言模型（LLMs）在评分统一建模语言（UML）类图方面的潜力。与现有主要评估专有LLMs的工作不同，我们专注于非专有模型，使得我们的方法适用于对透明度和成本敏感的大学。此外，现有研究评估的是完整图表而非单个标准的性能，对自动评分与人类评估的一致性提供的见解有限。为解决这些差距，我们提出一个评分管道，其中学生生成的UML类图由助教（TAs）和LLMs独立评估，然后在单个标准级别比较评分。我们通过一项对软件设计课程中92个UML类图的定量研究来评估该管道，将助教评分与六个开源LLMs产生的评估进行比较。性能在单个标准上测量，突出LLMs与人类评分者存在差异的领域。我们的结果显示，每个标准的准确率高达88.56%，皮尔逊相关系数高达0.78，仅使用开源模型就比先前工作有显著改进。这些模型的性能接近助教，表明了一条通往混合主动评分系统的可能路径，其中助教在评分中得到辅助。我们的发现表明，开源LLMs可以通过明确识别与评分标准的一致性来有效支持UML类图评分。所提出的管道提供了一种实用方法，以应对随着学生人数增长而增加的工作量。

英文摘要

In this paper, we investigate the potential of open-source Large Language Models (LLMs) for grading Unified Modeling Language (UML) class diagrams. In contrast to existing work, which primarily evaluates proprietary LLMs, we focus on non-proprietary models, making our approach suitable for universities where transparency and cost are critical. Additionally, existing studies assess performance over complete diagrams rather than individual criteria, offering limited insight into how automated grading aligns with human evaluation. To address these gaps, we propose a grading pipeline in which student-generated UML class diagrams are independently evaluated by both teaching assistants (TAs) and LLMs. Grades are then compared at the level of individual criteria. We evaluate this pipeline through a quantitative study of 92 UML class diagrams from a software design course, comparing TA grades against assessments produced by six open-source LLMs. Performance is measured across individual criteria, highlighting areas where LLMs diverge from human graders. Our results show per-criterion accuracy of up to 88.56\% and a Pearson correlation coefficient of up to 0.78, representing a substantial improvement over previous work while using only open-source models. The models achieve performance close to that of a TA, suggesting a possible path toward a mixed-initiative grading system, where TAs are aided in their grading. Our findings demonstrate that open-source LLMs can effectively support UML class diagram grading by explicitly identifying alignment with grading criteria. The proposed pipeline provides a practical approach to managing increasing workloads with growing student counts.

URL PDF HTML ☆

赞 0 踩 0

2502.19193 2026-06-19 cs.SI cs.AI cs.NE 版本更新 70%

Simulation of Language Evolution under Regulated Social Media Platforms: A Synergistic Approach of Large Language Models and Genetic Algorithms

受监管社交媒体平台下的语言演化模拟：大语言模型与遗传算法的协同方法

Jinyu Cai, Yusei Ishimizu, Mingyue Zhang, Munan Li, Jialong Li, Kenji Tei

专题命中其他LLM ：用LLM模拟语言演化，结合遗传算法

AI总结提出基于大语言模型的多智能体框架，结合遗传算法模拟用户语言策略在监管下的迭代演化，实验表明对话轮次增加可提升信息传递准确性和对话持续性。

Comments The manuscript has been accepted to IEEE Transactions on Computational Social Systems

详情

AI中文摘要

社交媒体平台经常实施限制性政策来调节用户内容，从而催生出创造性的规避语言策略。本文提出了一个基于大语言模型（LLMs）的多智能体框架，用于模拟在监管约束下语言策略的迭代演化。在该框架中，参与者智能体作为社交媒体用户，不断演化其语言表达，而监管智能体通过评估政策违规来模拟平台级别的监管。为了实现更逼真的模拟，我们采用了语言策略的双重设计（约束和表达）来区分冲突目标，并利用LLM驱动的遗传算法（GA）进行语言策略的选择、变异和交叉。该框架使用两种不同的场景进行评估：一个抽象的密码游戏和一个逼真的模拟非法宠物交易场景。实验结果表明，随着对话轮次的增加，不间断对话轮次的数量和信息传输的准确性都显著提高。此外，一项包含40名参与者的用户研究验证了生成对话和策略的现实相关性。消融研究也验证了GA的重要性，强调了其对长期适应性和整体结果改善的贡献。

英文摘要

Social media platforms frequently impose restrictive policies to moderate user content, prompting the emergence of creative evasion language strategies. This paper presents a multi-agent framework based on Large Language Models (LLMs) to simulate the iterative evolution of language strategies under regulatory constraints. In this framework, participant agents, as social media users, continuously evolve their language expression, while supervisory agents emulate platform-level regulation by assessing policy violations. To achieve a more faithful simulation, we employ a dual design of language strategies (constraint and expression) to differentiate conflicting goals and utilize an LLM-driven GA (Genetic Algorithm) for the selection, mutation, and crossover of language strategies. The framework is evaluated using two distinct scenarios: an abstract password game and a realistic simulated illegal pet trade scenario. Experimental results demonstrate that as the number of dialogue rounds increases, both the number of uninterrupted dialogue turns and the accuracy of information transmission improve significantly. Furthermore, a user study with 40 participants validates the real-world relevance of the generated dialogues and strategies. Moreover, ablation studies validate the importance of the GA, emphasizing its contribution to long-term adaptability and improved overall results.

URL PDF HTML ☆

赞 0 踩 0

2606.20537 2026-06-19 cs.LG cs.DC 新提交 65%

Execution-State Capsules: Graph-Bound Execution-State Checkpoint and Restore for Low-Latency, Small-Batch, On-Device Physical-AI Serving

执行状态胶囊：面向低延迟、小批量、设备端物理AI服务的图绑定执行状态检查点与恢复

Liang Su

发表机构 * GitHub

专题命中其他LLM ：面向LLM服务的执行状态检查点与恢复机制

AI总结针对低延迟、小批量、设备端物理AI服务场景，提出执行状态胶囊机制，通过图绑定检查点与恢复完整可恢复状态，在RTX 5090上实现亚毫秒级恢复，TTFT加速比达3.9倍至27倍。

Comments 27 pages, 9 figures

详情

AI中文摘要

主流LLM服务系统主要通过分页或基数键值（KV）缓存重用前缀工作。这对于高吞吐量、高并发服务非常有效，但它只管理执行状态的一个位置片段：KV缓存。我们研究相反的场景：低延迟、小批量、设备端物理AI服务，其中交互式LLM代理、语音系统和机器人策略在严格的响应预算下频繁分支、重置、中断和重新进入。我们引入执行状态胶囊，一种图绑定的检查点和恢复机制，用于在提交边界处保存完整的可恢复状态。FlashRT是一个白盒、后端内核运行时，其评估的NVIDIA CUDA后端在连续的静态缓冲区上运行捕获的图计划，无需块表间接寻址。由于活动状态是一组命名的封闭缓冲区，胶囊可以快照、恢复、分叉或回滚整个执行边界，包括KV、循环状态、卷积状态、MTP状态和元数据。这将重用从令牌寻址的KV片段转移到图绑定的执行状态边界。在RTX 5090上，胶囊恢复在存储状态级别是字节精确的，在贪婪解码下是令牌一致的。仅KV的消融实验出现分歧，表明循环状态是承载负载的。GPU驻留的快照和恢复是亚毫秒级的，TTFT相对于冷预填充的加速比从2k令牌时的3.9倍增长到16k令牌时的27倍。在Jetson AGX Thor和DGX Spark上，相同的正确性和结构属性成立。胶囊不是高吞吐量KV缓存服务的替代品；它们定义了一个互补的以延迟为先的服务点，用于显式执行状态重用。

英文摘要

Mainstream LLM serving systems reuse prefix work mainly through paged or radix key-value (KV) caches. This is highly effective for high-throughput, high-concurrency serving, but it manages only one positional fragment of execution state: the KV cache. We study the opposite regime: low-latency, small-batch, on-device physical-AI serving, where interactive LLM agents, speech systems, and robot policies repeatedly branch, reset, interrupt, and re-enter under tight responsiveness budgets. We introduce execution-state capsules, a graph-bound checkpoint and restore mechanism for the complete restorable state at a committed boundary. FlashRT is a white-box, backend-facing kernel runtime whose evaluated NVIDIA CUDA backend runs captured graph plans over contiguous static buffers with no block-table indirection. Because the live state is a closed set of named buffers, a capsule can snapshot, restore, fork, or roll back the whole execution boundary, including KV, recurrent state, convolution state, MTP state, and metadata. This moves reuse from token-addressed KV fragments to graph-bound execution-state boundaries. On an RTX 5090, capsule restore is byte-exact at the stored-state level and token-identical under greedy decode. A KV-only ablation diverges, showing that recurrent state is load-bearing. GPU-resident snapshot and restore are sub-millisecond, and TTFT speedup over cold prefill grows from 3.9x at 2k tokens to 27x at 16k tokens. On Jetson AGX Thor and DGX Spark, the same correctness and structural properties hold. Capsules are not a replacement for high-throughput KV-cache serving; they define a complementary latency-first serving point for explicit execution-state reuse.

URL PDF HTML ☆

赞 0 踩 0

2606.19850 2026-06-19 cs.LG cs.AI 新提交 65%

Neural Additive and Basis Models with Feature Selection and Interactions

具有特征选择和交互的神经加性模型与神经基础模型

Yasutoshi Kishimoto, Kota Yamanishi, Takuya Matsuda, Shinichi Shirakawa

发表机构 * Yokohama National University（横滨国立大学）

专题命中其他LLM ：提出在神经加性模型中引入特征选择机制，属于可解释机器学习方法，与LLM无直接关系。

AI总结提出在神经加性模型和神经基础模型中引入特征选择机制，通过特征选择层减少计算开销，并支持高维数据中的特征交互学习，性能优于或持平于现有GAM方法。

Comments Accepted at PAKDD 2024. Code is available at https://github.com/shiralab/NAM-FS

详情

DOI: 10.1007/978-981-97-2259-4_1

AI中文摘要

深度神经网络（DNN）在各个领域表现出色，但通常可解释性较低。神经加性模型（NAM）及其变体神经基础模型（NBM）在广义加性模型（GAM）中使用神经网络（NN）作为非线性形状函数。这两种模型具有高度可解释性，并且在NN训练中表现出良好的性能和灵活性。NAM和NBM基于GAM架构，可以提供并可视化每个特征对预测的贡献。然而，当使用双输入NN来考虑特征交互或将其应用于高维数据集时，由于所需计算资源的增加，训练NAM和NBM变得棘手。本文提出将特征选择机制融入NAM和NBM以解决计算瓶颈。我们在两种模型中引入特征选择层，并在训练过程中更新选择权重。我们的方法简单，与原始NAM和NBM相比，可以降低计算成本和模型大小。此外，它使我们即使在数据维度很高的情况下也能使用双输入NN并捕获特征交互。我们证明，所提出的模型与原始NAM和NBM相比计算效率更高，并且与最先进的GAM相比表现出更好或相当的性能。

英文摘要

Deep neural networks (DNNs) exhibit attractive performance in various fields but often suffer from low interpretability. The neural additive model (NAM) and its variant called the neural basis model (NBM) use neural networks (NNs) as nonlinear shape functions in generalized additive models (GAMs). Both models are highly interpretable and exhibit good performance and flexibility for NN training. NAM and NBM can provide and visualize the contribution of each feature to the prediction owing to GAM-based architectures. However, when using two-input NNs to consider feature interactions or when applying them to high-dimensional datasets, training NAM and NBM becomes intractable due to the increase in the computational resources required. This paper proposes incorporating the feature selection mechanism into NAM and NBM to resolve computational bottlenecks. We introduce the feature selection layer in both models and update the selection weights during training. Our method is simple and can reduce computational costs and model sizes compared to vanilla NAM and NBM. In addition, it enables us to use two-input NNs even in high-dimensional datasets and capture feature interactions. We demonstrate that the proposed models are computationally efficient compared to vanilla NAM and NBM, and they exhibit better or comparable performance with state-of-the-art GAMs.

URL PDF HTML ☆

赞 0 踩 0

2606.19819 2026-06-19 cs.CL cs.AI 新提交 65%

CREDENCE: Claim Reduction for Decomposition & Enhanced Credibility -- Semantic Metrics and Convergence Analysis

CREDENCE: 面向分解与增强可信度的声明缩减——语义度量与收敛性分析

Phuong Huu Vu Tran, Thuan Duc Mai, Bach Xuan Le

发表机构 * Vietnamese-German University（越南德国大学）； Ho Chi Minh University of Technology（胡志明市理工大学）

专题命中其他LLM ：声明分解和修复框架，用于事实核查，与LLM应用相关。

AI总结提出CREDENCE框架，通过语义F1度量解决Jaccard度量对释义声明的低估问题，并形式化分析修复管道的收敛性，实验表明语义F1比Jaccard F1提升15-32个百分点，规则修复将原子性违反率降低47-100%。

Comments 40 pages, 6 figures, 19 tables. Submitted to Language Resources and Evaluation

详情

AI中文摘要

将复合句分解为原子化的、可验证的声明是可靠自动化事实核查的前提。先前工作依赖基于词重叠（Jaccard）的度量，系统性地低估了释义声明的分解质量，并且缺乏对修复循环的形式化终止分析。我们提出CREDENCE，一个改进的声明分解与评估框架，解决了这两个缺陷。我们的贡献包括：(1) 语义F1：我们使用BGE-large余弦相似度保真度度量，解决了Jaccard的惩罚问题，并提高了下游事实核查的准确性；(2) 收敛定理：我们形式化地表征了修复管道的四个性质，确立了在预言解析器假设下基于规则的修复是单调且有限终止的；基于LLM的自修复被证明是非单调的，需要早期退出保护；(3) 三个评估基准，涵盖社交媒体、百科全书和新闻领域，用于跨领域泛化度量；(4) 跨四个分解器模型（3.8B-12B）和一个封闭API模型的多模型基准测试。在SocialClaimSplit、WikiSplitBench和ClaimDecompBench上的实验表明，语义F1比Jaccard F1提升15-32个百分点。在SocialClaimSplit和WikiSplitBench上，EPR范围为0.94至1.00，而ClaimDecompBench由于更难的新闻领域构造，包含较低的基线EPR情况（低至0.824），规则修复相对于基线模型将原子性违反率（AVR）降低了47-100%，且不降低保真度。

英文摘要

Decomposing compound sentences into atomic, verifiable claims is a prerequisite for reliable automated fact-checking. Prior work has relied on token-overlap (Jaccard) metrics that systematically underestimate decomposition quality for paraphrastic claims, and has lacked formal termination analysis for the repair loop. We present Credence, a revised claim decomposition and evaluation framework addressing both shortcomings. Our contributions are: (1) Semantic-F1: we use BGE-large cosine similarity fidelity metric that resolves Jaccard's penalisation and improves downstream fact-checking accuracy; (2) Convergence theorems: we formally characterise four properties of the repair pipeline, establishing that rule-based repair is monotone and finitely terminating under an oracle parser assumption; LLM-based self-repair is provably non-monotone and requires an early-exit guard; (3) Three evaluation benchmarks spanning social-media, encyclopaedic, and news domains for cross-domain generalisation measurement; (4) Multi-model benchmarking across four decomposer models (3.8B-12B) and a closed API model. Experiments on SocialClaimSplit, WikiSplitBench, and ClaimDecompBench show that Semantic-F1 outperforms Jaccard-F1 by +15-32pp. EPR ranges from 0.94 to 1.00 on SocialClaimSplit and WikiSplitBench, while ClaimDecompBench includes lower base EPR cases (down to 0.824) due to harder news-domain constructions, and rule-repair reduces the Atomicity Violation Rate (AVR) by 47-100% relative to the base model without degrading fidelity.

URL PDF HTML ☆

赞 0 踩 0

2606.19721 2026-06-19 cs.LG cs.AI 新提交 65%

OnDeFog: Online Decision Transformer under Frame Dropping

OnDeFog：帧丢失下的在线决策变压器

Daiki Yotsufuji, Kenta Nishihara, Shoma Shimizu, Kento Uchida, Shinichi Shirakawa

发表机构 * Yokohama National University（横滨国立大学）

专题命中其他LLM ：提出在线决策变压器处理帧丢失问题。

AI总结针对帧丢失导致性能下降的问题，提出OnDeFog，将DeFog机制与在线决策变压器结合，通过直接环境交互学习策略，在高丢帧率环境下优于ODT，在低奖励数据集上优于DeFog。

Comments Accepted to PRICAI 2025

详情

DOI: 10.1007/978-981-95-7072-0_10

AI中文摘要

在具有挑战性的现实世界强化学习应用中，通信延迟或传感器故障经常导致帧丢失，此时智能体无法接收丢失的状态及相关奖励。为了解决帧丢失导致的性能下降问题，通过将额外机制引入决策变压器以处理帧丢失，开发了随机帧丢失下的决策变压器（DeFog）。尽管DeFog可以缓解帧丢失环境中的性能下降，但由于DeFog是一种离线学习方法，它难以有效泛化到训练数据集中未充分表示的新状态。在本研究中，我们提出OnDeFog，它将DeFog中的机制与在线决策变压器（ODT）相结合，ODT是一种通过直接环境交互学习策略的在线强化学习方法。全面的实验评估表明，我们提出的OnDeFog在高丢帧率环境下相比ODT取得了更优的性能，并且在包含大量低奖励数据的数据集上优于DeFog。

英文摘要

In challenging real-world reinforcement learning applications, communication delays or sensor failures often cause frame dropping, in which the agent cannot receive the dropped states and associated rewards. To address the performance degradation caused by frame dropping, the Decision Transformer under Random Frame Dropping (DeFog) was developed by incorporating additional mechanisms into the decision transformer to tackle frame dropping. Although DeFog can mitigate performance degradation in frame-dropping environments, since DeFog is an offline learning method, it struggles to effectively generalize to novel states not adequately represented in the training dataset. In this study, we propose OnDeFog, which integrates the mechanisms in DeFog with the online decision transformer (ODT), an online reinforcement learning method that learns policies through direct environmental interaction. Comprehensive experimental evaluation demonstrates that our proposed OnDeFog achieves superior performance compared to ODT in environments characterized by high dropping frame rate and outperforms DeFog on datasets containing a large amount of low-reward data.

URL PDF HTML ☆

赞 0 踩 0

2606.19587 2026-06-19 stat.ML cs.LG 新提交 60%

A Solver-Free Training Method for Predict-then-Optimize

一种无求解器的预测后优化训练方法

Beichen Wan, Mo Liu

发表机构 * Department of Statistics and Operations Research, University of North Carolina at Chapel Hill, NC, USA（统计与运筹学系，北卡罗来纳大学教堂山分校）

专题命中其他LLM ：提出无求解器训练方法，优化预测模型，属于LLM应用

AI总结提出一种基于测度变换的决策聚焦学习管道，通过无求解器代理损失实现预测后优化中预测模型的高效训练，理论保证Fisher一致性，训练时间降低数个数量级。

Comments Accepted by ICML 2026

详情

AI中文摘要

我们提出了一种可扩展的方法，用于在预测后优化范式中训练预测（机器学习）模型，其中模型输出作为后续线性优化任务的系数。直接最小化经验决策遗憾对于线性规划和组合优化是不可行的，因为决策映射是分段常数，且梯度几乎处处为零。虽然现有方法通过平滑微分过程来解决这一问题，但它们存在可扩展性问题，因为每次梯度评估都需要调用计算昂贵的求解器。为了解决这个问题，我们提出了一种基于测度变换原理的决策聚焦学习管道，该管道在训练期间产生一个完全无优化求解器的新代理损失。我们建立了理论保证，包括Fisher一致性和超额风险界。实验上，我们的方法在实现与最先进方法相当的决策质量的同时，将训练时间减少了数个数量级。

英文摘要

We propose a scalable method for training prediction (machine learning) models in the predict-then-optimize paradigm, where model outputs serve as coefficients for a subsequent linear optimization task. Directly minimizing the empirical decision regret is intractable for linear programming and combinatorial optimization since the decision mapping is piecewise constant, and the gradients are zero almost everywhere. While existing methods address this by smoothing the differentiation process, they suffer from scalability issues, since a computationally expensive solver call is required for every gradient evaluation. To address this, we propose a decision-focused learning pipeline based on a measure transformation principle, which yields a new surrogate loss that is completely optimization-solver-free during training. We establish theoretical guarantees, including Fisher consistency and excess risk bounds. Empirically, our method achieves decision quality competitive with state-of-the-art methods while reducing training time by orders of magnitude.

URL PDF HTML ☆

赞 0 踩 0

2606.19410 2026-06-19 stat.ML cs.LG 新提交 60%

The Representational Limit of Scalar Interactions: An Interventional Decomposition

标量交互的表征限制：一种干预分解

Potito Aghilar, Sabino Roccotelli, Stanislao Fidanza, Vito Walter Anelli, Sebastiano Stramaglia, Tommaso Di Noia

发表机构 * Polytechnic University of Bari（巴里理工学院）； University of Bari Aldo Moro（巴里大学Aldo Moro）

专题命中其他LLM ：提出特征交互分解方法，可用于模型解释

AI总结本文证明标量交互指标混淆了唯一性、冗余性和协同性，并提出Stochastic Hi-Fi方法，通过干预掩码推理分解每个特征的U/R/S轮廓，在表格和图像任务中恢复被标量基线遗漏的结构。

详情

AI中文摘要

有符号的成对交互指标从根本上混淆了唯一性（U）、冗余性（R）和协同性（S）。我们在一个最小的3路XOR结构因果模型上证明了这一点：忠实的指标如Shapley-Taylor对每对返回零，而投影指标如Shapley Interaction将三阶效应扩散到混淆三种机制的成对标量中。我们引入了Stochastic Hi-Fi，一种事后、无需重新训练的可预测性分解方法，通过干预掩码推理估计每个特征的U/R/S轮廓。该估计器提供精确的干预语义、有限样本蒙特卡洛界限、耦合菱形采样带来的严格方差减少以及均匀的有限词汇收敛。在表格SCM上，Stochastic Hi-Fi恢复了被标量基线遗漏的结构（交互幅度恢复比高达411倍）。它还在GPT-2 IOI电路中分离了冗余和协同头。在NIH ChestX-ray14上，Stochastic Hi-Fi在Pointing Game中匹配GradCAM，并在Deletion AUC上显著改进。

英文摘要

Signed pairwise interaction scores fundamentally conflate uniqueness (U), redundancy (R), and synergy (S). We prove this on a minimal 3-way XOR structural causal model: faithful indices such as Shapley-Taylor return zero per pair, whereas projective indices such as Shapley Interaction spread the third-order effect into pair scalars that conflate the three mechanisms. We introduce Stochastic Hi-Fi, a post-hoc, retraining-free predictability decomposition that estimates per-feature U/R/S profiles by interventional masked inference. The estimator provides exact interventional semantics, finite-sample Monte Carlo bounds, strict variance reduction from coupled diamond sampling, and uniform finite-vocabulary convergence. Across tabular SCMs, Stochastic Hi-Fi recovers structure missed by scalar baselines (up to 411x larger interaction-magnitude recovery ratios). It also separates redundant and synergistic heads in the GPT-2 IOI circuit. On NIH ChestX-ray14, Stochastic Hi-Fi matches GradCAM on Pointing Game and improves substantially on Deletion AUC.

URL PDF HTML ☆

赞 0 踩 0

2606.20518 2026-06-19 cs.AI 新提交 60%

FlowEdit: Associative Memory for Lifelong Pronunciation Adaptation in Flow-Matching TTS

FlowEdit: 流匹配TTS中终身发音适应的联想记忆

Harshit Singh, Ayush Pratap Singh, Nityanand Mathur

发表机构 * University Of Maryland（马里兰大学）； TU Darmstadt（达姆施塔特工业大学）； Smallest AI

专题命中其他LLM ：流匹配TTS的终身发音适应

AI总结针对流匹配TTS部署后无法纠正专有名词发音错误的问题，提出FlowEdit框架，通过潜在条件编辑而非权重更新学习发音修正，并利用现代Hopfield网络存储和检索修正，在312个多语言专有名词基准上将音素错误率降低92.7%。

详情

AI中文摘要

流匹配文本到语音系统在零样本场景下表现出色，但部署后保持静态：除非重新训练模型，否则对词汇表外的专有名词的发音错误会持续存在。我们提出FlowEdit，一个用于冻结的流匹配TTS的终身适应框架，它将发音修正学习为潜在条件编辑而非权重更新。当提供纠正性反馈时，FlowEdit优化文本嵌入空间中的令牌级扰动，然后将修正存储在作为内容可寻址情景记忆的现代Hopfield网络中。在推理时，通过具有相似性门控的软注意力检索修正，实现模糊形态匹配。在我们整理的涵盖18个语系的312个多语言专有名词基准上，FlowEdit相对于零样本基线将目标词音素错误率降低了92.7%，同时保持相同的通用语音质量。修正过程在单个GPU上大约15秒完成。

英文摘要

Flow-matching text-to-speech systems achieve remarkable zero-shot quality but remain static after deployment: pronunciation errors on out-of-vocabulary proper nouns persist unless the model is retrained. We introduce FlowEdit, a life-long adaptation framework for frozen flow-matching TTS that learns pronunciation corrections as latent conditioning edits rather than weight updates. When corrective feedback is provided, FlowEdit optimizes a token-level perturbation in the text embedding space, then stores the correction in a Modern Hopfield Network serving as content-addressable episodic memory. At inference, corrections are retrieved via soft attention with a similarity gate, enabling fuzzy morphological matching. On our curated benchmark of 312 multilingual proper nouns across 18 language families, FlowEdit reduces target-word Phoneme Error Rate by 92.7% relative to the zero-shot baseline while maintaining identical general-speech quality. Corrections complete in approximately 15 seconds on a single GPU.

URL PDF HTML ☆

赞 0 踩 0

2606.20431 2026-06-19 cs.LG 新提交 60%

Sparsity, Superposition, and Forgetting: A Mechanistic Study of Representation Retention in Continual Learning

稀疏性、叠加与遗忘：持续学习中表示保持的机制研究

Jan Wasilewski, Jędrzej Kozal, Michał Woźniak, Bartosz Krawczyk

发表机构 * Rochester Institute of Technology（罗切斯特理工学院）； Wrocław University of Science and Technology（弗罗茨瓦夫科技大学）

专题命中其他LLM ：研究持续学习中的遗忘机制，与LLM相关

AI总结通过可控玩具框架研究持续学习中的遗忘机制，发现叠加随时间增加但任务边界处有瞬降，高稀疏性增加叠加但不必然导致遗忘，任务级有效秩随稀疏性增长。

详情

AI中文摘要

持续学习（CL）系统常常遗忘先前获得的知识，但由于真实数据集纠缠了许多因素，遗忘的机制在实践中难以孤立。我们提出了一个可控的玩具世界框架，使这些机制可观察和可测试。使用合成生成器-分离器流水线，我们定义了真实潜在特征，构建了具有可调稀疏性和重叠的任务，并引入了表示强度和叠加（特征间的方向重叠）的可测量量。然后，我们通过拟合保留、叠加和暴露历史之间的稀疏动态关系（通过SINDy）来研究保留动态——表示强度的时间变化。基于有效秩的互补任务级分析表征了表示能力如何在任务间分配。我们的受控实验得出三个要点。（1）叠加随时间增加，在任务边界处有瞬降，表明边界特定的干扰而非稳定漂移。（2）更高的特征稀疏性导致更多叠加，但不必然引起遗忘；当表示保持强时，尽管重叠，遗忘可以减少。（3）任务级有效秩随稀疏性增长，表明在稀疏机制下更广泛的能力使用。这些结果共同细化了常见直觉——更多叠加导致更多遗忘，通过显示重叠与表示强度和能力分配相互作用。我们的玩具分析为CL提供了可证伪的假设和诊断工具。

英文摘要

Continual learning (CL) systems often forget previously acquired knowledge, yet the mechanisms driving forgetting remain hard to isolate in practice because real datasets entangle many factors. We present a controlled, toy-world framework that makes these mechanisms observable and testable. Using a synthetic generator-separator pipeline, we define ground-truth latent features, build tasks with tunable sparsity and overlap, and introduce measurable quantities for representation strength and superposition (directional overlap among features). We then study retention dynamics-the temporal change of representation strength by fitting sparse dynamical relations (via SINDy) between retention, superposition, and exposure history. A complementary task-level analysis based on effective rank characterizes how representational capacity is allocated across tasks. Our controlled experiments yield three takeaways. (1) Superposition tends to increase over time with transient dips at task boundaries, suggesting boundary-specific interference rather than steady drift. (2) Higher feature sparsity induces more superposition yet does not inevitably cause forgetting; when representations remain strong, forgetting can be reduced despite overlap. (3) Task-level effective rank grows with sparsity, indicating broader capacity usage under sparse regimes. Together, these results nuance the common intuition that more superposition leads to more forgetting by showing that overlap interacts with representation strength and capacity allocation. Our toy analysis provides falsifiable hypotheses and diagnostic tools for CL.

URL PDF HTML ☆

赞 0 踩 0

2606.20254 2026-06-19 cs.CR 新提交 60%

Quantization as a Malicious Task: Removing Quantization-Conditioned Backdoors via Task Arithmetic

量化作为恶意任务：通过任务算术移除量化条件后门

Kaihsun Yang, Min-Yan Tsai, Chia-Mu Yu

专题命中其他LLM ：防御量化后门，涉及模型安全

AI总结提出QVec方法，通过将量化引起的权重变化视为恶意任务向量，在部署前进行参数校正，无需重训练或触发样本即可防御量化条件后门。

详情

AI中文摘要

模型量化被广泛采用，以在资源受限设备上部署深度神经网络时减少内存使用和推理成本。然而，最近的研究揭示了一种新的安全威胁，称为量化条件后门（QCBs），其中模型在全精度下行为正常，但仅在量化后激活恶意行为。现有的防御通常修改量化过程或校正激活统计，往往引入额外的计算开销或依赖特定的量化设置。在这里，我们提出QVec，一种从参数空间角度防御QCBs的方法。我们观察到，全精度模型与其量化版本之间的权重差异编码了一种结构化的行为偏移，可以解释为恶意任务向量，而非随机量化噪声。基于这一见解，QVec通过在部署前进行受控的参数校正来抵消这一恶意方向。QVec无需重新训练，无需触发样本，仅需一次量化传递来估计参数偏移，以及轻量级的超参数搜索。在图像分类基准和多个大型语言模型（LLM）攻击场景中的大量实验表明，QVec在保持干净性能的同时，持续抑制后门激活。

英文摘要

Model quantization is widely adopted to reduce memory usage and inference cost when deploying deep neural networks on resource-constrained devices. However, recent studies have revealed a new security threat known as Quantization-Conditioned Backdoors (QCBs), where a model behaves normally in full precision but activates malicious behavior only after quantization. Existing defenses typically modify quantization procedures or correct activation statistics, often introducing additional computational overhead or relying on specific quantization settings. Here, we present QVec, a parameter-space perspective for defending against QCBs. We observe that the weight difference between a full-precision model and its quantized counterpart encodes a structured behavioral shift, which can be interpreted as a malicious task vector rather than random quantization noise. Based on this insight, QVec counteracts this malicious direction through controlled parameter correction prior to deployment. QVec requires no retraining, no trigger samples, and only a single quantization pass to estimate the parameter shift, together with a lightweight hyperparameter search. Extensive experiments across image classification benchmarks and multiple Large Language Model (LLM) attack scenarios demonstrate that QVec consistently suppresses backdoor activation while preserving clean performance.

URL PDF HTML ☆

赞 0 踩 0

2606.19910 2026-06-19 cs.CL cs.SD eess.AS 新提交 60%

Light-weight Pronunciation Assessment via Discrete Speech Token Surprisal

轻量级发音评估：基于离散语音标记的意外度

Syeda Faiza Ahmed Sara, Shammur Absar Chowdhury

发表机构 * Qatar Computing Research Institute, Doha, Qatar（卡塔尔计算研究所，多哈，卡塔尔）

专题命中其他LLM ：使用语言模型计算语音标记意外度进行发音评估。

AI总结提出仅使用母语语音资源训练的轻量级发音评估框架，通过离散化语音标记和语言模型计算意外度，结合文本引导对齐特征，在无监督或少量校准下达到接近监督方法的性能。

Comments Accepted to Interspeech 2026

详情

AI中文摘要

训练自动发音评估通常依赖于标记的学习者错误或非母语语料库，这些语料库收集成本高昂。我们提出一个轻量级框架，仅使用母语语音资源训练，以无监督或通过少量评分话语进行轻量校准的方式运行。在推理时，学习者语音通过SSL编码器和K-means码本进行离散化。一个在母语序列上训练的标记语言模型计算意外度，其中较高的意外度表示音位偏差。我们添加了一个转录引导的Text2DUnit--DTW模块，该模块从参考文本预测母语标记序列，并将其与声学标记对齐以推导出错误敏感特征。意外度和对齐特征通过简单回归融合。在SpeechOcean762上，PCC从0.60提升到0.66（带转录引导），接近监督基线。在L2-ARCTIC上的跨数据集评估显示了一致的提升。

英文摘要

Training automated pronunciation assessment often relies on labeled learner errors or non-native corpora that are costly to collect. We propose a lightweight framework trained only on native speech resources, operating unsupervised or lightly calibrated with a small set of scored utterances. At inference, learner speech is discretized with an SSL encoder and a K-means codebook. A token language model trained on native sequences computes surprisal where higher surprisal indicates phonotactic deviation. We add a transcript-guided Text2DUnit--DTW module that predicts native token sequences from reference text and aligns them to acoustic tokens to derive error-sensitive features. Surprisal and alignment features are fused via simple regression. On SpeechOcean762, PCC improves from 0.60 to 0.66 with transcript guidance, near supervised baselines. Cross-dataset evaluation on L2-ARCTIC shows consistent gains.

URL PDF HTML ☆

赞 0 踩 0

2606.19734 2026-06-19 cs.LG 新提交 60%

Federated Bilevel Performative Prediction

联邦双层执行预测

Liangxin Qian, Chang Liu, Xuanyu Cao, Jun Zhao, Kwok-Yan Lam

发表机构 * Nanyang Technological University（南洋理工大学）； Zhejiang University（浙江大学）； Washington State University（华盛顿州立大学）

专题命中其他LLM ：研究联邦学习中的双层优化，涉及分布偏移。

AI总结研究联邦学习中客户端数据分布受决策影响的双层优化问题，提出联邦双层执行稳定点概念及两种求解方法，实验验证了稳定性阈值和元泛化提升。

Comments Accepted by ICML 2026

详情

AI中文摘要

联邦双层优化广泛用于跨分布式客户端的嵌套学习问题，例如在隐私和通信约束下的联邦超参数调整和元学习。大多数现有公式假设客户端数据分布固定，但执行性可能违反这一假设，其中部署的决策会重塑客户端行为和数据收集，导致客户端特定的、决策依赖的分布偏移。我们研究联邦双层执行预测，其中上层（UL）和下层（LL）目标都在客户端依赖、决策依赖的分布下进行评估。我们在解耦风险视角下形式化联邦双层执行稳定（FBPS）点，并给出其存在性和唯一性的充分条件。然后，我们开发两种联邦方法来计算FBPS解：FBi-RRM，在收缩条件下线性收敛；以及FBi-SGD，一种基于联邦超梯度估计的通信高效随机方法，在步长递减且敏感性足够小时具有收敛保证。在策略回归和元策略分类上的实验验证了预测的稳定性阈值，并展示了相对于非执行基线的元泛化改进，基于CNN的分类进一步证明了所提方法在非凸神经网络设置中的实际有效性。

英文摘要

Federated bilevel optimization is widely used for nested learning problems across distributed clients, such as federated hyperparameter tuning and meta-learning under privacy and communication constraints. Most existing formulations assume fixed client data distributions, which can be violated by performativity, where deployed decisions reshape client behavior and data collection, inducing client-specific, decision-dependent distribution shift. We study federated bilevel performative prediction, where both upper-level (UL) and lower-level (LL) objectives are evaluated under client-dependent, decision-dependent distributions. We formalize the federated bilevel performatively stable (FBPS) point under a decoupled-risk perspective and provide sufficient conditions for its existence and uniqueness. We then develop two federated methods to compute the FBPS solution: FBi-RRM, which converges linearly under a contraction condition, and FBi-SGD, a communication-efficient stochastic method based on federated hypergradient estimation with convergence guarantees under diminishing step sizes when sensitivities are sufficiently small. Experiments on strategic regression and meta strategic classification validate the predicted stability thresholds and demonstrate improved meta-generalization over non-performative baselines, and CNN-based classification further demonstrates the practical effectiveness of the proposed methods in nonconvex neural network settings.

URL PDF HTML ☆

赞 0 踩 0

2606.19603 2026-06-19 cs.LG 新提交 60%

Comparing Linear Probes with Mahalanobis Cosine Similarity

比较线性探针与马氏余弦相似度

Zhuofan Josh Ying, Peter Hase, Nikolaus Kriegeskorte

发表机构 * Columbia University（哥伦比亚大学）； Stanford University（斯坦福大学）； Schmidt Sciences（施密特科学）

专题命中其他LLM ：研究线性探针比较方法，与LLM可解释性相关

AI总结研究证明马氏余弦相似度与OOD AUROC存在线性关系，提供理论解释并验证其作为线性探针比较指标的有效性。

Comments 16 pages, 10 figures

详情

AI中文摘要

线性探针广泛用于可解释性研究，并常通过余弦相似度进行比较。两个方向之间的马氏余弦相似度（MCS）通过测试数据协方差重新加权内积，是一种自然的任务感知改进。Ying等人（2026）报告称，探针与在分布外（OOD）数据上训练的参考探针的MCS近乎完美地线性预测了该探针的OOD AUROC（R^2 = 0.98）。在这里，我们将这一实证发现扩展到不同模型、层和概念领域，并以封闭形式证明了这一普遍现象：对于投影为高斯分布的平衡类别，OOD AUROC与参考探针的MCS是线性的，因为两者都是探针在测试数据上信噪比（SNR）的S形函数。该理论还预测了这种线性何时失效，我们通过实验验证了这一点。MCS为比较线性探针提供了有理论依据且经验有效的替代方案，优于欧几里得余弦相似度。

英文摘要

Linear probes are widely used in interpretability research and often compared by cosine similarity. The Mahalanobis cosine similarity (MCS) between two directions, which reweights the inner product by test data covariance, is a natural task-aware refinement. Ying et al. (2026) report that a probe's MCS to a reference probe trained on the out-of-distribution (OOD) data near-perfectly linearly predicts the probe's OOD AUROC (R^2 = 0.98). Here, we extend this empirical finding across models, layers, and concept domains, and prove this general phenomenon in closed form: For balanced classes whose projections are Gaussian, OOD AUROC and MCS to the reference probe are linear because both are sigmoid-shaped functions of the probe's signal-to-noise ratio (SNR) on the test data. The theory also predicts when this linearity fails, which we verify empirically. MCS offers a theoretically grounded and empirically effective alternative to Euclidean cosine similarity for comparing linear probes.

URL PDF HTML ☆

赞 0 踩 0

2606.19411 2026-06-19 cs.LG 新提交 60%

Spectral DPPs via NEPv: A Scalable Continuous Relaxation of Determinantal MAP for Diversity-Aware Data Selection

通过NEPv的谱DPP：用于多样性感知数据选择的确定性点过程MAP的可扩展连续松弛

Richard Yi Da Xu

发表机构 * Hong Kong Baptist University（香港浸会大学）； TadReamk Limited（TadReamk有限公司）

专题命中其他LLM ：多样性感知数据选择，可应用于LLM数据筛选。

AI总结提出将NP难的DPP-MAP选择问题转化为Stiefel流形上的连续优化，通过非线性特征值问题（NEPv）的自洽场迭代实现近线性时间求解，适用于大规模数据选择。

详情

AI中文摘要

从海量候选池中选择一个小的、多样化的、高质量的子集是现代机器学习中的一个常见原语——用于训练和微调大型模型的数据整理和核心集选择、主动学习批次获取、上下文学习的提示和示例选择、检索多样化以及实验设计。确定性点过程（DPP）为此任务提供了原则性的、良好校准的多样性概念，但其MAP目标——选择大小为$k$的子集$S$最大化$\log\det(L_S)$——是NP难的，并且标准的贪心和采样算法在候选集大小$n$上具有超线性复杂度。这种成本在多样性最重要的数据为中心的场景中尤其高昂，其中$n$范围从数百万到数十亿的候选示例、特征或嵌入。我们将DPP-MAP重新表述为Stiefel流形上的连续优化问题，并证明其最优性条件构成一个先前未研究形式的具有特征向量依赖性的非线性特征值问题（NEPv）。该NEPv允许自洽场（SCF）迭代，具有基于谱间隙的局部收缩保证，从而提供了一个原则性的迭代求解器，其中多样性目标驱动一个特征向量依赖的算子。由此产生的算法OurMethod仅需要与核的矩阵-向量乘积，运行时间为$O\!\big((ndk+nk^2)\,t\big)$，其中迭代次数$t$很小，在$n$上接近线性，并直接与机器学习中常见的低秩和特征映射核集成。本文重点介绍松弛、求解器和扩展分析；完整的真实数据基准测试留给计划中的实证研究。

英文摘要

Selecting a small, diverse, high-quality subset from a massive pool of candidates is a recurring primitive in modern machine learning -- data curation and coreset selection for training and fine-tuning large models, active-learning batch acquisition, prompt and exemplar selection for in-context learning, retrieval diversification, and experimental design. Determinantal Point Processes (\DPP s) give a principled, well-calibrated notion of diversity for this task, but their \emph{MAP} objective -- pick a size-$k$ subset $S$ maximizing $\logdet(L_S)$ -- is NP-hard, and the standard greedy and sampling algorithms scale superlinearly in the ground-set size $n$. This cost is prohibitive precisely in the data-centric regime where diversity matters most, where $n$ ranges over millions to billions of candidate examples, features, or embeddings. We recast \DPP-MAP as a continuous optimization problem over the Stiefel manifold, and show that its first-order optimality conditions form a \emph{Nonlinear Eigenvalue Problem with eigenvector dependency} (\NEPv) of a previously unstudied form. This \NEPv\ admits a self-consistent field (\SCF) iteration with a spectral-gap-based local contraction guarantee, giving a principled iterative solver where the diversity objective drives an eigenvector-dependent operator. The resulting algorithm, \OurMethod, requires only matrix-vector products with the kernel and runs in time $O\!\big((ndk+nk^2)\,t\big)$ for a small number of iterations $t$, scaling near-linearly in $n$ and integrating directly with low-rank and feature-map kernels common in ML. This paper focuses on the relaxation, solver, and scaling analysis; full real-data benchmarking is left to a planned empirical study.

URL PDF HTML ☆

赞 0 踩 0

2605.31393 2026-06-19 cs.CL cs.AI 版本更新 70%

Target-Side Paraphrase Augmentation for Sign Language Translation with Large Language Models

面向手语翻译的大语言模型目标端释义增强

Pedro Dal Bianco, Jean Paul Nunes Reinhold, Oscar Stanchi, Facundo Quiroga, Franco Ronchetti, Ulisses Brisolara Corrêa

发表机构 * III-LIDI Universidad Nacional de La Plata（III-LIDI国立拉普拉塔大学）； CDTEC, Federal University of Pelotas（CDTEC，联邦 Pelotas 大学）； CONICET III-LIDI ； Comision de Investigaciones Cientificas Universidad Nacional de La Plata（科学委员会国立拉普拉塔大学）； Universidade Federal de Pelotas（联邦 Pelotas 大学）

专题命中指令微调：使用GPT-4o生成释义增强手语翻译。

AI总结针对手语翻译中平行语料稀缺和目标词汇长尾分布的问题，提出利用GPT-4o生成参考句子的受控释义变体进行目标端增强，并在三种手语数据集上验证了方法的有效性。

Comments Accepted at GenSign @ CVPR 2026. Non-Proceedings Track (https://genai4sl.github.io/)

详情

AI中文摘要

手语翻译（SLT）仍然受到有限的配对手语视频/文本语料库和长尾目标词汇的限制。我们研究了目标端增强方法，其中GPT-4o生成参考句子的受控释义变体，而手语输入保持不变。采用基于Signformer姿态的Transformer，在两阶段调度下进行训练：先在增强语料库上预训练，然后在原始参考句子上微调。我们在三个具有互补挑战的数据集上进行了评估：PHOENIX14T（德国手语），具有适度的词汇多样性；GSL（希腊手语），具有高度受控、重复的录制；以及LSA-T（阿根廷手语），具有严重的长尾稀疏性。在PHOENIX14T上，增强将BLEU-4从9.56提高到10.33。接近饱和的GSL基线和极其稀疏的LSA-T设置揭示了该方法的局限性。据我们所知，这是第一项将LLM生成的目标端释义和LLM作为评估者应用于手语翻译的研究。语义评估揭示了词汇重叠指标低估的忠实度提升。

英文摘要

Sign language translation (SLT) remains constrained by the limited availability of paired sign-video/text corpora and by the heavy-tailed vocabularies typical of real-world datasets. We study a target-side augmentation strategy in which a large language model (LLM) generates controlled paraphrase variants of the reference spoken-language sentence while the sign input remains unchanged. Concretely, we use GPT-4o to produce semantically faithful variants of the training targets and train a Signformer-style pose-based Transformer under a two-stage schedule: pre-training on the augmented corpus followed by fine-tuning on the original references. We evaluate this strategy on three datasets that span complementary challenges: PHOENIX14T (German Sign Language), a real-world corpus with moderate lexical diversity; the Greek Sign Language Dataset with highly controlled, repetitive recordings; and LSA-T (Argentinian Sign Language), a naturalistic corpus with a large vocabulary and severe long-tail sparsity. This range allows us to characterize precisely when and why target-side augmentation is beneficial. On PHOENIX14T, augmentation improves BLEU-4 from 9.56 to 10.33, demonstrating that paraphrastic exposure helps the decoder generalize beyond memorized reference phrasing. The near-saturated GSL baseline and the extremely sparse LSA-T setting reveal the limits of the approach: in both cases, single-reference lexical overlap metrics are insufficient to capture the full picture, motivating a complementary semantic evaluation. To our knowledge, this is the first study to examine LLM-generated target-side paraphrases as an augmentation mechanism for SLT, and the first to apply an LLM-as-a-Judge evaluation protocol to SLT. This complementary evaluation reveals gains in semantic fidelity that lexical overlap metrics understate.

URL PDF HTML ☆

赞 0 踩 0

2604.23938 2026-06-19 cs.CL 版本更新 70%

TSAssistant: A Human-in-the-Loop Agentic Framework for Automated Target Safety Assessment

TSAssistant: 一种人在回路中的自动化靶点安全性评估智能体框架

Xiaochen Zheng, Zhiwen Jiang, David Tokar, Yexiang Cheng, Alvaro Serra, Melanie Guerard, Klas Hatje, Tatyana Doktorova

发表机构 * Computational Sciences Center of Excellence（计算科学卓越中心）

专题命中领域大模型：利用LLM进行生物医学文献检索与综合

AI总结提出TSAssistant多智能体框架，通过分层指令架构和交互式优化循环，将靶点安全性评估报告生成分解为专业子任务，实现高可重复性和证据溯源。

Comments Updated with quantitative and expert evaluations

详情

AI中文摘要

靶点安全性评估（TSA）需要系统整合遗传、转录组、靶点同源性、药理学和临床数据，以评估治疗靶点的潜在安全性风险。该过程劳动密集且依赖专家，在可扩展性和可重复性方面面临挑战。我们提出TSAssistant，一种人在回路中的多智能体框架，将TSA报告生成分解为专门子智能体的工作流：研究子智能体各自基于并引用单个TSA领域，合成子智能体整合跨领域发现。子智能体通过标准化工具接口从精选生物医学来源检索和综合证据，生成可单独引用、基于证据的章节，其行为由分层指令架构塑造，该架构将协调逻辑与领域专业知识和用户意图分离。为补充这些软约束，程序化执行钩子和持久记忆存储在整个工作流中强制执行硬约束，而交互式优化循环允许专家在完全保留跨迭代对话上下文的情况下审查和修订各个章节。我们不是进行单一的整体比较，而是将报告质量分解为可重复性、证据基础、任务级准确性和专家监督下的可控性，发现高可重复性和证据基础、与人类参考高度一致以及专家驱动的净正面改进。

英文摘要

Target Safety Assessment (TSA) requires systematic integration of genetic, transcriptomic, target homology, pharmacological, and clinical data to evaluate potential safety liabilities of therapeutic targets. This process is labor-intensive and expert-dependent, posing challenges in scalability and reproducibility. We present TSAssistant, a human-in-the-loop multi-agent framework that decomposes TSA report generation into a workflow of specialized subagents: Research Subagents that each ground and cite a single TSA domain, and Synthesis Subagents that integrate findings across domains. Subagents retrieve and synthesize evidence from curated biomedical sources through standardized tool interfaces and produce individually citable, evidence-grounded sections, with behavior shaped by a hierarchical instruction architecture that separates coordination logic from domain expertise and user intent. To complement these soft constraints, programmatic execution hooks and persistent memory stores enforce hard constraints across the workflow, while an interactive refinement loop allows experts to review and revise individual sections with full conversational context preserved across iterations. Rather than a single holistic comparison, we decompose report quality into reproducibility, evidential grounding, task-level accuracy, and controllability under expert oversight, finding high reproducibility and grounding, substantial agreement with the human reference, and net-positive expert-driven refinement.

URL PDF HTML ☆

赞 0 踩 0

2402.14035 2026-06-19 cs.LG cs.AI 版本更新 70%

Wisdom of Committee: Diverse Distillation from Large Foundation Models and Domain Experts

委员会智慧：来自大型基础模型和领域专家的多样化蒸馏

Zichang Liu, Qingyun Liu, Yuening Li, Liang Liu, Anshumali Shrivastava, Shuchao Bi, Lichan Hong, Ed H. Chi, Zhe Zhao

发表机构 * Rice University（Rice大学）； Google DeepMind（谷歌DeepMind）； Google Inc（谷歌公司）； University of California, Davis（加州大学戴维斯分校）

专题命中领域大模型：蒸馏基础模型到紧凑领域模型，涉及推荐和视觉

AI总结针对基础模型向紧凑领域模型蒸馏时能力、架构和模态差异大的问题，提出DiverseDistill框架，通过可学习的问答机制和对齐异构教师输出，在推荐和视觉任务上恢复73-114%的性能差距。

Comments Accepted at the 1st Workshop on Resource-Efficient Learning and Knowledge Discovery (RelKD), KDD 2026

Journal ref Proceedings of the RelKD Workshop at KDD 2026

详情

AI中文摘要

从基础模型向紧凑领域模型进行知识蒸馏因能力、架构和模态的巨大差异而具有挑战性。例如，在我们的实验中，从7600万参数的语言模型蒸馏到200万参数的推荐模型仅能弥补未蒸馏学生与教师之间不到40%的性能差距。我们表明，引入与基础模型共享学生架构特征的领域专家作为多样化教师委员会，能显著改善迁移效果。然而，标准的多教师方法未能利用这种多样性：简单组合异构教师可能使性能低于单教师蒸馏。为此，我们提出DiverseDistill，一种交互式蒸馏框架，采用可学习的问答机制生成教师条件查询，并将异构教师输出对齐到学生的表示空间。与需要基于梯度的协同优化或修改教师架构的方法不同，DiverseDistill在冻结教师的情况下仅通过其中间层的前向推理运行：无需参数更新、无需协同训练、无需架构修改。动态教师重要性机制通过过滤每个样本中低相关性的教师（例如，在推荐任务中减少约30%的前向传播且无质量损失）进一步降低训练成本，而整个蒸馏模块在训练后被丢弃，推理时零开销。在推荐（38倍压缩）和视觉（3.6倍压缩）任务上的评估表明，DiverseDistill恢复了73-114%的师生性能差距，持续优于所有单教师和多教师基线方法。

英文摘要

Knowledge distillation from foundation models to compact domain models is challenging due to substantial gaps in capacity, architecture, and modality. For example, in our experiments, distilling from a 76M-parameter language model to a 2M-parameter recommender closes less than 40% of the performance gap between the undistilled student and the teacher. We show that introducing domain-specific experts -- which share the student's architectural characteristics -- alongside the foundation model as a diverse teacher committee significantly improves transfer. However, standard multi-teacher methods fail to exploit this diversity: naively combining heterogeneous teachers can degrade performance below single-teacher distillation. To address this, we propose DiverseDistill, an interactive distillation framework that employs a learnable Question-Answer mechanism to generate teacher-conditioned queries and align heterogeneous teacher outputs into the student's representation space. Unlike methods requiring gradient-based co-optimization or architectural modification of teachers, DiverseDistill operates with frozen teachers using only forward-pass inference through their intermediate layers: no parameter updates, no co-training, and no architectural surgery. A dynamic teacher importance mechanism further reduces training cost by filtering low-relevance teachers per sample (e.g., ~30% fewer forward passes with no quality loss for recommendation tasks), while the entire Distillation Module is discarded after training, adding zero inference overhead. Evaluations on recommendation (38x compression) and vision (3.6x compression) tasks demonstrate that DiverseDistill recovers 73-114% of the teacher-student performance gap, consistently outperforming all single- and multi-teacher baselines.

URL PDF HTML ☆

赞 0 踩 0

2606.20554 2026-06-19 cs.IR cs.AI 新提交 60%

Structuring and Tokenizing Distributed User Interest Context for Generative Recommendation

结构化与分词化分布式用户兴趣上下文以支持生成式推荐

Ruizhong Qiu, Yinglong Xia, Dongqi Fu, Hanqing Zeng, Ren Chen, Xiangjun Fan, Hong Li, Hong Yan, Hanghang Tong

发表机构 * University of Illinois Urbana--Champaign（伊利诺伊大学厄巴纳-香槟分校）； Meta MRS

专题命中领域大模型：生成式推荐，涉及语言模型与用户兴趣建模。

AI总结提出G2Rec框架，通过统一图建模与语义分词，实现工业级生成式推荐中用户兴趣上下文的全面准确建模。

详情

AI中文摘要

生成式推荐是一种新兴范式，在工业推荐系统中展现出前景，旨在从用户历史行为中预测其下一次交互。生成式推荐的核心是物品分词，它连接了物品语义与推荐模型。然而，现有方法往往难以同时有效地组织和注入复杂的用户行为与物品语义上下文。一方面，现有的基于图的集成方法，如图序列化和图神经网络，要么存在可扩展性问题，要么仅利用局部图信息。另一方面，现有的语义分词方法通常依赖启发式规则且缺乏明确的监督信号，可能导致不准确或次优的语义表示。为解决用户兴趣上下文建模中的这些局限性，我们提出G2Rec，一个可扩展的框架，将基于图的整体用户共同参与建模与语义分词统一起来，用于工业级生成式推荐。总体而言，G2Rec使推荐模型能够捕捉整体且基于语义的用户兴趣原型，而无需真实用户兴趣，从而在工业序列推荐中提供更全面、更准确的用户行为上下文建模。跨产品表面的在线部署和在公开数据集上的大量实验证明了G2Rec相对于现有方法的优越性。

英文摘要

Generative recommendation is an emerging paradigm that has shown promise in industrial recommendation systems, aiming to predict users' next interactions from their historical behaviors. At the core of generative recommendation lies item tokenization, which bridges item semantics and recommendation models. However, existing methods often struggle to effectively organize and inject complex user-behavioral and item-semantic contexts into recommendation models simultaneously. On the one hand, existing graph-based integration methods, such as graph serialization and graph neural networks, either suffer from scalability issues or exploit only local graph information. On the other hand, existing semantic tokenization methods typically rely on heuristics and lack explicit supervision signals, which may lead to inaccurate or suboptimal semantic representations. To address these limitations in user interest context modeling, we propose G2Rec, a scalable framework that unifies holistic graph-based user co-engagement modeling with semantic tokenization for industrial-scale generative recommendation. Overall, G2Rec enables recommendation models to capture holistic and semantically grounded user interest prototypes without requiring ground-truth user interests, thereby providing more comprehensive and accurate modeling of user behavior contexts in industrial sequential recommendation. Online deployment across product surfaces and extensive experiments on public datasets demonstrate the superiority of G2Rec over existing methods.

URL PDF HTML ☆

赞 0 踩 0

2606.20475 2026-06-19 cs.LG 新提交 65%

Marginal Advantage Accumulation for Memory-Driven Agent Self-Evolution

边际优势累积用于记忆驱动智能体自我进化

Mingyu Yang, Keye Zheng, Congchao Cheng, Yujie Liu, Xingkang Lu, Fan Jiang, Yefei Zheng

发表机构 * Alibaba International Digital Commerce Group（阿里巴巴国际数字商业集团）

专题命中后训练：涉及语言模型轨迹蒸馏，但非核心贡献。

AI总结针对批量式轨迹蒸馏中跨批次证据缺失问题，提出边际优势累积（MAA）方法，通过差分信号构造、指数移动平均累积和语义身份合并，在16个设置中14个取得最佳结果，优化阶段token消耗减少约75%。

Comments 26 pages, 4 figures, 10 tables, 42 references

2606.20553 2026-06-19 cs.CR 新提交 60%

From Efficiency to Leakage -- Privacy Backdoor in Federated Language Model Fine-Tuning

从效率到泄露——联邦语言模型微调中的隐私后门

Shanghao Shi, Chaoyu Zhang, Heng Jin, Yang Xiao, Yevgeniy Vorobeychik, William Yeoh, Ning Zhang, Y. Thomas Hou, Wenjing Lou

专题命中后训练：涉及语言模型微调中的隐私泄露。

AI总结提出NeuroImprint攻击，恶意参数服务器在参数高效微调中植入隐私后门，通过为每个样本分配独立神经元并限制单次更新，实现高保真重建训练文本。

详情

AI中文摘要

联邦学习（FL）使多方能够协作微调语言模型以完成特定领域任务，而无需共享原始数据。由于完整模型微调对FL客户端而言通常过于昂贵，参数高效微调（PEFT）已成为实践中的事实标准，它冻结基础模型，仅训练少量适配器。在本文中，我们表明恶意参数服务器可以隐秘地将PEFT适配器破坏为隐私后门，该后门隐式记忆客户端的训练样本，作为存储在独立神经元中的隔离的每样本参数更新，而不降低模型效用。具体来说，我们的攻击NeuroImprint为每个训练样本分配一个专用的记忆神经元，并约束每个神经元在局部微调轨迹中最多更新一次。这种设计减轻了语言模型微调中由大批量和状态优化器（如Adam/AdamW）引入的跨样本碰撞和跨步混合。微调后，得到的隔离的每样本更新可以通过闭式解析逆变换恢复文本嵌入，然后确定性地映射回令牌序列。为了理解我们方法的通用性，我们在多个语言模型（BERT、GPT-2、Qwen2和Llama3.2）上实现了NeuroImprint，并在涵盖不同领域的四个微调数据集上进行了评估。结果表明，我们的攻击能够以高语义保真度重建59%至79%的所有微调样本。

英文摘要

Federated learning (FL) enables multiple parties to collaboratively fine-tune language models for domain-specific tasks without sharing raw data. Since full model fine-tuning is often prohibitively expensive for FL clients, parameter-efficient fine-tuning (PEFT) has become the de facto approach in practice, freezing the base model and training only a small set of adapters. In this paper, we show that a malicious parameter server can stealthily corrupt a PEFT adapter into a privacy backdoor that implicitly memorizes the client's training samples as isolated per-sample parameter updates stored in separate neurons, without degrading model utility. Concretely, our attack, NeuroImprint, assigns a dedicated memorization neuron to each training sample and constrains that each neuron is updated at most once along the local fine-tuning trajectory. This design mitigates both cross-sample collisions and cross-step mixing introduced by large local batches and stateful optimizers (e.g., Adam/AdamW) in language-model fine-tuning. After fine-tuning, the resulting isolated per-sample updates can be analytically inverted in closed form to recover text embeddings, which are then deterministically mapped back to token sequences. To understand the generality of our method, we implemented NeuroImprint on multiple language models (BERT, GPT-2, Qwen2, and Llama3.2) and evaluated it across four fine-tuning datasets spanning diverse domains. The results demonstrate that our attack can reconstruct 59% to 79% of all finetuning samples with high semantic fidelity.

URL PDF HTML ☆

赞 0 踩 0

1. 预训练 2 篇

How Linear Is a Transformer Feed-Forward Block? Per-Block Linear Recoverability Is Learned, Not Architectural

Weibull Weight-Scale Parameter Evolution under AdamW Training Dynamics

2. 其他LLM 22 篇

Reinforcement Learning Foundation Models Should Already Be A Thing

LLM-Based Synthetic Ground Truth Generation for Audio-Based Emotion Classification via In-Context Learning

Learning What to Remember: Observability-Safe Memory Retention via Constrained Optimization for Long-Horizon Language Agents

Modeling U.S. Attitudes Toward China via an Event-Steered Multi-Agent Simulator

Too long; didn't solve

Teaching Students to Question the Machine: An AI Literacy Intervention Improves Students' Regulation of LLM Use in a Science Task

The Voice Behind the Words: Quantifying Intersectional Bias in SpeechLLMs

Beyond Grading Accuracy: Exploring Alignment of TAs and LLMs

Simulation of Language Evolution under Regulated Social Media Platforms: A Synergistic Approach of Large Language Models and Genetic Algorithms

Execution-State Capsules: Graph-Bound Execution-State Checkpoint and Restore for Low-Latency, Small-Batch, On-Device Physical-AI Serving

Neural Additive and Basis Models with Feature Selection and Interactions

CREDENCE: Claim Reduction for Decomposition & Enhanced Credibility -- Semantic Metrics and Convergence Analysis

OnDeFog: Online Decision Transformer under Frame Dropping

A Solver-Free Training Method for Predict-then-Optimize

The Representational Limit of Scalar Interactions: An Interventional Decomposition

FlowEdit: Associative Memory for Lifelong Pronunciation Adaptation in Flow-Matching TTS

Sparsity, Superposition, and Forgetting: A Mechanistic Study of Representation Retention in Continual Learning

Quantization as a Malicious Task: Removing Quantization-Conditioned Backdoors via Task Arithmetic

Light-weight Pronunciation Assessment via Discrete Speech Token Surprisal

Federated Bilevel Performative Prediction

Comparing Linear Probes with Mahalanobis Cosine Similarity

Spectral DPPs via NEPv: A Scalable Continuous Relaxation of Determinantal MAP for Diversity-Aware Data Selection

3. 指令微调 1 篇

Target-Side Paraphrase Augmentation for Sign Language Translation with Large Language Models

4. 领域大模型 3 篇

TSAssistant: A Human-in-the-Loop Agentic Framework for Automated Target Safety Assessment

Wisdom of Committee: Diverse Distillation from Large Foundation Models and Domain Experts

Structuring and Tokenizing Distributed User Interest Context for Generative Recommendation

5. 后训练 2 篇

Marginal Advantage Accumulation for Memory-Driven Agent Self-Evolution

From Efficiency to Leakage -- Privacy Backdoor in Federated Language Model Fine-Tuning