arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 1709
专题追踪
2606.10813 2026-06-15 cs.CR cs.CL 版本更新

RedAct: Redacting Agent Capability Traces for Procedural Skill Protection

RedAct: 为程序技能保护而编辑智能体能力痕迹

Shuwen Xu, Zhitao He, Yi R. Fung

发表机构 * Hong Kong University of Science and Technology(香港理工大学) University of Chinese Academy of Sciences(中国科学院大学)

AI总结 提出RedAct框架,通过定位保护关键信息、重写痕迹并嵌入行为水印,将技能转移率降至无技能基线以下,同时保留审计证据。

详情
AI中文摘要

用户依赖执行痕迹来观察智能体行为、诊断故障并确保问责。这些痕迹包含丰富的程序细节,包括工具调用、中间决策和错误恢复逻辑。然而,这些细节可能暴露私有的程序技能,使下游方法能够在没有模型权重或技能文件的情况下恢复关键公式、阈值和策略。为了量化这种风险并评估保护措施,我们构建了\textsc{CapTraceBench},一个包含75个专业长时任务和154个跨七个领域精选技能的基准。我们还引入了\textsc{RedAct}(https://github.com/...),一个受保护的痕迹发布框架,该框架定位受保护的关键信息,重写痕迹同时保留验证者关键证据,并嵌入行为水印用于下游溯源分析。在代表性的痕迹重用方法中,\textsc{RedAct}将归一化技能转移(NST)从原始痕迹的44.7--67.1%降至低于无技能基线,同时保留审计证据。其独立的行为水印实现了93.6--100.0%的真实检测率,误报率最多为1.9%。这些结果将公共智能体痕迹视为安全接口,并表明选择性编辑可以在不删除审计证据的情况下减少程序能力泄露。

英文摘要

Users rely on execution traces to observe agent behavior, diagnose failures, and ensure accountability. These traces contain rich procedural detail, including tool invocations, intermediate decisions, and error-recovery logic. Yet this detail can expose private procedural skills, allowing downstream methods to recover key formulas, thresholds, and strategies without access to model weights or skill files. To quantify this risk and evaluate protection, we construct CapTraceBench, a benchmark of 75 specialized long-horizon tasks and 154 curated skills across seven domains. We also introduce RedAct, a protected trace release framework that localizes protected key information, rewrites traces while preserving verifier-critical evidence, and embeds behavioral watermarks for downstream provenance analysis. Across representative trace reuse methods, RedAct reduces normalized skill transfer (NST) from 44.7-67.1% on raw traces to below the no-skill baseline, while preserving audit evidence. Its standalone behavioral watermarks reach 93.6-100.0% true detection with a false alarm rate of at most 1.9%. These results frame public agent traces as security interfaces and show that selective redaction can reduce procedural capability leakage without removing audit evidence.

2606.02995 2026-06-15 cs.CR cs.AI cs.IR cs.LG 版本更新

Patcher: Post-Hoc Patching of Backdoored Large Language Models

Patcher: 后门大型语言模型的事后修补

Anjun Gao, Yueyang Quan, Yufei Xia, Zhuqing Liu, Minghong Fang

发表机构 * University of Louisville(路易斯维尔大学) University of North Texas(北得克萨斯大学)

AI总结 提出Patcher框架,仅利用单个失败案例和模型参数,通过基于梯度的显著性定位后门触发器,并采用约束微调消除触发-响应关联,同时保持模型效用。

Comments To appear in the USENIX Security Symposium, 2026

详情
AI中文摘要

大型语言模型仍然容易受到越狱后门攻击,其中对手污染安全对齐数据以嵌入隐藏触发器,从而绕过安全机制。现有防御通常需要全面的攻击信息或多个触发示例,使得当防御者仅观察到单个报告失败案例而不知道其源于后门攻击还是自然对齐错误时,这些防御不切实际。本文提出Patcher,一个事后防御框架,仅使用单个报告失败案例和模型参数来修复后门语言模型。Patcher分两个阶段运行。首先,通过计算基于响应的梯度显著性分数并应用自适应聚类将触发器与良性上下文分离来定位后门触发器。其次,通过约束微调目标修补模型,该目标打破触发-响应关联,同时通过KL散度约束保持良性任务效用和对非触发越狱攻击的鲁棒性。我们在多种后门攻击策略下进行了广泛评估,并证明Patcher成功定位触发器并中和后门,同时保持模型效用。我们进一步展示了针对旨在规避我们防御的自适应攻击的鲁棒性。这项工作代表了向部署语言模型中训练时攻击的实际防御迈出的重要一步。

英文摘要

Large language models remain vulnerable to jailbreak backdoor attacks, where adversaries poison safety alignment data to embed hidden triggers that bypass safety mechanisms. Existing defenses often require comprehensive attack information or multiple triggered examples, making them impractical when defenders only observe a single reported failure case without knowing whether it stems from a backdoor attack or a natural alignment bug. This paper presents Patcher, a post-hoc defense framework that repairs backdoored language models using only a single reported failure case and the model parameters. Patcher operates in two stages. First, it localizes backdoor triggers by computing response-conditioned gradient-based saliency scores and applying adaptive clustering to separate triggers from benign context. Second, it patches the model through a constrained fine-tuning objective that breaks the trigger-response association while preserving benign-task utility and robustness to non-triggered jailbreak attacks through KL-divergence constraints. We conduct extensive evaluations across multiple backdoor attack strategies and demonstrate that Patcher successfully localizes triggers and neutralizes backdoors while maintaining model utility. We further show robustness against adaptive attacks designed to evade our defense. This work represents a significant step toward practical defenses against training-time attacks in deployed language models.

2606.02231 2026-06-15 stat.ML cs.LG stat.ME 版本更新

Identifiable Markov Switching Models with Instantaneous Effects and Exponential Families

具有瞬时效应和指数族的可识别马尔可夫切换模型

Roel Hulsman, Carles Balsells-Rodas, Sara Magliacane

发表机构 * University of Amsterdam(阿姆斯特丹大学)

AI总结 针对非平稳时间序列,提出在指数族噪声下具有瞬时效应的马尔可夫切换模型的可识别性理论,并开发FlowMSM框架用于检测隐状态和恢复因果结构。

Comments International Conference on Machine Learning (ICML) 2026

详情
AI中文摘要

时间系统通常表现出非平稳行为,例如季节性气候变化或1型糖尿病患者的血糖波动。对非平稳性建模的一种方法是通过离散隐状态,即时间的平稳片段。此类系统诱导出马尔可夫切换模型(MSM),这是一类隐马尔可夫模型,其中隐状态和观测变量之间存在自回归依赖关系。在存在频繁状态切换以及非线性和非高斯动态的情况下,特别是在变量之间存在瞬时效应(例如由于测量速率较慢)时,识别隐状态具有挑战性。在这项工作中,我们建立了在时间状态依赖、非线性滞后和瞬时效应以及来自指数族的独立噪声下,隐状态和状态依赖因果结构的可识别性。我们的可识别性理论涵盖了因果模型的非时间混合。此外,我们引入了FlowMSM,这是一个状态检测框架,可与任何平稳因果发现方法配对,以恢复状态依赖的因果结构。在合成基准和金融经济学数据集上的实验证明了我们的方法在检测隐状态和从非平稳时间序列中发现因果结构方面的有效性。

英文摘要

Temporal systems often exhibit non-stationary behaviour, such as seasonal climate variation or glucose fluctuations in patients with type-1 diabetes. One way to model non-stationarity is through discrete latent regimes, i.e., stationary segments of time. Such systems induce a Markov Switching Model (MSM), a class of Hidden Markov Models with autoregressive dependencies among latent regimes and observed variables. Identifying latent regimes is challenging in the presence of frequent regime switches and nonlinear and non-Gaussian dynamics, particularly when there are instantaneous effects between the variables, e.g., due to slow rates of measurements. In this work, we establish the identifiability of both latent regimes and regime-dependent causal structures under temporal regime dependencies, nonlinear lagged and instantaneous effects, and independent noise from the exponential family. Our identifiability theory subsumes non-temporal mixtures of causal models. Furthermore, we introduce FlowMSM, a regime detection framework that can be paired with any stationary causal discovery method to recover regime-dependent causal structures. Experiments on synthetic benchmarks and a financial economics dataset demonstrate the effectiveness of our approach to detect latent regimes and discover causal structures from non-stationary time series.

2605.24795 2026-06-15 math.OC cs.LG cs.RO cs.SY eess.SY 版本更新

Lifted Schrödinger Bridges for Gaussian Mixture Endpoints: Projection Gaps and Path-Space Obstructions

提升的Schrödinger桥用于高斯混合端点:投影间隙与路径空间障碍

Siddhartha Ganguly, George Rapakoulias, Panagiotis Tsiotras

发表机构 * Daniel Guggenheim School of Aerospace, Georgia Institute of Technology(丹尼尔·加金吉姆航空航天学院,佐治亚理工学院)

AI总结 针对高斯混合端点分布下的随机密度控制问题,提出一种提升路径空间构造,将问题分解为高斯分量间的显式Schrödinger桥与有限维熵耦合,并分析投影后的标签信息间隙及路径空间障碍。

Comments 35 pages. Submitted to a journal; comments are welcome

详情
AI中文摘要

我们研究了布朗先验动力学下高斯混合端点分布之间的随机密度控制。由于高斯混合之间的直接Schrödinger桥通常没有闭式解,我们引入了一种提升路径空间构造,其中每条轨迹都增加了一个源-目标分量标签。因此,问题分解为具有显式边际、漂移和成本公式的高斯分量间Schrödinger桥,而混合级分配简化为具有Sinkhorn缩放形式的有限维熵耦合问题。然后,我们分析了通过丢弃或遗忘标签得到的投影。通过构造,投影律满足原始高斯混合端点约束,但其相对熵通常与提升相对熵相差一个非负的条件标签信息间隙。这个间隙揭示了一个路径空间障碍:提升优化器在投影后通常不能等同于直接的无标签Schrödinger桥。我们还推导了与投影边际流相关的后验平均马尔可夫漂移,证明了动能上界,并识别了一个公共路径势条件,在该条件下投影间隙消失。为了自包含的阐述,记录了几个显示密度和形状控制的数值示例。

英文摘要

We study stochastic density control between Gaussian-mixture endpoint distributions under Brownian prior dynamics. Since the direct Schrödinger bridge between Gaussian mixtures is generally not available in closed form, we introduce a lifted path-space construction in which each trajectory is augmented with a source--target component label. Consequently, the problem decomposes into Gaussian component-to-component Schrödinger bridges with explicit marginal, drift, and cost formulas, while the mixture-level assignment reduces to a finite-dimensional entropic coupling problem with a Sinkhorn scaling form. We then analyze the projection obtained by discarding or forgetting the label. By construction, the projected law satisfies the original Gaussian-mixture endpoint constraints, but its relative entropy generally differs from the lifted relative entropy by a nonnegative conditional label-information gap. This gap reveals a path-space obstruction: the lifted optimizer cannot, in general, be identified with the direct unlabeled Schrödinger bridge after projection. We also derive the posterior-averaged Markov drift associated with the projected marginal flow, prove a kinetic-energy upper bound, and identify a common path-potential condition under which the projection gap vanishes. Several numerical illustrations showing density and shape control are recorded for a self-contained exposition.

2605.24609 2026-06-15 physics.med-ph cs.AI cs.CV 版本更新

Catching magnetic resonance imaging outliers in artificial intelligence-supported radiotherapy workflows: unsupervised detection and localization of image anomalies using deep learning

捕捉MRI异常:使用深度学习无监督检测和定位MRI伪影及临床异常

Mustafa Kadhim, Viktor Rogowski, Emilia Persson, Camila Gonzalez, André Haraldsson, Sofie Ceberg, Mikael Nilsson, Malin Kügele, Sven Bäck, Christian Jamtheim Gustafsson

发表机构 * Physics and Imaging in Radiation Oncology (phiRO)(物理与放射治疗成像(phiRO))

AI总结 提出一种两阶段无监督异常检测框架,通过离散令牌压缩和令牌惊奇度评分,在盆腔和脑部MRI上实现高精度异常检测与定位,支持放疗工作流自动化质量控制。

Comments This paper has been submitted to Physics and Imaging in Radiation Oncology (phiRO)

详情
AI中文摘要

人工智能越来越多地集成到放射治疗工作流程中,然而此类流程仍然容易受到分布外图像数据的影响,这些数据可能在临床任务中引入意外行为。基于深度学习的盆腔磁共振成像(MRI)异常检测在很大程度上仍未探索,对其全自动化可行性的透明评估有限。我们开发并评估了一个完全自动化的、无监督的盆腔和脑部MRI异常检测框架。一个两阶段框架在来自公共数据集的参考图像上训练:盆腔MRI使用LUND-PROBE,脑部MRI使用IXI、fastMRI和fastMRI+。在第一阶段,MRI切片被压缩成离散令牌;在第二阶段,对正常令牌的分布进行建模。通过结合感知图像差异和基于负对数似然的令牌惊奇度评分来估计异常证据。在具有合成全局异常和真实临床异常的盆腔MRI上,以及具有临床注释的fastMRI+异常的脑部MRI上,评估了自动检测。评估了敏感性、特异性、受试者工作特征曲线下面积(AUC)以及在保留的正常病例中的假阳性行为。该框架在隐藏评估队列中实现了稳健的检测,盆腔和脑部MRI的AUC分别为0.97(95% CI, 0.95-0.98)和0.81(95% CI, 0.74-0.87)。热图分析显示检测到的异常与真实位置之间具有很强的空间一致性,支持定位准确性和可解释性。这些结果支持无监督异常检测作为放射治疗工作流程中自动化MRI质量控制层的潜力,并透明地可视化可能危及下游基于AI任务的图像区域。

英文摘要

Artificial intelligence is increasingly integrated into radiotherapy workflows, yet such pipelines remain vulnerable to out-of-distribution image data that may introduce unexpected behavior in clinical tasks. Deep learning-based anomaly detection for pelvic magnetic resonance imaging (MRI) remains largely unexplored, and transparent evaluation of its feasibility for full automation is limited. We developed and evaluated a fully automated, unsupervised anomaly-detection framework for pelvic and brain MRI. A two-stage framework was trained on reference images from public datasets: LUND-PROBE for pelvic MRI, and IXI, fastMRI, and fastMRI+ for brain MRI. In the first stage, MRI slices were compressed into discrete tokens; in the second, the distribution of normal tokens was modeled. Anomaly evidence was estimated by combining perceptual image differences with token-surprisal scores based on negative log-likelihood. Automated detection was evaluated on pelvic MRI with synthetic global and real clinical anomalies, and on brain MRI with clinically annotated fastMRI+ abnormalities. Sensitivity, specificity, area under the receiver operating characteristic curve (AUC), and false-positive behavior in held-out normal cases were assessed. The framework achieved robust detection across hidden evaluation cohorts, with AUCs of 0.97 (95% CI, 0.95-0.98) and 0.81 (95% CI, 0.74-0.87) for pelvic and brain MRI, respectively. Heatmap analysis showed strong spatial agreement between detected anomalies and ground-truth locations, supporting localization accuracy and interpretability. These results support the potential of unsupervised anomaly detection as an automated MRI quality-control layer for radiotherapy workflows, with transparent visualization of image regions likely to compromise downstream AI-based tasks.

2605.18784 2026-06-15 q-fin.RM cs.AI cs.CR cs.CY econ.GN q-fin.EC 版本更新

The Insurability Frontier of AI Risk: Mapping Threats to Affirmative Coverage, Silent Exposures, and Exclusions

AI风险的可保险边界:将威胁映射到积极保险、沉默暴露和排除

Alex Leung, Rex Zhang, Ervin Ling, Kentaroh Toyoda, SiewMei Loh

发表机构 * Munich Re(慕尼黑再保险) Armilla Tokio Marine Kiln(东京海上日赤保险) CFC Apollo ibott Coalition

AI总结 本文研究了AI风险在商业保险中的可保险性边界,通过分析55类AI威胁与26种保险产品和排除制度,揭示了四个层次的可保险性前沿:积极保险的风险、沉默AI暴露、主动排除的风险以及传统私人保险结构之外的风险。

Comments Version 2

详情
AI中文摘要

代理AI的快速扩散为商业保险创造了一个新的覆盖问题:一些AI中介的损失现在被积极保险,一些在传统网络安全、技术错误与遗漏(E&O)、董事与高管(D&O)、雇佣实践责任(EPLI)、犯罪和媒体政策下产生沉默AI暴露,而其他则被积极排除。本文通过编码55类AI威胁与26种保险产品、保证和排除制度,利用公开承运商材料和OWASP/MITRE威胁目录,确定了四个层次的可保险性前沿:积极保险的风险、沉默AI暴露、主动排除的风险以及传统私人保险结构之外的风险。我们的编码测量公开声明的定位,而非执行合同的措辞;头条统计数据描述承运商公开声明的覆盖情况,而非任何具体索赔将支付什么。三个模式显现。首先,积极AI覆盖开始通过主要风险重点进行区分:公开材料通常将慕尼黑再保险定位在模型性能和漂移,Armilla和 Lloyd's 市场部分围绕幻觉和更广泛的AI责任,Tokio Marine Kiln和CFC围绕知识产权和技术E&O关注,Apollo ibott围绕新兴自主系统责任,Coalition围绕深度伪造和AI增强的网络安全响应。其次,传统业务线在AI作为工具而非损失法律原因的情况下保留沉默AI暴露。第三,基础模型集中是清晰的真正新型可保险性前沿,因为上游模型失败可以一次关联多个被保险人损失;相关市场设计问题是每个候选结构放松了哪些可保险性约束,而不是仅仅存在哪种系统性风险模板。

英文摘要

The rapid diffusion of agentic AI has created a new coverage problem for commercial insurance: some AI-mediated losses are now affirmatively insured, some create silent-AI exposure under legacy cyber, technology errors-and-omissions (E&O), directors-and-officers (D&O), employment practices liability (EPLI), crime, and media policies, and others are being actively excluded. This paper maps that emerging boundary by coding 55 AI threat classes against 26 insurance products, endorsements, and exclusion regimes using public carrier materials and OWASP/MITRE threat catalogs. We identify a four-tier insurability frontier: affirmatively insured perils, silent-AI exposures, actively excluded perils, and perils outside conventional private insurance structures. Our coding measures publicly claimed positioning rather than executed contract wording; the headline statistics describe what carriers publicly state about coverage, not what would be paid in any specific claim. Three patterns emerge. First, affirmative AI coverage is beginning to differentiate by primary risk emphasis: public materials often position Munich Re around model performance and drift, Armilla and parts of the Lloyd's market around hallucination and broader AI liability, Tokio Marine Kiln and CFC around IP and technology E&O concerns, Apollo ibott around emerging autonomous system liability, and Coalition around deepfake and AI-enabled cyber response. Second, legacy lines retain silent-AI exposure where AI is an instrumentality rather than the legal cause of loss. Third, foundation model concentration is the clearest genuinely novel insurability frontier because upstream model failure can correlate losses across many cedents at once; the relevant market design question is which insurability constraint each candidate structure relaxes, not merely which systemic risk template exists.

2511.08639 2026-06-15 cs.CY cs.AI cs.DL 版本更新

The Journal of Prompt-Engineered (Moral) Philosophy Or: Why AI-Assisted Ethics Research Requires Process Transparency

提示工程(道德)哲学杂志或:为何AI辅助伦理研究需要过程透明

Michele Loi

发表机构 * University of Milan(米兰大学)

AI总结 本文探讨了AI辅助伦理研究中过程透明的必要性,提出透明义务应基于主体完整性,通过五个透明元素构建文档充分性框架,以实现未来规范性判断的可能。

Comments 21 pages Transparency material documenting LLM usage available at: https://github.com/MicheleLoi/JPEP/tree/main/transparency/Canonical_MD

详情
AI中文摘要

现有学术中的AI披露要求要求报告AI辅助,但未明确透明的哲学含义:它们固定了义务,但未解释该义务服务于什么。我们主张伦理探究在两个独立层面本质上存在争议——关于它是什么,以及它对探究者的要求是什么,从而推翻了仅输出评估和福利经济对透明问题的忽视,并由此推翻了从实证科学引入的可重复性框架。透明义务基于主体完整性:在探究社区之前,作者哲学表达所构成身份承诺的可理解性。由于评估此类工作的标准未被共同确定,透明性的可实现目标不是根据 agreed criteria 评估,而是追踪——积累证据记录,使每个传统能按自身术语评估工作,并使未来规范性判断成为可能。我们开发了一个文档充分性框架,通过五个透明元素——声明、导航、文档账户、过程文档和开发记录——来操作化有意义的人类控制,本文本身展示了该框架,其完整的文档记录已存档在持久标识符中。该框架是第一版,有待修订,而非确定标准。

英文摘要

Existing AI disclosure mandates in scholarship require that AI assistance be reported but leave transparency philosophically unspecified: they fix the duty without explaining what the duty serves. We argue that ethical inquiry is essentially contested at two independent levels -- about what it is, and about what it demands of the inquirer -- defeating output-only evaluation and welfare-economic dismissal of the transparency question, and, by extension, reproducibility framings imported from the empirical sciences. The transparency duty is grounded instead in agent-integrity: the legibility, before a community of inquiry, of the identity-constituting commitments that the author's mode of philosophising expresses. Because the standards for evaluating such work are not communally settled, the achievable goal for transparency is not evaluation against agreed criteria but tracking -- accumulating the evidentiary record that lets each tradition assess the work on its own terms and makes future normative judgments possible. We develop a documentation-adequacy framework that operationalises Meaningful Human Control through five transparency elements -- declaration, navigation, documentation account, process documentation, and development records -- demonstrated by the paper itself, whose full documentation record is archived at a persistent identifier. The framework is a first iteration subject to revision, not a settled standard.

2604.23336 2026-06-15 cs.IR cs.CL cs.LG 版本更新

Efficient Rationale-based Retrieval: On-policy Distillation from Generative Rerankers based on JEPA

高效基于理由的检索:基于JEPA的生成重排序器的在线蒸馏

Teng Chen, Sheng Xu, Feixiang Guo, Xiaoyu Wang, Qingqing Gu, Hongyan Li, Luo Ji

发表机构 * Geely AI Lab(吉利人工智能实验室)

AI总结 本文提出Rabtriever,通过在线蒸馏从生成重排序器中学习,将查询和文档独立编码,提升检索效率,同时在多个任务中表现优异。

Comments 11 pages, 8 figures. ICMR 2026 (https://youtu.be/apDcrzEVwq4)

详情
AI中文摘要

不同于传统基于事实的检索,基于理由的检索通常需要使用大语言模型对查询-文档对进行跨编码,造成显著的计算成本。为解决这一限制,我们提出了Rabtriever,它独立编码查询和文档,同时提供与重排序器相当的跨查询-文档理解能力。我们从训练一个基于LLM的生成重排序器开始,该重排序器将文档置于查询之前,并提示LLM通过对数概率生成相关性分数。然后将其作为在线蒸馏框架的教师,Rabtriever作为学生重建教师的上下文感知查询嵌入。为此,Rabtriever首先从教师中初始化,参数冻结。然后采用联合嵌入预测架构(JEPA)范式,该范式在LLM层和头部之间集成一个轻量级、可训练的预测器,将查询嵌入投影到新的隐藏空间,文档嵌入作为潜在向量。JEPA然后最小化此投影嵌入与教师嵌入的分布差异。为了增强在线蒸馏的采样效率,我们还添加了对LLM日志几率的反向KL的辅助损失,以重塑学生的日志几率分布。Rabtriever将教师在文档长度上的二次复杂度优化为线性,经理论和实验证实。实验表明,Rabtriever在多种基于理由的任务中优于不同的检索器基线,包括共情对话和机器人操作,且从重排序器中仅有微小的准确率下降。Rabtriever在传统检索基准如MS MARCO和BEIR上也表现良好,性能与最佳检索器基线相当。

英文摘要

Unlike traditional fact-based retrieval, rationale-based retrieval typically necessitates cross-encoding of query-document pairs using large language models, incurring substantial computational costs. To address this limitation, we propose Rabtriever, which independently encodes queries and documents, while providing comparable cross query-document comprehension capabilities to rerankers. We start from training a LLM-based generative reranker, which puts the document prior to the query and prompts the LLM to generate the relevance score by log probabilities. We then employ it as the teacher of an on-policy distillation framework, with Rabtriever as the student to reconstruct the teacher's contextual-aware query embedding. To achieve this effect, Rabtriever is first initialized from the teacher, with parameters frozen. The Joint-Embedding Predictive Architecture (JEPA) paradigm is then adopted, which integrates a lightweight, trainable predictor between LLM layers and heads, projecting the query embedding into a new hidden space, with the document embedding as the latent vector. JEPA then minimizes the distribution difference between this projected embedding and the teacher embedding. To strengthen the sampling efficiency of on-policy distillation, we also add an auxiliary loss on the reverse KL of LLM logits, to reshape the student's logit distribution. Rabtriever optimizes the teacher's quadratic complexity on the document length to linear, verified both theoretically and empirically. Experiments show that Rabtriever outperforms different retriever baselines across diverse rationale-based tasks, including empathetic conversations and robotic manipulations, with minor accuracy degradation from the reranker. Rabtriever also generalizes well on traditional retrieval benchmarks such as MS MARCO and BEIR, with comparable performance to the best retriever baseline.

2509.06697 2026-06-15 econ.EM cs.LG stat.AP stat.ML 版本更新

Neural ARFIMA model for forecasting BRIC exchange rates with long memory

具有长期记忆的神经ARFIMA模型用于预测BRIC汇率

Donia Besher, Madhurima Panja, Shovon Sengupta, Tanujit Chakraborty

发表机构 * SAFIR, Sorbonne University Abu Dhabi(SAFIR,索邦大学阿布扎赫德分校) Sorbonne Center for Artificial Intelligence, Sorbonne University(索邦人工智能中心,索邦大学)

AI总结 本文提出神经ARFIMA模型,结合ARFIMA的长期记忆结构和神经网络非线性能力,以提高BRIC汇率预测精度。

详情
AI中文摘要

准确预测汇率仍是一个持续挑战,特别是对于新兴经济体如巴西、俄罗斯、印度和中国(BRIC)。这些序列表现出长期记忆和非线性,传统时间序列模型难以捕捉。汇率动态还受全球经济政策不确定性、美国股市波动性、美国货币政策不确定性、油价增长率和短期利率等因素影响。本文提出神经自回归分数积分移动平均(NARFIMA)模型,结合ARFIMA的长期记忆结构和神经网络的非线性学习能力,并纳入外生变量。我们建立了NARFIMA的渐近平稳性,并利用符合预测区间量化预测不确定性。实证结果表明,NARFIMA在预测BRIC汇率方面始终优于基准方法。

英文摘要

Exchange rate forecasting remains a challenging problem, particularly for emerging economies, where the observed time series exhibit pronounced long-memory dependence, nonlinear dynamics, and sensitivity to macro-financial drivers. Classical models such as ARFIMA capture long-range persistence but fail to adequately represent nonlinear relationships, while modern machine learning approaches often neglect the underlying long-memory structure in macroeconomic series. To address this gap, we propose a Neural AutoRegressive Fractionally Integrated Moving Average (NARFIMA) model that integrates ARFIMA-based long-memory modeling with neural networks for nonlinear function approximation, while incorporating exogenous macroeconomic and uncertainty indicators. The framework provides a unified approach for capturing persistence, nonlinear dynamics, and external shocks. We establish asymptotic stationarity of the NARFIMA process and develop conformal prediction intervals for distribution-free uncertainty quantification. Empirical results for BRIC exchange rates show that NARFIMA consistently outperforms a broad range of forecasting benchmarks across multiple horizons, underscoring the importance of explicitly modeling long-memory dependence in exchange rate dynamics. The `narfima' R package provides an implementation of our approach.

2512.02318 2026-06-15 cs.CR cs.AI 版本更新

COGNITION: From Evaluation to Defense against Multimodal LLM CAPTCHA Solvers

认知:从评估到对抗多模态大语言模型CAPTCHA求解器

Junyu Wang, Changjia Zhu, Yuanbo Zhou, Lingyao Li, Xu He, Mingkui Wei, Junjie Xiong

发表机构 * Missouri University of Science and Technology(密苏里科技大学) University of South Florida(佛罗里达州立大学) Visa Inc.(Visa公司) George Mason University(乔治·马歇尔大学)

AI总结 本文研究多模态大语言模型如何削弱视觉CAPTCHA的安全性,评估7种主流MLLM在18种CAPTCHA任务中的表现,揭示其解决能力及防御策略。

Comments Accepted by USENIX Sec'26

详情
AI中文摘要

本文研究多模态大语言模型(MLLMs)如何削弱视觉CAPTCHA的安全性。我们识别出攻击面,评估7种主流商业和开源MLLM在18种真实CAPTCHA任务中的性能,测量单次准确率、有限重试下的成功率、端到端延迟和每解成本。进一步分析任务特定提示工程和少样本演示对求解器效果的影响。我们发现MLLMs能以人类成本和延迟可靠解决识别导向和低交互CAPTCHA任务,而需要细粒度定位、多步骤空间推理或跨帧一致性任务对当前模型仍显著困难。通过分析此类MLLM的推理轨迹,我们探讨模型在特定CAPTCHA谜题中成功或失败的机制,并利用这些见解推导出防御导向的CAPTCHA任务选择和强化指南。通过案例研究验证这些原则,我们通过我们的指南加固一个易受攻击的CAPTCHA类型。我们证明,加入细粒度定位和隐含计数将最先进的MLLM的成功率从超过95%降低到0%,确认结构变化可以有效缓解威胁。最后讨论平台运营商在滥用缓解流程中部署CAPTCHA的含义。

英文摘要

This paper studies how multimodal large language models (MLLMs) undermine the security guarantees of visual CAPTCHA. We identify the attack surface where an adversary can cheaply automate CAPTCHA solving using off-the-shelf models. We evaluate 7 representative MLLMs on 18 real-world CAPTCHA task types, measuring single-shot accuracy, success under limited retries, end-to-end latency, and per-solve cost. We further validate our findings through a supplemental external dataset and an adaptive-attacker setting with session memory, while also analyzing the impact of task-specific prompt engineering and few-shot demonstrations on solver effectiveness. We reveal that MLLMs can reliably solve recognition-oriented and low-interaction CAPTCHA tasks at human-like cost and latency, whereas tasks requiring fine-grained localization, multi-step spatial reasoning, or cross-frame consistency remain significantly harder for current models. By examining the reasoning traces of such MLLMs, we investigate the underlying mechanisms of why models succeed/fail on specific CAPTCHA puzzles and use these insights to derive defense-oriented guidelines for selecting and strengthening CAPTCHA tasks. To validate these principles, we present a proof-of-concept by hardening a vulnerable CAPTCHA type using our guidelines. We demonstrate that incorporating fine-grained localization and implicit counting reduces the success rate of state-of-the-art MLLMs from over 95\% to 0\%, confirming that structural changes can effectively mitigate the threat. We conclude by emphasizing the urgent need for CAPTCHA redesign as MLLM capabilities increasingly threaten existing defenses. Code Availability (https://doi.org/10.5281/zenodo.20406852).

2505.11577 2026-06-15 cs.CY cs.AI 版本更新

The Accountability Paradox: How Platform API Restrictions Undermine AI Transparency Mandates

问责悖论:平台API限制如何削弱AI透明度要求

Florian A. D. Burnat, Brittany I. Davidson

发表机构 * University of Bath(巴斯大学)

AI总结 本文研究平台API限制与欧盟数字服务法案之间的矛盾,提出审计框架揭示平台内容审核和算法放大不可验证的盲区,指出AI依赖与问责限制的悖论,建议采用联邦访问模型和加强监管执行。

Comments Accepted at ACM Conference on Fairness, Accountability, and Transparency (FAccT '26)

详情
AI中文摘要

近期主要社交媒体平台对应用程序编程接口(API)的限制挑战了遵守欧盟数字服务法案[20]的要求,该法案要求数据访问以实现算法透明度。我们开发了一个结构化的审计框架来评估监管要求与平台实施之间的日益增长的不一致。我们对X/Twitter、Reddit、TikTok和Meta的比较分析识别出关键的『审计盲区』,其中平台内容审核和算法放大仍然无法被独立验证。我们的发现揭示了『问责悖论』:随着平台越来越多地依赖AI系统,它们同时限制了独立监督的能力。我们建议与国家标准技术研究院[80]的AI风险管理框架相一致的有针对性的政策干预,强调联邦访问模型和增强的监管执行。

英文摘要

Recent application programming interface (API) restrictions on major social media platforms challenge compliance with the EU Digital Services Act [20], which mandates data access for algorithmic transparency. We develop a structured audit framework to assess the growing misalignment between regulatory requirements and platform implementations. Our comparative analysis of X/Twitter, Reddit, TikTok, and Meta identifies critical ``audit blind-spots'' where platform content moderation and algorithmic amplification remain inaccessible to independent verification. Our findings reveal an ``accountability paradox'': as platforms increasingly rely on AI systems, they simultaneously restrict the capacity for independent oversight. We propose targeted policy interventions aligned with the AI Risk Management Framework of the National Institute of Standards and Technology [80], emphasizing federated access models and enhanced regulatory enforcement.

2604.20462 2026-06-15 cs.SE cs.CL cs.IR 版本更新

Deja Vu at Scale: Paraphrase-Robust Detection of Duplicate Gherkin Steps in Behaviour-Driven Software Testing with Sentence-Transformer Embeddings and a 1.1M-Step Open Benchmark

大规模既视感:基于Sentence-Transformer嵌入和行为驱动软件测试中重复Gherkin步骤的释义鲁棒检测与110万步骤开放基准

Ali Hassaan Mughal, Noor Fatima, Muhammad Bilal

发表机构 * work(工作) seecs.edu.pk(SEECs大学) tum.de(图腾大学)

AI总结 针对BDD测试中Gherkin步骤重复问题,提出结合精确哈希、归一化Levenshtein、句子Transformer余弦和Levenshtein带混合的四种策略检测器,并构建跨组织语料库和标注基准,F1达0.906。

Comments 28 pages, 2 figures, 4 tables. Submitted to Information and Software Technology (Elsevier). Tool, corpus, labelled benchmark, and rubric released at https://github.com/amughalbscs16/cukereuse-release under Apache-2.0

详情
AI中文摘要

背景。行为驱动开发(BDD)套件中的Gherkin步骤文本重复累积,带来已知的维护成本。现有检测器要么需要可运行测试,要么是单组织的,存在空白:一个静态的、对释义鲁棒的步骤级检测器,以及一个用于校准的公共基准。目标。我们发布了(i)迄今为止最大的跨组织BDD步骤语料库,(ii)一个标注的对级校准基准,以及(iii)一个四策略检测器,附带一个将聚类与ISO/IEC 25010可维护性子特征关联的合并节省模型。方法。语料库包含347个公共GitHub仓库、23,667个.feature文件和1,113,616个Gherkin步骤,带有SPDX标签。检测器分层使用精确哈希、归一化Levenshtein、句子Transformer余弦以及Levenshtein带混合。校准使用1,020个手动标注的步骤对,依据发布的规则(60对重叠,Fleiss kappa = 0.84)。我们报告了在主规则下和免评分重新标注下的精确率、召回率和F1,附带bootstrap 95%置信区间,并与SourcererCC风格和NiCad风格的词汇基线进行比较。结果。步骤加权精确重复率为80.2%;中位数仓库重复率为58.6%(Spearman rho = 0.51)。顶级混合聚类在2,245个文件中出现20,737次。近似精确在免评分标签上达到F1 = 0.822;语义在主规则下F1 = 0.906,反映了已披露的分层伪影。词汇基线达到F1 = 0.761和0.799。节省模型估计整个语料库可消除893,357个步骤出现;在中位数仓库中,62.5%的步骤行可消除。

英文摘要

Context. Behaviour-Driven Development (BDD) suites in Gherkin accumulate step-text duplication with documented maintenance cost. Prior detectors either require runnable tests or are single-organisation, leaving a gap: a static, paraphrase-robust, step-level detector and a public benchmark to calibrate it. Objective. We release (i) the largest cross-organisational BDD step corpus to date, (ii) a labelled pair-level calibration benchmark, and (iii) a four-strategy detector with a consolidation-savings model linking clusters to ISO/IEC 25010 maintainability sub-characteristics. Method. The corpus contains 347 public GitHub repositories, 23,667 .feature files, and 1,113,616 Gherkin steps, SPDX-tagged. The detector layers exact hashing, normalised Levenshtein, sentence-transformer cosine, and a Levenshtein-banded hybrid. Calibration uses 1,020 manually labelled step pairs under a released rubric (60-pair overlap, Fleiss kappa = 0.84). We report precision, recall, and F1 with bootstrap 95% CIs under the primary rubric and a score-free relabelling, and benchmark against SourcererCC-style and NiCad-style lexical baselines. Results. Step-weighted exact-duplicate rate is 80.2%; median-repository rate is 58.6% (Spearman rho = 0.51). The top hybrid cluster has 20,737 occurrences across 2,245 files. Near-exact reaches F1 = 0.822 on score-free labels; semantic F1 = 0.906 under the primary rubric reflects a disclosed stratification artefact. Lexical baselines reach F1 = 0.761 and 0.799. The savings model estimates 893,357 corpus-wide eliminable step occurrences; on the median repository 62.5% of step lines are eliminable.

2405.03063 2026-06-15 math.ST cs.IT cs.LG math.IT stat.ME stat.ML stat.TH 版本更新

Stability of a Generalized Debiased Lasso with Applications to Resampling-Based Variable Selection

广义去偏Lasso的稳定性及其在基于重抽样的变量选择中的应用

Jingbo Liu

发表机构 * Department of Statistics, University of Illinois Urbana-Champaign(伊利诺伊大学厄巴纳-香槟分校统计系) Department of Electrical and Computer Engineering, the Grainger College of Engineering(格拉inger工程学院电子与计算机工程系)

AI总结 提出基于稳定性原理的广义去偏Lasso估计量,通过设计矩阵单列扰动下的简单更新公式,在比例增长机制下实现渐近精确近似,显著降低重抽样变量选择的计算成本。

Comments to appear in Bernoulli

详情
AI中文摘要

我们提出了一种基于稳定性原理的广义去偏Lasso估计量。当设计矩阵的单列被扰动时,该估计量允许一个简单的更新公式,可以从原始解计算得出。在具有良好条件协方差的次高斯设计下,这种近似在比例增长机制下对于除消失比例坐标外的所有坐标是渐近精确的。证明依赖于集中和反集中论证来控制误差项和符号变化。相比之下,在类似假设下建立可比较的分布极限(例如高斯性)仍然是开放的。作为一个应用,我们表明该近似显著降低了基于重抽样的变量选择过程的计算成本,包括条件随机化测试和局部knockoff滤波器。

英文摘要

We propose a generalized debiased Lasso estimator based on a stability principle. When a single column of the design matrix is perturbed, the estimator admits a simple update formula that can be computed from the original solution. Under sub-Gaussian designs with well-conditioned covariance, this approximation is asymptotically accurate for all but a vanishing fraction of coordinates in the proportional growth regime. The proof relies on concentration and anti-concentration arguments to control error terms and sign changes. In contrast, establishing comparable distributional limits (e.g., Gaussianity) under similar assumptions remains open. As an application, we show that the approximation significantly reduces the computational cost of resampling-based variable selection procedures, including the conditional randomization test and a local knockoff filter.

2604.07530 2026-06-15 cs.DL cs.AI cs.CY cs.SI 版本更新

The Shrinking Lifespan of LLMs in Science

科学领域中LLM的生命周期缩短

Ana Trišović

发表机构 * Computer Science & Artificial Intelligence Laboratory(计算机科学与人工智能实验室) Massachusetts Institute of Technology(麻省理工学院)

AI总结 本研究通过分析62个LLM在超过10万篇引用论文中的科学采纳轨迹,发现模型的生命周期主要由发布年份决定,且每个后续发布年份的峰值时间和寿命分别缩短27%和23%。

详情
AI中文摘要

缩放定律描述了语言模型能力如何随计算和数据增长,但未说明模型发布后能持续多久。我们引入峰值时间和寿命作为模型过时的度量,并利用它们刻画62个LLM在超过10万篇引用论文(2019-2025年)中的科学采纳轨迹,将主动采纳与背景引用分离,以恢复引用计数无法解析的每个模型轨迹。我们发现,模型的寿命更多地由其发布时间而非特征决定:发布年份比架构、开放性或规模更能预测峰值时间和寿命。LLM的采纳遵循倒U型曲线(发布后上升、达到峰值然后下降),但这种模式正在迅速压缩。每个后续发布年份与峰值时间缩短27%和寿命缩短23%相关(p < 0.001),这一结果对最小年龄阈值和模型规模控制具有稳健性。这些采纳侧动态对缩放定律不可见,表明专注于任何单一模型可能是一项贬值的投资,其成本落在可重复性和迁移上。

英文摘要

Scaling laws describe how language model capabilities grow with compute and data, but say nothing about how long a model matters once released. We introduce time-to-peak and lifespan as measures of model obsolescence and use them to characterize the scientific adoption trajectories of 62 LLMs across more than 108k citing papers (2019-2025), separating active adoption from background citation to recover per-model trajectories that citation counts cannot resolve. We find that a model's longevity is shaped more by when it was released than by its characteristics: release year predicts time-to-peak and lifespan more strongly than architecture, openness, or scale. LLM adoption follows an inverted-U curve (rising after release, peaking, and then declining), but this pattern is rapidly compressing. Each successive release year is associated with a 27% shorter time-to-peak and a 23% shorter lifespan ($p < 0.001$), robust to minimum-age thresholds and controls for model size. These adoption-side dynamics are invisible to scaling laws and suggest that specialization on any single model may be a depreciating investment, with costs falling on reproducibility and migration.

2603.24596 2026-06-15 eess.AS cs.AI cs.CL 版本更新

X-OPD: Cross-Modal On-Policy Distillation for Capability Alignment in Speech LLMs

X-OPD:面向语音大语言模型能力对齐的跨模态在策略蒸馏

Di Cao, Dongjie Fu, Hai Yu, Siqi Zheng, Xu Tan, Tao Jin

发表机构 * Tencent Hunyuan(腾讯文心) Zhejiang University(浙江大学)

AI总结 提出X-OPD框架,通过跨模态在策略蒸馏对齐语音LLM与文本LLM的能力,利用文本教师模型评估语音模型的轨迹并提供令牌级反馈,显著缩小复杂任务性能差距。

Comments Accepted by Interspeech 2026

详情
AI中文摘要

虽然从级联对话系统转向端到端(E2E)语音大语言模型(LLMs)改善了延迟和副语言建模,但E2E模型通常表现出与其文本对应模型相比显著的性能下降。标准的监督微调(SFT)和强化学习(RL)训练方法无法弥合这一差距。为了解决这个问题,我们提出了X-OPD,一种新颖的跨模态在策略蒸馏框架,旨在系统地将语音LLM的能力与其文本对应模型对齐。X-OPD通过在线策略展开使语音LLM探索其自身分布,其中基于文本的教师模型评估这些轨迹并提供令牌级反馈,从而有效地将教师的能力蒸馏到学生的多模态表示中。在多个基准上的大量实验表明,X-OPD在保留模型固有能力的同时,显著缩小了复杂任务中的差距。

英文摘要

While the shift from cascaded dialogue systems to end-to-end (E2E) speech Large Language Models (LLMs) improves latency and paralinguistic modeling, E2E models often exhibit a significant performance degradation compared to their text-based counterparts. The standard Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) training methods fail to close this gap. To address this, we propose X-OPD, a novel Cross-Modal On-Policy Distillation framework designed to systematically align the capabilities of Speech LLMs to their text-based counterparts. X-OPD enables the Speech LLM to explore its own distribution via on-policy rollouts, where a text-based teacher model evaluates these trajectories and provides token-level feedback, effectively distilling teacher's capabilities into student's multi-modal representations. Extensive experiments across multiple benchmarks demonstrate that X-OPD significantly narrows the gap in complex tasks while preserving the model's inherent capabilities.

2603.12400 2026-06-15 math.CO cs.CV 版本更新

Generation of Maximal Snake Polyominoes Using a Deep Neural Network

使用深度神经网络生成最大蛇形多联骨牌

Benjamin Gauthier, Alain Goupil, Fadel Toure

发表机构 * Université du Québec à Trois-Rivières(魁北克大学三河分校)

AI总结 提出结构化像素空间扩散模型,从数据驱动学习生成最大蛇形多联骨牌,无需显式编码约束,能泛化到更大矩形并接近当前计算极限。

Comments In Proceedings GASCom 2026, arXiv:2606.09910

Journal ref EPTCS 445, 2026, pp. 104-113

详情
AI中文摘要

最大蛇形多联骨牌在大矩形中难以数值研究,因为计算它们需要对特定矩形大小的所有蛇形进行完全枚举,这相当于暴力算法。这阻碍了对更大矩形中最大蛇形的研究。此外,大多数可枚举的蛇形位于小矩形中,掩盖了大尺度模式。在本文中,我们研究了深度神经网络在基于数据驱动的训练中生成最大蛇形多联骨牌的贡献,其中最大性和邻接约束不是显式编码的,而是学习的。为此,我们实验了一种去噪扩散模型,我们称之为结构化像素空间扩散(SPS Diffusion)。我们发现SPS Diffusion从小矩形泛化到大矩形,生成有效的蛇形直至28x28方格,并在接近当前计算极限的方格上产生最大蛇形候选。然而,该模型容易出错,例如分支、循环或多个蛇形组件。总体而言,扩散模型是有前景的,表明深度神经网络可以理解复杂的组合对象,这对其研究是有用的。

英文摘要

Maximal snake polyominoes are difficult to study numerically in large rectangles, as computing them requires the complete enumeration of all snakes for a specific rectangle size, which corresponds to a brute force algorithm. This hinders the study of maximal snakes in larger rectangles. Moreover, most enumerable snakes lie in small rectangles, obscuring large-scale patterns. In this paper, we investigate the contribution of a deep neural network to the generation of maximal snake polyominoes from a data-driven training, where the maximality and adjacency constraints are not encoded explicitly, but learned. To this extent, we experiment with a denoising diffusion model, which we referred as Structured Pixel Space Diffusion (SPS Diffusion). We find that SPS Diffusion generalizes from small rectangles to larger ones, generating valid snakes up to 28x28 squares and producing maximal snake candidates on squares close to the current computational limit. The model is, however, prone to errors such as branching, cycles, or multiple snake components. Overall, the diffusion model is promising and suggests that complex combinatorial objects can be understood by deep neural networks, which is useful in their investigation.

2601.04646 2026-06-15 cs.IR cs.AI cs.CL cs.LG 版本更新

Succeeding at Scale: Enterprise Retrieval Benchmark Construction and Index-Preserving Query Adaptation for Multi-Tenant Search

规模化成功:面向多租户搜索的企业检索基准构建与索引保持查询适配

Prateek Jain, Shabari S Nair, Ritesh Goru, Prakhar Agarwal, Ajay Yadav, Yoga Sri Varshan Varadharajan, Constantine Caramanis

发表机构 * Prateek Jain Shabari S Nair Ritesh Goru Prakhar Agarwal Ajay Yadav Yoga Sri Varshan Varadharajan Constantine Caramanis

AI总结 针对多租户检索系统中标注数据匮乏和模型更新成本高的问题,提出全自动构建基准DevRev-Search,并研究仅微调查询编码器而保持文档索引不变的索引保持查询适配策略,实现质量与效率的平衡。

详情
AI中文摘要

大规模多租户检索系统生成大量查询日志,但缺乏用于有效领域适应的精心策划的相关性标签,导致大量“暗数据”未被充分利用。模型更新的高成本加剧了这一挑战,因为联合微调查询和文档编码器需要完整的语料库重新索引,这在拥有数千个独立索引的多租户环境中是不切实际的。我们引入了DevRev-Search,这是一个通过完全自动化管道构建的技术客户支持段落检索基准。候选生成使用跨多种稀疏和密集检索器的融合,随后使用LLM作为评判器进行一致性过滤和相关性标记。我们进一步研究并系统评估了索引保持查询适配策略,该策略仅微调查询编码器,同时保持文档索引固定。在DevRev-Search、SciFact和FiQA-2018上的实验表明,参数高效的查询编码器微调提供了显著的质量-效率权衡,实现了可扩展且实用的企业多租户检索。

英文摘要

Large-scale multi-tenant retrieval systems generate extensive query logs but lack curated relevance labels for effective domain adaptation, resulting in substantial underutilized "dark data." This challenge is compounded by the high cost of model updates, as jointly fine-tuning query and document encoders requires full corpus re-indexing, which is impractical in multi-tenant settings with thousands of isolated indices. We introduce DevRev-Search, a passage retrieval benchmark for technical customer support built via a fully automated pipeline. Candidate generation uses fusion across diverse sparse and dense retrievers, followed by an LLM-as-a-Judge for consistency filtering and relevance labeling. We further study and systematically evaluate index-preserving query-only adaptation strategies that fine-tune only the query-encoder while keeping the document indices fixed. Experiments on DevRev-Search, SciFact, and FiQA-2018 show that parameter-efficient fine-tuning of the query encoder delivers a remarkable quality-efficiency trade-off, enabling scalable and practical enterprise multi-tenant retrieval.

2602.16835 2026-06-15 cs.CR cs.LG 版本更新

NeST: Neuron Selective Tuning for LLM Safety

NeST: 面向LLM安全的神经元选择性调优

Sasha Behrouzi, Lichao Wu, Mohamadreza Rostami, Ahmad-Reza Sadeghi

发表机构 * University of California, Berkeley(加州大学伯克利分校) Stanford University(斯坦福大学)

AI总结 提出NeST框架,通过激活探测识别安全相关前馈神经元并训练共享簇级更新,仅用普通恶意提示即可泛化防御多种越狱攻击,在14个模型上以极少参数实现接近全微调的鲁棒性。

详情
AI中文摘要

安全对齐对于大型语言模型(LLM)的负责任部署至关重要。然而,现有方法通常依赖于重量级的微调,这在跨模型家族更新、审计和维护时成本高昂。全微调会产生大量的计算和存储开销,而参数高效方法(如低秩适应LoRA)则牺牲效率换取不一致的安全增益和对设计选择的敏感性。安全干预机制在不修改模型权重的情况下减少不安全输出,但无法直接塑造或保留控制安全行为的内部表示。我们提出NeST,一种用于高效事后安全对齐的神经元选择性调优框架。NeST通过对普通有害和无害提示进行激活探测来识别安全相关的前馈神经元,聚类具有相似激活模式的神经元,并训练共享的簇级更新,同时冻结模型的其余部分。重要的是,NeST仅使用普通恶意提示进行训练,不使用越狱特定的攻击数据,但能稳健地泛化到多种越狱攻击。学习到的更新随后被折叠到原始权重中,不产生推理时开销。在14个开源权重语言和多模态模型上的评估表明,NeST优于轻量级基线,并以显著更少的可训练参数接近全微调的鲁棒性。在纯文本模型上,NeST将平均越狱攻击成功率从44.5%降至1.1%,平均仅训练0.4M参数。在多模态设置中,它将ASR从55.3%降至1.1%,对于下游微调变体,通过将ASR从53.8%降至0.8%来恢复安全性。这些结果表明,通过将适应集中在局部、功能连贯的安全结构上,可以实现鲁棒、可维护的安全对齐。

英文摘要

Safety alignment is essential for the responsible deployment of Large Language Models (LLMs). Yet, existing approaches often rely on heavyweight fine-tuning that is costly to update, audit, and maintain across model families. Full fine-tuning incurs substantial computational and storage overhead, while parameter-efficient methods, e.g., Low-Rank Adaptation (LoRA), trade efficiency for inconsistent safety gains and sensitivity to design choices. Safety intervention mechanisms reduce unsafe outputs without modifying model weights, but do not directly shape or preserve the internal representations that govern safety behavior. We present NeST, a Neuron-Selective Tuning framework for efficient post-hoc safety alignment. NeST identifies safety-relevant feed-forward neurons via activation probing on vanilla harmful and benign prompts, clusters neurons with similar activation profiles, and trains shared cluster-level updates while freezing the rest of the model. Importantly, NeST is trained only on vanilla malicious prompts, without using jailbreak-specific attack data, yet generalizes robustly to diverse jailbreaks. The learned updates are then folded into the original weights, incurring no inference-time overhead. Evaluated on 14 open-weight language and multimodal models, NeST outperforms lightweight baselines and approaches full fine-tuning robustness with significantly fewer trainable parameters. On text-only models, NeST reduces average jailbreak attack success rate from 44.5% to 1.1% while training only 0.4M parameters on average. Across multimodal settings, it reduces ASR from 55.3% to 1.1%, and for downstream fine-tuned variants, it restores safety by reducing ASR from 53.8% to 0.8%. These results show that robust, maintainable safety alignment can be achieved by concentrating adaptation on localized, functionally coherent safety structures.

2602.06142 2026-06-15 cs.PL cs.AI cs.CL cs.LG cs.PF 版本更新

Protean Compiler: An Agile Framework to Drive Fine-grain Phase Ordering

Protean Compiler: 一种驱动细粒度阶段排序的敏捷框架

Amir H. Ashouri, Shayan Shirahmad Gale Bagi, Kavin Satheeskumar, Tejas Srikanth, Jonathan Zhao, Ibrahim Saidoun, Ziwen Wang, Bryan Chan, Tomasz S. Czajkowski

发表机构 * Huawei Technologies Canada(华为技术加拿大)

AI总结 提出Protean Compiler框架,在LLVM中内置细粒度阶段排序能力,通过140多种静态特征收集方法和机器学习优化,平均加速4.1%,最高15.7%。

Comments Version 3: Preprint version of the accepted work at ACM TACO 2026

详情
AI中文摘要

阶段排序问题自20世纪70年代末以来一直是一个长期挑战,但由于其优化空间巨大且具有无界性,至今仍是一个开放问题,没有有限解。传统上,这种局部优化决策由手工编码的算法针对少量基准测试进行调整,当基准测试套件变化时,通常需要大量精力重新调整。过去20年中,机器学习被用于构建性能模型以改进编译器优化的选择和排序,但这些方法并未无缝集成到编译器中,也从未在细粒度的代码段范围内实现。本文提出Protean Compiler:一种敏捷框架,使LLVM在细粒度范围内具备内置的阶段排序能力。该框架还包含一个完整的库,包含140多种在不同范围内手工设计的静态特征收集方法,实验结果表明,相对于LLVM的O3,在Cbench应用程序上仅需增加几秒构建时间,平均加速可达4.1%,最高可达15.7%。此外,Protean编译器易于与第三方ML框架和其他大型语言模型集成,两步优化的两个应用在CBench的Susan和Jpeg应用程序上相对于-O3分别获得10.1%和8.5%的加速。Protean编译器无缝集成到LLVM中,可作为新的、增强的、全功能的编译器使用。我们计划在不久的将来将该项目发布到开源社区。

英文摘要

The phase ordering problem has been a long-standing challenge since the late 1970s, yet it remains an open problem due to having a vast optimization space and an unbounded nature, making it an open-ended problem without a finite solution, one can limit the scope by reducing the number and the length of optimizations. Traditionally, such locally optimized decisions are made by hand-coded algorithms tuned for a small number of benchmarks, often requiring significant effort to be retuned when the benchmark suite changes. In the past 20 years, Machine Learning has been employed to construct performance models to improve the selection and ordering of compiler optimizations, however, the approaches are not baked into the compiler seamlessly and never materialized to be leveraged at a fine-grained scope of code segments. This paper presents Protean Compiler: An agile framework to enable LLVM with built-in phase-ordering capabilities at a fine-grained scope. The framework also comprises a complete library of more than 140 handcrafted static feature collection methods at varying scopes, and the experimental results showcase speedup gains of up to 4.1% on average and up to 15.7% on select Cbench applications wrt LLVM's O3 by just incurring a few extra seconds of build time on Cbench. Additionally, Protean compiler allows for an easy integration with third-party ML frameworks and other Large Language Models, and two applications of this two-step optimization show a gain of 10.1\% and 8.5\% speedup w.r.t. -O3 on CBench's Susan and Jpeg applications. Protean compiler is seamlessly integrated into LLVM and can be used as a new, enhanced, full-fledged compiler. We plan to release the project to the open-source community in the near future.

2602.13421 2026-06-15 stat.ML cs.AI q-bio.NC 版本更新

Metabolic cost of information processing in Poisson variational autoencoders

泊松变分自编码器中信息处理的代谢成本

Hadi Vafaii, Jacob L. Yates

发表机构 * Redwood Center for Theoretical Neuroscience(理论神经科学红木中心) UC Berkeley(伯克利大学)

AI总结 通过泊松变分自编码器,发现KL散度项与先验发放率成正比,产生代谢成本项,从而在编码保真度和能量消耗之间实现权衡。

Comments Published in CCN 2026 Proceedings: https://doi.org/10.32470/6ff31r0

详情
AI中文摘要

生物系统中的计算从根本上受到能量约束,但标准的计算理论将能量视为自由可用。在这里,我们认为在泊松假设下的变分自由能最小化为能量感知的计算理论提供了一条有原则的路径。我们的关键观察是,泊松自由能目标中的Kullback-Leibler(KL)散度项与模型神经元的先验发放率成正比,产生了一个惩罚高基线活动的涌现代谢成本项。这种结构将抽象的信息论量——*编码率*——与具体的生物物理变量——*发放率*——耦合起来,从而能够在编码保真度和能量消耗之间进行权衡。这种耦合自然地出现在泊松变分自编码器(P-VAE)中——一种受大脑启发的生成模型,它将输入编码为离散的尖峰计数,并作为特例恢复出尖峰形式的*稀疏编码*——但在标准高斯VAE中不存在。为了证明这种代谢成本结构是泊松公式所独有的,我们将P-VAE与Grelu-VAE(一种对潜在样本应用ReLU整流的高斯VAE,用于控制非负约束)进行比较。通过对KL项权重系数$\eta$和潜在维度的系统扫描,我们发现增加$\eta$会单调地增加P-VAE中的稀疏性并降低平均尖峰活动。相比之下,Grelu-VAE的表示保持不变,证实了该效应是泊松统计所特有的,而非非负表示的副产品。这些结果确立了泊松变分推理作为资源受限计算理论的一个有前景的基础。

英文摘要

Computation in biological systems is fundamentally energy-constrained, yet standard theories of computation treat energy as freely available. Here, we argue that variational free energy minimization under a Poisson assumption offers a principled path toward an energy-aware theory of computation. Our key observation is that the Kullback-Leibler (KL) divergence term in the Poisson free energy objective becomes proportional to the prior firing rates of model neurons, yielding an emergent metabolic cost term that penalizes high baseline activity. This structure couples an abstract information-theoretic quantity -- the *coding rate* -- to a concrete biophysical variable -- the *firing rate* -- which enables a trade-off between coding fidelity and energy expenditure. Such a coupling arises naturally in the Poisson variational autoencoder (P-VAE) -- a brain-inspired generative model that encodes inputs as discrete spike counts and recovers a spiking form of *sparse coding* as a special case -- but is absent from standard Gaussian VAEs. To demonstrate that this metabolic cost structure is unique to the Poisson formulation, we compare the P-VAE against Grelu-VAE, a Gaussian VAE with ReLU rectification applied to latent samples, which controls for the non-negativity constraint. Across a systematic sweep of the KL term weighting coefficient $β$ and latent dimensionality, we find that increasing $β$ monotonically increases sparsity and reduces average spiking activity in the P-VAE. In contrast, Grelu-VAE representations remain unchanged, confirming that the effect is specific to Poisson statistics rather than a byproduct of non-negative representations. These results establish Poisson variational inference as a promising foundation for a resource-constrained theory of computation.

2602.09161 2026-06-15 stat.ML cs.LG 版本更新

Minimum Distance Summaries for Robust Neural Posterior Estimation

最小距离摘要用于鲁棒神经后验估计

Sherman Khoo, Dennis Prangle, Song Liu, Mark Beaumont

发表机构 * University of Cambridge(剑桥大学)

AI总结 提出最小距离摘要方法,通过最大均值差异(MMD)在测试时自适应调整摘要统计量,在不修改预训练神经后验估计器的情况下实现鲁棒推断,理论保证鲁棒性并实验验证。

详情
AI中文摘要

基于模拟的推断(SBI)通过首先在先验-模拟器对上训练神经后验估计器(NPE),通常使用低维摘要统计量,实现摊销贝叶斯推断,然后可以在新测试观测上查询以廉价地重复用于快速推断。由于NPE是在训练数据分布下估计的,当观测偏离训练分布时,它容易受到误指定的影响。许多鲁棒SBI方法通过修改NPE训练或引入误差模型来解决这个问题,将鲁棒性与推断网络耦合,损害了摊销和模块化。我们引入了最小距离摘要,一种即插即用的鲁棒NPE方法,独立于预训练NPE自适应调整测试时的摘要统计量。利用最大均值差异(MMD)作为观测数据与摘要条件预测分布之间的距离,自适应摘要从MMD继承了强鲁棒性属性。我们证明该算法可以通过随机傅里叶特征近似高效实现,产生轻量级、无模型的测试时自适应过程。我们为算法的鲁棒性提供了理论保证,并在各种合成和真实世界任务上进行了实证评估,表明在最小额外开销下实现了显著的鲁棒性提升。

英文摘要

Simulation-based inference (SBI) enables amortized Bayesian inference by first training a neural posterior estimator (NPE) on prior-simulator pairs, typically through low-dimensional summary statistics, which can then be cheaply reused for fast inference by querying it on new test observations. Because NPE is estimated under the training data distribution, it is susceptible to misspecification when observations deviate from the training distribution. Many robust SBI approaches address this by modifying NPE training or introducing error models, coupling robustness to the inference network and compromising amortization and modularity. We introduce minimum-distance summaries, a plug-in robust NPE method that adapts queried test-time summaries independently of the pretrained NPE. Leveraging the maximum mean discrepancy (MMD) as a distance between observed data and a summary-conditional predictive distribution, the adapted summary inherits strong robustness properties from the MMD. We demonstrate that the algorithm can be implemented efficiently with random Fourier feature approximations, yielding a lightweight, model-free test-time adaptation procedure. We provide theoretical guarantees for the robustness of our algorithm and empirically evaluate it on a range of synthetic and real-world tasks, demonstrating substantial robustness gains with minimal additional overhead.

2602.05413 2026-06-15 cs.IR cs.CL 版本更新

SciDef: Datasets and Tools for Automated Definition Extraction from Scientific Literature with LLMs

SciDef:基于LLM的科学文献自动定义提取数据集与工具

Filip Kučera, Christoph Mandl, Isao Echizen, Radu Timofte, Timo Spinde

发表机构 * National Institute of Informatics (NII)(国立信息研究所) University of Würzburg(乌尔姆大学) University of Passau(帕萨乌大学) University of Würzburg (JMU)(乌尔姆大学)

AI总结 提出SciDef资源套件,包含人工验证的定义基准DefExtra、相似度判断DefSim及基于LLM的提取流程,通过16个语言模型评估,发现NLI匹配指标与人类判断高度一致,但相关性过滤仍是自动提取的关键瓶颈。

Comments Under Review - Submitted to CIKM 2026 Resources Track;

详情
AI中文摘要

科学概念在不同论文中常被不一致地定义,使得比较发现、复用术语和构建可靠的下游资源变得困难。我们提出SciDef,一个用于科学定义提取的资源套件。该套件包含DefExtra,一个包含来自75篇学术论文的268个人工验证的作者定义基准;DefSim,60个人工标注的定义对相似度判断;以及一个基于LLM的开放流程,用于PDF预处理、分块、定义提取、提示优化和评估。我们通过跨提示策略和分块方案对16个语言模型进行基准测试来验证资源。最强集合级配置得分为0.397,而最高覆盖配置至少匹配86.4%的金标准定义,但过度生成了候选定义。我们进一步表明,基于NLI的匹配指标与人类DefSim判断高度一致。这些结果将SciDef定位为以定义为中心的文献分析的可复用基准和工具层,同时突出相关性感知过滤作为全自动定义提取的关键瓶颈。代码和数据集可在https://this URL获取。

英文摘要

Scientific concepts are often defined inconsistently across papers, making it difficult to compare findings, reuse terminology, and build reliable downstream resources. We present SciDef, a resource suite for scientific definition extraction. The suite contains DefExtra, a benchmark of 268 human-validated author-stated definitions from 75 academic papers; DefSim, 60 human-labeled definition-pair similarity judgments; and an open LLM-based pipeline for PDF preprocessing, chunking, definition extraction, prompt optimization, and evaluation. We validate the resources by benchmarking 16 language models across prompting strategies and chunking schemes. The strongest set-level configuration achieves a score of 0.397, while the highest-coverage configuration matches at least one prediction to 86.4% of gold definitions but over-generates candidate definitions. We further show that an NLI-based matching metric agrees strongly with human DefSim judgments. These results position SciDef as a reusable benchmark and tooling layer for definition-centric literature analysis, while highlighting relevance-aware filtering as the key bottleneck for fully automatic definition extraction. Code & datasets are available at https://github.com/Media-Bias-Group/SciDef.

2505.17961 2026-06-15 stat.ME cs.AI math.ST stat.AP stat.TH 版本更新

Federated Causal Inference from Multi-Site Observational Data via Propensity Score Aggregation

基于倾向得分聚合的多中心观测数据联邦因果推断

Rémi Khellaf, Aurélien Bellet, Julie Josse

发表机构 * University of Technology, CNRS, France(法国技术大学、国家科学研究中心)

AI总结 提出通过联邦学习聚合各站点倾向得分,利用成员权重估计平均处理效应,解决多中心观测数据因隐私限制无法集中的因果推断问题。

详情
AI中文摘要

因果推断通常假设可以集中访问个体层面数据。然而,在实践中,数据往往分散在多个站点,由于隐私、后勤或法律限制,集中化不可行。我们通过联邦学习方法从分散的观测数据中估计平均处理效应来解决这个问题,允许通过交换聚合统计量而非个体层面数据进行推断。我们提出了一种新方法,使用成员权重(定义为给定协变量条件下站点成员的概率)通过联邦加权平均局部得分来估计倾向得分。成员权重可以使用标准联邦学习算法通过参数或非参数分类模型灵活估计。得到的倾向得分用于构建联邦逆概率加权和增强逆概率加权估计量。与元分析方法(当任何站点违反积极性时失败)相比,我们的方法利用跨站点处理分配的异质性来改善重叠。我们表明,在站点层面的样本量、处理机制和协变量分布异质性下,联邦逆概率加权和增强逆概率加权表现良好。理论分析以及在模拟和真实数据上的实验证明了相对于元分析及相关方法的明显优势。

英文摘要

Causal inference typically assumes centralized access to individual-level data. Yet, in practice, data are often decentralized across multiple sites, making centralization infeasible due to privacy, logistical, or legal constraints. We address this problem by estimating the Average Treatment Effect (ATE) from decentralized observational data via a Federated Learning (FL) approach, allowing inference through the exchange of aggregate statistics rather than individual-level data. We propose a novel method to estimate propensity scores via a federated weighted average of local scores using Membership Weights (MW), defined as probabilities of site membership conditional on covariates. MW can be flexibly estimated with parametric or non-parametric classification models using standard FL algorithms. The resulting propensity scores are used to construct Federated Inverse Propensity Weighting (Fed-IPW) and Augmented IPW (Fed-AIPW) estimators. In contrast to meta-analysis methods, which fail when any site violates positivity, our approach exploits heterogeneity in treatment assignment across sites to improve overlap. We show that Fed-IPW and Fed-AIPW perform well under site-level heterogeneity in sample sizes, treatment mechanisms, and covariate distributions. Theoretical analysis and experiments on simulated and real-world data demonstrate clear advantages over meta-analysis and related approaches.

2512.18021 2026-06-15 quant-ph cs.ET cs.LG 版本更新

Shuttling Compiler for Trapped-Ion Quantum Computers Based on Large Language Models

基于大型语言模型的离子阱量子计算机穿梭编译器

Fabian Kreppel, Reza Salkhordeh, Ferdinand Schmidt-Kaler, André Brinkmann

发表机构 * Institute of Computer Science, Johannes Gutenberg University(计算机科学研究所,约翰内斯·古特堡大学) Institute of Physics, Johannes Gutenberg University(物理研究所,约翰内斯·古特堡大学) Department of Computer Science, Saarland University(计算机科学系,萨尔兰大学)

AI总结 提出首个基于大语言模型的离子阱量子计算机穿梭编译器,通过微调预训练模型生成有效调度,减少穿梭开销达15%。

Comments 18 pages, 6 figures, 2 tables

详情
AI中文摘要

我们提出了首个基于大型语言模型(LLMs)的离子阱量子计算机穿梭编译器,其中量子比特在段之间穿梭以进行门执行和量子比特存储。我们在线性和分支一维穿梭架构的示例上微调预训练LLMs。因此,我们获得了一种与布局无关的编译策略,直接从数据中学习所需的穿梭操作。使用多达16个量子比特的基准电路,这些微调后的LLMs现在可以为穿梭架构生成有效的调度。值得注意的是,我们还为以前未见过的四路交叉布局获得了有效调度。这表明训练后的LLMs可以泛化到训练期间未遇到的布局。对于各种架构,基于LLM的调度改进了最先进的基线编译器结果,将穿梭开销减少了高达15%。

英文摘要

We present the first shuttling compiler based on large language models (LLMs) for trapped-ion quantum computers, where qubits are shuttled between segments for gate execution and qubit storage. We fine-tune pre-trained LLMs on examples from linear and branched one-dimensional shuttling architectures. Thus, we obtain a layout-independent compilation strategy that learns the required shuttling operations directly from data. Using benchmark circuits with up to 16 qubits, such fine-tuned LLMs can now generate valid schedules for shuttling architectures. Notably, we also obtain a valid schedule for a previously unseen four-way junction layout. This demonstrates that trained LLMs can generalize to layouts not encountered during training. For various architectures, LLM-based schedules improve upon state-of-the-art baseline compiler results, reducing the shuttling effort by up to 15%.

2601.11626 2026-06-15 math.NA cs.LG cs.NA 版本更新

Concatenated Matrix SVD: Compression Bounds, Incremental Approximation, and Error-Constrained Clustering

拼接矩阵SVD:压缩界限、增量近似与误差约束聚类

Maksym Shamrai

发表机构 * Institute of Mathematics of NAS of Ukraine(乌克兰国家科学院数学研究所) MacPaw Research(MacPaw研究)

AI总结 针对拼接后截断SVD压缩中哪些矩阵可安全合并的问题,提出基于谱界和增量SVD的聚类框架,实现显式误差约束下的压缩感知矩阵分组。

Comments Published in Transactions on Machine Learning Research (06/2026)

Journal ref Transactions on Machine Learning Research (2026)

详情
AI中文摘要

现代机器学习、信号处理和科学计算中出现了大量矩阵集合,通常通过拼接后截断奇异值分解(SVD)进行压缩。这种策略实现了参数共享和高效重构,已被广泛应用于多视图学习、信号处理到神经网络压缩等领域。然而,它留下了一个基本问题未解答:在显式重构误差约束下,哪些矩阵可以安全地拼接并压缩在一起?现有方法依赖于启发式或特定于架构的分组,并且对所得的SVD近似误差没有提供原则性保证。在本工作中,我们引入了一个理论驱动的框架,用于在SVD压缩约束下进行矩阵的压缩感知聚类。我们的分析建立了水平拼接矩阵的新谱界,从奇异值增长的下界推导出最优秩-$r$ SVD重构误差的全局上界。第一个界遵循Weyl型块扩展下的单调性,而第二个界利用增量残差的奇异值提供更紧的逐块保证。我们进一步开发了一种基于增量截断SVD的高效近似估计器,无需形成完整的拼接矩阵即可跟踪主导奇异值。因此,我们提出了三种聚类算法,仅当预测的联合SVD压缩误差低于用户指定阈值时才合并矩阵。这些算法在速度、可证明准确性和可扩展性之间权衡,实现了具有显式误差控制的压缩感知聚类。

英文摘要

Large collections of matrices arise throughout modern machine learning, signal processing, and scientific computing, where they are commonly compressed by concatenation followed by truncated singular value decomposition (SVD). This strategy enables parameter sharing and efficient reconstruction and has been widely adopted across domains ranging from multi-view learning and signal processing to neural network compression. However, it leaves a fundamental question unanswered: which matrices can be safely concatenated and compressed together under explicit reconstruction error constraints? Existing approaches rely on heuristic or architecture-specific grouping and provide no principled guarantees on the resulting SVD approximation error. In the present work, we introduce a theory-driven framework for compression-aware clustering of matrices under SVD compression constraints. Our analysis establishes new spectral bounds for horizontally concatenated matrices, deriving global upper bounds on the optimal rank-$r$ SVD reconstruction error from lower bounds on singular value growth. The first bound follows from Weyl-type monotonicity under blockwise extensions, while the second leverages singular values of incremental residuals to yield tighter, per-block guarantees. We further develop an efficient approximate estimator based on incremental truncated SVD that tracks dominant singular values without forming the full concatenated matrix. Therefore, we propose three clustering algorithms that merge matrices only when their predicted joint SVD compression error remains below a user-specified threshold. The algorithms span a trade-off between speed, provable accuracy, and scalability, enabling compression-aware clustering with explicit error control.

2512.23847 2026-06-15 q-fin.GN cs.LG q-fin.TR 版本更新

Detecting Lookahead Bias in LLM Forecasts

检测LLM预测中的前瞻偏差

Zhenyu Gao, Wenxi Jiang, Yutong Yan

发表机构 * Department of Finance, CUHK Business School(CUHK商学院金融系)

AI总结 提出统计程序检测大语言模型经济预测中的前瞻偏差,通过日期回忆查询估计前瞻倾向(LAP),并验证LAP与预测交互项在精度回归中的显著性,应用于新闻标题和财报电话会议预测任务。

详情
AI中文摘要

我们开发了一种统计程序,用于检测大语言模型(LLM)生成的经济预测中的前瞻偏差。通过对公司-日期对进行仅日期回忆查询,我们估计LLM已内化已实现结果信息的概率,这一统计量称为前瞻倾向(LAP)。LAP在整个样本期内显著为正,并在训练数据截止点后几乎降至零。我们表明,在精度回归中,LAP与LLM预测之间的正向交互表明存在前瞻偏差污染,并将该测试应用于两个预测任务:预测股票收益的新闻标题和预测资本支出的财报电话会议记录。在两个应用中,LLM预测的预测能力在高LAP的公司-日期对上被放大,而交互项在训练截止后的样本上失去显著性。我们的测试为评估LLM生成预测的有效性和可靠性提供了一种经济高效的诊断工具。

英文摘要

We develop a statistical procedure to detect lookahead bias in economic forecasts generated by large language models (LLMs). Using a date-only recall query for a firm-date pair, we estimate the probability that the LLM has internalized information about the realized outcome, a statistic we term Lookahead Propensity (LAP). LAP is materially positive throughout the in-sample period and collapses essentially to zero right after the training-data cutoff. We show that a positive interaction between LAP and the LLM forecast in an accuracy regression indicates lookahead-bias contamination, and apply the test to two forecasting tasks: news headlines predicting stock returns and earnings call transcripts predicting capital expenditures. In both applications, the LLM forecast's predictive power is amplified on high-LAP firm-date pairs, and the interaction loses significance on post-training-cutoff samples. Our test provides a cost-efficient, diagnostic tool for assessing the validity and reliability of LLM-generated forecasts.

2512.15947 2026-06-15 eess.IV cs.CV 版本更新

MCR-VQGAN: A Scalable and Cost-Effective Tau PET Synthesis Approach for Alzheimer's Disease Imaging

MCR-VQGAN:一种用于阿尔茨海默病成像的可扩展且经济高效的Tau PET合成方法

Jin Young Kim, Jeremy Hudson, Jeongchul Kim, Qing Lyu, Christopher T. Whitlow

发表机构 * Department of Biomedical Engineering, Wake Forest University School of Medicine(生物医学工程系,威克森林大学医学院) Department of Radiology, Wake Forest School of Medicine(放射学系,威克森林医学院) Department of Radiology and Biomedical Imaging, Yale School of Medicine(放射学与生物医学成像系,耶鲁医学院)

AI总结 提出MCR-VQGAN模型,通过多尺度卷积、ResNet块和CBAM模块改进VQGAN,从T1加权MRI合成高保真tau PET图像,在ADNI数据集上优于cGAN等方法,且合成图像保留诊断特征,区域SUVR等效分析显示与真实图像高度一致。

Comments Accepted for publication in IEEE Access. 14 pages, 5 figures, 8 tables

详情
AI中文摘要

Tau正电子发射断层扫描(PET)是阿尔茨海默病(AD)的关键诊断方式,但其广泛临床采用受到辐射暴露、可用性有限、高临床工作量和巨大财务成本的阻碍。为解决这些限制,我们提出了多尺度CBAM残差向量量化生成对抗网络(MCR-VQGAN),从结构T1加权MRI合成高保真tau PET图像。MCR-VQGAN通过三项增强改进了标准VQGAN架构:多尺度卷积、ResNet块和卷积块注意力模块(CBAM),这些共同改善了对局部和全局特征的捕获。使用来自ADNI数据库的222对T1加权MRI和tau PET扫描,我们训练并比较了MCR-VQGAN与cGAN、WGAN-GP、CycleGAN和基线VQGAN。MCR-VQGAN在所有指标上均实现了优越的图像合成性能(MSE = 0.0056 +/- 0.0061,PSNR = 30.65 +/- 4.47 dB,SSIM = 0.9263 +/- 0.0469)。在真实tau PET上训练的基于CNN的AD分类器在真实(63.64%)和合成(65.91%)图像上达到了相当的准确率,表明诊断相关特征得以保留。跨Braak定义ROI的区域SUVR等效分析进一步表明真实与合成tau PET之间高度一致(Pearson r = 0.78-0.88;ICC = 0.71-0.84),其中Braak V/VI区域一致性最强(ICC = 0.838)。这些结果共同表明,MCR-VQGAN为传统tau PET成像提供了一种有前景且可扩展的替代方案,可能改善AD研究和临床工作流程中tau生物标志物的可及性。

英文摘要

Tau positron emission tomography (PET) is a critical diagnostic modality for Alzheimer's disease (AD), but its widespread clinical adoption is hindered by radiation exposure, limited availability, high clinical workload, and substantial financial costs. To address these limitations, we propose the Multi-scale CBAM Residual Vector Quantized Generative Adversarial Network (MCR-VQGAN) to synthesize high-fidelity tau PET images from structural T1-weighted MRI. MCR-VQGAN advances the standard VQGAN architecture through three enhancements: multi-scale convolutions, ResNet blocks, and Convolutional Block Attention Modules (CBAM), which collectively improve the capture of local and global features. Using 222 paired T1-weighted MRI and tau PET scans from the ADNI database, we trained and compared MCR-VQGAN against cGAN, WGAN-GP, CycleGAN, and baseline VQGAN. MCR-VQGAN achieved superior image synthesis performance across all metrics (MSE = 0.0056 +/- 0.0061, PSNR = 30.65 +/- 4.47 dB, SSIM = 0.9263 +/- 0.0469). A CNN-based AD classifier trained on real tau PET achieved comparable accuracy on real (63.64%) and synthetic (65.91%) images, indicating that diagnostically relevant features are preserved. Regional SUVR-equivalent analysis across Braak-defined ROIs further indicated strong agreement between real and synthetic tau PET (Pearson r = 0.78-0.88; ICC = 0.71-0.84), with the strongest agreement in Braak V/VI (ICC = 0.838). Together, these results suggest that MCR-VQGAN offers a promising and scalable surrogate for conventional tau PET imaging, potentially improving the accessibility of tau biomarkers for AD research and clinical workflows.

2402.17750 2026-06-15 physics.optics cs.ET cs.LG 版本更新

Arbitrary control over multimode wave propagation for machine learning

用于机器学习的多模波传播的任意控制

Tatsuhiro Onodera, Martin M. Stein, Benjamin A. Ash, Mandar M. Sohoni, Melissa Bosch, Ryotatsu Yanagimoto, Marc Jankowski, Timothy P. McKenna, Tianyu Wang, Gennady Shvets, Maxim R. Shcherbakov, Logan G. Wright, Peter L. McMahon

发表机构 * School of Applied and Engineering Physics, Cornell University(应用与工程物理系,康奈尔大学) NTT Physics and Informatics Laboratories, NTT Research, Inc.(NTT物理与信息实验室,NTT研究公司) E. L. Ginzton Laboratory, Stanford University(E. L. Ginzton实验室,斯坦福大学) Kavli Institute at Cornell for Nanoscale Science, Cornell University(康奈尔大学纳米科学研究所) Department of Electrical and Computer Engineering, Boston University(波士顿大学电气与计算机工程系) Department of Electrical Engineering and Computer Science, University of California(加州大学电气工程与计算机科学系) Department of Applied Physics, Yale University(耶鲁大学应用物理系)

AI总结 提出一种可快速重编程折射率的二维可编程波导,通过并行电光调制实现多模波传播的任意控制,并用于单次神经网络推理,理论表明面积增长为N^1.5而非N^2。

Journal ref Nat. Phys. 22, 164-171 (2026)

详情
AI中文摘要

受控的多模波传播可以实现比基于单模波导连接分立组件的架构更节省空间的光子处理器。我们可以不定义离散元件,而是通过二维多模干涉来塑造光子处理器的连续基底以执行计算。这里我们设计并展示了一种折射率可在空间上快速重编程的器件,允许对波传播进行任意控制。该器件是一种二维可编程波导,利用对平板波导折射率的并行电光调制,具有约10^4个可编程空间自由度。我们在基准任务上实现了单次通过、无需数字预处理或后处理的神经网络推理,向量维度高达49。理论和数值分析进一步表明,二维可编程波导不仅可能提供器件面积的常数因子缩减,还可能带来缩放优势,所需面积按N^{1.5}而非N^2增长。

英文摘要

Controlled multimode wave propagation can enable more space-efficient photonic processors than architectures based on discrete components connected by single-mode waveguides. Instead of defining discrete elements, one can sculpt the continuous substrate of a photonic processor to perform computations through multimode interference in two dimensions. Here we designed and demonstrated a device with a refractive index that can be rapidly reprogrammed across space, allowing arbitrary control of wave propagation. The device, a two-dimensional programmable waveguide, uses parallel electro-optic modulation of the refractive index of a slab waveguide with about $10^4$ programmable spatial degrees of freedom. We implemented neural network inference on benchmark tasks with up to $49$-dimensional vectors in a single pass, without digital pre-processing or post-processing. Theoretical and numerical analyses further indicated that two-dimensional programmable waveguides may offer not only a constant-factor reduction in device area but also a scaling benefit, with the area required growing as $N^{1.5}$ rather than $N^2$.

2511.22246 2026-06-15 hep-ex cs.AI physics.ins-det 版本更新

An interpretable unsupervised representation learning for high precision measurement in particle physics

一种可解释的无监督表示学习用于粒子物理中的高精度测量

Xing-Jian Lv, De-Xing Miao, Zi-Jun Xu, Jian-Chun Wang

发表机构 * Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China(中国科学院高能物理研究所) University of Chinese Academy of Sciences, Beijing 100049, China(中国科学院大学)

AI总结 提出Histogram AutoEncoder(HistoAE),通过自定义直方图损失强制物理结构化的潜在空间,实现可解释的无监督学习,在硅微条探测器数据上达到电荷分辨率0.25e和位置分辨率3μm,媲美传统方法。

Comments 8 pages, 7 figures

详情
AI中文摘要

无监督学习已广泛应用于粒子物理的各种任务。然而,现有模型缺乏对其学习表示的精确控制,限制了物理可解释性,并阻碍了其用于精确测量。我们提出了直方图自编码器(HistoAE),一种无监督表示学习网络,具有自定义的基于直方图的损失函数,强制实现物理结构化的潜在空间。应用于硅微条探测器,HistoAE学习了一个可解释的二维潜在空间,对应于粒子的电荷和撞击位置。经过简单的后处理,它在束流测试数据上实现了$0.25\,e$的电荷分辨率和$3\,\mu\mathrm{m}$的位置分辨率,与传统方法相当。这些结果表明,无监督深度学习模型能够实现物理上有意义且定量精确的测量。此外,HistoAE的生成能力使其能够直接扩展到快速探测器模拟。

英文摘要

Unsupervised learning has been widely applied to various tasks in particle physics. However, existing models lack precise control over their learned representations, limiting physical interpretability and hindering their use for accurate measurements. We propose the Histogram AutoEncoder (HistoAE), an unsupervised representation learning network featuring a custom histogram-based loss that enforces a physically structured latent space. Applied to silicon microstrip detectors, HistoAE learns an interpretable two-dimensional latent space corresponding to the particle's charge and impact position. After simple post-processing, it achieves a charge resolution of $0.25\,e$ and a position resolution of $3\,μ\mathrm{m}$ on beam-test data, comparable to the conventional approach. These results demonstrate that unsupervised deep learning models can enable physically meaningful and quantitatively precise measurements. Moreover, the generative capacity of HistoAE enables straightforward extensions to fast detector simulations.

2509.24710 2026-06-15 stat.ML cs.LG cs.NA math.NA 版本更新

MAD: Manifold Attracted Diffusion

MAD: 流形吸引扩散

Dennis Elbrächter, Giovanni S. Alberti, Matteo Santacesaria

发表机构 * Department of Mathematics, University of Vienna(维也纳大学数学系) MaLGa Center, Department of Mathematics, University of Genoa(热那亚大学数学系MaLGa中心)

AI总结 提出流形吸引扩散方法,利用流形假设通过扩展得分函数在推理阶段去除噪声,生成无噪声样本,在玩具问题、合成数据和真实数据上验证有效性。

Journal ref Forty-third International Conference on Machine Learning, 2026

详情
AI中文摘要

基于得分的扩散模型是从图像分布中生成样本的一种高效方法。我们考虑训练数据来自目标分布的有噪声版本的情况,并提出一种可高效实现的推理过程修改,以生成无噪声样本。我们的方法受流形假设启发,该假设认为有意义的数据集中在高维环境空间的某个低维流形周围。核心思想是,噪声表现为离流形方向上的低幅度变化,而目标分布的相关变化主要限于流形方向。我们引入了扩展得分概念,并表明在简化设置中,它可以将小变化减少为零,同时基本保持大变化不变。我们描述了如何从标准得分的近似中高效计算其近似,并在玩具问题、合成数据和真实数据上展示了其有效性。

英文摘要

Score-based diffusion models are a highly effective method for generating samples from a distribution of images. We consider scenarios where the training data comes from a noisy version of the target distribution, and present an efficiently implementable modification of the inference procedure to generate noiseless samples. Our approach is motivated by the manifold hypothesis, according to which meaningful data is concentrated around some low-dimensional manifold of a high-dimensional ambient space. The central idea is that noise manifests as low magnitude variation in off-manifold directions in contrast to the relevant variation of the desired distribution which is mostly confined to on-manifold directions. We introduce the notion of an extended score and show that, in a simplified setting, it can be used to reduce small variations to zero, while leaving large variations mostly unchanged. We describe how its approximation can be computed efficiently from an approximation to the standard score and demonstrate its efficacy on toy problems, synthetic data, and real data.