arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2051
2606.11911 2026-06-11 stat.ML cs.LG math.AT 新提交

From Persistence to Survival: Hypothesis Testing, Effect Sizes and Vectorisation for Topological Features

从持续性到生存:拓扑特征的假设检验、效应大小与向量化

Juliette Murris, Bernadette Stolz, Karsten Borgwardt

发表机构 * Department of Machine Learning and Systems Biology, Max Planck Institute of Biochemistry, Martinsried, Germany(机器学习与系统生物学部门,马克斯·普朗克生物化学研究所,马尔廷斯里德,德国)

AI总结 提出STRAND方法,将持久性图视为生存数据,利用持久性生存函数统一实现假设检验、效应大小计算和向量化,在合成数据和真实基准上验证了有效性。

详情
AI中文摘要

持久性图是拓扑数据分析中常见的表示形式,但它们并非天然存在于向量空间中,且用于比较它们的统计工具在很大程度上与用于下游预测的工具分开发展。我们引入STRAND(生存拓扑表示图分析),将(集合的)持久性图视为生存数据:每个具有持久性值 $p = d - b$ 的拓扑特征是一个完全观测的事件时间,持久性生存函数 $S(t) = \mathbb{P}(p > t)$ 是比较图的中心对象。从这个单一表示中,我们推导出(i)一个非参数双样本检验,具有校准的第一类错误率和少量图的高功效;(ii)可解释的效应大小;以及(iii)用于下游机器学习的1-Wasserstein稳定特征向量。我们在具有受控拓扑的合成流形上验证了校准和功效,展示了在14个图和3D点云基准上的竞争性向量化,并将该方法应用于fMRI/神经科学数据中的功能性脑连接研究。据我们所知,STRAND是第一个从单一连贯且可解释的表示为持久性图提供假设检验和向量化的方法。

英文摘要

Persistence diagrams are common representations in topological data analysis, but they do not naturally live in a vector space, and the statistical tools developed for comparing them have largely evolved separately from those used for downstream prediction. We introduce STRAND (Survival Topological Representation ANalysis of Diagrams), which treats (collections of) PDs as survival data: each topological feature with persistence value $p = d - b$ is a fully observed time-to-event, and the persistence survival function $S(t) = \mathbb{P}(p > t)$ is the central object for comparing diagrams. From this single representation we derive (i) a non-parametric two-sample test with calibrated Type I error and high power from a small number of diagrams; (ii) interpretable effect sizes; and (iii) a 1-Wasserstein-stable feature vector for downstream machine learning. We validate calibration and power on synthetic manifolds with controlled topology, demonstrate competitive vectorisation across 14 graph and 3D point cloud benchmarks, and apply the method to study functional brain connectivity in fMRI/neuroscience data. To our knowledge, STRAND is the first method to provide hypothesis testing and vectorisation for persistence diagrams from a single coherent and interpretable representation.

2606.11876 2026-06-11 q-bio.QM cs.LG stat.ME 新提交

Seeing Below the Limit of Detection: A Censored-Poisson Bayesian Latent-Growth Change-Point Detector (the Span Detector) for Serial ctDNA in HR+/HER2- Metastatic Breast Cancer

检测限以下:用于HR+/HER2-转移性乳腺癌连续ctDNA的删失泊松贝叶斯潜在增长变点检测器(Span检测器)

Aarchi Singh Thakur, Abhijoy Sarkar

发表机构 * Span AI

AI总结 提出Span检测器,利用删失泊松贝叶斯潜在增长变点模型处理ctDNA非检测作为左删失观测,通过序贯广义似然比统计量检测变异检测率上升点,在10%假警报率下将提前三个月捕获进展的比例从11%提升至25%。

Comments 9 pages, 4 figures, 2 tables. Code and synthetic data generator: https://github.com/span-ai-labs/span-detector

详情
AI中文摘要

循环肿瘤DNA(ctDNA)在影像学显示耐药性数月前就已携带证据,但最早证据存在于检测限(LoD)以下:新生亚克隆仅被间歇性检测到,产生微弱检测和非检测的闪烁序列。商业液体活检将每次抽取视为独立快照,并将非检测视为无信号。我们认为非检测是左删失观测,而随时间变化的非检测和微弱检测模式在单个值可信之前就携带了可操作的生长证据。我们引入Span,一种删失泊松贝叶斯潜在增长变点检测器,它对二元检测过程建模,为每个变异的检测率累积一个向上变点的序贯广义似然比统计量,并以校准的假警报控制发出竞争风险警报。Span没有学习权重,因此没有过拟合风险。在一线CDK4/6抑制剂联合内分泌治疗的HR+/HER2-转移性乳腺癌合成队列中,在匹配的10%假警报率下,Span将提前三个月捕获的即将进展比例大约翻倍(惰性出现:25% vs 快照的11%),具有可证伪的剂量反应:对惰性出现效果显著,对快速出现效果消失。值轨迹基线表现与快照相同,将增益归因于删失检测模型。生存主干在真实乳腺癌数据(GBSG-2,n=686;C指数0.67 vs 0.68)上与Cox基线匹配,在具有清洁生物标志物的真实纵向队列(PBC2,n=312)上,同一管道正确拒绝获胜,这是一个可证伪的边界测试,确认机制是特定于状态的。所有ctDNA轨迹均为合成数据。

英文摘要

Circulating-tumour DNA (ctDNA) carries evidence of drug resistance months before imaging shows it, but the earliest evidence lives below the assay's limit of detection (LoD): a nascent subclone is detected only intermittently, producing a flickering sequence of faint detects and non-detects. Commercial liquid biopsies treat each draw as an independent snapshot and a non-detect as nothing. We argue a non-detect is a left-censored observation, and the pattern of non-detects and faint detects over time carries actionable evidence of growth before any single value is trustworthy. We introduce Span, a censored-Poisson Bayesian latent-growth change-point detector that models the binary detection process, accumulates a sequential generalised-likelihood-ratio statistic for an upward change-point in the per-variant detection rate, and raises a competing-risks alarm with calibrated false-alarm control. Span has no learned weights, so there is nothing to overfit. On a synthetic cohort of HR+/HER2- metastatic breast cancer on first-line CDK4/6-inhibitor plus endocrine therapy, at a matched 10% false-alarm rate, Span roughly doubles the fraction of impending progressions caught three months ahead (indolent regime: 25% vs 11% for the snapshot), with a falsifiable dose-response: large for indolent emergence, vanishing for fast emergence. A value-trajectory baseline performs identically to the snapshot, isolating the gain to the censored detection model. The survival backbone matches a Cox baseline on real breast-cancer data (GBSG-2, n=686; C-index 0.67 vs 0.68), and on a real longitudinal cohort with clean biomarkers (PBC2, n=312) the same pipeline correctly declines to win, a falsifiable boundary test confirming the mechanism is regime-specific. All ctDNA trajectories are synthetic.

2606.11870 2026-06-11 cond-mat.mtrl-sci cs.LG 新提交

Modelling magnetic material properties with uncertainty-aware neural networks

用不确定性感知神经网络建模磁性材料性质

Clemens Wager, Heisam Moustafa, Alexander Kovacs, Qais Ali, Harald Oezelt, Hayate Yamano, Masao Yano, Noritsugu Sakuma, Hyuga Hosoi, Akihito Kinoshita, Tetsuya Shoji, Akira Kato, Thomas Schrefl

发表机构 * University of Cambridge(剑桥大学) ETH Zurich(苏黎世联邦理工学院) University of Tokyo(东京大学) National Institute for Materials Science(国家材料科学研究所) Max Planck Institute for Intelligent Systems(智能系统马克斯·普朗克研究所)

AI总结 针对新材料发现中数据稀缺和分布外预测的不确定性问题,采用高斯负对数似然损失和基于dropout的贝叶斯近似量化预测不确定性,并迁移至微观结构预测矫顽力任务,证明不确定性量化可增强预测可信度且可迁移。

Comments pre print, unreviewed version

详情
AI中文摘要

机器学习越来越多地被应用于通过探索大成分和结构设计空间来加速新材料的发现。然而,高质量数据的稀缺以及频繁的分布外预测需求引入了大量不确定性,使得评估模型可靠性变得至关重要。在这项工作中,我们研究了不确定性量化作为评估永磁体研究背景下模型置信度的一种手段。在第一项研究中,我们基准测试了经典和现代机器学习模型在预测本征磁性方面的性能,重点关注其不确定性估计的质量。我们应用高斯负对数似然损失和基于dropout的贝叶斯近似作为估计预测不确定性的实用策略。在第二项研究中,我们将这些用于不确定性估计的架构特征迁移到一个更复杂的任务:使用图神经网络从微观结构信息预测矫顽力。这些研究共同表明,不确定性量化不仅增强了预测的可信度,而且在不同建模任务之间是可迁移的。

英文摘要

Machine learning is increasingly applied to accelerate the discovery of novel materials by exploring large compositional and structural design spaces. Yet, the scarcity of high-quality data and the frequent need for out-of-distribution prediction introduce substantial uncertainty, making the assessment of model reliability essential. In this work, we investigate uncertainty quantification as a means to evaluate model confidence in the context of permanent magnet research. In a first study, we benchmark classical and modern machine learning models for predicting intrinsic magnetic properties, focusing on the quality of their uncertainty estimates. We apply Gaussian negative log-likelihood loss and dropout-based Bayesian approximation as practical strategies for estimating predictive uncertainty. In a second study, we transfer these architectural features for uncertainty estimation to a more complex task: predicting coercivity from microstructural information using a graph neural network. Together, these studies demonstrate that uncertainty quantification not only enhances the trustworthiness of predictions but is also transferable across different modeling tasks.

2606.11869 2026-06-11 cs.SE cs.AI 新提交

Agents All the Way Down; A Methodology for Building Custom AI Agents from Substrate to Production

层层代理:从底层到生产构建自定义AI代理的方法论

Marc Alier Forment, Juanan Pereira, Francisco José García-Peñalvo, María José Casañ Guerrero

发表机构 * Universitat Politècnica de Catalunya (UPC)(西班牙巴塞罗那理工大学) Universidad del País Vasco / Euskal Herriko Unibertsitatea (UPV/EHU)(西班牙巴斯克大学)

AI总结 提出一种无框架的方法论,通过两个前提条件(将LLM作为软件组件和构建块)和三个实践(原型设计、打包为CLI、代理测试代理)来构建自定义AI代理,实现端到端开发。

详情
AI中文摘要

自定义AI代理是存在于自己应用程序中的代理,它们与自己的数据和工具交互,强制执行自己的安全边界,并携带自己的品牌和审计跟踪。它们与通用层级的区别在于适配性而非能力:每个代理由维护它的工程师为一项工作而构建。目前没有已发布的实践说明如何端到端地构建一个自定义AI代理。各个部分随处可见(函数调用API、模型上下文协议、可配对的代码代理),但将这些部分串联起来的实践存在于播客、博客和泄露的系统提示中。本文将这些实践记录为一种方法论,即“层层代理”:两个前提条件一次交叉并保持,然后三个实践在代理的生命周期中重复。前提条件是(P1)底层:将LLM作为软件组件,框架化为工具、系统,然后在提示缓存下框架化为消息;(P2)构建块:函数调用、MCP、CLI编排、liteshell模式、代理循环、技能、角色、钩子和脚手架。三个实践是(P3)使用通用代理进行原型设计;(P4)收获、折叠并将结果作为CLI发布,即Turtle模式;(P5)代理测试代理,其中通用代理通过行为场景驱动自定义代理,这是对经典测试的补充而非替代。工作循环是P3到P4再到P5并返回,一个推论自然得出:多代理编排就是CLI组合。该方法论在构造上是无框架的。它从AAC中提炼而来,AAC是开源LAMB平台的自定义代理,由一名开发人员使用AI配对程序员在大约十天内构建并投入生产。我们将其作为一种可迁移的实践呈现,独立于任何语言或框架。

英文摘要

Custom AI agents areagents that live inside their own application, talk to their own data and tools, enforce their own security boundaries, and carry their own brand and audit trail. What separates them from the general-purpose tier is fit, not capability: each is built for one job, by the engineer who will maintain it. No published practice sets out how to build one end to end. The pieces are everywhere (function-calling APIs, the Model Context Protocol, code agents to pair with), but the practice that chains them lives in podcasts, blogs, and leaked system prompts. This paper writes that practice down as a methodology, Agents All the Way Down: two preconditions crossed once and kept, then three practices repeated for the agent's life. The preconditions are (P1) Substrate, the LLM as a software component, framed as tools, then system, then messages under prompt-caching; and (P2) Building blocks: function calling, MCP, CLI orchestration, the liteshell pattern, the agent loop, skills, characters, hooks, and scaffolding. The practices are (P3) prototype with a general-purpose agent; (P4) harvest, fold, and ship the result as a CLI, the Turtle pattern; and (P5) agent-tests-agent, in which a general-purpose agent drives it through behavioural scenarios, a complement to classical testing, not a replacement. The working loop is P3 to P4 to P5 and back, and one corollary falls out for free: multi-agent orchestration is just CLI composition. The methodology is framework-free by construction. It was distilled from the AAC, a custom agent for the open-source LAMB platform, built in about ten days by one developer with an AI pair-programmer and in production . We present it as a transferable practice, independent of any language or framework.

2606.11865 2026-06-11 stat.ML cs.LG 新提交

Conformal Bayes under Label Shift: Post-Hoc Calibration vs. In-Training Adaptation

标签偏移下的共形贝叶斯:事后校准与训练内适应

Seungjin Choi

发表机构 * CROID Research and aSSIST University(CROID研究院和aSSIST大学)

AI总结 研究标签偏移下共形贝叶斯方法,通过重要性加权共形校准恢复目标域覆盖,比较事后校准与训练内适应两种策略,后者在偏差训练中起到去偏作用。

Comments 2nd Workshop on Epistemic Intelligence in Machine Learning (EIML@ICML 2026)

详情
AI中文摘要

共形贝叶斯将贝叶斯后验预测与共形校准相结合,产生既统计有效又几何高效的预测集。我们从统一视角研究标签偏移下的共形贝叶斯,识别出两种互补方法,它们通过重要性加权共形校准恢复名义目标域覆盖,但通过独立机制运作。\emph{事后校准}将后验预测向目标域倾斜,并通过重要性加权分位数校正共形阈值,保持参数后验不变。\emph{训练内适应}将参数后验本身向目标域倾斜,产生校正后的预测,其最高预测密度区域作为基于拟合目标预测的最高预测密度(HPD)预测集;效率依赖于模型,并不保证有限样本条件最优性。两个受控实验表明,在无偏训练机制下,两种策略同样实现有效覆盖,而在领先优化机制下,训练内适应作为去偏算子,在覆盖不变的情况下减少区间宽度。

英文摘要

Conformal Bayes combines Bayesian posterior predictives with conformal calibration to produce prediction sets that are both statistically valid and geometrically efficient. We study conformal Bayes under label shift from a unified perspective, identifying two complementary approaches that restore nominal target-domain coverage through importance-weighted conformal calibration but operate through independent mechanisms. \emph{Post-hoc calibration} tilts the posterior predictive toward the target domain and corrects the conformal threshold via an importance-weighted quantile, leaving the parameter posterior unchanged. \emph{In-training adaptation} tilts the parameter posterior itself to the target domain, producing a corrected predictive whose highest predictive density region serves as the highest predictive density (HPD) based prediction set under the fitted target predictive; efficiency is model-dependent and does not imply finite-sample conditional optimality. Two controlled experiments show that in an unbiased training regime both strategies achieve valid coverage equally, while in a lead-optimization regime in-training adaptation acts as a debiasing operator, reducing interval width at unchanged coverage.

2606.11857 2026-06-11 eess.SP cs.LG 新提交

REACH: Interpretability-Driven Feature Identification and Architecture Compression for Multi-Channel Vehicular Channel Estimation

REACH:面向多信道车辆信道估计的可解释性驱动特征识别与架构压缩

Simbarashe Aldrin Ngorima, Albert Helberg, Marelie H. Davel

发表机构 * Faculty of Engineering, North-West University, South Africa(南非北开普大学工程学院) Centre for Artificial Intelligence Research, South Africa(南非人工智能研究中心) National Institute for Theoretical and Computational Sciences, South Africa(南非理论与计算科学国家研究所)

AI总结 提出REACH框架,通过梯度归因识别关键时频特征并压缩网络,在IEEE 802.11p信道估计中实现参数和计算量大幅降低,且OOD泛化性能下降缓慢。

Comments 22 pages, 16 figures

详情
AI中文摘要

多信道混合信噪比训练改善了IEEE 802.11p车辆通信中深度学习信道估计器的分布外(OOD)泛化能力,但其内部机制尚不明确。本文提出REACH(基于相关性的信道估计器解释与架构压缩),一个在两层上运行的基于梯度的可解释性框架。输入级归因识别出一组在所有评估信道条件下始终相关的时频特征,从而以最小的性能损失实现输入维度缩减。滤波器级归因揭示了一种近乎通用的内部表示,为观察到的OOD泛化提供了表示层面的解释。基于由此产生的滤波器分类,相关性引导的架构压缩在归一化均方误差(NMSE)退化小于1 dB的情况下,大幅减少了参数数量和浮点运算次数(FLOPs),并且随着压缩程度的增加,OOD泛化性能的下降速度慢于分布内准确率的下降速度。

英文摘要

Multi-channel mixed-SNR training improves out-of-distribution (OOD) generalisation of deep learning channel estimators for IEEE 802.11p vehicular communications, yet the internal mechanism responsible for this remains unexplained. This work presents REACH (Relevance-based Explanation and Architectural Compression for cHannel estimators), a gradient-based interpretability framework that operates at two levels. Input-level attribution identifies a subset of time-frequency features consistently relevant across all evaluated channel conditions, enabling input dimensionality reduction with minimal performance loss. Filter-level attribution reveals a near-universal internal representation, providing a representational account of the observed OOD generalisation. Guided by the resulting filter taxonomy, relevance-guided architecture compression substantially reduces both the number of parameters and the number of floating-point operations (FLOPs) with sub-1 dB normalised mean square error (NMSE) degradation, and OOD generalisation degrades more slowly than within-distribution accuracy under increasing compression.

2606.11835 2026-06-11 cs.HC cs.AI 新提交

Designing AI-Supported Focus Groups: A Role x Modality Playbook

设计AI支持的焦点小组:角色×模态剧本

Zhiqing Wang, Steven Dow

发表机构 * University of California, San Diego(加州大学圣地亚哥分校)

AI总结 针对焦点小组资源密集且对引导高度敏感的问题,提出按AI角色(工具、联合主持、主持)和模态(文本、语音、具身)组织的剧本,并分析交互权衡与开放问题。

详情
AI中文摘要

收集参与者的生活经验是设计研究的核心。焦点小组的独特价值在于参与者不仅分享个人经历,还能相互回应,从而呈现比较、分歧和集体意义建构。然而,焦点小组资源密集且对引导高度敏感:主持人必须探究细节、平衡参与、管理话题流程并维持心理安全,微妙的引导选择可能影响哪些内容变得突出。近期人机交互研究和商业会议工具表明,生成式AI可以通过提示、轮流调节、主题映射和实时总结来支撑实时对话。然而,用户体验研究团队缺乏关于这些能力在焦点小组中的含义以及引入的方法论风险的清晰图景。我们综合了AI支持实时对话的相关工作,并将其转化为一个焦点小组特定的剧本,按AI角色(工具、联合主持、主持)和模态(文本、语音、具身)组织。我们描述了交互权衡,并识别了将AI支持的焦点小组作为方法论配置进行评估的开放问题。

英文摘要

Collecting participants' lived experiences is central to design research. Focus groups are uniquely valuable because participants not only share individual accounts but also respond to one another, surfacing comparison, disagreement, and collective sensemaking. However, focus groups are resource-intensive and highly sensitive to facilitation: moderators must probe for specificity, balance participation, manage topic flow, and sustain psychological safety, and subtle facilitation choices can shape what becomes salient. Recent HCI work and commercial meeting tools show that generative AI can scaffold live conversation through prompting, turn regulation, thematic mapping, and real-time summarization. Yet UXR teams lack a clear map of what these capabilities mean in focus groups and what methodological risks they introduce. We synthesize AI supports for live conversation and translate them into a focus-group-specific playbook organized by AI role (tool, co-host, host) and modality (text, voice, embodied).We synthesize prior work on AI-supported live conversation and propose a focus-group-specific playbook of AI supports organized by role (tool, co-host, host) and modality (text, voice, embodied). We characterize interactional trade-offs and identify open questions for evaluating AI-supported focus groups as methodological configurations.

2606.11817 2026-06-11 cs.CR cs.AI cs.CL cs.SE 新提交

Grammar-Constrained Decoding Can Jailbreak LLMs into Generating Malicious Code

语法约束解码可诱使大语言模型生成恶意代码

Yitong Zhang, Shiteng Lu, Jia Li

发表机构 * College of AI, Tsinghua University(清华大学人工智能学院)

AI总结 本文发现语法约束解码(GCD)可被利用发起名为CodeSpear的越狱攻击,使LLM生成恶意代码;并提出安全对齐方法CodeShield,通过生成蜜罐代码防御该攻击。

详情
AI中文摘要

大型语言模型(LLM)越来越多地用于代码生成,引发了对它们可能被滥用来生成恶意代码的担忧。与此同时,语法约束解码(GCD)已被广泛采用,通过强制语法有效性来提高LLM生成代码的可靠性。在本文中,我们揭示了一个反直觉的风险:这种面向可靠性的技术本身可能成为攻击面。我们发现了一种新的越狱攻击,称为CodeSpear,它利用GCD诱导LLM生成恶意代码。我们的实验表明,仅应用良性代码语法约束即可有效越狱LLM。为了解决这一漏洞,我们提出了CodeShield,一种安全对齐方法,即使在攻击者控制的语法约束下也能稳健地保持安全行为。CodeShield通过在代码模态中对齐模型,教其在GCD下生成蜜罐代码。这种代码在语义上是无害的,因此不会实现恶意请求,并且在结构上是多样化的,因此难以通过语法收紧来抑制。同时,当自然语言可用时,CodeShield仍然保留自然语言的拒绝。在4个基准测试中对10个流行LLM的实验表明,CodeSpear优于代表性的越狱基线,平均攻击成功率提高了30个百分点以上。CodeShield在CodeSpear下恢复了安全性,同时保持了良性实用性。我们的发现揭示了GCD的一个基本风险,并呼吁对其潜在安全影响给予更多关注。

英文摘要

Large Language Models (LLMs) are increasingly used for code generation, raising concerns that they may be misused to produce malicious code. Meanwhile, Grammar-Constrained Decoding (GCD) has been widely adopted to improve the reliability of LLM-generated code by enforcing syntactic validity. In this paper, we reveal a counterintuitive risk: this reliability-oriented technique can itself become an attack surface. We uncover a new jailbreak attack, termed CodeSpear, that exploits GCD to induce LLMs into generating malicious code. Our experiments show that simply applying a benign code grammar constraint can effectively jailbreak LLMs. To address this vulnerability, we propose CodeShield, a safety alignment approach that robustly preserves safe behavior even under attacker-controlled grammar constraints. CodeShield aligns the model in the code modality by teaching it to generate honeypot code under GCD. Such code is semantically harmless, so it does not implement the malicious request, and structurally diverse, so it is difficult to suppress through grammar tightening. At the same time, CodeShield still preserves natural-language refusals when natural language is available. Experiments on 10 popular LLMs across 4 benchmarks show that CodeSpear outperforms representative jailbreak baselines and increases the attack success rate by more than 30 percentage points on average. CodeShield also restores safety under CodeSpear while preserving benign utility. Our findings reveal a fundamental risk of GCD and call for greater attention to its potential security implications.

2606.11814 2026-06-11 quant-ph cs.AI cs.LG 新提交

Sparsified Kolmogorov-Arnold Networks for Interpretable Quantum State Tomography

稀疏化Kolmogorov-Arnold网络用于可解释量子态层析

Xinge Wu, Huaxin Wang, Jiajun Liu, Ruiqing He, Jiandong Shang, Hengliang Guo, Qiang Chen

发表机构 * National Supercomputing Center in Zhengzhou(郑州国家超级计算中心) Zhengzhou University(郑州大学) School of Computer and Artificial Intelligence(计算机与人工智能学院) School of Communication and Artificial Intelligence(通信与人工智能学院) School of Integrated Circuits(集成电路学院) Nanjing Institute of Technology(南京理工大学)

AI总结 研究利用稀疏化Kolmogorov-Arnold网络作为可检查的重构规则,通过三量子比特GHZ基准测试,识别出与GHZ相关的Pauli测量集,并揭示与解析GHZ Pauli分组一致的输入-隐藏-输出通路结构,实现神经网络重构模型的结构可解释性。

详情
AI中文摘要

量子态层析的机器学习方法可以实现高保真度重构,但训练模型所使用的物理结构往往隐含。这里我们探究稀疏化Kolmogorov-Arnold网络(KAN)是否不仅可以作为回归器,还可以作为可检查的重构规则,其内部组织可以与已知的Pauli结构进行对照。我们研究了一个受控的三量子比特GHZ族基准测试,其中所有63个非恒等Pauli期望值被用于重构三个GHZ子空间变量:种群不平衡$z$、实部非对角分量$c$和虚部非对角分量$c$。在有限采样和退极化噪声下,外部消融从63个测量中识别出扩展的12通道GHZ相关Pauli集,在测试的采样次数和退极化噪声强度下实现了精确的前12恢复。这些支持模式在多种子随机初始化和噪声水平分析中保持稳定,并在随机标签控制下崩溃。主要的剪枝输入-隐藏-输出通路以与解析GHZ Pauli分组一致的方式组织Z型种群可观测量和X/Y非对角可观测量,稀疏公式恢复恢复了规范的带符号Pauli关系。因此,KAN的贡献在于神经重构模型中的通路级结构可解释性,而非优越的稀疏回归。结合阴性对照,这些探针提供了一条一致性链,用于审计学习到的重构规则与已知物理结构的一致性。

英文摘要

Machine-learning approaches to quantum state tomography can achieve high reconstruction fidelity, but the physical structure used by the trained model often remains implicit. Here we ask whether a sparsified Kolmogorov-Arnold Network (KAN) can be used not only as a regressor, but also as an inspectable reconstruction rule whose internal organization can be checked against known Pauli structure. We study a controlled three-qubit GHZ-family benchmark in which all 63 non-identity Pauli expectation values are used to reconstruct three GHZ-subspace variables: the population imbalance $z$, the real off-diagonal component $c$, and the imaginary off-diagonal component $s$. Under finite-shot sampling and depolarizing noise, external ablation identifies the extended 12-channel GHZ-relevant Pauli set from the 63 measurements, with exact top-12 recovery across the tested shot counts and depolarizing-noise strengths. These support patterns remain stable across multi-seed random-initialization and noise-level analyses, and collapse under random-label controls. The dominant pruned input-hidden-output pathways organize Z-type population observables and X/Y off-diagonal observables in a pattern consistent with the analytic GHZ Pauli grouping, and sparse formula recovery recovers the canonical signed Pauli relations. The contribution of the KAN is therefore pathway-level structural interpretability within a neural reconstruction model, rather than superior sparse regression. Together with negative controls, these probes provide a consistency chain for auditing learned reconstruction rules against known physical structure.

2606.11798 2026-06-11 q-fin.CP cs.LG math.OC 新提交

Deterministic Policy Gradient for Learning Equilibrium in Time-Inconsistent Control Problems

时间不一致控制问题中学习均衡的确定性策略梯度

Xin Guo, Yijie Huang, Xiang Yu

发表机构 * Department of Industrial Engineering and Operations Research, University of California, Berkeley, USA(加州大学伯克利分校工业工程与运筹学系) Department of Applied Mathematics, The Hong Kong Polytechnic University, Kowloon, Hong Kong(香港理工大学应用数学系)

AI总结 提出一种连续时间无模型强化学习算法,通过确定性策略梯度和内定点迭代学习时间不一致控制问题的均衡策略,并在均值-方差投资组合和非指数贴现跟踪投资组合中验证有效性。

Comments Keywords: Time-inconsistent control, two-stage reformulation, model-free continuous-time reinforcement learning, deterministic policy gradient, fixed point iteration

详情
AI中文摘要

在本文中,我们开发了一种连续时间无模型强化学习算法,用于学习一般时间不一致控制问题中的确定性均衡策略。利用扩展的Hamilton-Jacobi-Bellman系统,我们将原始时间不一致问题转化为一个等价的两阶段问题。在第一阶段,对于给定的辅助函数,我们采用确定性策略梯度方法在辅助的时间一致控制问题中学习最优策略。在第二阶段,给定更新后的策略,我们利用内定点迭代和某些鞅特征来学习辅助函数。作为理论贡献,我们提供了一些温和的模型假设,并建立了内定点迭代的收敛性。通过在两阶段之间重复这种演员-评论家风格的迭代,我们的算法旨在以统一的方式学习不同时间不一致性来源下的均衡。该算法在两种经典的时间不一致金融应用中的优越有效性得到了说明:均值-方差投资组合管理和非指数贴现下的最优跟踪投资组合。

英文摘要

In this paper, we develop a continuous-time model-free reinforcement learning algorithm to learn deterministic equilibrium policies in general time-inconsistent control problems. Utilizing the extended Hamilton-Jacobi-Bellman system, we recast the original time-inconsistent problem into an equivalent two-stage problem. In the first stage, for given auxiliary functions, we employ the deterministic policy gradient approach to learn an optimal policy in an auxiliary time-consistent control problem. In the second stage, given the updated policy, we exploit the inner fixed point iterations and some martingale characterizations to learn the auxiliary functions. As a theoretical contribution, we provide some mild model assumptions and establish the convergence of inner fixed point iterations. By repeating this actor-critic style of iterations across two stages, our algorithm aims to learn the equilibrium under different sources of time-inconsistency in a unified manner. The superior effectiveness of the proposed algorithm are illustrated in two classical financial applications with time-inconsistency: mean-variance portfolio management and optimal tracking portfolio under non-exponential discounting.

2606.11795 2026-06-11 eess.AS cs.SD 新提交

Tight Boundary Prediction in Speaker Diarization Using Causal-Anticausal Consistency

说话人日志中的紧边界预测:基于因果-反因果一致性

Shota Horiguchi, Marc Delcroix, Naohiro Tawara, Takanori Ashihara, Atsushi Ando

发表机构 * NTT, Inc., Japan(日本NTT公司)

AI总结 针对松标注训练导致预测边界松散的问题,提出利用因果与反因果模型生成紧伪标签,并通过协同训练迭代优化,恢复约70%的紧标签训练效果并提升下游性能。

Comments Accepted to Interspeech 2026 (Long Paper Track)

详情
AI中文摘要

多说话人对话自动语音识别数据常用于训练说话人日志模型。由于此类数据优先考虑语义连续性,语音段中包含停顿和边界余量,导致标注松散。在此类数据上训练的模型倾向于内化产生这种松散性的机制,尽管紧语音区间有时更适用于下游应用。本文解决了利用松散标签使模型产生紧预测的新任务。我们的方法使用因果和反因果模型生成更紧的伪标签,这些模型本质上无法学习松散行为。我们进一步提出了一种协同训练方案,迭代地收紧标签并更新两个模型以进行更渐进式的优化。实验结果表明,所提方法恢复了理想紧标签训练所实现的约70%的收紧效果,并提升了下游性能。

英文摘要

Multi-talker conversational automatic speech recognition data are often used to train speaker diarization models. Because such data prioritize semantic continuity, pauses and boundary margins are included within speech segments, resulting in loose annotations. Models trained on such data tend to internalize mechanisms that reproduce this looseness, although tight speech intervals are sometimes preferable for downstream applications. In this paper, we address the novel task of enabling models to produce tight predictions using loose labels. Our method generates tighter pseudo labels using causal and anticausal models, which are inherently incapable of learning loosening behavior. We further propose a co-training scheme that iteratively tightens labels and updates both models for more progressive refinement. Experimental results show that the proposed method recovers about 70 % of the tightening effect achieved by ideal tight-label training and improves downstream performance.

2606.11780 2026-06-11 cs.IR cs.AI cs.IT math.IT 新提交

What Limits Does Quantization Place on Dense Top-$k$ Retrieval? A Theoretical Study

量化对密集Top-$k$检索的限制是什么?一项理论研究

Koki Okajima, Tsukasa Yoshida

发表机构 * NTT, Inc.(日本电报电话株式会社)

AI总结 理论证明在有限精度下,完美Top-$k$检索所需维度随语料库大小对数增长,量化精度存在阈值,影响实际系统设计。

Comments 9 pages, 2 figures

详情
AI中文摘要

我们建立了将包含$N$个文档的语料库嵌入为$d$维向量的条件,使得每个$k$子集$S \subseteq [N]$都能通过某个查询向量的top-$k$检索实现。最近的研究表明,在$\mathbb{R}^d$中,$d = O(k)$足以存在这样的嵌入,与$N$无关。我们理论上证明,这种与语料库无关的界限是无限精度所特有的。当每个坐标使用$B$比特时,完美top-$k$检索需要$Bd = \Omega(k \ln N)$;因此,在任何固定精度下,维度必须至少随$N$对数增长。针对$\ell_2$归一化的$B$比特均匀标量量化模型,我们还确定了精度阈值$B^{*} = O(\ln \ln N)$,低于该阈值任何维度都不够,同时还有两个进一步限制可行$(B, d)$对的区域。我们的结果表明,在实际的向量数据库和密集检索系统中,由于量化是标准操作,嵌入维度和可能的精度必须随语料库大小增长。

英文摘要

We establish conditions for embedding a corpus of $N$ documents as $d$-dimensional vectors such that every $k$-subset $S \subseteq [N]$ is realizable as a result of top-$k$ retrieval by some query vector. Recent work shows that $d = O(k)$ suffices for such embeddings to exist in $\mathbb{R}^d$, independently of $N$. We theoretically prove that this corpus-independent bound is specific to infinite precision. With $B$ bits per coordinate, perfect top-$k$ retrieval requires $Bd = Ω(k \ln N)$; thus, at any fixed precision, the dimension must grow at least logarithmically with $N$. Specializing to a $\ell_2$-normalized $B$-bit uniform scalar quantization model, we also identify a threshold on the precision $B^{*} = O(\ln \ln N)$ below which no dimension suffices, together with two further regimes that bound the feasible $(B, d)$ pairs. Our result implies that in practical vector databases and dense retrieval systems where quantization is standard, the embedding dimension and possibly the precision must grow with the corpus size.

2606.11773 2026-06-11 math.OC cs.LG 新提交

Last-Iterate Convergence of Optimistic Multiplicative Weight Update

乐观乘性权重更新的最后迭代收敛性

Francesco Orabona

发表机构 * King Abdullah University of Science and Technology(卡塔尔科学与技术大学)

AI总结 本文证明乐观乘性权重更新(OMWU)在光滑凸-凹鞍点问题中以足够小的常数学习率渐近收敛,无需唯一性、严格互补性、误差界或接近解的初始化。

详情
AI中文摘要

乐观梯度上升下降(OGDA)和乐观乘性权重更新(OMWU)是解决凸/凹鞍点问题的两种非常流行的算法,其中OMWU是OGDA的非欧几里得熵版本。自80年代以来,已知OGDA的最后迭代在光滑问题中渐近收敛到鞍点。另一方面,OMWU是否具有相同性质尚不清楚。在本文中,我证明了OMWU对于光滑凸-凹鞍点问题,在足够小的常数学习率下渐近收敛。该结果不需要唯一性、严格互补性、误差界或接近解的初始化。主要的新成分是一个边界论证,表明每个聚点满足非活动坐标的KKT不等式。该边界论证是在ChatGPT的协助下发现的,并在附录中记录。

英文摘要

Optimistic Gradient Descent Ascent (OGDA) and Optimistic Multiplicative-Weights Update (OMWU) are two very popular algorithms to solve convex/concave saddle-point problems, where OMWU is the non-Euclidean, entropic version of OGDA. It is known since the '80s that the last iterate of OGDA asymptotically converges to a saddle point in smooth problems. On the other hand, it is unknown if OMWU has the same property. In this paper, I show that OMWU converges asymptotically for smooth convex-concave saddle-point problems, with a small enough constant learning rate. The result does not require uniqueness, strict complementarity, an error bound, or initialization near a solution. The main new ingredient is a boundary argument showing that every cluster point satisfies the inactive-coordinate KKT inequalities. The boundary argument was discovered with assistance from ChatGPT and is documented in the appendix.

2606.11738 2026-06-11 stat.ML cs.LG 新提交

Renewable Lasso without Batch-Number Constraints: A Gradient-Enhanced Approach

无批次数量约束的可再生Lasso:一种梯度增强方法

Junzhuo Gao, Ling Peng, Xu Guo, Heng Lian

发表机构 * Department of Mathematics, City University of Hong Kong(香港城市大学数学系) School of Statistics and Data Science, Jiangxi University of Finance and Economics(江西财经大学统计与数据科学学院) Philosophy and Social Sciences Laboratory of Data Science in Finance and Economics at the Ministry of Education, Jiangxi University of Finance and Economics(教育部金融与经济数据科学哲学与社会科学实验室,江西财经大学) School of Statistics, Beijing Normal University(北京师范大学统计学院) CityUHK Shenzhen Research Institute(城大深圳研究院)

AI总结 针对高维广义线性模型的流数据在线估计,提出梯度增强替代损失函数,消除批次数量约束,并扩展到分布式流数据场景,理论推导非渐近误差界,实验验证精度提升。

详情
AI中文摘要

我们研究具有流数据的高维广义线性模型的在线估计。首先,针对非分布式设置,我们提出一种梯度增强替代损失函数,仅使用历史摘要近似累积损失,修改并改进了现有高维设置下同一模型的可再生估计方法,并消除了先前研究中的批次数量约束。然后,我们将该方法扩展到主从架构下的分布式流数据,其中批次按站点划分,仅交换摘要(梯度向量)。我们的调整方法不要求客户端计算完整的替代损失,而不是直接应用Jordan等人(2019)的流行方法到替代二次损失。我们在高维尺度下推导了非渐近误差界,没有先前研究中严格的批次数量约束。在线性和逻辑模型下的模拟结果以及实际数据应用表明,与现有的可再生估计器相比,精度有所提高。

英文摘要

We study online estimation for high-dimensional generalized linear models with streaming data. First, for the non-distributed setting, we propose a gradient-enhanced surrogate loss that approximates the cumulative loss using only historical summaries, which modifies and improves upon the existing renewable estimation approach for the same model in the high-dimensional setting, and removes the batch-number constraint in previous studies. We then extend the method to distributed streaming data under the master-client architecture, where batches are partitioned across sites and only summaries (gradient vectors) are exchanged. Instead of directing applying the popular method of Jordan et al. (2019) to the surrogate quadratic loss, our adjusted approach does not require the clients to compute the full surrogate loss. We derive non-asymptotic error bounds under the high-dimensional scaling, without the stringent constraint on the number of batches in the previous studies. Simulation results under linear and logistic models, together with a real-data application, show improved accuracy over existing renewable estimators.

2606.11737 2026-06-11 astro-ph.EP astro-ph.IM cs.LG 新提交

Machine-learning clustering of close-in exoplanet populations: links to pebble accretion

近地系外行星的机器学习聚类:与卵石吸积的联系

Yi Duann, Anders Johansen, Haiyang S. Wang, H. Jens Hoeijmakers

发表机构 * Center for Star and Planet Formation(星系与行星形成中心) Globe Institute(全球研究所) University of Copenhagen(哥本哈根大学) Lund Observatory(隆德天文台) Department of Physics(物理系) Lund University(隆德大学) Graduate Institute of Astronomy(天文研究所) National Central University(国立中央大学)

AI总结 利用高斯混合模型对近地系外行星进行无监督聚类,揭示其内在子群,并通过卵石吸积合成种群解释形成路径差异。

详情
AI中文摘要

近地系外行星展现出由形成条件和迁移过程塑造的广泛轨道构型和物理性质。尽管种群合成模型预测了不同的行星种群,但在观测到的系外行星与合成种群之间建立定量联系仍然具有挑战性。我们使用物理驱动的动力学参数研究近地系外行星的内在组织,并将所得种群与卵石吸积形成路径联系起来。将两阶段高斯混合模型应用于观测到的近地系外行星样本,在由行星-恒星相互作用的动力学描述符主导的特征空间中进行无监督概率聚类。将所得聚类映射到统计驱动的三维参数空间中的卵石吸积合成种群。然后使用与形成相关的量(包括气体可用性、气体分数和冰岩质量比)来解释映射的种群。我们在不施加预定义分类边界的情况下识别出统计上支持的子群,包括超大质量气态巨行星、热巨行星、暖木星主导系统和低质量巨行星。映射的合成种群揭示了形成时间、气体吸积和固体增长历史的系统性差异。特别是,超大质量气态巨行星比热巨行星和暖木星主导种群更倾向于与更早的形成时期相关联。这些结果表明,物理驱动的机器学习方法可以为观测到的系外行星种群与理论行星形成路径之间的联系提供统计上稳健的框架。

英文摘要

Close-in exoplanets exhibit a wide range of orbital architectures and physical properties shaped by both formation conditions and migration processes. Although population-synthesis models predict distinct planetary populations, establishing a quantitative connection between observed exoplanets and synthetic populations remains challenging. We investigate the intrinsic organisation of close-in exoplanets using physically motivated dynamical parameters and connect the resulting populations to pebble-accretion formation pathways. A two-stage Gaussian mixture model (GMM) is applied to an observed sample of close-in exoplanets, performing unsupervised probabilistic clustering in a feature space dominated by dynamical descriptors of planet-star interactions. The resulting clusters are mapped onto a pebble-accretion synthetic population within a statistically motivated three-dimensional parameter space. Formation-related quantities, including gas availability, gas fraction, and ice-rock mass ratio, are then used to interpret the mapped populations. We identify statistically supported sub-populations without imposing predefined classification boundaries, including very-massive gas giants, hot giants, warm-Jupiter-dominated systems, and lower-mass giants. The mapped synthetic populations reveal systematic differences in formation timing, gas accretion, and solid growth histories. In particular, very-massive gas giants are preferentially associated with earlier formation epochs than hot-giant and warm-Jupiter-dominated populations. These results demonstrate that physically motivated machine-learning approaches can provide a statistically robust framework for linking observed exoplanet populations to theoretical planet formation pathways.

2606.11698 2026-06-11 cs.CR cs.AI 新提交

T2S: A Rehearsal-Based Approach for Extraction-Resistant Model Watermarking

T2S:一种基于排练的防提取模型水印方法

Jian-Ping Mei, Weibin Zhang, Ao Yao, Tiantian Zhu, Jie Xiao

发表机构 * College of Computer Science and Technology, Zhejiang University of Technology(浙江工业大学计算机科学与技术学院)

AI总结 针对模型提取攻击,提出一种基于排练的水印嵌入框架,通过模拟提取过程并利用被盗模型在触发集上的损失微调水印知识,增强水印的迁移性和鲁棒性。

详情
Journal ref
ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2026, pp. 13967-13971
AI中文摘要

模型水印通过嵌入独特知识来诱导独特行为特征,从而保护AI模型的知识产权。主要技术挑战在于确保水印对水印模型的各种后处理攻击具有鲁棒性。模型提取攻击是最严重的威胁,攻击者利用预测输出训练替代模型,非法复制原始模型的功能。在这项工作中,我们提出了一种基于排练的水印嵌入框架,以增强模型水印对模型提取攻击的鲁棒性。通过模拟提取过程,我们的方法利用\textit{模拟被盗模型}在触发集上的损失作为训练信号,微调目标模型中的水印知识。这个微调步骤鼓励水印以增强可迁移性的方式嵌入,从而增加其在被盗模型中持续存在并保持可检测的机会。在不同设置下进行的全面实验表明,所提出的方法显著提高了模型水印对模型提取和后续水印移除攻击的鲁棒性。

英文摘要

Model watermarking safeguards AI model intellectual property by embedding distinctive knowledge that induces unique behavioral signatures. The primary technical challenge lies in ensuring watermark robustness against various post-processing attacks on the watermarked model. Model extraction attacks emerge as the most severe threat, where adversaries exploit prediction outputs to train surrogate models that illegally replicate the original model's functionality. In this work, we propose a rehearsal-based watermark embedding framework to enhance the robustness of model watermarks against model extraction attacks. By simulating the extraction process, our method leverages the loss of a \textit{simulated stolen model} on a trigger set as a training signal to fine-tune the watermark knowledge within the target model. This fine-tuning step encourages the watermark to be embedded in a way that boosts transferability, thereby increasing its chances of persisting and remaining detectable in stolen models. Comprehensive experiments conducted under diverse settings demonstrate that the proposed method significantly improves the robustness of model watermarks against both model extraction and subsequent watermark removal attacks.

2606.11676 2026-06-11 cs.CE cs.LG physics.comp-ph 新提交

Neural-Parameterized Cellular Automata for Wildfire Spread

神经参数化元胞自动机用于野火蔓延

Maksym Zhenirovskyy, Ion Matei, Rohit Vuppala, Takuya Kurihana, Hon Yung Wonga

发表机构 * Fujitsu Research of America(富士通美国研究)

AI总结 提出一种混合深度学习参数化概率元胞自动机框架,利用多尺度卷积神经网络动态生成空间变化参数,在保持物理可解释性的同时捕捉复杂环境交互,在六次大型野火中实现72小时IoU>0.6的预测。

Comments 16 pages, 9 figures

详情
AI中文摘要

传统野火模型依赖刚性、低维参数和静态燃料图,常常低估火势蔓延。为解决这一弱点,我们引入了一个在JAX中实现的混合深度学习参数化概率元胞自动机(CA)框架。我们的方法采用多尺度卷积神经网络动态生成控制火势蔓延概率、风向对齐和坡度影响的空间变化参数。这种混合设计捕捉了复杂的非线性环境交互,同时保留了底层三态CA的物理可解释性。JAX实现支持硬件加速和基于梯度的参数校准。在美国西部六次大规模野火上的评估显示,在10天数据同化窗口期间模型逐步拟合观测到的火线后,该模型在72小时预测范围内保持IoU>0.6;由此产生的预测是在这些观测中已编码的抑制机制下火势增长的条件投影。

英文摘要

Traditional wildfire models rely on rigid, low-dimensional parameters and static fuel maps, frequently underpredicting fire spread. To address this weakness, we introduce a hybrid deep-learning parameterized Probabilistic Cellular Automata (CA) framework implemented in JAX. Our approach employs a Multi-Scale Convolutional Neural Network to dynamically generate spatially varying parameters that govern fire-spread probability, wind alignment, and slope influence. This hybrid design captures complex, nonlinear environmental interactions while preserving the physical interpretability of the underlying three-state CA. The JAX implementation enables hardware acceleration and gradient-based parameter calibration. Evaluated on six large-scale wildfires in the western United States, the model maintains IoU > 0.6 over 72-hour forecast horizons after a 10-day data assimilation window during which the model is fitted incrementally to observed perimeters; the resulting forecast is a conditional projection of fire growth under the suppression regime already ncoded in those observations.

2606.11673 2026-06-11 quant-ph cs.LG 新提交

Higher-Order Token Interactions via Quantum Attention

高阶令牌交互的量子注意力机制

Jian Xu, Chao Li, Delu Zeng, John Paisley, Qibin Zhao

发表机构 * RIKEN iTHEMS RIKEN AIP South China University of Technology(华南理工大学) Columbia University(哥伦比亚大学)

AI总结 提出量子高阶注意力(QHA),通过数据重上传和非克利福德纠缠器在浅电路中合成任意阶令牌交互,证明其表达能力超越经典自注意力,并具有可训练性保证,在遗传上位、带噪学习奇偶和图三角形检测中高效检测高阶交互。

详情
AI中文摘要

标准点积自注意力在单层中仅计算令牌间的成对(二阶)交互;表示一般的$k$阶交互已知需要在单层中使用超二次资源或通过深度组合。我们引入\textbf{量子高阶注意力(QHA)},一种浅层、硬件可实现的量子注意力头,通过数据重上传和全对非克利福德纠缠器,在电路内部合成$k$阶令牌交互,并通过局部单量子比特读出暴露它们。我们证明:(i)表达能力分离:任何嵌入维度$m$、$H$个头和$p$位精度满足$mHp=o(N/\log\log N)$的单个标准自注意力层无法表示一个QHA头以电路深度$O(\log k)$($O(k)$个两量子比特门)表示的$k$阶相关族;(ii)其局部设计实例的可训练性保证:使用局部读出和$O(\log n)$深度,梯度方差为$\Omega(1/\mathrm{poly}(n))$(无贫瘠高原),我们通过实验确认——同时明确我们基准测试的更具表达力的全对实例是经验训练的,并显示指数衰减的梯度。实验上,在参数预算小$6.5\times$的情况下,QHA从不相交输入中泛化每个阶$k\le6$的隐藏子集奇偶性,而更大的经典注意力头在阶~2之后崩溃;与理论一致,优势的大小跟踪目标的傅里叶度——奇偶性最大,当存在低阶结构时缩小。作为一个应用,QHA在三个领域——遗传上位、带噪学习奇偶和图三角形检测——作为紧凑的高阶交互检测器,在最小的参数预算下达到噪声上限,而领域标准的线性方法失败。

英文摘要

Standard dot-product self-attention computes, in a single layer, only pairwise (order-2) interactions between tokens; representing a generic order-$k$ interaction is known to require either super-quadratic resources in one layer or composition across depth. We introduce \textbf{Quantum Higher-Order Attention (QHA)}, a shallow, hardware-realizable quantum attention head that, via data re-uploading and an all-to-all non-Clifford entangler, synthesizes order-$k$ token interactions inside the circuit and exposes them through a local single-qubit read-out. We prove (i) an expressivity separation: any single standard self-attention layer with embedding dimension $m$, $H$ heads and $p$-bit precision satisfying $mHp=o(N/\log\log N)$ cannot represent the order-$k$ correlation family that one QHA head represents with circuit depth $O(\log k)$ ($O(k)$ two-qubit gates); and (ii) a trainability guarantee for its local-design instantiation: with a local read-out and $O(\log n)$ depth the gradient variance is $Ω(1/\mathrm{poly}(n))$ (no barren plateau), which we confirm empirically -- while being explicit that the more expressive all-to-all instantiation we benchmark is trained empirically and shows exponentially decaying gradients. Empirically, at a $6.5\times$ smaller parameter budget, QHA generalizes hidden-subset parity of every order $k\le6$ from disjoint inputs, whereas the larger classical attention head collapses past order~2; consistent with theory, the size of the advantage tracks the target's Fourier degree - largest for parity and shrinking when low-order structure is present. As an application, QHA serves as a compact high-order interaction detector across three domains - genetic epistasis, learning-parity-with-noise, and graph triangle detection - reaching the noise ceiling at the smallest parameter budget where field-standard linear methods fail.

2606.11672 2026-06-11 cs.CR cs.AI 新提交

Can Open-Source LLM Agents Replace Static Application Security Testing Tools? An Empirical Assessment

开源LLM代理能否取代静态应用安全测试工具?一项实证评估

Derek Yohn, Luke Flancher, Mirajul Islam, Khaled Slhoub

发表机构 * College of Engineering and Science, Florida Institute of Technology(工程学院与科学学院,佛罗里达理工学院)

AI总结 评估基于开源LLM的代理在静态应用安全测试中的性能,与SAST工具Bandit对比,发现当前不适合实际应用。

Comments Keywords: Agentic AI, Cybersecurity, Large Language Models, Static Application Security Testing, Model performance evaluation

详情
AI中文摘要

本文探讨了代理式AI工具在网络安全领域的价值。我们评估了基于通用GenAI大语言模型(LLM)的代理在三种不同Ollama托管的通用开源模型驱动下的有效性。我们使用精确率、召回率、误报数以及基于捕获指标交互计算的综合得分,评估每个代理的性能,并与现有经过验证的静态应用安全测试(SAST)工具Bandit的基线性能进行比较。我们的研究结果驳斥了现代开源GenAI LLM代理在当前现实条件下适用于SAST扫描这一专门任务的看法。

英文摘要

This paper explores the value of agentic AI tools for cybersecurity purposes. We evaluate the efficacy of a general-purpose GenAI Large Language Model- (GenAI-) based agent when powered by three different Ollama-hosted general-purpose open source models. We assess each agent's performance using precision, recall, false positive count, and a calculated composite score based upon the interplay of the captured metrics, against the baseline performance of an existing, vetted Static Application Security Testing (SAST) tool, Bandit. Our findings refute the notion that a modern open-source GenAI LLM-based agent is currently suitable for the specialized task of SAST scanning under realistic conditions.

2606.11671 2026-06-11 cs.CR cs.AI 新提交

Runtime Skill Audit: Targeted Runtime Probing for Agent Skill Security

运行时技能审计:针对智能体技能安全的目标运行时探测

Tu Lan, Chaowei Xiao

发表机构 * Johns Hopkins University(约翰霍普金斯大学)

AI总结 提出运行时技能审计(RSA)动态分析方法,通过目标运行时条件探测技能行为,在100个技能上达到90.0%准确率,优于静态基线。

详情
AI中文摘要

智能体技能让LLM智能体能够复用指令、资源、工具和工作流,但也为恶意行为提供了新的隐藏场所。一个技能在其文档或代码中可能看起来无害,但只有在与特定用户请求、本地资产、持久状态或多步骤工具交互调用时才会变得有害。这使得纯静态审查变得脆弱。我们提出运行时技能审计(RSA),一种动态分析方法,通过询问技能介导的智能体在目标运行时条件下实际做了什么来审计技能。RSA不是用相同的通用任务测试每个技能,而是分析风险相关接口,准备执行上下文以触发这些接口,并根据产生的跟踪证据分配安全标签。我们在OpenClaw上实现RSA,并在100个技能上针对代表性静态基线进行评估。RSA达到90.0%的准确率,88.0%的真阳性率和8.0%的假阳性率,比最佳静态基线提高13.0个百分点。在自进化攻击下,静态检测器在一两轮后崩溃,而RSA在每轮中持续检测出19-20个恶意技能。

英文摘要

Agent skills let LLM agents reuse instructions, resources, tools, and workflows, but they also create a new place for malicious behavior to hide. A skill may look benign in its documentation or code while becoming harmful only when it is invoked with particular user requests, local assets, persistent state, or multi-step tool interactions. This makes purely static vetting brittle. We present Runtime Skill Audit (RSA), a dynamic analysis method that audits skills by asking what the skill-mediated agent actually does under targeted runtime conditions. Instead of testing every skill with the same generic tasks, RSA profiles risk-relevant interfaces, prepares the execution context needed to exercise them, and assigns security labels from the resulting trace evidence. We instantiate RSA on OpenClaw and evaluate it on 100 skills against representative static baselines. RSA achieves 90.0\% accuracy with an 88.0\% true positive rate and an 8.0\% false positive rate, improving accuracy by 13.0 percentage points over the best static baseline. Under self-evolving attacks, static detectors collapse after one or two rounds, while RSA continues to detect 19--20 out of 20 malicious skills across rounds.

2606.11663 2026-06-11 cs.SI cs.LG 新提交

Probabilistic Salary Prediction with Graph Attention Networks and a Mixture Density Network

基于图注意力网络和混合密度网络的概率薪资预测

Zhipei Qin, Mohammad Shokri, N. van Weeren, F. W. Takes

发表机构 * Leiden Institute of Advanced Computer Science (LIACS)(莱顿高级计算机科学研究所) Leiden University(莱顿大学) Randstad N.V.(Randstad公司)

AI总结 提出GAT-MDN框架,通过构建属性关系图并使用图注意力网络学习节点表示,结合混合密度网络输出薪资分布,在百万级荷兰招聘数据集上优于基线模型。

Comments 5 pages, 3 figures

详情
AI中文摘要

准确的薪资预测对于弥合现代劳动力市场中雇主与求职者之间的信息差距至关重要。现有方法主要产生单点估计,并将工作属性(如地点、职业和行业)视为独立的分类特征,忽略了真实世界薪酬数据固有的不确定性和多模态性,以及支配薪资规范的丰富层次结构和语义相似性关系。在本文中,我们提出了GAT-MDN,一个同时解决这两个限制的统一框架。对于三个属性域中的每一个,我们构建了一个特定领域的图,其边编码了(i)层次化的父子包含关系和(ii)从预训练的Sentence-Transformer导出的加权相似性链接。具有边缘特征感知注意力的并行图注意力网络(GAT)从这些多关系图中学习丰富的、上下文感知的节点表示。然后,一个基于优先级的层次选择模块组装一个复合特征向量,优雅地处理缺失或粗略的属性,而混合密度网络(MDN)头将该向量映射到高斯混合模型(GMM)的参数,产生完整的条件薪资分布。在超过100万条记录的真实世界荷兰招聘数据集上的大量实验表明,GAT-MDN在负对数似然(NLL)和均方误差(MSE)方面均显著优于非图MLP-MDN基线。

英文摘要

Accurate salary prediction is critical for bridging the information gap between employers and job seekers in modern labor markets. Existing approaches predominantly yield a single point estimate and treat job attributes such as location, occupation, and industry as independent categorical features, ignoring both the inherent uncertainty and multi-modality of real-world compensation data and the rich hierarchical and semantic-similarity relationships that govern pay norms. In this paper we propose GAT-MDN, a unified framework that addresses both limitations simultaneously. For each of the three attribute domains we construct a domain-specific graph whose edges encode (i) hierarchical parent-child containment and (ii) weighted similarity links derived from a pre-trained Sentence-Transformer. Parallel Graph Attention Networks (GATs) with edge-feature-aware attention learn rich, context-sensitive node representations from these multi-relational graphs. A priority-based hierarchical selection module then assembles a composite feature vector that gracefully handles missing or coarse attributes, and a Mixture Density Network (MDN) head maps this vector to the parameters of a Gaussian Mixture Model (GMM), yielding a full conditional salary distribution. Extensive experiments on a real-world Dutch job-posting dataset of over 1 million records demonstrate that GAT-MDN significantly outperforms a non-graph MLP-MDN baseline in both Negative Log-Likelihood (NLL) and Mean Squared Error (MSE).

2606.11648 2026-06-11 cs.CR cs.CL 新提交

Dummy Backdoor as a Defense: Removing Unknown Backdoors via Shared Internal Mechanisms for Generative LLMs

虚拟后门作为防御:通过共享内部机制移除生成式大语言模型中的未知后门

Kazuki Iwahana, Masaru Matsubayashi, Takuma Koyama, Toshiki Shibahara, Kenichiro Omintato, Akira Ito

发表机构 * NTT Social Informatics Laboratories(NTT社会信息实验室) Tohoku University(东北大学)

AI总结 提出一种基于共享内部机制的后门移除方法,通过嵌入已知触发器的虚拟后门并微调移除,从而降低未知后门攻击成功率,同时保持模型效用。

详情
AI中文摘要

后门攻击对大型语言模型(LLMs)的安全性和可靠性构成严重威胁,因为它们使模型在干净输入上表现正常,但在隐藏触发器出现时产生攻击者指定的响应。当防御者不知道后门攻击类型或通过后门训练形成的内部机制时,移除这种未知后门尤其具有挑战性。在这项工作中,我们提出了一种简单但有效的后门移除方法,基于不同后门之间的共享内部机制。首先,我们展示了具有相同任务(攻击目标)的不同后门会在内部激活中引发类似的触发器激活变化。受此观察启发,我们的方法有意嵌入一个具有已知触发器的后门(虚拟后门),然后通过在虚拟触发器输入与干净响应对上进行进一步微调来移除它。由于虚拟后门和未知后门可以依赖共享的内部机制,移除虚拟后门也会降低未知后门的效果。我们在多个模型家族上对三种后门攻击类型进行了评估。实验结果表明,我们的方法在保持模型效用的同时,显著降低了未知后门的攻击成功率,在后门移除效果和效用保持方面均优于现有的代表性防御方法。这些发现表明,防御者可控制的后门可以作为减轻生成式LLMs中未知后门的有益代理。

英文摘要

Backdoor attacks pose a serious threat to the safety and reliability of Large Language Models (LLMs), as they cause models to behave normally on clean inputs while producing attacker-specified responses when hidden triggers are present. Removing such unknown backdoors is particularly challenging when the defender does not know the backdoor attack types or the internal mechanisms formed through backdoor training. In this work, we propose a simple but effective backdoor removal method based on shared internal mechanisms across different backdoors. First, we show that different backdoors with the same task (attack objective) induce similar trigger-activated changes in the internal activations. Motivated by this observation, our method intentionally embeds a backdoor with a known trigger (\emph{dummy backdoor}) and then removes it through further fine-tuning on dummy-triggered inputs paired with clean responses. Since the dummy backdoor and the unknown backdoor can rely on shared internal mechanisms, removing the dummy backdoor also reduces the effect of the unknown backdoor. We evaluate our method on three backdoor attack types across multiple model families. Experimental results show that our method substantially reduces the attack success rate of the unknown backdoor while preserving model utility, outperforming representative existing defense methods in both backdoor removal effectiveness and utility preservation. These findings suggest that a defender-controllable backdoor can serve as a helpful proxy for mitigating unknown backdoors in generative LLMs.

2606.11642 2026-06-11 cs.HC cs.CL 新提交

3-Key-Input: Exploring the Theoretical Minimum Keys for Text Entry

3-Key-Input: 探索文本输入的理论最少按键数

Naoki Kimura

发表机构 * Naoki Kimura(金仓大学)

AI总结 本研究通过结合语言模型与2-5个物理按键,系统评估文本输入系统,发现3键+GPT-4o可实现字符错误率9.46%,表明在强语言模型先验下3键是实用最小值。

Comments 6 pages, 1 figure, 7 tables. Published in ICASSP 2026

详情
Journal ref
Proc. ICASSP 2026, pp. 19392-19396, 2026
AI中文摘要

如果我们为模糊键盘配备现代语言模型,可以将物理按键数量减少到多少?更少的按键在辅助设备和移动设备等受限场景中增加了硬件设计自由度。本文系统评估了使用2-5个物理按键结合基于语言模型的消歧的文本输入系统。在包含300个句子的英文语料库(商务/会话/技术各100句)上,我们比较了按键数量(2-5)、字母到按键映射(基于布局/基于频率/故意最坏情况)和解码器(仅Trie、GPT-2束搜索、GPT-4o选择)。我们发现,3键+GPT-4o实现了字符错误率(CER)9.46%和词错误率(WER)12.20%,相对于2键(CER 23.3%)CER降低了59%。在3键时,键流熵为1.54比特/字符;虽然增加到5键提高了准确率(CER 5.4%),但边际增益递减。在标准设计下,映射选择影响较小(ΔCER < 0.5个百分点),即使故意最坏映射也仅使CER增加+0.5个百分点,而技术句子的错误率大约是商务句子的两倍。这些结果表明,在我们评估的离线设置下,在强语言模型先验下,3键是通用英语的实用最小值。

英文摘要

How far can we reduce the number of physical keys if we endow an ambiguous keyboard with modern language models? Fewer keys increase hardware design freedom in constrained settings such as assistive devices and mobile form factors. This paper systematically evaluates text entry systems using 2-5 physical keys combined with language-model-based disambiguation. On a 300-sentence English corpus (100 sentences each for Business / Conversational / Technical), we compare key counts (2-5), letter-to-key mappings (layout-based / frequency-based / intentionally worst-case), and decoders (Trie-only, GPT-2 beam search, GPT-4o selection). We find that 3 keys + GPT-4o achieves character error rate (CER) 9.46% and word error rate (WER) 12.20%, reducing CER by 59% relative to 2 keys (CER 23.3%). At 3 keys, the key-stream entropy is 1.54 bits/char; while increasing to 5 keys improves accuracy (CER 5.4%), the marginal gains diminish. Mapping choice has a small impact under standard designs (ΔCER < 0.5 pp), and even an intentionally worst mapping degrades CER by only +0.5 pp, whereas Technical sentences yield roughly twice the error rate of Business. These results suggest that, in our evaluated offline setting under a strong LM prior, 3 keys are a practical minimum for general English.

2606.11635 2026-06-11 cs.CY cs.AI 新提交

Are LLMs Bad at Moral Reasoning?

LLMs 在道德推理上表现不佳吗?

Menghang Zhu, Seth Lazar

发表机构 * School of Philosophy (Political Philosophy) Renmin University of China(哲学学院(政治哲学)中国人民大学) School of Government and Policy Johns Hopkins University(政府与政策学院约翰霍普金斯大学)

AI总结 本文通过让LLMs生成评分标准而非直接评分,重新评估MoReBench数据集,发现LLMs的道德推理能力比先前认为的更强。

详情
AI中文摘要

为了让高能力AI系统在动态、开放的环境中安全运行,它们必须能够识别、理解并响应行动中的道德理由,并据此约束自身行为。越来越多的研究旨在评估当今最先进AI系统的这种能力——道德能力,最近得出了普遍悲观的结论。其中一篇最具雄心的论文收集了人类专家制定的黄金标准评分标准,用于评估1000个案例中的道德推理,并以此基准测试前沿AI模型,结果不尽如人意。在本文中,我们认为MoReBench数据集可以被重新利用,以给出对LLMs道德推理(道德能力的重要组成部分)更为乐观的图景。我们表明,如果不根据这些评分标准对LLMs的回应进行评分,而是让LLMs执行与人类相同的任务——为特定案例的道德分析生成评分标准——那么它们生成的评分标准与人类评分标准的校准程度高于其开放式回应,并且在存在差异时,这些差异可能仅仅反映了大多数道德问题的巨大维度,同时也突出了人类在“创建评分标准的评分标准”上的某些偏离。考虑到这些观点,MoReBench数据集表明LLMs在道德推理方面的能力比先前认为的要强得多。

英文摘要

For highly capable AI systems to operate safely in dynamic, open-ended environments, they must be able to identify, understand, and respond to moral reasons for action, and constrain their behaviour accordingly. A growing body of research aims to evaluate this capacity -- moral competence -- in today's most capable AI systems, recently reaching broadly pessimistic conclusions. One of the most ambitious such papers collects gold-standard human-authored rubrics for evaluating moral reasoning in 1,000 cases, and benchmarks frontier AI models against those rubrics, with underwhelming results. In this paper, we argue that the MoReBench dataset can be redeployed to give a much more optimistic picture of LLMs' moral reasoning (an essential part of moral competence). We show that if, instead of scoring LLMs' responses to these cases against these rubrics, we instead give the LLMs the same task given to humans -- to generate scoring rubrics for the moral analysis of particular cases -- the rubrics they generate are both better calibrated to the human rubrics than their open-ended responses, and, where they differ, plausibly reflect nothing more than the vast dimensionality of most moral problems, as well as highlighting some human departures from the "rubric for creating rubrics". Taking these points into consideration, the MoReBench dataset suggests that LLMs are significantly more capable at moral reasoning than was previously believed.

2606.11632 2026-06-11 cs.CR cs.AI cs.DC cs.MA 新提交

Sovereign Assurance Boundary: Certificate-Bound Admission for Agentic Infrastructure

主权保证边界:面向智能体基础设施的证书绑定准入机制

Jun He, Deying Yu

发表机构 * OpenKedge.io(OpenKedge实验室)

AI总结 针对智能体基础设施中非确定性推理系统对生产资源的高风险操作,提出主权保证边界(SAB),通过证书绑定的运行时准入层,将代理提案编译为执行合约并绑定加密证据,实现可验证、可撤销的授权控制。

Comments 12 pages, 1 figure, 13 tables

详情
AI中文摘要

智能体基础设施引入了一个关键的控制平面授权问题:非确定性推理系统可以对生产资源提出高风险变更,但现有的安全机制——如身份与访问管理(IAM)、策略引擎、共识协议和审计日志——要么强制执行静态的、上下文无关的权限,要么仅在执行后记录操作。本文介绍了主权保证边界(SAB),一种用于自主执行权限的证书绑定运行时准入层。SAB在保证气闸处拦截代理提案,将其编译为类型化执行合约$C$,并将这些合约绑定到加密证据摘要$H(E)$和策略版本。然后,合约通过后果感知的认证路径进行路由。成功准入后,系统发出一个严格限定于特定执行身份、撤销周期和有效时间窗口的签名主权保证证书($\Omega$)。最后,主权执行代理验证$\Omega$,并在调用基础设施API之前执行新鲜的执行前撤销和漂移检查。我们详细描述了气闸-代理架构,形式化了其准入和撤销不变量,并报告了在Go原型上对2500次准入尝试评估的初步可行性测量。最终,这种代理强制模型防止了自主推理直接改变状态,将委托的执行权限转化为一个可加密验证、证据绑定、可撤销且可重放的运行时工件。

英文摘要

Agentic infrastructure introduces a critical control-plane authorization problem: non-deterministic reasoning systems can propose high-stakes mutations to production resources, yet existing security mechanisms -- such as identity and access management (IAM), policy engines, consensus protocols, and audit logs -- either enforce static, context-unaware permissions or merely record actions post-execution. This paper introduces the Sovereign Assurance Boundary (SAB), a certificate-bound runtime admission layer for autonomous execution authority. SAB intercepts agent proposals at an assurance airlock, compiles them into typed execution contracts $C$, and binds these contracts to cryptographic evidence digests $H(E)$ and policy versions. The contracts are then routed through consequence-aware certification paths. Upon successful admission, the system emits a signed Sovereign Assurance Certificate ($Ω$) that is strictly scoped to a specific execution identity, revocation epoch, and validity window. Finally, a sovereign execution broker verifies $Ω$ and performs fresh pre-execution revocation and drift checks before invoking infrastructure APIs. We detail the airlock-broker architecture, formalize its admission and revocation invariants, and report preliminary feasibility measurements from a Go prototype evaluated over 2,500 admission attempts. Ultimately, this broker-enforced model prevents autonomous reasoning from directly mutating state, transforming delegated execution authority into a cryptographically verifiable, evidence-bound, revocable, and replayable runtime artifact.

2606.11631 2026-06-11 eess.AS cs.SD 新提交

Benchmarking Neural Speech Compression from a Rate-Distortion Perspective

从率失真角度基准测试神经语音压缩

Jun Xu, Zhengxue Cheng, Fengxi Zhang, Yuhan Liu, Li Song, Wenjun Zhang

发表机构 * School of Information Science and Electronic Engineering, Shanghai Jiao Tong University(信息科学与电子工程学院,上海交通大学)

AI总结 提出熵约束编解码器ECC,通过标量量化与学习熵模型结合,在低比特率下实现优于传统和神经编解码器的率失真性能。

详情
AI中文摘要

基于学习的语音压缩在低比特率性能上取得了有前景的成果,但许多神经语音编解码器仍使用预设速率的离散符号描述量化潜变量,或仅在符号生成后应用熵编码。这种设计将表示学习与概率建模解耦,限制了它们利用学习到的语音潜变量的非均匀使用和时间依赖性的能力。本文从率失真角度基准测试神经语音压缩,并进一步研究用于低比特率语音压缩的熵约束编码。我们首先制定了一个统一的基于学习的语音编码流程,并对最近的神经语音编解码器进行了基准测试风格的分析,表明显式概率建模在学习语音压缩中仍未得到充分探索。然后,我们提出了ECC,一种熵约束编解码器,它将标量量化与学习熵模型相结合。ECC集成了基于超先验的边信息、通道上下文建模、潜变量残差预测和轻量级时间建模,以在训练期间估计用于率估计的潜变量似然,并在推理期间进行算术编码。为了进一步提高低比特率效率,ECC引入了熵跳跃,它使用解码器可用的尺度估计省略高度可预测的残差符号,而无需传输额外的跳跃掩码。大量实验表明,ECC在低比特率下实现了优于传统和神经编解码器基线的率失真权衡,在两个广泛使用的测试集上,平均BD-rate在ViSQOL上降低39.9%,在PESQ上降低76.3%。消融和诊断研究进一步验证了熵建模的有效性。项目页面:此 https URL

英文摘要

Learning-based speech compression has achieved promising low-bitrate performance, but many neural speech codecs still describe quantized latents with preset-rate discrete symbols or apply entropy coding only after symbol generation. Such designs decouple representation learning from probability modeling, limiting their ability to exploit the non-uniform usage and temporal dependencies of learned speech latents. In this paper, we benchmark neural speech compression from a rate--distortion perspective and further investigate entropy-constrained coding for low-bitrate speech compression. We first formulate a unified learning-based speech coding pipeline and provide a benchmark-style analysis of recent neural speech codecs, showing that explicit probability modeling remains underexplored in learned speech compression. We then propose ECC, an Entropy-Constrained Codec that combines scalar quantization with a learned entropy model. ECC integrates hyperprior-based side information, channel-wise context modeling, latent residual prediction, and lightweight temporal modeling to estimate latent likelihoods for rate estimation during training and arithmetic coding during inference. To further improve low-bitrate efficiency, ECC introduces entropy skip, which omits highly predictable residual symbols using decoder-available scale estimates without transmitting additional skip masks. Extensive experiments show that ECC achieves a favorable low-bitrate rate--distortion trade-off over conventional and neural codec baselines, reducing BD-rate by 39.9% on ViSQOL and 76.3% on PESQ on average over two widely-used test sets. Ablation and diagnostic studies further validate the effectiveness of entropy modeling. Project Page: https://avery-xu.github.io/ECC-demo/

2606.11629 2026-06-11 math.DS cs.LG 新提交

Integral Formulation of QENDy for Robust Nonlinear System Identification

QENDy的积分形式用于鲁棒非线性系统辨识

Nikhil Saran, Sushant Pokhriyal, Stefan Klus, Rushikesh Kamalapurkar, Joel A. Rosenfeld

发表机构 * Department of Mathematics and Statistics at the University of South Florida(佛罗里达州立大学数学与统计学系) Institute of Engineering and Technology, JK Lakshmipat University(JK拉克什米帕特大学工程与技术学院) School of Mathematical & Computer Sciences at the Heriot–Watt University(赫里奥特-瓦特大学数学与计算机科学学院) University of Florida(佛罗里达大学)

AI总结 提出QENDy方法的积分形式,避免使用时间导数,从而增强对噪声的鲁棒性,实现更稳健的非线性动力学学习。

详情
AI中文摘要

本文提出了新定义的非线性系统二次嵌入方法(QENDy)的积分形式。在原始算法中,使用了轨迹数据点及其时间导数。计算时间导数的方法使算法对噪声敏感。我们的积分形式不使用时间导数,从而得到一种更鲁棒的动力学学习方法。

英文摘要

This manuscript proposes an integral formulation of the newly defined quadratic embedding method for identifying nonlinear systems (QENDy). In the original algorithm, trajectory data points along with their time derivatives are used. Methods for calculating time derivatives make the algorithm sensitive to noise. Our integral formulation does not use the time derivatives. This results in a more robust method to learn the dynamics.

2606.11620 2026-06-11 quant-ph cs.ET cs.LG 新提交

Family-Aware Residual Architecture for Predicting Quantum Circuit Simulation Performance

面向预测量子电路模拟性能的族感知残差架构

Honjar Xing, Yehong Jiang, Xianbang Wang, Zehua Wang, Zhicheng Jiang

发表机构 * IEEE

AI总结 提出族感知残差架构,利用电路族分类和算法指纹特征,预测量子电路模拟的最小近似阈值和运行时间,在7-130量子比特、10个算法族上实现79.5%精确阈值准确率和R²=0.82运行时间相关性。

Comments Accepted as a full paper at IEEE ISVLSI 2026 (QC-CSAA Workshop). To appear in IEEE Xplore. 6 pages, 1 figure, 2 tables

详情
AI中文摘要

近似张量网络模拟器能够对超出精确方法范围的量子电路进行经典模拟,但选择最优近似参数(如键维阈值)仍然是一个成本高昂的试错过程。我们提出了一种族感知神经架构,仅根据电路的OpenQASM描述和执行上下文,即可预测实现目标保真度所需的最小近似阈值以及量子电路模拟的预期挂钟运行时间。我们的关键洞察是,来自不同算法族(例如QFT、Grover、VQE)的量子电路由于其不同的纠缠结构而表现出根本不同的模拟成本曲线。我们采用族条件残差校正——在共享骨干网络之上添加的、针对特定族的加性调整,借鉴了已建立的条件计算技术——使模型能够同时捕获通用电路属性和算法细微差别。该架构包含一个预训练的族分类器(准确率97.5%)和从门组成启发式算法导出的领域信息算法指纹特征。在跨越7-130量子比特、10个算法族的电路上评估,我们的系统实现了79.5%的精确阈值准确率(91.2%在一个阶梯内)和R²=0.82的运行时间相关性,推理时间约为50毫秒——取代了可能需要数分钟到数小时的试错模拟运行。消融研究证实,族感知建模提供了最大的单一性能改进(+3.2个百分点),验证了算法族是模拟成本预测的一等特征的假设。

英文摘要

Approximate tensor-network simulators enable classical simulation of quantum circuits beyond the reach of exact methods, but selecting optimal approximation parameters -- such as bond dimension thresholds -- remains a costly trial-and-error process. We present a family-aware neural architecture that predicts both the minimum approximation threshold required to achieve target fidelity and the expected wall-clock runtime for quantum circuit simulation, given only the circuit's OpenQASM description and execution context. Our key insight is that quantum circuits from different algorithmic families (e.g., QFT, Grover, VQE) exhibit fundamentally distinct simulation cost profiles due to their differing entanglement structures. We employ family-conditioned residual corrections -- additive, family-specific adjustments atop a shared backbone, drawing on established conditional computation techniques -- enabling the model to capture both universal circuit properties and algorithmic nuances. The architecture incorporates a pretrained family classifier (97.5% accuracy) and domain-informed algorithm fingerprint features derived from gate-composition heuristics. Evaluated on circuits spanning 7--130 qubits across 10 algorithm families, our system achieves 79.5% exact threshold accuracy (91.2% within one rung) and $R^2 = 0.82$ runtime correlation, with inference completing in approximately 50 ms -- replacing trial-and-error simulation runs that may take minutes to hours. Ablation studies confirm that family-aware modeling provides the single largest performance improvement (+3.2 percentage points), validating the hypothesis that algorithm family is a first-class feature for simulation cost prediction.

2606.11613 2026-06-11 cs.IR cs.CL cs.HC cs.SI 新提交

Factions Within, Uncertain Across: Within-Document Reader Sub-Groups in Social Highlighting

内部派系,跨文档不确定:社交高亮中的文档内读者子群体

Kazuki Nakayashiki, Keisuke Watanabe

发表机构 * Glasp Inc.(Glasp公司)

AI总结 通过保留边界的曲线球零模型,发现文档内读者形成强子群体,其一致性远超共享显著性预测,且大部分源于细粒度读者特定共识;跨文档稳定性未解决。

Comments 11 pages, 3 figures, 3 tables

详情
AI中文摘要

当许多人高亮同一文档时,人群是单一共识,还是内部结构化为标记不同内容的读者子群体?这种结构是读者的稳定属性还是文档的属性?基于先前工作表明个体文档内高亮信号是低语而个体性存在于选择中,我们在一个共读平台上使用保留边界的曲线球零模型提出群体层面问题。实验1:在文档内,读者形成强子群体——配对一致性远超共享显著性、标记密度和句子流行度所预测的(最近邻一致性z=+6.3,在88%的文档中显著)。在八块区域保留零模型下,与文档相同粗略区域的共享参与解释了约40%的额外一致性;大部分以更细粒度的读者特定一致性存在(z=+3.6,77%显著)。因此,文档内人群在描述意义上是派系化的。实验2:这种分组是稳定的读者特质吗?这里我们诚实地面对统计功效。配对一致性的跨文档分半可重复性在合并后接近零(两个独立抽取样本中分别为+0.078和0.000),功效校准表明该检验仅对共读许多文档的配对有信息。在唯一有信息的高重叠子集(k>=4)中,点估计为正但小样本,在独立抽取样本间不精确,从未显著,并在区域保留零模型下衰减。因此,我们未解决跨文档稳定性:数据与从情境分组到弱至中等稳定读者特质的一切一致。人群在文档内是派系化的;这些派系是否随读者跨文档迁移,诚实地讲,超出了我们的能力范围。

英文摘要

When many people highlight the same document, is the crowd a single consensus, or is it internally structured into reader sub-groups that mark different things -- and is that structure a stable property of a reader or of the document? Building on prior work showing an individual's within-document highlighting signal is a whisper while individuality lives in selection, we ask the group-level question on a co-readership platform using a margin-preserving curveball null. Experiment 1: within a document, readers form strong sub-groups -- pairs agree far beyond what shared salience, mark density, and sentence popularity predict (nearest-neighbour agreement z=+6.3, significant in 88% of documents). Under an eight-block region-preserving null, shared engagement with the same coarse regions of the document accounts for about 40% of this excess; the majority survives as finer reader-specific agreement (z=+3.6, 77% significant). So the within-document crowd is, in a descriptive sense, factional. Experiment 2: is that grouping a stable reader trait? Here we are honest about power. The cross-document split-half reproducibility of a pair's agreement is near zero pooled (+0.078 and 0.000 in two separately drawn samples), and a power calibration shows the test is informative only for pairs that co-read many documents. In the only informative high-overlap subset (k>=4), point estimates are positive but small-sample, imprecise across the separately drawn samples, never significant, and attenuate under the region-preserving null. We therefore leave cross-document stability unresolved: the data is consistent with anything from situational grouping to a weak-to-moderate stable reader trait. The crowd is factional within a document; whether its factions follow the reader across documents is, honestly, beyond our reach.

2606.11596 2026-06-11 eess.SY cs.AI cs.SY 新提交

Model-Based and Data-Driven Hierarchical Control and Topology Co-Design for Robust Networked Systems

基于模型和数据驱动的鲁棒网络系统分层控制与拓扑协同设计

Shirantha Welikala, Zihao Song, Hai Lin, Panos J. Antsaklis

发表机构 * Department of Electrical Engineering, University of Notre Dame(电气工程系,诺特大学)

AI总结 针对线性子系统构成的网络系统,提出基于模型和仅依赖轨迹数据的分层控制策略,结合耗散性理论与线性矩阵不等式实现局部与全局耗散性保证及拓扑优化,并应用于直流微电网的鲁棒电压调节与电流共享。

Comments To be submitted to Automatica

详情
AI中文摘要

本文考虑一类由相互连接的线性子系统、扰动输入和性能输出构成的网络系统。利用耗散性理论,我们首先提出一种基于模型的分层控制设计策略,确保闭环网络系统从扰动输入到性能输出是耗散的。这包括为每个子系统设计局部控制器以强制执行局部耗散性保证,然后利用这些保证协同设计分布式全局控制器和互连拓扑,以在优化互连拓扑成本的同时强制执行全局耗散性保证。整个设计过程仅需求解一系列线性矩阵不等式(LMI)问题,从而保持组合性和可分散性,同时避免低效且集中的非凸迭代设计过程。这种基于模型的分层控制设计策略假设已知子系统动力学,这在许多实际网络系统中可能不成立。受此启发,我们还提出了一种数据驱动的分层控制设计策略,该策略仅假设子系统可获取丰富的输入-状态-输出轨迹数据。所提出的数据驱动设计过程假设影响子系统动力学的未知扰动受二次矩阵不等式约束(放宽了常规界限),并通过使用矩阵S引理来考虑这一点。最后,以直流微电网网络系统为例,验证了所提出的基于模型和数据驱动的分层控制设计在实现鲁棒(耗散)电压调节和电流共享方面的有效性。

英文摘要

In this paper, we consider a class of networked systems comprising an interconnected set of linear subsystems, disturbance inputs, and performance outputs. Using dissipativity theory, we first propose a model-based hierarchical control design strategy to ensure the closed-loop networked system is dissipative from its disturbance inputs to performance outputs. This involves designing local controllers for each subsystem to enforce local dissipativity guarantees, which are then exploited to co-design distributed global controllers and the interconnection topology to enforce global dissipativity guarantees while optimizing interconnection topology costs. The overall design process requires only solving a sequence of linear matrix inequality (LMI) problems, thereby retaining compositionality and decentralizability while avoiding non-convex, iterative design processes that are inefficient and centralized. This model-based hierarchical control design strategy assumes the knowledge of the subsystem dynamics, which may not hold in many real-world networked systems. Motivated by this, we also propose a data-driven hierarchical control design strategy that assumes only the availability of rich input-state-output trajectory data from the subsystems. The proposed data-driven design process assumes that the unknown disturbances affecting the subsystem dynamics are bounded by a quadratic matrix inequality (relaxing conventional bounds) and accounts for this by using the matrix S-lemma. Finally, the effectiveness of the proposed model-based and data-driven hierarchical control designs is illustrated for a networked system representing a DC microgrid, with the aim of enforcing robust (dissipative) voltage regulation and current sharing.