arXivDaily arXiv每日学术速递 周一至周五更新

科学与医疗

AI for Science

科学智能、蛋白质、分子、药物、材料、气象、物理和数学 AI。

2026-06-19 至 2026-06-19 收录 478 信号源:cs.LG, q-bio, physics, cond-mat, math, stat.ML

1. 材料化学 5 篇

2508.05762 2026-06-19 cond-mat.mtrl-sci cs.LG 版本更新 90%

Evaluating Universal Machine Learning Force Fields Against Experimental Measurements

评估通用机器学习力场与实验测量的对比

Sajid Mannan, Vaibhav Bihani, Carmelo Gonzales, Kin Long Kelvin Lee, Nitya Nand Gosvami, Sayan Ranu, Santiago Miret, N M Anoop Krishnan

发表机构 * Department of Civil Engineering, Indian Institute of Technology Delhi(印度理工学院德里土木工程系) Yardi School of Artificial Intelligence, Indian Institute of Technology Delhi(印度理工学院德里人工智能学院) Intel Labs, California, USA(美国加州英特尔实验室) Department of Materials Science and Engineering, Indian Institute of Technology Delhi(印度理工学院德里材料科学与工程系) Department of Computer Science and Engineering, Indian Institute of Technology Delhi(印度理工学院德里计算机科学与工程系)

专题命中 材料化学 :评估通用机器学习力场在材料科学中的应用。

AI总结 提出UniFFBench框架和MinX数据集,系统评估六种通用机器学习力场,发现模型在计算基准上表现优异但在实验复杂性下存在显著“现实差距”,密度预测误差高于实际应用阈值。

详情
AI中文摘要

通用机器学习力场(UMLFFs)有望通过实现跨元素周期表的快速原子模拟来革新材料科学。然而,它们的评估一直局限于可能无法反映实际性能的计算基准。我们引入了UniFFBench,一个全面的评估框架,包含MinX数据集——一个涵盖85种元素、极端热力学条件(0–5000 K, 0–1000 GPa)和结构复杂性(包括部分占据和无序)的1500多种矿物系统的多样化集合。这种多样性,结合用于验证的实验参考值,使得能够评估UMLFF在化学空间和条件上的泛化能力,这些条件远超典型的训练场景。我们对六种最先进的UMLFF的系统评估揭示了一个显著的“现实差距”:在计算基准上表现令人印象深刻的模型在面对实验复杂性时常常失败。即使是最好的模型也表现出高于实际应用所需阈值的密度预测误差。我们观察到模拟稳定性和力学性能准确性之间的脱节,预测误差与训练数据表示相关,而非建模方法。

英文摘要

Universal machine learning force fields (UMLFFs) promise to revolutionize materials science by enabling rapid atomistic simulations across the periodic table. However, their evaluation has been limited to computational benchmarks that may not reflect real-world performance. We introduce UniFFBench, a comprehensive evaluation framework featuring the MinX dataset -- a diverse collection of 1,500+ mineral systems spanning 85 elements, extreme thermodynamic conditions (0--5000 K, 0--1000 GPa), and structural complexity, including partial occupancy and disorder. This diversity, combined with experimental reference values for validation, enables assessment of UMLFF generalization across chemical space and conditions substantially beyond typical training scenarios. Our systematic evaluation of six state-of-the-art UMLFFs reveals a substantial ``reality gap'': models achieving impressive performance on computational benchmarks often fail when confronted with experimental complexity. Even the best-performing models exhibit higher density prediction error than the threshold required for practical applications. We observe disconnects between simulation stability and mechanical property accuracy, with prediction errors correlating with training data representation rather than the modeling method.

2503.02710 2026-06-19 cond-mat.mtrl-sci 版本更新 90%

Four regimes of primary radiation damage in tungsten

钨中初级辐射损伤的四个区域

Jesper Byggmästar, Ville-Markus Yli-Suutala, Aslak Fellman, Jan Åström, Jan Westerholm, Fredric Granberg

专题命中 材料化学 :模拟钨中辐射损伤,用于聚变反应堆材料

AI总结 通过机器学习驱动的大规模分子动力学模拟,发现钨中初级损伤随能量变化呈现四个区域,其中高能区偏离所有现有模型,且该区域起始能量与聚变中子对钨原子的最大反冲能量一致。

详情
AI中文摘要

我们首次在硅中观察到钨初级损伤产生向线性区域的转变。作为聚变反应堆中的关键等离子体 facing 材料,钨的辐射损伤已在实验和模拟中得到广泛研究。辐照实验通常产生MeV范围内的反冲,而全原子建模仅限于几百keV。在这里,我们通过极大规模且精确的机器学习驱动的分子动力学模拟,在高达20亿原子的系统中,以高达2 MeV的反冲能量桥接了这些尺度。我们揭示了作为损伤能量函数的四个初级损伤区域,其中向高能区域的转变偏离了所有先前的模型。奇怪的是,高能区域的起始与聚变发射中子对钨原子的最高可能反冲能量(300 keV)相吻合。

英文摘要

We observe for the first time in silico the transition to a linear regime in the primary damage production in tungsten. As the critical plasma-facing material in fusion reactors, radiation damage in tungsten has been studied extensively in experiments and simulations. Irradiation experiments routinely produce recoils in the MeV range while full atomistic modelling has been limited to a few hundred keV. Here we bridge these scales with extremely large-scale and accurate machine-learning-driven molecular dynamics simulations with recoil energies up to 2 MeV in systems up to one billion atoms. We reveal four regimes of primary damage as a function of damage energy, with a transition to a high-energy regime that deviates from all previous models. Curiously, the start of the high-energy regime coincides with the highest possible recoil energy to tungsten atoms from fusion-emitted neutrons (300 keV).

2502.18859 2026-06-19 cond-mat.mtrl-sci 90%

Machine Learning a Phosphor's Excitation Band Position

机器学习发光体的激发带位置

Nakyung Lee, Małgorzata Sójka, Annie La, Syna Sharma, Seán Kavanagh, Docheon Ahn, David O. Scanlon, Jakoah Brgoch

专题命中 材料化学 :机器学习预测发光材料激发波长

AI总结 本文提出基于极端梯度提升的机器学习方法,用于预测发光材料的最长激发波长,通过实验验证了Ce³+离子取代位点的数据,成功合成新型蓝光激发绿光发射材料。

Journal ref ACS Appl Mater Interfaces 2026 18 23 32921

详情
AI中文摘要

创建高性能的稀土激活无机发光材料对推进高效LED照明和背光平板显示器至关重要。这些发光材料必须具备有效吸收/激发由蓝色InGaN LED转换为白光的能力。稀土的5d₁激发态能级,决定激发峰位置,受无机宿主结构影响,包括局部环境、晶体结构和组成,使提前预测具有挑战性。本研究引入了一种新的极端梯度提升机器学习方法,定量确定发光材料的最长(最低能量)激发波长。我们专注于Ce³+的4f→5d跃迁,因其在激发和漫反射光谱中观察到明确的5d₁能级。模型在357个Ce³+离子取代位点的实验数据上进行训练,并通过成功合成新型蓝光激发绿光发射材料Ca₂SrSc₆O₁₂:Ce³+进行实验验证。该化合物在商用蓝光LED波长下的激发与模型预测高度一致。这些结果突显了数据驱动方法在加速下一代LED照明蓝光吸收发光材料发现中的变革潜力。

英文摘要

Creating superior lanthanide-activated inorganic phosphors is pivotal for advancing energy-efficient LED lighting and backlit flat panel displays. The most fundamental property these luminescent materials must possess is effective absorption/excitation by a blue InGaN LED for practical conversion into white light. The 5$d_1$ excited state energy level of lanthanides, which determines the excitation peak position, is influenced by the inorganic host structure, including the local environment, crystal structure, and composition, making it challenging to predict in advance. This study introduces a new extreme gradient boosting machine learning method that quantitatively determines a phosphor's longest (lowest energy) excitation wavelength. We focus on the Ce$^{3+}$ 4$f$ $\rightarrow$ 5$d$ transition due to its well-defined 5$d_1$ energy level observed in excitation and diffuse reflectance spectra. The model was trained on experimental data for 357 Ce$^{3+}$ cation substitution sites sourced from literature and in-house measurements and ultimately experimentally validated through the successful synthesis of a novel, blue-excited, green-emitting phosphor: Ca$_2$SrSc$_6$O$_{12}$:Ce$^{3+}$. This compound's excitation under commercial blue LED wavelength aligned remarkably well with the model's predictions. These results highlight the transformative potential of data-driven approaches in expediting the discovery of blue-absorbing phosphors for next-generation LED lighting.

2606.19600 2026-06-19 physics.comp-ph 新提交 85%

Machine-learned prediction of carbon interstitial clusters in diamond

金刚石中碳间隙簇的机器学习预测

Xiaoya Chang, Arsalan Hashemi, Nima Ghafari Cherati, Mikko Karttunen, Ádám Gali, Tapio Ala-Nissila

专题命中 材料化学 :机器学习预测金刚石碳间隙簇,属于材料科学

AI总结 通过主动学习构建间隙数据集,并基准测试三种机器学习原子间势,发现MACE势能准确预测能量和稳定性,而分子动力学模拟揭示了新的碳间隙簇及其亚稳态机制。

详情
AI中文摘要

金刚石中承载着对量子技术至关重要的光学活性点缺陷,然而在生长和辐照过程中引入的碳自间隙原子会与它们竞争并形成新缺陷,其构型景观由于微妙的能量差异控制着竞争极小值和路径而鲜有研究。这里我们通过主动学习构建了一个以间隙为中心的数据集,并基准测试了三种机器学习原子间势——GAP、NEP和等变MACE——与密度泛函理论在能量、力和迁移势垒方面的表现。MACE再现了参考能量学和相对稳定性,而其他势可能错误排序基态。使用经过验证的势进行退火分子动力学,揭示了一系列先前未报道的碳间隙簇,从双间隙到八间隙——其中几个引入了作为色心感兴趣的带隙态——并表明它们的亚稳态由动力学可及路径而非能量排序控制。这些结果绘制了间隙缺陷景观,并加速了量子技术的缺陷发现。

英文摘要

Diamond hosts optically active point defects central to quantum technologies, yet the carbon self-interstitials introduced during growth and irradiation compete with them and form new defects whose configurational landscape is poorly charted, as subtle energy differences govern the competing minima and pathways. Here we build an interstitial-focused dataset by active learning and benchmark three machine-learning interatomic potentials -- GAP, NEP and the equivariant MACE -- against density functional theory for energies, forces and migration barriers. MACE reproduces the reference energetics and relative stabilities, whereas the others can misorder the ground states. Annealing molecular dynamics with the validated potentials uncovers a series of previously unreported carbon interstitial clusters, from di- to octa-interstitials -- several introducing in-gap states of interest as colour centres -- and shows that their metastability is governed by kinetically accessible pathways rather than energetic ordering. These results chart the interstitial defect landscape and accelerate defect discovery for quantum technologies.

2606.19557 2026-06-19 physics.comp-ph 新提交 85%

TorchNEP: Ultra-Efficient and Accurate Training of Neuroevolution Potentials

TorchNEP:神经演化势的超高效和精确训练

Yong-Chao Wu, Xiaoya Chang, Tero Mäkinen, Amin Esfandiarpour, Jian-Li Shao, Tapio Ala-Nissila, Zheyong Fan, Mikko Alava

专题命中 材料化学 :神经演化势训练加速,属于材料科学智能

AI总结 提出基于PyTorch的TorchNEP框架,通过解析梯度、自适应优化和两阶段训练策略,将NEP训练加速两个数量级以上,并提升预测精度。

详情
AI中文摘要

神经演化势(NEP)是大规模原子模拟中最有效的机器学习原子间势框架之一。然而,其原始训练策略计算需求仍然很高,限制了模型架构和训练协议的系统探索。在这里,我们提出TorchNEP,一种基于PyTorch的NEP实现,它结合了解析推导的梯度、自适应优化和两阶段训练策略。TorchNEP将训练加速两个数量级以上,同时保持与现有NEP模型的完全兼容性。我们进一步表明,预测精度的提高主要源于两阶段训练协议,而非优化算法本身。在多样化的基准数据集上,TorchNEP持续改进力和应力预测,同时保持相当或更好的能量精度。对元素和合金系统的基准评估表明,对原子构型和关键材料性能的预测性能均得到增强。此外,我们表明增加模型复杂性并不一定能提高预测性能,尽管减少了训练误差。总体而言,TorchNEP为开发更准确和鲁棒的机器学习原子间势提供了一个高效且灵活的训练框架。

英文摘要

Neuroevolution Potential (NEP) is one of the most efficient machine-learned interatomic potential frameworks for large-scale atomistic simulations. However, its original training strategy remains computationally demanding, limiting systematic exploration of model architectures and training protocols. Here, we present TorchNEP, a PyTorch-based implementation of NEP that combines analytically derived gradients, adaptive optimization, and a two-stage training strategy. TorchNEP accelerates training by more than two orders of magnitude while maintaining full compatibility with existing NEP models. We further show that the improvement in predictive accuracy primarily originates from the two-stage training protocol rather than the optimization algorithm itself. Across diverse benchmark datasets, TorchNEP consistently improves force and stress predictions while maintaining comparable or improved energy accuracy. Benchmark evaluations on elemental and alloy systems demonstrate enhanced predictive performance for both atomic configurations and key materials properties. Furthermore, we show that increasing model complexity does not necessarily improve predictive performance despite reducing training errors. Overall, TorchNEP provides an efficient and flexible training framework for developing more accurate and robust machine-learned interatomic potentials.

2. 其他科学智能 9 篇

2606.19737 2026-06-19 stat.ME stat.ML 新提交 85%

Calibration without labels in multiple testing

多重检验中的无标签校准

Adway S. Wadekar, Jake A. Soloff

专题命中 其他科学智能 :提出多重检验无标签校准方法,应用于统计和神经科学

AI总结 针对多重检验中无法观测真实标签的难题,利用有序p值间距构造伪标签,实现局部错误发现率的校准,并揭示q值在心理学和神经科学文献中可能严重失准。

详情
AI中文摘要

大规模假设检验支持对单个假设的概率性声明,如经验贝叶斯方法估计局部错误发现率。我们研究如何将这些声明解释为原假设的近似校准预测,即使在模型误设定下也能产生可解释的错误概率。我们的方法从概率预测中汲取概念灵感,但面临不同的挑战:与预测不同(标签最终可观测),在多重检验中真实情况从未揭示,因此校准必须随机评估并间接建立。我们通过构造一组伪标签来应对这一挑战,这些伪标签源自有序$p$值的间距,并以局部错误发现率作为回归目标。我们的构造解锁了现有工具,用于评估和执行多重检验中的事后校准。值得注意的是,我们在对已发表的心理学和神经科学文献的大规模实证调查中发现,基于错误发现率的流行误差度量$q$值可能严重失准。

英文摘要

Large-scale hypothesis testing supports probability claims about individual hypotheses, as in empirical Bayes methods for estimating local false discovery rates. We study how such claims can be interpreted as approximately calibrated forecasts of the null hypothesis, yielding interpretable error probabilities even under model misspecification. Our approach draws conceptual inspiration from probabilistic forecasting but addresses a different challenge: unlike forecasting, where labels are eventually observed, in multiple testing the ground truth is never revealed, so calibration must be assessed stochastically and established indirectly. We address this challenge by constructing a set of pseudo-labels, derived from the spacings of ordered $p$-values, which have the local false discovery rate as their regression target. Our construction unlocks existing tools for assessing and performing post-hoc calibration in multiple testing. Notably, we find on a large-scale empirical survey of published psychology and neuroscience literature that the $q$-value, a popular error measure based on the false discovery rate, can be severely miscalibrated.

2606.19762 2026-06-19 q-bio.MN 新提交 85%

Oscillations and Spatial Patterns in Large-Scale Stochastic Gene Regulatory Networks

大规模随机基因调控网络中的振荡与空间模式

Manuel Eduardo Hernández-García, Jorge Velázquez-Castro

专题命中 其他科学智能 :分析基因调控网络振荡与空间模式,数学建模

AI总结 研究负反馈与扩散的循环基因调控网络,通过确定性和随机方法分析其稳定性,发现随机波动可诱导图灵失稳,为理解发育中的模式形成提供新视角。

Comments 16 pages, 10 figures

详情
AI中文摘要

基因调控网络(GRNs)是细胞生长和组织形成的基础,在发育过程中协调基因表达的时空调控。这些网络固有地受到分子噪声引起的内在波动的影响,因此分析其稳定性对于理解生物体稳健的模式形成和发育动力学至关重要。在本研究中,我们分析了具有负反馈和扩散的循环GRNs的稳定性和动力学,考虑了确定性和随机方法。在确定性情况下,系统表现出稳定性与不稳定性之间的分岔,导致无扩散时的Hopf失稳和包含扩散时的Turing-Hopf失稳。观察到空间域的离散化引入了额外的不稳定模式,从而允许更广泛的模式。基于二阶矩方法的随机框架包含了内在波动,揭示了对于小系统尺寸,即使系统在无扩散时是稳定的,波动也可以主导动力学并诱导随机Turing失稳。值得注意的是,即使所有变量具有相同的扩散速率,Turing失稳也可能出现。所开发的框架提供了一种系统的方法来分析具有扩散的高维随机系统的稳定性,从而简化了Turing和Turing-Hopf失稳的预测。这些发现有助于更深入地理解GRNs中的复杂动力学和模式形成,对细胞分化和发育等生物过程具有潜在意义。

英文摘要

Gene regulatory networks (GRNs) are fundamental to cellular growth and tissue formation, orchestrating spatially and temporally regulated gene expression during development. These networks are inherently subject to intrinsic fluctuations arising from molecular noise, making the analysis of their stability essential for understanding robust pattern formation and developmental dynamics of the organism. In this study, we analyze the stability and dynamics of cyclic GRNs with negative feedback and diffusion, considering both deterministic and stochastic approaches. In the deterministic case, the system exhibits a bifurcation between stability and instability, leading to Hopf instability in the absence of diffusion and to Turing-Hopf instability when diffusion is included. It was observed that the discretization of the spatial domain introduces additional unstable modes, enabling a wider range of patterns. The stochastic framework based on the second-moment approach, which incorporates intrinsic fluctuations, reveals that for small system sizes, fluctuations can dominate the dynamics and induce stochastic Turing instability, even when the system is stable in the absence of diffusion. Notably, Turing instabilities can emerge even when all variables have the same diffusion rate. The developed framework provides a systematic method for analyzing the stability of high-dimensional stochastic systems with diffusion, thereby simplifying the prediction of Turing and Turing-Hopf instabilities. These findings contribute to a deeper understanding of the complex dynamics and pattern formation in GRNs, with potential implications for biological processes, such as cellular differentiation and development.

2606.19396 2026-06-19 q-bio.QM 新提交 85%

BioHarness: Substrate-Aware Evidence Assembly for Biomedical Question Answering across Literature, Knowledge Bases, and Biological Atlases

BioHarness:面向生物医学问答的底物感知证据组装——跨文献、知识库和生物图谱

Meng Xiao, Chuan Qin, Jinmiao Chen, Yihang Cheng, Yuanchun Zhou, Hengshu Zhu

专题命中 其他科学智能 :面向生物医学问答的检索增强生成系统

AI总结 提出BioHarness,通过级联控制机制在文献检索、知识库和生物图谱间选择性组装证据,提升生物医学问答准确率,在19,302个问答项上得分从65.9提升至71.0。

Comments 14 Pages, 11 Figures, Keywords: biomedical question answering; retrieval-augmented generation; large language models; evidence assembly; biomedical knowledge bases; biological atlases

详情
AI中文摘要

动机:生物医学问答通常需要超越主题检索文献的证据,包括基因别名解析、数据库标识符标准化以及来自图谱的生物测量值。然而,现有的检索增强生成(RAG)系统通常遵循固定工作流程,缺乏明确机制来决定何时检索文本足够、何时需要经过整理的生物医学知识、或何时应调用对结构化测量值的可执行证据组装。这激发了一种底物感知的大语言模型(LLM)框架,能够跨文献、知识库和生物图谱选择性地组装足够的证据。结果:我们引入BioHarness,一种用于分阶段生物医学证据组装的LLM框架,涵盖文献检索、经过整理的生物医学知识资源以及来自图谱的结构化测量值。BioHarness首先尝试根据重排序的文献证据回答问题,并通过基于接地级联控制,仅在当前证据不确定、接地不足或底物不匹配时升级到REPL风格的证据组装。在涵盖七种答案格式的19,302个生物医学问答项上,BioHarness将最强非预言基线的综合得分从65.9提升至71.0。消融实验、案例研究和骨干扩展分析表明,这些提升源于通过重排序、实体接地和结构化测量访问修复证据-底物不匹配,而非不加区分地调用更多推理步骤、检索更多文献或依赖特定答案模型规模。

英文摘要

Motivation: Biomedical question answering often requires evidence beyond topically retrieved literature, including gene alias resolution, database identifier normalization, and atlas-derived biological measurements. However, existing retrieval-augmented generation (RAG) systems typically follow a fixed workflow and lack an explicit mechanism for deciding when retrieved text is sufficient, when curated biomedical knowledge is required, or when executable evidence assembly over structured measurements should be invoked. This motivates a substrate-aware large language model (LLM) harness that selectively assembles sufficient evidence across literature, knowledge bases, and biological atlases. Results: We introduce BioHarness, an LLM harness for staged biomedical evidence assembly across literature retrieval, curated biomedical knowledge resources, and atlas-derived structured measurements. BioHarness first attempts to answer from reranked literature evidence and escalates through grounded cascade control to REPL-style evidence assembly only when the current evidence is uncertain, weakly grounded, or substrate-mismatched. Across 19,302 biomedical QA items spanning seven answer formats, BioHarness improves the pooled score from 65.9 to 71.0 over the strongest non-oracle baseline. Ablations, case studies, and backbone-scaling analyses show that these gains arise from repairing evidence-substrate mismatches through reranking, entity grounding, and structured measurement access, rather than from indiscriminately invoking more reasoning steps, retrieving additional literature, or relying on a particular answer-model scale.

2606.20451 2026-06-19 stat.ML cs.LG stat.AP stat.CO 新提交 85%

SSH-Net: A Deep Neural Network for Predicting Failure Time Distribution Functions under Competing Risks with Application to GPU Data

SSH-Net: 一种用于竞争风险下预测失效时间分布函数的深度神经网络及其在GPU数据上的应用

Jie Min, Yueyao Wang, Mengkun Chen

发表机构 * Department of Mathematics & Statistics, University of South Florida(佛罗里达州立大学数学与统计学系) School of Statistics and Data Science, Zhejiang Gongshang University(浙江工商大学统计与数据科学学院) Department of Statistics, Virginia Tech(弗吉尼亚理工学院统计学系)

专题命中 其他科学智能 :提出深度神经网络预测失效时间,应用于GPU数据,属于科学智能

AI总结 提出结构化分段风险深度神经网络(SSH-Net),通过将网络结构与数据结构关联,允许不同协变量组通过子网络影响预测,在竞争风险框架下预测失效时间分布函数,仿真和GPU数据验证了准确性。

详情
AI中文摘要

竞争风险在工程领域常见,当应用场景复杂时会给时间事件数据建模带来挑战。近年来,深度神经网络因其灵活性和高学习能力在竞争风险预测中受到广泛关注。然而,神经网络结构的复杂性使得基于不同数据输入的超参数调优更加困难。此外,当工程系统具有多层级的复杂物理结构时,将所有结构层级视为单一输入组可能无法捕捉关键信息。为解决这些问题,我们提出了一种结构化分段风险深度神经网络(SSH-Net),用于在特定原因竞争风险框架下预测失效时间。我们的方法将神经网络结构与数据结构相关联,并允许不同的协变量组通过分离的子网络影响失效预测。神经网络基于特定原因竞争风险模型构建。SSH-Net输出特定原因风险函数,并采用惩罚对数似然作为损失函数。通过评估Brier分数、接收者操作特征曲线下面积(AUC)和预测的特定原因累积发生函数的均方根误差(RMSE),仿真研究验证了SSH-Net的预测准确性。我们进一步使用Titan GPU失效时间数据展示了模型预测失效时间分布函数的能力。

英文摘要

Competing risks are commonly observed in engineering fields and can bring challenges to time-to-event data modeling when the application scenarios are complicated. Recently, deep neural networks have received great attention for prediction with competing risks, due to their flexibility and high learning capability. However, the complexity of neural network structure brings extra difficulty in hyperparameter tuning based on different data inputs. Additionally, when an engineered system has complex physical structures with multiple hierarchical levels, treating all structural levels as a single group of inputs may fail to capture critical information. To address the issues, we propose a Structured Segmented Hazard Deep Neural Network (SSH-Net) for failure time prediction under cause-specific competing risks framework. Our approach associates neural network structure with data structures, and allows different covariate groups to impact the failure prediction through separate sub-networks. The neural network is constructed based on a cause-specific competing risks model. The SSH-Net outputs cause-specific hazard functions, and utilizes the penalized log-likelihood as the loss function. The prediction accuracy of SSH-Net is validated through simulation studies by evaluating the Brier score, the area under receiver operating characteristic curves (AUC), and the root mean square error (RMSE) of the predicted cause-specific cumulative incident function. We further demonstrate the model's ability to predict failure time distribution functions using the Titan GPU failure time data.

2606.19643 2026-06-19 stat.ML cs.LG 新提交 85%

Variational Consensus Monte Carlo for Bayesian Mixture

变分共识蒙特卡洛用于贝叶斯混合模型

Julie Fendler, Francesca L. Crowe, Tom Marshall, Sylvia Richardson, Paul D. W. Kirk

发表机构 * MRC Biostatistics Unit, University of Cambridge(剑桥大学生物统计学单位) Institute of Applied Health Research, University of Birmingham(伯明翰大学应用健康研究学院)

专题命中 其他科学智能 :提出贝叶斯混合模型用于联邦学习,在电子健康记录数据上验证

AI总结 提出变分共识蒙特卡洛方法扩展至过拟合贝叶斯混合模型,通过新颖的聚类匹配算法和聚合策略,在联邦学习设置下推断聚类数和所有参数,并在模拟和真实电子健康记录数据上验证了有效性。

详情
AI中文摘要

受健康数据的隐私、敏感性和共享限制的驱动,我们提出了一个在联邦学习设置下(即数据无法在计算节点之间完全共享或汇集)对贝叶斯混合模型进行推断的全面流程。我们采用共识蒙特卡洛(CMC)方法,在每个数据孤岛内独立运行MCMC算法以估计局部后验分布,然后聚合这些分布以近似完整数据的后验。Rabinovich, Angelino 和 Jordan (2015) [1] 的变分CMC方法将聚合步骤视为变分推断问题,但他们应用于混合模型时假设聚类数和关键混合参数已知。我们的主要方法贡献是:(i) 将变分CMC扩展到过拟合贝叶斯混合模型,该模型推断聚类数和所有模型参数,无需共轭性;(ii) 适用于跨孤岛设置的新颖聚类匹配算法,其中并非每个聚类都出现在每个局部数据集中;(iii) 针对聚合步骤的多种推断策略,匹配不同的联邦学习约束;以及 (iv) 在实践中选择这些策略的指南。一项全面的模拟研究验证了该框架,并允许我们与最先进的联邦学习替代方法进行比较。值得注意的是,我们表明当局部数据集的组成反映了数据中的底层聚类结构时,我们的方法可以比应用于汇集数据的标准MCMC更准确地恢复小聚类。我们在大规模电子健康记录数据上展示了该框架,识别了英国老年人群中的多发病模式。

英文摘要

Motivated by the privacy, sensitivity and sharing limitations of health data, we present a comprehensive pipeline for inference of Bayesian mixture models within a federated learning setting, i.e. when data cannot be fully shared or pooled across compute nodes. We adopt a Consensus Monte Carlo (CMC) approach, in which an MCMC algorithm is run independently within each data silo to estimate local posterior distributions, which are then aggregated to approximate the posterior over the full data. The variational CMC approach of Rabinovich, Angelino and Jordan (2015) [1] frames the aggregation step as a variational inference problem, but their application to mixtures assumes the number of clusters and key mixture parameters to be known. Our main methodological contributions are: (i) an extension of variational CMC to over-fitted Bayesian mixture models that infer the number of clusters and all model parameters, without requiring conjugacy; (ii) novel cluster-matching algorithms suitable for cross-silo settings in which not every cluster appears in each local dataset; (iii) a number of inference strategies for the aggregation step, matched to different federated learning constraints; and (iv) guidelines for choosing among these in practice. A comprehensive simulation study validates the framework and allows us to compare to state-of-the-art federated learning alternatives. Notably, we show that when the composition of local datasets reflects the underlying clustering structure in the data, our approach can recover small clusters with greater accuracy than standard MCMC applied to the pooled data. We illustrate the framework on large-scale electronic health record data, identifying multi-morbidity patterns in a British geriatric population.

2606.20480 2026-06-19 math.ST stat.ML stat.TH 新提交 85%

Leveraging tails for adaptation

利用尾部进行自适应

Sergios Agapiou, Ismaël Castillo, Paul Egels

专题命中 其他科学智能 :研究非参数贝叶斯后验收缩率,应用于白噪声回归和ReLU神经网络

AI总结 研究非参数贝叶斯中基于p-指数尾先验的后验收缩率,发现p越小收缩越快,且p→0时可实现光滑性自适应,应用于白噪声回归和ReLU神经网络。

Comments 59 pages, 3 figures

详情
AI中文摘要

我们考虑非参数设定下贝叶斯后验分布的收缩,其中函数在基或字典上的系数被赋予具有$p$指数尾的先验,包括拉普拉斯尾$(p=1)$和更重的尾$(p<1)$。结果表明,随着$p$减小,收缩率提高,并且在适当的$p\to 0$范围内,可以获得对光滑性的完全自适应(达到对数因子)。作为应用,我们考虑了白噪声回归中的级数先验和随机设计回归中的浅层ReLU神经网络。特别地,我们表明过参数化的浅层ReLU网络可以适应任何正则性$0\le \beta\le 2$。通过模拟研究,我们展示了与理论预测行为的高度实证一致性。

英文摘要

We consider contraction of Bayesian posterior distributions in nonparametric settings where coefficients of a function over a basis or dictionary are given priors with $p$--exponential tails, including Laplace tails $(p=1)$ and heavier tails $(p<1)$. It is shown that contraction rates improve as $p$ decreases and that full adaptation to smoothness, up to logarithmic factors, is obtained in an appropriate $p\to 0$ regime. As applications, we consider both series priors in white noise regression and shallow ReLU neural networks in random design regression. In particular, we show that overparametrised shallow ReLU networks can adapt to any regularity $0\le β\le 2$. Through a simulation study, we show strong empirical agreement with the behavior predicted by our theory.

2606.19524 2026-06-19 physics.ed-ph hep-ph 新提交 85%

Vistas: A Visualization Interface for Particle Collision Simulations

Vistas:粒子碰撞模拟的可视化界面

Benoit Assi, Christan Bierlich, Rikab Gambhir, Philip Ilten, Tony Menzo, Stephen Mrenna, Manuel Szewc, Michael K. Wilkinson, Ahmed Youssef, Jure Zupan

专题命中 其他科学智能 :可视化粒子碰撞模拟,用于物理教育

AI总结 提出Vistas工具,利用浏览器事件显示框架Phoenix可视化Pythia模拟的高能粒子碰撞各阶段,通过交互式3D图结构展示粒子,支持旋转、缩放和筛选,适用于物理教育。

Comments 20 pages, 9 figures, public code available

详情
AI中文摘要

我们介绍Vistas,一个用于可视化由Pythia蒙特卡洛事件生成器模拟的高能粒子物理碰撞的工具。Vistas利用基于浏览器的事件显示框架Phoenix,展示高能碰撞事件模拟的不同计算阶段:硬过程、部分子簇射、强子化和粒子衰变。每个阶段产生的粒子被表示为交互式三维图结构中的线条,每条线沿其粒子三维动量矢量的方向。事件可以旋转、平移和缩放,通过选择相关粒子线可以访问每个粒子的详细信息。此外,模拟所有阶段的粒子线可以切换开关,并可以通过粒子级运动学选择要求进行过滤。这种交互式环境提供了对Pythia模拟输出的直观解释,包括颜色流、束流残余和多重部分子相互作用等详细特征,使其成为物理教育环境中的有用工具,从外展活动到研究生粒子物理课程。

英文摘要

We introduce Vistas, a tool for visualizing high-energy particle physics collisions simulated by the Pythia Monte-Carlo event generator. Vistas utilizes the browser-based event display framework Phoenix to show distinct computational stages of a high-energy collision event simulation: the hard process, parton shower, hadronization, and particle decays. Particles produced from each of these stages are represented as lines in an interactive three-dimensional graph structure, where each line is along the direction of its particle's three-momentum vector. The event can be rotated, translated and zoomed, and details for each particle can be accessed by selecting the relevant particle line. Additionally, particle lines from all stages of the simulation can be toggled on and off and can be filtered by particle-level kinematic selection requirements. This interactive environment provides an intuitive interpretation of Pythia simulation output, including detailed features such as color flow, beam remnants, and multiple parton interactions, making it a useful tool in physics education settings, from outreach activities to graduate particle-physics courses.

2604.21804 2026-06-19 physics.ins-det hep-ex hep-ph 版本更新 85%

Agentic-AI Detector Co-design and Optimization in Vertically-Integrated Differentiable Full Simulations

Agentic-AI探测器协同设计与优化在垂直集成可微分全模拟中

Wonyong Chung, Qibin Liu, Liangyu Wu, Julia Gonski

专题命中 其他科学智能 :高能物理探测器设计优化

AI总结 提出双层级优化框架,将AI智能体集成到高能物理探测器设计中,通过可微分全模拟联合优化几何、前端数字化和重建算法参数,在竞争性能指标下找到最优设计点。

Comments 7 pages, 3 figures

详情
AI中文摘要

我们首次实现了AI智能体在高能物理实验探测器设计与优化中的应用,通过一个双层级优化框架,在可微分全模拟中垂直集成探测器几何、前端数字化和高层重建算法参数。以基线分辨率为$3\\%/\sqrt{E}$的双读出分段晶体电磁量能器为例,我们研究了AI智能体在识别和减少关键探测器参数以及非线性遍历设计空间方面的能力和价值。我们发现,当前前沿的LLM推理模型,在未提供额外实验特定上下文的情况下,能够有效执行复杂工作流,并主动提出通用但相关的进一步研究或改进方向。在此,我们展示了AI智能体在三个竞争性能指标中寻找最优设计点的能力,表明将智能体有效集成到前沿研究领域的复杂工作流中,可以在减少劳动和计算的同时,提高关键物理目标的性能。本研究为未来首次完全由AI设计的探测器在科学设施中的应用奠定了基础。

英文摘要

We present the first implementation of AI agents into the design and optimization of detectors in high-energy physics experiments via a bi-level optimization framework that vertically integrates detector geometry, front-end digitization, and high-level reconstruction algorithm parameters in differentiable full simulations. Using the example of a dual-readout, segmented crystal EM calorimeter with a baseline resolution of $3\%/\sqrt{E}$, we investigate the capabilities and value propositions of AI agents in the identification and reduction of key detector parameters and in the nonlinear traversal of design space. We find that frontier LLM reasoning-models today, without being given additional experiment-specific context, are able to effectively execute complex workflows and proactively suggest generic but relevant avenues for further study or improvement. Here, we demonstrate an AI agent's ability to find an optimal design point amidst three competing performance criteria, showing that effective integration of agents into the complex workflows of frontier research areas can yield higher performance for key physics goals while reducing labor and compute. This study establishes the foundation for a future demonstration of the first fully AI-designed detector for future scientific facilities.

2606.20437 2026-06-19 hep-ex cs.LG 新提交 85%

HEPTv2: End-to-End Efficient Point Transformer for Charged Particle Reconstruction

HEPTv2:用于带电粒子重建的端到端高效点变换器

Siqi Miao, Shitij Govil, Jack P. Rodgers, Mia Liu, Javier Duarte, Shih-Chieh Hsu, Yuan-Tang Chou, Pan Li

发表机构 * School of Electrical and Computer Engineering, Georgia Institute of Technology(佐治亚理工学院电气与计算机工程学院) Department of Physics and Astronomy, Purdue University(普渡大学物理与天文学系) Department of Physics, University of California San Diego(加州大学圣地亚哥分校物理系) Department of Physics, University of Washington(华盛顿大学物理系)

专题命中 其他科学智能 :点变换器用于粒子物理轨迹重建

AI总结 提出HEPTv2,一种端到端点变换器架构,通过局部敏感哈希编码和扇区化解码,无需图构建即可从探测器击中点直接重建粒子轨迹,在TrackML上以0.8%假率实现98.6%追踪效率,延迟仅15ms。

详情
AI中文摘要

带电粒子追踪——从稀疏探测器测量中重建轨迹——是一个基础的高能物理推理问题,也是在极端组合歧义下学习的典型例子。在高亮度大型强子对撞机(HL-LHC)上,尽管碰撞密度前所未有,追踪必须保持准确和高效。图神经网络表现强劲,但图构建和处理带来了大量成本,而基于变换器的方法依赖辅助阶段,阻碍了端到端优化。为解决这一问题,我们提出了HEPTv2,一种端到端点变换器架构,在一个可训练管道中从探测器击中点重建轨迹。HEPTv2结合了局部感知点编码器和轨迹解码器,无需图构建、聚类或过滤即可预测完整轨迹。编码器在探测器坐标空间中使用局部敏感哈希,以保留追踪相关几何结构,同时实现高效的局部注意力。解码器通过扇区化解码和联合编码器-解码器监督下的直接击中到轨迹预测来消除歧义,使整个管道能够端到端优化。在TrackML上,HEPTv2以0.8%的假率实现了98.6%的双多数追踪效率,同时在NVIDIA A100 GPU上每个事件仅需约15毫秒推理时间和0.4 GB峰值内存。对于最多包含$5\ imes10^5$个击中点的事件,延迟和内存大致线性扩展。HEPTv2在精度-延迟权衡中建立了新的最先进水平,相比之前最强的变换器效率提升4.5%,相比优化的基于图管道提升1.1-2.2%,同时延迟分别降低7倍和38-52倍。这些结果表明,端到端变换器能够提供HL-LHC实时粒子重建所需的精度和效率。

英文摘要

Charged-particle tracking -- reconstructing trajectories from sparse detector measurements -- is a fundamental high-energy-physics inference problem and a canonical example of learning under extreme combinatorial ambiguity. At the High-Luminosity Large Hadron Collider (HL-LHC), tracking must remain accurate and efficient despite unprecedented collision densities. Graph neural networks perform strongly, but incur substantial costs from graph construction and processing, while transformer-based approaches rely on auxiliary stages that prevent end-to-end optimization. To address this, we present HEPTv2, an end-to-end point-transformer architecture that reconstructs tracks from detector hits in one trainable pipeline. HEPTv2 combines a locality-aware point encoder with a track decoder that predicts complete trajectories without graph-building, clustering, or filtering. The encoder uses locality-sensitive hashing in detector coordinate space to preserve tracking-relevant geometry while enabling efficient local attention. The decoder resolves ambiguities through sectorized decoding and direct hit-to-track prediction under joint encoder-decoder supervision, allowing the full pipeline to be optimized end-to-end. On TrackML, HEPTv2 achieves 98.6% double-majority tracking efficiency at a 0.8% fake rate, while requiring only $\sim$15~ms inference time and 0.4~GB peak memory per event on a NVIDIA A100 GPU. Latency and memory scale approximately linearly for events with up to $5\times10^5$ hits. HEPTv2 establishes a new state of the art in the accuracy-latency trade-off, improving efficiency by 4.5% over the strongest prior transformer and by 1.1--2.2% over optimized graph-based pipelines, while reducing latency by factors of 7 and 38--52, respectively. These results show end-to-end transformers can deliver the accuracy and efficiency required for real-time particle reconstruction at the HL-LHC.

3. 物理仿真 14 篇

2606.20417 2026-06-19 cs.LG 新提交 85%

Neural network surrogates with uncertainty quantification for inverse problems in partial differential equations

具有不确定性量化的神经网络代理模型用于偏微分方程反问题

Christian Jimenez-Beltran, Aretha L. Teckentrup, Antonio Vergari, Konstantinos C. Zygalakis

发表机构 * School of Mathematics and Maxwell Institute for Mathematical Sciences University of Edinburgh(数学学院和麦克斯韦数学科学研究所爱丁堡大学)

专题命中 物理仿真 :神经网络代理用于偏微分方程反问题,不确定性量化

AI总结 提出DeepGaLA神经网络代理模型,为微分方程求解器提供不确定性感知预测,结合延迟接受MCMC诊断,实现高效可靠的贝叶斯反演。

详情
AI中文摘要

微分方程的反问题在科学和工程中普遍存在,其目标是从噪声或不完整的观测中推断未知模型参数。传统数值方法通常计算成本高昂,尤其是在贝叶斯设置中,对于复杂正向模型和高维参数空间,评估似然函数变得非常昂贵。为了应对这一挑战,我们引入了DeepGaLA,一种用于微分方程求解器的神经网络代理模型,它提供不确定性感知的预测,在训练数据有限时减少过度自信的推断。为了在实践中评估代理诱导的后验近似的保真度,我们表明,短时间运行的延迟接受马尔可夫链蒙特卡洛可以作为有效的诊断工具。在一系列数值实验中,DeepGaLA提供的正向模型近似精度与已建立的高斯过程代理相当,同时在参数维度增加时更好地保持效率。此外,它可以纳入微分方程约束,包括非线性情况。总体而言,这些结果表明,具有不确定性量化的神经代理模型能够实现复杂系统中反问题的可扩展且可靠的贝叶斯推断。

英文摘要

Inverse problems for differential equations arise throughout science and engineering, where one seeks to infer unknown model parameters from noisy or incomplete observations. Traditional numerical methods for these problems are often computationally expensive, particularly in Bayesian settings where evaluating the likelihood becomes costly for complex forward models and high-dimensional parameter spaces. To address this challenge, we introduce DeepGaLA, a neural-network surrogate for differential equation solvers that provides uncertainty-aware predictions, reducing overconfident inference when training data are limited. To evaluate the fidelity of the surrogate-induced posterior approximations in practice, we show that a short run of delayed-acceptance Markov chain Monte Carlo can serve as an effective diagnostic. Across a range of numerical experiments, DeepGaLA delivers forward-model approximations with accuracy comparable to established Gaussian-process surrogates, while better maintaining efficiency as parameter dimension grows. Moreover, it can incorporate differential-equation constraints, including in nonlinear settings. Overall, these results indicate that uncertainty-quantified neural surrogates can enable scalable and reliable Bayesian inference for inverse problems in complex systems.

2606.19984 2026-06-19 cs.LG 新提交 85%

Kolmogorov-Arnold Reservoir Computing

Kolmogorov-Arnold 储层计算

Juntian Huang, Jurgen Kurths, Ying Tang

发表机构 * Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China(电子科技大学基础与前沿科学研究所) Potsdam Institute for Climate Impact Research(波茨坦气候影响研究所) Department of Physics, Humboldt University Berlin(柏林洪堡大学物理系) Research Institute of Intelligent Complex Systems, Fudan University(复旦大学智能复杂系统研究所) School of Physics, University of Electronic Science and Technology of China(电子科技大学物理学院) Key Laboratory of Quantum Physics and Photonic Quantum Information, Ministry of Education, University of Electronic Science and Technology of China(电子科技大学教育部量子物理与光子量子信息重点实验室) Non-classical Information Science Basic Discipline Research Center of Sichuan Province, University of Electronic Science and Technology of China(电子科技大学四川省非经典信息科学基础学科研究中心)

专题命中 物理仿真 :提出KARC用于动力系统预测

AI总结 提出Kolmogorov-Arnold储层计算(KARC),用显式基函数展开替代储层,结合KAN的表达能力和储层计算的闭式训练,在偏微分方程等基准上优于现有方法。

详情
AI中文摘要

储层计算为预测动力系统提供了轻量级框架,但由于表示能力有限,可能难以捕捉长程依赖。传统储层计算循环使用可训练储层,对超参数敏感,而下一代储层计算以特征维度快速增长为代价去除了循环。在此,我们开发了Kolmogorov-Arnold储层计算(KARC),它受Kolmogorov-Arnold表示定理启发,用显式基函数展开替代储层。我们严格证明KARC是Kolmogorov-Arnold网络(KAN)的轻量级设计,保留了KAN的潜在表达能力,同时允许储层计算的高效闭式训练。在相当的成本下,KARC在包括偏微分方程在内的挑战性基准上优于现有储层计算方法。它还可以与生成扩散模型集成用于文本到图像生成。因此,本工作建立了储层计算与KAN之间的原则性桥梁,实现了高效高保真的动力系统预测。

英文摘要

Reservoir computing offers a lightweight framework for forecasting dynamical systems but may struggle to capture long-range dependencies due to limited representational capacity. Conventional reservoir computing recurrently uses trainable reservoirs with hyperparameter sensitivity, while the next-generation reservoir computing removes recurrence at the cost of rapidly growing feature dimensions. Here, we develop Kolmogorov-Arnold Reservoir Computing (KARC), which replaces reservoirs with explicit basis-function expansions inspired by the Kolmogorov-Arnold representation theorem. We rigorously show that KARC is a lightweight design of Kolmogorov-Arnold networks (KANs), preserving the potential expressive capacity of KANs while admitting efficient closed-form training of reservoir computing. At comparable cost, KARC outperforms existing reservoir computing methods on challenging benchmarks including partial differential equations. It can also be integrated with generative diffusion models for text-to-image generation. This work thus establishes a principled bridge between reservoir computing and KANs, enabling efficient and high-fidelity dynamical system forecasting.

2606.20442 2026-06-19 cs.LG cs.NA cs.NE math.NA 新提交 85%

Evolutionary Two-Stage Hyperparameter Optimization Strategies for Physics-Informed Neural Networks

物理信息神经网络的进化两阶段超参数优化策略

Fedor Buzaev, Dmitry Efremenko, Egor Bugaev, Andrei Ermakov, Denis Derkach, Daria Pugacheva, Fedor Ratnikov

发表机构 * HSE University(高等经济大学) AXXX

专题命中 物理仿真 :进化优化物理信息神经网络超参数

AI总结 针对物理信息神经网络训练不稳定、超参数敏感的问题,提出基于进化算法的两阶段优化策略,先低保真筛选再全训练,在三个PDE问题上显著降低误差。

Comments Equal advising: Daria Pugacheva and Fedor Ratnikov. Accepted to the ICLR 2026 Workshop on AI and PDEs

详情
AI中文摘要

物理信息神经网络(PINNs)通过将物理定律嵌入神经网络训练来求解偏微分方程(PDE)。然而,由于物理信息损失的高度非凸和多项结构,其性能受到不稳定收敛、训练平台期以及对架构和优化超参数的强敏感性的影响。在这种情况下,外循环超参数搜索是一个在异构参数上的噪声黑盒优化问题,经典的局部或基于梯度的策略容易陷入次优区域。进化算法凭借其基于种群的探索能力和处理混合、不可微搜索空间的能力,为发现有前景的配置提供了更稳健的机制。我们提出并研究了一种基于进化算法的两阶段方法,该方法结合了PINNs训练的探索和利用部分,以在固定计算预算下提高解的精度和鲁棒性。在第一阶段,我们执行具有截断轮次的低保真训练运行,以快速筛选候选配置,将超参数选择视为黑盒外循环问题。在第二阶段,只有最有希望的候选者使用标准基于梯度的优化器进行完全训练以细化解。在三个流行问题(即平流方程、Klein-Gordon方程和Helmholtz方程)上评估,我们的方法一致优于标准训练,并在受限计算资源内实现了显著更低的平均误差。

英文摘要

Physics-Informed Neural Networks (PINNs) solve Partial Differential Equations (PDEs) by embedding physical laws into neural network training. However, their performance suffers from unstable convergence, training plateaus, and strong sensitivity to architectural and optimization hyperparameters due to the highly non-convex and multi-term structure of the physics-informed loss. In this setting, the outer-loop hyperparameter search is a noisy and black-box optimization problem over heterogeneous parameters, where classical local or gradient-based strategies are easily trapped in suboptimal regions. Evolutionary algorithms, with their population-based exploration and ability to handle mixed, non-differentiable search spaces, provide a more robust mechanism for discovering promising configurations. We propose and investigate a two-stage approach based on evolutionary algorithms that combines exploration and exploitation parts of PINNs training to improve solution accuracy and robustness under fixed computational budgets. In the first stage, we perform low-fidelity training runs with truncated epochs to rapidly screen candidate configurations, treating hyperparameter selection as a black-box outer-loop problem. In the second stage, only the most promising candidates are fully trained with standard gradient-based optimizers to refine the solution. Evaluated on three popular problems, namely Advection, Klein-Gordon and Helmholtz equations, our method consistently outperforms standard training and achieves significantly lower mean error within constrained computational resources.

2606.19909 2026-06-19 stat.CO math.PR stat.ME 新提交 85%

Establishing an $Ω(\sqrt{d})$ complexity lower bound for PDMP samplers and how to break it: a sub-$\sqrt{d}$ algorithm for Gaussian-tailed targets

建立 PDMP 采样器的 $\Omega(\sqrt{d})$ 复杂度下界及如何突破:针对高斯尾目标的一个亚 $\sqrt{d}$ 算法

Augustin Chevallier

专题命中 物理仿真 :提出PDMP采样器新方案,优化高斯尾目标复杂度

AI总结 本文证明分段确定性马尔可夫过程采样器在标准设置下具有 $\Omega(\sqrt{d})$ 复杂度下界,并通过放宽目标密度连续时间不变性假设,提出一种新方案,对高斯尾目标实现 $O(d^\alpha)$($\alpha\in[0.2,0.3]$)的经验复杂度。

详情
AI中文摘要

尽管分段确定性马尔可夫过程(PDMP)采样器在理论上有非可逆性的吸引力,但迄今为止,尚未开发出在计算复杂度上相对于目标维度 $d$ 优于 $\mathcal{O}(\sqrt{d})$ 的 PDMP 采样器。我们通过在标准设置中建立 PDMP 采样器算法复杂度的 $\Omega(\sqrt{d})$ 下界,证明这是一个基本限制。通过放宽目标密度必须在所有连续时间保持不变的假设,我们随后展示了如何突破这一障碍。具体来说,我们引入了一种新颖的 PDMP 采样方案,并表明它对高斯尾目标实现了 $\mathcal{O}(d^\alpha)$ 的经验复杂度,其中 $\alpha \in [0.2, 0.3]$。此外,该 PDMP 方案在轨迹长度和速度更新之间的距离上都是局部自适应的。

英文摘要

Despite the theoretical appeal of their non-reversibility, to date, no Piecewise Deterministic Markov Process (PDMP) samplers have been developed that scale better than $\mathcal{O}(\sqrt{d})$ in computational complexity with respect to the target dimension $d$. We prove that this is a fundamental limitation by establishing an $Ω(\sqrt{d})$ lower bound on the algorithmic complexity of PDMP samplers in a standard setup. By relaxing the assumption that the target density must remain invariant at all continuous times, we then demonstrate how to bypass this barrier. Specifically, we introduce a novel PDMP sampling scheme and show that it achieves an empirical complexity of $\mathcal{O}(d^α)$, where $α\in [0.2, 0.3]$ for Gaussian-tailed targets. In addition, this PDMP scheme is locally adaptive in both trajectory length and distance between velocity updates.

2606.19895 2026-06-19 math.NA cs.LG cs.NA 新提交 85%

A fast direct solver based neural network for solving PDEs

基于快速直接求解器的神经网络求解偏微分方程

Jashwanth Reddy Kadaru, Vaishnavi Gujjula

发表机构 * Department of Computer Science & Engineering, International Institute of Information Technology Bangalore (IIIT-B), India(计算机科学与工程系,国际信息学院班加罗尔(IIIT-B),印度) Department of Data Science and Artificial Intelligence, International Institute of Information Technology Bangalore (IIIT-B), India(数据科学与人工智能系,国际信息学院班加罗尔(IIIT-B),印度)

专题命中 物理仿真 :提出神经网络求解PDE,属于物理仿真

AI总结 提出一种学习HODLR矩阵逆运算的神经网络,并扩展为非线性PDE求解算子,实验表明在多种PDE上高效且泛化良好。

Comments 26 pages, 7 Figures, 5 Tables

详情
AI中文摘要

大规模$N$体问题产生的矩阵可以使用层次矩阵高效表示,其关键思想是允许跨矩阵分区层次结构的可接受非对角子矩阵可以通过低秩矩阵很好地近似。HODLR(层次非对角低秩)矩阵是层次矩阵的一个子类,其中递归二分划分的每一级的所有非对角子矩阵都是低秩的。本文提出一种神经网络,基于Ambikasaran和Darve(2013)开发的HODLR矩阵快速直接求解器,学习HODLR矩阵的逆运算。我们进一步通过将部分线性层替换为深度子网络,扩展该架构以学习与PDE相关的非线性解算子。我们通过进行一组全面的实验来展示所提出架构的性能,包括(i)求解线性问题,如第二类Fredholm积分方程,(ii)求解PDE,如非线性薛定谔方程、Burgers方程和稳态达西流方程,(iii)跨不同参数值的泛化研究,(iv)将所提出网络的推理时间与经典数值求解器的运行时间进行比较,以及(v)将所提出网络与一些现有的神经算子学习网络进行比较。

英文摘要

The matrices arising from large scale $N$-body problems can be efficiently represented using hierarchical matrices, whose key idea is that the admissible off-diagonal sub-matrices can be well approximated by low-rank matrices across a hierarchy of matrix partitions. HODLR (Hierarchical Off-Diagonal Low-Rank) matrices are a subclass of hierarchical matrices in which all off-diagonal submatrices at every level of a recursive binary partition are low-rank. In this article, we present a neural network that learns the inverse operation of HODLR matrices based on the fast direct solver for HODLR matrices developed by Ambikasaran and Darve (2013). We further extend the architecture to learn nonlinear solution operators associated with PDEs by replacing some of the linear layers with deep sub-networks. We demonstrate the performance of the proposed architecture by performing a comprehensive set of experiments that include (i) solving a linear problem such as the Fredholm integral equation of the second kind, (ii) solving PDEs such as the nonlinear Schrödinger equation, Burgers' equation, and the steady-state Darcy's flow equation, (iii) generalization study across varying parameter values, (iv) comparing the inference time of the proposed network with the run time of a classical numerical solver, and (v) comparing the proposed network with some of the existing neural operator learning networks.

2606.20432 2026-06-19 math.AG math.RA quant-ph 新提交 85%

Eigenvector Varieties

特征向量簇

Sandra Di Rocco, Bernd Sturmfels, Svala Sverrisdóttir

专题命中 物理仿真 :研究李代数和量子系统哈密顿量的特征向量簇,属于数学物理

AI总结 研究方阵线性空间的特征向量簇,系统分析李代数和量子系统哈密顿量的相关几何性质。

详情
AI中文摘要

任何方阵线性空间都有一个关联的特征向量簇。其点是该线性空间中矩阵的特征向量。我们提出了特征向量簇的系统研究,重点关注李代数和量子系统的哈密顿量。

英文摘要

Any linear space of square matrices has an associated eigenvector variety. Its points are eigenvectors of matrices from that linear space. We present a systematic study of eigenvector varieties, with focus on Lie algebras and Hamiltonians of quantum systems.

2606.19486 2026-06-19 quant-ph cs.IT cs.LG math.IT 新提交 85%

Optimal Ansatz-free Hamiltonian Learning In Situ

无假设哈密顿量的最优原位学习

Taiqi Zhou, Weiyuan Gong

发表机构 * Department of Information Engineering, The Chinese University of Hong Kong(香港中文大学信息工程系) John A. Paulson School of Engineering and Applied Sciences, Harvard University(哈佛大学约翰·A·保罗森工程与应用科学学院) California Institute of Technology(加州理工学院)

专题命中 物理仿真 :哈密顿量学习算法,量子信息科学

AI总结 提出一种无需控制、无需辅助比特的算法,仅用泡利乘积态制备和测量,以最优总演化时间学习无假设哈密顿量,适用于近中期量子实验。

Comments 51 pages, 2 figures

详情
AI中文摘要

描述控制量子系统的哈密顿量特征,是量子设备校准、信号传感和纠错的基本子程序。近期工作提出了协议,通过实时演化实现无假设哈密顿量的最优海森堡极限学习,无需完全指定相互作用结构。然而,这些协议依赖于带有交错探测和控制的深电路以及极短的时间分辨率,使其难以在近中期原位量子实验中实现。本文提出一种计算高效、无需控制、无需辅助比特的算法,仅使用泡利乘积态制备和测量,在总演化时间 $\Theta(\frac{\Lambda}{\epsilon^2}\log(\frac{\Lambda}{\epsilon}))$ 内学习无假设哈密顿量 $H$(满足 $||H||\leq\Lambda$)。该算法的演化时间成本对于任何无控制协议是最优的,因为我们进一步证明了 $\Omega(\frac{\Lambda}{\epsilon^2}\log(\frac{\Lambda}{\epsilon}))$ 的下界。技术上,我们的方法引入了一个随机采样框架,结合了带限核时间采样和用于哈密顿量结构学习的位移筛。特征探测时间分辨率仅依赖于 $\Lambda$ 而非 $\varepsilon$,这使得我们的协议在传感和校准的高精度场景中特别有吸引力。我们还表明,当哈密顿量在校准后是局域的时,该算法在存在状态制备和测量(SPAM)噪声的情况下保持相同的渐近总演化时间。我们的结果展示了实验友好型哈密顿量学习的基本成本,并为近中期量子平台的严格原位表征提供了实用途径。

英文摘要

Characterizing the features of a Hamiltonian that governs a quantum system serves as a fundamental subroutine of quantum device calibration, signal sensing, and error correction. Recent works proposed protocols have achieved the optimal Heisenberg-limited scaling learning ansatz-free Hamiltonians from their real-time evolutions without fully specifying interaction structures. However, these protocols rely on both deep circuits with interleaving probes and control, and extremely short time resolution, making them difficult to implement on near- and intermediate-term in situ quantum experiments. In this work, we propose a computationally efficient, control-free, and ancilla-free algorithm that uses only Pauli product state preparation and measurement, and learns an ansatz-free Hamiltonian $H$ with $||H||\leqΛ$ in total evolution time of $Θ(\fracΛ{ε^2}\log(\fracΛε))$. The evolution time cost of our algorithm is optimal for any control-free protocols as we further prove a lower bound of $Ω(\fracΛ{ε^2}\log(\fracΛε))$. Technically, our method introduces a randomized-sampling framework that combines band-limited kernel-based time sampling with a displacement sieve for Hamiltonian structure learning. The characteristic probe time resolution depends only on $Λ$ instead of $\varepsilon$, which makes our protocol especially appealing in the high-precision regime for sensing and calibration applications. We also show that the algorithm maintains the same asymptotic total evolution time in the presence of state-preparation-and-measurement (SPAM) noise when the Hamiltonian is local after calibration. Our results demonstrate the fundamental cost of experimentally friendly Hamiltonian learning and provide a practical route to rigorous in situ characterization of near-term quantum platforms.

2606.20330 2026-06-19 quant-ph physics.atom-ph 新提交 85%

Observation of alignment tensor effects in metastability-exchange collisions with highly polarized 3He ensembles

高度极化3He系综中亚稳态交换碰撞中排列张量效应的观测

Yida Sha, Kaiwen Yi, Xingqing Jin, Matteo Fadel, Xiang Peng

专题命中 物理仿真 :3He极化实验,原子物理与量子传感

AI总结 通过线性化平均场模型和自由感应衰减测量,实验观测到高度极化3He中亚稳态排列张量引起的弛豫和频移,理论与实验吻合,为高精度磁测和自旋压缩态生成提供应用。

Comments 12 pages, 5 figures

详情
AI中文摘要

通过亚稳态交换光泵浦(MEOP)制备的高度极化3He系综已广泛应用于精密测量和基础物理。作为MEOP基础的亚稳态交换(ME)碰撞传统上用原子取向描述,而高极化下亚稳态排列张量的显著贡献尚未被探索。本文在平均场近似下发展了一个线性化模型,研究高度极化3He中的排列张量效应,该效应源于亚稳态F=3/2能级,并通过ME诱导的弛豫和频移显现。通过自由感应衰减(FID)测量,实验观察到基态-亚稳态混合3He系综对外部磁场的响应强烈依赖于核极化。此外,在获得张量诱导现象的特征后,我们展示了实验与理论之间的良好一致性。这项工作推进了对使用MEOP的高度极化3He中核自旋动力学的理解,并进一步应用于高精度磁测的系统误差校正以及核自旋压缩态生成的最优方案。

英文摘要

Highly polarized 3He ensembles prepared by metastability-exchange optical pumping (MEOP) have been widely used in precision measurements and fundamental physics. Metastability-exchange (ME) collisions, serving as the basis of MEOP, are traditionally described in terms of atomic orientation, while the significant contributions of metastable alignment tensor at high polarization remain unexplored. In this work, we develop a linearized model under mean-field approximation to investigate alignment tensor effects in highly polarized 3He , which originate from the metastable F = 3/2 manifold and are revealed through ME-induced relaxation and frequency shift. By means of free-induction-decay (FID) measurements, a pronounced dependence on nuclear polarization is experimentally observed in the response of the ground-state-metastable hybrid 3He ensembles to the external magnetic field. Furthermore, after obtaining the characteristics of tensor-induced phenomena, we demonstrate good agreement between the experiment and the theory. This work advances the understanding of nuclear spin dynamics in highly polarized 3He using MEOP. It further provides applications in systematic error correction of high-accuracy magnetometry, as well as in optimal protocol for the generation of nuclear spin-squeezed states.

2606.20328 2026-06-19 quant-ph physics.atom-ph 新提交 85%

Effective Faraday interaction between light and Helium-3 nuclear spins in a multi-pass cell

多通池中光与氦-3核自旋的有效法拉第相互作用

Kaiwen Yi, Yida Sha, Zejia Lin, Matteo Fadel, Xiang Peng

专题命中 物理仿真 :光与核自旋相互作用,量子传感

AI总结 通过亚稳态交换碰撞在多通池中实现光与氦-3核自旋的有效法拉第相互作用,并定量表征其强度,预测测量诱导的压缩速率为0.52 s$^{-1}$。

详情
AI中文摘要

氦-3核自旋构成一个极其稳定的量子系统,具有极长的相干时间,为量子技术提供了激动人心的机会。特别是,核自旋压缩态有望提高传感任务和新物理测试的精度。所有这些应用的一个核心挑战是实现可控的光-核自旋界面。在这里,我们通过利用室温下低压氦-3气体池中的亚稳态交换碰撞,实验演示了这样一个界面。射频放电产生少量亚稳态原子,既能实现高效光泵浦,又能介导集体核自旋与光学探针之间的有效法拉第相互作用。我们定量表征了这种相互作用的强度随核极化、外加磁场和探针光束参数的变化。此外,我们展示了使用多通池通过有效增加光学深度来增强这种相互作用。外推到当前实验中使用的探针功率的十倍,我们预测测量诱导的压缩速率为0.52 s$^{-1}$。我们的结果为光学访问氦-3核自旋提供了一条实用途径,并为生成用于量子计量学的长寿命宏观核自旋压缩态开辟了前景。

英文摘要

Helium-3 nuclear spins form an exceptionally stable quantum system with extremely long coherence time, offering exciting opportunities for quantum technologies. In particular, nuclear spin-squeezed states promise enhanced precision for sensing tasks and tests of new physics. A central challenge for all these applications is the realization of a controllable light-nuclear spin interface. Here we experimentally demonstrate such an interface by exploiting metastability-exchange collisions in a low-pressure helium-3 gas cell at room temperature. A radio-frequency discharge produces a small population of metastable atoms that both enables efficient optical pumping and mediates an effective Faraday interaction between the collective nuclear spin and an optical probe. We quantitatively characterize the strength of this interaction as a function of the nuclear polarization, applied magnetic field, and probe-beam parameters. Moreover, we show that using a multi-pass cell enhances this interaction by effectively increasing the optical depth. Extrapolating to a tenfold increase of the probe power used in the present experiment, we project a measurement-induced squeezing rate of 0.52 s$^{-1}$. Our results provide a practical pathway for optical access to helium-3 nuclear spins and open prospects for generating long-lived, macroscopic nuclear spin-squeezed states for quantum metrology.

2606.20326 2026-06-19 cs.LG physics.comp-ph 新提交 85%

Quantum-classical physics-informed Kolmogorov-Arnold networks for PDEs

量子-经典物理信息Kolmogorov-Arnold网络求解偏微分方程

Xiang Rao, Yuxuan Shen

发表机构 * School of Petroleum Engineering, Yangtze University(扬州大学石油工程学院) School of Computer Science, Yangtze University(扬州大学计算机科学学院) State Key Laboratory of Low Carbon Catalysis and Carbon Dioxide Utilization (Yangtze University)(扬州大学低碳催化与二氧化碳利用国家重点实验室) Western Research Institute, Yangtze University(扬州大学西部研究院)

专题命中 物理仿真 :量子-经典PINN求解PDE,科学计算

AI总结 提出QCPIKAN,首个量子-经典物理信息Kolmogorov-Arnold网络,结合Chebyshev多项式KAN层和参数化量子电路,通过嵌入物理约束加速高频误差指数收敛并抑制数值色散,在多孔介质渗流场景中优于现有量子-经典PINN。

详情
AI中文摘要

我们开发了QCPIKAN,这是首个旨在求解偏微分方程(PDE)的量子-经典物理信息Kolmogorov-Arnold网络。该混合框架基于Chebyshev多项式KAN层和参数化量子电路构建,将物理约束嵌入训练损失中以强制执行物理一致性。我们的基于逼近论的理论研究证明,该设计将高频误差收敛加速至指数速率,并有效抑制数值色散。我们在多孔介质中的三个典型渗流场景(包括单相流、组分运移和两相流)上验证了该框架。与现有的量子-经典物理信息神经网络相比,QCPIKAN在全局预测精度、局部误差控制、动态演化跟踪和驱替前沿定位方面均实现了优越性能。这项工作为求解复杂PDE提供了一种鲁棒且高效的替代方案。

英文摘要

We develop QCPIKAN, the first quantum-classical physics-informed Kolmogorov-Arnold network designed to solve partial differential equations (PDEs). Built upon Chebyshev-polynomial KAN layers and parameterized quantum circuits, this hybrid framework embeds physical constraints into the training loss to enforce physical consistency. Our theoretical investigations grounded in approximation theory prove that this design accelerates high-frequency error convergence to an exponential rate and effectively mitigates numerical dispersion. We validate the framework across three typical seepage scenarios in porous media, including single-phase flow, component transport and two-phase flow. Compared with existing quantum-classical physics-informed neural networks, QCPIKAN achieves superior performance in global prediction accuracy, local error control, dynamic evolution tracking and displacement front localization. This work provides a robust and efficient alternative for solving complex PDEs.

2606.19912 2026-06-19 math.NA cs.LG cs.NA physics.comp-ph 新提交 85%

Structure-Oriented Randomized Neural Networks for Poisson-Nernst-Planck and Poisson-Nernst-Planck-Navier-Stokes Systems

面向结构的随机神经网络用于泊松-能斯特-普朗克和泊松-能斯特-普朗克-纳维-斯托克斯系统

Yunlong Li, Fei Wang

发表机构 * School of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an, Shaanxi(西安交通大学数学与统计学院,西安,陕西)

专题命中 物理仿真 :随机神经网络求解PNP系统,科学计算

AI总结 提出结构导向随机神经网络(SO-RaNN)框架,通过解耦线性化子问题、逐点截断保持浓度正性、离散质量缩放因子和SAV后处理修正,实现PNP和PNP-NS系统的高效求解,并理论推导残差估计和收敛性。

详情
AI中文摘要

我们开发了一种面向结构的随机神经网络框架,称为SO-RaNN,用于泊松-能斯特-普朗克(PNP)系统和泊松-能斯特-普朗克-纳维-斯托克斯(PNP-NS)系统。解耦的线性化子问题通过随机神经网络在时空框架中迭代求解。对于浓度变量,使用逐点截断在数值层面强制正性,并在选定的修正时刻计算离散质量缩放因子并在时间上插值,以确保在这些时刻精确匹配质量并促进近似质量守恒。为了引入辅助离散耗散机制,我们进一步采用SAV型后处理修正,该修正使得SAV辅助变量在理想SAV更新下具有单调性。对于PNP-NS系统,使用保结构随机神经网络(SP-RaNN)处理速度场,使得速度近似通过构造满足逐点不可压缩约束。在理论方面,我们推导了线性化子问题的原始未修正RaNN求解器的残差估计,为PNP系统的原始外Picard迭代制定了条件性局部时间收敛结果,并分析了数值层面的正性修正以及质量修正和SAV后处理步骤。对于PNP-NS系统,我们建立了SP-RaNN空间的逼近结果,并给出了相应线性化Oseen型问题的条件性误差陈述。数值实验展示了源驱动制造测试中的逼近精度,并说明了预期中的数值层面正性修正、选定时刻质量匹配、基于最终规范固定势的计算自由能曲线以及基准测试中的无散度逼近。

英文摘要

We develop a structure-oriented randomized neural network framework, termed SO-RaNN, for the Poisson-Nernst-Planck (PNP) system and the Poisson-Nernst-Planck-Navier-Stokes (PNP-NS) system. The decoupled linearized subproblems are solved iteratively by randomized neural networks in a space-time framework. For the concentration variables, a pointwise cut-off is used to enforce positivity at the value level, and discrete mass-scaling factors are computed at selected correction instants and interpolated in time, so as to ensure exact mass matching at those instants and to promote approximate mass preservation between them. To introduce an auxiliary discrete dissipation mechanism, we further employ an SAV-type post-processing correction, which yields monotonicity of the SAV auxiliary variable under the ideal SAV update. For the PNP-NS system, a structure-preserving randomized neural network (SP-RaNN) is used for the velocity field, so that the velocity approximation satisfies the incompressibility constraint pointwise by construction. On the theoretical side, we derive residual-based estimates for the raw, uncorrected RaNN solvers of the linearized subproblems, formulate a conditional local-in-time convergence result for the raw outer Picard iteration of the PNP system, and analyze the value-level positivity correction together with the mass-correction and SAV post-processing steps. For the PNP-NS system, we establish an approximation result for the SP-RaNN space and provide a conditional error statement for the corresponding linearized Oseen-type problem. Numerical experiments demonstrate approximation accuracy in the source-driven manufactured tests and illustrate the intended value-level positivity correction, selected-time mass matching, computed free-energy curves based on the final gauge-fixed potential, and divergence-free approximation in benchmark tests.

2606.19562 2026-06-19 cs.LG physics.flu-dyn 新提交 85%

Advances in Scientific Machine Learning for Coupled Fluid Flow and Transport

耦合流体流动与输运的科学机器学习进展

Gabriel F. Barros, Rômulo M. Silva, Alvaro L. G. A. Coutinho

发表机构 * COPPE - Federal University of Rio de Janeiro - UFRJ(里约热内卢联邦大学COPPE学院)

专题命中 物理仿真 :科学机器学习综述,流体动力学

AI总结 综述科学机器学习在耦合流体流动与输运问题中的进展,包括基于SVD的线性降阶和PINNs、β-VAE等神经网络方法,并展示其在浊流和热对流中的应用。

详情
AI中文摘要

本章回顾了科学机器学习(SciML)在模拟由不可压缩Navier-Stokes方程和标量输运方程控制的耦合流体流动与输运现象方面的最新进展。这类系统出现在浊流和热对流等应用中,具有强非线性耦合和多尺度行为,使得高保真模拟计算成本高昂。为此,本章调查了构建高效代理模型的最新SciML方法,包括基于奇异值分解的线性降阶技术(如动态模态分解)和非线性神经网络方法(如物理信息神经网络(PINNs)和β-变分自编码器(β-VAEs))。首先介绍了作者将这些模型与高性能计算策略相结合的工作,包括自适应网格细化/粗化(AMR/C)和科学浮点数据压缩。然后提出了两个新贡献:通过PINNs对浊流进行代理建模,以及使用β-VAEs从热流中提取解缠的非线性模态。控制方程和代表性基准(包括锁交换流和Rayleigh-Bénard对流)说明了这些方法。本章篇幅较长,涵盖了耦合流体流动的数学和物理基础以及最先进建模的计算方面。总体而言,它展示了SciML如何在特定数据范围和建模假设下,实现复杂耦合系统的快速、精确近似,同时相对于全阶模拟大幅降低计算成本。实时预测和不确定性量化等更广泛的能力仍然是活跃的研究方向,其可行性在很大程度上取决于具体问题。

英文摘要

This chapter reviews recent advances in Scientific Machine Learning (SciML) for modeling coupled fluid flow and transport phenomena governed by the incompressible Navier-Stokes and scalar transport equations. Such systems, found in applications like turbidity currents and thermal convection, feature strong nonlinear coupling and multiscale behavior that make high-fidelity simulations computationally expensive. To address this, the chapter surveys state-of-the-art SciML methods for building efficient surrogate models, including linear reduced-order techniques based on Singular Value Decomposition (such as Dynamic Mode Decomposition) and nonlinear neural network approaches like Physics-Informed Neural Networks (PINNs) and $β$-Variational Autoencoders ($β$-VAEs). It first covers the authors' work combining these models with High Performance Computing strategies, including Adaptive Mesh Refinement/Coarsening (AMR/C) and scientific floating-point data compression. It then presents two new contributions: surrogate modeling of turbidity currents via PINNs, and the extraction of disentangled nonlinear modes from thermal flows using $β$-VAEs. Governing equations and representative benchmarks, including lock-exchange flows and Rayleigh-Bénard convection, illustrate these methodologies. The chapter is intentionally long, covering both the mathematical and physical foundations of coupled fluid flow and the computational aspects of state-of-the-art modeling. Overall, it demonstrates how SciML enables fast, accurate approximations of complex coupled systems within the specific data regimes and modeling assumptions considered, while substantially reducing computational cost relative to full-order simulations. Broader capabilities such as real-time prediction and uncertainty quantification remain active research directions whose feasibility depends strongly on the problem at hand.

2606.19457 2026-06-19 quant-ph physics.chem-ph 新提交 85%

Efficient classical representation and quantum state preparation of complete active space wavefunctions

完全活性空间波函数的高效经典表示与量子态制备

Hamza Jnane

专题命中 物理仿真 :量子化学波函数表示与制备,属于物理仿真

AI总结 针对强电子关联分子,提出基于量子Paldus变换的完全活性空间波函数高效经典表示(矩阵乘积态,键维O(d^2))和量子态制备方法,复杂度O(d^3),较现有方法指数级改进。

Comments 14 pages, 5 figures

详情
AI中文摘要

量子计算机有望解决大量分子的电子结构问题。然而,相关量子算法的性能取决于制备与目标本征向量有显著重叠的初始态。对于具有强电子关联的经典挑战性分子,从多参考态(如完全活性空间(CAS)波函数)出发是必要的。不幸的是,应用于此类态的最先进态制备协议的门复杂度随活性空间大小$d$呈指数增长。事实上,传统上甚至认为对化学相关系统进行CAS态的经典编码也是棘手的。在此,我们从最近引入的量子Paldus变换(QPT)中汲取见解,证明存在CAS态的高效经典表示,并设计了一种优于先前方法的新态制备程序。QPT表示从Fock基到更友好的对称性适应基的变换。我们的主要贡献在于证明:在该基下展开的CAS态可以高效地表示为矩阵乘积态(MPS),其键维缩放为$O(d^2)$。然后可以高效地将MPS加载到量子计算机上,并使用逆QPT将态变换回Fock基。此外,我们的方法可以轻松扩展到第一量子化中CAS态的高效制备,具有类似的复杂度。关键的是,我们证明了这两种态制备协议的复杂度仅以$O(d^3)$多项式增长,据我们所知,这比现有技术实现了指数级改进。

英文摘要

Quantum computers promise to solve the electronic structure problem for a large class of molecules. However, the performance of relevant quantum algorithms hinges on preparing initial states with substantial overlap with the target eigenvector. For classically challenging molecules with strong electron correlation, starting from multi-reference states, such as complete active space (CAS) wavefunctions is necessary. Unfortunately, the most advanced state preparation protocols applied to such states result in a gate complexity that scales exponentially with the active space size $d$. In fact, even encoding a CAS state classically is traditionally believed to be intractable for chemically relevant systems. Here, we draw insights from the recently introduced Quantum Paldus Transform (QPT) to show that there exists an efficient classical representation of CAS states and to design a new state preparation routine outperforming previous ones. The QPT represents a transformation from the Fock basis to a friendlier symmetry-adapted basis. Our main contribution consists in showing that CAS states expanded in this basis can efficiently be represented as a matrix product state (MPS) with a bond dimension scaling as $O(d^2)$. One can then efficiently load the MPS on a quantum computer and use the inverse QPT to transform the state to the Fock basis. Moreover, our method can easily be extended to the efficient preparation of CAS states in first quantisation with similar complexity. Crucially, we demonstrate that the complexity of both state preparation protocols only grows polynomially as $O(d^3)$ , which constitutes to the best of our knowledge an exponential improvement over the state of the art.

2606.20231 2026-06-19 cs.AI cond-mat.stat-mech cs.IT math-ph math.IT math.MP nlin.AO 新提交 85%

Thermodynamic Measure of Intelligence

智能的热力学度量

Ishanu Chattopadhyay

发表机构 * Institute for Biomedical Informatics, University of Kentucky(肯塔基大学生物医学信息学研究所) Department of Computer Science, University of Kentucky(肯塔基大学计算机科学系)

专题命中 物理仿真 :提出智能的热力学度量,属于物理与AI交叉

AI总结 提出智能是稀有但有效未来的合法放大,通过递归自模拟实现,并给出热力学度量,证明该结构对高智能必要且近乎充分。

详情
AI中文摘要

智能可以被度量吗?我们提出智能可以定义为稀有但有效未来的合法放大:一个系统增加那些在被动动力学下不太可能但在领域约束下仍然可允许的结果的概率。我们从智能系统必须建模世界及其自身在其中的位置这一前提开始。由于系统是其建模世界的一部分,这自然导致递归自模拟:系统表示其自身动作是轨迹一部分的未来。我们的核心结果给出了一个必要性陈述和一个条件性近乎充分性陈述,将该架构与稀有-有效未来的合法放大的精确热力学度量联系起来:高稀有-有效提升是不可能的,除非内部模拟以高保真度识别稀有-有效未来;反之,当稀有-有效保真度高且模拟包含有效策略时,可实现的提升接近受驱动限制的最优值。因此,递归自模拟不仅是智能的一个合理特征,而且在所述假设下,对于高热力学智能是必要且近乎充分的。由此产生的框架使智能在通用尺度上可度量,从被动物质和反馈控制器、大型语言模型、作为文本生成器的人类到麦克斯韦妖式信息引擎。

英文摘要

Can intelligence be measured? We propose that intelligence can be defined as the lawful amplification of rare but valid futures: a system increases the probability of outcomes that would be unlikely under passive dynamics but remain admissible under the constraints of the domain. We start with the premise that an intelligent system must model the world and its own place within it. Because the system is part of the world it models, this leads naturally to recursive self-simulation: the system represents futures in which its own actions are part of the trajectory. Our central results give a necessity statement and a conditional near-sufficiency statement connecting this architecture to a precise thermodynamic measure of lawful amplification of rare-valid futures: high rare-valid lift is impossible unless the internal simulation identifies rare-valid futures with high fidelity; conversely, when rare-valid fidelity is high and the simulation contains an effective policy, the achievable lift approaches the actuation-limited optimum. Thus recursive self-simulation is not merely a plausible feature of intelligence but, under the stated assumptions, is necessary and nearly sufficient for high thermodynamic intelligence. The resulting framework makes intelligence measurable on a universal scale, from passive matter and feedback controllers, large language models, and humans as text generators to Maxwell-demon-like information engines.

4. 气象气候 2 篇

2606.20165 2026-06-19 physics.ao-ph 新提交 85%

PRecover 1.0: Process Rate Recovery with Machine Learning

PRecover 1.0:基于机器学习的过程速率恢复

Miriam Simm, Tom Beucler, Corinna Hoose

专题命中 气象气候 :机器学习恢复云微物理速率,气象应用

AI总结 提出PRecover数据驱动后处理方法,利用随机森林、梯度提升和神经网络从ICON模型标准输出中恢复未存储的云微物理过程速率,采用两步分类-回归方法,成功恢复短时间累积速率并提供校准预测区间。

Comments Prepared for submission to Geoscientific Model Development (GMD)

详情
AI中文摘要

来自数值模拟的云微物理过程速率的全面信息有助于更好地理解降水形成路径和气溶胶-云相互作用。然而,资源限制通常使得将所有微物理过程速率包含在模型输出中不切实际,限制了深入分析。为了解决这一不足,我们引入了PRecover,一种数据驱动的后处理方法,用于从数值天气预报模型的标准输出中恢复运行时未存储的微物理过程速率。具体来说,我们训练随机森林、梯度提升模型和前馈神经网络,从ICosahedral非静力(ICON)模型中的双矩体微物理方案恢复微物理过程速率。我们使用云变量作为输入,这些变量来自欧洲有限区域设置下的高分辨率模拟。暖雨和冰微物理过程速率通过两步分类-回归方法恢复,包括瞬时和累积过程速率。作为基于物理的基线,我们评估是否可以直接从存储的ICON输出变量重新计算过程速率。对于增长和自收集等过程速率,可以准确重新计算,但对于自动转换、雨融化或异质冰核化速率则不行。使用PRecover,我们成功恢复了大多数在10分钟或更短输出时间步长内累积的过程速率,但对于更长累积间隔累积的速率,恢复难度增加。为了量化预测不确定性,我们通过共形分位数回归提供校准的预测区间。我们通过两个在不同区域域和训练中未见过的模拟设置下的案例研究,展示了模型的空间可迁移性。

英文摘要

Comprehensive information on cloud microphysical process rates from numerical simulations allows for better understanding of precipitation formation pathways and aerosol-cloud interactions. However, resource limitations often make it impractical to include all microphysical process rates in the model output, limiting in-depth analyses. To address this shortcoming, we introduce PRecover, a data-driven post-processing approach to recover microphysical process rates that are not stored during runtime from standard output of a numerical weather prediction model. In particular, we train random forests, gradient boosting models, and feed-forward neural networks to recover microphysical process rates from a two-moment bulk microphysics scheme in the ICOsahedral Nonhydrostatic (ICON) model. We use cloud variables as input, obtained from high-resolution simulations in a limited-area setup over Europe. Warm-rain and ice microphysical process rates are recovered with a two-step classification-regression approach for both instantaneous and accumulated process rates. As a physics-based baseline, we assess whether process rates can be directly recalculated from stored ICON output variables. Accurate recalculation is possible for process rates such as accretion and self-collection but not for the autoconversion, rain melting or heterogeneous ice nucleation rate. Using PRecover, we successfully recover most of the process rates that are accumulated over output time steps of 10 minutes or less, but the values are increasingly difficult to recover for rates accumulated over longer accumulation intervals. To quantify predictive uncertainty, we provide calibrated prediction intervals through conformalized quantile regression. We demonstrate spatial transferability of the models with two case studies over different regional domains and simulation settings unseen during training.

2606.19778 2026-06-19 physics.ao-ph 新提交 85%

A Stochastic-Thermodynamic Constraint on the Seasonal Phase Locking of the El Niño-Southern Oscillation

厄尔尼诺-南方涛动季节锁相的一个随机热力学约束

Yuki Yasuda, Tsubasa Kohyama

专题命中 气象气候 :ENSO季节锁相机制,属于气候科学智能

AI总结 通过线性随机充放电振子模型,利用热力学不确定关系量化熵产生率对SST异常方差季节变化的约束,解释ENSO冬季锁相机制。

详情
AI中文摘要

我们在线性随机充放电振子(SRO)中研究了厄尔尼诺-南方涛动(ENSO)的季节锁相,该振子是一个具有加性噪声和时变增长率的阻尼振子。锁相反映在海表温度异常(SSTA)方差的季节性上。通常,能量驱动这种变化,而熵则控制其是否发生;因此锁相同时受到基于能量和基于熵的约束。我们使用热力学不确定关系(TUR)量化了这种基于熵的约束,TUR是随机热力学中的一个基本不等式。TUR通过部分熵产生率约束SSTA方差的变化趋势,该熵产生率由正向和反向转移概率之比主导,并量化了SSTA转移的不可逆性。增长率控制这种不可逆性:其极值出现在北半球秋季和冬末,熵产生率在这两个时期达到峰值。这些峰值放松了TUR对SSTA方差趋势的约束,使得方差本身可以在北半球冬季达到峰值,这与观测到的ENSO锁相一致。相反,当不可逆性不足时,ENSO无法增长或衰减。如果这种不可逆性被解释为耗散能量,那么对ENSO增长和衰减的约束将要求这种耗散从赤道太平洋输出。需要更现实的模型来检验这一假设,并进一步探索熵与耗散能量之间的物理联系。

英文摘要

We investigate the seasonal phase locking of the El Niño-Southern Oscillation (ENSO) in a linear stochastic recharge oscillator (SRO), a damped oscillator with additive noise and a time-dependent growth rate. Phase locking is reflected in the seasonality of the variance of the sea surface temperature anomaly (SSTA). In general, energy drives such a change, whereas entropy governs whether it occurs; phase locking is thus subject to both an energy- and an entropy-based constraint. We quantify this entropy-based constraint using a thermodynamic uncertainty relation (TUR), a fundamental inequality in stochastic thermodynamics. The TUR constrains the tendency of the SSTA variance by the partial entropy production rate, which is dominated by the ratio of forward and backward transition probabilities and quantifies the irreversibility of SSTA transitions. The growth rate governs this irreversibility: its extrema occur in boreal autumn and late winter, and the entropy production rate peaks at both times. These peaks relax the TUR constraint on the tendency of the SSTA variance, so that the variance itself can peak in boreal winter, consistent with observed ENSO phase locking. Conversely, when irreversibility is insufficient, ENSO cannot grow or decay. If this irreversibility were interpreted as dissipated energy, the constraint on ENSO growth and decay would require this dissipation to be exported from the equatorial Pacific. A more realistic model is needed to test this hypothesis and to further explore the physical connection between entropy and dissipated energy.