arXivDaily arXiv每日学术速递 周一至周五更新

科学与医疗

AI for Science

科学智能、蛋白质、分子、药物、材料、气象、物理和数学 AI。

今日/当前日期收录 162 信号源:cs.LG, q-bio, physics, cond-mat, math, stat.ML
2606.18936 2026-06-18 cs.AI cs.CY 新提交 90%

SciRisk-Bench: A Risk-Dimension-Aware Benchmark for AI4Science Safety

SciRisk-Bench:面向AI4Science安全的风险维度感知基准

Linghao Feng, Yinqian Sun, Dongqi Liang, Sicheng Shen, Chenfei Yan, Yuxuan Peng, Yilin Zhao, Haibo Tong, Kai Li, FeiFei Zhao, Yi Zeng

发表机构 * Brain-inspired Cognitive Intelligence Lab, Institute of Automation, Chinese Academy of Sciences, Beijing, China(脑启发认知智能实验室,自动化研究所,中国科学院,北京,中国) School of Future Technology, University of Chinese Academy of Sciences, China(未来技术学院,中国科学院大学,中国) School of Artificial Intelligence, University of Chinese Academy of Sciences, China(人工智能学院,中国科学院大学,中国) Zhongguancun Academy, China(中关村学院,中国) Beijing Key Laboratory of Safe AI and Superalignment(北京安全人工智能与超对齐重点实验室) Gaoling School of AI, Renmin University of China(甘露人工智能学院,中国人民大学) Beijing Institute of AI Safety and Governance (Beijing-AISI)(北京人工智能安全与治理研究院(北京-AISI)) School of Humanities, University of Chinese Academy of Sciences, China(人文学院,中国科学院大学,中国)

专题命中 其他科学智能 :评估AI4Science安全,覆盖多学科风险

AI总结 提出SciRisk-Bench基准,从显式风险维度和科学学科两个角度评估AI4Science安全,覆盖7个学科、31个子学科和10个风险维度,实验揭示主流及科学大模型的安全薄弱环节。

详情
AI中文摘要

大型语言模型(LLMs)越来越多地嵌入到人工智能驱动的科学(AI4Science)工作流程中,从科学问答和文献分析到实验室规划和自主发现。这一进展迫切需要对安全基准进行评估,不仅要评估科学能力,还要评估模型是否能在高风险的科学背景下识别和避免风险。现有的AI4Science安全数据集涵盖多个学科和任务格式,但潜在的风险维度未得到充分说明。我们引入了\textbf{SciRisk-Bench},这是一个旨在从两个互补视角评估AI4Science安全的基准:显式风险维度和科学学科。SciRisk-Bench涵盖7个学科、31个子学科和10个风险维度。在实验部分,我们评估了主流LLMs和面向科学的LLMs在风险维度、学科和子学科上的表现,从而能够细粒度地诊断科学模型在哪些方面仍然不安全。

英文摘要

Large language models (LLMs) are increasingly embedded in AI for Science (AI4Science) workflows, from scientific question answering and literature analysis to laboratory planning and autonomous discovery. This progress creates an urgent need for safety benchmarks that evaluate not only scientific competence, but also whether models recognize and avoid risks in high-stakes scientific contexts. Existing AI4Science safety datasets cover several disciplines and task formats, leaving the underlying risk dimensions underspecified. We introduce \textbf{SciRisk-Bench}, a benchmark designed to evaluate AI4Science safety from two complementary perspectives: explicit risk dimensions and scientific disciplines. SciRisk-Bench covers 7 disciplines, 31 subdisciplines and 10 risk dimensions. In the experimental section, we evaluate both mainstream LLMs and science-oriented LLMs across risk dimensions, disciplines, and sub-disciplines, enabling fine-grained diagnosis of where scientific models remain unsafe.

2605.07022 2026-06-18 cs.LG 版本更新 90%

Self-Driving Datasets: From 20 Million Papers to Nuanced Biomedical Knowledge at Scale

自主驾驶数据集:从2000万篇论文到大规模精细化生物医学知识

Haydn Jones, Yimeng Zeng, Alden Rose, Li S. Yifei, Yining Huang, Kaiwen Wu, Jiaming Liang, Maggie Ziyu Huan, Yoseph Barash, Cesar de la Fuente-Nunez, Osbert Bastani, Zachary Ives, Mark Yatskar, Jacob R. Gardner

发表机构 * Department of Computer and Information Science, University of Pennsylvania(宾夕法尼亚大学计算机与信息科学系) Department of Genetics, University of Pennsylvania(宾夕法尼亚大学遗传学系) Departments of Bioengineering and Chemical and Biomolecular Engineering, University of Pennsylvania(宾夕法尼亚大学生物工程与化学与生物分子工程系)

专题命中 其他科学智能 :自动生成生物医学知识数据集,属于科学智能。

AI总结 本文提出通过PubMed自动生成结构化数据集,实现更大规模、更精细和更准确的生物医学知识,展示Starling系统在多个任务中生成大规模数据集并提升准确性。

详情
AI中文摘要

人工编纂的生物医学仓库在生物活性、基因组学和化学领域昂贵且滞后于原始文献,丢弃实验背景,掩盖了评估数据正确性和覆盖范围所需的细微差别。我们证明PubMed本身可以被自动且经济地转化为结构化数据集,这些数据集比它们取代的编纂数据库更大、更细致和更准确。我们提出了三个耦合贡献:(1)基于九个生物医学本体的LLM实体标记流水线,能够在包含2250万篇论文和2500亿个token的PubMed语料库中标记45亿个实体,跨19个类别;(2)混合稀疏密集检索支持在标记语料库上执行实体过滤的语义查询;(3)Starling,一个多代理深度研究系统,仅给定自然语言任务描述,即可设计精度和召回率目标的检索过滤器,诱导提取模式,并输出具有丰富细节字段和支持段落的结构化记录。在六个任务中——血脑屏障渗透性、口服生物利用度、急性毒性(LD50)、基因疾病关联、蛋白质亚细胞定位和化学反应——Starling生成约630万条记录(每任务91K至3M条);其中一些是目前最大的公开数据集。前沿模型对我们的提取的拒绝率在0.6-7.7%之间,远低于我们在广泛使用的编纂数据集上测量的错误率(例如,BBB_Martins为16.5%,Bioavailability_Ma为7.3%)。除了规模和准确性外,支持段落还携带了表格数据库所丢弃的细微差别——例如,口服生物利用度可能取决于进食与否的状态。共同,语料库、检索和代理为AI驱动的治疗设计建立了基础。代码和数据集:https://github.com/starling-labs/starling.

英文摘要

Manually curated biomedical repositories -- spanning bioactivity, genomics, and chemistry -- are expensive to maintain, lag behind primary literature, and discard experimental context, obscuring nuances needed to assess data correctness and coverage. We show that PubMed itself can be autonomously and cost-effectively turned into structured datasets that are larger, more nuanced, and more accurate than the curated databases they replace. We present three coupled contributions: (1) an LLM-based entity-tagging pipeline, grounded in nine biomedical ontologies, that tags 4.5B entities across 19 categories in a 22.5M-paper, 2.5T-token PubMed corpus; (2) hybrid sparse-dense retrieval supporting entity-filtered semantic queries over the tagged corpus; and (3) Starling, a multi-agent deep research system that, given only a natural-language task description, designs precision- and recall-targeted retrieval filters, induces an extraction schema, and emits structured records with nuance-rich fields and supporting passages. Across six tasks -- blood-brain barrier permeability, oral bioavailability, acute toxicity (LD50), gene-disease associations, protein subcellular localization, and chemical reactions -- Starling produces ~6.3M records (91K-3M per task); several are, to our knowledge, the largest public datasets for their property. Frontier-model rejection of our extractions is 0.6-7.7% across tasks, far below error rates we measure on widely used curated counterparts (e.g., 16.5% on BBB_Martins, 7.3% on Bioavailability_Ma). Beyond scale and accuracy, the supporting passages carry nuance tabular databases discard -- e.g., oral bioavailability may depend on fed vs. fasted state. Together, the corpus, retrieval, and agent establish a foundation for AI-driven therapeutic design. Code and datasets: https://github.com/starling-labs/starling.

2603.20019 2026-06-18 physics.ins-det 版本更新 90%

Design, construction, and operation of a 30-ton Water-based Liquid scintillator detector at Brookhaven National Laboratory

布鲁克海文国家实验室30吨水基液体闪烁体探测器的设计、建造与运行

S. Andrade, A. Baldoni, D. F. Cowen, R. Diaz Prerez, M. V. Diwan, S. Gokhale, S. Gwon, S. Hans, P. Hackspacher, J. Jerome, G. Lawley, G. D. Orebi Gann, P. Kumar, J. Park, C. Reyes, R. Rosero, N. Seberg, K. Siyeon, M. Smiley, R. Svoboda, N. Speece-Moyer, M. Vagins, B. Walsh, J. J. Wang, M. Wilking, G. Yang, D. Wooley, M. Yeh

专题命中 其他科学智能 :水基液体闪烁体探测器用于中微子探测,属于物理实验仪器

AI总结 介绍30吨水基液体闪烁体探测器的设计、安装与运行,旨在实现切伦科夫和闪烁信号的分离与调节,支持GeV和MeV中微子探测及金属负载中子标记。

Comments 32 pages, 24 figures

详情
AI中文摘要

水基液体闪烁体(WbLS)在十多年前被提出作为一种新型探测器介质,可能允许分离和调节切伦科夫信号与闪烁信号的相对比例。采用该技术的探测器可以大规模结合GeV级和MeV级中微子探测。此外,这种材料的金属负载能力使得中子标记成为可能,并允许调整有效粒子包容性。WbLS因其在大型探测器中的应用潜力以及现场修改配置的能力而具有吸引力。在布鲁克海文国家实验室(BNL),已建造了两个原型探测器,质量分别为1吨和30吨,用于理解WbLS的性质和稳定性。我们在此介绍30吨原型探测器的设计、安装和运行。未来出版物将介绍从两个探测器收集的数据分析结果。

英文摘要

Water-based Liquid Scintillator (WbLS) was proposed over a decade ago as a novel detector medium that might allow the separation and tuning of the relative ratio of the Cherenkov and Scintillation signals. A detector deploying this technology could combine GeV-scale and MeV-scale neutrino detection at scale. Furthermore, the metal-loading capability of such a material enables neutron tagging and allows the effective particle containment to be tuned. WbLS is attractive both for the potential to use it in large detectors and the ability to modify the configuration in situ. At Brookhaven National Laboratory (BNL), two prototypes have been built for understanding WbLS properties and stability, with masses of 1-ton and 30-ton, respectively. We present here the 30-ton prototype detector design, installation, and operation. Results from the analysis of data collected in the two detectors will follow in future publications.

2606.18598 2026-06-18 cs.AI cs.LG 新提交 85%

Optimizing Lithium Production Decisions under Geological, Demand, and Pricing Uncertainties: A POMDP Framework for Multi-Objective Decision Making

在地质、需求和定价不确定性下优化锂生产决策:多目标决策的POMDP框架

Anna C. Edmonds, Mansur M. Arief, Robert J. Moss, Mykel J. Kochenderfer, Jef Caers

发表机构 * Computer Science Department, Stanford University(斯坦福大学计算机科学系) Aeronautics and Astronautics Department, Stanford University(斯坦福大学航空与航天系) Earth and Planetary Sciences Department, Stanford University(斯坦福大学地球与行星科学系)

专题命中 其他科学智能 :POMDP框架优化锂矿开采决策,涉及地质与定价

AI总结 提出POMDP框架,通过信念状态规划优化锂矿开采决策,动态适应价格不确定性,实现更高需求满足和更平衡的经济环境效益。

Comments 24 pages, 14 tables, 4 figures

详情
AI中文摘要

锂生产中的决策制定具有挑战性,无论是从投资者角度还是战略生产角度。决定开采哪些矿山以及何时开采,不仅涉及地质和价格不确定性,还涉及提取方法选择的复杂性,从直接锂提取到硬岩开采。先前的工作探索了该问题的模型和优化采矿决策的不同方法;这些模型没有考虑定价不确定性、需求不确定性或提取锂的不同采矿技术。将不同的定价模型和提取技术纳入这些模型,可以制定更稳健的策略,不仅决定何时何地开采矿山,还决定采用哪种生产方法。我们将问题表述为部分可观测马尔可夫决策过程(POMDP),并使用信念状态规划方法求解以获得最优决策。在我们的研究中,我们表明POMDP求解器通过信念状态规划和显式不确定性管理,动态适应变化的锂价格机制(静态、线性、指数和随机),优于人类启发式启发法。通过优化勘探、生产和技术选择的顺序,该框架在所有不同的定价和矿床情景下,在项目生命周期内实现了更高的需求满足和更平衡的经济环境结果。

英文摘要

Decision making in lithium production is challenging, whether from an investor's perspective or a strategic production standpoint. Determining which mines to open and when to open them involves not only geological and price uncertainties, but also complexities around the choice of extraction method, from direct lithium extraction to hard rock mining. Prior work explored models of this problem and different methods to optimize mining decisions; these models did not account for uncertainty in pricing, uncertainty in demand, or different mining technologies to extract lithium. Incorporating different pricing models and extraction technology into these models enables more robust strategies for determining not only when and where to open a mine, but also which method of production to pursue. We frame the problem as a partially observable Markov decision process (POMDP) and solve using belief state planning methods to get optimal decision making. In our study, we show that POMDP solvers outperform human inspired heuristics by dynamically adapting to shifting lithium price regimes (static, linear, exponential, and stochastic) through belief state planning and explicit uncertainty management. By optimally sequencing exploration, production, and technology choice, the framework achieves higher demand fulfillment and more balanced economic environmental outcomes over the projects lifetime in all different pricing and deposit scenarios.

2606.18648 2026-06-18 physics.comp-ph 新提交 85%

Deep Research in Physical Sciences: A Multi-Agent Framework and Comprehensive Benchmark

物理科学中的深度研究:多智能体框架与综合基准

Yigeng Jiang, Tengchao Yang, Taoyong Cui, Jiaxing Wan, Yuan Wang, Weida Wang, Zhiyu Liu, Chuyi Peng, Binzhao Luo, Maoli Gao, Huaihai Huang, Yuqianer Zeng, Ziyang Zheng, Dongchen Huang, Chao Chen, Zichao Liu, Weiping Shen, Shuchen Pu, Siyu Zhou, Runmin Ma, Yusong Hu, Fei Chao, Bo Zhang, Xiawu Zheng, Zifu Wang, Lei Bai, Yunqi Cai, Shufei Zhang

专题命中 其他科学智能 :物理科学基准PhySciBench,LLM评估

AI总结 提出PhySciBench基准评估LLM在物理科学中的深度研究能力,并开发DelveAgent多智能体框架,通过自适应规划、双粒度记忆和分层反思机制提升准确率并降低推理成本。

Comments 19 pages, 5 figures, 1 table;

详情
AI中文摘要

深度研究智能体是基于大型语言模型(LLM)的系统,专为自主、多步骤的科学推理而设计,在加速物理科学研究方面具有巨大潜力。然而,目前缺乏对其在该领域能力的全面深入评估。为填补这一空白,我们引入了PhySciBench,一个与物理科学研究高度相关的基准,包含200个专家策划的问题,涵盖物理和化学,分布在反映真实科学工作流程的六个任务类别中。对最先进模型和智能体系统在PhySciBench上的评估显示性能有限;即使是最强的基线Gemini Deep Research,准确率也仅为33.5%。对失败案例的分析发现了三个反复出现的缺陷:扩展推理链的脆弱性、跨步骤的知识迁移有限以及缺乏基于物理的自验证。受这些发现启发,我们开发了DelveAgent,一个模块化的多智能体框架,配备自适应规划循环、双粒度记忆和分层物理接地反思机制。在四个科学基准上,DelveAgent将准确率提高了最多7.5个百分点,同时将推理成本降低到最强基线的大约三分之一。这些结果确立了PhySciBench作为评估物理科学中AI系统关键基准的重要性,并表明架构专业化可以有效增强自主科学研究的可靠性。

英文摘要

Deep research agents are Large Language Model (LLM)-based systems designed for autonomous, multi-step scientific reasoning, and they hold immense potential for accelerating research in the physical sciences. However, comprehensive and in-depth evaluations of their capabilities within this domain remain lacking. To address this gap, we introduce PhySciBench, a benchmark highly relevant to physical science research, comprising 200 expert-curated questions, balanced between physics and chemistry, across six task categories that reflect real-world scientific workflows. Evaluations of state-of-the-art models and agent systems on PhySciBench reveal limited performance; even the strongest baseline, Gemini Deep Research, achieves an accuracy of only 33.5%. Analysis of failure cases identifies three recurrent deficiencies: fragility in extended reasoning chains, limited knowledge transfer across steps, and a lack of physics-grounded self-verification. Motivated by these findings, we develop DelveAgent, a modular multi-agent framework equipped with an adaptive planning loop, dual-granularity memory, and a hierarchical physics-grounded reflection mechanism. Across four scientific benchmarks, DelveAgent improves accuracy by up to 7.5 percentage points while reducing inference costs to approximately one-third of the strongest baseline. These results establish the significance of PhySciBench as a critical benchmark for evaluating AI systems in the physical sciences and demonstrate that architectural specialization can effectively enhance the reliability of autonomous scientific research.

2606.18296 2026-06-18 physics.med-ph 新提交 85%

AI-Driven Lumped-Element Modeling of Human Respiratory System for Studying Voice Mechanics

AI驱动的呼吸系统集总参数建模用于研究发声力学

Maruf Md Ikram, Maryam Naghibolhosseini, Mohsen Zayernouri

专题命中 其他科学智能 :AI驱动呼吸系统建模,发声力学

AI总结 提出基于物理的呼吸、发声和发音子系统模型,结合深度学习提取的声门面积波形,首次模拟发声时的呼吸动力学,预测无法直接测量的声门下压力分布。

Comments 40 pages, 18 figures

详情
AI中文摘要

开发了一个基于物理的预测模型,涵盖人类呼吸、发声和发音子系统,用于模拟语音产生。将肺、可压缩气道和声带表示为弹簧-阻尼-质量控制的活塞-气缸系统,我们的数学模型稳健地捕捉了发声期间气道的复杂动力学。研究了肺组织和可压缩气道的非线性粘弹性特性,产生了一个响应灵敏且富有表现力的基线呼吸模型,能够进一步扩展为针对呼吸和发声的患者特异性模型。随后,将所得框架与声道机械表示集成,该表示由声门面积波形(GAW)控制,GAW捕捉了持续发声期间声带的运动。GAW通过深度学习从一名正常发音参与者的喉部高速视频内窥镜数据中提取。我们的新范式超越了呼吸系统建模,实现了AI驱动的发声建模,包括声带动力学、与流动空气动力学的相互作用以及由声带振荡行为引起的流动阻力。我们的研究首次实现了发声的呼吸动力学模拟,直接推进了声门下压力分布(无法在人体中直接无创测量)、动态阻力以及发声期间能量传递机制的预测,在发声力学领域具有重要意义。

英文摘要

A predictive physics-based model of human respiratory, phonatory, and articulatory subsystems is developed to simulate voice production. Representing lungs, compressible airways, and vocal folds as spring-damper-mass controlled piston-cylinder systems, our mathematical model robustly captures the intricate dynamics of airways during phonation. The nonlinear viscoelastic properties of lung tissues and compressible airways were investigated, yielding a responsive and expressive baseline respiratory model with the capability to further extend into a patient-specific model for both respiration and phonation. The resulting framework was subsequently integrated with a mechanical representation of the vocal tract, governed by the glottal area waveform (GAW) capturing the motion of vocal folds during sustained phonation. The GAW is extracted from laryngeal high-speed videoendoscopy data of a normophonic participant using deep learning. Our novel paradigm transcends beyond modeling the respiratory system, enabling AI-driven modeling of vocalization, including vocal fold dynamics, interactions with flow aerodynamics, and flow resistances, induced by the oscillatory behavior of vocal folds. Our investigation leads to the first-ever simulation of respiratory dynamics for vocalization, directly advancing the prediction of subglottal pressure distributions, impossible to measure directly and noninvasively in humans, dynamic resistances, and energy transfer mechanisms during phonation in voice mechanics.

2605.21115 2026-06-18 cs.DC cs.LG 版本更新 85%

Automated Byzantine-Resilient Clustered Decentralized Federated Learning for Battery Intelligence in Connected EVs

自动化抗拜占庭攻击的集群化去中心化联邦学习用于连接电动车的电池智能

Mouhamed Amine Bouchiha, Abdelaziz Amara Korba, Yacine Ghamri-Doudane

发表机构 * SAMOVAR, Télécom SudParis(SAMOVAR,法国电信南巴黎学院) Department of Computer Science, German University of Technology in Oman (GUtech)(阿曼技术大学计算机科学系) L3i, La Rochelle University(拉罗什大学L3i)

专题命中 其他科学智能 :提出联邦学习框架用于电动车电池智能,属于科学智能应用。

AI总结 本文提出了一种自动化抗拜占庭攻击的集群化去中心化联邦学习框架ABC-DFL,用于连接电动车的电池智能,通过引入动态Quorum拜占庭容错协议和基于或acles的聚合层,提高信任、安全和自动化水平,FLECA协议通过适应性阈值过滤恶意更新,有效缓解拜占庭攻击。

Comments 16 pages, 8 figures

详情
AI中文摘要

联邦学习(FL)已作为一种有前景的范式,用于管理智能交通系统(ITS)中的电动汽车(EV)电池数据,使其能够执行隐私保护的任务,如异常检测和容量估计。然而,大多数现有框架依赖于集中式聚合方案,这在安全性和信任方面存在关键限制。为了应对这些挑战,我们提出了ABC-DFL,一种用于连接电动车的自动化抗拜占庭攻击的集群化去中心化联邦学习(C-DFL)框架。所提出的激励驱动的C-DFL系统用开放许可的区块链取代中央服务器,特征新的动态Quorum拜占庭容错(QBFT)协议和基于或acles的聚合层,以增强信任、安全和自动化。ABC-DFL的核心是FLECA(过滤分层增强聚合),一种稳健的分层聚合协议,通过让每个EV使用基于其参考模型更新偏差的适应性阈值过滤恶意更新来缓解拜占庭攻击。Oracle节点负责跨组聚合,利用稳健的聚类来隔离和聚合来自可信EV组的模型更新。全面的实验评估显示,FLECA在良好条件下与FedProx收敛,并在适应性对抗场景中显著优于现有防御措施,攻击影响评分低于0.10。此外,多个多任务模型学习实验验证了激励机制的有效性和公平性。最后,链上和链下基准验证了ABC-DFL的实用性。

英文摘要

Federated learning (FL) has emerged as a promising paradigm for managing electric vehicle (EV) battery data in intelligent transportation systems (ITS), enabling privacy-preserving tasks such as anomaly detection and capacity estimation. However, most existing frameworks rely on centralized aggregation schemes, which pose critical limitations in terms of security and trust. To address these challenges, we propose ABC-DFL, an automated Byzantine-resilient clustered decentralized federated learning (C-DFL) framework for connected EVs. The proposed incentive-driven C-DFL system replaces the central server with an open-permissioned blockchain, featuring a new dynamic Quorum Byzantine Fault Tolerance (QBFT) protocol and an oracle-based aggregation layer, to enhance trust, security, and automation. At the core of ABC-DFL lies FLECA (Filtered Layered Enhanced Clustering Aggregation), a robust hierarchical aggregation protocol that mitigates Byzantine attacks by having each EV filter malicious updates using an adaptive threshold based on deviations from its reference model update. Oracle nodes, responsible for inter-group aggregation, employ robust clustering to isolate and aggregate model updates from trustworthy EV groups. Comprehensive experimental evaluations demonstrate that FLECA matches FedProx convergence under benign conditions and significantly outperforms existing defenses with attack impact scores below 0.10 in adaptive adversarial scenarios. Furthermore, several learning experiments with multitask models confirm the effectiveness and fairness of the incentive mechanism. Finally, on-chain and off-chain benchmarks validate the practicality of ABC-DFL.

2603.17777 2026-06-18 cond-mat.supr-con cond-mat.mtrl-sci 85%

Reaching Quantum Critical Point by Adding Non-magnetic Disorder in Single Crystals of Superconductor $(\text{Ca}_x\text{Sr}_{1-x})_3\text{Rh}_4\text{Sn}_{13}$

通过添加非磁性杂质达到量子临界点:在超导体$(\text{Ca}_x\text{Sr}_{1-x})_3\text{Rh}_4\text{Sn}_{13}$单晶中

Elizabeth H. Krenkel, Makariy A. Tanatar, Romain Grasset, Marcin Kończykowski, Shuzhang Chen, Cedomir Petrovic, Alex Levchenko, Ruslan Prozorov

专题命中 其他科学智能 :研究超导体量子临界点,属于凝聚态物理

AI总结 研究通过非磁性杂质调控超导体$(\text{Ca}_x\text{Sr}_{1-x})_3\text{Rh}_4\text{Sn}_{13}$的电阻率,发现量子临界点位于x=0.75至0.85之间,支持杂质可驱动系统进入量子临界区的观点。

Journal ref Phys. Rev. Research 8, 023183 (2026)

详情
AI中文摘要

Remeika系列超导体$(\text{Ca}_x\text{Sr}_{1-x})_3\text{Rh}_4\text{Sn}_{13}$显示出罕见的非磁性量子临界点(QCP),与超导性‘穹顶’下的连续电荷密度波(CDW)和结构相变相关。本文通过2.5 MeV电子辐照引入非磁性点状杂质,抑制CDW并驱动系统达到甚至超越QCP。这一结论基于电阻率ρ(T)随杂质量增加从费米液体到非费米液体区域的演变。在CDW侧,低于建议的QCP浓度x_c=0.9时,添加的杂质导致ρ(T)中线性项增大而二次项减小。在长程CDW秩序被抑制至T=0的剂量下,观察到几乎完美的T-线性依赖性,符合预期。我们细化了该系统的QCP位置,将其置于x=0.75至0.85之间。结果支持杂质可调控系统进入量子临界区的观点,并遵循Imry和Ma的论证,任何有序相都易受淬火杂质扰动。通过可控引入,这种杂质成为一种新的非热调控参数,可能适用于多种不同系统。

英文摘要

The Remeika series superconductor, $(\text{Ca}_x\text{Sr}_{1-x})_3\text{Rh}_4\text{Sn}_{13}$, shows a rare nonmagnetic quantum critical point (QCP) associated with the continuous charge-density wave (CDW) and structural transition under the ``dome'' of superconductivity achieved by tuning composition and applying pressure. Here we use a nonmagnetic point-like disorder induced by 2.5 MeV electron irradiation to suppress the CDW and drive the system to and even beyond the QCP. This conclusion is based on a clear evolution of temperature-dependent resistivity, $ρ\left(T\right)$, from the Fermi liquid to the non-Fermi liquid regime with increasing amount of disorder. Starting on the CDW side, below the suggested QCP concentration of $x_c=0.9$, added disorder resulted in a progressively larger linear term and a reduced quadratic term in $ρ\left(T\right)$. Nearly perfect $T-$linear dependence is observed at the dose at which long-range CDW order is suppressed to $T=$0, consistent with the expectations. We refine the QCP location in this system and place it in the interval between $x=$0.75 and 0.85. Our results strongly support the concept that the disorder can tune the system to the quantum critical regime and even beyond. It follows from the argument by Imry and Ma that any ordered phase is unstable toward quenched disorder. Introduced in a controlled way, this disorder becomes a novel non-thermal tuning parameter likely applicable to a variety of different systems.

2603.10412 2026-06-18 cond-mat.str-el cond-mat.mtrl-sci 85%

Long-range magnetic order with disordered spin orientations in a high-entropy antiferromagnet

高熵反铁磁体中长程磁序与无序自旋取向

Yao Shen, Guangkai Zhang, Qinghua Zhang, Xuejuan Gui, Yu Zhang, Heemin Lee, Cheng-Tai Kuo, Jun-Sik Lee, Ronny Sutarto, Feng Ye, Zhao Pan, Xiaomei Qin, Jinchen Wang, Tianping Ying, Youwen Long

专题命中 其他科学智能 :高熵反铁磁体中的长程磁序,属于凝聚态物理

AI总结 研究发现高熵材料中存在长程反铁磁序,尽管原子无序,但四种过渡金属元素协同稳定了无序自旋取向的磁序,揭示了复杂磁系统的新机制。

Comments 10 pages, plus references, 1 table, 4 figures, and Supplementary information, accepted for publication in Nature Communications

Journal ref Nature Communications 17, 3558 (2026)

详情
AI中文摘要

磁性系统中的无序通常会抑制长程有序,促进短程状态如磁性玻璃和磁簇。这在高熵材料中尤为显著,其特征是局部磁性实体和交换相互作用的随机分布。然而,在罕见情况下,高熵系统中仍可保持长程磁序,而微观特性及机理仍不明确,尤其是单个元素的磁性行为。本文结合中子衍射和共振软X射线散射,对高熵蜂窝晶格范德瓦尔材料(Mn1/4Fe1/4Co1/4Ni1/4)PS3的磁序进行了元素特异性研究。尽管存在显著的原子无序,低于72 K时仍观察到长程锯齿状反铁磁序,所有四种过渡金属元素参与统一相变。然而,不同元素的自旋取向各异,归因于单离子各向异性和交换相互作用的竞争。本研究展示了一种新型长程磁序,具有无序自旋取向,由高熵磁体中不同磁性元素协同稳定,为理解复杂磁系统提供了新范式。

英文摘要

Disorder in magnetic systems typically suppresses long-range order, promoting short-range states such as spin glasses and magnetic clusters. This is particularly prominent in high-entropy materials, characterized by the random distributions of local magnetic entities and exchange interactions. However, in rare exceptions, long-range magnetic order can persist in high-entropy systems, while the microscopic characters and underlying mechanisms remain elusive, especially the magnetic behaviors of individual elements. Here, combining neutron diffraction and resonant soft x-ray scattering, we have conducted an element-specific investigation into the magnetic order of a high-entropy honeycomb-lattice van der Waals material (Mn1/4Fe1/4Co1/4Ni1/4)PS3. Despite significant atomic disorder, long-range zigzag antiferromagnetic order is observed below 72 K, with all four transition-metal elements participating in a unified phase transition. However, the spin orientations of various elements are distinct, attributed to the competition between single-ion anisotropies and exchange interactions. Our findings showcase a novel form of long-range magnetic order with disordered spin orientations, which is synergically stabilized by distinct magnetic elements in a high entropy magnet, offering a new paradigm for understanding complex magnetic systems.

2602.19591 2026-06-18 cs.LG cs.AI 版本更新 85%

Detecting High-Potential SMEs with Heterogeneous Graph Neural Networks

使用异构图神经网络检测高潜力中小企业

Yijiashun Qi, Hanzhe Guo, Yijiazhen Qi

发表机构 * University of Michigan(密歇根大学) The University of Hong Kong(香港大学)

专题命中 其他科学智能 :用图神经网络预测中小企业发展潜力,属于科学智能应用

AI总结 提出SME-HGT异构图Transformer框架,利用公开数据构建包含公司、研究主题和政府机构的异构图,预测SBIR第一阶段获奖者能否进入第二阶段,AUPRC达0.621,优于基线模型。

Comments accepted by (ICIIS 2026)

详情
AI中文摘要

中小企业占美国企业的99.9%,贡献44%的经济活动,但系统性地识别高潜力中小企业仍是一个开放挑战。我们提出了SME-HGT,一个异构图Transformer框架,仅使用公开数据预测哪些SBIR第一阶段获奖者将进入第二阶段资助。我们构建了一个异构图,包含32,268个公司节点、124个研究主题节点和13个政府机构节点,通过约99,000条边连接三种语义关系类型。SME-HGT在时间分割测试集上达到0.621±0.003的AUPRC,在五个随机种子上优于MLP基线(0.590±0.002)和R-GCN(0.608±0.013)。在筛选深度为100家公司时,SME-HGT达到89.6%的精确率,比随机选择提升2.14倍。我们的时间评估协议防止信息泄露,对公开数据的依赖确保了可重复性。这些结果表明,公司、研究主题和资助机构之间的关系结构为中小企业潜力评估提供了有意义的信号,对政策制定者和早期投资者具有启示意义。

英文摘要

Small and Medium Enterprises (SMEs) constitute 99.9% of U.S. businesses and generate 44% of economic activity, yet systematically identifying high-potential SMEs remains an open challenge. We introduce SME-HGT, a Heterogeneous Graph Transformer framework that predicts which SBIR Phase I awardees will advance to Phase II funding using exclusively public data. We construct a heterogeneous graph with 32,268 company nodes, 124 research topic nodes, and 13 government agency nodes connected by approximately 99,000 edges across three semantic relation types. SME-HGT achieves an AUPRC of 0.621 0.003 on a temporally-split test set, outperforming an MLP baseline (0.590 0.002) and R-GCN (0.608 0.013) across five random seeds. At a screening depth of 100 companies, SME-HGT attains 89.6% precision with a 2.14 lift over random selection. Our temporal evaluation protocol prevents information leakage, and our reliance on public data ensures reproducibility. These results demonstrate that relational structure among firms, research topics, and funding agencies provides meaningful signal for SME potential assessment, with implications for policymakers and early-stage investors.

2601.18637 2026-06-18 quant-ph cs.LG stat.ML 85%

Universality of Many-body Projected Ensemble for Learning Quantum Data Distribution

多重体投影集合在学习量子数据分布中的普遍性

Quoc Hoan Tran, Koki Chinzei, Yasuhiro Endo, Hirotaka Oshima

发表机构 * Quantum Laboratory, Fujitsu Research, Fujitsu Limited, Kawasaki, Kanagawa 211-8588, Japan(富士通量子实验室,富士通研究,富士通株式会社,神户,神奈川县211-8588,日本)

专题命中 其他科学智能 :量子机器学习中投影集合的普遍性,属于科学智能

AI总结 本文探讨了多重体投影集合框架在量子机器学习中的普遍性,证明了其能近似任意纯态分布,并提出改进训练的增量MPE方法,通过实验验证了其在复杂量子数据分布学习中的有效性。

Comments 21 pages, 6 figures (added Github repository)

Journal ref IJCNN 2026

详情
AI中文摘要

生成量子数据需学习其底层量子分布,这在理论和实践中都面临挑战,但对理解量子系统至关重要。本文通过证明多重体投影集合框架的普遍性定理,回答了量子机器学习中参数化模型能否近似任意量子分布的问题。该定理表明MPE能在1-Wasserstein距离误差内近似任意纯态分布,提供了严格的通用表达性保证,填补了QML的关键理论空白。为提高实用性,我们提出具有层间训练的增量MPE变体。在聚类量子态和量子化学数据集上的数值实验验证了MPE在学习复杂量子数据分布中的有效性。

英文摘要

Generating quantum data by learning the underlying quantum distribution poses challenges in both theoretical and practical scenarios, yet it is a critical task for understanding quantum systems. A fundamental question in quantum machine learning (QML) is the universality of approximation: whether a parameterized QML model can approximate any quantum distribution. We address this question by proving a universality theorem for the Many-body Projected Ensemble (MPE) framework, a method for quantum state design that uses a single many-body wave function to prepare random states. This demonstrates that MPE can approximate any distribution of pure states within a 1-Wasserstein distance error. This theorem provides a rigorous guarantee of universal expressivity, addressing key theoretical gaps in QML. For practicality, we propose an Incremental MPE variant with layer-wise training to improve the trainability. Numerical experiments on clustered quantum states and quantum chemistry datasets validate MPE's efficacy in learning complex quantum data distributions.

2306.16886 2026-06-18 math.NT 85%

Extreme central values of quadratic Dirichlet $L$-functions with prime conductors

二次狄利克雷L函数在素数导数上的极值

Mingyue Fan, Shenghao Hua, Sizhe Xie

专题命中 其他科学智能 :数论中L函数极值下界研究

AI总结 研究素数p≡1 mod 8时L(1/2,χ_p)的下界结果,采用分析方法证明极值下限。

Comments Comments are welcome!

Journal ref The Quarterly Journal of Mathematics, Volume 77, Issue 1, March 2026, Pages 175-199

详情
AI中文摘要

本文证明了当p≡1 mod 8时,二次狄利克雷L函数L(1/2,χ_p)在极值情况下的下界结果。通过分析方法,我们得到了关于这些L函数值的严格下限,为相关数论问题提供了新的理论支持。

英文摘要

In this paper we prove a lower bound result for extremely large values of $L(\frac{1}{2},χ_p)$ with prime numbers $p\equiv 1\pmod 8$.

2507.00771 2026-06-18 math.AG 版本更新 85%

Generic vanishing theory in positive characteristic

正特征中的一般消没理论

Jefferson Baudin

专题命中 其他科学智能 :代数几何中正特征消没理论

AI总结 简化并改进了正特征一般消没理论的基本定理,证明了最大 Albanese 维数的正规簇满足 H^0(X, ω_X) ≠ 0,且若 Alb(X) 是普通的,则 S^0(X, ω_X) ≠ 0。

Comments Final version. To appear in L'Enseignement Mathématique

详情
AI中文摘要

我们简化并改进了正特征一般消没理论的主要基本定理。作为该理论的一个快速推论,我们证明了最大 Albanese 维数的正规簇 $X$ 满足 $H^0(X, \omega_X) \neq 0$,并且如果 $\mathrm{Alb}(X)$ 是普通的,那么 $S^0(X, \omega_X) \neq 0$。

英文摘要

We simplify and improve the main fundamental theorems of positive characteristic generic vanishing theory. As a quick corollary of the theory, we prove that a normal variety $X$ of maximal Albanese dimension satisfies $H^0(X, ω_X) \neq 0$ and that if $\mathrm{Alb}(X)$ is ordinary, then $S^0(X, ω_X) \neq 0$.

2506.24028 2026-06-18 math.AC math.CO math.RA 85%

The Gröbner basis for powers of a general linear form in a monomial complete intersection

关于一般线性形式在单项完全交集中的幂的格罗布纳基一组

Filip Jonsson Kling, Samuel Lundqvist, Fatemeh Mohammadi, Matthias Orth

专题命中 其他科学智能 :数学中Gröbner基与Lefschetz性质

AI总结 本文研究多项式环中几乎完全交集理想,明确描述其在任意术语顺序下的格罗布纳基组结构,通过格子路径与反射操作提供新证明,揭示Artinian单项完全交集在特征零域的强Lefschetz性质,并关联格罗布纳基元素数量与Catalan、Motzkin等数列,拓展量子物理中纠缠检测研究。

Journal ref Trans. Amer. Math. Soc. Ser. B 13 (2026), 339-378

详情
AI中文摘要

我们研究多项式环中的几乎完全交集理想,由所有变量的幂和其和的幂生成。我们的主要结果是,在任何术语顺序下,这些理想缩减格罗布纳基一组的显式描述。我们的方法主要是组合性的,关注初始理想的结构。我们为Artinian单项完全交集的向量空间基中的单项关联一个格子路径,并引入这些路径上的反射操作,从而得到一个关键计数论证。作为结果,我们提供了一个新的证明,表明Artinian单项完全交集在特征零域上具有强Lefschetz性质。我们的结果还提供了关于此类交集在特征p下分类弱Lefschetz性质的长期问题的新见解。此外,我们表明每个次数的格罗布纳基元素数量与几个著名的序列,包括广义Catalan、Motzkin和Riordan数相关,并将这些数与量子物理中自旋系统纠缠检测的研究联系起来。

英文摘要

We study almost complete intersection ideals in a polynomial ring, generated by powers of all the variables together with a power of their sum. Our main result is an explicit description of the reduced Gröbner bases for these ideals under any term order. Our approach is primarily combinatorial, focusing on the structure of the initial ideal. We associate a lattice path to each monomial in the vector space basis of an Artinian monomial complete intersection and introduce a reflection operation on these paths, which enables a key counting argument. As a consequence, we provide a new proof that Artinian monomial complete intersections possess the strong Lefschetz property over fields of characteristic zero. Our results also offer new insights into the longstanding problem of classifying the weak Lefschetz property for such intersections in characteristic $p$. Furthermore, we show that the number of Gröbner basis elements in each degree is connected to several well-known sequences, including the (generalized) Catalan, Motzkin, and Riordan numbers, and connect these numbers to the study of entanglement detection in spin systems within quantum physics.

2411.07434 2026-06-18 math.AP 85%

Stable determination of the first order perturbation of the biharmonic operator from partial data

从部分数据稳定确定双调和算子的一阶扰动

Boya Liu, Salem Selim

专题命中 其他科学智能 :偏微分方程逆问题稳定性估计

AI总结 研究双调和算子在三维及以上领域的一阶扰动的逆边界值问题,通过部分狄利克雷到神经元映射建立对数型稳定性估计。

详情
AI中文摘要

我们考虑在三维及以上有界域中带有一阶扰动的双调和算子的逆边界值问题。假设在边界邻域内已知一阶和零阶扰动,从部分狄利克雷到神经元映射建立这些扰动的对数型稳定性估计。具体而言,测量仅在边界上的任意小开子集进行。

英文摘要

We consider an inverse boundary value problem for the biharmonic operator with the first order perturbation in a bounded domain of dimension three or higher. Assuming that the first and the zeroth order perturbations are known in a neighborhood of the boundary, we establish log-type stability estimates for these perturbations from a partial Dirichlet-to-Neumann map. Specifically, measurements are taken only on an arbitrarily small open subsets of the boundary.

2506.15491 2026-06-18 math.AG 版本更新 85%

On Gorenstein $\mathbb{Q}_p$-rational threefold and fourfold singularities

关于Gorenstein $\mathbb{Q}_p$-有理三维和四维奇点

Jefferson Baudin, Zsolt Patakfalvi, Linus Rösler, Maciej Zdanowicz

专题命中 其他科学智能 :代数几何中奇点分类研究

AI总结 本文证明对于$n\leq 4$且$p>5$,拟Gorenstein $F$-纯且$\mathbb{Q}_p$-有理$n$维奇点是典范的,基于对对数典范奇点dlt修正的对偶复形的分析。

Comments Final version. To appear in Épijournal de Géométrie Algébrique

详情
AI中文摘要

我们证明对于$n \leq 4$且$p > 5$,拟Gorenstein $F$-纯且$\mathbb{Q}_p$-有理$n$维奇点是典范的。这类似于通常的有理Gorenstein奇点是典范的这一事实。证明基于对对数典范奇点的一个dlt修正的对偶复形的仔细分析。$n=4$的结果依赖于对数解消的存在性。

英文摘要

We prove that for $n \leq 4$ and $p > 5$, quasi--Gorenstein $F$--pure and $\mathbb{Q}_p$--rational $n$--fold singularities are canonical. This is analogous to the usual fact that rational Gorenstein singularities are canonical. The proof is based on a careful analysis of the dual complex of a dlt modification of a log canonical singularity. The result for $n = 4$ is contingent upon the existence of log resolutions.

2506.12789 2026-06-18 math.CO 版本更新 85%

Powers of 2 in High-Dimensional Lattice Walks

高维格点游走中2的幂次

Nikolai Beluhov

专题命中 其他科学智能 :组合数学中格点游走的2-adic估值

AI总结 研究高维格点游走中返回原点步数的2-adic估值,揭示其与二进制表示中1的个数的精确关系,并给出各维数下的最佳估计及等号成立条件。

Comments 20 pages

Journal ref Enumerative Combinatorics and Applications, volume 6, issue 2, 2026

详情
AI中文摘要

设 $W_d(n)$ 为 $\mathbb{Z}^d$ 中从原点出发并返回原点的 $2n$ 步游走的数量。我们研究该数的质因数分解中 $2$ 的指数,即 $w_d(n) = \nu_2(W_d(n))$。我们证明,对于每个 $d$,$w_d(n)$ 与 $n$ 的二进制展开中 $1$ 的个数 $s_2(n)$ 之间存在关系。例如,当 $d$ 为奇数时 $w_d(n) = s_2(n)$,当 $\nu_2(d) = 1$ 时 $w_d(n) = 2s_2(n)$;而当 $\nu_2(d) = 2$ 时 $w_d(n) \ge 3s_2(n)$。当 $\nu_2(d) \ge 3$ 时,模式进一步变化。然而,对于每个 $d$,我们给出了 $w_d(n)$ 的最佳类似估计,并描述了所有达到等号的 $n$。我们开发的方法也适用于更广泛的问题,因此可能具有独立意义。

英文摘要

Let $W_d(n)$ be the number of $2n$-step walks in $\mathbb{Z}^d$ which begin and end at the origin. We study the exponent of $2$ in the prime factorisation of this number; i.e., $w_d(n) = ν_2(W_d(n))$. We show that, for each $d$, there is a relationship between $w_d(n)$ and the number $s_2(n)$ of $1$s in the binary expansion of $n$. For example, $w_d(n) = s_2(n)$ if $d$ is odd and $w_d(n) = 2s_2(n)$ if $ν_2(d) = 1$; while $w_d(n) \ge 3s_2(n)$ if $ν_2(d) = 2$. The pattern changes further when $ν_2(d) \ge 3$. However, for each $d$, we give the best analogous estimate of $w_d(n)$ together with a description of all $n$ where equality is attained. The methods we develop apply to a wider range of problems as well, and so might be of independent interest.

2506.03987 2026-06-18 math.DG math.AP math.CV 85%

An Aubin-Yau theorem for transversally Kähler foliations

横截凯勒叶状结构的Aubin-Yau定理

Vlad Marchidanu

专题命中 其他科学智能 :微分几何中Aubin-Yau定理推广

AI总结 本文在横截凯勒叶状结构中推导了Aubin-Yau定理,通过同调定向条件,简化了Vaisman Aubin-Yau定理的证明。

Journal ref Annals of Global Analysis and Geometry, 70, 3 (2026)

详情
AI中文摘要

横截凯勒叶状结构是凯勒流形的推广,出现在复非凯勒环境中。本文给出了经典Aubin-Yau定理证明方法在横截凯勒情况下的自包含证明,并应用该结果得到已知Vaisman Aubin-Yau定理的新简化证明。

英文摘要

Transversally Kähler foliations are a generalisation of Kähler manifolds, appearing naturally in the complex non-Kähler setting. We give a self-contained proof of how the classical methods used in the proof of the Aubin-Yau theorem adapt to the transversally Kähler case under the homological orientability condition. We apply this result to obtain a new, simpler proof of the already known Vaisman Aubin-Yau theorem.

2506.03806 2026-06-18 math.RT 版本更新 85%

Matrix representations of the twisted virtual braid group and its extensions

扭曲虚拟辫子群及其扩张的矩阵表示

Mohamad N. Nasser, Vaibhav Keshari, Madeti Prabhakar

专题命中 其他科学智能 :辫子群矩阵表示分类研究

AI总结 本文分类了扭曲虚拟辫子群TVB_2到GL_3(C)的复局部表示,分为八种不可信类型,并分析了可约化性;还研究了TVB_n (n≥3)的齐次局部表示,识别出七种类型;以及STVB_2到M_3(C)的十三种表示类型,并讨论了扩张的非Φ型性质。

详情
AI中文摘要

本文分类了扭曲虚拟辫子群 $TVB_2$ 到 $\mathrm{GL}_3(\mathbb{C})$ 的复局部表示。结果表明,此类表示分为八种类型,所有类型都是不可信且可约化为二维表示。进一步可约化为一维表示的情况针对特定类型进行了分析。本文还研究了 $n \geq 3$ 时 $TVB_n$ 到 $\mathrm{GL}_{n+1}(\mathbb{C})$ 的复齐次局部表示,识别出七种不可信类型。此外,将奇异扭曲虚拟辫子群 $STVB_2$ 到 $\mathrm{M}_3(\mathbb{C})$ 的复局部表示分类为十三种不可信类型。最后,本文证明了并非所有 $TVB_2$ 表示到 $STVB_2$ 的复局部扩张都符合 $\Phi$ 型扩张。

英文摘要

This paper classifies complex local representations of the twisted virtual braid group, $TVB_2$, into $\mathrm{GL}_3(\mathbb{C})$. It shows that such representations fall into eight types, all of which are unfaithful and reducible to a two-dimensional representation. Further reducibility to a one-dimensional representation is analyzed for specific types. The paper also examines complex homogeneous local representations of $TVB_n$ into $\mathrm{GL}_{n+1}(\mathbb{C})$ for $n \geq 3$, identifying seven unfaithful types. Additionally, complex local representations of the singular twisted virtual braid group, $STVB_2$, into $\mathrm{M}_3(\mathbb{C})$ are classified into thirteen unfaithful types. Finally, the paper demonstrates that not all complex local extensions of $TVB_2$ representations to $STVB_2$ conform to a $Φ$-type extension.

2502.07641 2026-06-18 stat.ME stat.ML 版本更新 85%

Distributional Instrumental Variable Method

分布工具变量方法

Anastasiia Holovchak, Sorawit Saengkyongam, Nicolai Meinshausen, Xinwei Shen

专题命中 其他科学智能 :提出分布工具变量方法,用于因果推断

AI总结 提出分布工具变量方法,利用生成建模在非线性工具变量设置中估计整个干预分布,并证明其可识别性优于传统方法。

详情
AI中文摘要

工具变量方法常用于存在未测量混杂因素时推断因果效应。现有方法通常旨在估计平均因果效应,而少数方法关注分位数处理效应。本文的目标是估计整个干预分布。我们提出了一种称为分布工具变量(DIV)的方法,该方法在非线性工具变量设置中使用生成建模。我们在一般假设下建立了干预分布的可识别性,并展示了一个“欠识别”案例,其中DIV可以识别因果效应,而两阶段最小二乘法无法识别。我们的实证结果表明,DIV方法在广泛的模拟数据上表现良好,在均值或分位数处理效应的可识别性和估计误差方面优于现有工具变量方法。此外,我们将DIV应用于一个经济数据集,以检验制度质量与经济发展之间的因果关系,结果与原研究吻合良好。我们还将DIV应用于一个单细胞数据集,研究在未见干预下预测基因表达的泛化性和稳定性。DIV的软件实现可在R和Python中获取。

英文摘要

The instrumental variable (IV) approach is commonly used to infer causal effects in the presence of unmeasured confounding. Existing methods typically aim to estimate the mean causal effects, whereas a few other methods focus on quantile treatment effects. The aim of this work is to estimate the entire interventional distribution. We propose a method called Distributional Instrumental Variable (DIV), which uses generative modelling in a nonlinear IV setting. We establish identifiability of the interventional distribution under general assumptions and demonstrate an 'under-identified' case, where DIV can identify the causal effects while two-step least squares fails to. Our empirical results show that the DIV method performs well for a broad range of simulated data, exhibiting advantages over existing IV approaches in terms of the identifiability and estimation error of the mean or quantile treatment effects. Furthermore, we apply DIV to an economic data set to examine the causal relation between institutional quality and economic development and our results align well with the original study. We also apply DIV to a single-cell data set, where we study the generalizability and stability in predicting gene expression under unseen interventions. The software implementations of DIV are available in R and Python.

2606.18420 2026-06-18 cs.LG q-bio.QM stat.ML 新提交 80%

Measurement noise limits the advantage of nonlinear models over linear models in biomedical prediction

测量噪声限制了非线性模型在生物医学预测中相对于线性模型的优势

Marc-Andre Schulz, Kerstin Ritter

发表机构 * Hertie Institute for AI in Brain Health, University of Tübingen(赫蒂人工智能脑健康研究所,图宾根大学) Tübingen AI Center, University of Tübingen(图宾根人工智能中心,图宾根大学) Department of Psychiatry and Neurosciences, Charité – Universitätsmedizin Berlin(精神病学与神经科学系,柏林夏里特医学院) Bernstein Center for Computational Neuroscience, Berlin(伯恩斯坦计算神经科学中心,柏林) German Center for Mental Health (DZPG), partner site Tübingen(德国心理健康中心(DZPG),图宾根合作站点)

专题命中 其他科学智能 :分析测量噪声对生物医学预测模型的影响

AI总结 本文指出,在生物医学表格数据中,测量噪声会削弱非线性结构,导致非线性模型与线性模型性能相当,并提出了一个精确的超额风险恒等式,揭示了测量可靠性、样本量和特征表示三个条件必须同时满足才能体现非线性优势。

详情
AI中文摘要

在生物医学表格数据上,诸如深度网络、梯度提升树和核方法等灵活模型,在给定相同特征的情况下,反复被线性回归和逻辑回归匹配或击败。通常的反应是将其视为模型方面的不足,需要通过更多数据、更好的架构或调参来修复,假设非线性结构存在而模型未能捕捉到。我们认为,当限制因素是测量而非模型时(这在生物医学中经常发生),这些修复无法奏效。加性噪声模糊了群体最优预测器,并且由于模糊在去除函数的广泛形状之前先去除精细、快速变化的细节,它比线性结构更快地抹去非线性结构。一个k阶交互作用被特征可靠性的k次幂衰减,而线性部分只衰减一次。在生物医学测量典型的可靠性下,即使底层生物学是强非线性的,非线性优势也可能消失,并且噪声所移除的部分无法通过更大的队列或更灵活的模型恢复,只能通过更好的测量。非线性是隐藏的,而非缺失,线性模型与灵活模型之间的平局本身并不能对生物学做出定论。这些片段是经典的,来自测量误差统计、心理测量学和高斯分析,我们将它们组合成一个精确的超额风险恒等式。测量可靠性是与样本量和特征表示并列的三个条件之一,必须对齐才能使灵活模型发挥作用,而它们共同只留下一个狭窄的窗口,大多数生物医学任务落在此窗口之外。在140个英国生物银行任务中,灵活模型与线性模型之间的差距(如果存在)带有预测的噪声特征,并且这三个条件可以通过干预而非仅通过基准测试来分离。

英文摘要

On biomedical tabular data, flexible models such as deep networks, gradient-boosted trees, and kernel methods are repeatedly matched or beaten by linear and logistic regression given the same features. The usual reaction is to treat this as a model-side shortfall, to be fixed with more data, a better architecture, or tuning, on the assumption that the nonlinear structure is there and the model has failed to capture it. We argue that these fixes cannot help when the binding limit is the measurement rather than the model, as it frequently is in biomedicine. Additive noise blurs the population-optimal predictor, and because blurring removes a function's fine, rapidly varying detail before its broad shape, it erases nonlinear structure faster than linear structure. A degree-$k$ interaction is attenuated by the $k$-th power of feature reliability, while the linear part is attenuated only once. At the reliabilities typical of biomedical measurement, the nonlinear advantage can vanish even when the underlying biology is strongly nonlinear, and what the noise removes cannot be recovered by a larger cohort or a more flexible model, only by better measurement. The nonlinearity is hidden, not absent, and a tie between linear and flexible models is not by itself a verdict on the biology. These pieces are classical, drawn from measurement-error statistics, psychometrics, and Gaussian analysis, and we assemble them into an exact excess-risk identity. Measurement reliability is one of three conditions, alongside sample size and feature representation, that must align for a flexible model to help, and together they leave only a narrow window that most biomedical tasks fall outside. Across 140 UK Biobank tasks, the gap between flexible and linear models, where it exists, carries the predicted noise signature, and the three conditions can be separated by intervention but not by a benchmark alone.

2606.18784 2026-06-18 physics.optics 新提交 80%

Volumetric nanoscale localization using engineered point spread functions in light sheet microscopy

利用光片显微镜中的工程点扩散函数实现体积纳米级定位

R. E. Bautista Gonzalez, R. Mouthaan, A. Upadhya, D. J. X. Chow, K. R. Dunning, K. Dholakia

专题命中 其他科学智能 :光片显微镜纳米定位,工程点扩散函数

AI总结 结合双艾里工程点扩散函数与双光子光片荧光显微镜,实现大体积三维纳米级定位,横向精度<20 nm、轴向42 nm,并有望在毫米级样本中达到亚10 nm。

详情
AI中文摘要

在光学显微镜中,跨越大生物体积的纳米级三维定位仍然是一个突出的挑战,现有方法通常受限于成像速度、体积视场和定位精度,难以同时满足。在这里,我们通过将双艾里工程点扩散函数与双光子光片荧光显微镜相结合,克服了这些限制,实现了在整个大体积视场中的纳米级定位。我们的框架明确考虑了生物荧光团的宽带荧光发射特性,确保了在真实成像条件下的准确定位。我们在295 μm x 330 μm x 100 μm的体积上实现了横向<20 nm、轴向42 nm的定位精度,并有望在毫米级样本中实现亚10 nm定位。在荧光珠模型和活体哺乳动物卵母细胞中的实验证实了在受控和生物复杂环境中的稳健性能。这些结果为基于定位的超分辨率成像在生物相关体积上的可扩展策略奠定了基础,弥合了纳米级精度与大体积显微成像之间的差距。

英文摘要

Nanoscale three-dimensional localization across large biological volumes remains an outstanding challenge in optical microscopy, with existing approaches typically limited by imaging speed, volumetric field of view and localization precision when required simultaneously. Here, we overcome these limitations by combining a twin Airy engineered point spread function with two-photon light sheet fluorescence microscopy, enabling nanoscale localization throughout large volumetric fields of view. Our framework explicitly incorporates the broadband fluorescence emission characteristic of biological fluorophores, ensuring accurate localization under realistic imaging conditions. We achieve localization precisions of $<20$~nm laterally and 42~nm axially over volumes measuring 295~$μ$m x 330~$μ$m x 100~$μ$m , with a projected path to sub-10-nm localization in millimeter-scale specimens. Experiments in fluorescent bead phantoms and live mammalian oocytes confirm robust performance in both controlled and biologically complex environments. These results establish a scalable strategy for localization-based super-resolution imaging across biologically relevant volumes, bridging the gap between nanoscale precision and large-scale volumetric microscopy.

2606.18711 2026-06-18 physics.optics quant-ph 新提交 80%

Integration of diamond nanobeams with SnVs on Al2O3 waveguides for scalable quantum photonic chip application

用于可扩展量子光子芯片应用的SnV金刚石纳米梁与Al2O3波导的集成

Yeting Yang, Ryota Kitagawa, Tetsuya Miyatake, Masaharu Hida, Naoki Fushimi, Koki Kaminaka, Takuto Yamaguchi, Toshiki Iwai, Itsuki Takagi, Hidetsugu Matsukiyo, Satomi Ishida, Satoshi Iwamoto, Manabu Ohtomo, Toshiyuki Miyazawa, Kenichi Kawaguchi, Ryoichi Ishihara, Shintaro Sato

专题命中 其他科学智能 :金刚石SnV色心与波导集成,量子光子

AI总结 本文通过双层制造工艺,将含SnV色心的金刚石纳米梁与Al2O3波导集成,实现了光学耦合,并观察到导出的SnV-发射,展示了可扩展的集成方法。

Comments 12 pages, 3 figures, submissted to JJAP

详情
AI中文摘要

金刚石中的锡空位(SnV)中心是集成量子光子学中有前途的固态量子比特。在这里,我们制造并表征了一种包含SnV中心的Al2O3双锥波导结构上的金刚石,展示了金刚石纳米梁与底层Al2O3波导之间的光学耦合。这些器件采用与晶圆级光刻兼容的双层制造方法实现。在所有光学活性器件中观察到清晰的导出的SnV-发射,表明集成结构中有效的光学耦合。这些结果展示了一种可扩展的制造方法,用于将金刚石色心与光子波导集成。

英文摘要

Tin vacancy (SnV) centers in diamond are promising solid state qubits for integrated quantum photonics. Here, we fabricate and characterize a diamond on Al2O3 dual taper waveguide structure containing SnV centers, demonstrating optical coupling between the diamond nanobeam and the underlying Al2O3 waveguide. The devices are realized using a bilayer fabrication approach compatible with wafer scale lithography. Clear guided SnV- emission is observed in all optically active devices, indicating effective optical coupling in the integrated structure. These results demonstrate a scalable fabrication approach toward integrating diamond color centers with photonic waveguides.

2606.18604 2026-06-18 physics.bio-ph 新提交 80%

A symmetric relaxation method for entire two-dimensional cellular networks and its implications

整个二维细胞网络的对称松弛方法及其意义

Kai Xu, Lifan Weng, Zihan Wang, Yuyang Lian, Bin Huang

专题命中 其他科学智能 :二维细胞网络松弛,对称松弛方法

AI总结 提出一种适用于内部和边缘顶点的对称松弛方法,模拟二维细胞网络松弛,验证了von Neumann-Mullins定律等,并揭示了对称松弛通过减少短边抑制T1拓扑转变。

详情
AI中文摘要

为了模拟整个二维细胞网络的松弛,本研究提出了一种适用于内部和边缘顶点的对称松弛方法。这两类顶点的松弛由相关细胞的中心角对称性和每个顶点的角度对称性决定,但主要考虑因素不同。使用具有不同不规则性的修剪Voronoi网络作为松弛模拟的初始网络。特别地,我们提出了一种正六边形无序化方法来生成Voronoi网络,并发现不规则性值为1的网络内部细胞表现出保守的边数分布,正如在其他二维细胞网络中发现的那样。模拟结果对内部和边缘细胞均符合von Neumann-Mullins定律,并且包含几何修正项的修正方程显著提高了预测质量。还再现了Aboav-Weaire定律和Lewis定律,其中后者显示松弛细胞趋向于椭圆的极大内接多边形。对边长、内角和形状指数的分析表明,对称松弛通过减少短边同时增加相邻细胞间的面积差异来抑制T1(邻居交换)拓扑转变。研究结果表明,当力不平衡克服对称松弛的稳定效应时,可能触发T1事件,这为二维泡沫中的T1提供了可能的机制解释。

英文摘要

To simulate the relaxation of an entire 2D cellular network, this study proposes a symmetric relaxation method for both inner and marginal vertices. The relaxations of these two types of vertices are determined by the central angle symmetry of associated cells and the angle symmetry at each vertex, but with different major considerations. Trimmed Voronoi networks with varying irregularity are used as initial networks for the relaxation simulation. In particular, we propose a regular hexagon disordering method to generate Voronoi networks and find that the inner cells of networks with an irregularity value of one exhibit a conserved edge number distribution, as found in other 2D cellular networks. Simulation results agree with the von Neumann-Mullins law for both inner and marginal cells, and a modified equation including a geometric correction term significantly improves prediction quality. The Aboav-Weaire law and Lewis law are also reproduced, with the latter showing that relaxed cells tend to approach the ellipses' maximum inscribed polygons. Analysis of edge length, interior angle, and shape index reveals that symmetric relaxation inhibits T1 (neighbour exchange) topological transitions by reducing short edges while increasing area disparity among neighbouring cells. The findings suggest that T1 events may be triggered when force disequilibrium overcomes the stabilising effect of symmetric relaxation, providing a possible mechanistic explanation for T1 in 2D foams.

2606.18391 2026-06-18 physics.optics 新提交 80%

Constant sensitivity birefringence metrology using vector vortex beams

利用矢量涡旋光束实现恒定灵敏度的双折射测量

Gabriela Flores-Cova, Daniel Salamanca-Roldán, Carmelo Rosales-Guzmán, Daniel F. Urrego, Juan P. Torres

专题命中 其他科学智能 :矢量涡旋光束双折射测量,量子估计

AI总结 提出利用矢量涡旋光束进行双折射测量,通过量子估计理论证明相位估计灵敏度与未知相位无关,实验验证了该方法的优越性。

详情
AI中文摘要

差分干涉对比(DIC)显微镜和手性分析是两种测量双折射的成像技术,即样品在两个正交偏振上引入的相位差。传统方法使用高斯光束,并通过偏振变化推断双折射,导致相位估计灵敏度依赖于未知相位。我们展示了一种新型双折射探测器。它利用矢量涡旋光束,这是一种携带相反轨道角动量(OAM)光学模式的结构光。使用量子估计理论工具,我们证明了相位估计的灵敏度与未知相位值无关,并且在原则上可以比传统方法更好。我们通过实验验证了所提出的方案,展示了结构光在鲁棒且均匀的双折射传感中的潜力。

英文摘要

Differential Interference Contrast (DIC) microscopy and chiral analysis are two imaging techniques that measure the birefringence, i.e., the phase difference introduced by a sample on two orthogonal polarizations. Conventional approaches employ Gaussian beams and infer birefringence from polarization changes, resulting in phase-estimation sensitivities that depend on the unknown phase. We demonstrate here a new type of birefringence detector. It makes use of a vector vortex beam, a type of structured light endowed with optical modes that carry opposite orbital angular momentum (OAM). Using quantum estimation theory tools, we demonstrate that the sensitivity of phase estimation is independent of the value of the unknown phase, and can be even better, in principle, than the conventional approach. We experimentally validate the proposed scheme, demonstrating the potential of structured light for robust and uniform birefringence sensing.

2606.18368 2026-06-18 physics.atom-ph physics.ins-det 新提交 80%

Design and Performance of a Heated Gas Injector for Producing Cold Molecular Beams

用于产生冷分子束的加热气体注入器的设计与性能

Avneesh Verma, Jack Mango, Shungo Fukaya, Arian Jadbabaie, Sepehr Ebadi, Ronald F. Garcia Ruiz, John M. Doyle

专题命中 其他科学智能 :加热气体注入器,冷分子束产生

AI总结 本文设计并实现了一种加热气体注入器,将热气体直接送入低温环境,具有坚固、易安装、热隔离好等优点,并用于产生冷BaF自由基,未来将用于激光冷却含镭分子。

Comments 12 pages, 10 figures

详情
AI中文摘要

我们实现了一种将热气体直接送入低温环境的注入器装置。该注入器具有多个优点,包括坚固性、刚性、安装简便,以及在热(约300 K)铜填充线和冷(<3 K)低温缓冲气体池之间出色的热隔离。在分子精密测量实验的实际条件下,观察到池上的热负荷小于200 mW。聚酰胺-酰亚胺(PAI)管是基本的设计特征。填充线通过环氧树脂粘合到管的一端,而管的另一端通过滑配合连接到黄铜接头上,实现完全真空密封。冷却时,PAI在黄铜接头上收缩,形成低温密封。该注入器易于安装和拆卸,且刚性良好,在冷却至4 K时未观察到填充线相对于池的显著位移。我们通过将SF6通过热填充线流入池中,并将冷He缓冲气体通过单独的低温填充线流入池中,同时激光烧蚀含钡靶材,来表征注入器的性能。这产生了冷BaF自由基,并通过吸收光谱检测。该注入器设计将用于激光冷却含镭分子,如RaF和RaOH,在这些应用中,出于科学和安全原因,需要将SF6和H2O试剂密封地输送到低温缓冲气体池中。这些分子对于研究对称性破缺的核性质和寻找标准模型之外的物理特别有意义。

英文摘要

We realize an injector device that supplies warm gas directly into a cryogenic environment. This injector has several advantageous features, including robustness, rigidity, simple installation, and excellent thermal isolation between a hot ($\sim$300 K) copper fill line and a cold ($<$3 K) cryogenic buffer gas cell. Less than 200 mW heat load on the cell is observed in realistic conditions of a molecular precision measurement experiment. A polyamide-imide (PAI) tube is the essential design feature. The fill line is epoxied to one end of the tube while the other end of the tube is connected to the cell via a slip-fit onto a brass nipple, realizing a complete vacuum-tight seal. PAI contracts on the brass nipple when cooled, forming a cryogenic leak-tight seal. The injector is easily (de-)mountable and rigid, with no significant displacement of the fill line relative to the cell observed during cooldown to 4 K. We characterize injector performance by flowing into the cell $\text{SF}_6$ through the hot fill line and cold $\text{He}$ buffer gas through a separate cryogenic fill line while laser ablating a barium-containing target. This produces cold BaF free radicals, detected using absorption spectroscopy. This injector design will be employed to laser cool radium-containing molecules, such as $\text{RaF}$ and $\text{RaOH}$, where leak-tight delivery of $\text{SF}_6$ and $\text{H}_2\text{O}$ reagents into a cryogenic buffer gas cell is required for scientific and safety reasons. These molecules are of particular interest for the study of symmetry-violating nuclear properties and searches for physics beyond the Standard Model.

2606.18277 2026-06-18 physics.soc-ph q-bio.PE 新提交 80%

Multi-network comparison of between-farm contacts for infectious disease surveillance in swine production

猪生产中用于传染病监测的场间接触的多网络比较

Jason A. Galvis, Nicolas C. Cardenas, Gustavo Machado

专题命中 其他科学智能 :猪场接触网络比较,传染病监测

AI总结 通过比较11种网络类型(车辆移动、动物移动和基于距离的场间接触),发现车辆移动网络(尤其是饲料运输)连接最密集,育肥场在多个网络中充当超级传播者,不同网络识别的高风险农场集合不同,支持将多种传播途径纳入疾病监测。

详情
AI中文摘要

了解猪场如何直接和间接地相互连接对于描述传染病传播至关重要。本研究旨在描述11种网络类型中猪场的连通性,包括车辆移动(即卡车和拖车)、动物移动和基于距离的场间接触,以识别生产类型之间以及可能一致被表征为超级传播者的场之间的联系。卡车和拖车移动网络连接最为密集,尤其是饲料运输,其连接水平比猪移动和基于距离的网络高98.7%至99.7%。这些网络还表现出农场之间最高程度和频率的连接,而聚合卡车网络(包括所有卡车类型)显示出作为连接农场的桥梁的最大潜力。育肥场在所有网络中都与其他农场类型高度互联。母猪场经常被其他农场类型访问,尤其是通过饲料卡车移动,占这些连接的8.7%。我们证明,在车辆移动和邻近网络中,育肥场作为超级传播者发挥了主要作用。当比较每个网络中按超级传播者得分排名前50的农场时,基于车辆的网络显示出最高的相似性,车辆网络之间共享高达89%的排名靠前的农场。相比之下,猪移动和基于距离的网络识别出大部分不同的排名靠前的农场集合,与其他接触网络分别最多共享4%和8%。总体而言,每个网络都表现出独特的连接结构,导致不同的高风险农场集合,特别是在向种猪场潜在传播方面。这些发现支持将多种传播途径整合到疾病监测中。

英文摘要

Understanding how swine farms are interconnected, directly and indirectly, is essential to characterizing infectious disease transmission. This study aimed to describe the connectivity of swine farms across 11 network types, including vehicle movements (i.e., trucks and trailers), animal movements, and distance-based farm-to-farm contacts, to identify links among production types and farms likely to be consistently characterized as super-spreaders. Truck and trailer movement networks were the most densely connected, particularly for feed transport, showing connectivity levels between 98.7% and 99.7% higher than those of pig movement and distance-based networks. These networks also exhibited the highest degree and frequency of connections between farms, while the aggregated truck network, which included all truck types, showed the greatest potential to act as a bridge connecting farms. Finisher farms were highly interconnected with other farm types across all networks. Sow farms were frequently reached by other farm types, especially through feed truck movements, representing up to 8.7% of these links. We demonstrated that in vehicle movements and proximity networks, finisher farms played a major role as super-spreaders. When comparing the top 50 farms ranked by super-spreader score in each network, vehicle-based networks showed the highest similarity, with up to 89% of top-ranked farms shared between vehicle networks. In contrast, pig movement and distance-based networks identified largely distinct sets of top-ranked farms, sharing at most 4% and 8%, respectively, with other contact networks. Overall, each network exhibited a distinct connectivity structure, resulting in different sets of high-risk farms, particularly regarding potential transmission to breeding farms. These findings support the integration of multiple transmission pathways into disease surveillance.

2606.18821 2026-06-18 physics.ins-det nucl-ex 新提交 80%

Track and energy reconstruction algorithms for a time projection chamber with orthogonal fields

正交场时间投影室的径迹和能量重建算法

Martin Vít Vavřík, Babar Ali, Hugo Natal da Luz, Olivier Rousselle, Tomáš Sýkora

专题命中 其他科学智能 :正交场TPC径迹重建算法,粒子物理

AI总结 针对正交场时间投影室(OFTPC),开发了基于模拟电离电子漂移图的径迹重建和Runge-Kutta拟合的能量重建算法,在理想条件下实现了电子和正电子相对能量分辨率优于1%。

Comments 25 pages, 17 figures

详情
AI中文摘要

在这项工作中,我们描述了用于非典型时间投影室(TPC)的径迹和能量重建算法的开发,这些TPC将用于布拉格捷克技术大学实验与应用物理研究所,以搜索ATOMKI小组报告的反常内部对产生。这些TPC在非均匀环形磁场中运行,磁场方向与电场正交;因此我们称它们为正交场TPC(OFTPC)。尽管这种配置扭曲了电离电子的漂移并使电子和正电子的轨迹复杂化,但它也提供了几个实际优势。我们提出了几种测试方法中最有效的一种,该方法使用模拟的电离电子漂移图进行径迹重建,并使用基于Runge-Kutta的拟合进行能量重建。通过模拟,我们证明——在理想条件下,即理想的电荷读出(无放大和噪声)且已知初始径迹位置和方向——在应用依赖于径迹参数的系统效应校正后,可以实现电子和正电子的相对能量拟合高斯宽度(sigma)优于1%。

英文摘要

In this work, we describe the development of track- and energy-reconstruc-tion algorithms for atypical Time Projection Chambers (TPCs) that will be used at the Institute of Experimental and Applied Physics, Czech Technical University in Prague, to search for the anomalous internal pair creation reported by the ATOMKI group. These chambers operate with an inhomogeneous toroidal magnetic field oriented orthogonally to the electric field; we therefore refer to them as Orthogonal-Field TPCs (OFTPCs). Although this configuration distorts the drift of ionization electrons and complicates the resulting electron and positron trajectories, it also offers several practical advantages. We present the most effective of several tested approaches, which employs a simulated ionization-electron drift map for track reconstruction and a Runge--Kutta-based fit for energy reconstruction. Using simulations, we demonstrate that -- under idealized conditions, namely an ideal charge readout with no amplification and no noise and with known initial track positions and directions -- it is possible to achieve a fitted Gaussian width (sigma) better than 1\% in relative energy for both electrons and positrons, after applying corrections for systematic effects that depend on the track parameters.

2604.04089 2026-06-18 physics.comp-ph cond-mat.str-el cs.AI cs.HC 版本更新 80%

From Paper to Program: Externalizing and Diagnosing Knowledge Bottlenecks in AI-Assisted Quantum Many-Body Code Generation

从论文到程序:AI辅助量子多体代码生成中的知识外化

Yi Zhou

专题命中 其他科学智能 :AI辅助量子多体代码生成,属于科学智能

AI总结 针对AI直接翻译论文为代码时因隐含约定导致失败的问题,提出知识外化方法,通过多阶段人机协作流程将隐式假设显式化,在DMRG和Pfaffian-MPS任务上验证了有效性。

Comments Core thesis upgraded

详情
AI中文摘要

大型语言模型可以编写科学代码,但当正确性依赖于文献中的默认约定时,直接的论文到程序翻译仍然脆弱。我们将这一瓶颈识别为\textbf{知识外化}:在实现之前将隐式计算假设——索引约定、规范选择、费米子符号、收缩顺序和内存约束——转换为明确的技术规范。我们评估了一个多阶段、人在回路的工作流程,该流程在理论提取和代码生成之间插入这样的规范,并带有验证和停止门。该工作流程在两个算法上不同的量子多体任务上进行了测试:基于变分扫描的密度矩阵重整化群(DMRG)来自教学综述,以及将Hartree-Fock-Bogoliubov态构造性地转换为矩阵乘积态的Pfaffian方法,来自Jin等人五页的信件,Phys. Rev. B 105, L081101 (2022),该代码未公开。对于DMRG,在$4\ imes4$网格中,所有16个规范引导的模型配对都满足物理验证标准,而直接尝试为6/13。散文规范消融实验表明,外化的内容(而非LaTeX格式)是基本要素。对于Pfaffian-MPS,该工作流程在26次存档尝试中成功11次,而直接提示产生零次审计通过。跨规范转移是不对称的:由GPT~5.5实现的非GPT规范通过4/4,而由较弱模型实现的GPT~5.5规范失败4/4,表明存在残留的实现模型瓶颈。由此产生的\textit{论文到程序多体}技能为AI辅助实现多体算法以及诊断外化成功或失败提供了可审计的协议。

英文摘要

Large language models can write scientific code, but direct paper-to-program translation remains fragile when correctness depends on tacit conventions rather than explicit equations. We frame this as a \textbf{knowledge-externalization} problem: index choices, gauges, fermionic signs, contraction order, validation gates, and scaling constraints must be made explicit before code generation. We evaluate a multi-stage, human-in-the-loop workflow on two quantum many-body tasks. DMRG from Schollwock's pedagogical review serves as calibration: specification-guided implementations pass in all 16 model pairings, compared with 6/13 direct attempts, and a prose-specification ablation shows that externalized content, not \LaTeX{} form, is the active ingredient. Pfaffian conversion of HFB states to MPS from the five-page Letter by Jin et al. serves as the stress test: no public implementation is available, and success depends on tacit sign, gauge, ordering, and scalability conventions. Here the workflow yields 11/26 audited passes, while direct prompting yields none. Cross-specification transfer is asymmetric: non-GPT specifications implemented by GPT~5.5 pass 4/4, whereas GPT~5.5 specifications implemented by weaker models fail 4/4. The contrast supports a two-bottleneck picture. Externalization resolves the first bottleneck -- paper-to-code ambiguity -- well enough to make DMRG reproducible and Pfaffian-MPS auditable. The remaining failures expose a second bottleneck in implementation-model capability. Iterative meta-specification moves this boundary but does not eliminate it. The resulting \emph{Paper-to-Program Many-Body} skill is both a reusable implementation protocol and a diagnostic instrument for AI-assisted many-body programming.

2601.15430 2026-06-18 math.DG math.AG math.RT 版本更新 80%

The Hirzebruch quadratic form of a hyperplane arrangement and flat logarithmic connections

超平面配置的Hirzebruch二次型与平坦对数联络

Martin de Borbon, Dmitri Panov

专题命中 其他科学智能 :研究超平面配置的Hirzebruch二次型与对数联络

AI总结 本文证明复超平面配置的Hirzebruch二次型在稳定权集上非正,并识别出零集为特殊类型的平坦对数联络,证明使用了Kempf-Ness和框架势不等式。

Comments 12 pages

详情
AI中文摘要

我们证明复超平面配置的Hirzebruch二次型在稳定权集上是非正的,并将该集合中的零轨迹识别为一种特殊类型的平坦对数联络。证明使用了Kempf-Ness和框架势不等式。

英文摘要

We prove that the Hirzebruch quadratic form of a complex hyperplane arrangement is non-positive on the set of stable weights, and we identify the zero locus within this set with flat logarithmic connections of a distinguished type. The proof uses Kempf--Ness and the frame-potential inequality.