arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 1262
2605.00455 2026-05-04 stat.ME math.ST stat.ML stat.TH

Concentration and Calibration in Predictive Bayesian Inference

预测贝叶斯推断中的集中性与校准

David T. Frazier, Hui Wang

AI总结 本文探讨了预测贝叶斯推断中后验分布的集中性和校准问题,指出预测模型的准确性直接影响推断结果的可靠性,强调预测模型必须准确捕捉数据生成过程以确保校准。

详情
AI中文摘要

预测贝叶斯推断(PBI)是一种模型和先验无关的贝叶斯推断方法,允许用户仅通过指定一个用于未来未观测数据的正向预测模型来量化功能的不确定性。该框架的灵活性和通用性催生了许多新的算法和应用,但其对底层统计功能的推断可靠性仍不明确。本文证明,当使用PBI处理总体功能时,后验分布会集中在依赖于所用正向预测模型的明确量上。此外,正向预测模型完全决定了PBI中产生的不确定性量化。因此,我们的结果表明,如果预测模型未能捕捉所有相关数据特征,即使在简单例子中,预测贝叶斯可信区间对目标功能总体值的覆盖率可以任意接近零。我们详细解释了这一现象发生的原因,并展示这种行为直接与PBI框架中用于生成未来观测的预测模型的不准确性相关。因此,我们的结果表明,为了使PBI交付校准的后验推断,所用的预测引擎必须在某种明确意义上包含真实的DGP,否则在此框架下生成的推断将不校准。

英文摘要

Predictive Bayesian inference (PBI) represents a model-and prior-agnostic approach to standard Bayesian inference which allows users to quantify uncertainty for a functional of interest only by specifying a forward predictive model for future unobserved data. The flexibility and generality of this framework have led to a host of novel algorithms for implementing this approach, and many empirical applications, yet the reliability of the resulting inferences for the underlying statistical functional of interest remains unclear. Herein, we demonstrate that when using PBI for a population functional of interest, the resulting posterior concentrates onto a well-defined quantity that explicitly depends on the forward predictive model used to implement the predictive recursion underlying the method. Furthermore, the forward predictive model entirely determines the uncertainty quantification produced in PBI. Consequently, our results show that if the predictive model does not capture all relevant features of the data, and, even in very simple examples, the coverage of predictive Bayes credible sets for the population value of the functional of interest can be arbitrarily close to zero. We carefully explain why this occurs, and show that this behavior is directly tied to the inaccuracy of the forward predictive model used to produce future observations within the PBI framework. As a consequence, our results imply that in order for PBI to deliver calibrated posterior inferences, the resulting predictive engine used to generate posterior samples must contain, in a well-defined sense, the true DGP, else inferences generated under this framework will not be calibrated.

2605.00454 2026-05-04 hep-ph nucl-th

Nuclear structure and saturation effects from diffractive vector meson production

核结构与饱和效应从衍射向量介子产生

Heikki Mäntysaari, Hendrik Roch, Björn Schenke, Chun Shen, Wenbin Zhao

AI总结 本文研究了广泛核素的超外围碰撞中向量介子产生,评估了测量对氧和氖核小x结构的约束潜力,采用基于色玻璃凝聚态框架的模型,结合JIMWLK演化,预测O+O和Ne+Ne碰撞中J/ψ的相干和非相干产生,并量化理论不确定性。

Comments 15 pages, 10 figures

详情
AI中文摘要

我们研究了广泛核素的超外围碰撞(UPC)中Exclusive向量介子产生,并评估了测量对氧和氖核小x结构的约束潜力。我们采用一种依赖于撞击参数的色玻璃凝聚态框架,结合JIMWLK演化,参数由最近一次全球贝叶斯分析γ+p和γ+Pb数据约束。我们给出了在LHC能量下O+O和Ne+Ne UPCs中相干和非相干J/ψ产生预测,并使用校准的后验样本量化理论不确定性。我们采用多种核结构模型,发现t-微分可观测量对所选模型敏感。我们进一步研究了饱和效应的质子数依赖性,通过相干和非相干向量介子产生中的核抑制因子。饱和引起的抑制随着核质量数和能量系统增加。我们的结果提供了一个统一框架,用于系统研究高能下的胶子饱和和核结构的起始,未来在LHC和电子离子对撞机上的UPC测量中可实现。

英文摘要

We study exclusive vector meson production in ultra-peripheral collisions (UPCs) of a wide range of nuclei, and assess the potential of measurements to constrain the small-$x$ structure of oxygen and neon nuclei. We employ an impact-parameter-dependent color glass condensate framework incorporating JIMWLK evolution, with parameters constrained by a recent global Bayesian analysis of $γ+p$ and $γ+\mathrm{Pb}$ data. We present predictions for coherent and incoherent $\mathrm{J}/ψ$ production in $\mathrm{O}+\mathrm{O}$ and $\mathrm{Ne}+\mathrm{Ne}$ UPCs at LHC energies, and quantify theoretical uncertainties using posterior samples from the calibration. We employ several nuclear structure models and find that $t$-differential observables are sensitive to the chosen model. We further study the mass-number dependence of saturation effects through nuclear suppression factors for coherent and incoherent vector meson production. Saturation-induced suppression increases systematically with both nuclear mass number and energy. Our results provide a unified framework for the systematic study of the onset of gluon saturation and nuclear structure at high energy, accessible in future UPC measurements at the LHC and at the Electron-Ion Collider.

2605.00453 2026-05-04 cs.HC cs.CY

Urban to Rural Migration in Eastern Europe: Unpacking digital ruralities through TikTok video analysis

从城市到乡村的迁移:通过TikTok视频分析解构数字乡村

Anca-Simona Horvath, Cristian Tosa, Simai, Huang

AI总结 本文通过分析TikTok上901个视频,探讨东欧国家罗马尼亚从城市到乡村的迁移现象,揭示数字乡村的多重表现及其对旅游和日常生活的促进作用。

详情
AI中文摘要

从城市到乡村的迁移现象相较于农村到城市的迁移研究较少。在欧洲部分地区,越来越多的人从大城市或其他国家迁移到乡村。本文通过分析TikTok上发布的901个视频,研究这一现象。我们收集了截至2025年底的视频数据,这些视频使用三个标签,总计播放了2400万次。我们定量和定性地分析了这一语料库,并通过数字乡村理论(基于Harvey和Soja的空间三元组理论)探讨了发现。具体而言,我们分析了语料库作为:(a) 数字乡村本地化区域,(b) 数字乡村的正式表现,以及(c) 数字乡村的日常生活。我们发现,(a) 社交媒体平台使农村地区出现新的付费劳动形式,有时涉及自我商品化,尽管许多创作者并未明确向观众传达这一点。(b) 数字乡村获得新的表现形式,偏远罗马尼亚的乡村地区在TikTok上数据丰富。(c) 通过数字乡村表现的日常生活有时被理想化或浪漫化,但它们促进了旅游业,并被用作讨论各种主题的平台,包括提供大量健康建议,通常由非专业人士提供,有时批评西方医学,表达和推广宗教和政治观点,以及作为一般自我表达的形式。

英文摘要

Urban to rural migration is a less-researched phenomenon compared to its counterpart: rural to urban migration. In parts of Europe, an increasing number of people living in big urban centers within the country, or moving from other countries decide to relocate to rural areas. In this paper, we examine this phenomenon by analysing content posted on TikTok that documents this transition. We collected a corpus of 901 videos posted until late 2025, documenting urban to rural migration in Romania, under three hashtags, which have collectively been played a total of 24 million times at the time when we gathered the dataset. We analyse this corpus both quantitatively and qualitatively and discuss our findings through the lens of digital rurality - a theory based on Harvey's and Soja's spatial triad, applied to rural spaces, and based on the role of digital technologies as (re-)mediators of everyday lived experience. Specifically, we analyze the corpus as: (a) digital rural localities, (b) formal representations of the digital rural, and (c) everyday lives of the digital rural. We find that (a) Social media platforms enable new forms of paid labor that sometimes involve the commodification of the self in rural areas, although many of the creators we analyze do not explicitly acknowledge this with their audiences. (b) The digital rural gains new forms of representation, and rural areas in remote Romania are highly data-rich across TikTok. (c) The everyday lives represented through the digital rural are sometimes idealized or romanticised. However, they serve as promoters for tourism and are used as sites to document and discuss a variety of topics including giving ample health advice, typically by non-specialists and sometimes criticizing Western medicine, expressing and promoting religious and political views but also acting as forms of general self-expression.

2605.00452 2026-05-04 math.CO

From Graph Laplacians to String Partition Functions: A Rigorous Pathway from Discrete Spectra to Emergent Geometry

从图拉普拉西到字符串分区函数:从离散谱到涌现几何的严谨路径

Tishkov Vladislav

AI总结 本文通过构建图谱与黎曼面的映射,揭示了图谱信息在连续极限下与黎曼流形的经典稳定曲线的收敛关系,并展示了谱记忆场与多切口矩阵模型的联系,同时将广义相对论中的类空奇点与谱曲线的自同构群关联起来。

Comments The article has 18 pages and 1 figure

详情
AI中文摘要

本文建立了连接谱图理论、代数几何和弦理论的严谨数学基础。我们构造了一个规范映射,使得任何有限图G定义一个紧黎曼面X_G(谱曲线),其周期矩阵Ω_G编码图的粗粒谱信息。我们证明在图序列连续极限下,这些谱曲线在Deligne-Mumford紧化意义下收敛到与流形相关的经典稳定曲线。我们建立了与Eynard-Orantin的拓扑递归框架的联系,显示在适当条件下,谱曲线满足多切口矩阵模型的环方程。引入了谱记忆场Φ_G(u),并证明其提供了一个离散正则化形式的最小弦分区函数。我们构建了谱曲线上的量子散射算子,并证明其幺正性等价于谱记忆场的正性条件。此外,我们应用此框架解决广义相对论中的类空奇点,证明Belinski-Khalatnikov-Lifshitz(BKL)混沌区域与临界随机图集合等谱。经典奇点被无限节点链的有理曲线取代,Bekenstein-Hawking熵从谱曲线的自同构群中涌现。本文为离散量子引力方法提供了严谨的数学基础,并建立了图论、代数几何和理论物理之间的新联系。

英文摘要

This work establishes rigorous mathematical foundations connecting spectral graph theory, algebraic geometry, and string theory. We construct a canonical mapping whereby any finite graph \(G\) defines a compact Riemann surface \(X_{G}\) (the spectral curve) whose period matrix \(Ω_{G}\) encodes the graph's coarse-grained spectral information. We demonstrate that in the continuum limit of graph sequences converging to Riemannian manifolds, these spectral curves converge in the Deligne-Mumford compactification sense to the classical stable curves associated with the manifold. We establish connections to the topological recursion framework of Eynard-Orantin, showing that under appropriate conditions the spectral curve satisfies the loop equations of multi-cut matrix models. The spectral memory field \(Φ_{G}(u)\) is introduced and shown to provide a discrete regularization of minimal string partition functions. We construct quantum scattering operators on spectral curves and prove that their unitarity is equivalent to a positivity condition on the spectral memory field. Furthermore, we apply this framework to resolve spacelike singularities in general relativity, proving that the Belinski-Khalatnikov-Lifshitz (BKL) chaotic regime is isospectral to a critical random graph ensemble. The classical singularity is replaced by an infinite nodal chain of rational curves, and the Bekenstein-Hawking entropy emerges from the automorphism group of the spectral curve. This work provides rigorous mathematical underpinnings for discrete approaches to quantum gravity and establishes new connections between graph theory, algebraic geometry, and theoretical physics.

2605.00451 2026-05-04 physics.app-ph

Fundamental Efficiency Limits of Transition-Metal Dichalcogenide Solar Cells with Carrier Multiplication and Hot-Carrier Effects

具有载流子倍增和热点载流子效应的过渡金属二硫化物太阳能电池的基本效率极限

Seungwoo Lee

AI总结 本文提出了一种通用的详细平衡理论,研究了过渡金属二硫化物太阳能电池中载流子倍增和热点载流子效应的效率极限,探讨了其对性能优化的影响。

详情
AI中文摘要

针对过渡金属二硫化物(TMD)太阳能电池的详细平衡极限进行了深入研究,但现有TMD特定的极限未能同时解决厚度依赖的光学特性、载流子倍增(CM)、热点载流子(HC)提取以及有限的冷却泄漏。本文发展了一种通用的详细平衡理论,提供了一个上限框架。该模型结合了能量和厚度依赖的吸收率a(E,d)、单层能级分辨吸收率、实验可用的CM量子产率上限(eta_CM ≤ 0.97)以及具有理想能量选择性接触和有限热泄漏系数kappa的内可逆HC发动机。框架表明,CM和HC利用同一高于能隙的光子能库;因此,CM不会提高可逆HC热力学极限。相反,CM只能通过将多余能量利用从对冷却敏感的电压通道转移到收集电流中来保护有限kappa性能。对于光学厚的TMDs在AM1.5G光照下,SQ最优值位于E_g = 1.3 eV,而CM/HC favored envelope向E_g = 1.0 eV偏移,当可逆效率超过50%时。对于单层TMDs如WSe2(E_g = 1.63 eV),CM基本上不活跃,因为只有约3.7%的高于能隙AM1.5G光子满足E > 2E_g,给出理想化的短路电流增益仅为约0.6%。在单层TMDs中,具有大HC相关增益的TMDs在d = 10-50 nm时表现良好,但即使kappa = 0.2 W m^-2 K^-1也意味着Delta T = 500 K时约100 W m^-2的热泄漏。因此,高E_g单层TMDs不是有太阳CM候选者,而窄E_g、类似体的TMD吸收体在实现能量选择性提取和声子工程冷却抑制的情况下仍可能是超SQ候选者。

英文摘要

Detailed-balance limits for transition-metal dichalcogenide (TMD) solar cells have been reported, but existing TMD-specific limits do not simultaneously resolve thickness-dependent optics, carrier multiplication (CM), hot-carrier (HC) extraction, and finite cooling leakage. Here, we develop a generalized detailed-balance theory that provides an upper-bound framework. The model combines energy- and thickness-dependent absorptance a(E,d), exciton-resolved monolayer absorbance, an experimentally available CM quantum-yield limit (eta_CM <= 0.97), and an endoreversible HC engine with ideal energy-selective contacts and finite heat-leak coefficient kappa. The framework shows that CM and HC draw on the same above-gap photon-energy reservoir; therefore, CM does not raise the reversible HC thermodynamic limit. Instead, CM can protect finite-kappa performance only by shifting excess-energy utilization from a cooling-sensitive voltage channel into collected current. For optically thick TMDs under AM1.5G illumination, the SQ optimum lies near E_g = 1.3 eV, whereas the CM/HC-favored envelope shifts toward E_g = 1.0 eV with reversible efficiencies above 50%. For monolayer TMDs such as WSe2 (E_g = 1.63 eV), CM is essentially inactive because only about 3.7% of above-gap AM1.5G photons satisfy E > 2E_g, giving an idealized short-circuit-current gain of only about 0.6% before device nonidealities. Bulk-like TMDs can show large HC-related gains at d = 10-50 nm, but even kappa = 0.2 W m^-2 K^-1 implies about 100 W m^-2 heat leak for Delta T = 500 K. Thus, high-E_g monolayer TMDs are not promising one-sun CM candidates, whereas narrow-E_g, bulk-like TMD absorbers remain plausible beyond-SQ candidates only if energy-selective extraction and phonon-engineered cooling suppression are realized together.

2605.00450 2026-05-04 astro-ph.CO

Measuring $fσ_8$ and BAO scale in the Local Universe: a joint real and redshift space analysis from CosmicFlows-4++

测量 $fσ_8$ 和 BAO 尺寸:从 CosmicFlows-4++ 中的实空间和红移空间联合分析

C. Franco, H. M. Courtois, A. Bernui

AI总结 利用 CosmicFlows-4++ 数据库中的 65331 个星系距离,研究局部宇宙中 BAO 和 $fσ_8$ 的联合分析,通过实空间和红移空间数据测量,获得 BAO 尺寸和结构增长率参数约束。

Comments 11 pages, 10 figures, submitted to A&A

详情
AI中文摘要

星系的大尺度聚类包含宇宙的几何和动态信息。BAO 现象提供了一个标准尺子,约束宇宙膨胀历史,而红移空间畸变(RSD)通过偶极子速度场探测结构增长。本文通过 CosmicFlows-4++ 数据库中的 65,331 个星系距离,对局部宇宙中 $z = 0.1$ 的 BAO 和增长率参数 $fσ_8$ 进行联合分析。该目录的独特之处在于提供了实空间和红移空间的星系位置。通过拟合经验模型,我们得到实空间 BAO 尺寸 $r_{\rm{BAO}}^{\rm{real}} = 132\pm 8\,h^{-1}\,{\rm Mpc}$,红移空间 BAO 尺寸 $r_{\rm{BAO}}^{z} = 139 \pm 7\,h^{-1}\,{\rm Mpc}$,在红移 $z = 0.07$ 处。利用 Kaiser 格式建模相关函数的增强,我们推导出结构增长率参数 $fσ_8 = 0.344 \pm 0.105$。该分析展示了实空间和红移空间聚类测量的联合使用如何同时探测大尺度结构的重要观测量。在同一数据集中联合检测这些量,因此提供了局部宇宙结构和演化的自洽视图。这项研究可用于未来调查的一致性分析,如 DESI 和 4MOST,它们也将提供实空间和红移空间的数据。

英文摘要

The large-scale clustering of galaxies encodes both geometric and dynamical information about the Universe. The Baryon Acoustic Oscillations (BAO) phenomenon provides a standard ruler that constrains the cosmic expansion history, while Redshift Space Distortions (RSD) probe the growth of structure through the peculiar velocity field. In this work, we present a joint analysis of BAO and growth rate parameter, $fσ_{8}$, in the Local Universe out to $z = 0.1$, using the $65,331$ galaxy distances of CosmicFlows-4++ database. A distinctive property of this catalogue is the availability of real space galaxy positions in addition to the redshift space coordinates. Fitting an empirical model to the measurements we obtain $r_{\rm{BAO}}^{\rm{real}} = 132\pm 8\,h^{-1}\,{\rm Mpc}$ in real space, and $r_{\rm{BAO}}^{z} = 139 \pm 7\,h^{-1}\,{\rm Mpc}$ in redshift space, at redshift $z = 0.07$. Modeling the enhancement of the correlation function within the Kaiser formalism, we derive a constraint on the growth rate parameter $fσ_8 = 0.344 \pm 0.105$. This analysis demonstrates how the combination of real and redshift space clustering measurements enables a simultaneous probe of important observables of the large-scale structure. Their joint detection in the same dataset, therefore, provides a self consistent view of the structure and evolution of the Local Universe. This study may be used for consistency analyses of upcoming surveys, as DESI and 4MOST, that will also provide data in both real and redshift space.

2605.00447 2026-05-04 cs.SE

Think Harder and Don't Overlook Your Options: Revisiting Issue-Commit Linking with LLM-Assisted Retrieval

深入思考并不要忽视你的选项:重新审视基于LLM辅助检索的议题-提交链接

Cole Morgan, Muhammad Asaduzzaman, Shaiful Chowdhurry, Shaowei Wang

AI总结 本文重新评估了多种议题-提交链接恢复技术,通过比较不同检索方法和机器学习模型,发现密集检索优于稀疏检索,传统机器学习方法性能优于大语言模型。

详情
AI中文摘要

将问题报告链接到解决它们的提交对于软件可追溯性、维护和演变至关重要。准确的问题-提交链接帮助开发人员理解系统变化及其背后的原因。尽管已提出多种自动技术,从启发式和基于特征的方法到现代深度学习和大语言模型方法,我们的目标是评估这些技术以确定哪些最有效且高效。在本研究中,我们重新审视了几种已建立的问题-提交链接恢复技术,包括BTLink、EasyLink、FRLink、RCLinker和Hybrid-Linker,并评估它们在重排序问题-提交链接方面的性能。我们首先评估不同检索方法(BM25、BM25L、SBERT-语义搜索、ANNOY、LSH、HNSW)在高效检索相关提交以减少必须由更计算密集型模型考虑的候选集方面的能力。使用最佳检索方法后,我们进一步研究了不同机器学习方法在重排序候选提交方面的有效性,包括传统机器学习模型、交叉编码器和大语言模型(ChatGPT、Qwen、Gemma、Llama),以优化候选提交的重排序并提高精度。最后,我们比较了这些技术的有效性。我们的结果表明,密集检索方法在识别相关提交方面优于稀疏检索方法,结合密集和稀疏检索可以提高召回率。此外,我们发现传统机器学习基于的重排序技术比大语言模型方法表现更好。我们的结果强调了基于检索的流水线仍然是大规模问题-提交链接的实用且有效的方法,并且在采用计算密集型大语言模型方法之前,应仔细考虑更简单的模型。

英文摘要

Linking issue reports to the commits that resolve them is essential for software traceability, maintenance, and evolution. Accurate issue-commit links help developers to understand system changes and the rationale behind them. While numerous automated techniques have been proposed, ranging from heuristic and feature-based approaches to modern deep learning and large language model approaches, our goal is to evaluate these techniques to determine which are most effective and efficient. In this study, we revisit several established issue-commit link recovery techniques, including BTLink, EasyLink, FRLink, RCLinker, and Hybrid-Linker, and assess their performance for reranking issue-commit links. We first evaluate different retrieval methods (BM25, BM25L, SBERT-Semantic Search, ANNOY, LSH, HNSW) for their ability to efficiently retrieve relevant commits, reducing the candidate set that must be considered by more computationally expensive models. Using the best retrieval methods, we then investigate the reranking effectiveness of different machine learning-based techniques, including traditional machine learning models, a cross-encoder, and large language models (ChatGPT, Qwen, Gemma, Llama), to refine the reranking of candidate commits and improve precision. Finally, we compare the effectiveness of these techniques. Our results show that dense retrieval methods outperform sparse retrieval approaches in identifying relevant commits and that combining dense and sparse retrieval can improve recall. Additionally, we find that traditional machine learning-based reranking techniques achieve higher performance than LLM-based approaches. Our results highlight that retrieval-based pipelines remain a practical and effective solution for large-scale issue-commit linking, and that simpler models should be carefully considered before adopting computationally expensive LLM-based approaches.

2605.00446 2026-05-04 quant-ph

From quantum storage to amplification: the effect of unwanted couplings and an additional level in cavity-based ensemble quantum memories

从量子存储到放大:不希望的耦合和额外能级在腔基集体量子记忆中的影响

Jia-Wei Ji, Christoph Simon

AI总结 本文研究了腔基Λ型集体量子记忆的扩展模型,考虑了控制场和信号场的不希望耦合,推导了单光子存储效率、检索效率和保真度的表达式,并识别了三种动态 regime:稳定、阈值和不稳定。

Comments 12 pages and 8 figures

详情
AI中文摘要

量子记忆模型通常将复杂的能级结构简化为理想化的Λ系统,可能忽略了附近能级和不希望的耦合,这些耦合可能定性地改变预测性能。本文研究了腔基Λ型集体量子记忆的扩展模型,即一个四能级模型,考虑了来自控制场和信号场的不希望耦合,采用完全量子处理。我们推导了单光子存储效率、检索效率和保真度的显式表达式,并在此基础上识别了三种不同的动态 regime:稳定、阈值和不稳定。在稳定 regime 中,我们还区分了两种不同的子 regime。将理论应用于受热蒸汽启发的参数,我们确定了系统在何种条件下仍能作为高质量量子记忆运行。更一般地说,我们的结果提供了一个实用框架,用于区分真正的记忆操作与放大,并优化超越理想模型的现实量子记忆。

英文摘要

Quantum-memory models often reduce complex level structures to an idealized $Λ$ system, potentially missing nearby levels and unwanted couplings that can qualitatively alter the predicted performance. Here, we study an extension of a cavity-based $Λ$-type ensemble memory, a four-level model with unwanted couplings from both the control field and signal, using a fully quantum treatment. We derive explicit expressions for the single-photon storage efficiency, retrieval efficiency, and fidelity, and on this basis identify three distinct dynamical regimes: stable, threshold, and unstable. Within the stable regime, we additionally discriminate between two qualitatively different sub-regimes. Applying the theory to warm-vapor-inspired parameters, we determine the conditions under which the system can still operate as a high-quality quantum memory. More generally, our results provide a practical framework for distinguishing genuine memory operation from amplification and for optimizing realistic quantum memories beyond idealized models.

2605.00442 2026-05-04 math.DS nlin.CD

Dynamical analysis of r-Chialvo neuron map with cosine memristive

r-Chialvo神经元映射的动力学分析与余弦忆阻器

Ajay Kumar, V. V. M. S. Chandramouli

AI总结 本文通过引入余弦基忆阻器构建新型二维离散神经元映射,研究电磁调制的动力学特性。该映射表现出多样化的放电模式、稳定性行为和混沌吸引子,揭示了多种分岔结构及多稳态吸引子的共存现象。

详情
AI中文摘要

本文工作构建了一个新型的二维离散神经元映射,通过将余弦基忆阻器引入简化的Chialvo神经元映射中,以研究电磁调制的动力学分析。忆阻器的非线性电流-电压特性丰富了神经元映射的行为,导致多样化的放电模式、稳定性行为和混沌吸引子。本研究通过解析和数值方法确定了平衡点,并确定了参数条件下所提出映射出现Neimark-Sacker分岔的条件。进一步的数值研究揭示了通过正向和反向分岔图所呈现的反单调结构。该模型表现出广泛的共维一和共维二分岔模式,包括Neimark-Sacker、倍周期、鞍节、广义倍周期、尖点、折叠翻转以及多种共振结构(1:1,1:2,1:3,和1:4)。我们还观察到多稳态吸引子的共存现象,包括稳定的极限环、周期五吸引子和混沌吸引子及其相应的吸引盆地。此外,我们还将此分析扩展到环星配置的神经网络,并讨论了若干时空模式。该网络研究揭示了复杂的集体模式,包括不完美的同步、聚类模式和多 chimera 状态现象,这些在现有的 Chialvo 基础研究中尚未被观察到。这些结果突显了基于离散忆阻器的神经元映射在推进理论神经动力学方面的潜力,并提供了一个稳健的框架用于研究低维但动态丰富的神经元系统。

英文摘要

In this work, we construct a novel two-dimensional discrete neuron map by incorporating a cosine-based memristor into the reduced Chialvo neuron map to examine the dynamical analysis of electromagnetic modulation. The nonlinear current-voltage characteristics of the memristor enrich the neuron map's behavior, leading to diverse firing regimes, stability behaviors, and chaotic attractors. This study begins to establish the equilibrium points using both analytical and numerical methods. Additionally, we determine the conditions on parameters under which the proposed map exhibits a Neimark-Sacker bifurcation. Further, the numerical study reveals the antimonotonicity structure through the forward and backward bifurcation diagrams. The model exhibits a wide range of codimension-one and codimension-two bifurcation patterns, including Neimark-Sacker, period-doubling, saddle-node, generalized period-doubling, cusp-point, fold-flip, and various resonance structures (1:1, 1:2, 1:3, and 1:4). We also observe that the coexistence of multistable attractors including a stable limit cycle, a period-five attractor, and a chaotic attractor, along with their respective basins of attraction. Furthermore, we extend this analysis to the network of neurons under the ring-star configuration and discuss several spatiotemporal patterns. This network investigation reveals complex collective patterns, including imperfect synchronization, clustered patterns, and multi-chimera state phenomena, which have not been previously observed in existing Chialvo-based studies. These results highlight the potential of the discrete memristor-based neuron map for advancing theoretical neurodynamics and offer a robust framework for investigating low-dimensional yet dynamically rich neuron systems.

2605.00441 2026-05-04 cond-mat.mes-hall

Thermodynamic Charge Partition in Accumulation-Layer Heterostructures

累积层异质结构中的热力学电荷分配

Elmar Böckenhoff

AI总结 研究通过热力学描述累积层异质结构,分析电荷密度分配对能级路径和几何电容的影响,提出自洽方法计算电荷分布与磁化场的关系。

Comments 26 pages, 14 figures

详情
AI中文摘要

我们开发了累积层异质结构的热力学描述,其中诱导的面电荷密度被分配到近界面累积层电荷和周围结构的互补屏蔽数量之间。将这种分配视为核心状态变量,得到完整的亥姆霍兹自由能、校正的锁定分支化学位和移位的释放势能,将能量路径选择与几何电容分离。物理路径通过频谱选择:可压缩段保持完全屏蔽,而不可压缩段沿锁定分支演化,直到由相关能隙触发释放。微分电容、隧道电流和平台宽度作为同一耦合热力学结构的不同投影出现。一个标准的两阶段自洽泊松-薛定谔约简提供了孤立累积层的通用主函数和有限缓冲区的主表面,使理论在密度和几何上可计算。与磁化电容和磁隧道效应数据的比较支持了一种观点,即附近扩展电荷补充累积层,有效屏蔽深度随磁场增长。

英文摘要

We develop a thermodynamic description of accumulation-layer heterostructures in which the induced sheet density is partitioned between the near-interface accumulation-layer charge and a complementary screening charge in the surrounding structure. Treating this partition as the central state variable yields a complete Helmholtz free energy, a corrected locked-branch chemical potential, and a shifted release potential that separates energetic path selection from geometric capacitance. The physical path is selected spectrally: compressible segments remain fully screened, whereas incompressible segments evolve along a locked branch until release is triggered by the relevant gap. Differential capacitance, tunnel current and plateau width then emerge as different projections of the same coupled thermodynamic structure. A canonical two-stage self-consistent Poisson--Schrödinger reduction supplies universal master functions for the isolated accumulation layer and master surfaces for its finite-buffer extension, making the theory calculable across density and geometry. Comparison with magnetocapacitance and magnetotunneling data supports a picture in which nearby extended charge refills the accumulation layer and the effective screening depth grows with magnetic field.

2605.00437 2026-05-04 astro-ph.HE hep-ph nucl-th

Maximal mass of neutron stars constrained by neutron star observations

中子星最大质量受中子星观测的限制

Gábor Kasza, György Wolf

AI总结 通过贝叶斯加权框架分析中子星质量-半径序列端点的概率分布,研究高密度物质方程状态的约束,确定最大中子星质量及对应半径的分布。

Comments Accepted for publication in Journal of Subatomic Particles and Cosmology

详情
AI中文摘要

我们通过分析中子星质量-半径序列端点的概率分布,利用贝叶斯加权框架研究中子星物质高密度方程状态(EOS)的约束。从两个有代表性的有核基线EOS(SFHo和DD2)出发,将其在更高密度下与扩展的线性σ模型描述匹配,并限制到渐进行QCD(pQCD)结果,构建了一系列因果混合EOS,覆盖了超核密度范围内的广泛刚度。通过将二进制中子星合并GW170817的观测约束、中子星内部组成探测器(NICER)的质量-半径测量以及候选低质量及质量间隙致密天体的约束,通过贝叶斯似然加权整合进来。这种方法使我们能够确定最大中子星质量M_TOV及对应半径R_TOV的概率分布,即M(R)序列的端点。我们发现最大质量分布主要由观测约束决定,对基线EOS的选择敏感性较弱,当应用最稳健的约束时,倾向于2.2-2.3M_sun。相反,对应的半径分布对底层有核EOS有更强的依赖性,典型偏好值接近12±1公里。额外的潮汐变形性约束进一步限制了允许的参数空间,并在与可能的质量间隙中子星候选物结合时,排斥非常刚性的EOS实现。我们的结果表明,M(R)序列端点分布为在多信使贝叶斯框架内约束中子星EOS高密度行为提供了敏感且互补的诊断。

英文摘要

We investigate constraints on the high-density equation of state (EOS) of neutron star matter by analyzing the probability distributions of the endpoints of mass-radius M(R) sequences within a Bayesian weighting framework. Starting from two representative hadronic baseline EOSs, SFHo and DD2, matched at higher densities to an extended linear sigma model description and constrained to approach perturbative QCD (pQCD) results, we construct families of causal hybrid EOSs spanning a broad range of stiffness at supranuclear densities. Observational constraints from the binary neutron-star merger GW170817, mass-radius measurements from the Neutron Star Interior Composition Explorer (NICER), and candidate low-mass and mass-gap compact objects are incorporated through Bayesian likelihood weighting. This approach allows us to determine probability distributions for the maximum neutron-star mass M$_{\rm TOV}$ and the corresponding radius R$_{\rm TOV}$, i.e., the endpoints of the M(R) sequences. We find that the maximum-mass distributions are largely determined by observational constraints and show only weak sensitivity to the choice of baseline EOS, favoring values around 2.2-2.3 M$_\odot$ when the most robust constraints are applied. In contrast, the corresponding radius distributions exhibit a stronger dependence on the underlying hadronic EOS, with typical preferred values near $12\pm 1$ km. Additional tidal-deformability constraints further restrict the allowed parameter space and disfavor very stiff EOS realizations when interpreted together with the possible mass-gap neutron-star candidate. Our results demonstrate that endpoint distributions of M(R) sequences provide a sensitive and complementary diagnostic for constraining the high-density behavior of the neutron-star EOS within a multimessenger Bayesian framework.

2605.00430 2026-05-04 hep-ph hep-ex hep-lat nucl-ex nucl-th

Twist-2 relations for the twist-3 tensor-polarized distribution function $f_{LT}$ of a spin-1 hadron by the operator-product-expansion method

利用算子乘积展开方法推导扭-2关系:对于自旋1介子的扭-3张量极化分布函数 $f_{LT}$

S. Kumano, Kenshi Kuroki

AI总结 本文通过局部算子乘积展开方法推导了扭-2关系和BC求和规则,用于理解电子-氘深非弹性散射中的张量极化分布函数。

Comments 5 pages

详情
AI中文摘要

在自旋1介子中,张量极化部分子分布函数(PDFs)存在。扭-2函数是 $f_{1LL}$,而扭-3函数是 $f_{LT}$。由于在托马斯·杰斐逊国家加速器设施(JLab)正在进行实验以测量电子-氘深非弹性散射的截面,这些PDFs需要理论理解。特别是,JLab将在相对低的$Q^2$区域进行测量,因此扭-3贡献可能在截面中变得显著。在之前的工作中,通过非本地算子推导了$ f_{LT} $的扭-2关系,对应于Wandzura-Wilczek(WW)关系。此外,还获得了类似Burkhardt-Cottingham(BC)求和规则的关系。已知通过使用局部算子展开(OPE)可以形式化推导WW关系和BC求和规则。在本工作中,通过局部OPE方法推导了$ f_{LT} $的WW-like关系和BC-like求和规则,作为可靠独立的方法来建立这些关系。

英文摘要

In a spin-1 hadron, tensor-polarized parton distribution functions (PDFs) exist. The twist-2 function is $f_{1LL}$ and a twist-3 one is $f_{LT}$. Because an experiment is under preparation at the Thomas Jefferson National Accelerator Facility (JLab) to measure the cross section of electron-deuteron deep inelastic scattering with the tensor-polarized deuteron target, these PDFs need to be understood theoretically. Especially, measurements will be done in a relatively low-$Q^2$ region at JLab, so that twist-3 contributions could become sizable in the cross section. In a previous work, a twist-2 relation was derived for $f_{LT}$ in terms of $f_{1LL}$ by using a nonlocal operator, and it corresponds to the Wandzura-Wilczek (WW) relation between $g_1$ and $g_2$. In addition, another relation similar to the Burkhardt-Cottingham (BC) sum rule was obtained. It is known that a formal way to derive the WW relation and the BC sum rule is to use the operator product expansion (OPE) with local operators. In this work, the WW-like relation and the BC-like sum rule for $f_{LT}$ are derived by using the local OPE method as a reliable independent way to establish these relations.

2605.00429 2026-05-04 cs.GR

P2M++: Enhanced Solver for Point-to-Mesh Distance Queries

P2M++:点到网格距离查询的增强求解器

Qinghao Guo, Pengfei Wang, Chen Zong, Maodong Pan, Shiqing Xin, Shuangmin Chen, Changhe Tu, Wenping Wang

AI总结 本文提出P2M++求解器,通过自适应增强顶点集、球面-三角形碰撞检测和递归动态规划优化,提升点到网格距离查询的预处理和查询效率,尤其在旋转对称结构上表现更优。

详情
AI中文摘要

点到网格距离查询在计算机图形学和几何建模中至关重要。尽管最先进的P2M方法通过基于Voronoi的定位实现高速查询,但其预计算成本过高。其迭代Voronoi扫描用于干扰检测导致冗余谓词评估,并在旋转对称结构(如球体、圆锥或圆柱)上表现不佳,其中候选计数呈二次增长。我们提出P2M++通过三个关键贡献解决这些问题:首先,我们自适应地在高Voronoi顶点密度区域补充辅助站点,以在最小空间区域内定位复杂干扰;其次,我们将干扰检测重新表述为一系列以Voronoi单元角为中心的球面-三角形碰撞测试,利用基础网格的BVH高效解决;最后,我们通过替换标准kd树搜索为更快的递归动态规划实现来提升运行时性能。实验结果表明,P2M++在预处理阶段比原始P2M快3-10倍,在查询阶段快1.5倍,尤其在旋转对称几何上收益更显著。

英文摘要

Point-to-mesh distance queries are fundamental in computer graphics and geometric modeling. While the state-of-the-art P2M method achieves high-speed queries via Voronoi-based localization, it suffers from prohibitive precomputation costs. Its iterative Voronoi sweep for interference detection leads to redundant predicate evaluations and scales poorly on rotationally symmetric structures (e.g., spheres, cones or cylinders), where candidate counts grow quadratically. We propose P2M++ to address these limitations through three key contributions. First, we adaptively augment the set of mesh vertices with auxiliary sites in regions of high Voronoi vertex density to localize complex interference within minimal spatial regions. Second, we reformulate interference detection as a series of sphere-triangle collision tests centered at Voronoi cell corners, which are efficiently resolved using the base mesh's BVH. Finally, we enhance runtime performance by replacing the standard kd-tree search with a faster recursive dynamic programming implementation. Experimental results demonstrate that P2M++ is 3x-10x faster than the original P2M during preprocessing and 1.5x faster in queries, with even more pronounced gains on rotationally symmetric geometries.

2605.00428 2026-05-04 stat.ME cs.PF cs.SY eess.SY

How to Do Statistical Evaluations in ECE/CS Papers: A Practical Playbook for Defensible Results

如何在ECE/CS论文中进行统计评估:可辩护结果的实用指南

Bhaskar Krishnamachari

AI总结 本文提供了一套实用流程,帮助研究人员在系统、网络和应用机器学习领域进行可靠的实验评估,涵盖从假设检验到可重复性验证的完整方法。

Comments 30 pages, 8 figures; Tutorial paper; companion student workbook and claude skill available as ancillary material

详情
AI中文摘要

电气与计算机工程及计算机科学(ECE/CS)领域的强实验性论文不仅依赖于一个令人印象深刻的数据点,更依赖于设计、测量、分析和验证选择的完整链条。本文为初学者提供了一个紧凑的、以实例驱动的指南,涵盖声明、假设、分析单元、基线、制度扫描、不确定性估计、验证检查和报告等流程。同时介绍了经典的统计基础(描述统计、中心极限定理、正态和t分布置信区间、学生t检验、方差分析、卡方检验和皮尔逊相关性、线性回归)以及现代无分布的统计技术(自助法、Wilcoxon和Mann-Whitney检验、Cliff's delta),并讨论了因子设计、随机化和阻断、多重比较校正、延迟特定的陷阱、模拟验证和验证、等价型声明以及可重复性。通过一个持续的例子,即比较两种作业调度算法在模拟负载下的性能,穿插了Python代码片段供读者使用和调整。本文最后附有预提交检查表;配套的学生材料(项目型翻译表、评估计划工作表、练习和一个“坏评估解剖”案例)收集在单独的工作簿中。

英文摘要

Strong experimental papers in electrical and computer engineering and computer science (ECE/CS), especially in systems, networking, and applied machine learning, rest on more than a single impressive number. They rest on a chain of design, measurement, analysis, and validation choices that, taken together, make a result believable. This tutorial is a compact, example-driven guide to that chain for beginning researchers. We organize it as an evaluation workflow: claim, hypothesis, unit of analysis, baseline, regime sweep, uncertainty estimate, validation check, and reporting. Within that workflow we cover the classical statistical foundations (descriptive statistics, the central limit theorem, normal- and $t$-based confidence intervals, Student's $t$-test, ANOVA, chi-squared and Pearson correlation, linear regression) alongside the modern, distribution-free techniques (the bootstrap, Wilcoxon and Mann--Whitney tests, Cliff's delta) that are usually preferred for ECE/CS data. We also discuss factorial design, randomization and blocking, multiple-comparison correction, latency-specific pitfalls, simulation verification and validation, equivalence-style claims, and reproducibility. A running example, a comparison of two job-scheduling algorithms on simulated workloads with truncated heavy-tailed job sizes, threads through the tutorial, with Python snippets the reader can paste and adapt. The paper closes with a pre-submission checklist; companion student-facing material (project-type translation tables, an evaluation-plan worksheet, exercises, and a worked ``bad evaluation autopsy'') is collected in a separate workbook released alongside this paper.

2605.00426 2026-05-04 cs.CE

A Study on the Resource Utilization and User Behavior on Titan Supercomputer

对泰坦超级计算机资源利用和用户行为的研究

Sergio Iserte

AI总结 本文研究泰坦超级计算机的资源利用和用户行为,通过分析日志、GPU轨迹和科学领域信息,揭示工作负载分布和使用模式,并建立预测模型。

Journal ref Driving Scientific and Engineering Discoveries Through the Integration of Experiment, Big Data, and Modeling and Simulation(1512), pp. 1-8, Mar. 2022

详情
AI中文摘要

理解HPC设施用户的行为及其计算资源的请求和利用对于集群生产力至关重要,也对设计和构建未来exascale HPC系统至关重要。本文针对2021年Smoky Mountains会议数据挑战的第4个挑战,即'分析泰坦超级计算机上的资源利用和用户行为',深入分析泰坦的记录以发现模式和提取关系。本文通过资源管理器系统日志、GPU轨迹和从泰坦超级计算机收集的科学领域信息,探索工作负载分布和使用模式。此外,我们还研究资源利用和用户行为随时间的变化。使用数据科学方法,如相关性、聚类或神经网络,我们的发现使我们能够研究项目、作业、节点、GPU和内存之间的关系。我们提供了关于资源季节性使用和泰坦超级计算机利用预测模型的见解。此外,描述的方法可以轻松应用于其他HPC集群。

英文摘要

Understanding HPC facilities users' behaviors and how computational resources are requested and utilized is not only crucial for the cluster productivity but also essential for designing and constructing future exascale HPC systems. This paper tackles Challenge 4, 'Analyzing Resource Utilization and User Behavior on Titan Supercomputer', of the 2021 Smoky Mountains Conference Data Challenge. Specifically, we dig deeper inside the records of Titan to discover patterns and extract relationships. This paper explores the workload distribution and usage patterns from resource manager system logs, GPU traces, and scientific areas information collected from the Titan supercomputer. Furthermore, we want to know how resource utilization and user behaviors change over time. Using data science methods, such as correlations, clustering, or neural networks, our findings allow us to investigate how projects, jobs, nodes, GPUs and memory are related. We provide insights about seasonality usage of resources and a predictive model for forecasting utilization of Titan Supercomputer. In addition, the described methodology can be easily adopted in other HPC clusters.

2605.00418 2026-05-04 math.AC math.AG

Trace ideals of exterior powers of the module of differentials

外幂的微分模的迹理想

Ryo Ishizuka, Sora Miyashita

AI总结 研究外幂的微分模的迹理想,揭示其与多项式秩和形式幂级数秩的关系,并定义顶微分迹以刻画局部或graded环的奇点位。

Comments 23 pages, comments are welcome

详情
AI中文摘要

对于每个i≥0,我们研究i-th外幂的微分模的迹理想。我们证明这些理想刻画了graded环的多项式秩和完全局部环的形式幂级数秩,即多项式或形式幂级数扩展的最大变量数。对于顶外幂,我们引入顶微分迹并证明其精确定义了局部或graded环的奇点位。受此启发,我们引入并研究几乎正则环,即其顶微分迹包含极大理想。

英文摘要

For each $i \geq 0$, we study the trace ideal of the $i$-th exterior power of the module of differentials. We show that these ideals characterize the polynomial rank of graded rings and the formal power series rank of complete local rings, namely the maximal number of variables for a polynomial or formal power series extension over a subring. For the top exterior power, we introduce the top differential trace and prove that it precisely defines the singular locus of reduced equidimensional local or graded rings. Motivated by this, we introduce and investigate nearly regular rings, which are Noetherian rings whose top differential trace contains the maximal ideal.

2605.00417 2026-05-04 cs.DB cs.LO

Multiset semantics in SPARQL, Relational Algebra and Datalog

SPARQL中多集合语义的代数与逻辑分析

Renzo Angles, Claudio Gutierrez, Daniel Hernández

AI总结 本文分析了SPARQL模式中AND、UNION、FILTER、EXCEPT和SELECT的多集合语义的代数与逻辑结构,通过与Datalog和关系代数对齐,证明了三种形式在多集合语义下的等价性。

Comments 59 pages. Author's preprint; published in Semantic Web (SAGE), 2026, doi:10.1177/22104968261439426

详情
AI中文摘要

本文分析并刻画了涉及AND、UNION、FILTER、EXCEPT和SELECT的SPARQL模式的多集合语义的代数和逻辑结构。为此,我们将SPARQL与两种已建立的查询语言:Datalog和关系代数对齐。具体而言,我们研究(i)一种非递归Datalog的版本,其安全否定扩展以支持多集合,以及(ii)一种包含投影、选择、自然连接、算术并集和except的多集合关系代数。我们证明了这三种形式在多集合语义下是表达等价的。

英文摘要

The paper analyzes and characterizes the algebraic and logical structure of the multiset semantics for SPARQL patterns involving AND, UNION, FILTER, EXCEPT, and SELECT. To do this, we align SPARQL with two well-established query languages: Datalog and Relational Algebra. Specifically, we study (i) a version of non-recursive Datalog with safe negation extended to support multisets, and (ii) a multiset relational algebra comprising projection, selection, natural join, arithmetic union, and except. We prove that these three formalisms are expressively equivalent under multiset semantics.

2605.00415 2026-05-04 cond-mat.soft physics.chem-ph

Machine learning evaluation of structural descriptors for supercooled water

机器学习评估超冷却水的结构描述符

Kohei Yoshikawa, Kokoro Shikata, Kang Kim, Nobuyuki Matubayasi

AI总结 本文利用神经网络温度分类框架评估16种结构描述符,揭示其区分超冷却水温度依赖性结构变化的能力,并通过可解释人工智能方法确定影响模型预测的结构特征。

Comments 13 pages, 5 figures, 1table for main text, 10 figures for supplementary information

详情
AI中文摘要

液态水的异常行为广泛关联于超冷却范围内高密度与低密度状态之间的液-液相变。在微观层面,四面体氢键网络调控这些性质,推动了表征局部分子环境的结构描述符。这些描述符量化了四面体序、局部密度以及第一和第二配位壳层之间的距离等特征;然而,它们大多独立提出,缺乏系统比较。本文利用基于神经网络的温度分类框架评估了16种先前提出的描述符,从而能够客观评估其区分超冷却水温度依赖性结构变化的能力。我们进一步应用可解释人工智能方法,识别导致模型预测的结构特征。该方法揭示了不同描述符如何编码局部结构信息,并建立了一个数据驱动的框架,用于液态水结构描述符的评估。

英文摘要

The anomalous behavior of liquid water is widely associated with a liquid-liquid phase transition between high- and low-density states in the supercooled regime. At the microscopic level, tetrahedral hydrogen-bond networks govern these properties, motivating structural descriptors that characterize local molecular environments. These structural descriptors quantify features such as tetrahedral order, local density, and the separation between the first and second coordination shells; however, they have largely been proposed independently, with limited systematic comparison. Here we evaluate 16 previously proposed descriptors using a neural-network-based temperature classification framework, enabling an objective assessment of their ability to distinguish temperature-dependent structural changes in supercooled water. We further apply an explainable artificial intelligence method that identifies the structural features responsible for the model predictions. This approach reveals how different descriptors encode local structural information and establishes a data-driven framework for benchmarking structural descriptors in liquid water.

2605.00413 2026-05-04 cs.SE

ClozeMaster: Fuzzing Rust Compiler by Harnessing LLMs for Infilling Masked Real Programs

ClozeMaster:利用LLMs填充掩码真实程序以模糊Rust编译器

Hongyan Gao, Yibiao Yang, Maolin Sun, Jiangchang Wu, Yuming Zhou, Baowen Xu

AI总结 本文提出ClozeMaster,通过利用LLMs填充掩码的真实程序,生成有效测试用例,发现Rust编译器27个已确认的bug,优于现有模糊测试工具。

Comments Accepted at ICSE 2025

详情
AI中文摘要

确保Rust编译器的可靠性至关重要,鉴于Rust在关键系统开发中的日益普及,因其强调内存和线程安全性。然而,生成有效的Rust编译器测试程序存在重大挑战,由于Rust的复杂语法和严格要求。随着大型语言模型(LLMs)的流行,软件测试领域已探索利用LLMs生成测试用例。然而,直接使用LLMs生成Rust程序通常会产生大量无效测试用例。现有研究指出,触发历史编译器bug的测试用例可协助软件测试。我们的Rust编译器bug研究支持这一观察。受现有工作和实证研究的启发,我们引入了一种基于括号的掩码和填充策略,称为clozeMask。clozeMask策略涉及从历史问题报告中提取测试代码,识别并掩码具有特定结构的代码片段,并使用LLM填充掩码部分以合成新测试程序。该方法利用LLMs的生成能力,同时保留触发Rust编译器bug的能力。它使能够全面测试编译器的行为,特别是探索边缘情况。我们实现了这一方法作为原型CLOZEMASTER。CLOZEMASTER已识别出27个已确认的rustc和mrustc bug,其中10个已由开发者修复。此外,我们的实验结果表明,CLOZEMASTER在代码覆盖率和有效性方面优于现有模糊器。

英文摘要

Ensuring the reliability of the Rust compiler is of paramount importance, given increasing adoption of Rust for critical systems development, due to its emphasis on memory and thread safety. However, generating valid test programs for the Rust compiler poses significant challenges, given Rust's complex syntax and strict requirements. With the growing popularity of large language models (LLMs), much research in software testing has explored using LLMs to generate test cases. Still, directly using LLMs to generate Rust programs often results in a large number of invalid test cases. Existing studies have indicated that test cases triggering historical compiler bugs can assist in software testing. Our investigation into Rust compiler bug issues supports this observation. Inspired by existing work and our empirical research, we introduce a bracket-based masking and filling strategy called clozeMask. The clozeMask strategy involves extracting test code from historical issue reports, identifying and masking code snippets with specific structures, and using an LLM to fill in the masked portions for synthesizing new test programs. This approach harnesses the generative capabilities of LLMs while retaining the ability to trigger Rust compiler bugs. It enables comprehensive testing of the compiler's behavior, particularly exploring edge cases. We implemented our approach as a prototype CLOZEMASTER. CLOZEMASTER has identified 27 confirmed bugs for rustc and mrustc, of which 10 have been fixed by developers. Furthermore, our experimental results indicate that CLOZEMASTER outperforms existing fuzzers in terms of code coverage and effectiveness.

2605.00411 2026-05-04 cs.GT

Budget-Feasible Mechanisms for Submodular Welfare Maximization in Procurement Auctions

预算可行机制用于采购拍卖中的子模福利最大化

Shuang Cui, He Huang, Yu-e Sun, Chen Xue

AI总结 本文提出BFM-SWM机制,首次在采购拍卖中实现子模福利最大化的预算可行机制,具有可证明的近似保证,同时改进了效率和效果。

Comments Accepted at ICML 2026

详情
AI中文摘要

预算可行的采购拍卖在各种AI驱动的市场中扮演关键角色,如数据获取和众包,其中买家有有限预算,需从具有私人成本的战略卖家处采购服务。尽管已提出许多预算可行机制用于最大化买家估值的经典目标,但更具有经济意义的社会福利最大化目标仅最近被研究,现有方法仍牺牲预算可行性,限制了实际应用。本文通过提出BFM-SWM机制,首次在采购拍卖中实现子模福利最大化的预算可行机制,具有可证明的近似保证。我们的机制满足标准经济属性,包括真实性、个体理性和非负拍卖师盈余。作为副产品,我们开发了BFM-VM机制,专门用于估值最大化,其在一般子模函数上实现了确定性近似比为$1/(12+4\sqrt{3})$,显著优于Balkanski等人在SODA 2022中建立的最佳确定性比$1/64$,同时将运行时间从$\mathcal{O}(n^2\log n)$降至$\mathcal{O}(n\log n)$。广泛的实验展示了我们机制的效率和有效性。

英文摘要

Budget-feasible procurement auctions play a pivotal role in various AI-driven marketplaces, such as data acquisition and crowdsourcing, where a buyer with a limited budget seeks to procure services from strategic sellers with private costs. While numerous budget-feasible mechanisms have been proposed for the classic objective of maximizing the buyer's valuation, the more challenging and economically significant objective of social welfare maximization has only recently been studied, and existing approaches still sacrifice budget feasibility, thereby limiting their practical applicability. In this paper, we bridge this gap by proposing BFM-SWM, the first budget-feasible mechanism with provable approximation guarantees for submodular welfare maximization in procurement auctions. Our mechanism satisfies standard economic properties, including truthfulness, individual rationality, and non-negative auctioneer surplus. As a by-product, we develop BFM-VM, a variant tailored for valuation maximization, which achieves a deterministic approximation ratio of $1/(12+4\sqrt{3})$ for general submodular functions, substantially improving upon the best-known deterministic ratio of $1/64$ established by [Balkanski et al., SODA 2022], while reducing the running time from $\mathcal{O}(n^2\log n)$ to $\mathcal{O}(n\log n)$. Extensive experiments demonstrate the efficiency and effectiveness of our mechanisms.

2605.00409 2026-05-04 math.NA cs.NA

A variational approach to estimating the state of a magma reservoir from observed displacement

从观测位移角度估计岩浆库状态的变分方法

Shungo Kun Tonoyama, Atsushi Suzuki, Takemasa Miyoshi

AI总结 本文提出一种数值方法,通过变分法求解逆问题,利用观测火山表面位移估计岩浆库状态,核心方法是求解高条件数线性系统以获得岩浆库应力分布。

Comments 7 pages, 3 figures, submitted to JSIAM Letters

详情
AI中文摘要

我们提出了一种数值程序,用于求解一个逆问题,即从观测到火山表面位移的数据中估计岩浆库的状态。我们的变分方法旨在找到由数据和导数的范数构成的成本函数的最小值,该函数评估估计位移与观测位移之间的不匹配程度。成本函数的极值导致一个线性系统,用于确定岩浆库表面的应力分布,该系统具有非常高的条件数,但通过使用高精度算术可以得到合适的解。

英文摘要

We propose a numerical procedure to solve an inverse problem that estimates the state of a magma reservoir from observed surface displacement of a volcano. Our variational approach aims to find the minimizer of a cost function consisting of a norm concerning both data and derivative, which evaluates the misfit between the estimated and observed displacement. The extremal of the cost function leads to a linear system, to find the stress distribution on the reservoir surface, has very high condition number, but it is feasible to get appropriate solution by using high precision arithmetic.

2605.00407 2026-05-04 math.AP

Gradient blowup of smooth vacuum solutions to 1D compressible Euler equations

光滑真空解在1D可压缩欧拉方程中的梯度爆炸

Juhi Jang, Jiaqi Liu, Nader Masmoudi

AI总结 研究1D可压缩欧拉方程在半轴上的解,发现初始光滑且平方可积的解在有限时间内过渡到边界处C^{1-μ}正则性,导致边界处梯度爆炸。

详情
AI中文摘要

我们考虑了半轴上控制气体与静止真空边界接触的等熵可压缩欧拉方程。我们构造了一类初始光滑且平方可积的解,这些解在有限时间内在边界附近过渡到C^{1-μ}正则性(μ∈[1/2,1)),导致边界处的梯度爆炸。该结果基于作者最近构造的自相似等待时间解的稳定性分析。

英文摘要

We consider the isentropic compressible Euler equations in the half-line which govern the motion of gaseous fluids in contact with stationary vacuum boundary. We construct a large class of solutions that are initially smooth and square-integrable, and which, in finite time, transition to $C^{1-μ}$ regularity for $μ\in [1/2,1)$ near the boundary, leading to the gradient blowup at the boundary. It is based on stability analysis of self-similar waiting time solutions \cite{JLN2025} recently constructed by the authors.

2605.00406 2026-05-04 quant-ph physics.hist-ph

Bell Correlations and Selection Bias

贝尔相关性与选择偏差

Huw Price

AI总结 本文探讨量子理论中贝尔相关性可能源于选择偏差,而非非局域性与实在论的冲突,提出对量子非局域性理解的新视角。

Comments 19 pages, 6 figures. arXiv admin note: substantial text overlap with arXiv:2602.16985

详情
AI中文摘要

选择偏差在科学中普遍存在。从更大群体中选取样本的方法可能产生偏差,无论是正向还是负向。它可能在完整群体中独立的变量之间引入相关性,或在完整群体中依赖的变量之间掩盖相关性。本文提出了这些熟悉思想的意外应用。我们主张这些思想与约翰·斯图尔特·贝尔(Bell,1964)在量子理论中发现的令人困惑的相关性相关。在贝尔的工作及后续实验的背景下,人们普遍认为量子世界是‘非局域’的,这似乎与相对论相矛盾。许多人认为唯一的替代方案是放弃‘实在论’,即认为有一个独立于测量的客观世界。我们提出另一种观点:贝尔相关性是选择偏差,这既不与相对论也不与实在论相矛盾。

英文摘要

Selection artefacts are common in science. A method of selecting samples from a larger population may produce bias, in either direction. It may induce correlations between variables independent in the full population, or mask correlations between variables dependent in the full population. Here we propose a surprising application of these familiar ideas. We argue that they are relevant to puzzling correlations uncovered in quantum theory by John Stewart Bell (Bell 1964). In the light of Bell's work and subsequent experiments it is widely believed that the quantum world is 'nonlocal', in apparent tension with relativity. Many hold that the only alternative is to abandon 'realism', the view that there is an objective world independent of measurement. We propose instead that Bell correlations are selection artefacts, in tension neither with relativity nor realism.

2605.00404 2026-05-04 eess.SY cs.SY

Electric Grid Topology and Admittance Estimation using Phasor Measurements

利用相量测量的电力电网拓扑和电导估计

Norak Rin, Iman Shames, Ian Petersen, Elizabeth Ratnam

AI总结 本文研究了通过相量测量确定电网拓扑和电导参数的必要条件,提出在存在测量噪声时使用结构化总最小二乘法估计电导矩阵。

详情
AI中文摘要

近年来,精确的相量测量单元的进步使得实时估计配电和输电电网参数成为可能。本文研究了电压和电流相量测量需求,以估计电网拓扑和电导参数。我们展示了确定完全未知电网拓扑和电导所需的独立运行点(测量)数量的必要和充分条件。在具有先验拓扑信息的情况下,我们还展示了确定电导矩阵和相应电网拓扑所需的最小测量数量。在存在噪声相量测量的情况下,我们展示了可以使用结构化总最小二乘方法估计电导矩阵。通过在IEEE 13节点配电馈线、IEEE 14节点输电网络和IEEE 123节点配电馈线上的数值模拟,我们证明了我们的方法适用于具有测量噪声的辐射状和网状电网拓扑的应用。

英文摘要

Recent advances in precise phasor measurement units are enabling new approaches to estimate distribution and transmission grid parameters in real-time. In this paper, we investigate voltage and current phasor measurement requirements to estimate the electric grid topology and admittance parameters. We show necessary and sufficient conditions for the number of independent operating points (measurements) required to determine the topology and admittance of a completely unknown electric grid. With prior topology information, we also show that there is a minimum number of measurements required to uniquely determine the admittance matrix and corresponding grid topology. In the presence of noisy phasor measurements, we show that the admittance matrix can be estimated using a structured total least squares approach. By means of numerical simulations on the IEEE 13-node distribution feeder, the IEEE 14-node transmission network, and the IEEE 123-node distribution feeder, we demonstrate our approach is suitable for applications in radial and mesh grid topologies in the presence of measurement noise.

2605.00399 2026-05-04 cs.CE

Transient Multiscale Workflow for Thermal Analysis of 3DHI Chip Stack

瞬态多尺度工作流用于3D Hi芯片堆叠的热分析

Mohammad Elahi, Max O. Bloomfield, Theodorian Borca-Tasciuc, Jacob S. Merson

AI总结 本文提出一种瞬态热工作流,用于考虑BEOL结构中的3D异质结构,通过多尺度均质化策略实现强弱时间尺度分离问题的热分析。

详情
AI中文摘要

现代封装设计利用如背侧电源传输(BSPD)和3D堆叠芯片lets等技术,需要考虑后端线路(BEOL)结构的异质性以预测热点。多尺度均质化策略已被证明在稳态模拟中有效,但准确包含BEOL结构的3D瞬态模拟仍是一个开放挑战。本文展示了一种瞬态热工作流,该工作流基于Bloomfield等人2025年的研究,在温度独立本构属性的假设下,考虑BEOL中的3D异质结构。该工作流基于GDSII和OASIS文件自动提取、网格化和均质化热性质,以构建热性质图。已为一个1毫米×1毫米的SoC风格模型芯片生成了热性质图,该芯片使用LibreLane构建,具有100×100网格和5微米×5微米代表性体积元素(RVEs),以及50×50网格和10微米×10微米RVEs。提供了瞬态有效导热系数的表达式,并为单个RVE演示了瞬态效应的影响。最后,为时间积分时间步长dt=0.001提供了瞬态导热率图。

英文摘要

Modern package designs make use of technologies such as backside power delivery (BSPD) and 3D stacked chiplets that require accounting for the heterogeneity in back end of the line (BEOL) structures in hot-spot prediction. Multiscale homogenization strategies have been demonstrated to be effective for steady-state simulations, however accurate 3D transient simulations that include BEOL structures remain an open challenge. In this work, we demonstrate a transient thermal workflow that accounts for the 3D heterogeneous structures in the BEOL for problems with strong- and weak- temporal scale separation under the assumption of temperature independent constitutive properties. Our workflow, based on Bloomfield et. al. 2025, automatically extracts, meshes, and homogenizes thermal properties from GDSII and OASIS files to construct thermal property maps. Property maps (heat capacity and conductivity) have been generated for a 1 mm by 1 mm SoC-style model die that was constructed with LibreLane for 100 by 100 grids with 5 micron by 5 micron representative volume elements (RVEs), and 50 by 50 grids with 10 micron by 10 micron RVEs. The expressions for a transient effective conductivity are provided and a demonstration of the impact of the transient effects are provided for a single RVE. Finally, transient conductivity maps have been provided for a time integration timestep of dt=0.001.

2605.00396 2026-05-04 math.OC

Riemannian Optimization over Symmetric Positive Definite Matrices with the Alpha-Procrustes Geometry

在对称正定矩阵上基于Alpha-Procrustes几何的黎曼优化

Derun Zhou, Keisuke Yano, Mahito Sugiyama

AI总结 本文提出Alpha-Procrustes几何用于解决对称正定矩阵优化中条件数问题,通过统一界条件提升鲁棒性,验证了理论在多个应用中的有效性。

详情
AI中文摘要

在黎曼优化中,最优解处黎曼Hessian的条件数强烈影响优化算法的渐近收敛行为。在对称正定(SPD)矩阵流形上,常用优化度量如仿射不变(AI)度量和Bures-Wasserstein(BW)度量,当底层SPD矩阵变得病态时,这些度量会变得病态。因此,即使欧几里得Hessian在SPD流形上保持均匀良好条件,优化在与病态SPD矩阵相关联的最优解附近仍可能变得困难。本文通过SPD流形上的Alpha-Procrustes(AP)几何解决此问题。此几何泛化了多个已知度量,包括Log-Euclidean(LE)度量(α=0)和BW度量(α=1/2)。我们首先证明,当α=1时,由AP几何诱导的黎曼度量算子的所有特征值均被均匀界所限制,与底层SPD矩阵无关。因此,在假设欧几里得Hessian满足均匀谱界的情况下,相应黎曼Hessian的特征值均被均匀界所限制,与底层SPD矩阵无关。因此,α=1的情况为涉及病态SPD矩阵的若干黎曼优化问题提供了稳健的几何框架。最后,我们通过广泛的数值实验在多个应用中验证了我们的理论发现。

英文摘要

In Riemannian optimization, it is well known that the condition number of the Riemannian Hessian at an optimum strongly influences the asymptotic convergence behavior of optimization algorithms. On the manifold of symmetric positive definite (SPD) matrices, several commonly used metrics for optimization, such as the Affine-Invariant (AI) and Bures--Wasserstein (BW) metrics, tend to become ill-conditioned as the underlying SPD matrix becomes ill-conditioned. As a result, even when the Euclidean Hessian remains uniformly well-conditioned on the SPD manifold, optimization may still become difficult near an optimum associated with an ill-conditioned SPD matrix. In this paper, we address this issue through the Alpha-Procrustes (AP) geometry on the SPD manifold. This geometry generalizes several well-known metrics, including the Log-Euclidean (LE) metric for \(α=0\) and the BW metric for \(α=1/2\). We first show that, when \(α=1\), all eigenvalues of the Riemannian metric operator induced by the AP geometry are uniformly bounded independently of the underlying SPD matrix. Therefore, under the assumption that the Euclidean Hessian satisfies the uniform spectral bounds, all the eigenvalues of the corresponding Riemannian Hessian are uniformly bounded independently of the underlying SPD matrix. Consequently, the case \(α=1\) provides a robust geometric framework for several Riemannian optimization problems involving ill-conditioned SPD matrices. Finally, we validate our theoretical findings through extensive numerical experiments across a range of applications.

2605.00395 2026-05-04 math.OC

Controlling the Swarm: Sparse Actuation and Collision Avoidance under Stochastic Delay

控制群集:在随机延迟下的稀疏驱动与碰撞规避

Jiguang Yu

AI总结 本文提出一个统一的有限N框架,研究随机延迟下领导者-追随者粒子系统的稀疏驱动与碰撞规避问题,通过构建Lyapunov函数解决数学稳定性问题,并展示稀疏驱动在减少控制成本的同时揭示了领导者密度的非单调敏感性。

详情
AI中文摘要

经典群体模型展示了局部相互作用如何产生涌现秩序,但现实中的多智能体部署受到严重限制:有限的执行器可用性、异构通信延迟和环境噪声。在本次演讲中,我们提出一个统一的有限N框架,研究具有拓扑通信、奇异排斥和有界稀疏领导者驱动的延迟随机领导者-追随者粒子系统。此类系统的核心挑战是数学适定性,因为不连续通信法则和奇异排斥与标准强Itô框架相冲突。我们通过引入一个增强的Lyapunov函数,同时强制严格碰撞屏障并闭合一个统一的Gronwall估计。在此严格基础上,我们提出一个自由终端时间、机会约束最优控制问题。我们证明,时间稀疏的bang-off-bang领导者驱动不仅显著减少了与连续基线相比的控制成本,还揭示了领导者密度的非单调敏感性。最终,我们展示在延迟随机群集中,增加直接驱动并不严格最优——突显了合作控制中高度非平凡的资源分配悖论。

英文摘要

Classical flocking models demonstrate how local interactions generate emergent order, but real-world multi-agent deployments are bound by severe constraints: limited actuator availability, heterogeneous communication latencies, and environmental noise. In this talk, we present a unified finite-N framework that tackles the interplay of these exact mechanisms. We study a delayed stochastic leader-follower particle system featuring topological communication, singular repulsion, and bounded sparse leader actuation. A central challenge in such systems is mathematical well-posedness, as discontinuous communication laws and singular repulsions clash with standard strong Ito frameworks. We resolve this by introducing an augmented Lyapunov functional that simultaneously enforces a strict collision barrier and closes a uniform Gronwall estimate. Building on this rigorous foundation, we formulate a free-terminal-time, chance-constrained optimal control problem. We show that temporally sparse, bang-off-bang leader actuation not only drastically reduces control effort compared to continuous baselines, but also reveals non-monotone sensitivities to leader density. Ultimately, we demonstrate that in delayed stochastic swarms, adding more direct actuation is not strictly optimal -- highlighting a highly non-trivial resource allocation paradox in cooperative control.

2605.00391 2026-05-04 astro-ph.IM astro-ph.HE gr-qc

Training a neural network to rapidly identify candidate gravitational-wave events in the lower mass gap

训练神经网络以快速识别下质量间隙中的候选引力波事件

Nayyer Raza, Man Leong Chan, Daryl Haggard, Ashish Mahabal, Jess McIver, Audrey Durand, Alexandre Larouche, Hadi Moazen

AI总结 本文提出GWSkyNet-MassGap模型,通过预测候选合并体的质量间隙概率和中子星概率,帮助快速区分引力波事件是否涉及质量间隙区域,从而指导后续观测。

Comments 15 pages, 7 figures, 2 tables, submitted to ApJ

详情
AI中文摘要

目前,最重的中子星(NS)与最轻的黑洞(BH)之间的物理机制尚不明确,但可能通过新观测得到约束。尽管已观测到质量达约2个太阳质量的中子星,但缺乏电磁观测数据,这些数据被称为下质量间隙。最近LIGO-Virgo-KAGRA合作观测到的引力波信号表明,这个间隙可能并非空缺。快速区分候选引力波事件是否包含此质量间隙的组件,可指示是否存在可检测的电磁对应体,从而指导后续观测。本文训练了神经网络模型GWSkyNet-MassGap,同时预测候选合并体在下质量间隙的概率(P_MassGap)和涉及中子星的概率(P_NS)。模型能通过源啁啾质量推断信息以预测P_MassGap和P_NS,对高质量合并体(M_c≈15 M_太阳)预测正确,但对低质量系统预测不准确,需二进制质量比信息以打破质量退化。对于LVK第四观测运行(O4a)初期候选事件,模型对P_MassGap的平均预测误差为9%,对P_NS为6%。该模型可进一步发展以快速预测未来观测运行中候选事件的源啁啾质量。

英文摘要

The physics governing the boundary between the most massive neutron stars (NSs) and the least massive black holes (BHs) is currently uncertain, but could potentially be constrained with new observations. While NSs have been observed with masses up to $\sim2~M_{\odot}$, there is a dearth of electromagnetic observations of compact objects in the $\sim2-5~M_{\odot}$ range, known as the lower mass gap. Recent observations of gravitational-wave (GW) signals from binary mergers detected by the LIGO-Virgo-KAGRA (LVK) collaboration indicate that this gap is likely not empty. Rapidly distinguishing whether a candidate GW event has components in this purported mass gap can indicate the likelihood of a detectable electromagnetic counterpart, and thus inform decisions for follow-up observations. In this work we train a neural network model, GWSkyNet-MassGap, that simultaneously predicts the probability that a candidate merger has a component in the lower mass gap ($P_{\mathrm{MassGap}}$) and the probability that it involves a NS ($P_{\mathrm{NS}}$). We find that the model is able to infer information about the source chirp mass to predict $P_{\mathrm{MassGap}}$ and $P_{\mathrm{NS}}$, leading to correct predictions for high-mass mergers with $\mathcal{M}_c\gtrsim15~M_{\odot}$, but less accurate predictions for lower-mass systems which require knowledge of the binary mass ratio to break the mass degeneracy. For candidate events in the first part of LVK's fourth observing run (O4a), the model has a mean prediction error of 9% for $P_{\mathrm{MassGap}}$ and 6% for $P_{\mathrm{NS}}$. The model could be further developed to rapidly predict the source chirp mass for candidate events in future observing runs.

2605.00388 2026-05-04 math.OC

First-Order Optimality Conditions for Mathematical Programming with Equilibrium Constraints

关于具有均衡约束的数学规划的一阶最优性条件

Louis Shuo Wang

AI总结 本文探讨了具有均衡约束的数学规划问题的一阶最优性条件,分析了传统非线性规划技术的局限性,并提出基于可行区域几何结构的分析框架,为处理此类优化问题提供理论指导。

详情
AI中文摘要

我们系统地介绍了具有均衡约束的数学规划问题(MPECs)的一阶最优性条件,强调了传统非线性规划技术的局限性。目标是双重的。首先,我们解释了直接应用标准最优性条件——通过KKT系统或可微精确惩罚函数重构MPECs——通常不足的原因,因为此类方法通常需要强且限制性的假设,包括非退化和光滑性条件。其次,我们开发了一个基于原理的框架,通过关注可行区域的几何结构来分析MPECs。特别是,我们研究了站稳概念,并提供了可行点处切锥的详细特征化,从而得到适合MPECs的适当约束资格。这些结果构成了严格一阶分析的基础,并阐明了原始MPEC公式与其基于KKT的表示之间的关系,为处理这些本质上具有挑战性的优化问题提供了实用指导。

英文摘要

We present a systematic introduction to first-order optimality conditions for mathematical programs with equilibrium constraints (MPECs), emphasizing the limitations of classical nonlinear programming techniques. The goal is twofold. First, we explain why a direct application of standard optimality conditions -- based on reformulating MPECs via KKT systems or differentiable exact penalty functions -- is often inadequate, as such approaches typically require strong and restrictive assumptions, including nondegeneracy and smoothness conditions. Second, we develop a first-principles framework for analyzing MPECs by focusing on the geometric structure of the feasible region. In particular, we study stationarity concepts and provide a detailed characterization of the tangent cone at feasible points, which leads to appropriate constraint qualifications tailored to MPECs. These results form the foundation for rigorous first-order analysis and clarify the relationship between the original MPEC formulation and its KKT-based representation, offering practical guidance for handling these inherently challenging optimization problems.

2605.00387 2026-05-04 math.OC

Introduction to Exact Penalization for Mathematical Programming with Equilibrium Constraints

数学规划与均衡约束的精确惩罚法简介

Louis Shuo Wang

AI总结 本文介绍非线性规划和带有均衡约束的数学规划的精确惩罚方法,强调其与现代误差界理论的联系,探讨经典最优性条件的解释及扩展框架的应用。

详情
AI中文摘要

本文聚焦于非线性规划和带有均衡约束的数学规划(MPECs)的精确惩罚方法,强调其与现代误差界理论的联系。本文旨在两方面展开:首先,解释经典最优性条件如何通过精确惩罚进行解读,并阐述此类结果通常依赖于约束正则性条件,这些条件可以理解为可行集扰动的误差界。其次,突出基于子解析几何和Lojasiewicz型不等式的最新发展,扩展了这一框架,使其在更广泛的解析条件下实现精确惩罚。此外,本文还展示了如何将这一理论应用于MPECs的实践中,通过KKT系统重新公式化并构造基于残差映射的精确惩罚函数。特别关注Lojasiewicz误差界带来的分数阶惩罚,以及针对特殊问题类别改进的 formulations,其中可以获取更精确的指数。这些发展为分析和解决具有挑战性的约束优化问题提供了理论洞察和实用指导。

英文摘要

We present a focused introduction to exact penalty methods for nonlinear programs and mathematical programs with equilibrium constraints (MPECs), emphasizing their connection to modern error bound theory. The goal is twofold. First, we explain how classical optimality conditions can be interpreted through exact penalization, and why such results typically rely on constraint regularity conditions that can be understood as error bounds on perturbations of feasible sets. We then highlight how recent developments based on subanalytic geometry and Lojasiewicz-type inequalities extend this framework beyond classical regularity assumptions, enabling exact penalization under broader analytic conditions. Second, we demonstrate how this theory can be applied in practice to MPECs by reformulating them via KKT systems and constructing exact penalty functions based on residual mappings. Particular attention is given to fractional-order penalties arising from Lojasiewicz error bounds, as well as to improved formulations for special problem classes where sharper exponents can be obtained. These developments provide both theoretical insight and practical guidance for analyzing and solving challenging constrained optimization problems.