arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.18194 2026-06-17 cs.GT math.DS math.OC 新提交

Ergodic Deviation-Robust Equilibrium under Mirror Descent Learning in Finite Games

有限博弈中镜像下降学习下的遍历偏差鲁棒均衡

Joshua Steier

AI总结 提出遍历偏差鲁棒均衡(EDRE),一种针对熵镜像下降学习的动态相关均衡概念,要求极限分布为ε-纳什均衡、全程偏差增益为√T量级且为EMD不动点,并证明其在势博弈中存在性及PPAD难度。

Comments Under Review

详情
AI中文摘要

我们引入了遍历偏差鲁棒均衡(EDRE),这是一种针对重复有限博弈的动态相关均衡概念,其中智能体通过熵镜像下降(EMD)进行学习。EDRE要求同一配置和学习运行同时满足三个性质:(E1)极限配置是乘积分布下的ε-纳什均衡;(E2)在整个学习轨迹上,每个固定联盟的累积(单边)偏差增益以高概率为~O(√T);(E3)极限配置是EMD映射的不动点,因此它是由动力学选择而非仅仅被认证为均衡。我们证明了√T的偏差遗憾率是阶紧的,建立了在精确势博弈中的存在性(通过纳什定理,并在凹性下给出构造性近端路径),同时证明了EMD的Lyapunov单调性(当不动点集为单点集时逐点收敛),并通过变分不等式将选择性质扩展到单调多矩阵博弈。尽管静态EDRE等同于ε-纳什均衡,但其内容是动态的:EMD下的鲁棒(正测度)选择排除了线性不稳定均衡,因此EDRE充当了带有动态证书而非静态精炼的纳什均衡。在复杂性方面,我们证明了一般多矩阵博弈中计算EDRE是PPAD难的,而在势博弈中属于promise-PPAD。一个2×2协调博弈的实例说明了该框架的所有组成部分。附录中包含了额外结果,包括赌博反馈扩展、大步长下双策略EMD映射通向Li-Yorke混沌的倍周期路径、最小成本转向的线性规划公式以及支持性模拟。

英文摘要

We introduce Ergodic Deviation-Robust Equilibrium (EDRE), a dynamics-relative equilibrium concept for repeated finite games in which agents learn via entropic mirror descent (EMD). EDRE requires three properties to hold simultaneously for the same profile and learning run: (E1) the limit profile is an $\varepsilon$-Nash equilibrium at a product distribution; (E2) along the entire learning trajectory, every fixed coalition's cumulative aggregate (summed-unilateral) deviation gain is $\tilde{\mathcal{O}}(\sqrt{T})$ with high probability; and (E3) the limit profile is a fixed point of the EMD map, so that it is selected by the dynamics rather than merely certified as an equilibrium. We prove that the $\sqrt{T}$ deviation-regret rate is order-tight, establish existence in exact-potential games (via Nash's theorem, with a constructive proximal route under concavity) together with Lyapunov monotonicity of EMD (and pointwise convergence when the fixed-point set is a singleton), and extend the selection property to monotone polymatrix games through variational inequalities. Although a static EDRE coincides with an $\varepsilon$-Nash equilibrium, its content is dynamic: robust (positive-measure) selection under EMD excludes linearly unstable equilibria, so EDRE acts as a Nash equilibrium equipped with a dynamic certificate rather than a static refinement. On the complexity side, we show that computing EDRE is PPAD-hard in general polymatrix games and belongs to promise-PPAD for potential games. A worked $2\times 2$ coordination-game example illustrates all components of the framework. Additional results, including a bandit-feedback extension, a period-doubling route to Li-Yorke chaos for the two-strategy EMD map at large step size, a linear-program formulation for minimum-cost steering, and supporting simulations, appear in the appendices.

2606.18183 2026-06-17 stat.ML cs.LG math.PR 新提交

A Diffusion Approximation for Temporal-Difference Learning with Linear Features under Markovian Noise

马尔可夫噪声下线性特征时序差分学习的扩散近似

M. Forzo, E. Monzio Compagnoni, A. Russo, A. Pacchiano

发表机构 * Technical University of Munich (TUM), Munich, Germany(慕尼黑技术大学) University of Basel, Basel, Switzerland(巴塞尔大学) Boston University, Boston, USA(波士顿大学)

AI总结 针对线性TD(0)在马尔可夫噪声下的随机波动,提出随机微分方程近似模型,揭示投影Bellman算子收缩动力学与马尔可夫采样影响的区别,解释常数步长误差下限。

详情
AI中文摘要

带有线性函数逼近的时序差分(TD)学习是策略评估的核心方法。其经典连续时间描述为常微分方程(ODE),捕捉渐近均值动态但忽略了决定误差下限的随机波动。我们引入了马尔可夫噪声下线性TD(0)的随机微分方程(SDE)近似。所得模型将投影Bellman算子控制的收缩动力学与马尔可夫采样的影响区分开来。因此,该模型通过马尔可夫长期协方差与投影Bellman算子收缩几何之间的相互作用解释了常数步长误差下限。

英文摘要

Temporal difference (TD) learning with linear function approximation is a core method for policy evaluation. Its classical continuous-time description is an ordinary differential equation (ODE), which captures the asymptotic mean dynamics but neglects stochastic fluctuations determining the error floor. We introduce a stochastic differential equation (SDE) approximation for linear TD(0) under Markovian noise. The resulting model distinguishes the contraction dynamics governed by the projected Bellman operator from the influence of Markovian sampling. As a consequence, the model explains the constant-stepsize error floor through the interaction between Markovian long-run covariance and the contraction geometry of the projected Bellman operator.

2606.17854 2026-06-17 cs.CG cs.DM cs.DS math.CO 新提交

A Counterexample to Wegner's Conjecture for Axis-Parallel Rectangles

Wegner 关于轴平行矩形猜想的反例

Deepak Ajwani, Rishikesh Gajjala, Rajiv Raman, Saurabh Ray

AI总结 构造了一个三角形无关的矩形相交图,其独立数至多为 n/4,从而得到 τ(R) ≥ 2ν(R),推翻了 Wegner 猜想 τ(R) ≤ 2ν(R)-1。

Comments 30 pages

详情
AI中文摘要

1965年,Wegner 猜想每个有限的轴平行矩形族 \(\mathcal R\) 满足 \(\tau(\mathcal R) \le 2\nu(\mathcal R)-1\),其中 \(\tau(\mathcal R)\) 表示刺穿所有矩形所需的最小点数,\(\nu(\mathcal R)\) 表示两两不相交子族的最大大小。在过去的六十年里,该猜想激发了一系列工作:它已被验证适用于几类特殊的矩形族,已知的一般上界也在逐步改进,但猜想本身仍然开放。我们给出了一个显式反例。更精确地说,我们构造了一个在 \(n\) 个顶点上的三角形无关的矩形相交图,其独立数至多为 \(n/4\)。由于该图是三角形无关的,平面中没有点能位于三个矩形内;因此每个刺穿点最多击中两个矩形。于是,\(\tau(\mathcal R) \ge n/2 \ge 2\nu(\mathcal R)\),与 Wegner 的猜想界矛盾。我们还给出了一个稍一般的构造,使得 \(\tau(\mathcal R) \ge 2.21\nu(\mathcal R)\)。这表明矩形最大独立集问题的标准点松弛(等价于团松弛)的积分间隙至少为 2.21。

英文摘要

In 1965, Wegner conjectured that every finite family \(\mathcal R\) of axis-parallel rectangles in the plane satisfies \(\tau(\mathcal R) \le 2\nu(\mathcal R)-1\), where \(\tau(\mathcal R)\) denotes the minimum number of points needed to pierce all rectangles in \(\mathcal R\), and \(\nu(\mathcal R)\) denotes the maximum size of a pairwise disjoint subfamily. Over the last six decades, the conjecture has motivated a long line of work: it has been verified for several special classes of rectangle families, and the best known general upper bounds have been progressively improved, but the conjecture itself had remained open. We give an explicit counterexample. More precisely, we construct a triangle-free rectangle-intersection graph on \(n\) vertices whose independence number is at most \(n/4\). Since the graph is triangle-free, no point of the plane can lie in three rectangles; hence every piercing point hits at most two rectangles. Consequently, \(\tau(\mathcal R) \ge n/2 \ge 2\nu(\mathcal R)\), contradicting Wegner's conjectured bound. We also give a slightly more general construction for which \(\tau(\mathcal R) \ge 2.21\nu(\mathcal R)\). This shows that the standard point relaxation, equivalently the clique relaxation, for the Maximum Independent Set of Rectangles problem has integrality gap at least \(2.21\).

2606.17777 2026-06-17 stat.ME math.ST stat.ML 新提交

On Response-Adaptive Targeting Strategies for Multi-Treatment Experiments

多处理实验中的响应自适应目标策略

Redouane Yagouti, Rémy Degenne, Emilie Kaufmann

AI总结 提出统一框架αRTS,将两臂ERADE策略推广到多臂实验,证明渐近性质并引入强制探索变体解决稀疏目标问题。

详情
AI中文摘要

临床试验中的响应自适应随机化(RAR)旨在通过根据观察到的结果动态分配患者到治疗组来提高伦理和统计效率。虽然基于目标最优分配的RAR已在两臂设置中得到广泛研究,但其扩展到多处理实验($K \geq 2$)在理论上仍然零散,大多数现有方法集中于特定算法或受限的目标分配。在本文中,我们引入了一个响应自适应目标的统一框架,即$\alpha$再平衡目标策略($\alpha$RTS),它推广了Hu等人[2009]的ERADE两臂策略。我们证明了该族中的所有设计共享基本的渐近性质:强相合性、分配比例和处理效应估计量的渐近正态性以及渐近效率。为了解决稀疏目标情况(其中某些处理被渐近消除),我们进一步提出了带有强制探索的$\alpha$RTS,这是一种保证所有处理无限采样同时保持渐近保证的变体。广泛的模拟说明了$\alpha$RTS变体在三臂背景下的有限样本行为,特别强调了强制探索在稀疏设置中的关键作用。

英文摘要

Response-adaptive randomization (RAR) in clinical trials aims to improve ethical and statistical efficiency by dynamically allocating patients to treatments based on observed outcomes. While RAR based on a target optimal allocation have been extensively studied for two-arms settings, their extension to multi-treatment experiments ($K \geq 2$) remains theoretically fragmented, with most existing methods focusing on specific algorithms or restricted target allocations. In this paper, we introduce a unified framework for response-adaptive targeting, the $\alpha$-Rebalancing Targeting Strategies ($\alpha$RTS), which generalize the ERADE two-armed strategy of Hu et al. [2009]. We prove that all designs in this family share fundamental asymptotic properties: strong consistency, asymptotic normality of allocation proportions and treatment effect estimators, and asymptotic efficiency. To address sparse target regimes (where some treatments are asymptotically eliminated), we further propose $\alpha$RTS with Forced Exploration, a variant that guarantees infinite sampling for all treatments while preserving the asymptotic guarantees. Extensive simulations illustrate the finite-sample behavior of $\alpha$RTS variants in a 3-armed context, highlighting in particular the critical role of forced exploration in sparse settings.

2606.17604 2026-06-17 cs.DS math.PR math.ST 新提交

Spectral recovery of a planted triangle-dense subgraph

三角密集子图的谱恢复

Sam van der Poel, Cheng Mao, Benjamin McKenna

AI总结 针对最坏情况计算困难的三角密集k子图问题,提出基于局部带符号三角计数的谱算法和半定规划,在Erdős-Rényi随机图中恢复种植的三角密集子图,并揭示统计-计算间隙。

Comments 48 pages, zero figures. Accepted for presentation at the Conference on Learning Theory (COLT) 2026

详情
AI中文摘要

给定一个$n$个顶点的简单图和一个参数$k$,三角密集$k$子图问题在最坏情况下已知是计算困难的。为了规避计算困难,我们研究了一个平均情况模型,其中在一个$n$个顶点的Erdős-Rényi随机图中种植了一个$k$个顶点的三角密集子图。对于种植子图的恢复,我们提出了一种简单的谱算法和一个半定规划,两者都使用一个图矩阵,其条目是局部带符号三角计数。这些算法的理论保证通过图矩阵的谱分析建立。最后,我们提供了证据表明存在类似于种植团问题的统计-计算间隙。在低次多项式算法的框架下,关于子图大小$k$的计算阈值至少为$\sqrt{n}$,而信息论阈值至多为$\log n$。

英文摘要

Given a simple graph on $n$ vertices and a parameter $k$, the triangle-densest-$k$-subgraph problem is known to be computationally hard in the worst case. To circumvent the computational hardness, we study an average-case model where a triangle-dense subgraph on $k$ vertices is planted in an Erdős-Rényi random graph on $n$ vertices. For the recovery of the planted subgraph, we propose a simple spectral algorithm and a semidefinite program, both of which use a graph matrix whose entries are local signed triangle counts. Theoretical guarantees for these algorithms are established through spectral analysis of the graph matrix. Finally, we provide evidence showing a statistical-to-computational gap analogous to that for the planted clique problem. The computational threshold in terms of the subgraph size $k$ is at least $\sqrt{n}$ in the framework of low-degree polynomial algorithms, while the information-theoretic threshold is at most logarithmic in $n$.

2606.17600 2026-06-17 stat.ME math.ST stat.ML 新提交

Proximal Mediation Analysis with Hidden Recanting Witnesses

存在隐藏反悔证人的近端中介分析

Sihan Wu, Yang Bai, Yifan Cui

AI总结 针对中介分析中未知反悔证人(治疗诱导的中介-结局混杂因素)导致的路径效应识别难题,提出三种基于近端因果推断的识别策略,并开发了近端多重稳健估计量,在部分模型正确设定时仍一致,且渐近正态达到半参效率界。

详情
AI中文摘要

中介分析对于将治疗的因果效应分解为直接和间接路径至关重要。然而,许多实际场景依赖于一个严格的假设,即反悔证人(定义为治疗诱导的中介-结局混杂因素)要么不存在,要么事先完全已知。这一要求往往难以成立,尤其是当这些变量由于测量困难或隐私限制而无法观测时。在本文中,我们利用近端因果推断,提出了三种新的识别策略,以应对在存在未知反悔证人的情况下识别路径特定效应的挑战。在此基础上,我们开发了一个半参数推断框架,推导了有效影响函数,并提出了一种近端多重稳健估计量,该估计量在至少一组 nuisance 模型正确设定时保持一致。当所有 nuisance 模型正确设定并以适当速率收敛时,该估计量渐近正态并达到半参数效率界。我们提供了一种基于极小极大优化的去偏机器学习程序,用于点估计和构建有效置信区间。通过模拟研究和真实数据应用,展示了所提方法的性能。

英文摘要

Mediation analysis is essential for decomposing the causal effect of a treatment into direct and indirect pathways. However, many practical settings rely on the stringent assumption that recanting witnesses, defined as treatment-induced mediator-outcome confounders, are either absent or fully known a priori. Such a requirement is often untenable, especially when these variables remain unobservable due to measurement difficulties or privacy constraints. In this paper, we leverage proximal causal inference to develop three novel identification strategies to address the challenge of identifying path-specific effects in the presence of unknown recanting witnesses. Building on this, we develop a semiparametric inference framework that derives the efficient influence function and proposes a proximal multiply robust estimator, which remains consistent if at least one set of nuisance models is correctly specified. When all nuisance models are correctly specified and converge at appropriate rates, the estimator is asymptotically normal and achieves the semiparametric efficiency bound. We provide a minimax optimization-based debiased machine learning procedure for point estimation and constructing valid confidence intervals. The performance of the proposed methods is demonstrated by simulation studies and a real data application.

2606.17594 2026-06-17 eess.SY math.OC 新提交

Low-Thrust Orbital Differential Games with Speed Constraint Enforcement Using CostWeighting

使用成本加权的低推力轨道微分博弈与速度约束强制执行

Yahli Drucker, Vitaly Shaferman

AI总结 针对近圆轨道上两航天器近距离交会的终端相对速度约束问题,提出线性二次零和微分博弈模型,推导出解析闭环最优制导律,并通过成本函数权重选择实现任意终端速度。

Comments This work was submitted for journal publication. 22 pages and 9 figures

详情
AI中文摘要

本文考虑具有任意终端相对速度约束的低推力航天器追逃微分博弈问题。它处理了近圆轨道上两个相对接近的航天器的交会终端阶段。该问题被表述为一个线性二次零和微分博弈,对终端相对位置和速度有软约束,并对参与者的控制努力有运行成本。为每个参与者推导出解析的、闭环的、最小燃料消耗的最优制导律,形成鞍点解。证明通过适当选择成本函数的权重参数可以实现任何终端速度。为了验证解的最优性,当成本函数速度权重矩阵为正定或负定时,进行了共轭点分析。负定情况出现在高终端速度下,在文献中很少见。通过仿真评估了所推导制导律在不同目标机动下的性能,并与基于最优控制的最先进制导律进行了比较。仿真表明,所推导的制导律满足约束条件,并且在目标最优规避时比基于最优控制的制导律具有显著优势。

英文摘要

This paper considers the problem of a low-thrust spacecraft pursuit-evasion differential game with an arbitrary terminal relative speed constraint. It addresses the terminal phase of the engagement for two relatively close spacecraft near a circular orbit. The problem is formulated as a linear-quadratic zero-sum differential game, with soft constraints on the terminal relative position and velocity, and running costs on the players' control efforts. An analytical, closed-loop, minimum-fuel-consumption optimal guidance law is derived for each player, forming a saddle-point solution. It is proven that any terminal speed can be achieved by properly choosing the weighting parameters of the cost function. To verify the optimality of the solution, a conjugate point analysis is performed when the cost function velocity weighting matrix is either positive or negative definite. The negative-definite case arises at high terminal speeds and is seldom seen in the literature. The performance of the derived guidance law is evaluated in simulations for different target maneuvers and compared to a state-of-the-art optimal-control-based guidance law. The simulations show that the derived guidance law satisfies the constraints and offers a substantial advantage over the optimal-control-based guidance law when the target is optimally evading.

2606.17586 2026-06-17 q-bio.PE math.DS 新提交

Aggregation as a Double-Edged Sword: Fear, Allee Effects, and Finite-Time Collapse

聚合是一把双刃剑:恐惧、Allee效应与有限时间崩溃

Kwadwo Antwi-Fordjour, Eric M. Takyi

AI总结 研究通过疾病-捕食模型揭示,猎物聚合在恐惧和Allee效应下会加速生态系统有限时间崩溃,并首次量化了行为与人口参数对崩溃速度的影响。

详情
AI中文摘要

猎物聚合被广泛视为抵御捕食的防御机制,但我们表明,在受捕食者诱导的恐惧和人口Allee阈值影响的疾病结构化种群中,聚合可能矛盾地加速生态系统崩溃。我们开发并分析了一个易感-感染-捕食者模型,该模型包含双重恐惧反应——以及一个次线性基于聚合的捕食项和Allee效应。关键地,我们推导出灭绝时间的显式上界,该上界随着捕食压力增加或聚合增强而减小,首次量化了行为和人口参数如何共同决定生态崩溃的速度。这种有限时间灭绝随后引发感染猎物和捕食者种群的级联崩溃,导致整个生态群落灭绝。分岔分析揭示了随着恐惧强度、聚合强度和Allee阈值变化而出现的跨临界、鞍结和Hopf分岔。双参数延拓进一步确定了恐惧-Allee参数平面中稳定共存、振荡共存、捕食者排除和有限时间灭绝发生的精确区域,表明更强的聚合单调地扩大了有限时间灭绝区域,而较弱的聚合支持更丰富的共存动态景观。这些结果表明,当种群水平的行为防御与疾病动态和人口脆弱性相互作用时,可能产生突然的生态临界点。

英文摘要

Prey aggregation is widely regarded as a defense against predation, yet we show that in disease-structured populations subject to predator-induced fear and demographic Allee thresholds, aggregation can paradoxically accelerate ecosystem collapse. We develop and analyze a susceptible-infectious-predator model incorporating dual fear responses -- together with a sublinear aggregation-based predation term and an Allee effect. Critically, we derive an explicit upper bound on the extinction time that decreases as predator pressure increases or aggregation strengthens, quantifying for the first time how behavioral and demographic parameters jointly determine the speed of ecological collapse. This finite-time extinction subsequently triggers a cascade collapse of the infected prey and predator populations, driving the entire ecological community to extinction. Bifurcation analysis reveals transcritical, saddle-node, and Hopf bifurcations as fear intensity, aggregation strength, and Allee threshold vary. Two-parameter continuation further identifies the precise regions of the fear--Allee parameter plane in which stable coexistence, oscillatory coexistence, predator exclusion, and finite-time extinction occur, demonstrating that stronger aggregation monotonically enlarges the finite-time extinction region while weaker aggregation supports a richer landscape of coexistence dynamics. These results demonstrate that behavioral defenses operating at the population level can generate abrupt ecological tipping points when they interact with disease dynamics and demographic vulnerability.

2606.17531 2026-06-17 cs.LG cs.CG math.AT 新提交

Non-negative Matrix Factorisation with Topological Regularisation

带拓扑正则化的非负矩阵分解

Matias de Jong van Lier, Shizuo Kaji, Keunsu Kim

发表机构 * Recursive Inc.(Recursive公司) Graduate School of Science, Kyoto University(京都大学理学研究科) Institute of Mathematics for Industry, Kyushu University(九州大学数理学研究院)

AI总结 提出通过持久同调作为拓扑正则化项融入非负矩阵分解目标函数,以学习具有空间连贯性、周期结构或团状图信号的可解释基函数。

详情
AI中文摘要

我们研究了通过正则化学习到的基函数的拓扑结构,在非负矩阵分解(NMF)中学习可解释基函数。我们的方法源于观察到许多数据模态可以视为结构化域上的非负函数,其中基的质量与其拓扑结构内在相关。然而,纳入支撑拓扑的朴素方法通常受离散性和阈值依赖性困扰,使其不适合连续优化。我们通过采用持久同调作为稳定、无阈值的拓扑量化器,并设计将拓扑分数作为正则化项融入NMF目标函数来应对这些挑战。所得框架在一个统一的建模语言中涵盖了空间连贯的图像成分、周期性的时间序列结构和团状图信号。

英文摘要

We investigate the learning of interpretable bases in non-negative matrix factorisation (NMF) by regularising the topology of the learned basis functions. Our approach is motivated by the observation that many data modalities can be viewed as non-negative functions on a structured domain, where the quality of a basis is intrinsically linked to its topology. However, naive methods for incorporating the topology of the support are often hindered by discreteness and threshold dependence, rendering them unsuitable for continuous optimisation. We address these challenges by employing persistent homology as a stable, threshold-free topological quantifier and by designing topological scores that integrate into the NMF objective as regularisers. The resulting framework encompasses spatially coherent image components, periodic time-series structures, and clique-like graph signals within a unified modelling language.

2606.17426 2026-06-17 stat.ML cs.LG math.PR 新提交

Bounded Difference Concentration for Infinitely Exchangeable Sequences with Applications to AI Benchmark Uncertainty

无限可交换序列的有界差分集中不等式及其在AI基准不确定性中的应用

Fangyuan Lin, Spencer Frei, Victor H. de la Pena

发表机构 * Department of Statistics, Columbia University(哥伦比亚大学统计系) Google DeepMind(谷歌DeepMind)

AI总结 通过de Finetti测度分解有界差分函数的偏差,提出有效方差代理的集中不等式,并证明零和线性对比中潜在混合项完全抵消,应用于AI基准如MMLU的不确定性量化。

详情
AI中文摘要

我们考虑无限可交换随机变量函数的集中性质。通过对de Finetti导向测度取条件,我们证明任何具有有界差分常数$c_1, \dots, c_n$的函数的偏差分解为条件采样波动和潜在混合波动。当该潜在混合是$\sigma_{\mathrm{mix}}^2$-次高斯时,我们建立了一个有效方差代理为$\frac{1}{4}\sum_i c_i^2 + \sigma_{\mathrm{mix}}^2$的集中不等式。关键的是,我们证明对于零和线性对比,例如子样本均值与总体均值之差,潜在混合项完全抵消。这种抵消产生了一个紧的、无混合的Hoeffding型界,为近期有限可交换集中结果的无限可扩展极限提供了直接的de Finetti机制。我们将该框架应用于量化复合AI基准(如MMLU)中的不确定性,其中问题项在领域间自然表现出可交换依赖性。我们的结果既提供了一个领域分层层次模型来限制准确率分数的不确定性,也提供了一个无分布、节省成本的统计保证,用于从随机子集准确估计完整的基准分数。

英文摘要

We consider the concentration properties of functions of infinitely exchangeable random variables. By conditioning on the de Finetti directing measure, we show that the deviation of any function with bounded-difference constants $c_1, \dots, c_n$ decomposes into a conditional sampling fluctuation and a latent mixture fluctuation. When this latent mixture is $\sigma_{\mathrm{mix}}^2$-subgaussian, we establish a concentration inequality with an effective variance proxy of $\frac{1}{4}\sum_i c_i^2 + \sigma_{\mathrm{mix}}^2$. Crucially, we demonstrate that for zero-sum linear contrasts, such as the difference between a subsample mean and a full population mean, the latent mixture term cancels exactly. This cancellation yields a tight, mixture-free Hoeffding-type bound that provides a direct de Finetti mechanism for the infinite-extendibility limit of recent finite-exchangeable concentration results. We apply this framework to quantify uncertainty in composite AI benchmarks, such as MMLU, where question items naturally exhibit exchangeable dependence across domains. Our results provide both a domain-stratified hierarchical model for bounding the uncertainty of accuracy scores, and a distribution-free, cost-saving statistical guarantee for accurately estimating full benchmark scores from random subsets.

2606.17419 2026-06-17 cs.LG math.NA 新提交

Generalization Guarantees for Multi-Input Neural Operator Learning in Sobolev Spaces

多输入神经算子学习在Sobolev空间中的泛化保证

Yahong Yang, Zecheng Zhang, Wei Zhu, Wenjing Liao, Hao Liu

发表机构 * Georgia Institute of Technology(佐治亚理工学院) University of Notre Dame(圣母大学) Hong Kong Baptist University(香港浸会大学)

AI总结 针对多输入神经算子,在Sobolev范数下建立逼近和泛化误差估计,量化各输入空间对误差界的贡献,并揭示平衡状态下输入维度、正则性和Sobolev阶的相互作用。

详情
AI中文摘要

我们发展了多输入神经算子的逼近和泛化误差估计,输出误差在Sobolev范数下度量。与标准算子学习设置中只有一个输入函数不同,我们的框架允许多个输入函数定义在可能不同的域上,具有不同的维度和Sobolev正则性。导出的速率明确量化了每个输入空间对最终误差界的贡献。特别地,在平衡状态下,逼近和泛化速率由输入维度、正则性和Sobolev阶之间的相互作用控制,而对模型复杂度的依赖保持\(\log\log/\log\)型结构。我们的分析为多输入算子学习(包括Sobolev训练)提供了一个通用的理论框架,并适用于来自偏微分方程和科学计算的算子学习问题。

英文摘要

We develop approximation and generalization error estimates for multi-input neural operators, with the output error measured in Sobolev norms. In contrast to standard operator-learning settings with a single input function, our framework allows multiple input functions defined on possibly different domains, with different dimensions and Sobolev regularities. The derived rates explicitly quantify the contribution of each input space to the final error bound. In particular, in the balanced regime, the approximation and generalization rates are governed by the interaction between the input dimensions, regularities, and Sobolev orders, while the dependence on the model complexity retains a \(\log\log/\log\)-type structure. Our analysis provides a general theoretical framework for multi-input operator learning, including Sobolev training, and is applicable to operator learning problems arising from partial differential equations and scientific computing.

2606.17414 2026-06-17 cs.LG math.DS 新提交

Memory-Efficient Meta-Reinforcement Learning for Adaptive Safety-Critical Control in Adversarial Spacecraft Proximity Operations

用于对抗性航天器接近操作中自适应安全关键控制的内存高效元强化学习

Alejandro Posadas-Nava, Richard Linares, Minduli Wijayatunga

发表机构 * MIT(麻省理工学院) University of Illinois, Urbana-Champaign(伊利诺伊大学厄巴纳-香槟分校)

AI总结 本文研究利用元强化学习调整输入约束控制屏障函数的类K函数,比较三种循环网络架构和两种训练算法,发现Mamba与PPO组合在合作与非合作场景中均能提升任务完成率、安全性和燃料效率。

详情
AI中文摘要

自主航天器交会与接近操作(RPO)需要控制器在推力约束下保证安全,同时最小化燃料消耗。输入约束控制屏障函数(ICCBF)为具有执行约束的非线性系统提供了一种控制方法,构建前向不变安全集。先前工作表明,通过元强化学习(meta-RL)学习定义ICCBF递归的类$\mathcal{K}$函数,可为RPO中的安全关键控制提供鲁棒、非贪婪的方法。本文进一步扩展该框架,研究了三种循环网络架构(长短期记忆(LSTM)、门控循环单元(GRU)、选择性状态空间模型(Mamba))和两种训练算法(近端策略优化(PPO)和软演员-评论家(SAC))的性能,以确定通过元强化学习调整ICCBF类K函数的最佳设置。除了合作测试案例外,还在存在对抗行为的情况下评估性能,其中目标航天器以恶化追踪航天器安全的方式行动。结果表明,在所有测试的合作与非合作场景中,使用PPO的状态空间模型(如Mamba)相比其他架构在任务完成、安全和燃料节省方面表现更优。

英文摘要

Autonomous spacecraft rendezvous and proximity operations (RPO) require controllers that guarantee safety under thrust constraints while minimizing fuel expenditure. Input-constrained control barrier functions (ICCBFs) provide a control method for nonlinear systems with actuation constraints that construct a forward-invariant safe set. Previous work has shown that learning class-$\mathcal{K}$ functions defining the ICCBF recursion via meta reinforcement learning (meta-RL) yields a robust, non-greedy approach to safety-critical control in RPO. This paper extends that framework further by investigating the performance of three recurrent network architectures (Long Short Term Memory (LSTM), Gated Recurrent Unit (GRU), Selective State Space Model (Mamba)) and two training algorithms (Proximal Policy Optimization (PPO) and Soft Actor Critic (SAC)) to identify the best setup for tuning ICCBF class-K functions via meta-RL. In addition to cooperative test cases, performance is evaluated in the presence of adversarial behavior where the target spacecraft behaves in a way that worsens the safety of the chaser spacecraft. Results indicate that state space models such as Mamba when used with PPO achieve superior task completion, safety, and fuel-savings compared to other architectures, across all cooperative and uncooperative scenarios tested.

2606.17319 2026-06-17 stat.ML cs.LG math.CO math.ST 新提交

Tight $L_\infty$ Sample Complexity for Low-Degree and Sparse Boolean Polynomials

低次稀疏布尔多项式的紧 $L_\infty$ 样本复杂度

Jasper van Doornmalen, Mathieu Molina, Victor Verdugo, José Verschae

发表机构 * Institute for Mathematical and Computational Engineering(数学与计算工程研究所) Pontificia Universidad Católica de Chile(智利天主教大学) Blavatnik School of Computer Science and AI(Blavatnik计算机科学与人工智能学院) Tel Aviv University(特拉维夫大学) Department of Industrial and Systems Engineering(工业与系统工程系)

AI总结 针对有界二进制黑箱函数优化,研究布尔超立方体上多项式代理的学习问题,要求均匀 $L_\infty$ 误差保证,刻画了次高斯噪声下两类有界多项式的最小最大样本复杂度。

详情
AI中文摘要

受有界二进制黑箱函数优化的启发,我们研究了在布尔超立方体上学习多项式代理的问题。为了确保优化代理能为底层目标产生良好解,我们需要均匀的 $L_\infty$ 误差保证,而非通常的 $L_2$ 型保证。我们刻画了次高斯噪声下两类有界多项式的均匀估计的最小最大样本复杂度。首先,对于 $n$ 个变量上次数至多为 $d$ 的多项式,样本复杂度为 $n^{d+1}$。其次,对于 $s$-稀疏 Fourier-Walsh 多项式且 $s \leq n$,样本复杂度为 $ns^2$。这些速率在结构上不同于无噪声情形,其中均匀精确恢复的速率分别为 $n^d$ 和 $ns$。我们的下界甚至对任意自适应学习者也成立,表明额外的因子是噪声情形固有的。$L_2$ 范数的标准傅里叶分析工具不能自然地扩展到 $L_\infty$ 设置以产生均匀保证。我们的证明通过依赖适当选择的辅助范数作为控制 $L_\infty$ 误差的代理来克服这一困难。总之,我们的结果提供了学习优化安全多项式代理的样本复杂度的紧刻画。

英文摘要

Motivated by the optimization of bounded binary black-box functions, we study the problem of learning polynomial surrogates over the Boolean hypercube. To ensure that optimizing the surrogate yields good solutions for the underlying objective, we require uniform $L_\infty$-error guarantees rather than the usual $L_2$-type guarantees. We characterize the minimax sample complexity of uniform estimation under subgaussian noise for two classes of bounded polynomials. First, for polynomials of degree at most $d$ on $n$ variables, the sample complexity scales as $n^{d+1}$. Second, for $s$-sparse Fourier-Walsh polynomials with $s \leq n$, it scales as $ns^2$. These rates differ structurally from the noiseless setting, where uniform exact recovery scales as $n^d$ and $ns$, respectively. Our lower bounds hold even for arbitrary adaptive learners, showing that the additional factors are intrinsic to the noisy cases. Standard Fourier-analysis tools for the $L_2$-norm do not naturally extend to the $L_\infty$-setting in a way that yields uniform guarantees. Our proofs overcome this difficulty by relying on suitably chosen auxiliary norms that serve as proxies for controlling the $L_\infty$-error. Together, our results provide a tight characterization of the sample complexity of learning optimization-safe polynomial surrogates.

2606.17317 2026-06-17 cs.RO cs.AI math.OC 新提交

Transformer-Based Warm-Starting for Feasible and Optimal Terminal Approach to Tumbling Objects with Space Manipulators

基于Transformer的可行且最优末端接近翻滚目标的空间机械臂热启动方法

Yuji Takubo, Maximilian Adang, Mac Schwager, Simone D'Amico

发表机构 * Stanford University(斯坦福大学)

AI总结 针对空间机械臂末端接近翻滚目标的实时轨迹生成问题,提出基于因果Transformer的热启动方法,通过分解规划并热启动姿态-力矩分配阶段,在300个测试场景中减少28%迭代次数和23%运行时间,同时保持控制成本分布。

Comments 8 pages, 4 figures

详情
AI中文摘要

由于航天器总线运动、机械臂动力学、可见性锥和轨迹级安全约束之间的非线性耦合,在轨机器人服务的实时轨迹生成具有挑战性。本文研究了基于学习的热启动方法,用于空间机械臂末端接近翻滚目标的序列凸规划(SCP)。所提出的框架将问题分解为系统质心平移规划阶段和耦合姿态-机械臂力矩分配阶段,并对后者应用因果变压器热启动,后者构成了主要的计算瓶颈。比较了线性动作解码器和流匹配动作解码器在不同动作分块和训练数据集大小下的表现,并使用SCP在成本最优和可行性投影下评估了生成的热启动。在300个保留场景中,学习的热启动将第二阶段SCP迭代次数减少多达28%,运行时间减少23%,同时保持最终控制成本分布。当学习的热启动用于非凸可行性投影时,其运行时间相比成本最优SCP几乎减半,同时避免了启发式初始化时观察到的灾难性高成本尾部行为。这些结果表明,序列模型热启动可以提高基于优化的空间机械臂末端制导的计算效率和轨迹鲁棒性。

英文摘要

Real-time trajectory generation for on-orbit robotic servicing is challenging due to the nonlinear coupling between spacecraft bus motion, manipulator dynamics, visibility cone, and trajectory-level safety constraints. This paper studies learning-based warm-starting for sequential convex programming (SCP) in the terminal approach of a space manipulator toward a tumbling target. The proposed framework decomposes the problem into a system center-of-mass translational planning stage and a coupled attitude--manipulator torque-allocation stage, and applies a causal transformer warm-start to the latter, which constitutes the dominant computational bottleneck. Linear and flow matching action decoders are compared under different action-chunking and training dataset sizes, and the resulting warm-starts are evaluated under both cost-optimal and feasibility projection using SCP. Across 300 held-out scenarios, the learned warm-start reduces the second-stage SCP iteration count by up to 28% and the runtime by 23% while preserving the final control-cost distribution. When the learned warm-starts are used for nonconvex feasibility projection, they nearly halve the runtime relative to cost-optimal SCP, while avoiding the catastrophic high-cost tail behavior observed when initialized heuristically. These results indicate that sequence-model warm-starts can improve both the computational efficiency and trajectory robustness of optimization-based terminal guidance for space manipulation.

2606.17307 2026-06-17 cs.DM math.CO 新提交

A program to find families of graphs in Free$\{C_4,4K_1\}$ with bounded clique width

在Free$\{C_4,4K_1\}$中寻找有界团宽图族的程序

Cléophée Robin, Alexandre Talon

AI总结 研究不含4-圈和4-独立集作为导出子图的图类Free{C4,4K1}的子类的团宽,引入(k,l,m)-分解框架证明可分解图类有界团宽,并给出程序发现的例子,同时展示某些图具有无界团宽的超图族。

Comments The source of the programm will be uploaded later

详情
AI中文摘要

本文研究不含4-圈和大小为4的独立集作为导出子图的图类:$\mathop{Free}\{C_4, 4K_1\}$。这是当限制在由排除4阶导出子图定义的类时,染色问题复杂性的三个最小开放情形之一。我们研究了$\mathop{Free}\{C_4, 4K_1\}$的一些子类的团宽。我们引入了一个新框架:$(k,l,m)$-分解,并证明如果某个图类$\cal G$中的所有图都是$(k,l,m)$-可分解的,那么$\cal G$中的图具有有界团宽。我们给出了几个这样的类的例子,这些例子是通过我们设计的程序找到的。我们还证明,对于任何$\mathop{Free}\{C_4, 4K_1\}$中可被3个团覆盖的图$G$,存在$\mathop{Free}\{C_4, 4K_1\}$中$G$的超图的一个无限族,其团宽无界。

英文摘要

In this paper we study the class of graphs without cycles of size 4 and independent sets of size 4 as induced subgraphs: $\mathop{Free}\{C_4, 4K_1\}$. This is one of the three minimal minimal open cases for the complexity of the colouring problem when restricted to classes defined by excluding induced subgraphs of order 4. We investigate the clique width of some subclasses of $\mathop{Free}\{C_4, 4K_1\}$. We introduce a new framework: the $(k,l,m)$-decomposition and prove that if all the graphs of a class $\cal G$ are $(k,l,m)$-decomposable, then graphs in $\cal G$ have bounded clique width. We give a few examples of such class, found with the help of a program we designed. We also show, for any graph $G \in \mathop{Free}\{C_4, 4K_1\}$ that is 3 cliques coverable, an infinite family in $\mathop{Free}\{C_4, 4K_1\}$ of supergraphs of $G$ which have unbounded clique width.

2606.17293 2026-06-17 stat.ME math.ST 新提交

Dependent Censoring Based on Geometric Optimization

基于几何优化的相依删失

Anis Fradi, Salima Helali, Bilel Bousselmi

AI总结 针对生存分析中的相依删失问题,提出基于扩展广义Marshall-Olkin模型的框架,利用几何优化技术估计失效与删失时间的依赖关系,并证明渐近性质。

详情
AI中文摘要

在生存分析中,相依删失对准确估计模型参数和生存函数构成了重大挑战。本研究引入了一个利用扩展广义Marshall-Olkin(EGMO)模型的新框架,以处理相依删失机制。采用几何优化技术来开发高效的估计程序,捕捉失效时间和删失时间之间的依赖关系。我们建立了它们的渐近性质。模拟研究和实际数据应用说明了该方法的稳健性和有效性。

英文摘要

In survival analysis, dependent censoring poses significant challenges in accurately estimating model parameters and survival functions. This study introduces a novel framework leveraging Extended Generalized Marshall-Olkin (EGMO) models to address dependent censoring mechanisms. Geometric optimization techniques are employed to develop efficient estimation procedures that capture dependencies between failure and censoring times. We establish their asymptotic properties. Simulation studies and real data applications illustrate the method's robustness and effectiveness.

2606.17280 2026-06-17 eess.SY math.OC 新提交

Optimal Powered Descent Guidance with Pyramid-Shaped Approach-Angle Constraints

具有金字塔形接近角约束的最优动力下降制导

Revital Frenkel, Vitaly Shaferman

AI总结 提出一种解析最优软着陆制导律,通过不等式接近角路径约束将轨迹限制在着陆点倒金字塔内,避免地面碰撞并控制接近角,利用庞特里亚金最小原理推导开环/闭环解及最优终端时间。

Comments This work has been submitted for journal publication. 36 pages 10 figures

详情
AI中文摘要

本文解析推导了一种新颖的具有不等式接近角路径约束的最优软着陆制导律。所提出的制导律通过将最优轨迹限制在起源于着陆点的凸倒金字塔内,防止地面碰撞并实现接近角控制。采用恒定引力场中的三维点质量线性运动学模型,以及二次控制能量代价和位置速度终端约束。利用庞特里亚金最小原理以及无约束弧段与约束弧段之间过渡的最优性条件,推导了解析开环和闭环解以及最优终端时间。此外还表明,当路径约束被激活时,最优终端时间减小。所得到的制导律是连续的、时间分段线性,并且在闭环中状态非线性。当约束被激活时,控制器抵消垂直于约束的重力分量,使轨迹沿约束面演化。在不同初始条件下进行仿真评估,展示了精确的着陆性能和对路径约束的一致满足。

英文摘要

In this paper, a novel optimal soft-landing guidance law with inequality approach-angle path constraints is analytically derived. The proposed guidance law prevents ground collision and enables approach-angle control by constraining the optimal trajectory to remain within a convex inverted pyramid originating at the landing point. A 3D point-mass linear kinematic model in a constant gravitational field is employed, together with a quadratic control-effort cost and terminal constraints on position and velocity. Analytical open-loop and closed-loop solutions, together with the optimal final time, are derived using Pontryagin's Minimum Principle and the optimality conditions at the transitions between unconstrained and constrained arcs. It is additionally shown that the optimal final time decreases when the path constraints become active. The resulting guidance law is continuous, piecewise linear in time, and nonlinear in the states in closed-loop. When a constraint becomes active, the controller cancels the gravitational component normal to the constraint, causing the trajectory to evolve along the constraint surface. The proposed guidance law is evaluated in simulations under various initial conditions, demonstrating accurate landing performance and consistent satisfaction of the path constraints.

2606.17267 2026-06-17 stat.ME econ.EM math.NA stat.AP stat.ML 新提交

Bayesian Poisson-Randomized Gamma Tensor Factorization with Application to International Trade Flows

贝叶斯泊松-随机化伽马张量分解及其在国际贸易流中的应用

Jie Jian, Aaron Schein

AI总结 提出贝叶斯分层张量分解模型,结合低秩CP结构和条件伽马模型,处理稀疏半连续张量数据,并通过混合变分-蒙特卡洛算法实现大规模后验推断,应用于国际贸易流分析。

详情
AI中文摘要

我们研究具有过多零值、重右尾和切片特定离散度的稀疏半连续张量数据。这些特征自然出现在货币价值的多维数据中,例如国际贸易,其中大多数出口商-进口商-产品-年份单元格为零,而正值是连续且高度可变的。为了对这些数据进行建模,我们提出了一种贝叶斯分层张量分解模型,该模型在潜在泊松率张量上放置低秩CP结构,并将其与条件伽马模型耦合以处理正结果,其中率参数可以在一个模式内的不同切片之间变化。因此,该模型分离了正观测的发生和幅度,同时通过共享的低秩潜在结构在所有张量维度上借用强度。为了将后验推断扩展到大型数组,我们开发了一种混合变分-蒙特卡洛算法,该算法将高效的坐标上升更新与部分折叠的增广数据采样器相结合。应用于约6000万条贸易流,该方法揭示了出口商、进口商、产品和年份之间的多维依赖关系,这是从重力型或成对网络分析中难以恢复的,因为这些分析没有联合建模产品和时间维度。

英文摘要

We study sparse semi-continuous tensor data with excess zeros, heavy right tails, and slice-specific dispersion. Such features arise naturally in monetary-valued multi-way data, such as international trade, where most exporter--importer--product--year cells are zero while positive values are continuous and highly variable. To model these data, we propose a Bayesian hierarchical tensor factorization model that places a low-rank CP structure on a latent Poisson rate tensor and couples it with a conditional Gamma model for positive outcomes, with rate parameters that can vary across slices within a mode. The model therefore separates the occurrence and magnitude of positive observations while borrowing strength across all tensor dimensions through a shared low-rank latent structure. To scale posterior inference to large arrays, we develop a hybrid variational--Monte Carlo algorithm that combines efficient coordinate ascent updates with a partially collapsed augmented-data sampler. Applied to approximately 60 million trade flows, the method surfaces multiway dependence across exporters, importers, products, and years that is difficult to recover from gravity-type or pairwise network analyses, which do not jointly model the product and temporal dimensions.

2606.17219 2026-06-17 eess.SP math.OC 新提交

A Unified Analytical Nullspace-Based Least-Squares Design of the Farrow Structure

基于零空间的最小二乘Farrow结构统一解析设计

Deijany Rodriguez Linares, Håkan Johansson

AI总结 提出一种基于零空间参数化的统一最小二乘设计方法,解析处理Farrow结构中混合类型子滤波器(线性相位与通用FIR),满足群延迟约束并减少自由参数。

Comments 5 pages, 2 figures, submitted to IEEE Signal Process. Lett

详情
AI中文摘要

基于线性相位FIR子滤波器的Farrow结构能够高效实现可变分数延迟(VFD)滤波器,降低实现复杂度。虽然全线性相位配置允许解耦的最小二乘(LS)公式并具有解析解,但当需要混合类型分支(线性相位和通用FIR)时(例如施加群延迟约束时),这种解耦会失效。本文通过分支对称约束的零空间参数化,提出一种统一的Farrow结构LS设计,得到适用于任意分支类型的解析解。数值结果表明,所提框架能够满足全线性相位方法无法满足的群延迟约束,同时相对于无约束的通用FIR基线大幅减少自由参数数量。

英文摘要

Farrow structures based on linear--phase FIR subfilters provide an efficient realization of variable fractional--delay (VFD) filters with reduced implementation complexity. While the all--linear--phase configuration admits a decoupled least--squares (LS) formulation with an analytical solution, this decoupling fails when branches of mixed types, linear--phase and general FIR, are required, as occurs when a group--delay constraint is imposed. This letter presents a unified LS design for Farrow structures via a nullspace parameterization of the per--branch symmetry constraints, yielding an analytical solution that accommodates arbitrary per--branch types. Numerical results demonstrate that the proposed framework satisfies group--delay constraints that the all linear--phase approach cannot meet, while substantially reducing the number of free parameters relative to the unconstrained general FIR baseline.

2606.17185 2026-06-17 cs.LG eess.SP math.DG stat.ML 新提交

Finsler Geometry, Graph Neural Networks, and You

芬斯勒几何、图神经网络与你

T. Mitchell Roddenberry, Richard G. Baraniuk

发表机构 * Rice University(莱斯大学)

AI总结 针对图拉普拉斯只能近似各向同性算子的局限,提出基于芬斯勒拉普拉斯的图神经网络层,证明其收敛性并恢复非线性扩散方程的几何结构。

详情
AI中文摘要

基于图拉普拉斯的图神经网络架构近似拉普拉斯-贝尔特拉米算子,因此限制了它们在各向同性算子上的应用。作为拉普拉斯-贝尔特拉米算子的非线性替代,我们考虑从流形上采样的点云上芬斯勒拉普拉斯的估计。我们证明,随着点样本数量的增加,这些离散估计收敛到流形上的真实算子。此外,我们表明该算子可以表示为图神经网络层,我们用它来定义一组受约束以表达芬斯勒几何的芬斯勒图神经网络。我们表明,芬斯勒图神经网络在实践中恢复了非线性扩散方程背后的几何结构。

英文摘要

Graph neural network architectures based on the graph Laplacian approximate the Laplace-Beltrami operator, thus limiting their application to isotropic operators. As a nonlinear alternative to the Laplace-Beltrami operator, we consider estimates of the Finsler Laplacian on point clouds sampled from a manifold. We prove that these discrete estimates converge to the true operator on the manifold as the number of point samples grows. Moreover, we show that this operator can be expressed as a graph neural network layer, which we use to define a family of Finslerian graph neural networks constrained to express Finsler geometry. We show that Finslerian graph neural networks recover the geometry underlying nonlinear diffusion equations in practice.

2606.17165 2026-06-17 stat.ME cs.AI econ.EM math.ST 新提交

Statistical Foundations of LLM-based A/B Testing: A Surrogacy Framework for Human Causal Inference

基于LLM的A/B测试的统计基础:用于人类因果推断的替代指标框架

Joel Persson, Mårten Schultzberg, Sebastian Ankargren

发表机构 * Spotify USA, Inc.(Spotify美国公司)

AI总结 提出替代指标理论框架,证明在弱于分布等价条件下,校准LLM输出可识别平均处理效应,并分析随机性带来的偏差与方差。

详情
AI中文摘要

组织和研究者越来越有兴趣在A/B测试中使用大型语言模型(LLM)代替人类参与者,以期更快、更低成本地进行实验。我们研究当在LLM结果上估计的处理效应何时能够恢复在感兴趣的人类群体上测量的效应。LLM与人类结果之间的分布等价性会使任何标准估计量有效,但这不现实。因此,我们开发了一个统计框架,将替代终点理论适配到LLM。该框架表明,将LLM结果校准到人类结果,在替代性和可比性条件(联合弱于分布等价性)下,可以识别平均处理效应。当这些条件不成立时,感兴趣的效应仅部分可识别,我们提供了诊断方法,可以在历史实验上证伪替代性,并给出有限重叠下最坏情况偏差的界限。我们进一步证明,LLM固有的随机性会引入偏差和方差,但使用多次抽取的平均值作为替代指标可以同时缓解两者。我们在模拟和Upworthy标题的A/B测试应用中展示了方法和理论。我们工作的一个核心结论是,LLM结果作为替代指标的有效性只能对过去的处理被证伪,而无法对新处理被验证,因此对于新颖干预,人类实验仍然不可或缺。我们讨论了LLM选择、提示和温度作为设计变量的作用,以及如何确定人类实验的规模以进行验证。

英文摘要

Organizations and researchers show increasing interest in using large language models (LLMs) in place of human participants in A/B tests, in the hope of experimenting faster and at lower cost. We study when a treatment effect estimated on LLM outcomes recovers the effect that would have been measured on the human population of interest. Distributional equivalence between LLM and human outcomes would make any standard estimator valid but is unrealistic. We therefore develop a statistical framework that adapts surrogate endpoint theory to LLMs. The framework shows that calibrating LLM outcomes to human outcomes identifies the average treatment effect under surrogacy and comparability conditions that are jointly weaker than distributional equivalence. When these conditions fail, the effect of interest is only partially identified, and we provide diagnostics that can falsify surrogacy on historical experiments together with a bound on the worst-case bias from limited overlap. We further show that the stochasticity inherent to LLMs introduces both bias and variance, but using an average of multiple draws as the surrogate mitigates both. We illustrate the methods and theory in simulations and an application to A/B tests on Upworthy headlines. A central takeaway from our work is that the validity of LLM outcomes as surrogates can only be falsified for past treatments and never verified for new ones, so human experiments remain indispensable for novel interventions. We discuss the role of LLM choice, prompting, and temperature as design variables, and how to size human experiments for validation.

2606.17125 2026-06-17 q-bio.PE math.DS math.OC 新提交

Tipping the Balance: Allee Thresholds, Saddle-Node Bifurcations, and Optimal Sterile-Male Release Strategies for Anopheles Mosquitoes

打破平衡:按蚊的Allee阈值、鞍结分岔与最优不育雄蚊释放策略

Abba Gumel, C. Alex Safsten

AI总结 针对按蚊的性别和阶段结构模型,研究不育昆虫技术(SIT)下的Allee效应,证明通过释放不育雄蚊可将种群推过Allee阈值实现消除,并优化释放策略。

Comments 47 pages

详情
AI中文摘要

我们建立并分析了一个按蚊动态的性别和阶段结构模型,该模型考虑了不育昆虫技术(SIT),其动机是需要对杀虫剂抗性和户外传播具有鲁棒性的工具。模型追踪水生阶段、成年雄性、未交配雌性以及与野生或不育雄性交配的雌性;包括产卵能力和幼虫竞争;并使用一个不应期后跟密度依赖的配偶搜索。由此产生的Holling II型交配项产生了配偶寻找的Allee效应。在建立适定性后,我们证明该Allee效应使得无蚊平衡对所有允许参数局部稳定,并且当快速配偶搜索再生数$R_0^q$小于1时全局渐近稳定。当$R_0^q>1$、栖息地容量大且幼虫竞争弱时,通过鞍结分岔出现两个正平衡:一个稳定的自然平衡和一个不稳定的Allee平衡,将持续存在与灭绝分开。对于一个简化模型,Goh-Volterra Lyapunov泛函估计了持续存在的吸引域。然后我们展示了恒定和种群响应的不育雄蚊释放如何重塑这种双稳态。足够大的释放通过第二个鞍结分岔消灭了正平衡,而足够大的恒定释放从每个允许的初始状态驱动局部消除。因此,SIT只需将种群推过Allee分界线,之后配偶寻找失败即可完成灭绝。在一个具有Allee阈值停止规则的自由时域优化框架中,混合释放策略相对于最佳恒定策略将不育雄蚊需求减少约5%,相对于最佳种群响应策略减少约39%。这些结果将Allee效应重新定义为一种媒介抑制的控制杠杆。

英文摘要

We formulate and analyze a sex- and stage-structured model for Anopheles dynamics under the sterile insect technique (SIT), motivated by the need for tools robust to insecticide resistance and outdoor transmission. The model tracks aquatic stages, adult males, unmated females, and females mated with wild or sterile males; includes egg-laying capacity and larval competition; and uses a refractory period followed by density-dependent mate search. The resulting Holling type-II mating term generates a mate-finding Allee effect. After establishing well-posedness, we prove that this Allee effect makes the mosquito-free equilibrium locally stable for all admissible parameters and globally asymptotically stable when a quick-mate-search reproduction number $R_0^q$ is below one. When $R_0^q>1$, habitat capacity is large, and larval competition is weak, two positive equilibria arise through a saddle-node bifurcation: a stable natural equilibrium and an unstable Allee equilibrium separating persistence from extinction. For a reduced model, a Goh-Volterra Lyapunov functional estimates the persistence basin. We then show how constant and population-responsive sterile-male releases reshape this bistability. Sufficiently large releases annihilate the positive equilibria in a second saddle-node bifurcation, while a sufficiently large constant release drives local elimination from every admissible initial state. Thus SIT need only push the population across the Allee separatrix, after which mate-finding failure can complete extinction. In a free-horizon optimization framework with an Allee-threshold stopping rule, a hybrid release strategy reduces the sterile-male requirement by about $5\%$ relative to the best constant-only strategy and $39\%$ relative to the best population-responsive-only strategy. These results recast the Allee effect as a control lever for vector suppression.

2606.18245 2026-06-17 math.AC 新提交

Derived functors and Hilbert polynomials over Gorenstein rings

导出函子与Gorenstein环上的Hilbert多项式

Satyabrata Paul, Tony J. Puthenpurakal

AI总结 研究Gorenstein环上非自由极大Cohen-Macaulay模的Tor和Ext函子的多项式增长,证明次数上界并给出等式条件。

详情
AI中文摘要

设$(A,\mathfrak{m},k)$是维数$d\ge 1$的Gorenstein环,$N$是维数$t\ge 1$的完美模,$I$是$N$的定义理想。对于非自由极大Cohen-Macaulay(=MCM)$A$-模$M$和整数$i\ge 1$,众所周知函数$n \mapsto \ell(Tor_i^A(M,N/I^{n+1}N))$和$n \mapsto \ell(Ext^i_A(M,N/I^{n+1}N))$分别是次数为$r_i^{I,N}(M)$和$s_{I,N}^i(M)$的多项式类型。我们证明$r_i^{I,N}(M)\le t-1$和$s^i_{I,N}(M)\le t-1$,并且当$I$是极大理想$\mathfrak{m}$时,两个不等式都成为等式。我们还证明$r_i^{I,N}(M)\le r_1^{I,N}(\Omega^dk)$,$s^i_{I,N}(M)\le s^1_{I,N}(\Omega^dk)$以及$r_i^{I,N}(\Omega^dk)=r_1^{I,N}(\Omega^dk)=s^1_{I,N}(\Omega^dk)=s^i_{I,N}(\Omega^dk)$。

英文摘要

Let $(A,\mathfrak{m},k)$ be a Gorenstein ring of dimension $d\ge 1$, $N$ a perfect module of dimension $t\ge 1$ and $I$ an ideal of definition of $N$. For a non-free maximal Cohen-Macaulay (=MCM) $A$-module $M$ and an integer $i\ge 1$, it is well known that the functions $n \mapsto \ell(Tor_i^A(M,N/I^{n+1}N))$ and $n \mapsto \ell(Ext^i_A(M,N/I^{n+1}N))$ are of polynomial types of degrees $r_i^{I,N}(M)$ and $s_{I,N}^i(M)$, respectively. We prove that $r_i^{I,N}(M)\le t-1$ and $s^i_{I,N}(M)\le t-1$ and when $I$ is the maximal ideal $\mathfrak{m}$, both the inequalities become equalities. We also show that $r_i^{I,N}(M)\le r_1^{I,N}(\Omega^dk)$, $s^i_{I,N}(M)\le s^1_{I,N}(\Omega^dk)$ and $r_i^{I,N}(\Omega^dk)=r_1^{I,N}(\Omega^dk)=s^1_{I,N}(\Omega^dk)=s^i_{I,N}(\Omega^dk)$. \end

2606.18238 2026-06-17 math.AG 新提交

Exceptional collections for canonical stacks of log del Pezzo surfaces with $\frac13(1,1)$ singularities

具有 $\frac13(1,1)$ 奇点的对数 del Pezzo 曲面的典范栈的例外集合

Alex Junior Gomez Saltachin

AI总结 研究具有 $\frac13(1,1)$ 型奇点的对数 del Pezzo 曲面的导出范畴,证明其典范 Deligne-Mumford 栈的导出范畴具有完全例外集合,并应用于一般次数 10 的超曲面。

Comments 14 pages

详情
AI中文摘要

我们研究与具有 $\frac{1}{3}(1,1)$ 型奇点的对数 del Pezzo 曲面相关的导出范畴。对于这样的曲面 $X$,我们考虑其光滑典范 Deligne--Mumford 栈 $\pi:\mathcal X\to X$,并将其与奇异粗曲面 $X$ 进行比较。我们的主要结果证明,如果 $X$ 是一个所有奇点均为 $\frac{1}{3}(1,1)$ 型的复对数 del Pezzo 曲面,那么 $D^b(\operatorname{coh}\mathcal X)$ 具有一个完全例外集合。证明结合了对数 del Pezzo 曲面的理性、Orlov 的 blow-up 公式以及 Ishii--Ueda 的特殊 McKay 对应。然后我们专门研究一个一般次数 $10$ 的超曲面 $X_{10}\subset \mathbb P(1,2,3,5)$。Corti--Heuberger 级联将其极小消解识别为 $\widetilde{X}_{10}\cong \operatorname{Bl}8\mathbb F_3$,因此典范栈 $\mathcal X_{10}$ 具有长度为 $13$ 的完全例外集合。我们还通过 Karmazyn--Kuznetsov--Shinder 的方法讨论了奇异粗范畴。

英文摘要

We study derived categories associated with log del Pezzo surfaces whose singularities are of type $\frac{1}{3}(1,1)$. For such a surface $X$, we consider the canonical smooth Deligne--Mumford stack $\pi:\mathcal X\to X$ and compare it with the singular coarse surface $X$. Our main result proves that, if $X$ is a complex log del Pezzo surface whose singularities are all of type $\frac{1}{3}(1,1)$, then $D^b(\operatorname{coh}\mathcal X)$ admits a full exceptional collection. The proof combines rationality of log del Pezzo surfaces, Orlov's blow-up formula, and the special McKay correspondence of Ishii--Ueda. We then specialize to a general degree $10$ hypersurface $X_{10}\subset \mathbb P(1,2,3,5)$. The Corti--Heuberger cascade identifies its minimal resolution as $\widetilde{X}_{10}\cong \operatorname{Bl}8\mathbb F_3$, and therefore the canonical stack $\mathcal X_{10}$ has a full exceptional collection of length $13$. We also discuss the singular coarse category through the approach of Karmazyn--Kuznetsov--Shinder.

2606.18234 2026-06-17 math.NT math.CO 新提交

On zero-sum problems of two new types

关于两种新类型的零和问题

Zhi-Wei Sun

AI总结 研究模n整数环上两种新零和问题,给出s1(n)和t1(n)的上下界,并猜想其精确值为2n+1和2n-(-1)^n。

Comments 9 pages

详情
AI中文摘要

本文主要研究模$\mathbb Z/n\mathbb Z$(其中$n>1$)上两种新类型的零和问题。设$s_1(n)$(相应地$t_1(n)$)是最小正整数$k$,使得对于任意不被$n$整除(相应地,与$n$互素)的整数$a_1,\ldots,a_k$,存在子集$I\subseteq\{1,\ldots,k\}$满足$|I|=n$且和$\sum_{i\in I}a_i$被$n$整除但不被$n^2$整除。对于$n\geqslant 4$,我们证明$2n+1\leqslant s_1(n)\leqslant n^2-2n+2$和$2n-(-1)^n\leqslant t_1(n)\leqslant (n-1)\varphi(n)+1$。我们猜想对任意整数$n>2$,有$s_1(n)=2n+1$和$t_1(n)=2n-(-1)^n$。

英文摘要

In this paper, we mainly investigate zero-sum problems over $\mathbb Z/n\mathbb Z$ (with $n>1$) of two new types. Let $s_1(n)$ (resp. $t_1(n)$) be the least positive integer $k$ such that for any integers $a_1,\ldots,a_k$ not divisible by $n$ (resp., relatively prime to $n$), there is an $I\subseteq\{1,\ldots,k\}$ with $|I|=n$ for which the sum $\sum_{i\in I}a_i$ is divisible by $n$ but not divisible by $n^2$. For $n\geqslant 4$, we prove that $2n+1\leqslant s_1(n)\leqslant n^2-2n+2$ and $2n-(-1)^n\leqslant t_1(n)\leqslant (n-1)\varphi(n)+1$. We conjecture that $s_1(n)=2n+1$ and $t_1(n)=2n-(-1)^n$ for any integer $n>2$.

2606.18221 2026-06-17 math.NA 新提交

LGNO: A Local-Global Neural Operator for Hyperbolic Conservation Laws

LGNO:用于双曲守恒律的局部-全局神经算子

Hao Wang, Chi-Wang Shu, Qi Tang

AI总结 提出局部-全局神经算子(LGNO),结合全局FNO和局部多分辨率分支,通过一步损失和谱惩罚训练,在粗网格上实现比高精度WENO-Z格式更低的数值耗散。

详情
AI中文摘要

双曲守恒律的解既表现出大尺度上的光滑结构,又表现出尖锐的局部特征(如激波和接触间断),这使得现有神经算子难以精确逼近。傅里叶神经算子(FNO)能很好地捕捉长程相互作用,但往往因过度数值耗散而模糊局部结构。为此,我们提出一种局部-全局神经算子(LGNO),它通过结合用于表示大尺度光滑动力学的全局FNO分支和用于增强局部间断及非光滑特征的局部多分辨率分支,学习一步离散流映射。该模型使用一步损失进行训练,该损失结合了物理空间预测项和对高频的谱惩罚,以抑制陡峭前沿附近的伪振荡。在一维和二维的大量基准测试中,LGNO在参数数量匹配的情况下始终优于FNO基线,将一步误差降低了2-5倍,并在长时间自回归展开中保持显著更高的精度。最引人注目的是,尽管它仅使用高阶WENO-Z格式的短时数据进行训练,但在粗$256^2$网格上的长时间展开中,LGNO表现出的数值耗散低于在更细$512^2$网格上运行的相同WENO-Z格式,而计算成本却低几个数量级。这些结果表明,通过适当的架构和训练目标,学习算子可以有效地学习离散流映射。它们进一步表明,这类学习算子有潜力比生成训练数据的传统激波捕捉格式更好地控制长时间数值耗散。

英文摘要

Solutions of hyperbolic conservation laws exhibit both smooth structures across large scales and sharp localized features such as shocks and contact discontinuities, making them difficult to approximate accurately with existing neural operators. The Fourier Neural Operator (FNO) captures long-range interactions well but tends to smear localized structures through excessive numerical dissipation. To address this, we propose a Local-Global Neural Operator (LGNO) that learns a one-step discrete flow map by combining a global FNO branch for representing smooth dynamics at large scales with a local multiresolution branch for enhancing localized discontinuities and nonsmooth features. The model is trained with a one-step loss that combines a physical space prediction term and a spectral penalty on high frequencies to suppress spurious oscillations near steep fronts. On a large collection of benchmarks in one and two dimensions, LGNO consistently outperforms FNO baselines with matched parameter counts, reducing one-step errors by factors of 2-5 and remaining significantly more accurate over long autoregressive rollouts. Most strikingly, although it is trained only on short-time data from a high-order WENO-Z scheme, the long-time rollout of LGNO on a coarse $256^2$ grid exhibits lower numerical dissipation than the same WENO-Z scheme run on a finer $512^2$ grid, while being orders of magnitude cheaper to evaluate. These results suggest that, with an appropriate architecture and training objective, learned operators can effectively learn discrete flow maps. They further suggest that such learned operators have the potential to control long-time numerical dissipation better than the conventional shock-capturing schemes that generate the training data.

2606.18218 2026-06-17 math.PR cs.LG eess.SY math.OC stat.ML 新提交

Finite-Time Queue Peak Laws in Stochastic Networks: Logarithmic Scaling After Geometric Thresholds

随机网络中的有限时间队列峰值律:几何阈值后的对数缩放

Hao Liang, Cheng Tang, Yunzong Xu

AI总结 研究广义交换机中有限时间队列峰值,证明在均匀内部松弛条件下,漂移最小化调度策略的峰值包络从平方根律转变为对数律,并给出匹配下界和几何阈值。

详情
AI中文摘要

我们研究广义交换机中的有限时间队列峰值,广义交换机是一种标准随机网络模型,其中许多队列共享受限的服务资源。到达过程可以是依赖的、时变的,并且适应于过去;稳态负载条件是均匀内部松弛,即条件均值到达向量始终位于容量区域的一个固定收缩内。我们表明,这种松弛重塑了漂移最小化调度策略(如MaxWeight)的有限时间峰值律。没有松弛时尖锐的平方根包络仅持续到几何依赖的阈值;超过该阈值,运行最大值随水平期仅对数增长,无论是高概率还是期望意义下。其机制是自归一化:在当前队列方向上,投影波动尺度被稳定化漂移尺度归一化。这从对数系数中消除了容量几何,而几何仍保留在阈值中。匹配的下界表明,对数项和几何阈值都是不可避免的。当有限时间状态空间塌缩可用时,可以使用局部瓶颈几何来锐化阈值。对于广义输入排队交换机,我们获得了具有紧对数系数的有限时间峰值界。仿真说明了理论预测的两阶段包络、局部几何改进和方差敏感改进。

英文摘要

We study finite-horizon queue peaks in generalized switches, a standard stochastic-network model in which many queues share constrained service resources. Arrivals may be dependent, time-varying, and adapted to the past; the standing load condition is uniform interior slack, meaning the conditional mean arrival vector stays in a fixed contraction of the capacity region. We show that this slack reshapes the finite-time peak law for drift-minimizing scheduling policies such as MaxWeight. The square-root envelope that is sharp without slack persists only up to a geometry-dependent threshold; beyond that threshold, the running maximum grows only logarithmically with the horizon, both with high probability and in expectation. The mechanism is self-normalization: in the current queue direction, the projected fluctuation scale is normalized by the stabilizing drift scale. This removes capacity geometry from the logarithmic coefficient, while geometry remains in the threshold. Matching lower bounds show that both the logarithmic term and a geometric threshold are unavoidable. When finite-time state-space collapse is available, the threshold can be sharpened using local bottleneck geometry. For generalized input-queued switches, we obtain finite-time peak bounds with tight logarithmic coefficients. Simulations illustrate the two-phase envelope, local geometric refinements, and variance-sensitive improvements predicted by the theory.

2606.18217 2026-06-17 math.CT 新提交

Non-distributive lattices of thick tensor-ideals via trivial extensions

通过平凡扩张构造厚张量理想的非分配格

Charalampos Verasdanis

AI总结 通过平凡扩张构造非刚性张量三角范畴,其厚张量理想构成非分配格。

Comments 11 pages

详情
AI中文摘要

我们构造了非刚性张量三角范畴,其厚张量理想构成非分配格。

英文摘要

We construct non-rigid tensor-triangulated categories with non-distributive lattice of thick tensor-ideals.

2606.18214 2026-06-17 math.AP math.PR 新提交

Time and Killed Resolvents in Reflected Optimal Stopping with a Max Payoff

带最大收益的反射最优停时中的时间与杀死预解式

Louis Shuo Wang, Ye Liang

AI总结 研究正象限内带最大收益的反射扩散最优停时问题,证明非光滑收益在扭结集上产生奇异停时增益测度,并给出正确值表示需使用首次进入停时集时杀死的预解式。

详情
AI中文摘要

我们研究正象限内带最大收益 \(G(x_1,x_2)=x_1\vee\alpha x_2\) 的常返反射二维扩散的无限时域最优停时。非光滑收益在扭结集 \(\Delta=\{x_1=\alpha x_2\}\) 上产生奇异停时增益测度。我们证明 \(\displaystyle \Gamma^\Delta(dx) = -\frac{n^\top a(x)n}{2\sqrt{1+\alpha^2}}\,\sigma_\Delta(dx)\),其中 \(n=(1,-\alpha)\),因此在局部椭圆性条件下对角分量非正且严格负。这意味着每个内部扭结点位于连续区域。我们进一步证明正确的值表示使用首次进入停时集时杀死的预解式:\(\displaystyle V=G-R_r^{\mathcal C}\Gamma\),并给出一个闭式反射布朗运动反例说明无限制的反射预解式通常是错误的。反射布朗运动基准和数值实验说明了局部时、预解式间隙和对角回避机制。

英文摘要

We study infinite-horizon optimal stopping for normally reflected two-dimensional diffusions in the positive quadrant with max payoff \(G(x_1,x_2)=x_1\vee\alpha x_2\). The non-smooth payoff produces a singular stopping-gain measure on the kink set \(\Delta=\{x_1=\alpha x_2\}\). We prove $\displaystyle \Gamma^\Delta(dx) = -\frac{n^\top a(x)n}{2\sqrt{1+\alpha^2}}\,\sigma_\Delta(dx)$, with $n=(1,-\alpha)$, so the diagonal component is non-positive and strictly negative under local ellipticity. This implies that every interior kink point lies in the continuation region. We further show that the correct value representation uses the resolvent killed at first entry into the stopping set, $\displaystyle V=G-R_r^{\mathcal C}\Gamma$, and give a closed-form reflected Brownian counter-example showing that the unrestricted reflected resolvent is generally wrong. A reflected Brownian benchmark and numerical experiments illustrate the local-time, resolvent-gap, and diagonal-avoidance mechanisms.

2606.18212 2026-06-17 math.AP 新提交

Inverse problems for a nonlinear dynamical Schrödinger operator with magnetic potential

带磁势的非线性动力学薛定谔算子的反问题

Mandeep Kumar, Boya Liu, Manmohan Vashisth

AI总结 研究带磁势和电势的非线性动力学薛定谔算子的两个反问题,在解析性假设下证明Dirichlet-to-Neumann映射唯一确定时变磁势和电势,并建立全数据和部分数据的唯一性。

详情
AI中文摘要

我们研究带磁势和电势的非线性动力学薛定谔算子的两个反问题。在适当的解析性假设下,我们证明Dirichlet-to-Neumann映射唯一确定时变磁势和电势。我们从全数据和部分数据两方面建立了这些势的唯一性。特别地,对于部分数据问题,通过假设势在边界附近已知,且Neumann数据在边界的任意小开子集上测量,建立了所需的唯一性。此外,我们建立了正问题的适定性,得到了解的最优Sobolev正则性。

英文摘要

We study two inverse problems for a nonlinear dynamical Schrödinger operator with magnetic and electric potentials. Under suitable analyticity assumptions, we show that the Dirichlet-to-Neumann map uniquely determines time-dependent magnetic and electric potentials. We establish the uniqueness of these potentials from both full data and partial data. In particular, for the partial data problem, the desired uniqueness is established by assuming that the potentials are known near the boundary, and the Neumann data is measured on arbitrarily small open subsets of the boundary. In addition, we establish the well-posedness of the forward problem, where we obtain the optimal Sobolev regularity for solutions.