arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.20406 2026-06-19 stat.ME stat.CO 新提交

Flexible modeling of bimodal distributions via skewed-$t$ mixtures

双峰分布的灵活建模：基于偏斜-t分布的混合模型

Marco Bee, Flavio Santi

AI总结提出基于Fernández和Steel (1998)偏斜-t分布的混合模型，通过EM算法进行极大似然估计，并开发似然比检验，用于拟合双峰、偏斜和厚尾数据，在标准普尔500指数中验证了双峰性。

详情

AI中文摘要

基于LLM启发式的零样本主动特征获取

Binyamin Perets, Natalie Mendelson, Shiran Vainberg, Yehuda Chowers, Shai Shen-Orr, Shie Mannor

发表机构 * Faculty of EE, Technion（技术学院电子工程系）； Faculty of Medicine, Technion（技术学院医学院）； CytoReason ； NVIDIA

AI总结提出通过LLM启发式获取马尔可夫随机场充分统计量的零样本主动特征获取框架，解决数据标注不足问题，在IBD患者诊断中优于现有方法。

详情

AI中文摘要

主动特征获取（AFA）顺序选择要观察的特征以达成分类或排序决策。其主要局限性在于依赖大量标注数据来拟合指导获取的概率模型。大型语言模型（LLM）提供无监督的领域知识，但作为序列规划者表现不佳。要求其同时知晓和决策会混淆最好分开的能力。这里，我们通过严格的启发式方法开发了一个零样本AFA框架：仅要求LLM返回其可被信任返回的内容，即马尔可夫随机场（MRF）的充分统计量——一元偏差和成对协变。我们将该框架应用于两个场景：二分类和top-$k$识别。实践中，LLM可靠地仅返回判别性统计量，即区分类别而非孤立每个类别的统计量，这阻碍了经典AFA。我们应用最大熵闭包来解决这种规范模糊性。我们在炎症性肠病（IBD）患者队列上进行评估，这是一个活跃的临床环境，其中诊断模糊性和患者异质性阻碍了稳定的治疗策略。我们的框架在真实标签和其自身提取的信念上均优于LLM。在最关键的地方，即最困难的患者上，我们的top-$k$获取策略显著优于所有现有方法。

英文摘要

Active feature acquisition (AFA) sequentially selects which features to observe to reach a classification or ranking decision. Its central limitation is reliance on large amount of labeled data to fit probabilistic models guiding acquisition. Large language models (LLMs) supply unsupervised domain knowledge, but are poor sequential planners. Asking one to both know and decide conflates capabilities best kept separate. Here, we develop a framework for zero-shot AFA through disciplined elicitation: asking the LLM only for what it can be trusted to return, the unary deviations and pairwise co-variations that are the sufficient statistics of a Markov random field (MRF). We apply our framework to two settings: binary classification and top-$k$ identification. In practice, the LLM reliably returns only discriminative statistics, what distinguishes the classes rather than each class in isolation, which precludes classical AFA. We apply a maximum-entropy closure that resolves this gauge ambiguity. We evaluate on a cohort of Inflammatory Bowel Disease (IBD) patients, an active clinical setting where diagnostic ambiguity and patient heterogeneity obstruct stable treatment strategies. Our framework outperforms the LLM both on real labels and on its own extracted beliefs. Where it matters most, on the hardest patients, our top-$k$ acquisition policy markedly outperforms all existing methods.

URL PDF HTML ☆

赞 0 踩 0

2412.17470 2026-06-19 math.ST econ.EM stat.ME stat.TH 版本更新

A Necessary and Sufficient Condition for Size Controllability of Heteroskedasticity Robust Test Statistics

异方差稳健检验统计量尺寸可控性的一个充要条件

Benedikt M. Pötscher, David Preinerstorfer

AI总结针对回归模型中单个约束检验，给出了异方差稳健检验统计量尺寸可控性的充要条件，改进了现有仅充分条件的结果。

Comments Clarification in Footnote 15 added

2512.19187 2026-06-19 stat.ME 版本更新

Interpolated Quantile Estimation: A Unified Framework Bridging Quantiles and the Mean

插值分位数估计：桥接分位数与均值的统一框架

Saïd Maanan, Azzouz Dermoune, Ahmed El Ghini

AI总结提出三类在经典分位数与样本均值之间连续插值的估计量，基于平滑L1损失构建统一M估计框架，证明一致性和渐近正态性，并揭示轻尾和重尾分布下的不同效率特性。

详情

AI中文摘要

本文开发并分析了三类估计量，它们在经典分位数与样本均值之间连续插值。构造从$L_1$损失的平滑版本开始，由位置参数$z$和平滑参数$h \ge 0$索引，其最小化器$\hat q(z,h)$产生一个统一的$M$估计框架。根据$(z, h)$的指定方式，该框架生成三类不同的估计量：固定参数平滑分位数估计量、固定分位数的插入估计量，以及一个新的均值估计程序连续统。对于所有三个族，我们通过一致渐近等连续性论证建立了一致性和渐近正态性。极限方差具有封闭形式，允许跨族和平滑水平的效率透明比较。参数空间的几何分解表明，对于固定分位数水平$\tau$，可接受的$(z, h)$对位于直线上，沿该线估计量针对相同的总体分位数，而其渐近方差发生变化。理论分析揭示了两种效率机制。在轻尾分布（例如高斯分布）下，平滑产生单调方差减少。在重尾分布（例如拉普拉斯分布）下，有限平滑参数$h^{*}(\tau) > 0$严格提高了分位数估计的效率。基于模拟数据和真实金融收益的数值实验验证了这些结论，并表明，在渐近和有限样本中，均值估计族并未改进样本均值。

英文摘要

This paper develops and analyzes three families of estimators that continuously interpolate between classical quantiles and the sample mean. The construction begins with a smoothed version of the $L_1$ loss, indexed by a location parameter $z$ and a smoothing parameter $h \ge 0$, whose minimizer $\hat q(z,h)$ yields a unified $M$-estimation framework. Depending on how $(z, h)$ is specified, this framework generates three distinct classes of estimators: fixed-parameter smoothed quantile estimators, plug -- in estimators of fixed quantiles, and a new continuum of mean -- estimating procedures. For all three families we establish consistency and asymptotic normality via a uniform asymptotic equicontinuity argument. The limiting variances admit closed forms, allowing a transparent comparison of efficiency across families and smoothing levels. A geometric decomposition of the parameter space shows that, for fixed quantile level $τ$, admissible pairs $(z, h)$ lie on straight lines along which the estimator targets the same population quantile while its asymptotic variance evolves. The theoretical analysis reveals two efficiency regimes. Under light-tailed distributions (e.g., Gaussian), smoothing yields a monotone variance reduction. Under heavy-tailed distributions (e.g., Laplace), a finite smoothing parameter $h^{*}(τ) > 0$ strictly improves efficiency for quantile estimation. Numerical experiments -- based on simulated data and real financial returns -- validate these conclusions and show that, both asymptotically and in finite samples, the mean-estimating family does not improve upon the sample mean.

URL PDF HTML ☆

赞 0 踩 0

2309.15769 2026-06-19 math.ST cs.LG stat.ME stat.TH 版本更新

Benign overfitting beyond prediction: The ordinary least squares interpolator

超越预测的良性过拟合：普通最小二乘插值器

Dennis Shen, Dogyoon Song, Peng Ding, Jasjeet S. Sekhon

发表机构 * Department of Data Sciences & Operations, University of Southern California（数据科学与运营系，南加州大学）； Department of Statistics, University of California, Davis（统计学系，加州大学戴维斯分校）； Department of Statistics, University of California, Berkeley（统计学系，加州大学伯克利分校）； Google DeepMind（谷歌DeepMind）

AI总结本文研究过参数化线性模型中最小ℓ2范数OLS插值器的参数估计与推断性质，推导了留k法、遗漏变量偏误公式和Frisch-Waugh-Lovell定理的过参数化版本，并扩展了高斯-马尔可夫定理。

Comments This work is accepted for publication in Biometrika

详情

AI中文摘要

深度学习的最新进展突显了过参数化统计模型中良性过拟合的现象，引发了对其基础理解的浓厚兴趣。由于其简单性和实际相关性，普通最小二乘（OLS）插值器已成为从理论上理解这一现象的关键研究对象。虽然OLS在经典欠参数化设置下的性质已得到充分理解，但其在过参数化区域中的行为——与岭回归或lasso不同——仍相对较少被探索。我们通过为最小$\ell_2$范数OLS插值器推导新的代数和统计结果，为这一不断增长的文献做出贡献。与现有大部分关注预测风险的工作不同，我们的分析集中于参数估计和推断，这对于许多统计学和因果推断应用至关重要。具体地，我们建立了以下内容的过参数化类比：(i) 留$k$法公式，(ii) 遗漏变量偏误公式，以及(iii) Frisch-Waugh-Lovell定理。在高斯-马尔可夫模型下，我们进一步扩展了高斯-马尔可夫定理，并分析了过参数化设置下同方差性时的方差估计。这些结果共同为研究过参数化线性模型中的参数估计和推断提供了一个系统框架，为超越预测含义的良性过拟合提供了新视角。

英文摘要

Recent advances in deep learning have highlighted the phenomenon of benign overfitting in overparameterized statistical models, sparking significant interest in understanding its foundations. Owing to its simplicity and practical relevance, the ordinary least squares (OLS) interpolator has become a key object of study for gaining theoretical insight into this phenomenon. While the properties of OLS are well understood in classical underparameterized settings, its behavior in the overparameterized regime -- unlike that of ridge regression or the lasso -- remains comparatively less explored. We contribute to this growing literature by deriving new algebraic and statistical results for the minimum $\ell_2$-norm OLS interpolator. In contrast to much of the existing work, which focuses on prediction risk, we center our analysis on parameter estimation and inference, which are fundamental for many statistics and causal inference applications. Specifically, we establish overparameterized analogues of (i) the leave-$k$-out formulas, (ii) the omitted variable bias formula, and (iii) the Frisch-Waugh-Lovell theorem. Under the Gauss-Markov model, we further extend the Gauss-Markov theorem and analyze variance estimation under homoskedasticity in the overparameterized setting. Collectively, these results provide a systematic framework for studying parameter estimation and inference in overparameterized linear models, offering a novel perspective on benign overfitting beyond its implications for prediction.

URL PDF HTML ☆

赞 0 踩 0

2606.19540 2026-06-19 stat.ME stat.CO stat.ML 新提交

Overfitted high-dimensional matrix factorizations via adaptive spectral shrinkage

通过自适应谱收缩的过拟合高维矩阵分解

Lorenzo Mauri, David B. Dunson

AI总结提出EigenBayes方法，通过谱估计和自适应经验贝叶斯校准超参数，实现快速且具有不确定性量化的过拟合因子模型，在数值实验和基因组学应用中优于现有方法。

详情

AI中文摘要

因子模型是分析高维数据以提取低秩信号和估计协方差的常用方法。它们将协方差矩阵分解为低秩分量和对角分量之和。一个关键问题是如何选择潜在维度$k$，当因子模型仅近似成立且信噪比较低时，这尤其具有挑战性。贝叶斯过拟合因子模型指定$k$的上界，并依赖结构化收缩先验有效去除多余分量。这类方法流行且有效，但计算成本高。我们提出了一种更快的\texttt{EigenBayes}方法，基于潜在因子的谱估计和关键超参数的自适应经验贝叶斯校准，提供有效的不确定性量化。得到的后验分布可跨结果分解且解析可处理，绕过了马尔可夫链蒙特卡洛。我们证明\texttt{EigenBayes}能适应每个结果和潜在维度的信噪比，同时将多余的潜在分量收缩至零。我们建立了良好的渐近性质，并在数值实验和基因组学应用中展示了强大的实证性能，其中EigenBayes优于最先进的替代方法。

英文摘要

Factor models are popular approaches for analyzing high-dimensional data to extract low-rank signals and estimate covariances. They decompose the covariance matrix as the sum of low-rank and diagonal components. A key issue is how to choose the latent dimension $k$, which is particularly challenging when the factor model only holds approximately and in low signal-to-noise scenarios. Bayesian overfitted factor models specify an upper bound on $k$ and rely on structured shrinkage priors to effectively remove extra components. Such approaches are popular and effective, but computationally expensive. We propose a much faster \texttt{EigenBayes} approach that provides valid uncertainty quantification, based on spectral estimation of latent factors and adaptive empirical Bayes calibration of key hyperparameters. The resulting posterior distribution factorizes across outcomes and is analytically tractable, bypassing Markov chain Monte Carlo. We show that \texttt{EigenBayes} adapts to the signal-to-noise ratio of each outcome and latent dimension, while shrinking superfluous latent components to zero. We establish favorable asymptotic properties and demonstrate strong empirical performance in numerical experiments and a genomics application, where EigenBayes outperforms state-of-the-art alternatives.

URL PDF HTML ☆

赞 0 踩 0

2606.19643 2026-06-19 stat.ML cs.LG 新提交

Variational Consensus Monte Carlo for Bayesian Mixture

变分共识蒙特卡洛用于贝叶斯混合模型

Julie Fendler, Francesca L. Crowe, Tom Marshall, Sylvia Richardson, Paul D. W. Kirk

AI总结提出变分共识蒙特卡洛方法扩展至过拟合贝叶斯混合模型，通过新颖的聚类匹配算法和聚合策略，在联邦学习设置下推断聚类数和所有参数，并在模拟和真实电子健康记录数据上验证了有效性。

详情

AI中文摘要

受健康数据的隐私、敏感性和共享限制的驱动，我们提出了一个在联邦学习设置下（即数据无法在计算节点之间完全共享或汇集）对贝叶斯混合模型进行推断的全面流程。我们采用共识蒙特卡洛（CMC）方法，在每个数据孤岛内独立运行MCMC算法以估计局部后验分布，然后聚合这些分布以近似完整数据的后验。Rabinovich, Angelino 和 Jordan (2015) [1] 的变分CMC方法将聚合步骤视为变分推断问题，但他们应用于混合模型时假设聚类数和关键混合参数已知。我们的主要方法贡献是：(i) 将变分CMC扩展到过拟合贝叶斯混合模型，该模型推断聚类数和所有模型参数，无需共轭性；(ii) 适用于跨孤岛设置的新颖聚类匹配算法，其中并非每个聚类都出现在每个局部数据集中；(iii) 针对聚合步骤的多种推断策略，匹配不同的联邦学习约束；以及 (iv) 在实践中选择这些策略的指南。一项全面的模拟研究验证了该框架，并允许我们与最先进的联邦学习替代方法进行比较。值得注意的是，我们表明当局部数据集的组成反映了数据中的底层聚类结构时，我们的方法可以比应用于汇集数据的标准MCMC更准确地恢复小聚类。我们在大规模电子健康记录数据上展示了该框架，识别了英国老年人群中的多发病模式。

英文摘要

Motivated by the privacy, sensitivity and sharing limitations of health data, we present a comprehensive pipeline for inference of Bayesian mixture models within a federated learning setting, i.e. when data cannot be fully shared or pooled across compute nodes. We adopt a Consensus Monte Carlo (CMC) approach, in which an MCMC algorithm is run independently within each data silo to estimate local posterior distributions, which are then aggregated to approximate the posterior over the full data. The variational CMC approach of Rabinovich, Angelino and Jordan (2015) [1] frames the aggregation step as a variational inference problem, but their application to mixtures assumes the number of clusters and key mixture parameters to be known. Our main methodological contributions are: (i) an extension of variational CMC to over-fitted Bayesian mixture models that infer the number of clusters and all model parameters, without requiring conjugacy; (ii) novel cluster-matching algorithms suitable for cross-silo settings in which not every cluster appears in each local dataset; (iii) a number of inference strategies for the aggregation step, matched to different federated learning constraints; and (iv) guidelines for choosing among these in practice. A comprehensive simulation study validates the framework and allows us to compare to state-of-the-art federated learning alternatives. Notably, we show that when the composition of local datasets reflects the underlying clustering structure in the data, our approach can recover small clusters with greater accuracy than standard MCMC applied to the pooled data. We illustrate the framework on large-scale electronic health record data, identifying multi-morbidity patterns in a British geriatric population.

URL PDF HTML ☆

赞 0 踩 0

2606.20480 2026-06-19 math.ST stat.ML stat.TH 新提交

Leveraging tails for adaptation

利用尾部进行自适应

Sergios Agapiou, Ismaël Castillo, Paul Egels

AI总结研究非参数贝叶斯中基于p-指数尾先验的后验收缩率，发现p越小收缩越快，且p→0时可实现光滑性自适应，应用于白噪声回归和ReLU神经网络。

Comments 59 pages, 3 figures

2604.06464 2026-06-19 cs.LG physics.app-ph stat.ML 版本更新

Weighted Bayesian Conformal Prediction

加权贝叶斯共形预测

Xiayin Lou, Peng Luo

发表机构 * Technical University of Munich（慕尼黑技术大学）； Massachusetts Institute of Technology（麻省理工学院）

AI总结提出加权贝叶斯共形预测（WBCP），通过加权Dirichlet先验推广贝叶斯共形预测到重要性加权设置，理论证明有效样本量决定后验方差，并提供更丰富的条件覆盖不确定性。

详情

AI中文摘要

共形预测提供具有有限样本覆盖保证的分布自由预测区间，Snell & Griffiths 最近的工作将其重新解释为贝叶斯求积（BQ-CP），通过阈值上的 Dirichlet 后验产生强大的数据条件保证。然而，BQ-CP 根本上要求 i.i.d. 假设。同时，加权共形预测通过重要性权重处理分布偏移，但仍然是频率学派方法，仅产生点估计阈值。我们提出 \textbf{加权贝叶斯共形预测（WBCP）}，它将 BQ-CP 推广到任意重要性加权设置，用加权 Dirichlet $\Dir(\neff \cdot \tilde{w}_1, \ldots, \neff \cdot \tilde{w}_n)$ 替换均匀 Dirichlet $\Dir(1,\ldots,1)$，其中 $\neff$ 是 Kish 有效样本量。我们证明了四个理论结果：(1)~$\neff$ 是匹配频率学派和贝叶斯方差的唯一集中参数；(2)~后验标准差以 $O(1/\sqrt{\neff})$ 衰减；(3)~BQ-CP 的随机占优保证扩展到每个权重轮廓的数据条件保证；(4)~HPD 阈值在条件覆盖上提供 $O(1/\sqrt{\neff})$ 的改进。我们将 WBCP 实例化为 \emph{地理贝叶斯共形预测}，其中基于核的空间权重产生每个位置的后验，并具有可解释的诊断。在合成和真实空间数据集上的实验表明，WBCP 在保持覆盖保证的同时提供了更丰富的不确定性信息。

英文摘要

Conformal prediction provides distribution-free prediction intervals with finite-sample coverage guarantees, and recent work by Snell \& Griffiths reframes it as Bayesian Quadrature (BQ-CP), yielding powerful data-conditional guarantees via Dirichlet posteriors over thresholds. However, BQ-CP fundamentally requires the i.i.d. assumption. Meanwhile, weighted conformal prediction handles distribution shift via importance weights but remains frequentist, producing only point-estimate thresholds. We propose \textbf{Weighted Bayesian Conformal Prediction (WBCP)}, which generalizes BQ-CP to arbitrary importance-weighted settings by replacing the uniform Dirichlet $\Dir(1,\ldots,1)$ with a weighted Dirichlet $\Dir(\neff \cdot \tilde{w}_1, \ldots, \neff \cdot \tilde{w}_n)$, where $\neff$ is Kish's effective sample size. We prove four theoretical results: (1)~$\neff$ is the unique concentration parameter matching frequentist and Bayesian variances; (2)~posterior standard deviation decays as $O(1/\sqrt{\neff})$; (3)~BQ-CP's stochastic dominance guarantee extends to per-weight-profile data-conditional guarantees; (4)~the HPD threshold provides $O(1/\sqrt{\neff})$ improvement in conditional coverage. We instantiate WBCP for spatial prediction as \emph{Geographical BQ-CP}, where kernel-based spatial weights yield per-location posteriors with interpretable diagnostics. Experiments on synthetic and real-world spatial datasets demonstrate that WBCP maintains coverage guarantees while providing substantially richer uncertainty information.

URL PDF HTML ☆

赞 0 踩 0

2606.20148 2026-06-19 stat.ME 新提交

A case study of causal mediation using Bayesian nonparametrics and semiparametric corrections

使用贝叶斯非参数和半参数修正的因果中介分析案例研究

Yuhua Zhang, Michael J. Daniels

AI总结提出截断富集狄利克雷过程混合模型估计自然直接和间接效应，结合高效MCMC算法和基于有效影响函数的一步后验修正，解决贝叶斯非参数中因果估计量的可靠推断问题。

2606.20078 2026-06-19 stat.OT 新提交

气象学中基于回归的因果推断实用入门（I）：所有混杂因素可测

Caren Marzban, Yikun Zhang, Nicholas Bond, Michael Richman

AI总结介绍在非时间序列场景下，利用匹配方法进行因果推断，提供气象学应用实例和R代码。

详情

AI中文摘要

一个变量是否是另一个变量的原因，或者仅仅与之相关，通常是一个重要的科学问题。因果推断是在统计背景下解决该问题的技术体系。尽管在存在时间信息时评估因果关系相对直接，但在非时间序列场景（本文考虑的情况）下，评估因果效应更为困难。因果推断领域的发展涉及广泛的主题概念，从而限制了其在包括气象学在内的一些领域的应用。然而，其核心所需的因果推断知识仅涉及基本概率论和回归，这是大多数气象学家熟悉的主题。通过聚焦这些核心领域，本文及其姊妹篇为气象学界进入（非时间序列）因果推断领域提供了垫脚石。尽管介绍了一些理论基础，但主要目标是将一种称为匹配的特定方法应用于气象学问题。应用数据为公开数据，并提供了R代码，为气象学学生和研究人员进入该领域铺平了道路。

英文摘要

Whether a variable is the cause of another, or simply associated with it, is often an important scientific question. Causal Inference is the name associated with the body of techniques for addressing that question in a statistical setting. Although assessing causality is relatively straightforward in the presence of temporal information, outside of that setting - the situation considered here - it is more difficult to assess causal effects. The development of the field of causal inference has involved concepts from a wide range of topics, thereby limiting its adoption across some fields, including meteorology. However, at its core, the requisite knowledge for causal inference involves little more than basic probability theory and regression, topics familiar to most meteorologists. By focusing on these core areas, this and a companion article provide a steppingstone for the meteorology community into the field of (non-temporal) causal inference. Although some theoretical foundations are presented, the main goal is the application of a specific method, called matching, to a problem in meteorology. The data for the application are in public domain, and R code is provided as well, forming an easy path for meteorology students and researchers to enter the field.

URL PDF HTML ☆

赞 0 踩 0

2506.18652 2026-06-19 stat.AP 版本更新

A Practical Introduction to Regression-based Causal Inference in Meteorology (II): Unmeasured confounders

气象学中基于回归的因果推断实用入门（二）：未测量的混杂因素

Caren Marzban, Yikun Zhang, Nicholas Bond, Michael Richman

AI总结介绍在未测量混杂因素存在时，利用工具变量法通过回归估计因果效应，并以气象数据为例说明工具变量选择的重要性。

详情

AI中文摘要

将相关性“提升”为因果关系的障碍之一是混杂现象，即两个变量之间的相关性实际上是由第三个变量（称为混杂因素）引起的。在先前的一篇配套文章中，我们考察了混杂因素被测量的情况。本文表明，即使混杂变量未被测量，在某些条件下，仍然可以通过一种基于回归的方法（利用工具变量的概念）来估计因果效应。使用与姊妹篇类似的气象数据集，比较和对比了因果效应的几种不同估计。结果表明，工具变量估计的因果效应依赖于工具变量的选择，而气象学考虑对于解决这种不确定性至关重要。提供了用于生成所有结果的R代码，并概述了未来工作的许多方向。

英文摘要

One obstacle to ``elevating'' correlation to causation is the phenomenon of confounding, i.e., when a correlation between two variables exists because both variables are in fact caused by a third variable, called a confounder. The situation where the confounders are measured is examined in an earlier, accompanying article. Here, it is shown that even when the confounding variables are not measured, under certain conditions it is still possible to estimate the causal effect via a regression-based method that uses the notion of instrumental variables. Using a meteorological data set, similar to that in the sister article, a number of different estimates of the causal effect are compared and contrasted. It is shown that the instrumental-variable estimates of causal effect depend on the choice of the instrumental variable, and that meteorological considerations are important in resolving the ambiguity. R code is provided for generating all of the results, and numerous directions for future work are outlined.

URL PDF HTML ☆

赞 0 踩 0

2405.00118 2026-06-19 math.ST stat.ME stat.TH 版本更新

Causal Inference with High-dimensional Discrete Covariates

高维离散协变量下的因果推断

Zhenghao Zeng, Sivaraman Balakrishnan, Yanjun Han, Edward H. Kennedy

AI总结研究高维离散协变量下因果效应的估计问题，证明常用估计量的均方误差界为d²/n²+1/n，并给出极小化下界，提出利用效应同质性和先验知识的新估计量以加速收敛。

Comments 74 pages, 9 figures

详情

AI中文摘要

在从观察性研究估计因果效应时，研究人员通常需要调整许多协变量以消除暴露与结果之间的非因果关系，其中许多协变量是离散的。常用估计量在存在许多离散协变量时的行为尚不明确，因为它们的性质通常是在稀疏性和平滑性等结构假设下分析的，而这些假设不适用于离散设置。在这项工作中，我们研究了一个模型中因果效应的估计，其中用于混杂调整的协变量是离散但高维的，意味着类别数量$d$与样本量$n$相当甚至更大。具体来说，我们证明了常用回归、加权和双稳健估计量的均方误差以$\frac{d^2}{n^2}+\frac{1}{n}$为界。然后，我们证明了平均处理效应的极小化下界为$\frac{d^2}{n^2 \log^2 n}+\frac{1}{n}$量级，这刻画了高维离散设置下因果效应估计的基本难度，并表明上述估计量在忽略对数因子时是速率最优的。我们进一步考虑了可以利用的额外结构，即效应同质性和协变量分布的先验知识，并提出了新的估计量，这些估计量具有更快的收敛速率$\frac{d}{n^2} + \frac{1}{n}$，从而在更广泛的范围内实现一致性。通过模拟研究对结果进行了实证说明。

英文摘要

When estimating causal effects from observational studies, researchers often need to adjust for many covariates to deconfound the non-causal relationship between exposure and outcome, among which many covariates are discrete. The behavior of commonly used estimators in the presence of many discrete covariates is not well understood since their properties are often analyzed under structural assumptions including sparsity and smoothness, which do not apply in discrete settings. In this work, we study the estimation of causal effects in a model where the covariates required for confounding adjustment are discrete but high-dimensional, meaning the number of categories $d$ is comparable with or even larger than sample size $n$. Specifically, we show the mean squared error of commonly used regression, weighting and doubly robust estimators is bounded by $\frac{d^2}{n^2}+\frac{1}{n}$. We then prove the minimax lower bound for the average treatment effect is of order $\frac{d^2}{n^2 \log^2 n}+\frac{1}{n}$, which characterizes the fundamental difficulty of causal effect estimation in the high-dimensional discrete setting, and shows the estimators mentioned above are rate-optimal up to log-factors. We further consider additional structures that can be exploited, namely effect homogeneity and prior knowledge of the covariate distribution, and propose new estimators that enjoy faster convergence rates of order $\frac{d}{n^2} + \frac{1}{n}$, which achieve consistency in a broader regime. The results are illustrated empirically via simulation studies.

URL PDF HTML ☆

赞 0 踩 0

2606.20514 2026-06-19 stat.ME 新提交

Hypergraph Variable Selection with False Discovery Rate Control

具有错误发现率控制的超图变量选择

Sarah Organ, Toby Kenney, Hong Gu

AI总结针对预测变量复杂依赖结构导致变量选择方法功效降低的问题，提出基于超图的选择方法，在控制错误发现率的同时提高选择功效。

Comments 28 pages, 4 figures

2202.03332 2026-06-19 stat.ME econ.EM stat.AP 版本更新

Practical Forecasting of Environmental Maps: A Functional Data Approach

环境地图的实用预测：一种函数型数据方法

Alexander Gleim, Nazarii Salish

AI总结提出一种基于函数型数据分析的统计方法，用于预测随时间变化的地理区域环境数据，通过整合时空依赖关系生成预测表面，并以德国地面臭氧浓度预测为例验证其有效性。

详情

AI中文摘要

环境问题在社会经济和健康研究中日益受到关注，推动了相关现实过程记录和数据收集的进展。然而，传统数据处理工具往往过于局限，无法考虑此类数据集的丰富特性。本文提出了一种简单的统计视角，用于预测随时间在预定义地理区域上顺序收集的环境数据。我们将此类数据集视为具有可能复杂地理区域的表面（或函数型）时间序列。利用函数型数据分析技术，我们开发了一种预测方法，能够同时考虑地理和时间依赖性。该方法允许整合传统多元技术以提供预测表面。我们通过德国地面臭氧浓度的预测示例展示了我们方法的实用价值，证明了其有效性和广泛应用的潜力。

英文摘要

Environmental problems are receiving increasing attention in socio-economic and health studies, fostering advances in recording and data collection of related real-life processes. However, traditional tools for data processing are often found too restrictive as they do not account for the rich nature of such data sets. In this paper, we propose a simple statistical perspective on forecasting environmental data collected sequentially over time across some predefined geographic region. We treat such data set as a surface (or functional) time series with a possibly complicated geographical domain. Using techniques from functional data analysis, we develop a forecasting methodology that allows to account for both geographic and temporal dependencies. This methodology allows integration of traditional multivariate techniques to provide forecasts surfaces. We demonstrate the practical value of our approach with a forecasting example of ground-level ozone concentration across Germany, showcasing its effectiveness and potential for broad application.

URL PDF HTML ☆

赞 0 踩 0

2606.20191 2026-06-19 stat.ML stat.ME 新提交

AK-MCS-C2 : Active Kriging Monte Carlo Simulation method with conformal certification for failure probability estimation

AK-MCS-C2: 具有共形认证的主动克里金蒙特卡洛模拟方法用于失效概率估计

Edgar Jaber, Vincent Chabridon, Mathilde Mougeot

AI总结提出一种结合主动克里金蒙特卡洛模拟与共形预测的主动学习框架，通过自适应交叉共形策略和J+GP共形估计器，在少量样本下提供无分布假设的预测误差保证，提高极限状态面附近样本分类可靠性，从而提升失效概率估计的准确性和鲁棒性。

2606.20141 2026-06-19 stat.CO 新提交

DASH: A Dimensionality Reduction Method for Large-scale Convex MIQP with Applications in Subset Portfolio Selection

DASH: 一种用于大规模凸MIQP的降维方法及其在子集投资组合选择中的应用

Pinzhang Cheng

AI总结提出DASH降维方法，通过减少变量层次改善大规模凸MIQP求解器性能，在子集投资组合选择中显著提升Gurobi难以求解问题的初始解质量。

详情

折叠传输MCMC：对称贝叶斯模型的可认证商后验计算

Jun Hu

发表机构 * Wuhan University of Technology（武汉理工大学）

AI总结针对对称贝叶斯模型中的冗余多峰性导致MCMC收敛诊断退化的问题，提出Folded Transport MCMC方法，通过在对称群的基本域上构建独立采样器直接对商后验进行推断，并利用LCNF振荡认证框架在商度量下提供可证明的认证下界。

Comments 50 pages (including supplementary material), 5 figures, 6 tables. Submitted to Journal of Computational and Graphical Statistics

详情

AI中文摘要

具有有限对称性的贝叶斯模型——如可交换分量的混合模型、具有紧密间隔模态的结构识别——定义的后验在标签置换群下不变，产生冗余的多峰性，从而降低MCMC收敛诊断的质量。我们引入折叠传输MCMC（FolT-MCMC），该方法通过在对称群的基本域上构建独立采样器，直接对商后验进行推断。商提议分布通过对群轨道上学习的归一化流进行对称化得到。我们证明了基于LCNF振荡的认证框架可以迁移到商度量，并具有稳定子修正的球质量界和改进的覆盖半径，并且当未折叠流表现出跨模态提议缺陷时，分位数核心认证下界会得到改善。在高斯混合（d=2-20）、标签切换目标（最多24个等价模态）以及标准贝叶斯三分量混合后验上，分位数核心认证改进比从2倍到145倍不等，且折叠认证经验上几乎与维度无关。在台风山竹期间超高层建筑的真实加速度计数据上，FolT-MCMC产生了非平凡的分位数核心认证，而未折叠认证是平凡的。

英文摘要

In Bayesian mixture models and other exchangeable-component models, the posterior is invariant under permutation of component labels, creating m! equivalent modes-the label-switching problem. Standard MCMC methods either mix poorly across these modes or rely on post-hoc relabelling that cannot guarantee the sampler has converged. We propose Folded Transport MCMC (FolT-MCMC), which eliminates label switching before sampling by restricting the Markov chain to a fundamental domain-a sorted or reflected subspace containing exactly one representative from each symmetric mode. The proposal is a learned normalising flow whose density is symmetrised over the group orbits, ensuring correct targeting on the reduced space. We show that this construction preserves a computable convergence diagnostic based on the oscillation of the log-density ratio, and that the diagnostic becomes sharper on the fundamental domain whenever the original-space flow under-covers one or more symmetric modes. Experiments on Gaussian mixtures (d=2-20), label-switching targets (up to 24 equivalent modes), a standard Bayesian three-component mixture posterior, and real accelerometer data from a supertall building show improvement ratios of 2x to 145x, with the folded diagnostic stable across dimensions while the unfolded diagnostic collapses.

URL PDF HTML ☆

赞 0 踩 0

2603.20022 2026-06-19 stat.ME 版本更新

Q-approximation of operating characteristics of clinical trial designs

临床试验设计操作特性的Q-近似

Susanna Gentile, Daniel E. Schwartz, Riddhiman Saha, Lorenzo Trippa

AI总结提出Q-近似方法，通过二次近似似然函数替代完整数据模拟，快速评估临床试验的操作特性，计算效率比蒙特卡罗模拟高150-1900倍。

详情

AI中文摘要

设计临床试验需要评估多个操作特性（OCs），例如早期停止决策的可能性、检测治疗效应的概率以及I类错误率。在大多数情况下，这些评估基于计算密集型的蒙特卡罗模拟。随着临床试验复杂性和适应性设计使用的增加，计算负担可能迅速变得难以承受。我们引入了一种快速近似OCs的策略，称为Q-近似。我们的方法基于对数似然的二次近似和渐近论证。主要思想是用模拟决定试验中期和最终决策的近似似然函数来替代完整试验数据集的模拟。Q-近似方法可应用于任何使用与似然原理一致的数据分析方法的试验设计，包括具有早期停止的多阶段设计、自适应随机化设计以及利用外部数据的设计。我们通过几个例子说明了该方法，并表明它在减少计算时间的同时提供了重要OCs的准确近似。特别是，在我们的实验中，要达到相当的精度水平，标准蒙特卡罗近似OCs所需的计算预算比Q-近似高150到1900倍。通过实现快速的OC评估，Q-近似可以支持在应用试验规划和方法学开发中更广泛地使用创新试验设计。

英文摘要

Designing clinical trials requires evaluating multiple operating characteristics (OCs), such as the likelihood of an early stopping decision, the probability of detecting a treatment effect, and the Type I error rate. In most cases, these evaluations are based on computationally intensive Monte Carlo simulations. As the complexity of clinical trials and the use of adaptive designs increase, the computational burden can quickly become prohibitive. We introduce a strategy for rapidly approximating OCs, called the Q-approximation. Our approach is based on quadratic approximations of the log-likelihood and asymptotic arguments. The main idea is to replace simulation of full trial datasets with simulation of the approximate likelihood functions that determine the trial's interim and final decisions. The Q-approximation approach can be applied to any trial design that uses data analysis methods coherent with the likelihood principle, including multistage designs with early stopping, adaptively randomized designs, and designs that leverage external data. We illustrate the approach with several examples and show that it provides an accurate approximation of important OCs while reducing the computation time compared to Monte Carlo simulations. In particular, in our experiments, the standard Monte Carlo approximation of OCs requires 150 to 1,900 times greater computing budget than Q-approximations to achieve comparable levels of accuracy. By enabling fast OC evaluations, Q-approximations can support the broader use of innovative trial designs in both applied trial planning and methodological development.

URL PDF HTML ☆

赞 0 踩 0

2602.01929 2026-06-19 math.DS stat.CO stat.ML 版本更新

Probabilistic function-on-function nonlinear autoregressive model for emulation and reliability analysis of stochastic dynamical systems

概率函数对函数非线性自回归模型用于随机动力系统的仿真与可靠性分析

Zhouzhou Song, Marcos A. Valdebenito, Styfen Schär, Stefano Marelli, Bruno Sudret, Matthias G. R. Faes

AI总结提出F2NARX模型，从函数对函数回归角度改进NARX方法，结合PCA和高斯过程回归实现概率预测，并通过主动学习高效估计首次穿越失效概率。

详情

AI中文摘要

在许多工程领域，构建准确且计算高效的代理模型（或仿真器）用于预测动力系统响应至关重要，但由于外部激励和系统参数到系统响应的强非线性和高维映射，这仍然具有挑战性。本文引入了一种新颖的函数对函数非线性自回归外生输入模型（F2NARX），该模型从函数对函数回归的角度重新表述了最近提出的$\mathcal{F}$-NARX方法。所提出的框架在保持高精度的同时显著提高了预测效率。通过将主成分分析与高斯过程回归相结合，F2NARX进一步通过无迹变换以自回归方式实现动力响应的概率预测。这种概率预测能力进一步促进了首次穿越概率评估的主动学习。通过不同复杂度的案例研究证明了该方法的有效性。结果表明，F2NARX在效率上比最先进的NARX模型高出几个数量级，同时通常达到更高的精度。此外，主动学习方法能够仅使用少量训练时间历程准确估计动力系统的首次穿越失效概率。

英文摘要

Constructing accurate and computationally efficient surrogate models (or emulators) for predicting dynamical system responses is critical in many engineering domains, yet remains challenging due to the strongly nonlinear and high-dimensional mapping from external excitations and system parameters to system responses. This work introduces a novel Function-on-Function Nonlinear AutoRegressive model with eXogenous inputs (F2NARX), which reformulates the recently proposed $\mathcal{F}$-NARX method from a function-on-function regression perspective. The proposed framework substantially improves predictive efficiency while maintaining high accuracy. By combining principal component analysis with Gaussian process regression, F2NARX further enables probabilistic predictions of dynamical responses via the unscented transform in an autoregressive manner. Such probabilistic prediction capabilities further facilitate active learning for first-passage probability evaluation. The effectiveness of the method is demonstrated through case studies of varying complexity. Results show that F2NARX outperforms state-of-the-art NARX model by orders of magnitude in efficiency while achieving higher accuracy in general. Meanwhile, the active learning approach enables accurate estimation of first-passage failure probabilities for dynamical systems using only a small number of training time histories.

URL PDF HTML ☆

赞 0 踩 0

2601.23173 2026-06-19 stat.ME 版本更新

Robust, partially alive particle Metropolis-Hastings via the Frankenfilter

鲁棒的、部分存活的粒子Metropolis-Hastings算法：基于Frankenfilter

Chris Sherlock, Andrew Golightly, Anthony Lee

AI总结针对隐马尔可夫模型中条件似然为零导致粒子滤波失效的问题，提出Frankenfilter，通过固定模拟次数上下限并设定成功目标，实现鲁棒且高效的似然估计，在伪边际Metropolis-Hastings中比标准粒子滤波效率提高2-3倍。

详情

AI中文摘要

当隐马尔可夫模型允许给定隐藏过程的观测条件似然为零时，从一个观测时间到下一个观测时间的所有粒子模拟可能产生零值。如果是这样，滤波分布无法估计，且估计的参数似然为零。存活粒子滤波器通过为每个观测间隔模拟随机数量的粒子来解决这个问题，在达到目标数量的非零条件似然后停止。对于异常观测或较差的参数值，非零结果可能极不可能发生，计算成本过高。我们引入了Frankenfilter，一种有原则的、部分存活的粒子滤波器，它在固定模拟次数上下限的同时，针对用户定义的成功量。Frankenfilter产生似然的无偏估计，适用于伪边际Metropolis-Hastings（PMMH）。我们证明，与使用标准粒子滤波器的PMMH相比，使用Frankenfilter的PMMH对异常值和错误指定的初始参数值更加鲁棒，并且通常效率至少提高2-3倍。我们还提供了选择成功量的建议。在n个精确观测的情况下，这特别简单：目标为n次成功。

英文摘要

When a hidden Markov model permits the conditional likelihood of an observation given the hidden process to be zero, all particle simulations from one observation time to the next could produce zeros. If so, the filtering distribution cannot be estimated and the estimated parameter likelihood is zero. The alive particle filter addresses this by simulating a random number of particles for each inter-observation interval, stopping after a target number of non-zero conditional likelihoods. For outlying observations or poor parameter values, a non-zero result can be extremely unlikely, and computational costs prohibitive. We introduce the Frankenfilter, a principled, partially alive particle filter that targets a user-defined amount of success whilst fixing lower and upper bounds on the number of simulations. The Frankenfilter produces unbiased estimators of the likelihood, suitable for pseudo-marginal Metropolis--Hastings (PMMH). We demonstrate that PMMH with the Frankenfilter is more robust to outliers and mis-specified initial parameter values than PMMH using standard particle filters, and is typically at least 2-3 times more efficient. We also provide advice for choosing the amount of success. In the case of n exact observations, this is particularly simple: target n successes.

URL PDF HTML ☆

赞 0 踩 0

2512.17473 2026-06-19 eess.SP cs.LG math.OC stat.ML 版本更新

Alternating Direction Method of Multipliers for Nonlinear Matrix Decompositions

非线性矩阵分解的交替方向乘子法

Atharva Awari, Nicolas Gillis, Arnaud Vandaele

发表机构 * University of Mons（蒙斯大学）

AI总结提出基于交替方向乘子法（ADMM）的算法求解非线性矩阵分解（NMD），支持多种非线性函数和损失函数，在真实数据集上验证了适用性和效率。

Comments 16 pages, 7 figures. v3: Revised version: added new experiments and comparisons. Code available from https://gitlab.com/Atharva05/admm-for-nmd

详情

AI中文摘要

我们提出了一种基于交替方向乘子法（ADMM）的算法，用于求解非线性矩阵分解（NMD）。给定输入矩阵 $X \in \mathbb{R}^{m \times n}$ 和分解秩 $r \ll \min(m, n)$，NMD 寻求矩阵 $W \in \mathbb{R}^{m \times r}$ 和 $H \in \mathbb{R}^{r \times n}$，使得 $X \approx f(WH)$，其中 $f$ 是逐元素非线性函数。我们在几个代表性非线性模型上评估了我们的方法：适用于非负稀疏数据近似的修正线性单元激活 $f(x) = \max(0, x)$，适用于概率电路表示的逐分量平方 $f(x) = x^2$，以及适用于推荐系统的 MinMax 变换 $f(x) = \min(b, \max(a, x))$。所提出的框架灵活支持多种损失函数，包括最小二乘、$\ell_1$ 范数和 Kullback-Leibler 散度，并且可以轻松扩展到其他非线性和度量。我们在真实世界数据集上展示了该方法的适用性、效率和适应性，突出了其在广泛应用中的潜力。

英文摘要

一种无求解器的预测后优化训练方法

Beichen Wan, Mo Liu

AI总结提出一种基于测度变换的决策聚焦学习管道，通过无求解器代理损失实现预测后优化中预测模型的高效训练，理论保证Fisher一致性，训练时间降低数个数量级。

Comments Accepted by ICML 2026

详情

AI中文摘要

我们提出了一种可扩展的方法，用于在预测后优化范式中训练预测（机器学习）模型，其中模型输出作为后续线性优化任务的系数。直接最小化经验决策遗憾对于线性规划和组合优化是不可行的，因为决策映射是分段常数，且梯度几乎处处为零。虽然现有方法通过平滑微分过程来解决这一问题，但它们存在可扩展性问题，因为每次梯度评估都需要调用计算昂贵的求解器。为了解决这个问题，我们提出了一种基于测度变换原理的决策聚焦学习管道，该管道在训练期间产生一个完全无优化求解器的新代理损失。我们建立了理论保证，包括Fisher一致性和超额风险界。实验上，我们的方法在实现与最先进方法相当的决策质量的同时，将训练时间减少了数个数量级。

英文摘要

We propose a scalable method for training prediction (machine learning) models in the predict-then-optimize paradigm, where model outputs serve as coefficients for a subsequent linear optimization task. Directly minimizing the empirical decision regret is intractable for linear programming and combinatorial optimization since the decision mapping is piecewise constant, and the gradients are zero almost everywhere. While existing methods address this by smoothing the differentiation process, they suffer from scalability issues, since a computationally expensive solver call is required for every gradient evaluation. To address this, we propose a decision-focused learning pipeline based on a measure transformation principle, which yields a new surrogate loss that is completely optimization-solver-free during training. We establish theoretical guarantees, including Fisher consistency and excess risk bounds. Empirically, our method achieves decision quality competitive with state-of-the-art methods while reducing training time by orders of magnitude.

URL PDF HTML ☆

赞 0 踩 0

2606.19410 2026-06-19 stat.ML cs.LG 新提交

The Representational Limit of Scalar Interactions: An Interventional Decomposition

标量交互的表征限制：一种干预分解

Potito Aghilar, Sabino Roccotelli, Stanislao Fidanza, Vito Walter Anelli, Sebastiano Stramaglia, Tommaso Di Noia

AI总结本文证明标量交互指标混淆了唯一性、冗余性和协同性，并提出Stochastic Hi-Fi方法，通过干预掩码推理分解每个特征的U/R/S轮廓，在表格和图像任务中恢复被标量基线遗漏的结构。

详情

AI中文摘要

有符号的成对交互指标从根本上混淆了唯一性（U）、冗余性（R）和协同性（S）。我们在一个最小的3路XOR结构因果模型上证明了这一点：忠实的指标如Shapley-Taylor对每对返回零，而投影指标如Shapley Interaction将三阶效应扩散到混淆三种机制的成对标量中。我们引入了Stochastic Hi-Fi，一种事后、无需重新训练的可预测性分解方法，通过干预掩码推理估计每个特征的U/R/S轮廓。该估计器提供精确的干预语义、有限样本蒙特卡洛界限、耦合菱形采样带来的严格方差减少以及均匀的有限词汇收敛。在表格SCM上，Stochastic Hi-Fi恢复了被标量基线遗漏的结构（交互幅度恢复比高达411倍）。它还在GPT-2 IOI电路中分离了冗余和协同头。在NIH ChestX-ray14上，Stochastic Hi-Fi在Pointing Game中匹配GradCAM，并在Deletion AUC上显著改进。

英文摘要

Signed pairwise interaction scores fundamentally conflate uniqueness (U), redundancy (R), and synergy (S). We prove this on a minimal 3-way XOR structural causal model: faithful indices such as Shapley-Taylor return zero per pair, whereas projective indices such as Shapley Interaction spread the third-order effect into pair scalars that conflate the three mechanisms. We introduce Stochastic Hi-Fi, a post-hoc, retraining-free predictability decomposition that estimates per-feature U/R/S profiles by interventional masked inference. The estimator provides exact interventional semantics, finite-sample Monte Carlo bounds, strict variance reduction from coupled diamond sampling, and uniform finite-vocabulary convergence. Across tabular SCMs, Stochastic Hi-Fi recovers structure missed by scalar baselines (up to 411x larger interaction-magnitude recovery ratios). It also separates redundant and synergistic heads in the GPT-2 IOI circuit. On NIH ChestX-ray14, Stochastic Hi-Fi matches GradCAM on Pointing Game and improves substantially on Deletion AUC.

URL PDF HTML ☆

赞 0 踩 0

2606.19883 2026-06-19 cs.LG stat.ML 新提交

Matching Markets meet Cumulative Prospect Theory: Towards Optimal and Adversarially Robust Learning

匹配市场遇上累积前景理论：迈向最优和对抗鲁棒学习

Ananya Kunisetty, Avishek Ghosh

发表机构 * Indian Institute of Technology Bombay（印度理工学院孟买分校）

AI总结研究基于累积前景理论（CPT）的竞争性双边匹配市场多智能体多臂赌博机问题，提出最优遗憾界算法并扩展到对抗性市场。

Comments Accepted at ECML-PKDD 2026, Naples, Italy

详情

AI中文摘要

我们研究了一个在竞争性设置下具有双边匹配市场的多智能体多臂赌博机问题，该问题基于以人为中心的决策模型。为了捕捉人类偏好，我们使用累积前景理论（CPT），该理论通过一个（α-Hölder连续）权重函数以非线性方式加权智能体的行动。CPT已被广泛用于行为经济学和风险敏感机器学习中，以模拟人类偏好。我们分析了带有CPT权重扭曲奖励的最先进学习算法，并获得了玩家最优遗憾界为$\mathcal{O}(K\log T \left(\frac{1}{\Delta}\right)^{2/\alpha})$，其中$K$表示臂数，$T$是学习时间，$\Delta$表示（适当定义的）玩家的最小偏好差距。注意到对$\Delta$的依赖是次优的，我们通过明智地选择探索期间的活跃臂集进一步改进了这一遗憾，从而在主导项中消除了对$K$的依赖，并在臂数$K$显著大于玩家数$N$的设置中实现了改进的（最优）遗憾保证。此外，我们考虑了对抗性市场，其中智能体的观测奖励可能被破坏。我们提出并分析了在已知和未知总破坏预算两种设置下，以CPT作为风险敏感度量的鲁棒市场算法，并在两种情况下建立了对数级别的玩家最优遗憾保证。

英文摘要

We study a multi-agent multi-armed bandit problem in the competitive setup with two-sided matching markets under a human centric decision making model. To capture human preferences, we use cumulative prospect theory (CPT) that weighs the actions of the agent in a nonlinear fashion using a ($α$-Hölder continuous) weight function. CPT has been widely used in behavioral economics and risk sensitive machine learning to emulate human preferences. We analyze the state-of-the-art learning algorithm with CPT weight distorted rewards and obtain a player optimal regret of $\mathcal{O}(K\log T \left(\frac{1}Δ\right)^{2/α})$, where $K$ denotes the number of arms, $T$ is the learning horizon, and $Δ$ represents (suitably defined) players' minimum preference gap. Noticing the dependence on $Δ$ to be sub-optimal, we further improve this regret by judiciously selecting the active set of arms during exploration, which removes the dependence on $K$ in the dominant term and achieves an improved (optimal) regret guarantees in the setting where the number of arms $K$ is significantly larger than the number of players $N$. In addition, we consider adversarial markets where the observed rewards of the agents may be corrupted. We propose and analyze algorithms for robust markets with CPT as risk sensitive measure in both settings where the total corruption budget is known and where it is unknown, and establish logarithmic player-optimal regret guarantees in both cases.

URL PDF HTML ☆

赞 0 踩 0

2606.19607 2026-06-19 cs.AI stat.AP 新提交

Which Pairs to Compare for LLM Post-Training?

LLM后训练中应比较哪些对？

Jiangze Han, Vineet Goyal, Will Ma

发表机构 * Columbia University（哥伦比亚大学）

AI总结研究偏好后训练中如何选择最具信息量的比较对，提出基于采样设计的比较策展方法，通过DPO训练的理论分析给出优化准则，实验证明能提升样本效率。

详情

AI中文摘要

基于偏好的后训练已成为对齐语言模型的核心范式。常见的数据收集策略是为每个提示生成少量补全并标注生成的比较对。然而，人工偏好标签通常比生成额外补全昂贵得多，这提示了相同标注预算的不同使用方式：生成更大的补全集，但只标注最具信息量的比较对。本文研究在基于偏好的后训练中应比较哪些对。我们将比较策展形式化为一个采样设计问题，并通过基于偏好的后训练目标下的最终策略质量来评估设计。我们针对直接偏好优化（DPO）实例化该框架，分析标注对的选择如何通过DPO训练传播到下游策略性能。我们的主要结果为DPO训练策略的后训练最优性差距提供了匹配的上界和下界。这些界限表明，比较选择通过一个单一的设计相关信息矩阵影响下游性能，该矩阵将标签分配与参数估计误差和策略次优性联系起来。这为预算受限的比较策展提供了显式优化准则，并激发了从大型生成补全池中选择信息对的实际采样设计。在合成设置和语言模型后训练基准上的实验表明，所提出的设计在样本效率上持续优于常见的比较选择启发式方法。

英文摘要

Preference-based post-training has become a central paradigm for aligning language models. A common data-collection strategy is to generate a small set of completions for each prompt and label the resulting comparison pairs. However, human preference labels are often much more expensive than generating additional completions, suggesting a different use of the same labeling budget: generate a larger pool of completions, but label only the most informative comparison pairs. This paper studies which pairs should be compared in preference-based post-training. We formulate comparison curation as a sampling-design problem and evaluate designs by the quality of the final policy under the preference-based post-training objective. We instantiate this framework for Direct Preference Optimization (DPO), analyzing how the choice of labeled pairs propagates through DPO training to downstream policy performance. Our main results provide matching upper and lower bounds on the post-training optimality gap of the DPO-trained policy. The bounds show that comparison selection affects downstream performance through a single design-dependent information matrix, which links label allocation to parameter estimation error and policy suboptimality. This yields an explicit optimization criterion for budgeted comparison curation and motivates practical sampling designs for selecting informative pairs from large generated completion pools. Experiments on synthetic settings and language-model post-training benchmarks show that the proposed designs consistently improve sample efficiency over common comparison-selection heuristics.

URL PDF HTML ☆

赞 0 踩 0

2606.19491 2026-06-19 cs.LG stat.ML 新提交

Algebraic Dead Directions in LayerNorm Transformers: A Forward-Pass-Only Diagnostic at LLM Scale

LayerNorm Transformer 中的代数死方向：一种仅需前向传播的大语言模型规模诊断方法

Tejas Pradeep Shirodkar, P. J. Narayanan

发表机构 * IIIT, Hyderabad（海得拉巴国际信息技术学院）

AI总结本文发现 LayerNorm 的逆尺度方向是后最终归一化中心激活协方差矩阵的精确代数核，可仅从参数中读取死方向，无需前向或后向传播，并在 14 个预训练模型上验证了其有效性。

Comments 34 pages, 7 figures, 6 tables. Empirical companion to arXiv:2606.05957

详情

AI中文摘要

预训练 Transformer 位于损失函数的奇异极小值附近，此时 Fisher 信息度量沿死方向退化：参数空间中方向性 Fisher 为零的方向。通常定位这样的方向需要一次前向传播和激活矩阵的特征分解，或基于采样的复杂度估计；没有一种方法能仅从网络参数计算方向。我们针对 LayerNorm Transformer 给出了一个这样的方向。LayerNorm 仿射的逆尺度方向 $\gamma^{-1}/\|\gamma^{-1}\|$ 是后最终归一化中心激活协方差矩阵的精确代数核，适用于任何输入分布，并在参数空间中诱导出相应的死方向。它仅从 LN 尺度参数读取，无需前向或后向传播，无需特征分解：这是针对 LayerNorm 的最廉价死方向读取方法。我们在 14 个预训练 Transformer（9 个 LayerNorm，5 个 RMSNorm；160M-35B；语言和视觉目标）上进行了测试。在随机初始化时，预测方向与测量的底部奇异方向（一次前向传播，直接 SVD）在 9/9 的 LayerNorm 模型上匹配到小数点后四位，并在 5/5 的 RMSNorm 模型上正确缺失，后者缺乏产生该方向的均值减法投影器。在训练后的检查点上，沿该方向的协方差特征值加深约 ${\sim}10^3$ 倍，并打开更多死方向；随机初始化到训练后的差距是一次前向传播、每检查点沿预测坐标的奇异结构读出。由此得出两个闭式结论：残差流的最小奇异值在 13/14 个 Transformer 上逐块保持不变（在其自身输入分布上测量），唯一的例外（Gemma$4$-$31$B）是一个真正的死方向，同一读出可精确定位；核方向的存在从参数本身即可对 Transformer 的归一化进行分类。

英文摘要

Pretrained transformers sit near singular minima of the loss, where the Fisher information metric degenerates along dead directions: directions in parameter space along which the directional Fisher vanishes. Locating such a direction normally needs a forward pass and an eigendecomposition of activations, or a sampling-based complexity estimate; none returns a direction computable from the network's parameters alone. We give one, for LayerNorm transformers. The inverse-scale direction $γ^{-1}/\|γ^{-1}\|$ of the LayerNorm affine is an exact algebraic kernel of the post-final-norm centred activation covariance, for any input distribution, and induces a corresponding dead direction in parameter space. It is read from the LN scale parameter alone, with no forward or backward pass and no eigensolve: the cheapest dead-direction read, specific to LayerNorm. We test it on $14$ pretrained transformers ($9$ LayerNorm, $5$ RMSNorm; $160$M-$35$B; language and vision objectives). At random initialisation the predicted direction matches the measured bottom singular direction (one forward pass, direct SVD) to four decimal places on $9/9$ LayerNorm models, and is correctly absent on $5/5$ RMSNorm models, which lack the mean-subtraction projector that creates it. On the trained checkpoint the covariance eigenvalue along this direction deepens by ${\sim}10^3\times$ and further dead directions open; the random-init-to-trained gap is a one-forward-pass, per-checkpoint readout of singular structure along the predicted coordinate. Two consequences follow in closed form: the residual stream's smallest singular value is preserved block-to-block on $13/14$ transformers measured on their own input distribution, the one exception (Gemma$4$-$31$B) a genuine dead direction the same read pinpoints; and the kernel direction's presence classifies a transformer's normalisation from the parameters alone.

URL PDF HTML ☆

赞 0 踩 0

2606.20557 2026-06-19 cs.LG math.ST stat.ML stat.TH 新提交

Optimal Deterministic Multicalibration and Omniprediction

最优确定性多校准与全预测

Georgy Noarov, Aaron Roth

发表机构 * University of Pennsylvania（宾夕法尼亚大学）

AI总结本文提出一种确定性算法，实现多校准的极小化最优样本复杂度，并推广到结果不可区分性，解决确定性预测器是否必要的问题。

详情

AI中文摘要

一个模型在一组群体权重 $G$ 上是多校准的，如果它是校准的——即即使以其预测为条件也是无偏的——不仅整体上，而且在通过每个 $g \in G$ 对上下文重新加权后也是如此。这对于许多下游应用是一个有用的性质，也是可信机器学习的基本要求。在这项工作之前，所有已知达到 $\varepsilon$-多校准的极小化最优 $\widetilde O(\varepsilon^{-3})$ 样本复杂度的预测器都是随机化的，而确定性预测器仅以更差的样本复杂度已知。多校准中随机化对于最优样本复杂度是否必要的问题由 [CLNR26] 明确提出，并在之前的几项工作中隐含提出。我们通过给出一个输出确定性预测器的极小化最优多校准算法解决了这个开放问题。然后我们将该算法推广到产生满足关于有限或有限覆盖测试集合的结果不可区分性（OI）的最优确定性预测器。作为一个应用，这也给出了具有最优样本复杂度的确定性全预测器和泛预测器，解决了 [OKK25] 和 [BHHLZ25] 提出的开放问题。

英文摘要

A model is multicalibrated on a collection of group weights $G$ if it is calibrated -- i.e. unbiased even conditional on its prediction -- not just overall, but also after reweighting contexts by each $g \in G$. It is a useful property for many downstream applications and is a basic desideratum of trustworthy machine learning. Before this work, all predictors known to attain the minimax-optimal $\widetilde O(\varepsilon^{-3})$ sample complexity rate for $\varepsilon$-multicalibration were randomized, while deterministic predictors were known only with substantially worse sample complexity. Whether randomization is necessary for optimal sample complexity in multicalibration was explicitly asked by [CLNR26] and implicitly in several prior works. We resolve this open problem by giving a minimax-optimal multicalibration algorithm that outputs a deterministic predictor. We then generalize the algorithm to produce optimal deterministic predictors that satisfy outcome indistinguishability (OI) with respect to finite or finitely covered collections of tests. As an application, this also gives deterministic omnipredictors and panpredictors with optimal sample complexity, resolving open problems posed by [OKK25] and [BHHLZ25].

URL PDF HTML ☆

赞 0 踩 0

2606.20022 2026-06-19 stat.ML cs.LG math.OC 新提交

Stochastic Linear Contextual Bandits with Bounded Noise: A Set-Membership Approach

具有有界噪声的随机线性上下文赌博机：一种集合成员方法

Haonan Xu, Yingying Li

AI总结针对有界奖励噪声的随机线性上下文赌博机，提出基于集合成员估计和乐观原则的SME-OFU算法，实现O(log T)的遗憾界，优于次高斯噪声下的最优界。

Comments 23 pages, 1 figure

详情

AI中文摘要

本文考虑具有有界奖励噪声的随机线性上下文赌博机（SLCB）。现有工作通常假设次高斯奖励噪声和有界期望奖励，在此条件下最优遗憾界关于时间T为$\tilde{O}(\sqrt{T})$。然而，在许多应用中，实现/观测到的奖励也自然有界，这意味着奖励噪声有界。有界噪声比次高斯条件更具信息性，但在SLCB文献中尚未被明确利用。本文通过利用一种称为集合成员估计（SME）的不确定性量化方法，并应用面对不确定性的乐观原则（OFU），提出了一种新颖的算法SME-OFU。我们的算法享有改进的遗憾界$O(\log T)$。注意，这并不与次高斯噪声下现有的最优界$\tilde{O}(\sqrt{T})$矛盾，因为有界噪声是更强的条件。最后，仿真表明，当奖励噪声有界时，SME-OFU相对于为次高斯噪声设计的基准算法在经验上有所改进。

英文摘要

This paper considers stochastic linear contextual bandits (SLCB) with bounded reward noise. Existing works typically assume sub-Gaussian reward noise and bounded expected rewards, under which the optimal regret bound scales as $\tilde{O}(\sqrt{T})$ in terms of horizon $T$. However, in many applications, realized/observed rewards are also naturally bounded, implying bounded reward noise. Bounded noise is more informative than the sub-Gaussian condition but has not been leveraged explicitly in the SLCB literature. In this paper, we propose a novel algorithm SME-OFU by utilizing an uncertainty quantification method called set-membership estimation (SME) and applying the principle of optimism in the face of uncertainty (OFU). Our algorithm enjoys an improved regret bound $O(\log T)$. Notice that this does not contradict the existing optimal bound $\tilde{O}(\sqrt{T})$ for sub-Gaussian noise because bounded noise is a stronger condition. Finally, simulations show empirical improvements of SME-OFU over a benchmark algorithm designed for sub-Gaussian noise when the reward noise is bounded.

URL PDF HTML ☆

赞 0 踩 0

2606.19878 2026-06-19 cs.LG math.OC stat.ML 新提交

On the Oracle Complexity of Interpolation-Based Gradient Descent

基于插值的梯度下降的预言复杂度

Dongmin Lee, William Lu, Anuran Makur

发表机构 * Purdue University（普渡大学）

AI总结提出分段多项式插值梯度下降（PPI-GD）方法，通过数据域等距点查询一阶预言构造多项式插值近似全梯度，在强凸和非凸损失下分析预言复杂度，证明在数据维数受限且损失足够光滑时优于多种GD变体。

Comments 16 pages, 2 figures

详情

DOI: 10.1109/TAC.2026.3682210

AI中文摘要

最近关于经验风险最小化（ERM）的一阶优化器的工作表明，可以利用ERM损失函数在训练数据中的光滑性（而非优化参数中的光滑性）来改进梯度下降（GD）方法的预言复杂度。在本文中，我们提出了一种不精确梯度方法——分段多项式插值梯度下降（PPI-GD），该方法通过在数据域中的等距点处查询一阶预言来近似每次迭代中的全梯度，从而在数据域的适当大小的块上构造所得梯度样本的多项式插值。我们分析了PPI-GD在强凸和非凸损失函数下的预言复杂度，其中数据空间维数以训练样本数量的多对数函数为界，并发现当损失函数足够光滑时，PPI-GD在关键区域优于几种GD变体。此外，我们的分析将双三次样条插值误差分析中的几种技术扩展到$d$变量张量积多项式插值的设置中，这可能对插值分析具有独立意义。

英文摘要

Recent work on first-order optimizers for empirical risk minimization (ERM) has suggested that smoothness of ERM loss functions in the training data, rather than in the optimization parameters, can be leveraged to improve the oracle complexity of gradient descent (GD) methods. In this paper, we propose an inexact gradient method, piecewise polynomial interpolation-based gradient descent (PPI-GD), which approximates the full gradient in each iteration by querying the first-order oracle at equidistant points in the data domain to construct polynomial interpolants of the resulting gradient samples over appropriately sized patches of the data domain. We analyze the oracle complexity of PPI-GD for strongly convex and non-convex loss functions when the data space dimension is bounded by a polylogarithmic function of the number of training samples, and find it to outperform several GD variants in key regimes when the loss function is sufficiently smooth. Furthermore, our analysis extends several techniques from the error analysis of bicubic spline interpolants to the setting of $d$-variate tensor product polynomial interpolants which may be of independent interest in interpolation analysis.

URL PDF HTML ☆

赞 0 踩 0

2606.20356 2026-06-19 math.OC cs.AI cs.LG math.PR stat.ML 新提交

Robust $Q$-learning for mean-field control under Wasserstein uncertainty in common noise

公共噪声Wasserstein不确定性下的平均场控制鲁棒$Q$-学习

Mathieu Laurière, Ariel Neufeld, Kyunghyun Park

AI总结提出一种针对公共噪声分布Wasserstein不确定性的离散时间平均场控制鲁棒$Q$-学习算法，结合量化投影与Wasserstein对偶，证明同步和异步学习的收敛性及有限时间界，并在系统风险和流行病模型中验证鲁棒性-性能权衡。

2606.20299 2026-06-19 stat.ML cs.LG hep-ph physics.data-an 新提交

Statistical Properties of Training & Generalization

训练与泛化的统计特性

Itay Lavie, Noam Levi, Yonatan Kahn

AI总结从物理学角度研究深度学习的关键特征和意外现象，回顾神经缩放定律及其与物理问题中约束和归纳偏置的相互作用。

Comments 32 pages, 3 figures. Part of the VERaiPHY initiative

2605.02989 2026-06-19 cs.IT eess.SP math.IT stat.ML 版本更新

Information Theory and Statistical Learning

信息论与统计学习

Abbas El Gamal

AI总结本文是Cover & Thomas《信息论基础》第三版的章节预印本，系统介绍了散度度量在模型训练中的作用，涵盖线性回归、生成扩散模型等，并给出了扩散模型更系统的推导。

详情

AI中文摘要

本手稿包含即将出版的《Cover and Thomas信息论基础》第三版中一章的预印本，经Wiley许可发布。新版的目录EIT-3 ToC可在此https URL找到。反馈请联系abbas@ee. this http URL。学习与信息论在模型训练和基本性能极限的表征中均有交叉。本手稿对第一个交叉点进行了简洁易懂的处理，仅需高年级本科生或一年级研究生水平的信息论和统计学基础知识。章末习题使材料既适合课堂使用也适合自学。本章重点讨论散度度量在模型训练中的作用，示例涵盖从线性回归、逻辑回归到自回归模型、变分自编码器、扩散模型、生成对抗网络和基于分数的模型。介绍了证据下界（ELBO）、f-散度和Fisher散度。特别是，对生成扩散模型的处理提供了比文献中更系统、更明确的推导。

英文摘要

This manuscript contains preprint of a chapter under consideration for inclusion in the forthcoming third edition of {\em Cover and Thomas's Elements of Information Theory}, posted with permission from Wiley. The table of contents EIT-3 ToC of the new edition can be found at: https://docs.google.com/document/d/1L-m4oQEJw1PJhoxBeMwrrBD8S_HmvzMEkPbYvS24980/edit?usp=sharing . For feedback, please contact abbas@ee.stanford.edu Learning and information theory intersect in both model training and the characterization of fundamental performance limits. This manuscript provides a concise and accessible treatment of the first intersection, requiring only basic background in information theory and statistics at the senior undergraduate or first-year graduate level. End-of-chapter exercises make the material well suited for classroom use as well as self-study. The chapter focuses on the role of divergence measures in model training, with examples ranging from linear and logistic regression to autoregressive models, variational autoencoders, diffusion models, generative adversarial networks, and score-based models. It introduces the evidence lower bound (ELBO), f-divergences, and the Fisher divergence. In particular, the treatment of the generative diffusion model provides a more systematic and explicit derivation than is typical in the literature.

URL PDF HTML ☆

赞 0 踩 0

2605.18315 2026-06-19 math.OC stat.ML 版本更新

Attention-based PCA

基于注意力的PCA

Rodrigo Maulen-Soto, Claire Boyer

AI总结本文研究了注意力机制在无监督问题PCA中的表现，证明在高斯数据上训练时，softmax和线性注意力层学习的参数与协方差矩阵的主特征向量对齐，建立了与PCA的直接联系，并扩展到上下文设置中。

详情

AI中文摘要

我们通过一个经典无监督问题——主成分分析（PCA）的视角研究注意力机制。我们证明，当在高斯数据上训练时，softmax和线性注意力层学习的参数与协方差矩阵的主特征向量对齐，从而建立了与PCA的直接且明确的联系。我们的分析涵盖了有限和无限提示范围。在无限提示极限下，我们证明收敛到与主谱方向对齐的全局最优解；而在有限提示设置中，我们显示相同的行为在采样效应范围内出现。我们进一步将分析扩展到具有突出Wishart协方差的上下文设置中，其中注意力成功地恢复了底层信号方向。这些结果表明，在无监督目标下，注意力本质上执行类似于PCA的计算，为其实现表示学习能力提供了理论基础。

英文摘要

We study attention mechanisms through the lens of a canonical unsupervised problem: principal component analysis (PCA). We show that, when trained on Gaussian data, both softmax and linear attention layers learn parameters that align with the principal eigenvectors of the covariance matrix, thereby establishing a direct and explicit connection with PCA. Our analysis covers both finite and infinite prompt regimes. In the infinite-prompt limit, we prove convergence to globally optimal solutions aligned with the leading spectral direction, while in the finiteprompt setting we show that the same behavior emerges up to sampling effects. We further extend the analysis to an in-context setting with spiked Wishart covariances, where attention successfully recovers the underlying signal direction. These results demonstrate that attention inherently performs PCA-like computations under unsupervised objectives, providing a theoretical foundation for its representation-learning capabilities.

URL PDF HTML ☆

赞 0 踩 0

2604.21097 2026-06-19 stat.ML cs.LG 版本更新

Learning to Emulate Chaos: Adversarial Optimal Transport Regularization

学习模拟混沌：对抗最优传输正则化

Gabriel Melo, Leonardo Santiago, Peter Y. Lu

发表机构 * Department of Mechanical and Aerospace Engineering, North Carolina State University, Raleigh, NC（北卡罗来纳州立大学机械与航空航天工程系）； Department of Electrical and Computer Engineering, Tufts University, Medford, MA（塔夫茨大学电气与计算机工程系）； Work performed while at the University of Campinas（在坎皮纳斯大学工作期间）

AI总结针对混沌动力学模拟中长程统计保真度低的问题，提出基于对抗最优传输的目标函数，联合学习高质量汇总统计量和物理一致的模拟器，理论分析与实验验证了Sinkhorn散度和WGAN对偶形式的有效性。

详情

AI中文摘要

混沌出现在许多复杂动力系统中，从天气到电网，但使用机器学习模拟器等数据驱动方法难以准确建模。虽然模拟器是加速模拟和解决逆问题的有前途的工具，但它们仍然难以学习混沌动力学，其中对初始条件的敏感性使得精确的长期预测不可行，尤其是在给定噪声数据的情况下。最近的工作转而训练模拟器以匹配混沌吸引子的统计特性，但这些方法通常依赖于手工制作的汇总统计量或大型、多样的多环境数据集。在这项工作中，我们提出了一类对抗最优传输目标，可以从单个噪声轨迹中联合学习高质量的汇总统计量和物理一致的模拟器。我们从理论上分析并实验验证了我们的方法的Sinkhorn散度公式（2-Wasserstein）和WGAN风格的对偶公式（1-Wasserstein）。在各种混沌系统（包括具有高维时空混沌的系统）上的数值实验表明，使用我们提出的目标训练的模拟器具有显著改善的长期统计保真度。

英文摘要

Chaos arises in many complex dynamical systems, from weather to power grids, but is difficult to accurately model with data-driven methods such as machine learning emulators. While emulators are promising tools for accelerating simulations and solving inverse problems, they still struggle to learn chaotic dynamics, where sensitivity to initial conditions renders exact long-term forecasts infeasible, especially given noisy data. Recent work instead trains emulators to match the statistical properties of chaotic attractors, but these approaches often rely on handcrafted summary statistics or large, diverse multi-environment datasets. In this work, we propose a family of adversarial optimal transport objectives that can jointly learn high-quality summary statistics and a physically consistent emulator from a single noisy trajectory. We theoretically analyze and experimentally validate a Sinkhorn divergence formulation (2-Wasserstein) and a WGAN-style dual formulation (1-Wasserstein) of our approach. Numerical experiments across a variety of chaotic systems, including ones with high-dimensional spatiotemporal chaos, show that emulators trained using our proposed objectives have significantly improved long-term statistical fidelity.

URL PDF HTML ☆

赞 0 踩 0

2604.03146 2026-06-19 stat.ML cs.LG 版本更新

Characterization of Gaussian Universality Breakdown in High-Dimensional Empirical Risk Minimization

高维经验风险最小化中高斯普适性破坏的表征

Chiheb Yaakoubi, Cosme Louart, Malik Tiomoko, Zhenyu Liao

发表机构 * School of Data Science, The Chinese University of Hong Kong, Shenzhen, China ； Huawei Noah's Ark Lab, Huawei Technologies, Paris, France ； School of Electronic Information ； Communications, Huazhong University of Science \& Technology, China

AI总结通过将凸高斯极小极大定理推广到非高斯数据，刻画了高维经验风险最小化估计量的渐近分布，揭示了高斯普适性的适用范围与局限。

Comments 28 pages, 5 figures, 1 table

Journal ref ICML 2026

详情

AI中文摘要

我们研究了一般非高斯数据设计下的高维凸经验风险最小化（ERM）。通过启发式地将凸高斯极小极大定理（CGMT）扩展到非高斯设置，我们推导出关键统计量的渐近极小极大表征，从而能够近似ERM估计量 $\hat{\theta}$ 的均值 $\mu_{\hat{\theta}}$ 和协方差 $C_{\hat{\theta}}$。具体地，在数据矩阵的集中假设以及损失和正则化子的标准正则性条件下，我们证明：对于独立于训练数据的测试协变量 $x$，投影 $\hat{\theta}^\top x$ 近似遵循 $\mu_{\hat{\theta}}^\top x$ 的一般非高斯分布与一个独立中心高斯变量（方差为 $\mathrm{tr}(C_{\hat{\theta}} \mathbb{E}[xx^\top])$）的卷积。这一结果阐明了ERM高斯普适性的范围和局限。此外，我们证明任何 $\mathcal{C}^2$ 正则化子渐近等价于一个由其零点的Hessian矩阵和 $\mu_{\hat{\theta}}$ 处的梯度唯一确定的二次型。我们提供了跨不同损失和模型的数值模拟，以验证我们的理论预测和定性见解。

英文摘要

We study high-dimensional convex empirical risk minimization (ERM) under general non-Gaussian data designs. By heuristically extending the Convex Gaussian Min-Max Theorem (CGMT) to non-Gaussian settings, we derive an asymptotic min-max characterization of key statistics, enabling approximation of the mean $μ_{\hatθ}$ and covariance $C_{\hatθ}$ of the ERM estimator $\hatθ$. Specifically, under a concentration assumption on the data matrix and standard regularity conditions on the loss and regularizer, we show that for a test covariate $x$ independent of the training data, the projection $\hatθ^\top x$ approximately follows the convolution of the generally non-Gaussian distribution of $μ_{\hatθ}^\top x$ with an independent centered Gaussian variable of variance $\mathrm{tr}(C_{\hatθ} \mathbb{E}[xx^\top])$. This result clarifies the scope and limits of Gaussian universality for ERMs. Additionally, we prove that any $\mathcal{C}^2$ regularizer is asymptotically equivalent to a quadratic form determined solely by its Hessian at zero and gradient at $μ_{\hatθ}$. Numerical simulations across diverse losses and models are provided to validate our theoretical predictions and qualitative insights.

URL PDF HTML ☆

赞 0 踩 0

2603.10184 2026-06-19 stat.ML cs.LG 版本更新

Stabilizing Bandits using Regularization: Precise Regret and A Quantitative Central Limit Theorem

使用正则化稳定赌博机：精确遗憾与定量中心极限定理

Budhaditya Halder, Ishan Sengupta, Koustav Chowdhury, Samya Praharaj, Koulik Khamaru

发表机构 * Department of Statistics, Rutgers University（罗切斯特大学统计系）； Indian Statistical Institute, Kolkata（加尔各答印度统计研究所）

AI总结本文提出一种精细的稳定性条件，证明正则化随机镜像下降算法满足该条件，并推导出自适应采样下经验奖励估计的非渐近Berry-Esseen界、匹配的遗憾上下界，以及抗腐败下的渐近正态性，同时揭示正则化是有效推断的必要代价。

Comments Updated rate of convergence and precise regret in version 2

详情

AI中文摘要

由于自适应采样违反了经典渐近理论中的独立性假设，使用赌博机数据进行统计推断面临根本性挑战。近期工作将稳定性~\citep{laiwei82} 确定为自适应下有效推断的充分条件。本文首先提出一个精细的稳定性条件，以在线算法的迭代形式表述，并证明一大类正则化随机镜像下降算法满足该条件。这一精细条件使我们能够在多个方面加强~\citet{laiwei82} 的渐近结果。首先，我们推导出自适应采样下经验奖励估计的非渐近Berry-Esseen界。其次，我们推导出所提算法遗憾的匹配非渐近上下界，从而精确刻画其遗憾。第三，我们证明这些正则化算法在给定水平的对抗性腐败下保持渐近正态性和有效推断。最后，我们表明正则化是必要的而非偶然的：Lai-Wei稳定性与最优的$O(\sqrt{T})$遗憾率（如EXP3等非正则化算法所达到的）不相容，因此受控的多对数级遗憾膨胀是有效推断的代价。

英文摘要

Statistical inference with bandit data presents fundamental challenges owing to adaptive sampling, which violates the independence assumptions underlying classical asymptotic theory. Recent work has identified stability~\citep{laiwei82} as a sufficient condition for valid inference under adaptivity. This paper first provides a refined stability condition, stated in terms of the iterates of an online algorithm, and shows that a large class of regularized stochastic-mirror-descent-style algorithms satisfy it. This refined condition allows us to strengthen the asymptotic results of~\citet{laiwei82} in several ways. First, we derive a non-asymptotic Berry--Esseen bound for the empirical reward estimates under adaptive sampling. Second, we derive matching non-asymptotic upper and lower bounds on the regret of the proposed algorithm, yielding a precise characterization of its regret. Third, we show that these regularized algorithms preserve asymptotic normality and valid inference under a prescribed level of adversarial corruption. Finally, we show that regularization is necessary rather than incidental: Lai--Wei stability is incompatible with the optimal $O(\sqrt{T})$ regret rate -- the rate attained by unregularized algorithms such as EXP3 -- so that a controlled, polylogarithmic inflation in regret is the price of valid inference.

URL PDF HTML ☆

赞 0 踩 0

2601.14430 2026-06-19 stat.ML cs.LG 版本更新

Meta Flow Maps enable scalable reward alignment

元流映射实现可扩展的奖励对齐

Peter Potaptchik, Adhi Saravanan, Abbas Mammadov, Alvaro Prat, Michael S. Albergo, Yee Whye Teh

发表机构 * University of Oxford（牛津大学）； Harvard University（哈佛大学）； Kempner Institute（凯普纳研究所）

AI总结提出元流映射（MFMs）框架，通过可微分的单步后验采样实现高效价值函数估计，从而无需轨迹模拟即可进行推理时引导和离策略微调，显著降低计算成本。

详情

AI中文摘要

控制生成模型在计算上是昂贵的。这是因为与奖励函数的最优对齐——无论是通过推理时引导还是微调——都需要估计价值函数。这一任务需要访问条件后验 $p_{1|t}(x_1|x_t)$，即与中间状态 $x_t$ 一致的干净数据 $x_1$ 的分布，这一要求通常迫使方法诉诸昂贵的轨迹模拟。为了解决这一瓶颈，我们引入了元流映射（MFMs），这是一个将一致性模型和流映射扩展到随机机制的框架。MFMs 被训练为执行随机单步后验采样，从任意中间状态生成任意多个独立同分布的干净数据 $x_1$ 样本。关键在于，这些样本提供了一个可微分的重参数化，从而解锁了高效的价值函数估计。我们利用这一能力解决了两种范式中的瓶颈：实现无需内部展开的推理时引导，并促进对一般奖励的无偏、离策略微调。实验上，我们的单粒子引导 MFM 采样器在 ImageNet 上以极少的计算量在多个奖励上优于 Best-of-1000 基线。

英文摘要

Controlling generative models is computationally expensive. This is because optimal alignment with a reward function--whether via inference-time steering or fine-tuning--requires estimating the value function. This task demands access to the conditional posterior $p_{1|t}(x_1|x_t)$, the distribution of clean data $x_1$ consistent with an intermediate state $x_t$, a requirement that typically compels methods to resort to costly trajectory simulations. To address this bottleneck, we introduce Meta Flow Maps (MFMs), a framework extending consistency models and flow maps into the stochastic regime. MFMs are trained to perform stochastic one-step posterior sampling, generating arbitrarily many i.i.d. draws of clean data $x_1$ from any intermediate state. Crucially, these samples provide a differentiable reparametrization that unlocks efficient value function estimation. We leverage this capability to solve bottlenecks in both paradigms: enabling inference-time steering without inner rollouts, and facilitating unbiased, off-policy fine-tuning to general rewards. Empirically, our single-particle steered-MFM sampler outperforms a Best-of-1000 baseline on ImageNet across multiple rewards at a fraction of the compute.

URL PDF HTML ☆

赞 0 踩 0

2509.15822 2026-06-19 stat.ML cs.LG math.PR math.ST stat.TH 版本更新

Phase Transition for Stochastic Block Model with more than $\sqrt{n}$ Communities

具有多于 $\sqrt{n}$ 个社区的随机块模型的相变

Alexandra Carpentier, Christophe Giraud, Nicolas Verzelen

发表机构 * Institut für Mathematik – Universität Potsdam, Potsdam, Germany（波恩大学数学研究所，德国波恩）； Laboratoire de Mathématiques d’Orsay, Université Paris-Saclay, CNRS, France（奥赛数学实验室，巴黎-萨克雷大学，法国 CNRS）； INRAE, Institut Agro, MISTEA, Univ. Montpellier, France（国家农业研究院，蒙彼利埃大学，法国）

AI总结本文证明在随机块模型中，当社区数 $K\geq \sqrt{n}$ 时，低度多项式在 Chin 等人提出的阈值以下无法恢复社区，而通过计数特定子图可在多项式时间内实现恢复，支持了新相变阈值的猜想。

详情

AI中文摘要

统计物理的预测表明，在随机块模型（SBM）中，当社区数 $K$ 固定时，社区恢复在 Kesten-Stigum (KS) 阈值以上（且仅在其以上）可以在多项式时间内实现。这一猜想催生了丰富的文献，证明在 KS 阈值以上的 SBM 中，非平凡社区恢复确实是可能的。只要 $K\ll \sqrt{n}$（其中 $n$ 是观测图中的节点数），KS 阈值以下低度多项式（LDP）的失败也被证明。当 $K\geq \sqrt{n}$ 时，Chin 等人（2025）最近证明，在稀疏机制中，通过计数非回溯路径，可以在 KS 阈值以下的多项式时间内实现社区恢复。这一突破使他们提出了多社区机制 $K\geq \sqrt{n}$ 的新阈值。在这项工作中，我们为他们的猜想提供了证据：\n1- 我们证明，对于任意图密度，LDP 无法在 Chin 等人（2025）提出的阈值以下恢复社区；\n2- 我们证明，在所提出的阈值以上，不仅是在 Chin 等人（2025）考虑的稀疏机制中，而且在适度稀疏机制中，通过计数受 LDP 分析启发的某些特定子图，可以在多项式时间内实现社区恢复。\n特别地，计数长度为 $\log(n)$ 的自避路径（这与基于非回溯算子的谱算法密切相关）仅在稀疏机制中是最优的。在更密集的机制中，必须考虑基于循环放大的更复杂子图。

英文摘要

Predictions from statistical physics postulate that recovery of the communities in the Stochastic Block Model (SBM) with a fixed number $K$ of communities is possible in polynomial time above, and only above, the Kesten-Stigum (KS) threshold. This conjecture has given rise to a rich literature, proving that non-trivial community recovery is indeed possible in SBM above the KS threshold. Failure of low-degree polynomials (LDP) below the KS threshold was also proven, as long as $K\ll \sqrt{n}$, where $n$ is the number of nodes in the observed graph. When $K\geq \sqrt{n}$, Chin et al.(2025) recently proved that, in a \emph{sparse regime}, community recovery in polynomial time is possible below the KS threshold by counting non-backtracking paths. This breakthrough led them to postulate a new threshold for the many-communities regime $K\geq \sqrt{n}$. In this work, we provide evidence supporting their conjecture:\\ 1- We prove that, for \emph{any graph density}, LDP fail to recover communities below the threshold postulated by Chin et al.(2025) ;\\ 2- We prove that community recovery is possible in polynomial time above the postulated threshold, not only in the \emph{sparse regime} considered in Chin et al.~(2025), but also in \emph{moderately sparse regimes}, by counting occurrences of some specific motifs inspired by the LDP analysis.\\ In particular, counting self-avoiding paths of length $\log(n)$, which is closely related to spectral algorithms based on the Non-Backtracking operator, is optimal only in the sparse regime. More complex motifs based on the blow-up of a cycle must be considered in denser regimes.

URL PDF HTML ☆

赞 0 踩 0

2104.08928 2026-06-19 stat.ML cs.CL cs.LG 版本更新

Group-Sparse Matrix Factorization for Transfer Learning of Word Embeddings

面向词嵌入迁移学习的组稀疏矩阵分解

Kan Xu, Xuanyi Zhao, Hamsa Bastani, Osbert Bastani

发表机构 * W. P. Carey School of Business, Arizona State University（亚利桑那州立大学韦伯商学院）； University of Pennsylvania（宾夕法尼亚大学）； Wharton School, University of Pennsylvania（宾夕法尼亚大学沃顿商学院）

AI总结提出一种基于组稀疏惩罚的两阶段估计器，通过结合大规模语料和少量领域数据高效迁移学习领域特定的词嵌入，并证明了其泛化误差界和非凸目标函数的局部最优与全局最优统计等价。

详情

AI中文摘要

非结构化文本为许多领域的决策者提供了丰富的数据源，从零售中的产品评论到医疗保健中的护理记录。为了利用这些信息，单词通常通过无监督学习算法（如矩阵分解）转化为词嵌入——编码单词之间语义关系的向量。然而，从训练数据有限的新领域学习词嵌入可能具有挑战性，因为在新领域中含义/用法可能不同，例如，单词“positive”通常具有积极情感，但在医疗记录中通常具有消极情感，因为它可能意味着患者检测出疾病阳性。在实践中，我们预计只有少数领域特定的单词可能具有新含义。我们提出了一种直观的两阶段估计器，通过组稀疏惩罚利用这种结构，通过结合大规模文本语料库（如维基百科）和有限的领域特定文本数据，高效地迁移学习领域特定的词嵌入。我们限定了迁移学习估计器的泛化误差，证明当只有少量嵌入在领域间改变时，它可以用显著更少的领域特定数据实现高精度。此外，我们证明了在标准正则化条件下，由非凸目标函数识别的所有局部最小值与全局最小值在统计上不可区分，这意味着我们的估计器可以高效计算。我们的结果首次给出了组稀疏矩阵分解的界限，这可能具有独立意义。我们通过与自然语言处理中最先进的微调启发式方法进行实证比较来评估我们的方法。

英文摘要

Unstructured text provides decision-makers with a rich data source in many domains, ranging from product reviews in retail to nursing notes in healthcare. To leverage this information, words are typically translated into word embeddings -- vectors that encode the semantic relationships between words -- through unsupervised learning algorithms such as matrix factorization. However, learning word embeddings from new domains with limited training data can be challenging, because the meaning/usage may be different in the new domain, e.g., the word ``positive'' typically has positive sentiment, but often has negative sentiment in medical notes since it may imply that a patient tested positive for a disease. In practice, we expect that only a small number of domain-specific words may have new meanings. We propose an intuitive two-stage estimator that exploits this structure via a group-sparse penalty to efficiently transfer learn domain-specific word embeddings by combining large-scale text corpora (such as Wikipedia) with limited domain-specific text data. We bound the generalization error of our transfer learning estimator, proving that it can achieve high accuracy with substantially less domain-specific data when only a small number of embeddings are altered between domains. Furthermore, we prove that all local minima identified by our nonconvex objective function are statistically indistinguishable from the global minimum under standard regularization conditions, implying that our estimator can be computed efficiently. Our results provide the first bounds on group-sparse matrix factorization, which may be of independent interest. We empirically evaluate our approach compared to state-of-the-art fine-tuning heuristics from natural language processing.

URL PDF HTML ☆

赞 0 踩 0

2606.20341 2026-06-19 stat.ME stat.AP 新提交

Anchors Away: Navigating Unanchored Indirect Comparisons with Multilevel Unanchored Meta-Regression (ML-UMR)

锚定之外：使用多层次非锚定元回归（ML-UMR）导航非锚定间接比较

Conor Chandler, Jack Ishak

AI总结针对随机证据缺失时的非锚定治疗比较，提出多层次非锚定元回归（ML-UMR），通过贝叶斯框架联合建模个体与汇总数据，估计多治疗、多研究及目标人群的边际和条件效应，并明确识别假设与可转移性假设。

Comments 20 pages (excluding supplementary material), 5 figures

详情

AI中文摘要

当随机证据不可用时，使用单臂研究或断开证据的非锚定间接治疗比较越来越多地用于卫生技术评估（HTA）。现有方法，包括匹配调整间接比较（MAIC）和模拟治疗比较（STC），通常局限于成对设置，并且通常估计比较研究人群中的边际效应，这可能与决策相关人群不同。我们提出多层次非锚定元回归（ML-UMR），一种用于综合来自完全断开证据的个体患者数据和汇总数据的贝叶斯回归框架。ML-UMR通过在一个统一似然中联合建模个体水平和汇总水平数据，将多层次网络元回归（ML-NMR）扩展到非锚定设置，从而能够估计跨多个治疗、研究和目标人群的治疗特异性结果以及边际和条件效应。ML-UMR区分了识别治疗效应所需的假设与将结果转移到目标人群所需的假设。与所有非锚定比较一样，有效推断依赖于强且通常不可验证的假设，包括条件可交换性、结果模型的正确设定以及跨治疗假设（例如，共享预后因素假设（SPFA））。ML-UMR并未减轻这些要求，而是在统一框架内使其明确，并促进敏感性分析。在模拟研究中，ML-UMR对比较人群效应产生了低偏差和名义覆盖。向其他人群的可转移性关键取决于识别假设：在强效应修饰下，违反SPFA导致偏差，而纳入亚组信息则恢复了近乎无偏的估计和名义覆盖。

英文摘要

Unanchored indirect treatment comparisons using single-arm studies or disconnected evidence are increasingly used in health technology assessment (HTA) when randomized evidence is unavailable. Existing methods, including matching-adjusted indirect comparison (MAIC) and simulated treatment comparison (STC), are generally limited to pairwise settings and typically estimate marginal effects in the comparator study population, which may differ from the decision-relevant population. We propose multilevel unanchored meta-regression (ML-UMR), a Bayesian regression framework for synthesizing individual patient data and aggregate data from fully disconnected evidence. ML-UMR extends multilevel network meta-regression (ML-NMR) to unanchored settings by jointly modeling individual- and aggregate-level data within a unified likelihood, enabling estimation of treatment-specific outcomes and both marginal and conditional effects across multiple treatments, studies, and target populations. ML-UMR distinguishes assumptions required to identify treatment effects from those required to transport results to target populations. As with all unanchored comparisons, valid inference relies on strong and often unverifiable assumptions, including conditional exchangeability, correct specification of the outcome model, and cross-treatment assumptions (e.g., shared prognostic factor assumption (SPFA)). ML-UMR does not lessen these requirements but makes them explicit within a unified framework and facilitates sensitivity analyses. In simulation studies, ML-UMR produced low bias and nominal coverage for comparator-population effects. Transportability to alternative populations depended critically on identifying assumptions: violations of SPFA led to bias under strong effect modification, whereas incorporating subgroup information restored near-unbiased estimation and nominal coverage.

URL PDF HTML ☆

赞 0 踩 0

2606.19982 2026-06-19 stat.ME 新提交

Built-in Selection Bias in Proportional Hazards Models with Omitted Covariates: Simulation Evidence and Alternative Approaches

省略协变量的比例风险模型中的内置选择偏倚：模拟证据与替代方法

Ayoub Bifenzi, Helene Jacqmin-Gadda

AI总结本文通过模拟和实际数据，证明在随机试验中，即使省略的协变量与处理独立，仍会导致Cox比例风险模型估计的处理风险比存在偏倚，并比较了脆弱模型、加速失效时间模型和Kaplan-Meier曲线等替代方法的稳健性。

详情

AI中文摘要

在时间-事件分析中，来自Cox比例风险（PH）模型的风险比（HR）是评估治疗效果最常用且广泛报告的指标。然而，由于风险比固有地依赖于每个时间点的生存条件，它们具有非可压缩性。因此，当存在因省略重要协变量导致的未测量异质性时，即使这些协变量在基线时与主要暴露独立（如随机对照试验中），风险比也会受到内置选择偏倚的影响。本文旨在概述文献中关于未观测异质性（由影响结局的省略协变量引起）如何在标准比例风险模型中偏倚治疗风险比估计的关键发现，即使在处理分配独立于这些协变量的随机试验中也是如此。通过模拟，我们评估了半参数Cox PH模型和参数PH模型在各种未测量异质性场景下的偏倚程度。然后，我们将这些标准模型与替代方法进行比较，这些方法要么解决了这一问题，要么被认为对此具有稳健性。这些替代方法包括来自脆弱模型的风险比、来自加速失效时间（AFT）模型的回归参数，以及使用Kaplan-Meier曲线非参数估计或基于具有时变暴露效应的Cox模型估计的治疗组间生存差异。我们通过一个来自放射治疗肿瘤学组（RTOG 9202）的随机对照试验的实际数据应用，说明了所探索替代方法的实际相关性。

英文摘要

In time-to-event analysis, the hazard ratio (HR) derived from the Cox proportional hazards (PH) model is the most commonly used and widely reported measure for assessing treatment effects. However, hazard ratios are non-collapsible due to their inherent conditioning on survival up to each time point. As a result, they are subject to built-in selection bias in the presence of unmeasured heterogeneity arising from omitted important covariates, even when these covariates are independent of the main exposure at baseline, as is the case in randomized controlled trials. This article aims to provide an overview of key findings from the literature on how unobserved heterogeneity, due to omitted covariates that affect the outcome, can bias the estimation of the treatment hazard ratio in standard proportional hazards models, even in randomized trials where treatment is assigned independently of such covariates. Through simulations, we evaluate the extent of bias in the semi-parametric Cox PH model and parametric PH model under various scenarios of unmeasured heterogeneity. We then compare these standard models to alternative approaches that either account for this issue or are considered robust to it. These alternatives include the hazard ratio estimated from frailty models, regression parameters from an Accelerated Failure Time (AFT) model, and survival differences between treatment groups estimated nonparametrically using Kaplan-Meier curves or based on a Cox model with time-dependent effect of the exposure. We illustrate the practical relevance of the explored alternatives through a real data application to a randomized controlled trial from the Radiation Therapy Oncology Group (RTOG 9202).

URL PDF HTML ☆

赞 0 踩 0

2606.19892 2026-06-19 stat.ME 新提交

The Ghosh-Lin and Fine-Gray models for a mix of administrative and random censoring

混合行政删失与随机删失下的Ghosh-Lin和Fine-Gray模型

Thomas H. Scheike, Christian Mirian, Isao Yokota, Giuliana Cortese

AI总结针对同时存在行政删失和随机删失的数据，提出结合风险集调整和逆概率删失加权的方法，使Ghosh-Lin和Fine-Gray模型得到一致估计。

详情

AI中文摘要

复发事件或竞争风险回归模型通常应用于生物医学领域，两者都可视为边际模型。在存在右删失的情况下，需要调整这些模型以获得一致估计量。当删失是行政性时，边际回归模型特别容易估计。然而，当删失是随机作用时，通常考虑逆概率删失加权（IPCW）调整来获得参数估计。该技术通过正确的删失模型进行删失权重调整，但对于行政删失，只需修改风险集即可正确调整。在实践中，对于大型中央登记处或某些临床试验，所有受试者的行政删失时间已知，但通常也会有一定比例的受试者被随机删失。在这项工作中，我们考虑两种常用的回归方法：用于带有终止事件的复发事件的Ghosh-Lin模型和用于竞争事件的Fine-Gray模型。对于这两种情况，当同时存在行政删失和随机删失时，我们展示了如何通过处理这两种不同类型删失的组合，在最小化建模假设的基础上获得正确估计。

英文摘要

Recurrent events or competing risks regression models are often applied in the bio-medical setting and both can be considered as marginal models. In presence of right-censoring, such models need to be adjusted to give consistent estimators. When censoring is administrative, marginal regression models are particularly easy to estimate. However, when censoring is instead acting randomly, inverse probability of censoring weighting (IPCW) adjustments are typically considered to obtain parameter estimates. This technique relies on a censoring-weights adjustment via a correct censoring model, but for administrative censoring the adjustment is done correctly simply by modifying the risk-set. In practice for large central registries or some clinical trials, the administrative censoring time will be known for all subjects, but there will typically also be a proportion of subjects that are censored at random. In this work, we consider two frequently used regression approaches, the Ghosh-Lin model for recurrent events with terminal events and the Fine-Gray model for competing events. For these two settings, when both administrative and random censoring are present, we demonstrate how to obtain correct estimation by dealing with the combination of the two different types of censoring relying on a minimum of modeling assumptions.

URL PDF HTML ☆

赞 0 踩 0

2606.19760 2026-06-19 stat.AP 新提交

Covariate-Adjusted Functional Principal Components Analysis for Modeling Hazard Rates of Physical Activity in the US Population

协变量调整的功能主成分分析用于建模美国人口体力活动的风险率

Md Rokibul Hasan, Pratim Guha Niyogi

AI总结提出基于风险函数的分布分析方法，利用功能主成分分析（FPCA）从腕部加速度计数据中刻画个体活动强度分布变异，优于均值摘要。

详情

AI中文摘要

体力活动在人类健康中起着至关重要的作用。其整体分布因人而异。常用的汇总指标无法描述这种分布模式。我们提出了一种基于分布的分析方法，通过从腕部加速度计数据中导出的风险函数来建模个体活动强度模式，从而描述体力活动。我们分析了2011-2012年国家健康与营养调查（NHANES）中4297名连续佩戴设备7天的成年人的分钟级独立于监测器的运动摘要（MIMS）数据。我们使用基于生存的方法为每个个体在共同强度网格上导出了非参数活动强度风险，将MIMS的风险曲线及其对数变换后的MIMS都视为功能对象。我们在MIMS的两个尺度上使用功能主成分分析（FPCA）来表征活动强度分布的主要变异模式。组均值风险函数在低强度水平上差异很小，而在高强度水平上我们观察到显著差异。我们的结果表明，基于风险的功能表示方法能够捕捉个体间体力活动强度分布的差异，提供了一种灵活且可解释的方式来表征异质性。该方法优于基于均值的摘要，并支持对人口亚组之间体力活动模式进行有原则的比较。

英文摘要

Physical activity plays a vital role in human health. Its entire distribution differs among people. Commonly used summary measures cannot describe this distributional pattern. We present a distribution-based analytical approach to describe physical activity by modeling individual-level activity-intensity patterns through hazard functions derived from wrist-worn accelerometer data. We analyzed minute-level Monitor-Independent Movement Summary (MIMS) data of 4297 adults with seven continuous days of device wear from the 2011- 2012 National Health and Nutrition Examination Survey (NHANES). We derived a nonparametric activity-intensity hazard using a survival-based approach for each individual on a common intensity grid, treating both the hazard curves from MIMS and their log-transformed MIMS as functional objects. We used functional principal component analysis (FPCA) on both scales of MIMS to characterize dominant modes of variation in activity-intensity distributions. Group-wise mean hazard functions showed little difference at lower intensity levels, while we observed a substantial difference at higher intensity levels. Our results demonstrate that hazard-based functional representations for capturing differences in physical activity intensity distributions across individuals offer a flexible and interpretable way to characterize heterogeneity. This approach works better than mean-based summaries and supports principled comparisons of physical activity patterns across population subgroups.

URL PDF HTML ☆

赞 0 踩 0

2606.19743 2026-06-19 stat.ME stat.AP 新提交

A Bayesian spatio-temporal nearest neighbor Gaussian process model for pooled genetic data

一种用于汇总遗传数据的贝叶斯时空最近邻高斯过程模型

Imke Botha, Tianxiao Hao, Lucinda E. Harrison, Nick Golding, Daniel J. Weiss, Jennifer A. Flegg

AI总结提出最近邻高斯过程模型，结合序贯蒙特卡洛平方算法，高效推断汇总遗传数据中的单倍型频率，并应用于非洲抗疟药物耐药性遗传数据分析。

详情

AI中文摘要

大规模遗传数据集通常汇总不同遗传标记的总等位基因计数。从这些汇总数据中推断单倍型频率（即多标记等位基因的频率）是一个挑战。由于计算成本，先前在此背景下的时空建模仅限于3个标记。在这项工作中，我们提出了一种最近邻高斯过程（NNGP）模型，以改善随标记和观测数量扩展的规模。为了推断模型参数，我们开发了一种新颖的序贯蒙特卡洛平方算法，该算法使用带有祖先抽样的粒子吉布斯来变异NNGP函数值。后者在观测数量和NNGP数量上具有线性成本，并可应用于广泛的NNGP模型。作为案例研究，我们分析了与非洲抗疟药物耐药性相关的遗传数据，并在3和6个遗传标记数据集上实证展示了我们的扩展结果。

英文摘要

Large scale genetic datasets often aggregate the total allele counts of distinct genetic markers. Inferring haplotype frequencies (i.e.\ the frequency of multimarker alleles) from these pooled data is a challenge. Previous spatio-temporal modelling in this context has been limited to 3 markers due to the computational cost. In this work, we propose a nearest neighbor Gaussian process (NNGP) model to improve scaling with the number of markers and observations. To infer the parameters of our model, we develop a novel sequential Monte Carlo squared algorithm, which uses particle Gibbs with ancestor sampling to mutate the NNGP function values. The latter has a linear cost in the number of observations and the number of NNGPs, and can be applied to a broad range of NNGP models. As a case study, we analyse genetic data relating to antimalarial drug resistance in Africa, and show our scaling results empirically on a 3 and 6 genetic marker dataset.

URL PDF HTML ☆

赞 0 踩 0

2606.20489 2026-06-19 q-bio.PE nlin.CG physics.bio-ph stat.AP 新提交

West Nile virus outbreak in Italy modelled with the quantum Game of Life

意大利西尼罗病毒疫情用量子生命游戏建模

Andrea Fontana, Simone Tambascia, Ciro Di Carluccio, Andrea Esposito, Bernardo Spagnolo, Andrea M. Chiariello

AI总结使用量子生命游戏细胞自动机模型模拟2025年夏季意大利西尼罗病毒传播，通过优化蚊子出生和移除率，准确拟合局部和区域平均累计感染曲线，并评估环境变化的影响。

详情

AI中文摘要

近年来，意大利观察到西尼罗病毒（WNV）异常高传播，特别是在拉齐奥南部、坎帕尼亚和威尼托地区感染高峰显著。WNV的主要病媒是库蚊，通过叮咬传播人类感染。本文通过基于量子版本的生命游戏（GOL）细胞自动机模型的计算方法，研究2025年夏季意大利西尼罗热疫情的扩散。具体而言，人类动力学根据GOL规则演化，而病媒（即蚊子）的随机动力学及其与人类的相互作用同时发生。我们表明，该模型在局部和平均区域水平上以高精度拟合累计感染个体曲线，仅需优化蚊子出生率和移除率参数。此外，利用模型的灵活性，我们表明模型参数值的变化阐明了系统对环境变化的响应。例如，我们量化了蚊子传播控制措施或由于气候和生态变化导致的蚊子突然增加的影响。总体而言，我们提供了意大利WNV感染传播的一般定量描述，可作为测试不同环境情景的支持工具，并有助于决策者制定监测病媒动力学和控制病毒传播的策略。

英文摘要

In the last years, an anomalously high spreading of West Nile virus (WNV) has been observed in Italy, with particularly high peaks of infections in southern Lazio, Campania and Veneto regions. The main disease vector for WNV is represented by Culex pipiens mosquitoes, which spread human infections through their bites. Here, we investigate WNV fever epidemic diffusion during summer season 2025 in Italy through a computational approach based on a quantum version of the Game of Life (GOL) cellular automaton model. Specifically, human dynamics evolves according to the GOL rules, while stochastic dynamics of disease vectors, i.e., mosquitoes, as well as their interaction with humans, simultaneously occur. We show that this model fits the curves of cumulative infected individuals with high accuracy, either at local and average-regional level, with only optimization of mosquito birth and removal rates parameters. Furthermore, leveraging model flexibility, we show that changes in model parameters values elucidate system response to environmental variations. For instance, we quantify, e.g., the impact of mosquito spreading containment measures or sudden mosquito increasing abundance due to climatic and ecological changes. Overall, we provide a general, quantitative description of WNV infection spreading in Italy which could represent a supportive tool to test different environmental scenarios and could be useful to devise strategies for decision makers to monitor disease vector dynamics and to control consequent virus diffusion.

URL PDF HTML ☆

赞 0 踩 0

2606.19041 2026-06-19 stat.ME 新提交

Efficient Cumulative Incidence Estimation in Biobank Studies Using All Prevalent and Incident Events

利用所有现患和发病事件在生物库研究中进行高效累积发病率估计

David M. Zucker, Malka Gorfine

AI总结针对生物库数据中同时包含招募前发病（现患）和随访期间发病的个体，提出一种新的累积发病率函数估计方法，整合所有病例，处理年轻发病且生存期长的疾病，理论证明渐近性质，模拟和UK生物库癌症数据验证其优势。

详情

AI中文摘要

基于人群的生物库已在许多国家建立，为大规模研究各种疾病的发病率提供了机会。生物库数据通常是在特定日历期内招募的研究队列中收集的，受试者在年龄介于$R_L$和$R_U$之间时进入研究。本研究关注包含两类个体的生物库数据：在招募前已发生目标疾病（称为现患病例）的个体，以及最初招募时无病但在随访期间发病的个体。我们提出一种新的累积发病率函数（CIF）估计量，它超越了现有方法，因为它整合了所有疾病病例，无论是现患还是发病，无论其后续生命历程如何。特别是，新方法可以处理涉及在年轻年龄发生且发病后生存期长的疾病的情况。建立了新方法的渐近性质，并进行了模拟研究以检验该方法的性能。我们通过将方法应用于英国生物库的癌症数据，说明了该方法的使用，并强调了其相对于现有方法的优势。

英文摘要

Population-based biobanks, now established in many countries, offer opportunities for large-scale studies investigating the incidence of various diseases. Biobank data is typically collected from a study cohort recruited over a defined calendar period, with subjects entering the study at various ages falling between $R_L$ and $R_U$. This work focuses on biobank data that includes individuals in whom onset of the disease of interest occurred before recruitment, termed prevalent cases, along with individuals initially recruited as disease-free in whom disease onset occurred during the follow-up period. We propose a novel cumulative incidence function (CIF) estimator that goes beyond existing methods in that it incorporates all disease cases, both prevalent and incident, irrespective of their subsequent life course. In particular, the new method can handle situations involving diseases that can occur at young ages with long survival after disease onset. Asymptotic properties of the new method are established and a simulation study is presented examining the performance of the method. We illustrate the use of the method and highlight its advantages over existing methods with an application to cancer data from the UK biobank.

URL PDF HTML ☆

赞 0 踩 0

2406.01557 2026-06-19 stat.ME stat.AP 版本更新

Flexible aggregation of compositional predictors with shared effects for microbiome association analysis

共享效应组合预测因子的灵活聚合用于微生物组关联分析

Satabdi Saha, Liangliang Zhang, Michele Guindani, Kim-Anh Do, Christine B. Peterson

AI总结提出BRACE方法，通过尖峰-聚类先验和投影约束高斯先验，实现微生物组数据的自适应聚类和变量选择，识别与结果共享效应的关键特征。

详情

AI中文摘要

微生物组分析的最新进展为微生物群落的分子动态提供了前所未有的见解，激发了揭示微生物组在人类健康中关键作用的兴趣。然而，由于微生物组数据的高维、稀疏和组成性，识别与临床结果相关的微生物特征仍然具有挑战性。此外，许多微生物分类群虽然被分类为不同的，但可能共享功能角色，使传统的变量选择方法复杂化。为了克服这些障碍，我们引入了具有聚合组成效应的贝叶斯回归（BRACE），这是一种新方法，使用结合伯努利活动指标的尖峰-聚类先验、有限活动集上的Ewens可交换分割先验以及聚类效应上的投影约束高斯先验，进行数据自适应聚类和变量选择。我们工作的方法论创新在于如何将Ewens分割先验与聚类原子上的投影约束高斯相结合，以强制执行总和为零的约束。BRACE将具有相似效应的微生物分类群分组，产生更可解释的模型，同时实现有效的降维。通过综合模拟和一项检查口腔微生物组组成对胰岛素抵抗影响的真实应用，我们证明了BRACE在识别具有共享效应的关键特征方面优于现有方法。

英文摘要

Ongoing advancements in microbiome profiling have provided unprecedented insights into the molecular dynamics of microbial communities, sparking a surge of interest in uncovering the microbiome's critical role in human health. Identifying microbial features linked to clinical outcomes, however, remains challenging due to the high-dimensional, sparse, and compositional nature of microbiome data. Additionally, many microbial taxa, although classified as distinct, may share functional roles, complicating traditional variable selection methods. To overcome these obstacles, we introduce Bayesian Regression with Agglomerated Compositional Effects (BRACE), a novel approach using a spike-and-cluster prior combining Bernoulli activity indicators, an Ewens exchangeable partition prior on the finite active set, and a projection-based constrained Gaussian prior on cluster effects to perform data-adaptive clustering and variable selection. The methodological innovation of our work lies in how we combine the Ewens partition prior with a projection-based constrained Gaussian on the cluster atoms to enforce the sum-to-zero constraint. BRACE groups microbial taxa with similar effects on the outcome, yielding more interpretable models while enabling effective dimension reduction. Through comprehensive simulations and a real-world application examining the influence of oral microbiome composition on insulin resistance, we demonstrate BRACE's superior performance over existing methods, particularly in identifying key features with shared effects on outcomes.

URL PDF HTML ☆

赞 0 踩 0

2606.20240 2026-06-19 econ.EM stat.AP 新提交

Two-Sample IV: Efficient Two-Step Estimation and Tests for Overidentification and Weak-Instruments

两样本IV：高效两步估计及过度识别与弱工具变量检验

Fatima Kasenally, Ruoxi Guan, Frank Windmeijer

AI总结针对两样本IV估计，提出异方差和样本异质性下稳健的两步高效估计方法及过度识别检验，仅需线性回归的汇总统计量，并扩展弱工具变量检验。

详情

AI中文摘要

两样本IV是一种流行的估计方法，当结果变量和处理变量在不同样本中可用，而工具变量在两个样本中都可用时。标准估计量是两样本两阶段最小二乘估计量，在同方差和样本同质性下是有效的。我们开发了一个稳健的两步程序，用于在一般异方差和样本异质性下进行有效估计，并提出了相关的两样本Hansen过度识别检验。我们方法的一个关键特征是只需要两个样本中简化形式和第一阶段的线性回归的汇总统计量。这些是估计系数向量的六个对象，以及同方差和异方差稳健的估计方差矩阵。我们进一步表明，在同方差和同质性下，处理样本中的第一阶段F统计量可以按标准方式用作弱工具变量检验，这里的相对偏差是比例偏差。我们提出了Montiel-Olea和Pflueger (2013)的有效F统计量的扩展，用于异方差情况，遵循Windmeijer (2025)的推广。我们在Marshall (2019)研究教育对投票行为影响的应用中说明了估计量和检验，并进行了聚类稳健推断。

英文摘要

Two-sample IV is a popular estimation method when the outcome and treatment variables are available in different samples, whereas instruments are available in both samples. The standard estimator is two-sample two-stage least squares estimator, which is efficient under homoskedasticity and homogeneity of the samples. We develop a robust two-step procedure for efficient estimation under general heteroskedasticity and heterogeneity of the samples, and propose a related two-sample Hansen overidentification test. A key feature of our approach is that only summary statistics from the linear regressions of the reduced form and first-stage in the two samples are needed. These are the six objects of the estimated coefficient vectors, and the homoskedastic and heteroskedasticity robust estimated variance matrices. We further show that the first-stage F-statistic in the treatment sample can be used as a test for weak instruments in the standard way under homoskedasticity and homogeneity, with the relative bias here a proportional bias. We propose an extension of the effective F-statistic of Montiel-Olea and Pflueger (2013) for the heteroskedastic case, following the generalization in Windmeijer (2025). We illustrate the estimators and tests in an application studying the effect of education on voting behavior from Marshall (2019), with cluster robust inference.

URL PDF HTML ☆

赞 0 踩 0

2606.20420 2026-06-19 q-fin.CP stat.AP 新提交

Advanced Calibration Analysis and Tools: Identifying Influential Observations in Stochastic Interest Rate Model Calibration

高级校准分析与工具：识别随机利率模型校准中的有影响观测值

Philipp Mahler, Peter Ruckdeschel

AI总结将校准问题嵌入非线性回归理论，证明最小化RMSRE等价于加权最小二乘，开发诊断框架（加权帽子矩阵、影响函数、泛函Delta方法），实证发现杠杆边界主导、有效维度损失及2022年后参数稳定性转变，指出低RMSRE不足以验证校准。

Comments 47 pages, 9 figures, 1 table

详情

AI中文摘要

利率模型的准确校准对于市场一致性估值和经济情景生成器（ESGs）至关重要。多因子模型（如G2++模型）的传统校准方法通常依赖于点估计，忽略了特定市场数据的影响和估计不确定性的量化。本文开发了一个诊断框架，将校准问题嵌入非线性回归理论。研究表明，行业常见的均方根相对误差（RMSRE）最小化等价于加权最小二乘（WLS）问题。这一等价关系导出了诊断工具的相应公式，包括用于杠杆分析的加权帽子矩阵、用于局部敏感性诊断的影响函数，以及用于局部、边界置信区间的泛函Delta方法。实现中采用了高效的雅可比矩阵分解，利用了平价（ATM）上限的解析可处理性。该框架应用于2016-2025年期间的欧元ATM上限数据集。我们的实证分析揭示了边界主导的杠杆分布、由于参数约束活跃导致的重复有效维度损失，以及2022年后市场转型中局部参数稳定性的诊断机制转变。对精算模型治理的启示是：低RMSRE不足以验证校准。最后，我们讨论了该框架对一般最小二乘问题的适用性，同时指出了对于缺乏闭式梯度的工具（如互换期权）的计算挑战。

英文摘要

The accurate calibration of interest rate models is central to market-consistent valuation and Economic Scenario Generators (ESGs). Traditional calibration methods for multi-factor models such as the G2++ model often rely on point estimates, neglecting the influence of specific market data and the quantification of estimation uncertainty. This paper develops a diagnostic framework embedding the calibration problem into non-linear regression theory. It shows that the common industry practice of minimizing the Root Mean Squared Relative Error (RMSRE) is equivalent to a Weighted Least Squares (WLS) problem. This equivalence yields the corresponding formulations for diagnostic tools, including the Weighted Hat Matrix for leverage analysis, Influence Functions for local sensitivity diagnostics, and the Functional Delta Method for local, boundary-respecting confidence intervals. The implementation uses an efficient Jacobian factorization that exploits the analytical tractability of At-The-Money (ATM) caps. The framework is applied to a dataset of Euro ATM caps covering the period 2016--2025. Our empirical analysis reveals a boundary-dominated leverage profile, repeated losses of effective dimensionality due to active parameter constraints, and a diagnostic regime shift in local parameter stability around the post-2022 market transition. The resulting message for actuarial model governance is that low RMSRE is not sufficient for calibration validation. We conclude by discussing the framework's applicability to general least-squares problems while highlighting the computational challenges for instruments lacking closed-form gradients, such as swaptions.

URL PDF HTML ☆

赞 0 踩 0

2606.19789 2026-06-19 math.OC stat.ME 新提交

电力系统中碳成本传导率：来自欧盟排放交易体系下意大利的证据

Pierdomenico Duttilo, Francesco Lisi

AI总结研究欧盟排放交易体系下碳成本在意大利电力市场的传导率，基于2016-2024年数据，采用自回归线性回归模型，发现全国平均传导率约32%，且各市场区域存在显著异质性。

详情

AI中文摘要

本文研究了欧盟排放交易体系（EU ETS）下碳定价对意大利电力市场的影响，重点关注第三和第四阶段（2016-2024年）各市场区域的碳成本传导率（CPTR）。利用日度数据，研究采用基于自回归动态线性回归模型的计量经济学框架，估计碳成本在批发电力价格中的反映程度。进一步通过稳健性检验和分位数回归，评估CPTR在不同燃料价差水平下的变化。结果表明，碳成本正向且显著地传导至电力价格，证实了碳定价作为关键市场驱动因素的相关性。然而，传导不完全，CPTR值始终低于100%。在国家层面，传导率估计约为32%，第三阶段和第四阶段之间无统计显著变化。各市场区域出现显著异质性：在北部、中北部和撒丁岛，第四阶段传导率上升，而在中南部和西西里岛则下降，反映了发电结构、碳强度和市场条件的差异。总体而言，研究结果强调了市场区域因素在塑造电力市场碳定价有效性中的重要性。

英文摘要

This paper investigates the impact of carbon pricing under the EU Emissions Trading System (EU ETS) on the Italian electricity market, focusing on the carbon cost pass-through rate (CPTR) across market zones during Phases 3 and 4 (2016-2024). Using daily data, the study applies an econometric framework based on a linear regression model with autoregressive dynamics to estimate the extent to which carbon costs are reflected in wholesale electricity prices. It further incorporates robustness checks and quantile regression to assess how the CPTR varies across different fuel spread levels. The results show that carbon costs are positively and significantly transmitted to electricity prices, confirming the relevance of carbon pricing as a key market driver. However, pass-through is incomplete, with CPTR values consistently below 100%. At the national level, the pass-through estimate is around 32%, with no statistically significant change between Phase 3 and Phase 4. Substantial heterogeneity emerges across market zones: pass-through increases in the North, Centre-North, and Sardinia during Phase 4, while it declines in the Centre-South and Sicily, reflecting differences in generation mix, carbon intensity, and market conditions. Overall, the findings highlight the importance of market zones factors in shaping the effectiveness of carbon pricing in electricity markets.

URL PDF HTML ☆

赞 0 踩 0

2603.06820 2026-06-19 econ.EM stat.OT 版本更新

Hippocratic Utility and Status Quo Bias

希波克拉底效用与现状偏见

Tomasz Strzalecki

AI总结本文通过简单例子揭示一种重视失去生命多于拯救生命的效用函数，其适用范围比最初看起来有限得多。

2410.19333 2026-06-19 econ.GN physics.soc-ph q-fin.EC stat.AP 版本更新

Swiss-system chess tournaments and unfairness

瑞士制国际象棋锦标赛与不公平性

László Csató, Alex Krumer

AI总结研究瑞士制国际象棋锦标赛中轮次奇偶性导致的不公平性，发现多执白一局的选手得分显著更高，建议采用偶数轮次和平衡颜色分配机制。

Comments 13 pages, 4 tables

2512.02203 2026-06-19 econ.EM stat.AP 版本更新

Statistical Inference in Large Multi-way Networks

大规模多路网络中的统计推断

Lucas Resende, Guillaume Lecué, Lionel Wilner, Philippe Choné

AI总结提出一种基于分类任务的多路网络结构参数估计方法，无需固定效应数量与结构假设，避免 incidental parameter 问题，在稀疏网络中比 PPML 更快且置信区间更可靠，应用于法国医疗政策因果效应分析。

Comments Working paper

详情

AI中文摘要

我们提出了一种新方法，用于在多路网络中估计结构参数，同时控制丰富的固定效应结构。该方法基于一系列分类任务，对固定效应的数量和结构均不敏感。与完全最大似然方法相比，我们的估计量不会受到 incidental parameter 问题的影响。对于稀疏连接的网络，它在计算上也比 PPML 更快。我们提供的经验证据表明，我们的估计量比 PPML 及其偏差修正策略产生更可靠的置信区间。即使在模型误设下，这些改进仍然成立，并且在稀疏设置中更为显著。虽然 PPML 在密集、低维数据中仍具有竞争力，但我们的方法为多路模型提供了一种稳健的替代方案，能够随稀疏性高效扩展。该方法被应用于研究政策改革对法国医疗空间可达性的因果效应。

英文摘要

We propose a new method to estimate structural parameters in multi-way networks while controlling for rich structures of fixed effects. The method is based on a series of classification tasks and is agnostic to both the number and structure of fixed effects. In contrast to full maximum likelihood approaches, our estimator does not suffer from the incidental parameter problem. For sparsely connected networks, it is also computationally faster than PPML. We provide empirical evidence that our estimator yields more reliable confidence intervals than PPML and its bias-correction strategies. These improvements hold even under model misspecification and are more pronounced in sparse settings. While PPML remains competitive in dense, low-dimensional data, our approach offers a robust alternative for multi-way models that scales efficiently with sparsity. The method is applied to study the causal effect of a policy reform on spatial accessibility to health care in France.

URL PDF HTML ☆

赞 0 踩 0

2606.20427 2026-06-19 math.ST stat.ME stat.TH 新提交

Private Rate-Double-Robust Inference

私有率双稳健推断

Máté Kormos, Aad van der Vaart

AI总结本文通过局部隐私机制注入噪声保护个体隐私，同时利用率双稳健性实现目标参数的无偏和半参数有效推断，并开发了私有化非参数和参数 nuisance 估计方法。

详情

AI中文摘要

我们协调了隐私保护和率双稳健推断。个体隐私通过局部隐私机制得到保护：向敏感数据注入噪声，仅揭示用于推断的噪声数据。因此，隐私保护阻碍了推断。相比之下，当目标参数的估计量的大样本偏差由另外两个 nuisance 参数的估计误差之间的权衡表征时，该参数的推断是率双稳健的。因此，率双稳健性促进了推断。我们协调的起点是一类由无限维线性索引和低维非线性回归索引的率双稳健目标参数。这包括因果参数等。为了私有地推断这些目标，我们展示了合适的隐私机制如何将敏感数据模型的半参数性质转移到私有设置中。率双稳健性被转移，从而实现了对目标参数的局部私有、无偏和半参数有效推断。最后，我们将一般的非参数 nuisance 估计量转化为私有估计量，这些估计量继承了其非私有对应物的收敛性质。对于参数 nuisance 模型，我们开发了一种私有矩估计方法及其大样本推断理论。

英文摘要

We reconcile privacy protection and rate-double-robust inference. The privacy of individuals is protected by a local privacy mechanism: injecting noise into their sensitive data, revealing only the noisy data for inference. Hence, privacy protection hinders inference. In contrast, the inference of a target parameter is rate-double-robust when the large-sample bias of an estimator of the parameter is characterised by a trade-off between the estimation errors of two other, nuisance, parameters. Hence, rate-double-robustness facilitates inference. Our starting point of reconciliation is a class of rate-double-robust target parameters indexed linearly by an infinite-dimensional and nonlinearly by a low-dimensional regression. Among others, this includes causal parameters. To infer these targets privately, we show how suitable privacy mechanisms transfer the semiparametric properties of the sensitive-data model to the private setting. Rate-double-robustness is transferred, enabling locally-private, unbiased and semiparametrically efficient inference of our target parameters. Finally, we transform general nonparametric nuisance estimators into private ones, which inherit convergence properties of their nonprivate counterparts. For parametric nuisance models, we develop a private method-of-moments estimator and its large-sample inference theory.

URL PDF HTML ☆

赞 0 踩 0

2601.02322 2026-06-19 stat.ME cs.LG 版本更新

Environment-Adaptive Covariate Selection: Learning When to Use Spurious Correlations for Out-of-Distribution Prediction

环境自适应协变量选择：学习何时利用虚假相关进行分布外预测

Shuozhi Zuo, Yixin Wang

发表机构 * Department of Statistics, University of Michigan, Ann Arbor（统计系，密歇根大学，安阿伯分校）

AI总结针对分布外预测中协变量选择问题，提出环境自适应算法，根据环境特征动态选择协变量集，在模拟和实际数据中优于静态方法。

详情

AI中文摘要

一种常见的分布外预测方法将模型限制为因果或不变协变量，以避免可能随环境变化的虚假关联。尽管具有理论吸引力，但当仅观察到结果的部分因果父节点时，该策略可能不如经验风险最小化。在这种情况下，非因果协变量可以作为未观察到的因果父节点的代理，当代理关系稳定时改善预测，但当变化破坏这种关系时则有害。因此，最优协变量集可能取决于所遇到的具体变化。由于不同的变化会在未标记的协变量分布中留下特征，我们提出了一种环境自适应协变量选择算法，该算法将环境级摘要映射到特定于环境的协变量集。这些摘要可以是手工制作的，也可以从多环境数据中学习，并且先验因果知识可以作为约束条件纳入。在模拟和应用数据集中，所提出的方法在各种变化下优于静态因果、不变和其他非自适应规则。

英文摘要

A common approach to out-of-distribution prediction restricts models to causal or invariant covariates to avoid spurious associations that may change across environments. Despite its theoretical appeal, this strategy can underperform empirical risk minimization when only a subset of the causal parents of the outcome is observed. In such settings, non-causal covariates can serve as proxies for unobserved causal parents and improve prediction when the proxy relationship is stable, but they can hurt when shifts disrupt that relationship. Thus, the optimal covariate set can depend on the specific shift encountered. Because different shifts leave signatures in the unlabeled covariate distribution, we propose an environment-adaptive covariate selection algorithm that maps environment-level summaries to environment-specific covariate sets. These summaries may be hand-crafted or learned from multi-environment data, and prior causal knowledge can be incorporated as constraints. Across simulations and applied datasets, the proposed method improves over static causal, invariant, and other non-adaptive rules under diverse shifts.

URL PDF HTML ☆

赞 0 踩 0

2606.20114 2026-06-19 stat.ME stat.AP 新提交

Community detection in small-sample ordinal regimes: A benchmarking framework for Delphi data

小样本有序情境下的社区检测：德尔菲数据的基准测试框架

Yuri Calleo, Simone Di Zio, Fabrizio Maturo

AI总结针对德尔菲数据高维小样本导致的秩亏问题，提出从变量中心协方差模型转向网络中心连接模型，利用社区检测算法识别潜在主题结构，实现结构稳定的降维。

详情

AI中文摘要

德尔菲数据共识的统计建模面临一个关键瓶颈：问卷项目的高维性与专家小组有限样本量之间的矛盾。这种秩亏导致传统潜变量模型（如主成分分析）结构不稳定且易过拟合。为弥补这一方法论空白，本研究提出从变量中心协方差模型转向网络中心连接模型。通过将项目相关性映射到加权图拓扑，我们提出了一个基于模拟的基准测试，利用社区检测算法识别潜在主题结构，有效解决了高维小样本情境下典型的谱不稳定性和秩亏问题。该研究系统评估了基于结构密度、信息流和谱划分的拓扑方法在合成数据集上的鲁棒性，这些数据集旨在复制共识数据的病理条件，包括有序量表和系统噪声。核心方法论贡献在于证明专家判断间的共线性——传统上被视为需要正则化的统计冗余——可以有效地重新解释为凝聚的拓扑信号。该框架为研究人员提供了一种结构化的自动降维程序，确保即使在标准因子分析失效的小样本情境下也能保持结构稳定性和心理测量一致性。

英文摘要

The statistical modeling of consensus in Delphi data faces a critical bottleneck: the high dimensionality of questionnaire items relative to the limited sample size of expert panels. This rank deficiency leads traditional latent variable models, such as Principal Component Analysis, to be structurally unstable and prone to overfitting. Addressing this methodological gap, this study proposes a transition from variable-centric covariance models to network-centric connectivity models. By mapping item correlations onto a weighted graph topology, we present a simulation-based benchmark that utilizes community detection algorithms to identify latent thematic structures, effectively addressing the spectral instability and rank deficiency typical of high-dimensional, low-sample-size regimes. The research systematically evaluates the robustness of topological approaches based on structural density, information flow, and spectral partitioning against synthetic datasets designed to replicate the pathological conditions of consensus data, including ordinal scales and systemic noise. The central methodological contribution lies in demonstrating that collinearity among expert judgments - traditionally treated as statistical redundancy to be regularized - can be effectively reinterpreted as a topological signal of cohesion. This framework provides researchers with a structured and automated procedure for dimensionality reduction, ensuring structural stability and psychometric consistency even in small-sample regimes where standard factor analysis breaks down.

URL PDF HTML ☆

赞 0 踩 0

2606.19775 2026-06-19 cs.SI stat.AP stat.OT 新提交

Rethinking Sampling Strategy in Link Prediction

重新思考链接预测中的采样策略

Yilin Bi, Zhenyu Deng, Xinshan Jiao, Tao Zhou

AI总结提出β-采样方案，研究两阶段采样对链接预测性能的影响，发现缺失链接的结构特征显著影响预测精度，且第二阶段采样策略至关重要。

Comments 19 pages, 5 figures, 3 tables

详情

AI中文摘要

许多现实世界的网络是不完整的，使得链接预测成为网络科学中的一个基本挑战。为了训练参数和评估算法，观察到的链接通常被划分为三个子集，即训练集、验证集和探测集。这种划分隐含地涉及两个采样过程：第一阶段采样产生探测集，第二阶段采样获得变化集。迄今为止，我们对这两个采样过程如何影响算法性能的理解仍然非常有限。为了解决这个问题，我们提出了一种称为β-采样的采样方案，其中链接的采样概率与其两个端点的度数乘积的β次幂成正比。在45个真实网络上的实验表明，通过改变探测集模拟的缺失链接的结构特征显著影响预测精度。当缺失链接倾向于连接高度数节点时，这类链接可以很容易地被准确预测。此外，即使探测集固定，第二阶段采样仍然对预测精度产生显著影响。值得注意的是，最优的第二阶段采样策略不同于随机采样（随机选择链接形成验证集）和一致采样（保证验证集和探测集中的链接具有相同的结构特征）。

英文摘要

Many real-world networks are incomplete, making link prediction a fundamental challenge in network science. To train parameters and evaluate algorithms, observed links are usually divided into three subsets, namely training, validation, and probe sets. This division implicitly involves two sampling processes: first-stage sampling yields the probe set and second-stage sampling obtains the variation set. To date, our understanding of how these two sampling processes affect algorithm performance remains quite limited. To address this issue, we propose a sampling scheme called $β$-sampling, where the sampling probability of a link is proportional to the product of the degrees of its two endpoints raised to the power of $β$. Experiments on 45 real-world networks reveal that the structural characteristics of missing links, as simulated via varying probe sets, substantially impact prediction accuracy. When missing links tend to connect high-degree nodes, such links can be predicted accurately with ease. Furthermore, even with a fixed probe set, second-stage sampling still exerts a significant influence on prediction accuracy. Notably, the optimal second-stage sampling strategy differs from \textit{random sampling} (which randomly selects links to form the validation set) and \textit{consistent sampling} (which guarantees that links in the validation and probe sets share identical structural characteristics).

URL PDF HTML ☆

赞 0 踩 0

2606.19642 2026-06-19 physics.ao-ph stat.AP stat.ML 新提交

Rigorous uncertainty quantification of probabilistic AI weather forecasts with conformal prediction

基于保形预测的概率AI天气预报的严格不确定性量化

Anna Asch, Raphael Rossellini, Pedram Hassanzadeh, Rebecca Willett

AI总结针对AI概率天气预报校准不足（尤其是极端事件），提出使用保形预测方法，无需分布假设即可数学保证覆盖，应用于三个全球模型（GenCast、NeuralGCM、AIFS-ENS）的温度和降水预报，实现校准不确定性而不牺牲其他概率指标。

详情

AI中文摘要

概率天气预报正随着人工智能（AI）经历快速变革。在传统数值天气预报中，计算能力可能限制集合预报对未知未来状态统计分布的近似程度。AI模型便于生成更大的集合，并经过概率考量训练，理论上能带来更好的不确定性量化。这些最先进模型的预报通常被认为是良好校准的。然而，我们在此表明，此类模型的统计覆盖（校准的最终度量）可能存在问题，尤其是在极端事件上。为解决这一缺陷，我们采用保形预测，这是一类统计方法，与以往的后处理技术不同，它在无分布假设下数学上保证覆盖。我们将在线保形预测应用于三个领先全球天气模型（GenCast、NeuralGCM和AIFS-ENS）的温度和降水预报（包括极端情况），确保校准不确定性而不牺牲其他概率指标。这种后处理方法可应用于任何预报模型。

英文摘要

Probabilistic weather forecasting is undergoing rapid transformation with artificial intelligence (AI). In traditional numerical weather prediction, computing power can limit how well ensemble forecasts approximate the unknown statistical distribution of future states. AI models facilitate larger ensembles and are trained with probabilistic considerations, ideally leading to better uncertainty quantification. Forecasts from these state-of-the-art models are often considered well-calibrated. However, here we show that the statistical coverage of such models, the ultimate measure of calibration, can struggle, especially on extreme events. To address this shortcoming, we employ conformal prediction, a class of statistical methods that mathematically guarantees coverage under no distributional assumptions, unlike previous post-processing techniques. We apply online conformal prediction to temperature and precipitation forecasts (including extremes) of three leading global weather models, GenCast, NeuralGCM, and AIFS-ENS, ensuring calibrated uncertainty at no expense to other probabilistic metrics. This post-processing method can be applied to any forecasting model.

URL PDF HTML ☆

赞 0 踩 0

2606.18544 2026-06-19 stat.AP 新提交

Chess Signatures of Play

对弈的棋谱签名

Christian Turk, Nicholas Polson

AI总结利用粗路径理论的签名变换提取棋局中事件顺序与交互的不变特征，构建签名核双样本检验和时序有效作弊检测方法，在控制错误率的同时显著提升检测能力。

详情

AI中文摘要

一局棋是一个流：一个按时间排序的走法序列，每个走法携带引擎评估、准确度度量、局面复杂度度量和时钟读数。我们将一局棋建模为多元路径，并应用粗路径理论的签名变换，获得一个重参数化不变、分级的特征集，记录棋局内事件的顺序和交互，无需参数化似然。我们证明，棋手的对弈法则可以从期望签名中识别，直至树状等价；构造路径空间上的签名核双样本检验；并将作弊检测重新表述为任意时序有效的序列检验：签名符合度得分成为一个e过程，其误差通过Ville不等式对每个样本量同时控制，波动在中等偏差尺度上校准。判别信息存在于签名的Levy面积中，该面积衡量准确度是否恰好当局面变难时上升——这是引擎辅助的特征，而聚合的匹配率统计忽略了这一点。在对照研究中，该检验保持精确的第一类错误控制，检测能力从对细微辅助的微不足道上升到对明显辅助的0.98，中位检测时间与增长率预测一致。校准至马格努斯·卡尔森记录在案的精英准确度后，该监测器不会标记世界冠军级别的对弈；我们展示了作弊策略，这些策略使所有聚合统计量（包括Regan系统的最佳走法频率z分数）保持不变，却被签名干净地捕获——精确说明了顺序感知、任意时序有效的检验如何加强现有的国际象棋反作弊方法。

英文摘要

A game of chess is a stream: a time-ordered sequence of moves, each carrying an engine evaluation, a measure of accuracy, a measure of position complexity, and a clock reading. We model a game as a multivariate path and apply the signature transform of rough-path theory to obtain a reparametrization-invariant, graded feature set that records the order and interaction of in-game events without a parametric likelihood. We show that a player's law of play is identifiable from the expected signature up to tree-like equivalence, construct a signature-kernel two-sample test on path space, and recast cheating detection as an anytime-valid sequential test: a signature conformance score becomes an e-process whose error is controlled for every sample size at once by Ville's inequality, with fluctuations calibrated on the moderate-deviation scale. The discriminating information lives in the signature's Levy areas, which measure whether accuracy rises precisely when positions become hard--the fingerprint of engine assistance that aggregate match-rate statistics discard. In a controlled study the test holds exact type-I control and detection power rises from negligible for subtle assistance to 0.98 for blatant assistance, with a median detection time matching the growth-rate prediction. Calibrated to Magnus Carlsen's documented elite accuracy, the monitor does not flag world-champion-level play; and we exhibit cheating strategies that leave every aggregate statistic, including the best-move-frequency z-score of the Regan system, unchanged yet are caught cleanly by the signature--making precise how an order-aware, anytime-valid test strengthens the prevailing approach to chess anti-cheating.

URL PDF HTML ☆

赞 0 踩 0

2606.18436 2026-06-19 stat.ML cs.LG 新提交

Pointwise is Pointless? A Multimodal Ablation Study for Precipitation Nowcasting with Graph Neural Networks

逐点是否无意义？基于图神经网络的降水临近预报的多模态消融研究

Ophélia Miralles, Máté Mile, Christoffer Artturi, Thomas Nipen, Ivar Seierstad

发表机构 * Norwegian Meteorological Institute（挪威气象研究所）

AI总结本研究通过多模态图神经网络系统，消融分析雷达、数值预报、地面观测、卫星数据及训练损失对降水临近预报的影响，发现各模态分别改善不同方面，点观测虽提升局部但需结合损失函数和不确定性表示才能优化雷达场。

详情

AI中文摘要

稀疏点观测在降水临近预报中日益可用，但尚不清楚它们能在多大程度上改善密集雷达场预报。我们通过北欧雷达区域的多模态图神经网络临近预报系统部分回答了这个问题。该模型预测未来两小时内每五分钟的降雨率，并采用雷达历史、MEPS数值天气预报、Netatmo地面观测、MSG卫星通道、随机噪声和基于CRPS的集合损失的不同组合进行训练。本研究设计为对操作相关信源和训练目标的消融。我们比较了仅雷达、NWP信息、站点信息、卫星信息、噪声增强和基于CRPS的配置，使用雷达网格、站点位置、降雨起始的互补诊断，以及oracle、位移和幅度评分。结果表明，每个信源改善了预报问题的不同方面。MEPS稳定了仅雷达外推，Netatmo观测改善了局部站点和起始诊断，卫星预测因子减少了某些站点级偏差，但在确定性使用时可能过早激活降雨。基于CRPS的配置提供了最一致的雷达网格增益，而卫星与CRPS的组合设置给出了最佳的整体oracle/DAS评分。这些结果不支持点观测对临近预报无用的结论，但表明局部观测技能和空间相干雷达场技能是不同的目标。实际意义是，稀疏观测可以提供有用的局部约束，但它们对雷达类场的益处取决于训练损失、不确定性表示以及观测支持在模型中的编码方式。

英文摘要

Sparse point observations are increasingly available for precipitation nowcasting, but it is unclear how much they improve dense radar-field forecasts. We partially address this question with a multimodal graph neural network nowcasting system over the Nordic radar domain. The model predicts rain rate every five minutes up to two hours ahead and is trained with different combinations of radar history, MEPS numerical weather prediction, Netatmo surface observations, MSG satellite channels, stochastic noise, and CRPS-based ensemble losses. The study is designed as an ablation of operationally relevant information sources and training objectives. We compare radar-only, NWP-informed, station-informed, satellite-informed, noise-augmented, and CRPS-based configurations using complementary diagnostics on the radar grid, at station locations, for rain onset, and through oracle, displacement, and amplitude scores. The results show that each source improves a different part of the forecast problem. MEPS stabilises radar-only extrapolation, Netatmo observations improve local station and onset diagnostics, and satellite predictors reduce some station-level biases but may activate rain too early when used deterministically. CRPS-based configurations provide the most consistent radar-grid gains, while the combined satellite and CRPS setup gives the best overall oracle/DAS score. These results do not support the conclusion that point observations are uninformative for nowcasting, but they show that local observational skill and spatially coherent radar-field skill are distinct targets. The practical implication is that sparse observations can provide useful local constraints, but their benefit for radar-like fields depends on the training loss, uncertainty representation, and how observation support is encoded in the model.

URL PDF HTML ☆

赞 0 踩 0

2606.18611 2026-06-19 cs.SD cs.AI cs.LG stat.ML 新提交

QC-GAN: A Parameter-Efficient Quaternion Conformer GAN for High-Fidelity Speech Enhancement

QC-GAN: 一种参数高效的四元数Conformer GAN用于高保真语音增强

Shogo Yamauchi, Hideaki Tamori, Makoto Sakai, Yosuke Yamano, Tohru Nitta

发表机构 * The Asahi Shimbun Company（朝日新闻社）； Tokyo Woman's Christian University（东京女子基督教大学）

AI总结提出参数高效的QC-GAN，结合四元数Conformer生成器和MetricGAN训练，通过汉密尔顿积共享权重减少参数量，在VoiceBank+DEMAND上以0.89M参数达到PESQ 3.48，性能媲美两倍大小模型。

Comments 10 pages, 6 figures and 5 tables. Accepted at Interspeech2026

2508.14009 2026-06-19 stat.OT 版本更新

Understanding Pedagogical Content Knowledge of Introductory Data Science Instructors: An Inaugural Framework

理解入门数据科学教师的教学内容知识：一个初步框架

Sinem Demirci, Mine Doğucu, Andrew Zieffler, Joshua M. Rosenberg

AI总结通过访谈14名入门数据科学教师并分析教学大纲，探索其教学内容知识（PCK）的关键组成部分，为教师发展提供见解，并建立IDS领域的PCK初步框架。

Comments 67 pages, 4 tables

2502.06866 2026-06-19 cs.LG cs.AI econ.EM stat.AP stat.ML 版本更新

Global Ease of Living Index: a machine learning framework for longitudinal analysis of major economies

全球生活便利指数：面向主要经济体纵向分析的机器学习框架

Arun Kumar Selvaraj, Tanay Panat, Rohitash Chandra

发表机构 * Transitional Artificial Intelligence Research Group, School of Mathematics and Statistics（过渡人工智能研究组，数学与统计学学院）； Centre for Artificial Intelligence and Innovation（人工智能与创新中心）； Pingla Institute（Pingla研究所）

AI总结提出全球生活便利指数，结合社会经济和基础设施因素，利用机器学习处理缺失数据，并通过主成分分析和因子分析降维，为政策制定者提供改善生活质量的可操作工具。

详情

AI中文摘要

全球经济、地缘政治条件以及COVID-19疫情等破坏性事件对生活成本和生活质量产生了巨大影响。理解主要经济体中生活成本和生活质量的长期影响至关重要。一个透明且全面的生活指数必须包含生活条件的多个维度。在本研究中，我们提出了一种通过全球生活便利指数量化生活质量的方法，该指数将各种社会经济和基础设施因素整合为一个单一综合得分。我们的指数利用定义生活水平的经济指标，这有助于针对特定领域进行干预改进。我们提出了一个机器学习框架来处理特定国家某些经济指标的数据缺失问题。然后，我们整理并更新数据，并使用降维方法（主成分分析和因子分析）创建自1970年以来主要经济体的生活便利指数。我们的工作通过为政策制定者提供识别需要改进领域（如医疗系统、就业机会和公共安全）的实用工具，显著丰富了相关文献。我们的方法使用开放数据和代码，易于复现并适用于各种情境，为生活质量评估的持续研究和政策制定提供了透明度和可访问性。

英文摘要

The drastic changes in the global economy, geopolitical conditions, and disruptions such as the COVID-19 pandemic have impacted the cost of living and quality of life. It is essential to comprehend the long-term implications of the cost of living and quality of life in major economies. A transparent and comprehensive living index must include multiple dimensions of living conditions. In this study, we present an approach to quantifying the quality of life through the Global Ease of Living Index that combines various socio-economic and infrastructural factors into a single composite score. Our index utilises economic indicators that define living standards, which could help in targeted interventions to improve specific areas. We present a machine learning framework to address missing data for certain economic indicators in specific countries. We then curate and update the data and use a dimensionality reduction approach (Principal Component Analysis and Factor Analysis) to create the Ease of Living Index for major economies since 1970. Our work significantly adds to the literature by offering a practical tool for policymakers to identify areas needing improvement, such as healthcare systems, employment opportunities, and public safety. Our approach with open data and code can be easily reproduced and applied to various contexts, providing transparency and accessibility for ongoing research and policy development in quality-of-life assessment.

URL PDF HTML ☆

赞 0 踩 0

2606.19859 2026-06-19 cs.IT cs.LG math.IT math.PR math.ST stat.TH 新提交

Doeblin Curves

Doeblin 曲线

Dongmin Lee, William Lu, Anuran Makur, Japneet Singh

AI总结提出 Doeblin 曲线概念，量化马尔可夫核在不同散度和功率水平下的收缩行为，并应用于噪声迭代优化、噪声电路可靠计算和差分隐私等领域的更细粒度收缩分析。

Comments 42 pages, 2 figures

Journal ref IEEE Transactions on Information Theory, vol. 72, no. 6, pp. 3556-3596, June 2026

详情

DOI: 10.1109/TIT.2026.3678229

AI中文摘要

近期关于 Doeblin 系数的研究揭示了它们作为 TV 距离的 Dobrushin 收缩系数的多路泛化的有用性，这与它们在马尔可夫链遍历性理论中的经典作用不同。然而，为了建立信息收缩的存在性，通常需要强条件，例如远离 0。基于最近提出的非线性信息收缩概念，我们旨在提出一种更细粒度的基于 Doeblin 的多路收缩行为刻画，即使对于 Doeblin 系数为 0 的信道，也能产生非平凡的收缩保证。为此，我们引入了 Doeblin 曲线的概念——一种非线性函数，它量化了马尔可夫核在特定散度和功率水平下对输入分布集合的收缩行为。在我们的分析过程中，我们发展了 Doeblin 系数的新变分刻画，提出了 Doeblin 曲线的若干性质，定义了功率约束 Doeblin 曲线的几个版本，并利用上述变分刻画推导了上下界。然后，我们将这些结果应用于不同领域，包括噪声迭代优化的泛化界、噪声电路可靠计算的误差界以及在线迭代算法的差分隐私保证。特别是，我们将这些领域的结果扩展到更广泛的领域或群体设置，利用 Doeblin 曲线揭示比 Doeblin 系数更细粒度的收缩现象。

英文摘要

Recent research on Doeblin coefficients has shed light on their usefulness as a multi-way generalization of the Dobrushin contraction coefficient for TV distance, in a separate vein from their classic role in the theory of Markov chain ergodicity. However, strong conditions, such as being bounded away from 0, are typically necessary for Doeblin coefficients to establish the existence of information contraction. Building on recently formulated concepts of nonlinear information contraction, we aim to propose a finer-grained Doeblin-based characterization of multi-way contraction behavior which yields non-vacuous contraction guarantees even for channels whose Doeblin coefficient is 0. To this end, we introduce the notion of a Doeblin curve -- a nonlinear function which quantifies the contraction behavior of a Markov kernel on collections of input distributions at specific levels of divergence and power. Through the course of our analysis, we develop a new variational characterization of Doeblin coefficients, present several properties of Doeblin curves, define several versions of power-constrained Doeblin curves, and derive upper and lower bounds using our aforementioned variational characterization. We then utilize these results in diverse areas, including generalization bounds for noisy iterative optimization, error bounds for reliable computation with noisy circuits, and differential privacy guarantees for online iterative algorithms. In particular, we extend results in these areas to broader domains or group settings, leveraging Doeblin curves to reveal finer-grained contraction phenomena than Doeblin coefficients.

URL PDF HTML ☆

赞 0 踩 0

2606.19726 2026-06-19 math.ST stat.TH 新提交

A Laplace equation approach to the Behrens--Fisher problem

Behrens-Fisher问题的拉普拉斯方程方法

Nagananda K G, Jong Sung Kim

AI总结针对两独立正态样本方差未知且不等的情况，提出偏微分方程公式，通过正交分解和球面楔概率将分布问题转化为拉普拉斯-狄利克雷边值问题，导出累积分布函数和概率密度的精确有限样本表示，并得到尾部分布展开。

Comments 31 pages, 4 figures

详情

AI中文摘要

我们针对两个独立正态样本（方差未知且不等）的Behrens-Fisher问题，发展了一种偏微分方程公式。通过正交分解分离均值分量和残差分量（对应于去除均值方向后中心化的样本内变异），并将样本均值的学生化差异重新表述为尺度不变的几何约束。这种简化将分布问题转化为球面楔概率的评估，这些概率被识别为调和测度以及拉普拉斯-狄利克雷边值问题在原点的值。在此框架下，我们导出了累积分布函数和概率密度函数的精确有限样本表示，形式为贝塔函数，仅依赖于样本量和方差比。这些表示将Behrens-Fisher分布置于标准特殊函数形式中，可直接在广泛可用的商业软件（包括Microsoft Excel）中使用，从而便于分布评估和分位数计算。我们还得到了相关调和延拓及其阈值导数的Gegenbauer分离变量展开，系数为封闭的贝塔-伽马形式，并导出了具有显式首项常数和高阶修正的尖锐尾部分布展开。

英文摘要

We develop a partial differential equation formulation of the Behrens-Fisher problem for two independent normal samples with unknown and unequal variances. An orthogonal decomposition separates mean and residual components (corresponding to the centered within-sample variation left after removal of the mean directions) and recasts the studentized difference of sample means as a scale-invariant geometric constraint. This reduction transforms the distributional problem into the evaluation of spherical wedge probabilities, which are identified with harmonic measure and with the value at the origin of a Laplace-Dirichlet boundary value problem. From this framework, we derive exact finite-sample representations for the cumulative distribution function and the probability density function in terms of beta functions, with dependence only on the sample sizes and the variance ratio. These representations place the Behrens-Fisher law in a standard special-function form that is directly accessible in widely available commercial software -- including Microsoft Excel -- thereby facilitating distributional evaluation and quantile computation. We also obtain a Gegenbauer separation-of-variables expansion for the associated harmonic extension and its threshold derivative, with coefficients in closed Beta-Gamma form, and derive sharp tail expansions with explicit leading constants and higher-order corrections.

URL PDF HTML ☆

赞 0 踩 0

2606.11171 2026-06-19 cs.LG cond-mat.stat-mech cs.IT math.IT math.OC math.ST stat.TH 新提交

Indexed Bellman Information Complexity

核赌博机中的算法与极小极大复杂度

Yunbei Xu

AI总结本文通过统一MAIR框架，将GP-UCB与MAMS算法置于共同语言下，提出结合两者优势的安全主算法，并证明在过参数化模型中算法复杂度比类宽极小极大或DEC证书更具信息性。

详情

AI中文摘要

高斯过程上置信界（GP-UCB）和决策估计系数（DEC）方法乍看之下可能属于不同的理论。本文将这两种观点置于一个共同的算法信息语言中，用于频率学派RKHS赌博机。GP-UCB固定了一个算法性的（而非真实的）高斯过程先验，并利用实现轨迹的复杂度以及计算可处理性，而MAMS优化了一个鲁棒的类宽MAIR/DEC包络。通过统一的MAIR框架和异质半正定算法先验，我们推广了GP-UCB分析和MAMS算法，提出了一种结合两者优势的安全主算法，并提供了一个核赌博机构造，表明在过参数化模型中算法复杂度可以比类宽极小极大或DEC证书更具信息性。由此得出的信息是：算法信息和类宽极小极大系数回答不同的问题，并可能导致不同的差距；核赌博机提供了一个干净的环境，使得这种区别在数学上变得可见。

英文摘要

We develop indexed Bellman information complexity, a representation-level theory of interactive decision making centered on information indices and reference histories. The representation strips away problem-specific syntax and retains only the ingredients needed for dynamic programming and information accounting, thereby unifying the earlier framework of indexed algorithmic information ratios (AIR). On the upper-bound side, regret is controlled by Bellman supersolutions or potential identities whose gradient bracket is paid for by indexed information. Upper-confidence-bound (UCB), estimation-to-decision/decision-estimation-coefficient (E2D/DEC), and adaptive-minimax-sampling or exploration-by-optimization (AMS/EBO) methods appear as three relaxations of this same identity. On the lower-bound side, the posterior-reference trajectory supplies both the information telescope and the ghost quantile of small-regret trajectories. The resulting critical radius in the lower bound is an effective-dimension-scale quantity, as in Fano and local-prior-mass lower bounds, rather than the constant radius of a two-point Le Cam argument. The examples show that DEC is best viewed as a one-step relaxation of indexed Bellman information complexity, not as a universally tight conversion mechanism. We illustrate the framework through several applications, with particular emphasis on kernel bandits. In this setting, the active action marginal provides a concrete basis for comparing UCB, E2D, and AMS/EBO.

URL PDF HTML ☆

赞 0 踩 0

2605.20541 2026-06-19 math.ST math.PR stat.TH 版本更新

Finite-Sample Bounds for Expected Signature Estimation under Weak Dependence

有限样本下弱依赖条件下期望签名估计的界限

Bryson Schenck

AI总结本文研究了在弱依赖条件下，从单一长依赖轨迹估计期望签名的有限样本界限，通过块平均估计器证明了非渐近的均方误差界，并探讨了在不同Hurst指数下的收敛性。

Comments 59 pages, 1 figure

详情

AI中文摘要

期望签名在满足矩增长条件时唯一确定随机粗糙路径的分布，但此前缺乏从单一长依赖轨迹估计其有限样本界限。本文研究了一个平稳随机过程，其样本路径可解释为几何粗糙路径，被划分为等间距观测的块，并证明了块平均估计器的非渐近均方误差界。当路径的Hölder正则性至多为1/2时，需要粗糙路径理论来定义估计量，因为Young积分和Riemann-Stieltjes积分无法定义签名的迭代积分。在矩、平稳性和块签名协方差衰减条件（严格弱于α-混合且适用于长程依赖驱动器）下，误差分为离散化项和波动项，其速率分别由路径正则性和依赖强度决定。通过逐层粗糙因子方差分析，保持有限截断常数显式，并在固定观测预算下获得最优分配规则。本文验证了分数奥本海姆-乌伦贝克过程在三个制度下的假设，即粗糙（Hurst H<1/2）、半鞅（H=1/2）和长程（H>1/2）。蒙特卡罗实验显示经验收敛速率快于理论上界。

英文摘要

The expected signature uniquely determines the law of a random rough path under a moment-growth condition, yet finite-sample bounds for estimating its truncations from a single long dependent trajectory remain unavailable. We study a strictly stationary stochastic process equipped with a geometric rough-path lift, observed in non-overlapping blocks of equally-spaced samples, and prove a non-asymptotic mean-squared error (MSE) bound for the block-averaging estimator of its truncated expected signature. Under moment and stationarity assumptions together with a direct covariance-decay condition on block signatures -- strictly weaker than $α$-mixing and applicable to long-range-dependent processes -- the error separates into a discretization term and a fluctuation term, with rates determined respectively by path regularity and dependence strength. A levelwise rough-factorial variance analysis keeps finite-truncation constants explicit and yields an optimal allocation rule under a fixed observation budget. We verify the assumptions for independent-coordinate fractional Ornstein--Uhlenbeck processes in three regimes: short-range (Hurst $1/4<H<1/2$), semimartingale ($H=1/2$), and long-range ($H>1/2$); in all three, the block-signature covariance is summable, so the fluctuation term decays at the same rate as in the independent-block case, even under long memory at $H>1/2$. Monte Carlo experiments show empirical slopes steeper than the guaranteed upper-bound rates.

URL PDF HTML ☆

赞 0 踩 0

2604.02336 2026-06-19 math.FA math.ST stat.TH 版本更新

The Shift Operator Calculus for Stationary Time Series Analysis

平稳时间序列分析的移位算子演算

Anand Ganesh, Babhrubahan Bose, Anand Rajagopalan

AI总结本文为平稳时间序列建模建立了严格的移位算子演算，证明了不同函数族下转移函数算子的存在性和等距性，并统一了平稳过程可逆性与转移函数算子可逆性的概念。

Comments 7 pages

2602.04550 2026-06-19 quant-ph math.ST stat.TH 版本更新

Locally Gentle State Certification for High Dimensional Quantum Systems

高维量子系统的局部温和态认证

Cristina Butucea, Jan Johannes, Henning Stein

AI总结研究局部温和量子态认证中非破坏性测量的信息代价，推导出样本复杂度为Θ(d³/(ε²α²))，揭示了α-温和性惩罚与希尔伯特空间维度d的线性关系。

详情

AI中文摘要

量子统计推断的标准方法依赖于引起波函数坍缩的测量，从而消耗量子态以提取信息。在本工作中，我们研究了\emph{局部温和}量子态认证的基本极限，其中学习算法被限制在迹范数下最多扰动态$\alpha$，从而允许样本重用。我们分析了区分未知态$\rho$等于参考态$\rho_0$还是与其$\epsilon$-远的问题。我们推导了该问题的极小极大样本复杂度，量化了非破坏性测量的信息代价。具体地，通过构造显式测量算子，我们证明了$\alpha$-温和性约束施加了$\frac{d}{\alpha^2}$的样本量惩罚，导致总样本复杂度为$n = \Theta(\frac{d^3}{\epsilon^2 \alpha^2})$。我们的结果阐明了信息提取与态扰动之间的权衡，并突出了量子学习中物理测量约束与隐私机制之间的深层联系。关键地，我们发现施加$\alpha$-温和性所导致的样本量惩罚与希尔伯特空间维度$d$呈线性关系，而非高维私有估计中典型的参数数量$d^2-1$。

英文摘要

Standard approaches to quantum statistical inference rely on measurements that induce a collapse of the wave function, effectively consuming the quantum state to extract information. In this work, we investigate the fundamental limits of \emph{locally-gentle} quantum state certification, where the learning algorithm is constrained to perturb the state by at most $α$ in trace norm, thereby allowing for the reuse of samples. We analyze the hypothesis testing problem of distinguishing whether an unknown state $ρ$ is equal to a reference $ρ_0$ or $ε$-far from it. We derive the minimax sample complexity for this problem, quantifying the information-theoretic price of non-destructive measurements. Specifically, by constructing explicit measurement operators, we show that the constraint of $α$-gentleness imposes a sample size penalty of $\frac{d}{α^2}$, yielding a total sample complexity of $n = Θ(\frac{d^3}{ε^2 α^2})$. Our results clarify the trade-off between information extraction and state disturbance, and highlight deep connections between physical measurement constraints and privacy mechanisms in quantum learning. Crucially, we find that the sample size penalty incurred by enforcing $α$-gentleness scales linearly with the Hilbert-space dimension $d$ rather than the number of parameters $d^2-1$ typical for high-dimensional private estimation.

URL PDF HTML ☆

赞 0 踩 0

2504.09564 2026-06-19 math.ST stat.TH 版本更新

The weak-feature-impact effect on the NPMLE in monotone binary regression

单调二元回归中弱特征影响对NPMLE的影响

Dario Kieffer, Angelika Rohde

AI总结研究单调二元回归中非参数最大似然估计在弱特征关系下的极限分布，发现一种新的分布连续插值于两个极端情况，并改进了小样本近似。

Comments Added Theorem 3.3 and several visualizations

详情

AI中文摘要

统计文献提供了单调二元回归中非参数最大似然估计（NPMLE）在两种极端情况下的逐点极限分布：如果特征-标签关系严格单调且足够光滑，则以立方根$n$速率收敛，具有缩放Chernoff型极限分布；如果底层关系平坦，则以参数$\sqrt{n}$速率收敛。本文提供了NPMLE分布演变的完整图景，揭示了一种新的极限分布，在弱特征-标签关系的情况下，为小样本提供了显著更好的分布近似。该分布被证明连续插值于两个极端情况之间。确定该分布的创新方法是将其作为新引入的弱特征影响三角阵列中NPMLE的极限，针对特定的参数-样本量配置。此外，在适当缩放的$L^{1}$误差中同样观察到弱特征影响场景下的相变。作为副产品，获得了平坦回归函数下的极限分布，这是先前未知的。证明开发了一种全新的策略，特别是不基于开关关系。伴随这些结果的新型局部极小极大下界。

英文摘要

Statistical literature provides pointwise limiting distributions of the nonparametric maximum likelihood estimator (NPMLE) in monotone binary regression for the two extremal cases: If the feature-label relation is strictly monotone and sufficiently smooth, it converges at a cube-root-$n$ rate with scaled Chernoff-type limiting distribution, and it converges at the parametric $\sqrt{n}$-rate if the underlying relation is flat. In this article, we provide the complete picture of the distributional metamorphosis of the NPMLE, revealing a new limiting distribution which provides a significantly better distributional approximation for small samples in case of a weak feature-label relationship. It is shown to continuously interpolate between the two extremal cases. The innovative way to determine this distribution is to generate it as a limit of the NPMLE in the newly introduced weak-feature-impact triangular array for a particular parameter-sample-size constellation. Moreover, the phase transition is likewise observed for the suitably rescaled $L^{1}$-error in this weak-feature-impact scenario. As a by-product, its limiting distribution for flat regression functions is obtained, which was unknown before. The proof develops a completely new strategy, notably not based on the switch relation. A novel type of local minimax lower bounds accompanies these results.

URL PDF HTML ☆

赞 0 踩 0

2507.15475 2026-06-19 eess.SP math.PR stat.AP

On the Distribution of a Two-Dimensional Random Walk with Restricted Angles

二维受限角度随机游走的分布

Karl-Ludwig Besser

AI总结研究受限角度二维随机游走的分布，推导两步联合与边缘分布，提供一般步数的数值解及大步数近似，明确支持集的精确描述。

Comments 14 pages, 14 figures

Journal ref IEEE Transactions on Signal Processing, vol. 74, pp. 2316-2330, 2026

2506.23396 2026-06-19 stat.ML cs.LG

AICO: Feature Significance Tests for Supervised Learning

Kay Giesecke, Enguerrand Horel, Chartsiri Jirachotkulthorn

发表机构 * Stanford University, Department of Management Science and Engineering and Institute for Computational and Mathematical Engineering（斯坦福大学管理科学与工程系和计算与数学工程研究所）； Upstart, Inc.（Upstart公司）； Stanford University, Institute for Computational and Mathematical Engineering（斯坦福大学计算与数学工程研究所）

2412.20298 2026-06-19 cs.LG cs.CY stat.ML

An Experimental Study on Fairness-aware Machine Learning for Credit Scoring Problems

Huyen Giang Thi Thu, Thang Viet Doan, Ha-Bang Ban, Tai Le Quy

发表机构 * Banking Academy of Vietnam（越南银行学院）； Vietnam Academy of Science and Technology（越南科学技术 academy）； Hanoi University of Science and Technology（河内科学技术大学）； University of Koblenz（科隆大学）

Comments The manuscript is submitted to Springer Nature's journal

2510.05013 2026-06-19 stat.ML cs.LG

Curiosity-Driven Development of Action and Language in Robots Through Self-Exploration

通过自我探索的机器人好奇心驱动行为与语言发展

Theodore Jerome Tinker, Kenji Doya, Jun Tani

发表机构 * Okinawa Institute of Science and Technology（冲绳科学技术大学院大学）

AI总结本研究通过好奇心驱动的机器人自我探索，结合Q学习实现主动推理，揭示了组合泛化、快速学习、先配对后组合以及异常处理导致的U型发展模式，为人类高效语言习得提供解释。

Comments 27 pages, 22 pages of supplementary material

详情

AI中文摘要

婴儿通过极少的经验就能泛化习得语言，而大型语言模型需要数十亿的训练标记。人类高效发展的基础是什么？我们通过实验研究了这一问题，其中机器人代理通过好奇心驱动的自我探索学习执行与祈使句（例如，推红色立方体）相关的动作。我们的方法使用Q学习摊销主动推理，实现内在动机的发展性学习。模拟揭示了与发展心理学观察相对应的关键发现。i) 随着组合元素规模的增加，泛化能力显著提高。ii) 好奇心驱动的探索能够加速学习。iii) 句子和动作的机械配对先于组合泛化。iv) 异常处理导致U型发展表现，这种模式类似于儿童语言学习中的表征重述。这些结果表明，好奇心驱动的主动推理解释了内在动机的感觉运动-语言学习如何支持人类和人工代理中的可扩展组合泛化和异常处理。

英文摘要

Infants acquire language with generalization from minimal experience, whereas large language models require billions of training tokens. What underlies efficient development in humans? We investigated this problem through experiments wherein robotic agents learn to perform actions associated with imperative sentences (e.g., push red cube) via curiosity-driven self-exploration. Our approach amortizes active inference using Q-learning, enabling intrinsically motivated developmental learning. The simulations reveal key findings corresponding to observations in developmental psychology. i) Generalization improves drastically as the scale of compositional elements increases. ii) Curiosity-driven exploration enables faster learning. iii) Rote pairing of sentences and actions precedes compositional generalization. iv) Exception-handling induces U-shaped developmental performance, a pattern like representational redescription in child language learning. These results suggest that curiosity-driven active inference accounts for how intrinsically motivated sensorimotor-linguistic learning supports scalable compositional generalization and exception handling in humans and artificial agents.

URL PDF HTML ☆

赞 0 踩 0

2505.01318 2026-06-19 stat.ME

Modeling Large Nonstationary Spatial Data with the Full-Scale Basis Graphical Lasso

用全尺度基图拉索方法建模大非平稳空间数据

Matthew LeDuc, William Kleiber, Tomoko Matsuo

AI总结本文提出了一种结合隐含低秩过程和稀疏协方差模型的新方法，用于建模大非平稳空间数据，通过灵活的图高斯马尔可夫随机场模型对低秩组件系数进行建模，并结合全尺度近似和基图拉索方法，提出全尺度基图拉索方法（FSBGL），采用图拉索惩罚似然进行估计，通过差异凸方案优化，通过合成场和热层高分辨率模拟数据集验证，与现有空间模型相比，在有限训练数据下更能捕捉热层温度场的显著特征。

详情

DOI: 10.1016/j.spasta.2026.101007

AI中文摘要

我们提出了一种新的方法，用于建模大非平稳空间过程的数据集，该方法结合了隐含的低秩过程和稀疏协方差模型。低秩组件的系数被赋予了灵活的图高斯马尔可夫随机场模型。利用低秩和紧支撑协方差结构结合了全尺度近似和基图拉索；我们称这种新方法为全尺度基图拉索（FSBGL）。估计采用图拉索惩罚似然，通过差异凸方案进行优化。我们在合成场以及具有挑战性的高分辨率热层模拟数据集上展示了所提出的方法。在与现有空间模型的比较中，即使在可用训练数据有限的情况下，FSBGL在捕捉热层温度场的显著特征方面表现更好。

英文摘要

We propose a new approach for the modeling large datasets of nonstationary spatial processes that combines a latent low rank process and a sparse covariance model. The low rank component coefficients are endowed with a flexible graphical Gaussian Markov random field model. The utilization of a low rank and compactly-supported covariance structure combines the full-scale approximation and the basis graphical lasso; we term this new approach the full-scale basis graphical lasso (FSBGL). Estimation employs a graphical lasso-penalized likelihood, which is optimized using a difference-of-convex scheme. We illustrate the proposed approach on synthetic fields as well as with a challenging high-resolution simulation dataset of the thermosphere. In a comparison against state-of-the-art spatial models, the FSBGL performs better at capturing salient features of the thermospheric temperature fields, even with limited available training data.

URL PDF HTML ☆

赞 0 踩 0

2408.15920 2026-06-19 math.ST math.PR stat.TH

Nonlinear Filtering and Spatial Asymptotic Consistency for SPDEs Observed via Spatio-Temporal Point Processes

Jan Szalankiewicz, Cristina Martinez-Torres, Wilhelm Stannat

Comments Fixed several typos throughout the manuscript, substantially revised Section 4 with improved theoretical bounds, and updated simulations with corresponding code base improvements

Journal ref Stoch PDE: Anal Comp (2026)

2307.06655 2026-06-19 stat.ME

Stochastic Reaction-Diffusion Systems in Biophysics: Towards a Toolbox for Quantitative Model Evaluation

Gregor Pasemann, Carsten Beta, Wilhelm Stannat

Journal ref In: Stich, M., Carballido-Landeira, J. (eds) Nonlinear Dynamics for Biological Systems. SEMA SIMAI Springer Series, vol 40, 2025, Springer, Cham

1812.05678 2026-06-19 stat.ME

Objective-Driven Ensembles: Bridging the Gap Between Interpretable Sparsity and Algorithmic Prediction

目标驱动集成：弥合可解释稀疏性与算法预测之间的差距

Anthony Christidis, Stefan Van Aelst, Ruben Zamar

AI总结本文提出目标驱动集成方法，通过将最优子集选择推广为联合数学优化问题，生成可解释的集成模型，并理论证明惩罚预测变量重叠可限制预测协方差、减轻有限样本虚假相关的影响，实现机器学习级精度与稀疏模型可解释性的兼顾。

详情

AI中文摘要

稀疏方法（如最优子集选择、弹性网）是获得可解释模型的标准方法，但可能遭受高方差和易受虚假相关影响的问题。另一方面，算法集成（如随机森林、梯度提升）实现了高预测精度，但产生了由随机化或顺序残差拟合驱动的不可解释黑箱。近年来，一种统一的范式出现了：目标驱动集成。通过将最优子集选择推广为联合数学优化问题，该方法通过将预测变量最优地分配到少量不同模型中来生成可解释的集成。在本文中，我们综合了这一日益增长的文献，并为其经验成功提供了理论见解。具体来说，我们表明惩罚预测变量重叠在数学上限制了预测协方差，并减轻了有限样本虚假相关的影响。我们使用精确的组合预言机证明了这些性质，并回顾了最近的计算近似如何成功地将这一框架扩展到各种领域，包括高维数据、分类任务以及存在逐案例或逐单元污染的场景，实现了机器学习级别的精度，同时保留了稀疏模型的可解释性。

英文摘要

Sparse methods (e.g., Best Subset Selection, Elastic Net) are the standard approach for obtaining interpretable models, but they can suffer from high variance and vulnerability to spurious correlations. Alternatively, algorithmic ensembles (e.g., Random Forests, Gradient Boosting) achieve high prediction accuracy but yield uninterpretable black boxes driven by randomization or sequential residual fitting. In recent years, a unifying paradigm has emerged: Objective-Driven Ensembles. By generalizing best subset selection into a joint mathematical optimization problem, this approach generates interpretable ensembles by optimally splitting predictors across a small number of diverse models. In this paper, we synthesize this growing body of literature and illustrate the statistical principles driving its empirical success. Specifically, we utilize finite-sample bounds to demonstrate how penalizing predictor overlap controls ensemble covariance and provides a mathematical hedge against spurious correlations. We evaluate these mechanics using an exact combinatorial oracle, and review how recent computational approximations have successfully scaled this framework to a variety of domains, including high-dimensional data, classification tasks, and settings with casewise or cellwise contamination, achieving machine-learning-level accuracy while retaining the interpretability of sparse models.

URL PDF HTML ☆

赞 0 踩 0

1909.03488 2026-06-19 math.AT cs.CG math.PR math.ST stat.TH

Probabilistic Convergence and Stability of Random Mapper Graphs

Adam Brown, Omer Bobrowski, Elizabeth Munch, Bei Wang

1406.0214 2026-06-19 eess.SY cs.SY math.AT stat.ML

Topological and Statistical Behavior Classifiers for Tracking Applications

拓扑与统计行为分类器用于跟踪应用

Paul Bendich, Sang Chin, Jesse Clarke, Jonathan deSena, John Harer, Elizabeth Munch, Andrew Newman, David Porter, David Rouse, Nate Strawn, Adam Watkins

AI总结本文提出基于多假设跟踪、拓扑数据分析和机器学习的统一理论，通过拓扑特征编码行为信息，利用统计模型拟合拓扑特征分布，并结合目标类型分类方法提升跟踪性能。

1. 统计理论与方法 10 篇

Flexible modeling of bimodal distributions via skewed-$t$ mixtures

Analysis of uncertain fixed-effects model for Latin square designs

A minimum-risk and cost-efficient two-sample sequential testing framework for the shifted exponential models with application to precipitation data

Calibration without labels in multiple testing

Machine Learning Integrated in Wavelet Shrinkage (MLShrink)

SCOPE Shrinkage: A Unified Framework for Wavelet Denoising

Zero-Shot Active Feature Acquisition via LLM-Elicitation

A Necessary and Sufficient Condition for Size Controllability of Heteroskedasticity Robust Test Statistics

Interpolated Quantile Estimation: A Unified Framework Bridging Quantiles and the Mean

Benign overfitting beyond prediction: The ordinary least squares interpolator

2. 贝叶斯统计与概率建模 4 篇

Overfitted high-dimensional matrix factorizations via adaptive spectral shrinkage

Variational Consensus Monte Carlo for Bayesian Mixture

Leveraging tails for adaptation

Weighted Bayesian Conformal Prediction

3. 因果推断与实验设计 10 篇

A case study of causal mediation using Bayesian nonparametrics and semiparametric corrections

A Law of Iterated Expectation Primer for Causal Inference

Off-Policy Evaluation for Missingness-Aware Policies in MDPs with Rewards Missing Not at Random

Kernel-Based Functional Balancing for Causal Inference with Compositional Treatments

Statistical Foundations of LLM-based A/B Testing: A Surrogacy Framework for Human Causal Inference

Invariant quantile regression for heterogeneous environments

A causal framework for evaluating the total effect of strategies aiming to expand screening and to improve outcomes

A Practical Introduction to Regression-based Causal Inference in Meteorology (I): All confounders measured

A Practical Introduction to Regression-based Causal Inference in Meteorology (II): Unmeasured confounders

Causal Inference with High-dimensional Discrete Covariates

4. 高维统计与正则化 1 篇

Hypergraph Variable Selection with False Discovery Rate Control

5. 时间序列与空间统计 1 篇

Practical Forecasting of Environmental Maps: A Functional Data Approach

6. 计算统计与MCMC 12 篇

AK-MCS-C2 : Active Kriging Monte Carlo Simulation method with conformal certification for failure probability estimation

DASH: A Dimensionality Reduction Method for Large-scale Convex MIQP with Applications in Subset Portfolio Selection

Establishing an $Ω(\sqrt{d})$ complexity lower bound for PDMP samplers and how to break it: a sub-$\sqrt{d}$ algorithm for Gaussian-tailed targets

A Flat Connection: The Pooling Factor and the Geometry of Centring in Hierarchical MCMC

Computational Identifiability

Folded Transport MCMC: Eliminating Label Switching by Sampling on a Fundamental Domain

Q-approximation of operating characteristics of clinical trial designs

Probabilistic function-on-function nonlinear autoregressive model for emulation and reliability analysis of stochastic dynamical systems

Robust, partially alive particle Metropolis-Hastings via the Frankenfilter

Alternating Direction Method of Multipliers for Nonlinear Matrix Decompositions

Flow Matching for Efficient and Scalable Data Assimilation

Towards practical PDMP sampling: Metropolis adjustments, locally adaptive step-sizes, and NUTS-based time lengths

7. 机器学习统计基础 20 篇

SSH-Net: A Deep Neural Network for Predicting Failure Time Distribution Functions under Competing Risks with Application to GPU Data

AURA: Adaptive Uncertainty-aware Refinement for LLM-as-a-Judge Auditing

A Solver-Free Training Method for Predict-then-Optimize

The Representational Limit of Scalar Interactions: An Interventional Decomposition

Matching Markets meet Cumulative Prospect Theory: Towards Optimal and Adversarially Robust Learning

Which Pairs to Compare for LLM Post-Training?

Algebraic Dead Directions in LayerNorm Transformers: A Forward-Pass-Only Diagnostic at LLM Scale

Optimal Deterministic Multicalibration and Omniprediction

Stochastic Linear Contextual Bandits with Bounded Noise: A Set-Membership Approach

On the Oracle Complexity of Interpolation-Based Gradient Descent

Robust $Q$-learning for mean-field control under Wasserstein uncertainty in common noise

Statistical Properties of Training & Generalization

Information Theory and Statistical Learning

Attention-based PCA

Learning to Emulate Chaos: Adversarial Optimal Transport Regularization

Characterization of Gaussian Universality Breakdown in High-Dimensional Empirical Risk Minimization

Stabilizing Bandits using Regularization: Precise Regret and A Quantitative Central Limit Theorem

Meta Flow Maps enable scalable reward alignment

Phase Transition for Stochastic Block Model with more than $\sqrt{n}$ Communities

Group-Sparse Matrix Factorization for Transfer Learning of Word Embeddings

8. 生物统计与医学统计 8 篇

Anchors Away: Navigating Unanchored Indirect Comparisons with Multilevel Unanchored Meta-Regression (ML-UMR)

Built-in Selection Bias in Proportional Hazards Models with Omitted Covariates: Simulation Evidence and Alternative Approaches

The Ghosh-Lin and Fine-Gray models for a mix of administrative and random censoring

Covariate-Adjusted Functional Principal Components Analysis for Modeling Hazard Rates of Physical Activity in the US Population

A Bayesian spatio-temporal nearest neighbor Gaussian process model for pooled genetic data

West Nile virus outbreak in Italy modelled with the quantum Game of Life

Efficient Cumulative Incidence Estimation in Biobank Studies Using All Prevalent and Incident Events

Flexible aggregation of compositional predictors with shared effects for microbiome association analysis

9. 经济金融与社会科学统计 9 篇

Two-Sample IV: Efficient Two-Step Estimation and Tests for Overidentification and Weak-Instruments

Advanced Calibration Analysis and Tools: Identifying Influential Observations in Stochastic Interest Rate Model Calibration

Dynamic Core Allocation for Malleable Jobs with Unknown Speed-up Parameters

A Model-Agnostic Bootstrap for Macro-Level Claims Reserving Under the Conditioning Principle

The Negative Binomial Chain-Ladder: A Full Likelihood Model for Claim Count Reserving