arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.17424 2026-06-17 stat.ME 新提交

The dangers of using three-number summaries to estimate unknown standard deviations: sensitivity analyses and some possible improvements incorporating shape

使用三数汇总估计未知标准差的风险：敏感性分析及结合形状信息的改进方法

Udara Kumaranathunga, Alysha De Livera, Luke A. Prendergast

AI总结本文揭示三数汇总（最小值、中位数、最大值）不足以可靠估计标准差，提出基于缩放Beta分布的新估计器，并开发敏感性分析工具以提高推断可靠性。

详情

AI中文摘要

参数自适应过渡多项式：Kunchenko随机多项式的带符号奇偶连续α扩展

Serhii Zabolotnii

AI总结本文提出了一种参数自适应过渡多项式（PATP），作为Kunchenko随机多项式的带符号奇偶连续α扩展，通过连续参数α在[0,1]范围内控制，解决了非高斯误差下的参数估计问题，并探讨了其在极端厚尾分布中的应用边界。

Comments 38 pages, 8 figures. Code and Lean 4 proofs: this https URL (https://github.com/SZabolotnii/Ku-PATP-code-supplement). v3: the full F_2^{-1}b estimator is now used throughout (its Monte Carlo g_2(alpha) converges to the closed form); added regression and real-data (EuStockMarkets) validations; scope restricted to symmetric error laws; corrected the Laplace g_2 illustration

详情

AI中文摘要

Kunchenko的多项式最大化方法提供了一种半参数工具，用于在非高斯误差下的参数估计，但其经典幂基依赖于有限的高阶整数矩。本文引入了参数自适应过渡多项式（PATP），一种由连续参数α在[0,1]范围内控制的带符号奇偶分数幂家族。二次指数映射p_i(α)连接了分数 regime p_i(0)=1/i，退化线性点p_i(1/2)=1和带符号奇偶整数幂 regime p_i(1)=i。对于S=2的情况，我们推导出一个闭式方差减少系数g_2(α)，以带符号和绝对分数矩表示，识别了α=1/2处的奇异行为，并陈述了在何种矩和正则性条件下该公式有意义。该构造应被视为Kunchenko广义装置内的Form-B PATP类比，而不是在α=1时的精确恢复经典偶幂PMM基。使用标准分布的数值示例来检验带符号奇偶估计量的有限样本行为，并标记极厚尾情况如Cauchy的适用边界。

英文摘要

Kunchenko's method of polynomial maximization provides a semiparametric apparatus for parameter estimation under non-Gaussian errors, but its classical power basis relies on finite higher-order integer moments. This paper introduces the Parametrically Adaptive Transition Polynomial (PATP), a signed-parity fractional-power family controlled by a continuous parameter alpha in [0,1]. The quadratic exponent map p_i(alpha) connects the fractal regime p_i(0)=1/i, the degenerate linear point p_i(1/2)=1, and the signed-parity integer-power regime p_i(1)=i. For the degree-S=2 case we derive a closed-form variance-reduction coefficient g_2(alpha) in terms of signed and absolute fractional moments, identify the singular behavior at alpha=1/2, and state the moment and regularity conditions under which the formula is meaningful. The construction should be read as a Form-B PATP analogue within Kunchenko's generalized apparatus, not as an exact recovery of the canonical even-power PMM basis at alpha=1. Numerical illustrations on canonical distributions are used to examine the finite-sample behavior of the signed-parity estimator and to mark the boundary of applicability for extremely heavy-tailed cases such as Cauchy.

URL PDF HTML ☆

赞 0 踩 0

2501.10729 2026-06-17 stat.ME cs.LG stat.ML 版本更新

基于深度学习的OCO-2光谱大气CO2摊销概率检索：结合拉普拉斯近似与归一化流

Alejandro Calle-Saldarriaga, Felix Jimenez, Jack Grosskreuz, Jiazheng Wang, Jonathan Hobbs, Matthias Katzfuss

发表机构 * University of Wisconsin–Madison（威斯康星大学麦迪逊分校）； Jet Propulsion Laboratory, California Institute of Technology（加州理工学院喷气推进实验室）

AI总结提出深度学习框架，利用拉普拉斯近似和归一化流从OCO-2光谱中快速、准确地检索大气CO2浓度，并量化不确定性，相比传统方法加速数个数量级且精度更高。

Comments 23 pages, 8 figures

详情

AI中文摘要

基于空间的大气二氧化碳（CO2）监测对于约束全球碳收支至关重要。NASA的轨道碳观测者-2号（OCO-2）利用高分辨率光谱估算柱平均干空气CO2摩尔分数（XCO2）。然而，当前的操作检索算法计算成本高且未能正确量化不确定性。我们提出了一种新颖的深度学习框架来解决这些挑战。由于真实卫星观测的地面真值数据难以获取，我们使用高保真模拟数据集开发并验证了我们的方法。该数据集旨在支持OCO-2不确定性量化（UQ），并包含了真实的前向模型误差。我们的架构使用多分支神经网络编码光谱波段，并通过两种可扩展的UQ方法——拉普拉斯近似和归一化流——来估计完整CO2柱或其所需汇总的后验分布。与操作性的“全物理”求解器相比，我们的方法具有五个关键优势：（1）摊销：推理速度提高数个数量级，能够实时处理海量数据流；（2）模型误差鲁棒性：通过在明确包含模型差异的模拟数据上训练，我们的方法考虑了标准反演中常被忽略的系统误差；（3）点估计精度：与基线方法相比，我们实现了更优的预测精度；（4）改进的UQ：概率输出提供了校准更好的不确定性估计；（5）非高斯后验：当使用归一化流时，我们的框架成功建模了复杂、非对称的后验分布，克服了高斯假设的局限性。这些结果表明，基于模拟的深度学习是迈向下一代操作处理系统的可行路径。

英文摘要

Space-based monitoring of atmospheric carbon dioxide (CO2) is essential for constraining the global carbon budget. NASA's Orbiting Carbon Observatory-2 (OCO-2) estimates column-averaged dry-air mole fractions of CO2 (XCO2) using high-resolution spectra. However, current operational retrieval algorithms are computationally expensive and do not properly quantify uncertainties. We present a novel deep learning framework that addresses these challenges. Due to the difficulties of ground-truth data for real satellite observations, we develop and validate our approach using a high-fidelity simulation dataset. This dataset, created to support OCO-2 uncertainty quantification (UQ), incorporates realistic forward model errors. Our architecture encodes spectral bands using a multi-branch neural network and estimates posteriors of the full CO2 column or desired summaries thereof using two scalable UQ methods: Laplace approximations and normalizing flows. Our approach has five key advantages relative to operational "full-physics" solvers: (1) Amortization: Inference is orders of magnitude faster, enabling real-time processing of massive data streams; (2) Model error robustness: By training on simulations that explicitly include model discrepancies, our method accounts for systematic errors often neglected by standard inversions; (3) Point estimate accuracy: We achieve superior predictive accuracy compared to baseline methods; (4) Improved UQ: The probabilistic outputs yield better-calibrated uncertainty estimates; and (5) Non-Gaussian posteriors: When utilizing normalizing flows, our framework successfully models complex, asymmetric posterior distributions, overcoming the limitations of the Gaussian assumption. These results suggest that simulation-based deep learning is a viable path toward next-generation operational processing systems.

URL PDF HTML ☆

赞 0 踩 0

2606.17343 2026-06-17 cs.CV stat.AP 新提交

Bayesian Magnetic Resonance Joint Image Reconstruction and Uncertainty Quantification using Sparsity Prior Models and Markov Chain Monte Carlo Sampling

贝叶斯磁共振联合图像重建与不确定性量化：基于稀疏先验模型和马尔可夫链蒙特卡洛采样

Ahmed Karam Eldaly, Matteo Figini, Daniel C. Alexander

发表机构 * Department of Computer Science, University of Exeter（埃克塞特大学计算机科学系）； UCL Hawkes Institute, Department of Computer Science, University College London（伦敦大学学院计算机科学系霍克斯研究所）

AI总结提出一种基于压缩感知磁共振图像重建的不确定性量化框架，采用贝叶斯线性逆问题建模，利用稀疏先验（总变分或小波变换）和分裂增广吉布斯采样器进行MCMC采样，在单线圈和多线圈数据集上验证了优于优化方法和深度学习方法的图像重建与不确定性量化性能。

详情

AI中文摘要

我们提出了一种新的框架，用于使用压缩感知磁共振图像重建进行不确定性量化。该问题在贝叶斯框架内被表述为线性逆问题，并为未知模型参数分配先验分布。具体而言，待重建的图像在给定基下被假设为稀疏的。我们开发了一个适用于任何基的通用框架，并作为示例，测试了图像在（1）空间梯度（使用总变分先验模型）和（2）小波变换中的稀疏性。然后，采用基于分裂增广吉布斯采样的马尔可夫链蒙特卡洛（MCMC）方法从未知参数的后验分布中采样。使用近端MCMC方法有效采样不可微的条件分布。所提出的算法在单线圈和多线圈数据集上使用各种k空间子采样模式和比率进行了验证。结果表明，与对应的基于优化的方法相比，每种提出的方法在图像重建方面具有优越性能。此外，与现有的基于深度学习的方法相比，我们的框架有效地量化了不确定性，显示估计的不确定性图与使用真实值和重建图像计算的误差图之间存在显著相关性。

英文摘要

We propose a novel framework for uncertainty quantification using compressed sensing magnetic resonance image reconstruction. The problem is formulated within a Bayesian framework as a linear inverse problem, with prior distributions assigned to the unknown model parameters. Specifically, the image to be reconstructed is assumed to be sparse in a given basis. We develop a general framework applicable to any basis and as examples, we test the sparsity of the image in its (1) spatial gradients using a total variation prior model, and in its (2) wavelet transform. A Markov chain Monte Carlo (MCMC) method, based on a split-and-augmented Gibbs sampler, is then employed to sample from the posterior distribution of the unknown parameters. The non-differentiable conditional distributions are efficiently sampled using a proximal MCMC method. The proposed algorithms are validated on both single-coil and multi-coil datasets using various k-space sub-sampling patterns and ratios. The results demonstrate the superior performance of each proposed approach in reconstructing images compared to its counterpart optimisation-based method. Moreover, our framework effectively quantifies uncertainty, showing a notable correlation between estimated uncertainty maps and error maps computed using ground truth and reconstructed images, compared with existing deep learning-based methods.

URL PDF HTML ☆

赞 0 踩 0

2606.17267 2026-06-17 stat.ME econ.EM math.NA stat.AP stat.ML 新提交

扩展特征分配模型

Mario Beraha, Federico Camerlenghi, Lorenzo Ghilotti

AI总结提出统一贝叶斯框架，联合建模特征标签和比例，克服标准模型忽略标签依赖性的局限，并引入Cox过程和行列式点过程先验，在基因组变异和森林调查中展示有效性。

详情

AI中文摘要

特征分配模型是贝叶斯非参数工具，适用于每个观测可同时展现多个特征的数据。标准公式的一个基本限制是假设特征标签独立同分布，因此在后验推断中不起作用。本文引入了一个统一的贝叶斯框架用于扩展特征分配模型，其中特征标签和比例被联合建模，从而能够同时发现特征并学习标签之间的依赖关系。基于点过程理论，我们开发了这些模型的完整贝叶斯分析。在这个一般设置中，我们还刻画了先前提出的先验会导致较差的预测分布，这些分布无法捕捉标签依赖性，并且对观测到的频率谱不敏感。我们的方法旨在通过利用特征标签携带的信息，超越这些标准公式。我们通过引入以下内容展示了我们方法的有用性：(i) 一个Cox过程先验，用于聚类基因组变异嵌入，同时预测新变异和新变异簇；(ii) 一个行列式点过程先验，用于重复森林调查，其中预测涉及未观测树木的数量和位置。

英文摘要

Feature allocation models are Bayesian nonparametric tools tailored to data in which each observation can simultaneously exhibit multiple characteristics, or features. A fundamental limitation of standard formulations is that feature labels are assumed to be independent and identically distributed, and therefore play no role in posterior inference. The present paper introduces a unified Bayesian framework for extended feature allocation models, in which feature labels and proportions are modeled jointly, thereby enabling the simultaneous discovery of features and learning of dependencies among their labels. Building on point process theory, we develop a full Bayesian analysis of these models. Within this general setting, we also characterize previously proposed priors as those leading to poor predictive distributions, which cannot capture label dependencies and are insensitive to the observed frequency spectrum. Our methodology is designed to move beyond such standard formulations by leveraging the information carried by feature labels. We demonstrate the usefulness of our approach by introducing: (i) a Cox process prior that clusters genomic variant embeddings while predicting new variants and new variant clusters; (ii) a determinantal point process prior for repeated forest surveys, where prediction concerns both the number and the locations of unobserved trees.

URL PDF HTML ☆

赞 0 踩 0

2412.08895 2026-06-17 eess.SP stat.AP stat.CO 版本更新

Fully Bayesian Wideband Direction-of-Arrival Estimation and Detection via RJMCMC

基于RJMCMC的全贝叶斯宽带波达方向估计与检测

Kyurae Kim, Philip T. Clemson, James P. Reilly, Jason F. Ralph, Simon Maskell

AI总结提出一种宽带信号模型，通过循环卷积和频域稀疏矩阵分解，将边际似然计算复杂度从O(N^3 k^3)降至O(N k^3)，结合非可逆RJMCMC实现全贝叶斯源数检测与DOA估计。

详情

AI中文摘要

考虑一个阵列接收来自未知数量$k$个源的未知宽带信号。宽带信号可占据任意宽的带宽，使得基于解调的方法不适用，这在涉及声学信号的场景中很常见。本文旨在根据$N$个含噪阵列测量值确定$k$，这一任务称为“检测问题”，贝叶斯模型比较是常用方法。为使贝叶斯推断可行，通常需要对源信号进行边际化。不幸的是，对于宽带信号，朴素边际化的时间复杂度为$\mathcal{O}(N^3 k^3)$，难以承受。因此，全贝叶斯信号检测尚未在宽带设置中得到验证。本文提出一种宽带信号模型，允许计算上可处理的源信号边际化。我们从线性时不变（LTI）信号传播的规范模型出发，将其增强为循环卷积，且不失一般性。这允许在频域中进行高效计算，所得线性系统可分解为一个稀疏矩阵，我们称之为\textit{条带矩阵分解}。利用这种稀疏模式，可将计算边际似然的时间复杂度降至$\mathcal{O}(N k^3)$。这些计算改进使得通过可逆跳跃马尔可夫链蒙特卡洛（RJMCMC）进行高效后验推断成为可能。本文使用RJMCMC的非可逆扩展（NRJMCMC），它通常比RJMCMC具有更低的自相关性和更快的收敛速度。然后，可以使用NRJMCMC抽取的样本以全贝叶斯方式检测潜在源信号。我们通过与广义似然比检验（GLRT）和信息准则进行比较来评估我们的方法。

英文摘要

Consider an array receiving unknown wideband signals from an unknown number of sources $k$. Wideband signals can occupy arbitrarily wide bandwidths, rendering demodulation-based approaches inapplicable, a common situation in settings involving acoustic signals. Here, we aim to determine $k$ given $N$ noisy array-valued measurements, a task known as the "detection problem," for which Bayesian model comparison is a common approach. To render Bayesian inference tractable, it is typically necessary to marginalize the source signals. Unfortunately, for wideband signals, naive marginalization has an unaffordable time complexity of $\mathcal{O}(N^3 k^3)$. As a result, fully Bayesian signal detection has yet to be demonstrated in wideband settings. In this work, we propose a wideband signal model that allows for computationally tractable marginalization of the source signals. We begin from the canonical model of linear time-invariant (LTI) signal propagation, which is then augmented into a circular convolution, all without loss of generality. This allows for efficient computation in the frequency domain, where the resulting linear system admits a decomposition into a sparse matrix we refer to as a \textit{stripe matrix decomposition}. Exploiting this sparsity pattern reduces the time complexity of computing the marginal likelihood to $\mathcal{O}(N k^3)$. These computational improvements enable efficient posterior inference via reversible-jump Markov chain Monte Carlo (RJMCMC). In this work, we use the non-reversible extension of RJMCMC (NRJMCMC), which often achieves lower autocorrelation and faster convergence than RJMCMC. Detection of the latent source signals can then be performed in a fully Bayesian manner using samples drawn by NRJMCMC. We evaluate our procedure by comparing it against generalized likelihood ratio testing (GLRT) and information criteria.

URL PDF HTML ☆

赞 0 踩 0

2606.18197 2026-06-17 stat.AP stat.ME 新提交

FoundCause: 从观测数据中发现含隐混淆因子的因果关系

Patrick Blöbaum, Krishnakumar Balasubramanian, Shiva Prasad Kasiviswanathan

发表机构 * Amazon Web Services（亚马逊云服务）； Department of Statistics, University of California, Davis（加州大学戴维斯分校统计系）

AI总结提出FoundCause，一种基于合成数据训练的摊销因果发现模型，通过单次前向传递直接映射数据集到因果图，显式建模隐混淆因子，在15个真实数据集上优于11种非摊销和4种摊销方法。

Comments Download the model at this https URL (https://github.com/amazon-science/foundcause)

详情

AI中文摘要

从观测数据中发现因果关系仍然具有挑战性，因为需要在没有干预的情况下恢复有向结构和隐混淆因子。我们提出了FoundCause，一种完全在合成数据上训练的摊销因果发现模型，它通过单次前向传递直接将数据集映射到因果图。通过从大量模拟结构因果模型中学习，FoundCause捕获了可迁移的统计模式，这些模式泛化到单个数据集之外。该架构融合了因果发现的几个关键归纳偏置。它使用一个置换不变的Transformer编码器，通过交替关注样本和变量来联合建模跨变量依赖性和每个变量的分布。通过统计条件注意力注入来自经典非对称度量的成对统计特征，引导模型朝向已知的因果信号。一个分解的解码器将边的存在性与方向分离，而一个三角细化模块使得能够推理高阶因果模式，如链和碰撞器。此外，一个基于可学习隐令牌的专用混淆因子模块显式建模隐藏的共同原因，并且模型通过其掩码输入表示显式处理缺失数据。据我们所知，FoundCause是第一个显式建模隐混淆因子的摊销因果发现方法。FoundCause在15个真实数据集上优于11种经典非摊销方法（如PC、GES、NOTEARS风格优化）和4种摊销因果发现方法，相对于最强的非摊销方法，在$F_1$上提高了9.6%，在AUROC上提高了1.2%，结构汉明距离减少了18.9%，同时仅需单次前向传递即可完成推理。

英文摘要

Causal discovery from observational data remains challenging due to the need to recover directed structure and latent confounding without interventions. We propose FoundCause, an amortized causal discovery model trained entirely on synthetic data that maps datasets directly to causal graphs in a single forward pass. By learning from large collections of simulated structural causal models, FoundCause captures transferable statistical patterns that generalize beyond individual datasets. The architecture incorporates several key inductive biases for causal discovery. It uses a permutation-invariant transformer encoder with alternating attention over samples and variables to jointly model cross-variable dependence and per-variable distributions. Pairwise statistical features derived from classical asymmetry measures are injected through statistics-conditioned attention, guiding the model toward known causal signals. A factorized decoder separates edge existence from direction, while a triangular refinement module enables reasoning over higher-order causal motifs such as chains and colliders. In addition, a dedicated confounder module based on learnable latent tokens explicitly models hidden common causes, and the model explicitly handles missing data via its masked input representation. To our knowledge, FoundCause is the first amortized causal discovery approach to explicitly model latent confounding. FoundCause outperforms 11 classical non-amortized methods (e.g., PC, GES, NOTEARS-style optimization) and 4 amortized causal discovery methods on 15 real-world datasets, achieving +9.6% improvement in $F_1$, +1.2% in AUROC, and an 18.9% reduction in structural Hamming distance relative to the strongest non-amortized methods, while performing inference in a single forward pass.

URL PDF HTML ☆

赞 0 踩 0

2606.17790 2026-06-17 stat.AP cs.IT 新提交

Distributed Experimental Design: Bayes-optimal Fusion of Local Designs

分布式实验设计：局部设计的贝叶斯最优融合

Nagananda K G, Lav R. Varshney, Pramod K. Varshney

AI总结提出分布式贝叶斯实验设计的决策理论框架，推导贝叶斯最优融合规则，实现局部设计决策的全局最优融合，并通过数值实验验证其接近集中式性能。

Comments 12 pages, 4 figures

详情

AI中文摘要

我们为分布式贝叶斯实验设计开发了一个决策理论框架，其中局部代理使用期望信息增益评估候选实验，并将其局部设计决策传输到融合中心。与集中式贝叶斯设计不同（其中所有似然分量和信息增益值都可供单个规划者使用），分布式设置中的融合中心从压缩的局部建议中选择全局实验。我们推导了贝叶斯最优融合规则，该规则选择在给定观察到的局部设计决策条件下条件期望集中信息增益最大的实验。该规则在精神上类似于分布式检测中的最优融合规则，但存在根本差异，因为底层效用是期望信息增益，而导致的损失是信息增益遗憾而非分类错误。我们还建立了信息损失界限，并确定了仅决策融合规则渐近等价于集中式设计的条件。数值实验表明，贝叶斯最优融合紧密逼近集中式理想情况，而当少数站点携带不成比例的信息时，多数投票可能高度次优。

英文摘要

We develop a decision-theoretic framework for distributed Bayesian experimental design in which local agents evaluate candidate experiments using expected information gain and transmit their local design decisions to a fusion center. Unlike centralized Bayesian design, where all likelihood components and information-gain values are available to a single planner, the fusion center in the distributed setting chooses a global experiment from compressed local recommendations. We derive the Bayes-optimal fusion rule, which selects the experiment with largest conditional expected centralized information gain given the observed local design decisions. This rule is analogous in spirit to optimal fusion rules in distributed detection, but differs fundamentally because the underlying utility is expected information gain and the resulting loss is information-gain regret rather than classification error. We also establish information-loss bounds and identify conditions under which the decision-only fusion rule is asymptotically equivalent to the centralized design. Numerical experiments show that Bayes-optimal fusion closely approximates the centralized oracle, whereas majority voting can be highly suboptimal when a minority of sites carry disproportionate information.

URL PDF HTML ☆

赞 0 踩 0

2606.17777 2026-06-17 stat.ME math.ST stat.ML 新提交

On Response-Adaptive Targeting Strategies for Multi-Treatment Experiments

多处理实验中的响应自适应目标策略

Redouane Yagouti, Rémy Degenne, Emilie Kaufmann

AI总结提出统一框架αRTS，将两臂ERADE策略推广到多臂实验，证明渐近性质并引入强制探索变体解决稀疏目标问题。

详情

AI中文摘要

临床试验中的响应自适应随机化（RAR）旨在通过根据观察到的结果动态分配患者到治疗组来提高伦理和统计效率。虽然基于目标最优分配的RAR已在两臂设置中得到广泛研究，但其扩展到多处理实验（$K \geq 2$）在理论上仍然零散，大多数现有方法集中于特定算法或受限的目标分配。在本文中，我们引入了一个响应自适应目标的统一框架，即$\alpha$再平衡目标策略（$\alpha$RTS），它推广了Hu等人[2009]的ERADE两臂策略。我们证明了该族中的所有设计共享基本的渐近性质：强相合性、分配比例和处理效应估计量的渐近正态性以及渐近效率。为了解决稀疏目标情况（其中某些处理被渐近消除），我们进一步提出了带有强制探索的$\alpha$RTS，这是一种保证所有处理无限采样同时保持渐近保证的变体。广泛的模拟说明了$\alpha$RTS变体在三臂背景下的有限样本行为，特别强调了强制探索在稀疏设置中的关键作用。

英文摘要

Response-adaptive randomization (RAR) in clinical trials aims to improve ethical and statistical efficiency by dynamically allocating patients to treatments based on observed outcomes. While RAR based on a target optimal allocation have been extensively studied for two-arms settings, their extension to multi-treatment experiments ($K \geq 2$) remains theoretically fragmented, with most existing methods focusing on specific algorithms or restricted target allocations. In this paper, we introduce a unified framework for response-adaptive targeting, the $\alpha$-Rebalancing Targeting Strategies ($\alpha$RTS), which generalize the ERADE two-armed strategy of Hu et al. [2009]. We prove that all designs in this family share fundamental asymptotic properties: strong consistency, asymptotic normality of allocation proportions and treatment effect estimators, and asymptotic efficiency. To address sparse target regimes (where some treatments are asymptotically eliminated), we further propose $\alpha$RTS with Forced Exploration, a variant that guarantees infinite sampling for all treatments while preserving the asymptotic guarantees. Extensive simulations illustrate the finite-sample behavior of $\alpha$RTS variants in a 3-armed context, highlighting in particular the critical role of forced exploration in sparse settings.

URL PDF HTML ☆

赞 0 踩 0

2606.17600 2026-06-17 stat.ME math.ST stat.ML 新提交

基于高斯过程的工具变量和近端因果推断

Yuqi Zhang, Krikamol Muandet, Dino Sejdinovic, Edwin Fong, Siu Lun Chau

AI总结提出去条件高斯过程框架，用于存在未观测混杂时的因果推断，同时提供可靠的后验不确定性量化，并通过边际似然优化实现模型选择。

详情

AI中文摘要

工具变量（IV）和近端因果学习（Proxy）方法是在存在未观测混杂情况下进行因果推断的核心框架。尽管方法论上取得了重大进展，现有方法很少提供可靠的认知不确定性（EU）量化。我们通过一个去条件高斯过程（DGP）框架来解决这一差距，用于不确定性感知的因果学习。我们的公式将流行的核估计量恢复为后验均值，确保了预测精度，而后验方差则提供了有原则且校准良好的EU。此外，概率结构通过边际对数似然优化实现了系统的模型选择。实证结果表明，通过经验覆盖频率和决策感知的准确率拒绝曲线评估，该方法在提供信息丰富的EU量化的同时，表现出强大的预测性能。总之，我们的方法为存在未观测混杂情况下的因果推断提供了一个统一、实用的解决方案，并具有可靠的不确定性。

英文摘要

Instrumental variable (IV) and proximal causal learning (Proxy) methods are central frameworks for causal inference in the presence of unobserved confounding. Despite substantial methodological advances, existing approaches rarely provide reliable epistemic uncertainty (EU) quantification. We address this gap through a Deconditional Gaussian Process (DGP) framework for uncertainty-aware causal learning. Our formulation recovers popular kernel estimators as the posterior mean, ensuring predictive precision, while the posterior variance yields principled and well-calibrated EU. Moreover, the probabilistic structure enables systematic model selection via marginal log-likelihood optimization. Empirical results demonstrate strong predictive performance alongside informative EU quantification, evaluated via empirical coverage frequencies and decision-aware accuracy rejection curves. Together, our approach provides a unified, practical solution for causal inference under unobserved confounding with reliable uncertainty.

URL PDF HTML ☆

赞 0 踩 0

2606.17121 2026-06-17 stat.AP cs.LG physics.flu-dyn 新提交

Regularized Machine Learning for System Identification of Ship Free-Running Manoeuvres from CFD-Based Synthetic Data: A Comparative Study

基于CFD合成数据的船舶自由航行操纵系统辨识的正则化机器学习：比较研究

R.F. Suárez, J.C. Berndt, M. Abdel-Maksoud

发表机构 * Hamburg University of Technology (TUHH)（汉堡技术大学）

AI总结本研究使用正则化回归方法从CFD生成的自由航行数据中辨识船舶水动力系数，重点评估了系数集大小、训练长度和操纵组合对模型性能的影响，发现Ridge回归在计算效率和预测精度间取得最佳平衡。

Comments 28 pages

详情

AI中文摘要

本研究探讨了从CFD生成的自由航行仿真数据中辨识船舶水动力系数的监督机器学习技术。具体而言，将普通最小二乘法和正则化回归方法应用于Abkowitz型操纵模型。训练和验证数据集来自Z形和回转操纵的URANS仿真，这些仿真已通过实验基准数据验证。分析评估了系数集大小、预测模型训练所需的最小训练长度以及操纵组合对模型性能的影响。结果表明，只要通过适当的系数选择、回归模型或输入数据变异性解决多重共线性问题，大角度Z形操纵适用于水动力系统辨识。较大的系数集为可变条件提供了更大的模型灵活性，但更容易出现多重共线性。正则化回归技术有效缓解了多重共线性，并显著提高了预测精度，而纳入更多样化的操纵数据同样如此。在测试的模型中，Ridge回归在计算效率和预测精度之间提供了最佳折衷。

英文摘要

This study investigates supervised machine learning techniques for identifying ship hydrodynamic coefficients from CFD-generated data from free-running simulations. Specifically, ordinary least squares and regularized regression methods are applied to Abkowitz-type manoeuvring models. Training and validation datasets are derived from URANS simulations of zig-zag and turning circle manoeuvres, which are validated against experimental benchmark data. The analysis evaluates the effects of coefficient set size, minimum training length required for predictive model training, and manoeuvre combinations on model performance. Results demonstrate the suitability of large-angle zig-zag manoeuvres for hydrodynamic system identification, provided that multicollinearity is addressed through appropriate coefficient selection, regression models, or input data variability. Larger coefficient sets offer greater model flexibility for variable conditions but are more prone to multicollinearity. Regularized regression techniques effectively mitigate multicollinearity and notably enhance prediction accuracy, as does incorporating more diverse manoeuvring data. Among tested models, Ridge regression provided the best compromise between computational efficiency and prediction accuracy.

URL PDF HTML ☆

赞 0 踩 0

2411.13763 2026-06-17 math.ST stat.ME stat.ML 版本更新

Active Subsampling for Measurement-Constrained M-Estimation of Individualized Thresholds with High-Dimensional Data

高维数据下个体化阈值测量受限M估计的主动子采样

Jingyi Duan, Lehao Fu, Yang Ning

AI总结针对测量受限问题，提出K步主动子采样算法，通过迭代采样最具信息量的观测并求解正则化M估计，实现高维线性阈值参数估计，并揭示条件密度光滑性导致的相变现象。

Comments Accepted to Annals of Statistics, 2026

详情

AI中文摘要

测量受限问题在现代应用如电子健康记录研究中频繁出现。在此类问题中，尽管有大量数据集可用，但收集标记数据可能非常昂贵或耗时，导致在给定预算内只能标记一小部分数据。这引发了一个关键问题：在预算约束下，哪些数据点最有益于标记？我们在测量受限M估计框架下研究估计最优个体化阈值的问题。具体地，我们的目标是估计连续变量$X$的线性阈值$\theta^TZ$中的高维参数$\theta$，使得$X$是否超过阈值$\theta^TZ$与二元结果$Y$之间的差异最小化。在测量受限设置中，我们提出了一种新颖的$K$步主动子采样算法来估计$\theta$，该算法迭代地采样数据集中最具信息量的观测，并求解正则化M估计量。我们的理论分析揭示了关于$\beta$（给定$Y$和$Z$时$X$的条件密度的光滑性）的尖锐相变现象。完整摘要请参见论文。

英文摘要

Measurement-constrained problems frequently arise in modern applications such as electronic health record studies. In such problems, despite the availability of large datasets, collecting labeled data can be highly costly or time-consuming, allowing only a small portion of the data to be labeled within a given budget. This raises a critical question: which data points are most beneficial to label given the budget constraint? We study this question in the context of estimating an optimal individualized threshold under a measurement-constrained M-estimation framework. In particular, our goal is to estimate a high-dimensional parameter $\theta$ in a linear threshold $\theta^TZ$ for a continuous variable $X$ such that the discrepancy between whether $X$ exceeds the threshold $\theta^TZ$ and a binary outcome $Y$ is minimized. In the measurement-constrained setting, we propose a novel $K$-step active subsampling algorithm to estimate $\theta$, which iteratively samples the most informative observations in the dataset and solves a regularized M-estimator. Our theoretical analysis reveals a sharp phase transition phenomenon with respect to $\beta$, the smoothness of the conditional density of $X$ given $Y$ and $Z$. Please see the paper for the full abstract.

URL PDF HTML ☆

赞 0 踩 0

2606.18078 2026-06-17 stat.ME 新提交

Spatial prediction of environmental processes using random forests: How best to account for spatial dependence?

使用随机森林对环境过程进行空间预测：如何最好地考虑空间依赖性？

Duncan Lee, Vinny Davies, Helen R. Savage, Hussein Twabi, Marriott Nliwasa, Peter MacPherson

AI总结本文比较了随机森林融合空间依赖性的多种方法，通过模拟和空气污染数据实验，发现空间基函数方法表现一致良好。

详情

AI中文摘要

环境过程的地统计空间预测通常通过克里金法使用高斯过程模型进行，而机器学习算法是非空间预测的最先进技术。最近这些思想的融合令人兴奋，使传统机器学习算法具备了处理空间自相关的能力，从而提高了预测性能。已经提出了多种方法，包括与高斯过程的融合、观测驱动的相关结构、空间基函数和局部地理拟合。然而，尚未对其相对预测性能进行数值比较，而这对于指导环境科学家选择最优方法至关重要。本文填补了这一知识空白，并专注于随机森林作为机器学习算法，因为它们在计算和概念上比深度学习算法更易于实现。本文展示了两项研究的结果，第一项是受控模拟实验，研究是否有任何单一方法在不同空间自相关类型中始终表现优越。第二项研究关注马拉维布兰太尔市一项结核病患病率研究中空气污染浓度的预测。结果表明，虽然没有单一方法普遍优越，但使用空间基函数在模拟和真实数据研究中均表现一致良好。

英文摘要

Geostatistical spatial prediction for environmental processes is typically undertaken using Gaussian process models via Kriging, while machine learning (ML) algorithms are state-of-the-art for non-spatial prediction. An exciting recent fusion of these ideas imbibes traditional ML algorithms with the capacity to deal with spatial autocorrelation, leading to improved predictive performance. A range of approaches have been proposed, including fusion with Gaussian processes, observation-driven correlation structures, spatial basis functions and local geographical fitting. However, there has been no numerical comparison of their relative predictive performances, which is needed to advise environmental scientists on the optimal approach to use. This paper fills this knowledge gap, and focuses on random forests as the ML algorithm because they are more computationally and conceptually straightforward to implement than deep learning algorithms. The results from two studies are presented, the first being a controlled simulation experiment investigating whether any single approach is consistently superior across different spatial autocorrelation types. The second study focuses on the prediction of air pollution concentrations within a tuberculosis prevalence study in Blantyre, Malawi. The results show that whilst no single approach is universally superior, utilising spatial basis functions appears to perform consistently well across both the simulation and real data studies.

URL PDF HTML ☆

赞 0 踩 0

2606.18044 2026-06-17 stat.AP 新提交

Model-based clustering of compositional trajectories for the analysis of mobility data

基于模型的成分轨迹聚类用于移动数据分析

Andrea Panarotto, Manuela Cattelan, Ruggero Bellio

AI总结提出一种基于状态空间模型的成分时间序列聚类方法，将电话数据中的移动轨迹表示为道路类型比例，以识别城市移动模式。

Comments 36 pages (26 for the main text, 10 in the supplementary), 13 figures (6 in the main text, 7 in the supplementary)

详情

AI中文摘要

理解城市移动模式对于设计高效且可持续的交通系统至关重要。受帕多瓦市及其周边地区应用的启发，我们提出了一种新颖的统计框架，用于分析和聚类源自电话数据的移动轨迹。我们引入了个体移动的成分表示，该表示将不确定的设备位置与周围道路网络的信息相结合，在每个时间点编码与观测位置兼容的不同道路类型的比例。这种表述自然地考虑了测量不确定性，并产生了在单纯形中演化的轨迹。为了对这些数据进行建模，我们开发了一个用于成分时间序列的状态空间框架，该框架同时捕捉电话测量误差和潜在移动过程的时间动态。基于这一表示，我们提出了一种基于模型的聚类方法，该方法基于状态空间模型的混合，以识别具有相似演化轨迹的组。这使我们能够将个体移动聚合成在人口层面上可解释的移动模式。案例研究的结果表明，该方法能够揭示有意义的移动行为，为政策制定者提供潜在相关的见解。

英文摘要

Understanding urban mobility patterns is crucial for designing efficient and sustainable transportation systems. Motivated by an application to the municipality of Padova and its surroundings, we propose a novel statistical framework for the analysis and clustering of mobility trajectories derived from telephonic data. We introduce a compositional representation of individual movements that integrates the uncertain device location with information on the surrounding road network, encoding at each time point the proportions of different road types compatible with the observed position. This formulation naturally accounts for measurement uncertainty and yields trajectories evolving in the simplex. To model these data, we develop a state-space framework for compositional time series that captures both the telephonic measurement error and the temporal dynamics of the latent mobility process. Building on this representation, we propose a model-based clustering approach based on mixtures of state-space models to identify groups of trajectories with similar evolution. This allows us to aggregate individual movements into interpretable mobility patterns at the population level. The results of the case study demonstrate the ability of the approach to uncover meaningful mobility behaviors, providing insights that are potentially relevant to policy makers.

URL PDF HTML ☆

赞 0 踩 0

2606.17939 2026-06-17 stat.AP stat.ML 新提交

Understanding Long-Term Dynamics of Individual Metro Usage: A Hidden Semi-Markov State Framework with Survival Analysis

理解个体地铁使用的长期动态：基于生存分析的隐半马尔可夫状态框架

Bingxun Wang, Valeria Maria Urbano, Shan He, Yang Chen, Wei Liu, Zhibin Jiang, Piercesare Secchi

AI总结提出融合隐半马尔可夫模型与离散时间生存分析的框架，利用上海地铁四年刷卡数据识别五种可解释的出行状态及其转移层次，揭示退出风险与状态相关但独立于时长，而重返风险随不活跃时长急剧衰减。

详情

AI中文摘要

理解个体地铁使用在多年时间尺度上的演化对于交通规划和乘客留存至关重要。然而，现有方法通常将移动模式表征为静态聚类或短期变化，忽略了交通参与的生命周期动态。本研究提出一个基于状态的生命周期建模框架，将隐半马尔可夫模型（HSMM）与离散时间生存分析相结合，以刻画个体地铁移动性的演化。HSMM推断具有显式持续时间分布的潜在移动状态以及控制状态变迁的转移矩阵，而生存组件通过依赖于移动状态轨迹和行为历史的状态相关风险函数，对退出和重新进入事件进行建模。将该框架应用于上海地铁系统四年（2021-2024）的智能卡数据，能够识别可解释的移动状态，刻画转移动态，并量化状态依赖的退出和重新进入过程。分析揭示了五种稳健的移动状态，具有以偶尔使用网关状态为中心的方向性转移层次，以及控制脱离和回归的根本不同的时间机制：退出风险与状态相关但与持续时间无关，而重新进入风险随不活跃时长急剧衰减。这些发现为面向生命周期的移动性分析提供了方法论基础，并为交通运营商识别风险用户和安排留存干预提供了实践指导。

英文摘要

Understanding how individual metro usage evolves over multi-year horizons is essential for transit planning and passenger retention. However, existing approaches typically characterize mobility patterns as static clusters or short-term variability, leaving the lifecycle dynamics of transit participation underexplored. This study proposes a state-based lifecycle modeling framework that integrates Hidden Semi-Markov Models (HSMM) with discrete-time survival analysis to characterize the evolution of individual metro mobility. The HSMM infers latent mobility states with explicit duration distributions and a transition matrix governing regime changes, while the survival component models exit and re-entry events via state-dependent hazard functions conditioned on mobility-state trajectories and behavioral history. Applied to four years of smart card data from the Shanghai metro system (2021-2024), the framework enables the identification of interpretable mobility states, the characterization of transition dynamics, and the quantification of state-dependent exit and re-entry processes. The analysis reveals five robust mobility states with a directional transition hierarchy centered on an occasional-usage gateway state, and fundamentally different temporal mechanisms governing disengagement and return: exit hazard is state-dependent but duration-independent, whereas re-entry hazard decays sharply with inactivity length. These findings provide a methodological foundation for lifecycle-oriented mobility analysis and practical guidance for transit operators to identify at-risk users and time retention interventions.

URL PDF HTML ☆

赞 0 踩 0

2606.17717 2026-06-17 stat.ME stat.AP 新提交

Double zero-inflated spatio-temporal modeling of daily precipitation under detection thresholds

检测阈值下日降水量的双零膨胀时空建模

Juan Marcen-Gutierrez, Jorge Castillo-Mateo, Alan E. Gelfand, Jesús Asín, Ana C. Cebrián

AI总结针对日降水量中两种零值（无降水事件和低于检测限的未测量降水）问题，提出结合Probit回归、Gamma回归和阈值截断观测机制的多层时空模型，并应用高斯过程捕捉空间依赖，在贝叶斯框架下实现精确推断。

Comments 38 pages (+33 pages supplement), 7 figures (+35 figures supplement), 5 tables

详情

AI中文摘要

解释日尺度降水行为对于精细理解降水驱动机制至关重要。然而，由于零值的频繁出现，这一工作具有挑战性。两种类型的零值——作为干旱事件的无降水和由于检测限导致的未测量降水——的公认存在加剧了这一挑战。在这项工作中，我们提出了一个多层时空模型，该模型允许我们区分和解释两种类型的零值，并对高于检测限的正降水进行建模。该方法结合了通过Probit回归建模概率的零处点质量、潜在正降水量的Gamma回归以及受阈值截断影响的观测机制。为了捕捉空间依赖性，在每个回归模型中采用了高斯过程。在贝叶斯框架下工作，我们可以获得具有精确不确定性的丰富推断范围。特别是，我们提供了基于模型的推断工具，以比较和量化真实降水过程与其观测对应物在相关特征上的差异。我们将模型应用于西班牙东北部埃布罗河流域70个站点15年间的春季日观测数据分析。我们的发现表明，阈值强烈影响观测降水的发生，特别是在湿润地区。虽然其对总累积量的影响较小，但它可能对上分位数产生显著影响。

英文摘要

Explaining precipitation behavior at daily scale is important for fine scale understanding of the mechanisms driving precipitation. However, this effort is challenging because of the frequent incidence of zeros. The challenge is amplified by the acknowledged incidence of two types of zeros -- absence of precipitation as a dry event and absence of measured precipitation due to detection limits. In this work, we propose a multilevel spatio-temporal model which allows us to distinguish and explain the two types of zeros, as well as to model positive precipitation above the detection limit. The methodology combines a point mass at zero with probability modeled through a probit regression, a Gamma regression for latent positive precipitation amounts, and an observation mechanism subject to threshold-induced censoring. To capture spatial dependencies, Gaussian processes are employed in each regression model. Working within a Bayesian framework, we can obtain a rich range of inference with exact uncertainty. In particular, we provide model-based inference tools to compare and quantify differences between the true precipitation process and its observed counterpart across relevant characteristics. We apply our model to the analysis of daily spring observations at 70 sites over 15 years from the Ebro River Basin in northeastern Spain. Our findings indicate that the threshold strongly affects the occurrence of observed precipitation, especially in humid regions. While its impact on total accumulated amounts is small, it can exert a relevant effect on upper quantiles.

URL PDF HTML ☆

赞 0 踩 0

2606.17369 2026-06-17 math.ST stat.ME 新提交

Inference Optimal Long Run Variance Estimation with Lugsail Kernels

使用Lugsail核的推断最优长程方差估计

Rebecca P. Kurtz-Garcia, James M. Flegal

AI总结针对具有平稳序列依赖的数据，提出基于非标准固定平滑极限分布的Lugsail估计器最优带宽规则，改善偏差校正并优化推断。

2606.17530 2026-06-17 physics.soc-ph cs.LG econ.GN stat.AP 新提交

Public transit gains and spatially uneven travel demand changes after NYC congestion pricing

纽约市拥堵收费后公共交通增益与空间不均的出行需求变化

Donghang Li, Dingyi Zhuang, Yunlin Li, Chenan Shen, Nina Cao, Yunhan Zheng, Shenhao Wang, Jinhua Zhao

发表机构 * Department of Civil and Environmental Engineering, Massachusetts Institute of Technology（麻省理工学院土木与环境工程系）； Department of Urban Studies and Planning, Massachusetts Institute of Technology（麻省理工学院城市研究与规划系）； Mathematical Institute, University of Oxford（牛津大学数学院）； Department of Mechanical Engineering, Massachusetts Institute of Technology（麻省理工学院机械工程系）； College of Urban and Environmental Sciences, Peking University（北京大学城市与环境科学学院）； Department of Urban and Regional Planning, University of Florida（佛罗里达大学城市与区域规划系）； Center for Computational Science and Engineering, Massachusetts Institute of Technology（麻省理工学院计算科学与工程中心）

AI总结利用时间序列基础模型生成概率反事实预测，评估纽约市2025年实施的拥堵收费政策，发现公交和地铁客流量显著增加，但总体出行需求略有下降，且影响存在空间异质性。

详情

AI中文摘要

纽约市于2025年1月实施了全国首个基于区域的拥堵收费计划，为评估全系统城市出行如何响应大规模定价干预提供了机会。由于此类政策会在不同交通方式和区域间产生溢出效应，因此难以构建可信的控制组。我们利用时间序列基础模型生成具有校准不确定性的概率反事实需求预测，以应对这一挑战。将该框架应用于公交、地铁和总出行量数据，我们发现，与预期无政策需求相比，政策实施后公交和地铁客流量显著增加，而总体出行需求略有下降。影响存在空间异质性：总体出行需求的减少集中在拥堵缓解区内，而公共交通的增益则延伸至曼哈顿核心区以外。社会人口分析进一步揭示了不同社区之间的适应差异，凸显了空间公平性问题。我们的框架为在缺乏干净控制组的情况下，对全系统城市干预进行不确定性感知评估提供了一种可扩展的方法。

英文摘要

New York City implemented the nation's first cordon-based congestion pricing program in January 2025, providing an opportunity to evaluate how system-wide urban mobility responds to large-scale pricing interventions. Because such policies generate spillovers across modes and locations, credible control groups are difficult to construct. We address this challenge using time series foundation models to generate probabilistic counterfactual demand forecasts with calibrated uncertainty. Applying this framework to bus, subway, and aggregate trip volume data, we find that post-policy bus and subway ridership increased significantly relative to expected no-policy demand, while overall travel demand decreased modestly. The effects are spatially heterogeneous: while reductions in overall travel demand are concentrated within the Congestion Relief Zone, transit gains extend beyond Manhattan's core. Socio-demographic analyses further reveal uneven adaptation across neighborhoods, highlighting spatial equity implications. Our framework provides a scalable approach for the uncertainty-aware evaluation of system-wide urban interventions when clean control groups are unavailable.

URL PDF HTML ☆

赞 0 踩 0

2606.12097 2026-06-17 stat.AP physics.data-an 新提交

Weibull-Stationary Stochastic Differential Equations for Conditional Long-Horizon Wind Power Forecasting

条件长期风电预测的威布尔平稳随机微分方程

Luca Di Persio, Mehrdad Ghadiri

AI总结提出一种基于威布尔平稳SDE的月度风电概率预测框架，通过异方差卡尔曼滤波和三种SDE模型实现高分辨率预测，CRPS约1.57 m/s，功率Wasserstein距离低于额定容量1.4%。

详情

AI中文摘要

我们提出了一个以十分钟分辨率进行一个月前风电预测的条件概率框架。从序列相关的SCADA风速数据中估计月度威布尔形状和尺度参数，通过Godambe协方差修正，并使用异方差卡尔曼滤波在双变量VAR(1)状态空间模型上进行预测。以MMSE预测的威布尔不变律为条件，我们构建并比较了三种正风速SDE模型：Ornstein-Uhlenbeck-Weibull变换、Fokker-Planck漂移优先扩散和Fokker-Planck扩散优先模型。模拟的风速集合通过校准的XGBoost功率曲线映射到功率。应用于Kelmarsh风电场Senvion MM92涡轮机2021年1月的数据，三种SDE公式在概率精度上统计上不可区分，平均CRPS值在1.569至1.575 m/s之间。因此，扩散优先模型在计算上更优，运行时间相对于OU-Weibull模型减少了约七倍。在功率域中，模拟与观测分布之间的Wasserstein距离为26.1-27.6 kW，低于额定容量的1.4%，而所检查月份的月能量产出偏差约为-7.3%。在0-1500 kW范围内，超越概率误差保持在1.6个百分点以下，在额定功率附近约为2.2个百分点。这些量为下游运行问题提供了决策相关的概率输入，而非完成的备用、储能、市场或疲劳优化决策。完全边缘化卡尔曼预测律下的威布尔参数是一个自然的扩展。

英文摘要

We present a one-month-ahead conditional probabilistic framework for wind-power forecasting at ten-minute resolution. Monthly Weibull shape and scale parameters are estimated from serially dependent SCADA wind-speed data, corrected through a Godambe covariance, and forecast by a heteroskedastic Kalman filter on a bivariate VAR(1) state-space model. Conditional on the MMSE forecasted Weibull invariant law, we construct and compare three positive wind-speed SDE models: an Ornstein-Uhlenbeck-Weibull transform, a Fokker-Planck drift-first diffusion, and a Fokker-Planck diffusion-first model. The simulated wind-speed ensembles are mapped to power through a calibrated XGBoost power curve. Applied to January 2021 data from a Senvion MM92 turbine at Kelmarsh Wind Farm, the three SDE formulations are statistically indistinguishable in probabilistic accuracy, with mean CRPS values between 1.569 and 1.575 m/s. The diffusion-first model is therefore preferred on computational grounds, reducing runtime by about a factor of seven relative to the OU-Weibull model. In the power domain, the Wasserstein distance between simulated and observed distributions is 26.1-27.6 kW, below $1.4\%$ of rated capacity, while the monthly energy-yield bias is about $-7.3\%$ for the examined month. Exceedance-probability errors remain below 1.6 percentage points over the 0-1500 kW range and about 2.2 percentage points near rated power. These quantities provide decision-relevant probabilistic inputs for downstream operational problems, rather than completed reserve, storage, market, or fatigue-optimization decisions. Full marginalisation over the Kalman predictive law of the Weibull parameters is left as a natural extension.

URL PDF HTML ☆

赞 0 踩 0

2412.00607 2026-06-17 stat.ME q-fin.RM 版本更新

On a risk model with tree-structured Poisson Markov random field frequency, with application to rainfall events

基于树结构泊松马尔可夫随机场频率的风险模型及其在降雨事件中的应用

Hélène Cossette, Benjamin Côté, Alexandre Dubeau, Etienne Marceau

AI总结提出一种树结构泊松马尔可夫随机场模型来刻画组合风险中的频率相依性，研究无限增长树上的渐近风险，并在极端降雨数据上验证了模型灵活性和可扩展性。

Comments 40 pages

2506.00561 2026-06-17 stat.AP stat.ME 版本更新

Mortality Forecasting under Climate Risk: A Stochastic Approach with Distributed Lag Non-Linear Models

气候风险下的死亡率预测：基于分布滞后非线性模型的随机方法

Jiacheng Min, Han Li, Thomas Nagler, Shuanming Li

AI总结提出将分布滞后非线性模型融入随机死亡率模型，通过新回拟合算法分离气候与非气候风险，在三个欧洲区域验证了短期预测优势，并利用未来气候数据预测至2045年的死亡率变化。

Comments 25 pages, 10 figures, and 2 tables

详情

AI中文摘要

评估气候驱动的死亡风险近几十年来已成为一个新兴研究领域。本文提出一种新方法，将气候驱动效应明确纳入单种群和多种群随机死亡率模型。新模型由两部分组成：随机死亡率模型和分布滞后非线性模型（DLNM）。随机部分捕捉死亡率中非气候的长期趋势、波动性和季节模式。DLNM部分捕捉气候变量对死亡率的非线性和滞后效应，以及热浪和寒潮对不同年龄组的影响。对于模型校准，我们提出一种新的回拟合算法，能够将气候驱动的死亡风险与非气候驱动的随机死亡风险分离开来。我们利用来自三个欧洲地区（雅典、里斯本和罗马）的数据，展示了我们的模型相对于四种替代模型的有效性和改进的短期（1-18个月）预测性能。此外，作为所提出建模框架的一个应用，我们利用气候模型生成的未来UTCI数据，在两种代表性浓度路径（RCP）情景下，同时考虑随机死亡率改善趋势和气候风险，提供了这些地区到2045年的总死亡率预测。预测显示，随着UTCI随时间普遍升高，冬季死亡率显著下降，而夏季死亡率上升。尽管我们预计在RCP8.5情景下短期总死亡率略低于RCP2.6，但在RCP8.5情景下，长期总死亡率预计将增加。

英文摘要

Assessing climate-driven mortality risk has become an emerging area of research in recent decades. In this paper, we propose a novel approach to explicitly incorporate climate-driven effects into both single- and multi-population stochastic mortality models. The new model consists of two components: a stochastic mortality model, and a distributed lag non-linear model (DLNM). The stochastic component captures the non-climate long-term trend, volatility, and seasonal patterns in mortality rates. The DLNM component captures non-linear and lagged effects of climate variables on mortality, as well as the impact of heat waves and cold waves across different age groups. For model calibration, we propose a novel backfitting algorithm that allows us to disentangle the climate-driven mortality risk from the non-climate-driven stochastic mortality risk. We illustrate the effectiveness and improved short-term (1--18 month) forecasting performance of our model against four alternative models, using data from three European regions: Athens, Lisbon, and Rome. Furthermore, as an application of the proposed modeling framework, we utilize future UTCI data generated from climate models to provide total mortality forecasts into 2045 across these regions under two Representative Concentration Pathway (RCP) scenarios, taking both stochastic mortality improvement trend and climate risk into account. The projections show a noticeable decrease in winter mortality alongside a rise in summer mortality, driven by a general increase in UTCI over time. Although we expect slightly lower overall mortality in the short term under RCP8.5 compared to RCP2.6, a long-term increase in total mortality is anticipated under the RCP8.5 scenario.

URL PDF HTML ☆

赞 0 踩 0

2606.17486 2026-06-17 stat.ME stat.CO 新提交

嵌套采样：一个批判性且全面的理论指南

Luca Martino, Fernando Llorente

AI总结本文全面详细地阐述了嵌套采样（NS）的推导过程，澄清其理论基础和实际挑战，旨在为新手提供教程，为经验丰富的从业者提供批判性回顾。

详情

AI中文摘要

嵌套采样（NS）技术因其能够高效探索高似然区域而受到广泛关注，尤其是在宇宙学和天文学领域——这一特性类似于隐式似然优化，是其成功的基础。虽然NS的完整理论推导复杂且涉及多个近似，但核心挑战在于从似然约束先验中进行采样，这对性能至关重要。本文提供了NS推导的全面详细阐述，澄清了其理论基础和实际挑战。我们详细描述了NS过程，强调了其优势及潜在局限性。通过这样做，本文旨在加深对该方法的理解，并促进未来在广泛科学应用中改进、新变体和更高效实现的发展。因此，本文的主要贡献是双重的：既作为该领域新手的教程，又作为经验丰富的从业者的批判性回顾。

英文摘要

The nested sampling (NS) technique has gained widespread attention, particularly in cosmology and astronomy, due to its ability to efficiently explore high-likelihood regions - a feature akin to an implicit likelihood optimization that underlies its success. While the full theoretical derivation of NS is complex and involves several approximations, the central challenge lies in sampling from the likelihood-constrained priors, which is crucial for its performance. This work provides a comprehensive and detailed exposition of NS derivation, clarifying both its theoretical foundations and practical challenges. We provide a thorough description of the NS procedure, emphasizing both its strengths and potential limitations. In doing so, this work seeks to deepen understanding of the method and to foster the development of future enhancements, novel variants, and more efficient implementations across a wide range of scientific applications. Thus, the main contribution of this work is twofold: it serves both as a tutorial for newcomers to the field and as a critical review for experienced practitioners.

URL PDF HTML ☆

赞 0 踩 0

2606.13827 2026-06-17 math.NA cs.LG stat.ML 新提交

Approximating Gaussian Whittle-Matern Fields over Well-Centered Triangulations of Riemannian Manifolds

离散流形上的Whittle-Matérn场逼近

Srinivas Nambirajan

AI总结提出一种基于离散外微分的GMRF逼近方法，统一处理Whittle-Matérn场族，支持推断参数，兼容点/分段平滑测量，计算独立于插值函数，并给出低秩近似用于压缩感知。

Comments More specific title, updated acknowledgement, minor typos fixed

详情

AI中文摘要

马尔可夫Whittle-Matérn场已通过稀疏精度矩阵的高斯马尔可夫随机场（GMRF）收敛逼近，使用两参数族SPDE的有限元近似：\$ (\kappa^2 - \Delta)^{\alpha/2} u = \mathcal{W}, \\;\\; \kappa \in \mathbb{R}, \\; \alpha \in \mathbb{N} \$。利用离散外微积分（DEC）分析的最新进展，我们提出了一种不同但密切相关的收敛GMRF逼近方法，适用于离散化为良好中心单纯复形的完备无边黎曼流形上的Matérn场。该收敛方法：(i) 对\$\alpha, \kappa\$不可知，从而允许对整个\$(\alpha, \kappa)\$族GMRF的精度和协方差矩阵进行通用逼近方案，因此它们可以被推断而非猜测。(ii) 固有地模拟随机场的逐点和分段平滑测量，并对两者同样好地逼近。(iii) 计算上与所用插值函数无关——如果将一种收敛插值替换为同一网格上的另一种合适插值，不会产生额外开销。此外，我们证明，在精确意义上良好连接且体积集中的离散化上，精度矩阵是图拉普拉斯的谱函数。我们为该族Matérn GMRF提供了一个低秩逼近器，并提及一个用例：通过压缩感知减少建模GMRF所需的测量数量。

英文摘要

Markovian Whittle-Matérn fields have been convergently approximated by discrete Gauss Markov Random Fields (GMRFs) with sparse precision matrices using a Finite Element approximation of the two-parameter family, \[ (\kappa^2 - \Delta)^{\alpha/2} u = \mathcal{W}, \;\; \kappa \in \mathbb{R}, \; \alpha \in \mathbb{N}. \] of SPDEs. Using recent developements in the analysis of Discrete Exterior Calculus (DEC), we present a different, yet closely related, convergent GMRF approximation to these Matérn fields over complete, boundaryless Riemannian manifolds discretized as well-centered simplicial complexes. This convergent method (i) is agnostic to $\alpha, \kappa$ and thus allows a universal approximation scheme for the precision and covariance matrices of the entire $(\alpha, \kappa)$-family of GMRFs, so they may be inferred rather than guessed. (ii) inherently models pointwise and piecewise-smoothed measurements of a random field and approximates both equally well (iii) is computationally independent of the interpolants used - it suffers no overhead if one convergent interpolant were replaced with another suitable interpolant over the same mesh. Furthermore, we show that, on discretizations that are well-connected in a precise sense, and volume-concentrated, the precision matrices are spectral functions of a graph-laplacian. We provide a low rank approximator to the family of such Matérn GMRFs and mention a use case: reducing the number of measurements needed to model the GMRF by compressed-sensing.

URL PDF HTML ☆

赞 0 踩 0

2606.09049 2026-06-17 stat.ME cs.LG math.ST stat.ML 新提交

Data augmented bootstrap: Unifying confidence interval construction by approximate invariance

数据增强自助法：通过近似不变性统一置信区间构建

Kevin Han Huang

AI总结提出数据增强自助法（DAB），利用数据的近似不变性构建置信区间，统一了经典自助法、共形预测等方法的理论，并引入数据增强启发式方法。

Comments Added comparison with arXiv:2604.15229 (https://arxiv.org/abs/2604.15229)

详情

AI中文摘要

我们提出了数据增强自助法（DAB），这是一个通过数据的近似不变变换来构建置信区间的框架。作为特例，DAB 恢复了依赖于精确群对称性的流行方法，例如共形预测、最大均值差异 U-统计量的 wild bootstrap 以及最近提出的 SymmPI。同时，DAB 也恢复了经典的自助法，该方法利用了随着数据集大小增长，数据索引均匀采样下数据集的近似不变性。对于所有 DAB 方法，我们建立了理论覆盖结果，这些结果根据不变性的强度在有限样本和渐近保证之间插值，且不假设群结构。近似不变性通过 Kolmogorov 距离度量，并且对于满足高斯普适性的统计量，简化为条件均值和方差匹配。这使我们能够将数据增强（DA）——一种基于近似不变性的广泛使用的机器学习启发式方法——纳入已知的统计方法中。我们通过实验测试了将 DA 纳入自助法、wild bootstrap 和共形预测在模拟设置以及图像、语言和科学数据上的性能。

英文摘要

We propose the data augmented bootstrap (DAB), a framework for constructing confidence intervals from approximately invariant transformations of the data. As special cases, DAB recovers popular methods that rely on exact group symmetries, such as conformal prediction, wild bootstrap for Maximum Mean Discrepancy U-statistics and the recently proposed SymmPI. Meanwhile, DAB also recovers the classical bootstrap method, which exploits the dataset's approximate invariance under uniform sampling of data indices as the dataset size grows. For all DAB methods, we establish theoretical coverage results that interpolate between finite-sample and asymptotic guarantees according to the strength of the invariance, and without assuming a group structure. The approximate invariance is measured in the Kolmogorov distance and, for statistics that satisfy Gaussian universality, reduces to conditional mean and variance matching. This allows us to incorporate data augmentation (DA), a widely used machine learning heuristic based on approximate invariances, into known statistical methods. We empirically test the performance of incorporating DA into bootstrap, wild bootstrap and conformal prediction for simulated settings as well as for image, language and scientific data.

URL PDF HTML ☆

赞 0 踩 0

2605.16900 2026-06-17 stat.ME math.ST 版本更新

Splitting schemes and estimators for stochastic differential equations with Hölder multiplicative noise

具有Hölder乘性噪声的随机微分方程的分裂方案和估计器

Bowen Fang, Dario Spanò, Massimiliano Tamborrino

AI总结本文研究了具有局部Lipschitz漂移和Hölder连续乘性扩散的单变量随机微分方程的参数估计问题，提出了一种基于数值分裂方案的首个显式伪似然估计器，该方案在强均方收敛性和状态空间保持性方面优于传统的欧拉-马尔蒂内斯离散化方法，并通过模拟验证了其在准确性和计算效率上的优越性。

Comments Additional simulation results. 56 pages, 14 figures, 2 tables

详情

AI中文摘要

基于张量的二阶因果发现

Nathan Ouyang, Kexin Wan, Anna Seigal

AI总结提出TSCD算法，利用观测和干预数据的协方差矩阵张量，在线性结构方程模型下识别有向无环图及其边函数，仅要求噪声不相关，并扩展到非线性模型，具有对数级干预可识别性。

Comments 27 pages, 7 figures. Code available at this https URL (https://github.com/QWE123665/Tensor-based-Second-order-Causal-Discovery)

详情

AI中文摘要

因果发现旨在揭示变量间的因果依赖关系。为此，我们提出了一种称为基于张量的二阶因果发现（TSCD）的算法。其输入是从观测数据和干预数据的协方差矩阵中得到的张量。假设因果依赖关系遵循有向无环图（DAG）上的线性结构方程模型，TSCD输出DAG及其边上的函数，仅要求噪声变量不相关。我们还实现了该方法在非线性模型中的版本。我们关注二阶统计量（通过协方差矩阵）的动机是：相对于高阶矩，它们在统计和计算上更高效；相对于一阶统计量，它们具有可识别性；并且无论变量是否为高斯分布，它们都适用。我们证明，TSCD从对数于变量数量的干预次数中可识别因果顺序和参数。实验表明，TSCD对噪声具有鲁棒性，与现有方法相比具有竞争力，并且可扩展到数百个变量。

英文摘要

Causal discovery seeks to uncover the causal dependencies among variables. For this purpose, we propose an algorithm called Tensor-based Second-order Causal Discovery (TSCD). Its input is a tensor obtained from the covariance matrices of observational and interventional data. Assuming the causal dependencies follow a linear structural equation model on a directed acyclic graph (DAG), TSCD outputs the DAG and the functions on its edges, requiring only that the noise variables are uncorrelated. We also implement a version of the approach for nonlinear models. Our focus on second-order statistics (via the covariance matrices) is motivated by their statistical and computational efficiency relative to higher-order moments, their identifiability relative to first-order statistics, and that they work regardless of whether the variables are Gaussian. We show that TSCD has identifiable causal order and parameters from a number of interventions that is logarithmic in the number of variables. Experiments show that TSCD is robust to noise, competitive with existing methods, and scales to hundreds of variables.

URL PDF HTML ☆

赞 0 踩 0

2606.18011 2026-06-17 stat.ML cs.LG stat.ME 新提交

Fast Nonparametric Conditional Independence Testing via Two-Stage Regression

通过两阶段回归的快速非参数条件独立性检验

Eric V. Strobl

发表机构 * Department of Biomedical Informatics, University of Pittsburgh（生物医学信息学系，匹兹堡大学）

AI总结提出BLITZ方法，通过两阶段回归（低阶多项式+浅层树）快速消除条件集影响，实现校准良好的非参数条件独立性检验，适用于因果发现。

Comments A fast R implementation with C++ back-end is available at this https URL (https://github.com/ericstrobl/BLITZ)

详情

AI中文摘要

基于约束的因果发现依赖于重复的条件独立性检验，但快速非参数检验往往牺牲校准性，尤其是当变量通过非线性关系依赖于条件集时。我们提出了BLITZ（Broad-to-Local Independence Testing via residualiZation），一种非参数条件独立性检验，旨在在一秒内运行良好，同时保持约束因果发现算法执行数千次查询所需的准确性。BLITZ首先使用低阶多项式回归消除对条件集的广泛平滑依赖，然后应用一个小型非线性特征映射，并通过浅层树回归对这些特征进行残差化。得到的统计量检验残差互协方差，并采用矩匹配卡方近似于零分布。我们从理论上证明，两阶段设计降低了树残差化器面临的有效复杂度，使得浅层树能够控制残差条件均值偏差，同时避免过度过拟合。在模拟中，BLITZ提供了比快速核、随机特征和基于回归的竞争者更好的零校准，同时保持所测试方法中最快的速度之一。在合成图和流式细胞术数据的因果发现实验中，BLITZ在保留的邻接中产生了更可靠的端点方向，并具有竞争力的结构恢复。这些结果表明，从宽到局部残差化是实现因果发现中校准、可扩展的非参数条件独立性检验的实用途径。

英文摘要

Constraint-based causal discovery relies on repeated conditional independence tests, but fast nonparametric tests often sacrifice calibration, especially when variables depend on the conditioning set through nonlinear relationships. We introduce BLITZ (Broad-to-Local Independence Testing via residualiZation), a nonparametric conditional independence test designed to run well under a second while maintaining the accuracy needed for the thousands of queries performed by constraint-based causal discovery algorithms. BLITZ first removes broad smooth dependence on the conditioning set using low-order polynomial regression, then applies a small nonlinear feature map and residualizes those features with shallow tree regressions. The resulting statistic tests residual cross-covariance, with a moment-matched chi-square approximation to the null distribution. We show theoretically that the two-stage design reduces the effective complexity faced by the tree residualizers, allowing shallow trees to control residual conditional-mean bias while avoiding excessive overfitting. In simulations, BLITZ provides better null calibration than fast kernel, random-feature, and regression-based competitors while remaining among the fastest methods tested. In causal discovery experiments on synthetic graphs and flow-cytometry data, BLITZ yields more reliable endpoint orientations among retained adjacencies and competitive structural recovery. These results suggest that broad-to-local residualization is a practical route to calibrated, scalable nonparametric conditional independence testing for causal discovery.

URL PDF HTML ☆

赞 0 踩 0

2606.17383 2026-06-17 q-fin.RM cs.AI cs.LG stat.ML 新提交

Model Validation of Agentic AI Systems: A POMDP-Based Framework for Belief-State, Forecast, and Policy Validation

智能体AI系统的模型验证：基于POMDP的信念状态、预测与策略验证框架

Matthew Francis Dixon

发表机构 * Quiota LLC（Quiota公司）

AI总结提出基于部分可观测马尔可夫决策过程（POMDP）的智能体AI模型验证框架，将自主决策分解为信息、信念、预测、动作和效用组件独立验证，并通过投资组合管理案例展示其有效性。

Comments 28 pages, 3 figures, 6 tables. Source code available from this https URL (https://github.com/mfrdixon/agentic-AI-as-POMDP)

详情

AI中文摘要

智能体人工智能系统引入了一类新的模型风险。与传统预测模型不同，自主智能体持续获取信息，形成关于环境潜在状态的信念，生成预测，选择行动，并随时间调整其行为。现有的验证方法主要关注预测准确性，因此对底层决策过程的质量提供的洞察有限。本文提出了一种基于部分可观测马尔可夫决策过程（POMDP）的智能体AI模型验证框架。该框架将自主决策分解为信息、信念、预测、行动和效用，允许每个组件独立验证。大型语言模型（LLM）被形式化为近似贝叶斯滤波算子，并开发了一个模型风险分类体系，涵盖状态空间、滤波、预测、策略、效用规范和参数风险。通过一个投资组合管理案例研究展示了模型风险验证方法，其中智能体从市场和宏观经济信息中推断潜在市场制度，生成基于信念的预测，并使用Black-Litterman框架构建投资组合。实证验证结合了性能分析、信念校准诊断、覆盖测试、消融研究和参数敏感性分析。结果表明，潜在状态推断对决策质量有独立贡献，且主要结论在广泛的参数值范围内保持稳健。本文的主要贡献是提供了一个实用框架，将已建立的模型风险管理概念扩展到自主AI系统，并为其验证、治理和监控提供了严格的基础。

英文摘要

Agentic artificial intelligence systems introduce a new class of model risk. Unlike traditional predictive models, autonomous agents continuously acquire information, form beliefs regarding latent states of the environment, generate forecasts, select actions, and adapt their behavior over time. Existing validation methodologies focus primarily on predictive accuracy and therefore provide limited insight into the quality of the underlying decision process. This paper proposes a model validation framework for agentic AI based on Partially Observable Markov Decision Processes (POMDPs). The framework decomposes autonomous decision making into information, beliefs, forecasts, actions, and utility, allowing each component to be validated independently. Large language models (LLMs) are formalized as approximate Bayesian filtering operators, and a model-risk taxonomy is developed encompassing state-space, filtering, forecast, policy, utility-specification, and parameter risks. The model risk validation methodology is demonstrated through a portfolio-management case study in which an agent infers latent market regimes from market and macroeconomic information, generates belief-conditioned forecasts, and constructs portfolios using a Black--Litterman framework. Empirical validation combines performance analysis, belief calibration diagnostics, coverage tests, ablation studies, and parameter-sensitivity analysis. The results indicate that latent-state inference contributes independently to decision quality and that the principal conclusions remain robust across a broad range of parameter values. The principal contribution of the paper is a practical framework for extending established model risk management concepts to autonomous AI systems and providing a rigorous foundation for their validation, governance, and monitoring.

URL PDF HTML ☆

赞 0 踩 0

2606.17196 2026-06-17 stat.ML cs.LG stat.ME 新提交

Another Look at Log-PCA for Probability Measures: A Dynamical Formulation and Statistical Convergence

再探概率测度的Log-PCA：一种动力学公式与统计收敛性

Peng Xu, Changbo Zhu, Young-Heon Kim, Xiaohui Chen

发表机构 * Department of Statistics University of Illinois Urbana-Champaign（统计学系伊利诺伊大学厄巴纳-香槟分校）； Department of ACMS University of Notre Dame（ACMS系诺丁汉大学）； Department of Mathematics University of British Columbia（数学系不列颠哥伦比亚大学）； Department of Mathematics Thomas Lord Department of Computer Science University of Southern California（数学系托马斯·劳德计算机科学系南加州大学）

AI总结本文在Wasserstein几何下提出一种动力学公式解释log-PCA，称为Wasserstein切向PCA（WT-PCA），并推导了经验WT-PCA相对于总体测度的统计收敛速率。

2606.17215 2026-06-17 cs.LG cs.DS stat.ML 新提交

Sum-of-Squares Degree Barriers for the Reweighted-Hinge Method in Robust Halfspace Learning: A Christoffel-Function Characterization

鲁棒半空间学习中重加权铰链方法的平方和度障碍：一个Christoffel函数刻画

Xiaoyu Li

发表机构 * Xiaoyu Li（李小宇）

AI总结本文通过Christoffel函数精确刻画了有界度证书无法去除的异常质量，揭示了重加权铰链方法在恶意噪声下学习γ-间隔半空间时，证书的SoS度与异常容忍度之间的基本权衡。

详情

AI中文摘要

芬斯勒几何、图神经网络与你

T. Mitchell Roddenberry, Richard G. Baraniuk

发表机构 * Rice University（莱斯大学）

AI总结针对图拉普拉斯只能近似各向同性算子的局限，提出基于芬斯勒拉普拉斯的图神经网络层，证明其收敛性并恢复非线性扩散方程的几何结构。

2606.17665 2026-06-17 math.ST math.PR stat.ML 新提交

Non-asymptotic Tail Bounds for the Kostlan--Shub--Smale Field: Tensor PCA and Spherical $k$-Spin Complexity

Kostlan–Shub–Smale 场的非渐近尾部界：张量 PCA 和球面 $k$-自旋复杂度

Jean-Marc Azaïs (IMT), Federico Dalmao (UDELAR), Yohann De Castro (ICJ, ECL, PSPM, IUF)

AI总结本文为球面上KSS随机场的上确界建立显式非渐近尾部界层次结构，并应用于尖峰张量PCA和球面k-自旋模型景观，通过管方法和秩约简将估计误差归结为KSS场，利用Kac-Rice公式和Mehta-Fyodorov表示等得到显式尾部界。

详情

AI中文摘要

本文为球面上Kostlan–Shub–Smale (KSS) 随机场的上确界建立了一个显式非渐近尾部界的层次结构，并将其应用于两个问题：尖峰张量PCA和球面$k$-自旋模型的景观。对于张量PCA，我们研究了在信噪比$\lambda$下，通过\textit{轮廓最大似然估计}（即限制在相干性至少为$\kappa$的归一化秩-$R$张量上的MLE），从单个高斯观测中估计秩为$R$、阶数为$k\ge 3$、维度为$d\ge 3$的对称信号张量的非渐近统计极限。我们的分析使用了一个单一的简化：一个确定性几何不等式（管方法）和秩约简步骤将估计误差界定为典型KSS场的上确界，而Kac-Rice公式将其转化为一个高斯积分，该积分涉及平移高斯正交系综的期望绝对特征多项式，进而由我们层次结构中的四个显式尾部界（三个来自Mehta–Fyodorov表示，一个来自Ben Arous–Dembo–Guionnet大偏差）控制。相同的简化产生了两个结果，每个都带有显式常数。对于估计，一个有限$(k,d)$误差界恢复了Perry、Wein和Bandeira的渐近最优速率$\sqrt{d\log k}$，并显式依赖于秩$R$和相干性$\kappa$。对于景观，球面$k$-自旋哈密顿量的退火复杂度的双侧非渐近括号界恢复了高维极限下的Auffinger–Ben Arous–Černý复杂度函数。

英文摘要

This paper builds a hierarchy of explicit, non-asymptotic tail bounds for the supremum of the Kostlan--Shub--Smale (KSS) random field on the sphere, and applies it to two problems: Spiked Tensor PCA and the landscape of the spherical $k$-spin model. For Tensor PCA, we study the non-asymptotic statistical limits of estimating a rank-$R$ symmetric signal tensor of order~$k\ge 3$ and dimension~$d\ge 3$ from a single Gaussian observation at signal-to-noise ratio~$\lambda$, through the \emph{profile maximum likelihood estimator}, the MLE restricted to normalized rank-$R$ tensors of coherence at least~$\kappa$. Our analysis uses a single reduction: a deterministic geometric inequality (the Tube Method) and a rank-reduction step bound the estimation error by the supremum of the canonical KSS field, which the Kac--Rice formula turns into a Gaussian integral against the expected absolute characteristic polynomial of a shifted Gaussian Orthogonal Ensemble, controlled in turn by the four explicit tail bounds of our hierarchy (three from a Mehta--Fyodorov representation, one from a Ben Arous--Dembo--Guionnet large deviation). The same reduction yields two results, each with explicit constants. For estimation, a finite-$(k,d)$ error bound recovers the asymptotically optimal rate~$\sqrt{d\log k}$ of Perry, Wein and Bandeira, with explicit dependence on the rank~$R$ and the coherence~$\kappa$. For the landscape, a two-sided non-asymptotic bracketing of the annealed complexity of the spherical $k$-spin Hamiltonian recovers the Auffinger--Ben Arous--Černý complexity function in the high-dimensional limit.

URL PDF HTML ☆

赞 0 踩 0

2606.17364 2026-06-17 math.ST math.OC stat.ML 新提交

A Polyak-Ruppert Central Limit Theorem for SA-Adam with Momentum and Non-Convergent Adaptive Preconditioning

带动量与非收敛自适应预条件化的SA-Adam的Polyak-Ruppert中心极限定理

Sunyoung An, Xiaoming Huo

AI总结证明在动量和非收敛预条件化下，Polyak-Ruppert平均的Adam迭代仍满足经典中心极限定理，渐近协方差与SGD相同。

Comments 44 pages, 3 figures

详情

AI中文摘要

结合预条件化、动量和权重衰减的自适应优化器（Adam和AdamW），在Polyak-Ruppert平均下，是单次推断的候选引擎。在动量和非收敛预条件化下，平均迭代是否保持经典的Polyak-Ruppert中心极限定理（CLT），具有三明治协方差$H^{-1}SH^{-1}$（Hessian $H$，梯度协方差$S$）？仅预条件化的分析不适用：带动量时，规范分解退化为同义反复。将增广状态（迭代、动量缓冲区）视为时变线性随机逼近（SA），我们证明了（在局部稳定化下）正漂移稳定性、非自治Polyak-Ruppert CLT和投影恒等式。结论：迭代边际协方差恰好是普通随机梯度下降（SGD）的三明治$H^{-1}SH^{-1}$，因此自适应在渐近意义下不可见。这适用于SA-Adam（次线性衰减动量增益，$\gamma\in(\alpha,1)$；次线性区间是关键），而非恒定$\beta$的Adam。耦合$L_2$权重衰减产生岭惩罚三明治，将单次推断扩展到正则化问题。

英文摘要

Adaptive optimizers combining preconditioning, momentum, and weight decay (Adam and AdamW) are, under Polyak-Ruppert averaging, candidate engines for one-pass inference. Does the averaged iterate keep the classical Polyak-Ruppert central limit theorem (CLT), with sandwich covariance $H^{-1}SH^{-1}$ (Hessian $H$, gradient covariance $S$), under momentum and non-convergent preconditioning? The preconditioner-only analysis does not carry over: with momentum the canonical decomposition collapses to a tautology. Treating the augmented state (iterate, momentum buffer) as a time-varying linear stochastic approximation (SA), we prove (under local stabilization) positive drift stability, a non-autonomous Polyak-Ruppert CLT, and a projection identity. The upshot: the iterate-marginal covariance is exactly the plain stochastic gradient descent (SGD) sandwich $H^{-1}SH^{-1}$, so the adaptivity is asymptotically invisible. This holds for SA-Adam (sub-linearly vanishing momentum gain, $\gamma\in(\alpha,1)$; the sub-linear regime is essential), not constant-$\beta$ deployed Adam. Coupled $L_2$ weight decay yields the ridge-penalized sandwich, extending one-pass inference to regularized problems.

URL PDF HTML ☆

赞 0 踩 0

2606.17260 2026-06-17 math.OC cs.LG stat.ML 新提交

Accelerated Convex Optimization via Hamiltonian Dynamics with Deterministic Integration Time

基于确定性积分时间的哈密顿动力学的加速凸优化

Xiuyuan Wang, Vishwak Srinivasan, Qiang Fu, Siddharth Mitra, Ashia Wilson, Andre Wibisono

发表机构 * Department of Computer Science, Yale University（耶鲁大学计算机科学系）； Department of EECS, Massachusetts Institute of Technology（麻省理工学院电子工程与计算机科学系）

AI总结提出基于哈密顿动力学的平滑凸优化算法，通过利用平均哈密顿流轨迹的收缩而非端点收缩，实现确定性加速收敛，并推导出具有最优一阶复杂度的离散实现。

Comments 51 pages, 7 figures. Accepted to the 39th Annual Conference on Learning Theory (COLT 2026)

2606.16379 2026-06-17 cs.LG stat.ML 新提交

Scalable and Interpretable Representation Alignment with Ordinal Similarity

可扩展且可解释的序数相似性表示对齐

Diogo Soares, Pankhil Gawade, Andrea Dittadi, Ewa Szczurek

发表机构 * University of Maryland（马里兰大学）； Google Research（谷歌研究院）

AI总结针对现有表示相似性度量缺乏可解释性、对异常值敏感且计算复杂的问题，提出基于序数相似性的三元组和四元组相似性指数，实现可解释、鲁棒且高效的对齐度量。

详情

AI中文摘要

评估表示相似性是表示学习的基础。然而，现有度量存在显著局限性：由于基线漂移而缺乏可解释性，对异常值缺乏鲁棒性，并且对于大型数据集计算上难以处理，迫使依赖启发式近似。为了解决这些问题，我们开发了一个序数相似性框架，通过三元组相似性指数（TSI）和四元组相似性指数（QSI）实例化，通过量化序数关系的一致性来衡量对齐。我们从理论上证明，这种公式本质上是可解释的、对异常值鲁棒的，并且计算高效。最后，我们建立了TSI与通过互近邻度量的局部邻域对齐之间的形式等价性。实验上，我们验证了这些性质，并表明序数相似性提供了一种可扩展的对齐度量方法，使从业者能够更好地理解和设计表示。

英文摘要

Evaluating representation similarity is fundamental to representation learning. However, existing metrics suffer from significant limitations: they lack interpretability due to shifting baselines, lack robustness to outliers, and are computationally intractable for large datasets, forcing reliance on heuristic approximations. To address this, we develop an ordinal-similarity framework, instantiated by the Triplet (TSI) and Quadruplet (QSI) Similarity Indices, which measure alignment by quantifying the consistency of ordinal relationships. We theoretically demonstrate this formulation is inherently interpretable, robust to outliers, and computationally efficient. Finally, we establish a formal equivalence between TSI and local neighborhood alignment, measured by Mutual Nearest Neighbors. Empirically, we validate these properties and show that ordinal similarity offers a scalable approach to measuring alignment, enabling practitioners to better understand and design representations.

URL PDF HTML ☆

赞 0 踩 0

2606.14954 2026-06-17 math.FA cs.LG math.OC stat.ML 新提交

通过学习支持函数摊销最大内积搜索

Theo X. Olausson, João Monteiro, Michal Klein, Marco Cuturi

AI总结提出基于回归的摊销MIPS方法，通过训练神经网络直接预测最优键，利用支持函数的凸性加速搜索，在BEIR基准上显著提升IVF匹配率。

详情

AI中文摘要

最大内积搜索（MIPS）是机器学习中的关键子程序，需要从数据库（键）中识别出与给定查询最匹配的向量。我们提出摊销MIPS：一种基于回归的方法，训练神经网络直接预测MIPS解，从而摊销在固定键数据库上从已知分布中重复求解查询的MIPS成本。我们的关键洞察是，MIPS值函数是键集合的\emph{支持}函数，这是一个经过充分研究的凸函数，其梯度给出最优键。这激发了两种互补的摊销模型：SupportNet，一个输入凸神经网络，用于回归支持函数；以及KeyNet，一个向量值网络，直接回归最优键。SupportNet可以作为聚类路由器，将查询引导到相关的数据库分区，而KeyNet可以作为原始查询的直接替代品，直接输入到现成的索引流水线中。我们在BEIR基准上的实验表明，对于文档嵌入，当考虑计算工作量（无论是FLOPs、探测次数还是挂钟时间）时，学习的SupportNet和KeyNet显著提高了IVF匹配率。我们的代码可在以下网址获取：this https URL。

英文摘要

Maximum inner product search (MIPS) is a crucial subroutine in machine learning, requiring the identification of a vector taken within a database (the keys) that best aligns with a given query. We propose amortized MIPS: a regression-based approach that trains neural networks to directly predict MIPS solutions, amortizing the cost of repeatedly solving MIPS for queries drawn from a known distribution over a fixed key database. Our key insight is that the MIPS value function is the \emph{support} function of the set of keys, a well-studied convex function whose gradient yields the optimal key. This motivates two complementary amortized models: SupportNet, an input-convex neural network trained to regress the support function, and KeyNet, a vector-valued network that directly regresses the optimal key. SupportNet can serve as a cluster router, steering queries toward relevant database partitions, while KeyNet can be used as a drop-in replacement for the original query, fed directly to off-the-shelf indexing pipelines. Our experiments on the BEIR benchmark show that, for document embeddings, learned \SupportNet{}s and \KeyNet{}s significantly improve IVF match rates when accounting for compute effort, whether measured in FLOPs, number of probes, or wall-clock time. Our code is available at: this https URL.

URL PDF HTML ☆

赞 0 踩 0

2602.23116 2026-06-17 cs.LG cs.GT stat.ML 版本更新

Provably Efficient Regularized Online RLHF with Generalized Bilinear Preferences

具有广义双线性偏好的可证明高效正则化在线RLHF

Junghyun Lee, Minju Hong, Kwang-Sung Jun, Chulhee Yun, Se-Young Yun

AI总结研究在线RLHF中正则化最佳响应最大遗憾最小化问题，通过广义双线性偏好模型证明强凸性可导出多对数遗憾，表明快速遗憾不限于KL散度。

Comments 48 pages, 3 figures (ver3: major revisions; ver2: more colorful boxes, fixed some typos)

详情

AI中文摘要

我们考虑在一般偏好和bandit反馈下在线RLHF中的正则化最佳响应最大遗憾最小化问题。虽然各种正则化器被用于增强对齐的鲁棒性，但已知的多对数遗憾保证仍然高度特定于KL。为了研究这种快速速率是否扩展到KL之外，我们采用广义双线性偏好模型（GBPM）——通过一个秩为$2r$的斜对称矩阵捕获$d$维逐项特征上的非传递偏好——以隔离一般正则化的影响。关键地，在GBPM下，我们证明任何贪婪策略的对偶间隙受限于平方估计误差，该误差仅利用强凸性和斜对称性导出。在特征覆盖假设下，我们通过贪婪采样建立了$\tilde{\mathcal{O}}(\eta d^4 C_{\min}^{-1} (\log T)^2 \wedge d^2 C_{\min}^{-1/2} \sqrt{T})$的通用多对数遗憾，并通过探索后提交（Explore-Then-Commit）建立了$\tilde{\mathcal{O}}(C_{\min}^{-2} \sqrt{\eta r T} \wedge r^{1/3} C_{\min}^{-4/3} T^{2/3})$的维度改进遗憾（对于条件良好的臂集），其中$\eta^{-1}$是正则化系数，$T$是时间范围，$C_{\min}$是依赖于臂集的量。这表明“快速”遗憾并非KL特有，而是通用强凸几何的基本结果。

英文摘要

We consider the problem of regularized best-response max-regret minimization in online RLHF under general preferences and bandit feedback. While various regularizers are utilized to robustify alignment, known polylogarithmic regret guarantees remain heavily specific to KL. To investigate whether such fast rates extend beyond KL, we adopt the Generalized Bilinear Preference Model (GBPM) -- capturing intransitive preferences over $d$-dimensional item-wise features via a rank-$2r$ skew-symmetric matrix -- to isolate the impact of generic regularization. Crucially, under GBPM, we prove that the dual gap of any greedy policy is bounded by the squared estimation error, derived using \emph{only} strong convexity and skew-symmetry. Under a feature coverage assumption, we establish a \emph{generic} polylogarithmic regret of $\tilde{\mathcal{O}}(\eta d^4 C_{\min}^{-1} (\log T)^2 \wedge d^2 C_{\min}^{-1/2} \sqrt{T})$ with Greedy Sampling, and a dimension-wise improved regret (for well-conditioned arm-sets) of $\tilde{\mathcal{O}}(C_{\min}^{-2} \sqrt{\eta r T} \wedge r^{1/3} C_{\min}^{-4/3} T^{2/3})$ with Explore-Then-Commit, where $\eta^{-1}$ is the regularization coefficient, $T$ is the time horizon, and $C_{\min}$ is an arm-set dependent quantity. This demonstrates that ``fast'' regrets are not KL-specific, but rather a fundamental consequence of generic strongly convex geometry.

URL PDF HTML ☆

赞 0 踩 0

2603.04198 2026-06-17 stat.ML cs.LG 版本更新

质疑共形预测中的覆盖-长度度量：当更短的区间并不更好时

Yizhou Min, Yizhou Lu, Lanqi Li, Zhen Zhang, Jiaye Teng

AI总结本文批判性检验共形预测中标准度量（覆盖率和区间长度）的充分性，揭示一种称为“偏见技巧”（PT）的反直觉方法可欺骗性地缩短区间长度而保持覆盖有效，并提出新度量“区间稳定性”以检测此类行为。

详情

AI中文摘要

共形预测（CP）已成为无分布不确定性量化的基石，通常通过其覆盖率和区间长度进行评估。本文批判性地检验了这些标准度量的充分性。我们证明，通过一种称为偏见技巧（PT）的反直觉方法，区间长度可能被欺骗性地改善，而覆盖率仍然有效。具体而言，对于任何给定的测试样本，PT 概率性地返回一个区间，该区间要么为空，要么使用调整后的置信水平构建，从而保持边际覆盖率。虽然 PT 可能产生欺骗性较低的区间长度，但它引入了实际漏洞：同一输入在算法的重复运行中可能产生完全不同的预测区间。我们正式推导了 PT 实现这些误导性改进的条件，并在各种回归和分类任务中提供了广泛的实证证据。此外，我们引入了一个新度量——区间稳定性，它有助于检测新的 CP 方法是否基于此类 PT 技术隐式地改善了长度。代码可在 https://this URL 获取。

英文摘要

Conformal prediction(CP) has become a cornerstone of distribution-free uncertainty quantification, conventionally evaluated by its coverage and interval length. This work critically examines the sufficiency of these standard metrics. We demonstrate that the interval length might be deceptively improved through a counter-intuitive approach termed Prejudicial Trick(PT), while the coverage remains valid. Specifically, for any given test sample, PT probabilistically returns an interval, which is either null or constructed using an adjusted confidence level, thereby preserving marginal coverage. While PT potentially yields a deceptively lower interval length, it introduces practical vulnerabilities: the same input can yield completely different prediction intervals across repeated runs of the algorithm. We formally derive the conditions under which PT achieves these misleading improvements and provide extensive empirical evidence across various regression and classification tasks. Furthermore, we introduce a new metric interval stability which helps detect whether a new CP method implicitly improves the length based on such PT-like techniques. Code is available at this https URL.

URL PDF HTML ☆

赞 0 踩 0

2512.21315 2026-06-17 cs.LG cs.CV stat.ML 版本更新

Does the Data Processing Inequality Reflect Practice? On the Utility of Low-Level Tasks

数据处理不等式是否反映实践？论低级任务的有用性

Roy Turgeman, Tom Tirer

AI总结本文研究低级处理（如去噪、编码）如何提升分类性能，证明在有限样本下存在预处理可提高准确率，并通过实验验证理论趋势。

Comments ICLR 2026 (camera-ready). Code is available at: this https URL (https://github.com/serveroy/process-before-you-classify)

详情

AI中文摘要

数据处理不等式是一个信息论原理，指出信号的信息内容不能通过处理观测数据而增加。特别地，它表明在解决分类问题之前，增强信号或对其进行编码没有益处。对于最优贝叶斯分类器，这一断言可以被证明是正确的。然而，在实践中，尽管现代深度神经网络具有强大的能力，但在高级下游任务之前执行“低级”任务仍然很常见。在本文中，我们旨在理解低级处理何时以及为何对分类有益。我们提出了一个二元分类设置的综合理论研究，其中我们考虑一个与最优贝叶斯分类器紧密相连的分类器，并随着训练样本数量的增加而收敛到它。我们证明，对于任何有限数量的训练样本，存在一种预分类处理可以提高分类准确率。我们还探讨了类分离、训练集大小和类平衡对该过程相对增益的影响。我们通过理论设置的经验研究来支持我们的理论。最后，我们进行了一项实证研究，调查去噪和编码对基准数据集上实际深度分类器性能的影响。具体来说，我们改变了训练集的大小和类别分布以及噪声水平，并展示了与我们的理论结果一致的趋势。

英文摘要

The data processing inequality is an information-theoretic principle stating that the information content of a signal cannot be increased by processing the observations. In particular, it suggests that there is no benefit in enhancing the signal or encoding it before addressing a classification problem. This assertion can be proven to be true for the case of the optimal Bayes classifier. However, in practice, it is common to perform "low-level" tasks before "high-level" downstream tasks despite the overwhelming capabilities of modern deep neural networks. In this paper, we aim to understand when and why low-level processing can be beneficial for classification. We present a comprehensive theoretical study of a binary classification setup, where we consider a classifier that is tightly connected to the optimal Bayes classifier and converges to it as the number of training samples increases. We prove that for any finite number of training samples, there exists a pre-classification processing that improves the classification accuracy. We also explore the effect of class separation, training set size, and class balance on the relative gain from this procedure. We support our theory with an empirical investigation of the theoretical setup. Finally, we conduct an empirical study where we investigate the effect of denoising and encoding on the performance of practical deep classifiers on benchmark datasets. Specifically, we vary the size and class distribution of the training set, and the noise level, and demonstrate trends that are consistent with our theoretical results.

URL PDF HTML ☆

赞 0 踩 0

2512.13853 2026-06-17 cs.LG cond-mat.stat-mech math.PR stat.ML 版本更新

Dropout Neural Network Training Viewed from a Percolation Perspective

从逾渗视角看待Dropout神经网络训练

Finley Devlin, Jaron Sanders

AI总结本文研究使用dropout训练深度神经网络时的逾渗现象，建立新逾渗模型刻画网络拓扑与路径问题的关系，揭示dropout中的逾渗效应及其可能导致训练崩溃的机制。

Comments 21 pages, 14 figures

详情

AI中文摘要

在这项工作中，我们研究了使用dropout训练深度神经网络（NNs）时逾渗的存在和影响。Dropout方法是训练NNs的正则化技术，由G. Hinton等人（2012）首次提出。这些方法在训练的每个阶段随机临时移除NN中的连接，并用随机梯度下降（SGD）更新剩余子网络。随机从网络中移除连接的过程类似于逾渗，这是统计物理的一个范式模型。如果dropout移除足够多的连接，使得NN的输入和输出之间没有路径，那么NN就无法根据数据做出预测。我们研究了模拟NN中dropout的新逾渗模型，并刻画了网络拓扑与该路径问题之间的关系。该理论证明了dropout中存在逾渗效应。我们还表明，在使用dropout训练无偏置NN时，这种逾渗效应可能导致训练崩溃；并且我们启发式地论证了这种崩溃也扩展到有偏置的NN。

英文摘要

In this work, we investigate the existence and effect of percolation in training deep Neural Networks (NNs) with dropout. Dropout methods are regularisation techniques for training NNs, first introduced by G. Hinton et al. (2012). These methods temporarily remove connections in the NN, randomly at each stage of training, and update the remaining subnetwork with Stochastic Gradient Descent (SGD). The process of removing connections from a network at random is similar to percolation, a paradigm model of statistical physics. If dropout were to remove enough connections such that there is no path between the input and output of the NN, then the NN could not make predictions informed by the data. We study new percolation models that mimic dropout in NNs and characterise the relationship between network topology and this path problem. The theory shows the existence of a percolative effect in dropout. We also show that this percolative effect can cause a breakdown when training NNs without biases with dropout; and we argue heuristically that this breakdown extends to NNs with biases.

URL PDF HTML ☆

赞 0 踩 0

2512.11784 2026-06-17 cs.LG stat.ML 版本更新

使用生成式AI的空间疾病映射与差异检测：一种摊销贝叶斯学习框架

Luca Aiello, Sudipto Banerjee

AI总结提出一种摊销贝叶斯框架，通过神经网络近似后验分布，实现跨不同区域图的空间边界检测，并在呼吸疾病和肺癌数据中验证其有效性。

详情

AI中文摘要

我们引入了一个用于空间边界检测的摊销贝叶斯框架，该框架能够推广到具有不同区域数量和多样邻接结构的区域图上的后验推断。底层模型将泊松计数似然与协变量驱动的规则相结合，以中断跨不相似相邻区域的平滑，并利用有向无环图自回归（DAGAR）先验来捕捉残差空间依赖性。为了逼近目标后验分布，我们在模拟地图上训练了一个神经引擎：一个置换不变摘要网络编码观测计数、偏移量、协变量和邻接矩阵的图感知表示，而一个条件归一化流生成近似的后验样本。模拟研究证明了准确的参数恢复、接近名义水平的区间覆盖、良好校准的后验预测行为以及信息丰富的后验边界概率。与马尔可夫链蒙特卡洛（MCMC）的基准测试证实了在主要边界证据上的紧密一致性，而消融研究验证了包含模型引导的图摘要的有效性。最后，应用于格拉斯哥呼吸系统疾病和加利福尼亚肺癌数据表明，一个训练好的神经引擎可以无缝部署到具有不同图结构的真实世界地图上，产生的边界结论与已建立的局部平滑分析一致。

英文摘要

We introduce an amortized Bayesian framework for spatial boundary detection that generalizes posterior inference across areal graphs with varying numbers of regions and diverse adjacency structures. The underlying model couples a Poisson count likelihood with a covariate-driven rule to interrupt smoothing across dissimilar neighboring areas, utilizing a directed acyclic graph autoregressive (DAGAR) prior to capture residual spatial dependence. To approximate the target posterior distribution, a neural engine is trained on simulated maps: a permutation-invariant summary network encodes graph-aware representations of the observed counts, offsets, covariates, and adjacency matrices, while a conditional normalizing flow generates the approximate posterior draws. Simulation studies demonstrate accurate parameter recovery, near-nominal interval coverage, well-calibrated posterior predictive behavior, and informative posterior boundary probabilities. Benchmarking against Markov chain Monte Carlo (MCMC) confirms close agreement regarding primary boundary evidence, and an ablation study validates the inclusion of model-guided graph summaries. Finally, applications to Glasgow respiratory disease and California lung cancer data demonstrate that a single trained neural engine can be seamlessly deployed across real-world maps with distinct graph structures, yielding boundary conclusions consistent with established localized smoothing analyses.

URL PDF HTML ☆

赞 0 踩 0

2606.18139 2026-06-17 stat.ME 新提交

Bayesian Threshold-Aligned Joint Disease Progression Modeling for Alzheimer's Disease

贝叶斯阈值对齐的阿尔茨海默病联合疾病进展建模

Rong Wu, Duygu Tosun, Isabella Hausle, Margo Heston, Aaron Wolfe Scheffler

AI总结提出贝叶斯阈值对齐联合疾病进展模型（B-TAJ DPM），通过半参数框架将生物标志物轨迹与认知障碍生存终点联合建模，并锚定于阳性阈值，以揭示异质性进展模式。

详情

AI中文摘要

阿尔茨海默病的特征是淀粉样蛋白-β和tau蛋白的逐渐积累，数年后出现认知障碍。尽管存在这一既定模式，但在病理进展年龄和认知症状发作方面存在显著的主体间变异性。为了理解这种变异的来源，需要通过联合建模疾病进展和认知障碍发生时间（参考标志性阳性阈值）的框架，将主体对齐到异质性疾病时间线上。现有的神经退行性疾病进展模型依赖于限制性参数形式，未能将疾病时间线锚定于阳性阈值，并且将生物标志物轨迹与认知生存终点分离。为了解决这些局限性，我们引入了贝叶斯阈值对齐联合疾病进展模型（B-TAJ DPM）。这个生成式半参数框架在潜在疾病时间线上建模多变量疾病进展轨迹，这些轨迹锚定于标志性阳性阈值。关键的是，该框架整合了一个生存模型，将病理进展与认知障碍联系起来。后验推断和对未见主体的后验预测在开源软件中实现。模拟研究显示出优异的估计精度和区间覆盖率。当应用于阿尔茨海默病神经影像学倡议数据时，B-TAJ DPM刻画了非线性进展模式，量化了主体间阳性年龄的变异性，并揭示了tau阳性年龄与认知障碍加速之间的联系。

英文摘要

Alzheimer's disease is characterized by the progressive accumulation of amyloid-$\beta$ and tau followed years later by cognitive impairment. Despite this established motif, substantial subject-level variability exists in the age of pathological progression and the onset of cognitive symptoms. To understand the source of this variation, subjects must be aligned across heterogeneous disease timelines via frameworks that jointly model disease progression and time to cognitive impairment with reference to landmark positivity thresholds. Existing neurodegenerative disease progression models rely on restrictive parametric forms, fail to anchor disease timelines to positivity thresholds, and decouple biomarker trajectories from cognitive survival endpoints. To address these limitations, we introduce the Bayesian Threshold-Aligned Joint Disease Progression Model (B-TAJ DPM). This generative, semi-parametric framework models multivariate disease progression trajectories over latent disease timelines anchored at landmark positivity thresholds. Crucially, the framework integrates a survival model to link pathological progression to cognitive impairment. Posterior inference and posterior predictions for unseen subjects are carried out in open-source software. Simulation studies demonstrate excellent estimation accuracy and interval coverage. When applied to Alzheimer's Disease Neuroimaging Initiative data, B-TAJ DPM characterizes non-linear progression patterns, quantifies subject-level variation in positivity age, and reveals links between age of tau positivity and acceleration of cognitive impairment.

URL PDF HTML ☆

赞 0 踩 0

2606.17923 2026-06-17 stat.ME 新提交

Spatial mixed models for assessing environmental exposure effects on the microbiome

评估环境暴露对微生物组影响的空间混合模型

Sooran Kim, Chan Wang, Soyoung Kwak, Fares Darawshy, Alexander Bain, Leopoldo N. Segal, Jiyoung Ahn, Huilin Li

AI总结提出一种空间混合模型框架，利用条件自回归先验同时处理区域空间依赖和分类群生态依赖，在特征选择中实现高检测功率和低假阳性率，应用于PM2.5暴露研究识别相关菌属。

详情

AI中文摘要

环境暴露（如空气污染）对人类健康的影响日益受到重视。越来越多的证据表明，微生物组可能介导这些效应，从而解释环境与宿主生物学之间的关系。然而，环境暴露对微生物组的影响尚未完全明确，且该背景下的统计建模面临复杂依赖结构的挑战。具体而言，微生物组数据在采样区域间表现出空间依赖性，以及微生物分类群间的生态相关性，若忽略这些依赖，会显著降低检测能力，导致遗漏真实信号。我们提出了一种新颖的微生物组数据空间混合建模框架，该框架利用条件自回归先验同时考虑区域级空间依赖和分类群级生态依赖。通过模拟，我们证明该框架优于忽略此类依赖的现有方法，在特征选择中实现高检测功率，同时保持低假阳性率并降低估计均方误差。应用于两项真实研究——食品与微生物组纵向调查研究数据和肺微生物组数据集，其中涉及细颗粒物（PM2.5）暴露，我们的模型识别出已知与污染相关健康结果有关的菌属，以及可能介导宿主对空气污染反应的新分类群。这一新颖方法为揭示复杂环境数据中具有生物学意义的关联提供了强大而灵活的工具。

英文摘要

The influence of environmental exposures, such as air pollution, on human health has become increasingly recognized. A growing body of evidence suggests that the microbiome may mediate these effects, explaining the relationship between the environment and host biology. However, the impact of environmental exposures on the microbiome is not yet fully understood, and statistical modeling in this context is challenged by complex dependency structures. In particular, microbiome data exhibit spatial dependencies across sampling regions as well as ecological correlations among microbial taxa, which, if ignored, can substantially reduce detection power, leading to missed true signals. We introduce a novel spatial mixed modeling framework for microbiome data that accounts for both region-level spatial dependency and taxon-level ecological dependency using conditional autoregressive priors. Through simulations, we demonstrate that this framework outperforms existing methods that ignore such dependencies, by achieving high detection power in feature selection while maintaining low false positive rates and reduced mean squared error in estimation. Applied to two real studies-data from Food and Microbiome Longitudinal Investigation study and lung microbiome dataset-with fine particulate matter (PM_2.5) exposures, our model identified genera, which are known to be involved in pollution-related health outcomes, as well as novel taxa that may mediate host responses to air pollution. This novel approach offers a powerful and flexible tool for uncovering biologically meaningful associations in complex environmental data.

URL PDF HTML ☆

赞 0 踩 0

2606.17841 2026-06-17 stat.ME 新提交

Subgroup analysis in randomized controlled trials with binary outcomes: dilution and logic-respecting properties

二元结局随机对照试验中的亚组分析：稀释与逻辑一致性性质

Long-Hao Xu, Yang Han, Tim Friede

AI总结研究二元结局随机对照试验中亚组分析的比值比和相对响应的性质，证明比值比不适合作为疗效指标而相对响应合适，并阐明两者在逻辑一致性和稀释性质上的差异。

详情

AI中文摘要

亚组分析在随机对照试验中常规用于检验治疗效果在患者亚组间是否同质或由于治疗效应异质性而不同。本文研究了二元结局亚组分析中比值比和相对响应的性质，通过新的理论见解和方法学发展扩展了先前的工作。我们建立了几个新定理，描述了当两个亚组合并时，总体人群的比值比在大小和方向上如何变化。这些结果进一步证实了比值比不适合作为该亚组设置中的疗效指标，而相对响应是合适的。我们还提出了比值比和相对响应之间的正式关系，并阐明了它们在逻辑一致性性质（即总体疗效是否介于亚组疗效之间）和稀释性质（即混合亚组是否使总体比值比向1移动）方面的差异。尽管比值比通常不具有逻辑一致性，但在某些条件下它可能近似表现为具有逻辑一致性的疗效指标。为了说明我们的发现，我们基于临床试验数据给出了一个说明性示例，并讨论了其对随机对照试验中亚组分析的意义。

英文摘要

Subgroup analysis is routinely used in randomized controlled trials to examine whether treatment effects are homogeneous across patient subgroups or differ because of treatment-effect heterogeneity. In this paper, we investigate the properties of the odds ratio and the relative response in subgroup analyses with binary outcomes, extending previous work with new theoretical insights and methodological developments. We establish several new theorems that characterize how the odds ratio for the overall population changes in both magnitude and direction when two subgroups are combined. These results further confirm that the odds ratio is inappropriate as an efficacy measure in this subgroup setting, whereas the relative response is appropriate. We also present the formal relationship between the odds ratio and the relative response, and clarify their differences in terms of the logic-respecting property, that is, whether the overall efficacy lies between the subgroup efficacies, and the dilution property, that is, whether mixing subgroups moves the overall odds ratio toward 1. Although the odds ratio is generally not logic-respecting, it may behave approximately like a logic-respecting efficacy measure under certain conditions. To illustrate our findings, we present an illustrative example based on clinical trial data and discuss its implications for subgroup analysis in randomized controlled trials.

URL PDF HTML ☆

赞 0 踩 0

2605.19772 2026-06-17 stat.ME 版本更新

Assessing covariate-adjusted risk differences in small-sample clinical trials

评估小样本临床试验中协变量调整的风险差异

Martin Schnuerch, Alex Ocampo, Klaus Kähler Holst, Christian Stock

AI总结针对小样本（N≤150）随机临床试验，通过模拟研究比较精确无条件检验、Mantel-Haenszel方法和g计算法在估计和检验风险差异时的表现，发现g计算法在极小样本中I类错误膨胀，而稳健或惩罚变体可改善错误控制但牺牲功效，经典方法稳健但效率较低，并基于结果提供方法选择建议。

Comments 22 pages, 3 figures

详情

AI中文摘要

二元终点在临床试验中很常见，传统上使用条件优势比来评估治疗效果。然而，优势比的解释困难，不可压缩，且依赖于强假设才能成为试验的相关总体汇总指标。作为替代，风险差异作为更可解释、临床意义更明确且假设更少的治疗效果度量，日益受到重视。这一转变也受到新监管指南的推动，该指南强调边际估计量的相关性并鼓励协变量调整。然而，风险差异的协变量调整推断，特别是在小样本中，存在方法学上的细微差别，且缺乏公认的最佳实践。我们进行了一项模拟研究，比较了在小样本（N≤150）随机临床试验中，存在预后分类基线协变量时，估计和检验风险差异的方法，重点关注精确无条件检验、Mantel-Haenszel方法和g计算（标准化）方法。我们发现，当应用标准Wald型推断时，几种g计算方法在极小样本中表现出I类错误膨胀，而稳健或惩罚变体以牺牲功效为代价改善了错误控制。经典方法如Mantel-Haenszel和Suissa-Shuster检验保持稳健，但可能放弃协变量调整带来的效率提升。总体而言，我们的结果表明，观察到的I类错误膨胀很大程度上反映了估计量与方差估计之间的错配，而非仅由小样本量导致。基于这些结果，我们提供了实用建议，以指导方法选择，使估计量、方差估计和推断目标保持一致。

英文摘要

Binary endpoints are common in clinical trials and conditional odds ratios have traditionally been used to assess treatment effects. However, the interpretation of odds ratios is difficult, they are non-collapsible and rely on strong assumptions in order to be a relevant overall summary measure for the trial. As an alternative, risk differences have gained increasing prominence as a more interpretable, clinically meaningful and assumption-lean measure of treatment effects. This shift has also been motivated by new regulatory guidance, which emphasizes the relevance of marginal estimands and encourages covariate adjustment. Yet, covariate-adjusted inference for risk differences, particularly in smaller samples, has methodological subtleties and lacks well-established best practices. We conduct a simulation study comparing methods for estimating and testing risk differences in small-sample ($N \leq 150$) randomized clinical trials with prognostic categorical baseline covariates, focusing on exact unconditional tests, Mantel-Haenszel methods, and $g$-computation (standardization) approaches. We find that several $g$-computation approaches exhibit inflated Type-I error in very small samples when standard Wald-type inference is applied, whereas robust or penalized variants improve error control at the expense of power. Classical methods such as the Mantel-Haenszel and Suissa-Shuster tests remain robust but may forgo efficiency gains from covariate adjustment. Overall, our results indicate that much of the observed Type-I error inflation reflects misalignment between estimand and variance estimation rather than small sample size alone. Based on these results, we provide practical recommendations to guide method selection that align the estimand, variance estimation, and inferential target.

URL PDF HTML ☆

赞 0 踩 0

2604.26272 2026-06-17 stat.ME 版本更新

TWICEBEE: A Two-stage Intra-patient Curve-free Bayesian Decision-Theoretic Dose Escalation Design

TWICEBEE: 一种两阶段患者内无曲线贝叶斯决策理论剂量递增设计

Dehua Bi, Katherine Ryan, Sabine Heitzeneder, Zina Good, John S. Tamaresis, Robert Lowsky, Michelle Monje, Crystal Mackall, Ying Lu

AI总结针对多周期免疫治疗中毒性随周期递减的特点，提出两阶段患者内剂量递增设计，结合加速滴定与改进的无曲线贝叶斯决策理论框架，实现安全高效的剂量探索。

详情

AI中文摘要

我们提出了一种新颖的I期患者内剂量递增设计，专门针对多周期免疫治疗环境，其中固定剂量水平的毒性在临床预期中会随着后续治疗周期而降低。该设计源于一项CAR T细胞疗法的I期试验，这是一种新兴的细胞免疫疗法，已在癌症中确立应用，并在自身免疫性疾病中日益受到研究。该设计适用于临床上认为周期特异性毒性非递增假设合理的情况。具体而言，我们基于针对两药试验的改进无曲线贝叶斯决策理论（c-CFBD）设计的外推性质（Xu等人，2025），将治疗周期视为第二维度。通过重新定义偏序，c-CFBD框架可以适应跨周期的毒性降低。所提出的设计采用两阶段结构：初始加速滴定阶段以快速探索剂量水平，随后是c-CFBD阶段以提高安全性并估计周期特异性最大耐受剂量序列。跨多种场景的模拟研究显示了良好的操作特性。

英文摘要

We propose a novel Phase I intra-patient dose-escalation design tailored for multi-cycle immunotherapy settings, in which toxicity at a fixed dose level is clinically expected to decrease over successive treatment cycles. This design was motivated by a phase I trial of CAR T cell therapy, an emerging cellular immunotherapy with established applications in cancer and growing investigation in autoimmune disease. The design is intended for settings in which nonincreasing cycle-specific toxicity assumption is clinically justified. Specifically, we build on the extrapolation property of the modified curve-free Bayesian decision-theoretic (c-CFBD) design for two-agent trials (Xu, et al. 2025), treating treatment cycle as a second dimension. By redefining the partial order, the c-CFBD framework can accommodate the reduction in toxicity across cycles. The proposed design adopts a two-stage structure: an initial accelerated titration stage to rapidly explore dose levels, followed by a c-CFBD stage to improve safety and estimate the cycle-specific maximum tolerated dose sequence. Simulation studies across a range of scenarios demonstrate favorable operating characteristics.

URL PDF HTML ☆

赞 0 踩 0

2301.07386 2026-06-17 q-bio.NC stat.AP 版本更新

Hierarchical Bayesian inference for community detection and connectivity of functional brain networks

功能脑网络社区检测与连接性的层次贝叶斯推断

Lingbin Bian, Nizhuan Wang, Leonardo Novelli, Jonathan Keith, Adeel Razi

AI总结提出基于贝叶斯潜在块模型的多层社区检测方法，在个体和群体层面稳健检测加权功能网络社区结构，保留个体变异性，并通过模拟和真实fMRI数据验证其准确性和可靠性。

详情

AI中文摘要

大多数功能性磁共振成像研究依赖于对层级组织的功能脑网络的估计，这些网络的分隔与整合反映了人类的认知和行为变化。然而，现有的从个体和群体层面分析方法中估计网络社区结构的大多数方法并未考虑受试者之间的变异性。在本文中，我们开发了一种基于贝叶斯潜在块模型（LBM）的新型多层社区检测方法。该方法能够在个体和群体层面稳健地检测具有未知社区数量的加权功能网络的社区结构，并保留个体网络的变异性。为了验证，我们提出了一种新的基于社区结构的多元高斯生成模型来模拟合成信号。我们的模拟研究表明，通过层次贝叶斯推断估计的社区成员身份与生成模型中预定义的节点标签一致。该方法还通过使用人类连接组项目中100名无关健康受试者的工作记忆任务fMRI数据的分半可重复性进行了测试。使用合成数据和真实数据的分析表明，与常用的（多层）模块性模型相比，我们提出的方法更准确、更可靠。

英文摘要

Most functional magnetic resonance imaging studies rely on estimates of hierarchically organized functional brain networks whose segregation and integration reflect the cognitive and behavioral changes in humans. However, most existing methods for estimating the community structure of networks from both individual and group-level analysis methods do not account for the variability between subjects. In this paper, we develop a new multilayer community detection method based on Bayesian latent block model (LBM). The method can robustly detect the community structure of weighted functional networks with an unknown number of communities at both individual and group levels and retain the variability of the individual networks. For validation, we propose a new community structure-based multivariate Gaussian generative model to simulate synthetic signal. Our simulation study shows that the community memberships estimated by hierarchical Bayesian inference are consistent with the predefined node labels in the generative model. The method is also tested via split-half reproducibility using working memory task fMRI data of 100 unrelated healthy subjects from the Human Connectome Project. Analyses using both synthetic and real data show that our proposed method is more accurate and reliable compared with the commonly used (multilayer) modularity models.

URL PDF HTML ☆

赞 0 踩 0

2601.11735 2026-06-17 stat.ME 版本更新

Identifying Conditions Favouring Multiplicative Heterogeneity Models in Network Meta-Analysis

识别网络荟萃分析中支持乘性异质性模型的条件

Xinlei Xu, Caitlin H Daly, Audrey Béliveau

AI总结通过比较加性随机效应与乘性效应模型在nmadb数据库中的表现，发现乘性模型在拟合优度上相当或更优，且对极端观测和发表偏倚更稳健。

详情

AI中文摘要

在网络荟萃分析（NMA）中，对研究间异质性进行显式建模对于确保有效推断和避免夸大精度至关重要。虽然加性随机效应（RE）模型是传统方法，但乘性效应（ME）模型仍未得到充分探索。ME模型通过加权最小二乘法估计的共同因子膨胀研究内方差，产生与固定效应模型相同的点估计，同时膨胀置信区间。我们基于nmadb数据库中具有显著异质性的两臂研究NMA，实证比较了RE和ME模型，并使用Akaike信息准则评估模型拟合。ME模型通常提供与RE模型相当或更好的拟合。案例研究进一步揭示，RE模型对极端和不精确的观测敏感，而ME模型对此类观测赋予较小权重，因此对发表偏倚表现出更大的稳健性。我们的结果表明，在NMA实践中，ME模型值得与常规RE模型一同考虑。

英文摘要

Explicit modelling of between-study heterogeneity is essential in network meta-analysis (NMA) to ensure valid inference and avoid overstating precision. While the additive random-effects (RE) model is the conventional approach, the multiplicative-effect (ME) model remains underexplored. The ME model inflates within-study variances by a common factor estimated via weighted least squares, yielding identical point estimates to a fixed-effect model while inflating confidence intervals. We empirically compared RE and ME models across NMAs of two-arm studies with significant heterogeneity from the nmadb database, assessing model fit using the Akaike Information Criterion. The ME model often provided comparable or better fit to the RE model. Case studies further revealed that RE models are sensitive to extreme and imprecise observations, whereas ME models assign less weight to such observations and hence exhibit greater robustness to publication bias. Our results suggest that the ME model warrant consideration alongside conventional RE model in NMA practice.

URL PDF HTML ☆

赞 0 踩 0

2510.04421 2026-06-17 stat.ML cs.LG math.ST 版本更新

Learning Survival Models with Right-Censored Reporting Delays

学习带有右删失报告延迟的生存模型

Yuta Shikuri, Hironori Fujisawa

AI总结针对报告延迟导致的生存数据右删失问题，联合建模事件和报告过程的参数风险，提出一致估计量和蒙特卡洛EM算法，并利用迁移学习提高行政删失下及时风险评估的准确性。

Comments 26 pages, 3 figures, 3 tables

2401.05343 2026-06-17 q-bio.NC stat.ME 版本更新

Spectral Topological Data Analysis of Brain Signals

脑信号的谱拓扑数据分析

Anass El-Yaagoubi, Shuhao Jiao, Moo K. Chung, Hernando Ombao

AI总结提出一种保留频率信息的拓扑摘要方法，通过基于相干性的过滤构建谱景观，用于脑电信号的两样本检验，并在ADHD数据中检测到拓扑差异。

Comments 32 pages, 13 figures

详情

AI中文摘要

脑功能连接的拓扑分析通常将每对通道简化为单个标量依赖（通常是皮尔逊相关），因此无法解析组织电生理学的频率特异性同步。我们提出一种保留频率信息的拓扑摘要。谱景观通过傅里叶频率索引Bubenik（2015）的持续景观，每个过滤基于相干性距离构建，因此它是过滤尺度和频率的函数。它在相干矩阵上是Lipschitz稳定的，并在选定频带上提供函数两样本检验，其极限零分布和一致性遵循标准函数数据论证。在模拟中，该检验在零假设下保持名义水平的同时，恢复了所在频带的拓扑差异。应用于53名对照和51名ADHD儿童的脑电图，全局检验在95%水平上拒绝了两组周期拓扑的相等性（p = 0.019）；逐频带后续分析将差异定位到伽马和θ频带，尽管在此样本量下没有频带通过族系校正。该模式与这些频带在ADHD中的既定作用一致。

英文摘要

Topological analyses of brain functional connectivity usually reduce each pair of channels to a single scalar dependence, typically the Pearson correlation, and so cannot resolve the frequency-specific synchronisation that organises electrophysiology. We propose a topological summary that keeps the frequency information. The spectral landscape indexes the persistence landscape of Bubenik (2015) by Fourier frequency, building each filtration from a coherence-based distance, so that it is a function of both the filtration scale and the frequency. It is Lipschitz-stable in the coherence matrix and feeds a functional two-sample test over a chosen frequency band, whose limiting null distribution and consistency follow from standard functional-data arguments. In simulations the test recovers a topological difference in the band where it lives while holding its nominal level under the null. Applied to electroencephalography from 53 control and 51 ADHD children, a global test rejects equality of the two groups' cycle topology at the 95% level (p = 0.019); a band-by-band follow-up localises the difference to the gamma and theta bands, although none survives family-wise correction at this sample size. The pattern is consistent with the established role of these bands in ADHD.

URL PDF HTML ☆

赞 0 踩 0

2606.17723 2026-06-17 stat.AP 新提交

Tail Dependence in EU Carbon Markets: Graphical Models of Extremes for EUA Futures

欧盟碳市场中的尾部依赖：EUA期货的极值图模型

Jan Maciejowski, Manuele Leonelli

AI总结应用Hüsler-Reiss极值图模型分析EU ETS第三、四阶段20个日度变量，发现尾部网络比平均依赖网络更密集、中心节点不同，且EUA期货在尾部网络中中心性最高，而股指和外汇对则相反。

详情

AI中文摘要

理解极端价格波动如何在金融和能源市场间传播，对于欧盟排放交易体系（EU ETS）的风险管理和监管设计至关重要。我们将Hüsler-Reiss极值图模型应用于一个包含20个日度变量的系统，这些变量围绕EU ETS第三和第四阶段（2013-2025年）的EUA期货，并以高斯图模型作为平均依赖基线。尾部网络在结构上与平均依赖网络截然不同：密度显著更高，围绕不同的中心节点组织，并受部门内同质性支配，这种同质性比平均依赖水平更紧密地约束了部门边界。EUA期货在标准图模型中处于边缘位置，但在尾部网络中达到最高中心性，而股指和主要外汇对则呈现相反趋势。指数随机图模型确认了所有样本期内尾部网络中股票和外汇的边缘性，并识别出市场下行期间的三角闭合是第三阶段的现象，在第四阶段消失。阶段转变重构了尾部网络而未使其稀疏化：平均依赖急剧收缩，而尾部依赖持续存在，崩溃传染从聚集传播转变为扩散传播。这些发现对合规实体的对冲构建、监管机构的压力测试校准以及EU ETS市场系统性风险监测工具的设计具有直接意义。

英文摘要

Understanding how extreme price movements propagate across financial and energy markets is critical for risk management and regulatory design in the EU Emissions Trading System (EU ETS). We apply Hüsler-Reiss graphical models of extremes to a system of 20 daily variables centred on EU allowances futures across Phases 3 and 4 of the EU ETS (2013--2025), with a Gaussian graphical model as the average-dependence baseline. The tail networks are structurally distinct from the average dependence network: substantially denser, organized around different central nodes, and governed by within-sector homophily that binds sector boundaries more tightly than at the average-dependence level. EU allowances futures are peripheral in the standard graphical model but achieve the highest centrality in the tail networks, while equity indices and major FX pairs follow the opposite trajectory. Exponential random graph models confirm equity and FX peripherality in tail networks across all sample periods and identify triadic closure during market downturns as a Phase~3 phenomenon that vanishes in Phase~4. The phase transition restructures the tail network without thinning it: average dependence contracts sharply while tail dependence persists, and crash contagion shifts from clustered to diffuse propagation. These findings have direct implications for hedge construction by compliance entities, stress-test calibration by regulators, and the design of systemic-risk monitoring tools for EU ETS markets.

URL PDF HTML ☆

赞 0 踩 0

2502.17518 2026-06-17 cs.LG cs.AI q-fin.CP stat.ML 版本更新

Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies

通过分类器模型进行集成强化学习：在交易策略中增强风险回报权衡

Zheli Xiong

AI总结本文研究了在金融交易策略中使用集成强化学习模型的全面研究，利用分类器模型来提升性能。通过将A2C、PPO和SAC等强化学习算法与传统分类器如支持向量机（SVM）、决策树和逻辑回归相结合，探讨不同分类器组如何整合以改善风险回报权衡。研究评估了各种集成方法的有效性，将其与单个强化学习模型在关键金融指标（包括累计回报率、夏普比率（SR）、卡勒姆比率和最大回撤（MDD））上进行比较。结果表明，集成方法在风险调整后的回报方面始终优于基础模型，提供了更好的回撤管理和整体稳定性。然而，我们发现集成性能对方差阈值τ的选择敏感，强调了动态调整τ以达到最佳性能的重要性。本研究强调了将强化学习与分类器结合在自适应决策中的价值，对金融交易、机器人和其他动态环境具有启示。

Comments 23 pages,10 figures, 9 table

详情

AI中文摘要

本文提出了一项全面研究，探讨在金融交易策略中使用集成强化学习（RL）模型的应用，利用分类器模型来提升性能。通过结合A2C、PPO和SAC等强化学习算法与传统分类器如支持向量机（SVM）、决策树和逻辑回归，我们研究了不同分类器组如何整合以改善风险回报权衡。研究评估了各种集成方法的有效性，将其与单个RL模型在关键金融指标（包括累计回报率、夏普比率（SR）、卡勒姆比率和最大回撤（MDD））上进行比较。我们的结果表明，集成方法在风险调整后的回报方面始终优于基础模型，提供了更好的回撤管理和整体稳定性。然而，我们发现集成性能对方差阈值τ的选择敏感，强调了动态调整τ以达到最佳性能的重要性。本研究强调了将强化学习与分类器结合在自适应决策中的价值，对金融交易、机器人和其他动态环境具有启示。

英文摘要

This paper presents a comprehensive study on the use of ensemble Reinforcement Learning (RL) models in financial trading strategies, leveraging classifier models to enhance performance. By combining RL algorithms such as A2C, PPO, and SAC with traditional classifiers like Support Vector Machines (SVM), Decision Trees, and Logistic Regression, we investigate how different classifier groups can be integrated to improve risk-return trade-offs. The study evaluates the effectiveness of various ensemble methods, comparing them with individual RL models across key financial metrics, including Cumulative Returns, Sharpe Ratios (SR), Calmar Ratios, and Maximum Drawdown (MDD). Our original experimental results demonstrate that ensemble methods often outperform base models in terms of risk-adjusted returns, providing better management of drawdowns and overall stability. However, both the original analysis and the additional reproduction reported in this version show that ensemble performance is sensitive to the choice of variance threshold $\tau$, classifier group, RL-agent pair, and market universe. The reproduction evidence strengthens the conclusion that classifier-assisted ensemble selection can improve robustness, while also clarifying that the advantage is conditional rather than automatic across all datasets. This study emphasizes the value of combining RL with classifiers for adaptive decision-making, with implications for financial trading, robotics, and other dynamic environments.

URL PDF HTML ☆

赞 0 踩 0

2602.19201 2026-06-17 econ.EM stat.ME 版本更新

Panel Quantile Regression with Common Shocks

面板分位数回归与共同冲击

Harold D. Chiang, Antonio F. Galvao, Chia-Min Wei

AI总结本文发展了一种对共同冲击稳健的面板分位数回归渐近与推断理论，提出了在共同冲击存在下仍一致的协方差估计量，放宽了截面独立和T≫N的假设。

详情

AI中文摘要

本文为固定效应面板分位数回归（FEQR）发展了一种渐近与推断理论，该理论对普遍存在的共同冲击具有稳健性。这种冲击引起截面依赖性，这在许多经济和金融面板中是核心问题，但在现有的FEQR理论中很大程度上被忽略，现有理论通常假设截面独立性并要求$T \gg N$。我们证明，在温和条件$(\log N)^2/T \to 0$下，标准FEQR估计量仍保持渐近正态，从而适应经验相关的情形，包括$T \ll N$的情况。我们进一步证明，共同冲击从根本上改变了渐近协方差结构，使得传统协方差估计量不一致，并提出了一个简单的协方差估计量，在存在和不存在共同冲击的情况下均保持一致。因此，所提出的程序提供了有效的稳健推断，无需事先了解依赖结构，从而大大扩展了FEQR方法在实际面板数据环境中的适用性。

英文摘要

This paper develops an asymptotic and inferential theory for fixed-effects panel quantile regression (FEQR) that delivers inference robust to pervasive common shocks. Such shocks induce cross-sectional dependence that is central in many economic and financial panels but largely ignored in existing FEQR theory, which typically assumes cross-sectional independence and requires $T \gg N$. We show that the standard FEQR estimator remains asymptotically normal under the mild condition $(\log N)^2/T \to 0$, thereby accommodating empirically relevant regimes, including those with $T \ll N$. We further show that common shocks fundamentally alter the asymptotic covariance structure, rendering conventional covariance estimators inconsistent, and we propose a simple covariance estimator that remains consistent both in the presence and absence of common shocks. The proposed procedure therefore provides valid robust inference without requiring prior knowledge of the dependence structure, substantially expanding the applicability of FEQR methods in realistic panel data settings.

URL PDF HTML ☆

赞 0 踩 0

2604.14257 2026-06-17 econ.GN stat.AP 版本更新

Mapping the causal structure of price formation in Texas's transitioning electricity market

德克萨斯州转型电力市场中价格形成的因果结构映射

Shiva Madadkhani, Nils Sturma, Mathias Drton, Svetlana Ikonnikova

AI总结采用因果发现方法研究德克萨斯州电力市场，发现风电已成为日前电价的主要因果驱动因素，其影响是天然气的三倍以上，但价格抑制效应在高峰时段减弱，且风电增长将阻塞成本重新分配给远距离负荷中心。

详情

AI中文摘要

可再生能源的部署以及电气化和大型数字负荷带来的需求增长正在改变电力市场。然而，这些发展如何重塑电价动态仍知之甚少，导致系统规划者、容量投资者和市场参与者依赖于热力主导时代的假设，而这些假设可能不再成立。我们使用因果发现来研究正在经历快速转型的德克萨斯州的批发电价演变。我们的发现推翻了德克萨斯州是一个天然气价格驱动市场的观点，证明了风电已成为日前价格的主要因果驱动因素，其影响是天然气的三倍以上。然而，风电的价格抑制效应在高峰时段正在减弱，并且风电增长将阻塞成本重新分配给远距离负荷中心。此外，德克萨斯州南部和西部负荷的上升改变了系统价格和区域差异。通过揭示因果驱动因素的时空演变性质，我们的分析表明，新发电和大负荷的节奏、地理选址和相对规模将对未来的电价风险、基础设施需求和投资具有决定性作用。

英文摘要

Renewable deployment and rising demand from electrification and large digital loads are transforming electricity markets. However, how these developments reshape electricity price dynamics remains poorly understood, leaving system planners, capacity investors, and market participants reliant on assumptions from a thermal-dominated era that may no longer hold. We use causal discovery to study the evolution of wholesale electricity prices in Texas, which is undergoing rapid transformation. Our findings overturn the view of Texas as a gas-price-driven market, demonstrating that wind generation has become the dominant causal driver of day-ahead prices, with effects more than three times greater than those of natural gas. Yet wind's price-suppressing effect is weakening during peak periods, and wind growth redistributes congestion costs to distant load centres. Furthermore, rising load in South and West Texas alters system prices and regional differentials. Uncovering the evolving spatiotemporal nature of causal drivers, our analysis reveals that the pace, geographic siting, and relative scale of new generation and large loads will be decisive for future electricity price risks, infrastructure needs, and investments.

URL PDF HTML ☆

赞 0 踩 0

2603.27049 2026-06-17 stat.ML cs.LG 版本更新

最大化相对改进：将公平学习视为讨价还价问题

Jiwoo Han, Moulinath Banerjee, Yuekai Sun

AI总结提出将群体公平解释为子群体间的讨价还价问题，通过相对改进指标恢复Kalai-Smorodinsky解，并给出公理化和有限样本收敛保证。

Comments Accepted at ICML 2026

2503.10945 2026-06-17 cs.LG cs.AI cs.CR stat.ML 版本更新

Gaussian DP for Reporting Differential Privacy Guarantees in Machine Learning

高斯差分隐私：机器学习中报告差分隐私保证的方法

Juan Felipe Gomez, Bogdan Kulynych, Georgios Kaissis, Flavio P. Calmon, Jamie Hayes, Borja Balle, Antti Honkela

AI总结针对当前机器学习中差分隐私报告不完整的问题，提出使用非渐近高斯差分隐私（GDP）作为主要报告方式，通过数值会计和决策理论度量，证明GDP能无误差地捕获DP-SGD等算法的完整隐私特征。

Comments IEEE SatML 2026 (position paper track)

详情

AI中文摘要

当前报告机器学习算法（如DP-SGD）的差分隐私（DP）保证的做法提供了不完整且可能误导的图景。例如，如果仅知道机制的一个$(\varepsilon, \delta)$，标准分析表明可能存在针对训练数据记录的高精度推理攻击，而更仔细的分析发现，对于大多数实际机制，这种精确攻击并不存在。在这篇立场论文中，我们主张使用_非渐近_高斯差分隐私（GDP）作为机器学习中传达DP保证的主要手段，以避免这些潜在缺点。利用DP文献中的两个最新进展：（i）能够以任意精度计算DP-SGD的隐私配置文件和$f$-DP曲线的开源数值会计，以及（ii）关于DP表示的决策理论度量，我们展示了如何使用数值会计提供GDP的非渐近界，并表明GDP能够以几乎无误差的方式捕获DP-SGD及相关算法的整个隐私配置文件（由该度量量化）。为了支持我们的主张，我们研究了最先进的DP大规模图像分类以及美国十年人口普查的TopDown算法的隐私配置文件，观察到GDP在所有情况下都与其配置文件拟合得非常好。最后，我们讨论了这种方法的优缺点，并探讨了哪些其他隐私机制可以从GDP中受益。

英文摘要

Current practices for reporting differential privacy (DP) guarantees for machine learning (ML) algorithms such as DP-SGD provide an incomplete and potentially misleading picture. For instance, if only a single $(\varepsilon, \delta)$ is known about a mechanism, standard analyses show that there could exist highly accurate inference attacks against training data records, when, upon a more careful analysis, such accurate attacks do not exist for most practical mechanisms. In this position paper, we argue that using _non-asymptotic_ Gaussian Differential Privacy (GDP) as the primary means of communicating DP guarantees in ML avoids these potential downsides. Using two recent developments in the DP literature: (i) open-source numerical accountants capable of computing the privacy profile and $f$-DP curves of DP-SGD to arbitrary accuracy, and (ii) a decision-theoretic metric over DP representations, we show how to provide non-asymptotic bounds on GDP using numerical accountants, and show that GDP can capture the entire privacy profile of DP-SGD and related algorithms with virtually no error, as quantified by the metric. To support our claims, we investigate the privacy profiles of state-of-the-art DP large-scale image classification, and the TopDown algorithm for the U.S. Decennial Census, observing that GDP fits their profiles remarkably well in all cases. We conclude with a discussion on the strengths and weaknesses of this approach, and discuss which other privacy mechanisms could benefit from GDP.

URL PDF HTML ☆

赞 0 踩 0

2606.18113 2026-06-17 stat.ME 新提交

Undocumented Behavior in the gsynth R package and its Consequences for Three Published Studies

gsynth R包中的未记录行为及其对三项已发表研究的影响

Beniamino Green, P. M. Aronow

AI总结研究发现gsynth包在特定选项组合下因实现错误严重低估标准误，导致假阳性率升高，并影响三篇APSR论文的结论。

详情

AI中文摘要

在2025年12月CRAN上的1.3.1版本更新之前，gsynth（一个用于估计交互固定效应（IFE）模型的流行R包）可能严重且系统地低估标准误。当两个估计选项（inference = "parametric" 和 EM = TRUE）同时使用时，会出现这种低估，此时该包会对Gobillon和Magnac（2016）的IFE-EM估计量应用参数自助法。该包在2025年12月停止支持这种组合，最新文档现在描述参数自助法因理论上的不兼容性而不适用于IFE-EM估计量。我们的重点是在gsynth的1.3.1之前版本中发现的实现错误：当EM = TRUE时使用的参数自助法与Xu（2017）提出的算法不匹配，使用了样本内残差而非样本外误差。我们证明，仅此实现错误就可能导致低估数个数量级。我们进行了一项实证蒙特卡洛研究，在一系列州级面板数据集上随机分配安慰剂处理，并表明gsynth在现实环境中可能产生高假阳性率。我们识别出三篇发表在《美国政治科学评论》上的论文受到此行为的影响。重新分析这些论文的相关部分，我们表明：（i）纠正实现错误后，大多数发现变得不显著；（ii）使用Xu（2017）的广义合成控制方法替代IFE-EM后，所有发现均变得不显著。

英文摘要

Prior to the version 1.3.1 update on CRAN in December 2025, gsynth, a popular R package for estimating Interactive Fixed Effects (IFE) models, could drastically and systematically underestimate standard errors. This underestimation would occur when two estimation options (inference = "parametric", and EM = TRUE) were used together, in which case the package would apply a parametric bootstrap procedure to Gobillon and Magnac (2016)'s IFE-EM estimator. The package ceased supporting this combination in December 2025, and the latest documentation now describes the parametric bootstrap as not suitable for use with the IFE-EM estimator due to a theoretical incompatibility. Our focus is an implementation error we identified in the pre-1.3.1 versions of gsynth: the parametric bootstrap used when EM = TRUE did not match the algorithm proposed in Xu (2017), using in-sample residuals instead of out-of-sample errors. We show that this implementation error alone can cause underestimation by orders of magnitude. We conduct an empirical Monte Carlo study using randomly assigned placebo treatments on a series of state-level panel datasets, and show that gsynth could yield high false positive rates in realistic settings. We identify three papers published in the American Political Science Review that are affected by this behavior. Reanalyzing the relevant sections of these papers, we show that (i) correcting the implementation error renders most findings insignificant, and (ii) using Xu (2017)'s Generalized Synthetic Control method in place of IFE-EM renders every finding insignificant.

URL PDF HTML ☆

赞 0 踩 0

2606.17261 2026-06-17 cs.PF cs.SE stat.AP 新提交

The Right Call for Software Benchmarking: Consistent Decisions in Stateful Environments

软件基准测试的正确调用：有状态环境下的一致决策

Gábor Melis

AI总结针对有状态环境下基准测试偏差问题，提出基于对比估计量的实验设计，消除程序特定偏差，实现渐近正确决策。

详情

AI中文摘要

在对性能的不懈追求中，现代计算系统越来越依赖有状态机制来适应工作负载和物理环境的动态变化，这提高了效率，但使基准测试以及软件优化变得困难。事实上，自适应机制本质上会在测量之间引入时间依赖性，并导致对单个程序性能的朴素估计产生偏差。注意到纠正此类偏差需要对系统动态进行推测性假设，我们呼吁优先考虑性能差异而非绝对度量，并将软件基准测试形式化为识别最快程序的决策问题，对此相对知识就足够了。为此，我们提出了简单的实验设计，允许对比的一致估计，从而使程序特定偏差在可接受的假设下抵消。这些设计渐近地产生正确的决策，并为有状态环境下的有限预算基准测试提供了一种稳健的方法，对性能敏感软件的开发具有广泛的影响。

英文摘要

In the perpetual pursuit of performance, modern computing systems rely ever more on stateful mechanisms to accommodate the dynamics of workloads and physical environments, bolstering efficiency but confounding benchmarking and thereby the optimization of software. Indeed, by their nature, adaptive mechanisms introduce temporal dependencies between measurements and render naive estimators of individual program performance biased. Observing that rectifying such biases necessitates speculative assumptions about system dynamics, we call for prioritizing performance differentials over absolute measures and formalize software benchmarking as the decision problem of identifying the fastest program, for which relative knowledge suffices. To this end, we propose simple experiment designs admitting consistent estimators of contrasts, whereby program-specific biases cancel under tenable assumptions. These designs asymptotically yield the correct decision and afford a robust methodology for finite-budget benchmarking in stateful environments, bearing broad implications for the development of performance-sensitive software.

URL PDF HTML ☆

赞 0 踩 0

2605.24003 2026-06-17 cs.CV cs.AI stat.AP 版本更新

Remote sensing data imputation using deep learning for multispectral imagery

基于深度学习的多光谱遥感数据插补

Shuang Liu, Fiona Johnson, Rohitash Chandra

发表机构 * Water Research Centre, University of New South Wales（新南威尔士大学水研究中心）； ARC ITTC Data Analytics for Resources and Environments, University of New South Wales（新南威尔士大学资源与环境数据分析师联盟）； Transitional Artificial Intelligence Research Group, School of Mathematics and Statistics, University of New South Wales（新南威尔士大学数学与统计学过渡人工智能研究组）

AI总结针对云覆盖导致的光学卫星数据缺失问题，本研究比较了线性插值与多种深度学习模型（CNN、Inception Resnet、Autoencoder及其与LSTM的组合）在四个有藻华历史记录的湖泊中重建缺失光谱波段的效果，发现深度学习模型显著优于基线方法，其中CNN表现最佳，且基于插补图像的藻华指数与观测数据吻合良好。

详情

AI中文摘要

近年来，遥感技术在水体应用中得到越来越多的利用。使用光学卫星数据的一个常见挑战是由于云覆盖导致的观测缺失。这些数据缺口可能导致错过对水资源管理部门高度关注的湖泊中关键事件（如藻华）的检测。因此，提高光学卫星数据集的完整性对于改善藻华的监测和预测至关重要。在本研究中，我们比较了传统数据插补方法（即线性插值）与深度学习模型在四个有藻华历史记录的湖泊中重建缺失光谱波段的效果。采用的深度学习模型包括基于CNN的架构（即CNN、Inception Resnet和Autoencoder）以及基于CNN-LSTM的架构（即CNN-LSTM、Resnet-LSTM和Autoencoder-LSTM）。我们的结果表明，在人工掩膜区域内插补光谱波段值时，深度学习模型显著优于基线线性插值方法。在这些模型中，CNN在大多数湖泊中表现最佳。此外，我们通过将插补图像与观测数据进行比较，评估了基于插补图像的藻华指数（即Green/Red和NDCI）的性能。我们的结果表明，深度学习模型对于插补PlanetScope SuperDove影像中的缺失数据是有效的，从而能够实现更可靠的水体监测应用。

英文摘要

Remote sensing techniques have been increasingly utilised in aquatic applications in recent years. A common challenge in using optical satellite data is the presence of missing observations due to cloud cover. These data gaps can lead to missed detection of critical events, such as algal blooms, in lakes of high interest to water authorities. As a result, enhancing the completeness of optical satellite datasets is crucial for improving the monitoring and prediction of algal blooms. In this study, we compared a traditional data imputation method (i.e., linear interpolation) with deep learning models for reconstructing missing spectral bands across four lakes with historical records of algal blooms. The deep learning models adopted include CNN-based architectures (i.e., CNN, Inception Resnet, and Autoencoder) and CNN-LSTM-based architectures (i.e., CNN-LSTM, Resnet-LSTM, and Autoencoder-LSTM). Our results demonstrated that deep learning models substantially outperformed the baseline linear interpolation method in imputing spectral band values within artificially masked regions. Among these models, CNN delivered the best performance across most lakes. Furthermore, we evaluated the performance of algal bloom indices (i.e., Green/Red and NDCI) derived from the imputed imagery by comparing them with the observed data. Our results demonstrate that deep learning models are effective for imputing missing data in PlanetScope SuperDove imagery, enabling more reliable applications in water monitoring.

URL PDF HTML ☆

赞 0 踩 0

2408.04327 2026-06-17 stat.ME stat.AP stat.CO 版本更新

BayesFBHborrow: An R Package for Bayesian borrowing for time-to-event data from a flexible baseline hazard

BayesFBHborrow: 基于灵活基线风险的贝叶斯借用方法用于时间-事件数据的R包

Darren Scott, Sophia Axillus, Alex Lewin, Grant Izmirlian

AI总结提出一种半参数贝叶斯借用模型，通过平滑先验灵活建模基线风险，利用“块-涂抹”先验增强对非交换历史数据的鲁棒性，并开发R包实现协变量调整借用与边际风险比估计。

详情

AI中文摘要

利用外部试验信息加速药物开发的统计方法越来越受欢迎。贝叶斯方法促进了动态借用，其中响应的相似性决定了使用多少信息。我们提出了一种用于时间-事件数据的半参数贝叶斯借用模型，采用平滑先验，通过集成平均允许基线风险取任何形式 \citep{Scott2024}。通过精确建模基线风险，而不是通过固定分段区间近似其形式，当参数可交换性的借用假设成立时，可以提高功效并减少估计治疗效应的偏倚。一种“块-涂抹”借用先验通过增加借用对先验-数据冲突存在的敏感性，使模型对不可交换的历史数据具有鲁棒性，从而减少I类错误膨胀的可能性。我们介绍了BayesFBHborrow，一个实现带有历史对照的半参数贝叶斯借用模型的R包。我们演示了如何选择最优借用超参数。该模型支持协变量调整借用，当结果差异可归因于协变量分布变化时，可以减少先验-数据冲突并提高功效。由于治疗效应估计量不可折叠，可以通过贝叶斯G计算估计边际风险比，同时仍允许进行校正分析以考虑对照组漂移。我们在模拟和真实数据集上展示了贝叶斯灵活基线风险模型，针对边际估计量，进行了未调整和调整分析。

英文摘要

Statistical methods that leverage external trial information to help accelerate drug development are becoming increasingly popular. Bayesian methods facilitate dynamic borrowing, where the similarity of the response guides how much information is used. We have proposed a semiparametric Bayesian borrowing model for time-to-event data, with smoothing priors that allows the baseline hazard to take any form via an ensemble average \citep{Scott2024}. By accurately modelling the baseline hazard, rather than approximating its form via fixed piecewise intervals, power is improved and bias of the estimated treatment effect reduced when the borrowing assumption of parameter exchangeability holds. A ``lump-and-smear'' borrowing prior makes the model robust to non-exchangeable historical data by increasing the sensitivity of borrowing to the presence of prior-data conflict, reducing the potential for type I error inflation. We present BayesFBHborrow, an R package that implements our semiparametric Bayesian borrowing model with a historical control. We demonstrate how to select the optimal borrowing hyperparameters. The model supports covariate-adjusted borrowing, which can reduce prior-data conflict and improve power when differences in outcomes are attributable to changes in the covariate distribution. As the treatment effect estimator is non-collapsible, the marginal hazard ratio can be estimated via Bayesian G-computation, while still permitting an adjusted analysis to account for control group drift. We illustrate the Bayesian flexible baseline hazard model on a simulated and real dataset with a marginal estimand, for both an unadjusted and adjusted analyses.

URL PDF HTML ☆

赞 0 踩 0

2604.07336 2026-06-17 astro-ph.CO astro-ph.IM physics.data-an stat.AP 版本更新

The Non-Gaussian Weak-Lensing Likelihood: A Multivariate Copula Construction and Impact on Cosmological Constraints

非高斯弱引力透镜似然：多元Copula构建及其对宇宙学约束的影响

Veronika Oehl, Tilman Tröster

AI总结提出用Copula方法构建两点相关函数的非高斯似然，在大尺度上比高斯似然更准确，但对Stage-IV巡天影响可忽略。

Comments 16 pages, 5 figures in the main text. Published in the Open Journal of Astrophysics

详情

AI中文摘要

我们提出了一个计算两点相关函数的非高斯似然的框架。非高斯性在Stage-IV弱引力透镜巡天将精确测量的大尺度上最为显著。我们展示了如何通过Copula方法构建并高效评估这种多元似然，该方法结合了精确的一维边缘分布和来自精确多元似然的依赖结构。发现Copula似然与相关函数的模拟抽样分布比高斯似然更一致，尤其是在大尺度上。此外，我们研究了非高斯Copula似然对后验推断的影响，包括对当代弱引力透镜分析的全参数空间采样。我们发现对于$1\\ 000 \\ \mathrm{deg}^2$巡天，$S_8$可能存在约一个标准差的参数偏移，但对于$10\\ 000 \\ \mathrm{deg}^2$区域偏移可忽略，表明高斯似然对于Stage-IV巡天是足够的，尽管结果依赖于详细的掩膜几何和数据向量结构。

英文摘要

We present a framework to compute non-Gaussian likelihoods for two-point correlation functions. The non-Gaussianity is most pronounced on large scales that will be well-measured by stage-IV weak-lensing surveys. We show how such a multivariate likelihood can be constructed and efficiently evaluated using a copula approach by incorporating exact one-dimensional marginals and a dependence structure derived from the exact multivariate likelihood. The copula likelihood is found to be in better agreement with simulated sampling distributions of correlation functions than Gaussian likelihoods, particularly on large scales. We furthermore investigate the effect of the non-Gaussian copula likelihood on posterior inference, including sampling the full parameter space of contemporary weak-lensing analyses. We find potential parameter shifts in $S_8$ on the order of one standard deviation for $1 \ 000 \ \mathrm{deg}^2$ surveys but negligible shifts for areas of $10 \ 000 \ \mathrm{deg}^2$, suggesting Gaussian likelihoods are sufficient for stage-IV surveys, though results depend on the detailed mask geometry and data-vector structure.

URL PDF HTML ☆

赞 0 踩 0

2601.18252 2026-06-17 cs.CV cs.AI cs.LG stat.ML 版本更新

Co-PLNet: A Collaborative Point-Line Network for Prompt-Guided Wireframe Parsing

Co-PLNet: 一种用于提示引导的线框解析的协作点线网络

Chao Wang, Xuanying Li, Cheng Dai, Jinglei Feng, Yuxiang Luo, Hao Qin, Yuqi Ouyang

AI总结提出点线协作框架Co-PLNet，通过点线提示编码器交换空间线索，并利用交叉引导线解码器增强点线一致性，在Wireframe和YorkUrban数据集上提升线框解析的准确性和鲁棒性。

详情

AI中文摘要

线框解析旨在恢复线段及其连接点，以形成结构化的几何表示，用于同时定位与地图构建（SLAM）等下游任务。现有方法分别预测线和点，并在事后进行调和，导致不匹配和鲁棒性降低。我们提出Co-PLNet，一个点线协作框架，在两个任务之间交换空间线索，其中早期检测通过点线提示编码器（PLP-Encoder）转换为空间提示，该编码器将几何属性编码为紧凑且空间对齐的图。交叉引导线解码器（CGL-Decoder）随后通过基于互补提示的稀疏注意力细化预测，强制点线一致性和效率。在Wireframe和YorkUrban上的实验显示，准确性和鲁棒性持续改进，同时具有有利的实时效率，证明了我们在结构化几何感知中的有效性。我们的代码可在该 https URL 获取。

英文摘要

Wireframe parsing aims to recover line segments and their junctions to form a structured geometric representation useful for downstream tasks such as Simultaneous Localization and Mapping (SLAM). Existing methods predict lines and junctions separately and reconcile them post-hoc, causing mismatches and reduced robustness. We present Co-PLNet, a point-line collaborative framework that exchanges spatial cues between the two tasks, where early detections are converted into spatial prompts via a Point-Line Prompt Encoder (PLP-Encoder), which encodes geometric attributes into compact and spatially aligned maps. A Cross-Guidance Line Decoder (CGL-Decoder) then refines predictions with sparse attention conditioned on complementary prompts, enforcing point-line consistency and efficiency. Experiments on Wireframe and YorkUrban show consistent improvements in accuracy and robustness, together with favorable real-time efficiency, demonstrating our effectiveness for structured geometry perception. Our code is available at this https URL.

URL PDF HTML ☆

赞 0 踩 0

1. 统计理论与方法 8 篇

The dangers of using three-number summaries to estimate unknown standard deviations: sensitivity analyses and some possible improvements incorporating shape

Dependent Censoring Based on Geometric Optimization

Finite-Time Queue Peak Laws in Stochastic Networks: Logarithmic Scaling After Geometric Thresholds

Conformal Prediction Intervals with Tail-Specific Guarantees

Parametrically Adaptive Transition Polynomial: a Signed-Parity Continuous-alpha Extension of Kunchenko Stochastic Polynomials

Robust Local Polynomial Regression with Similarity Kernels

Dependencies in Item-Adaptive CAT Data and Differential Item Functioning Detection: A Multilevel Framework

Tests for categorical data beyond Pearson: A distance covariance and energy distance approach

2. 贝叶斯统计与概率建模 8 篇

A Bayesian Boolean Matrix Factorization with Application to Copy Number Analysis in Cancer

Amortized Probabilistic Retrieval of Atmospheric CO2 from OCO-2 Spectra Using Deep Learning with Laplace Approximations and Normalizing Flows

Bayesian Magnetic Resonance Joint Image Reconstruction and Uncertainty Quantification using Sparsity Prior Models and Markov Chain Monte Carlo Sampling

Bayesian Poisson-Randomized Gamma Tensor Factorization with Application to International Trade Flows

Generative multi-scale modeling and downscaling via spatial autoregressive transport maps

Online activity prediction via generalized Indian buffet process models

Extended feature allocation models

Fully Bayesian Wideband Direction-of-Arrival Estimation and Detection via RJMCMC

3. 因果推断与实验设计 12 篇

A Sensitivity Framework for Identifying Contagion under Latent Homophily for Fixed-in-Time Network Analyses, with an Application to U.S. House Congressional Voting

Anytime-valid Optimal Policy Identification

Kernel-Based Functional Balancing for Causal Inference with Compositional Treatments

Semiparametric Mediation Analysis with Separately Observed Mediator and Outcome under Unmeasured Confounding

FoundCause: Causal Discovery with Latent Confounders from Observational Data

Distributed Experimental Design: Bayes-optimal Fusion of Local Designs

On Response-Adaptive Targeting Strategies for Multi-Treatment Experiments

Proximal Mediation Analysis with Hidden Recanting Witnesses

Statistical Foundations of LLM-based A/B Testing: A Surrogacy Framework for Human Causal Inference

Estimating Individualized Treatment Effects in Acute Ischemic Stroke with Causal Transformation Models (TRAM-DAG): A Multi-Centre Observational Study with External RCT Validation

Causal Inference with Missing Exposures and Missing Outcomes

Instrumental and Proximal Causal Inference with Gaussian Processes

4. 高维统计与正则化 2 篇

Regularized Machine Learning for System Identification of Ship Free-Running Manoeuvres from CFD-Based Synthetic Data: A Comparative Study

Active Subsampling for Measurement-Constrained M-Estimation of Individualized Thresholds with High-Dimensional Data

5. 时间序列与空间统计 9 篇

Spatial prediction of environmental processes using random forests: How best to account for spatial dependence?

Model-based clustering of compositional trajectories for the analysis of mobility data

Understanding Long-Term Dynamics of Individual Metro Usage: A Hidden Semi-Markov State Framework with Survival Analysis

Double zero-inflated spatio-temporal modeling of daily precipitation under detection thresholds

Inference Optimal Long Run Variance Estimation with Lugsail Kernels

Public transit gains and spatially uneven travel demand changes after NYC congestion pricing

Weibull-Stationary Stochastic Differential Equations for Conditional Long-Horizon Wind Power Forecasting

On a risk model with tree-structured Poisson Markov random field frequency, with application to rainfall events

Mortality Forecasting under Climate Risk: A Stochastic Approach with Distributed Lag Non-Linear Models

6. 计算统计与MCMC 11 篇

Improving Linear Regression on Small Datasets via Gaussian Process and Extreme Value Theory-Based Data Augmentation

Tropical Viterbi Tubes for Decoding Uncertainty in Hidden Markov Models

Uncertainty Quantification of Engineering Structures by Polynomial Chaos Expansion and Multivariate Active Learning

Nested Sampling: A Critical and Comprehensive Theoretical Guide

Approximating Gaussian Whittle-Matern Fields over Well-Centered Triangulations of Riemannian Manifolds

Data augmented bootstrap: Unifying confidence interval construction by approximate invariance

Splitting schemes and estimators for stochastic differential equations with Hölder multiplicative noise

A Generalized Sinkhorn Algorithm for Mean-Field Schrödinger Bridge

Variational autoencoders with latent high-dimensional steady geometric flows for dynamics

Uncertainty in AI-driven Monte Carlo simulations

Randomized Midpoint Method for Log-Concave Sampling under Constraints

7. 机器学习统计基础 33 篇

Martingale Doppelgänger-Eval: An Identification Framework for Auditing Candlestick Understanding in Vision-Language Models

Tensor-based second-order causal discovery

Fast Nonparametric Conditional Independence Testing via Two-Stage Regression

Model Validation of Agentic AI Systems: A POMDP-Based Framework for Belief-State, Forecast, and Policy Validation

Another Look at Log-PCA for Probability Measures: A Dynamical Formulation and Statistical Convergence

Sum-of-Squares Degree Barriers for the Reweighted-Hinge Method in Robust Halfspace Learning: A Christoffel-Function Characterization

A Diffusion Approximation for Temporal-Difference Learning with Linear Features under Markovian Noise

Bounded Difference Concentration for Infinitely Exchangeable Sequences with Applications to AI Benchmark Uncertainty

Tight $L_\infty$ Sample Complexity for Low-Degree and Sparse Boolean Polynomials

Finsler Geometry, Graph Neural Networks, and You

Non-asymptotic Tail Bounds for the Kostlan--Shub--Smale Field: Tensor PCA and Spherical $k$-Spin Complexity

A Polyak-Ruppert Central Limit Theorem for SA-Adam with Momentum and Non-Convergent Adaptive Preconditioning

Accelerated Convex Optimization via Hamiltonian Dynamics with Deterministic Integration Time

Scalable and Interpretable Representation Alignment with Ordinal Similarity

Representation Costs in Data Science: Foundations and the Quasi-Banach Spaces of Deep Neural Networks

Eigen-Spike Emergence and Quadratic Equivalents for Conjugate Kernels on Nonlinearly Separable Data

Approximating full conformal prediction: distribution free guarantees via the tournament correction

Curiosity-Critic: Cumulative Prediction Error Improvement as a Tractable Intrinsic Reward for World Model Training

Conditional Local Importance by Quantile Expectations

Amortizing Maximum Inner Product Search with Learned Support Functions

Provably Efficient Regularized Online RLHF with Generalized Bilinear Preferences

Stable and Steerable Sparse Autoencoders with Weight Regularization

Learning from Biased and Costly Data Sources: Minimax-optimal Data Collection under a Budget