arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 1970
专题追踪
2606.12857 2026-06-12 stat.ME stat.CO 新提交

Discrepancy Modeling with Intermediate Variables: A New Framework for Robust Gaussian Process Calibration

带中间变量的差异建模:鲁棒高斯过程校准的新框架

Henry Shaowu Yuchi, Michael Grosskopf, Aman Sharma, Nicolas Schunck, Jared O'Neal, Matt Menickelly, Stefan M. Wild

AI总结 提出利用中间变量进行差异建模的鲁棒高斯过程校准框架,通过结构化变量选择、离散化缩放高斯过程约束和空间填充设计,联合建模仿真器与差异,提升预测性能并缓解可辨识性问题。

详情
AI中文摘要

高斯过程广泛用于计算机实验中的代理建模,这些实验通常产生大量中间变量,但在标准校准框架中未明确使用。如果不利用这些变量,校准不完美模型可能具有挑战性,而分别拟合仿真器和差异模型也会带来可辨识性问题。在这项工作中,我们提出了一种鲁棒的高斯过程校准框架,利用中间变量进行差异建模。该框架集成了结构化的中间变量选择过程、离散化缩放高斯随机过程(S-GaSP)来约束差异项,以及用于选择约束点的空间填充设计策略。这使得仿真器和差异的联合建模成为可能,提高了预测性能,提供了原则性的不确定性量化,并减轻了可辨识性风险。我们在涉及结合能的核物理应用中证明了其有效性,其性能优于基线方法。

英文摘要

Gaussian processes are widely used for surrogate modeling in computer experiments, which often produce numerous intermediate variables that are not explicitly used in standard calibration frameworks. Calibration of imperfect models can be challenging without leveraging these variables, while fitting the emulator and the discrepancy models separately also poses identifiability issues. In this work, we propose a robust Gaussian process calibration framework that leverages intermediate variables for discrepancy modeling. The framework integrates a structured intermediate variable selection process, a discretized scaled Gaussian stochastic process (S-GaSP) to constrain the discrepancy term, and a space-filling design strategy for selecting constraint points. This enables joint modeling of the emulator and discrepancy, improving predictive performance, providing principled uncertainty quantification, and alleviating identifiability risks. We demonstrate its efficacy on a nuclear physics application involving binding energies, where it outperforms baseline approaches.

2606.12701 2026-06-12 stat.ME 新提交

Bayesian machine learning approach for recurrent events studies using Soft Bayesian Additive Regression Trees (SBART)

基于贝叶斯机器学习方法的复发事件研究:软贝叶斯加性回归树(SBART)

MengXing Chen, Debajyoti Sinha, Antonio Linero

AI总结 提出软贝叶斯加性回归树(SBART)非参数方法,结合软决策树与贝叶斯集成学习,用于复发事件建模,通过两层数据增强实现高效计算,在模拟和实际数据中优于现有方法。

详情
AI中文摘要

复发事件数据在生物医学研究中经常出现,其中个体可能经历同一类型事件的多次复发,例如反复住院。本文介绍了一种在贝叶斯集成学习框架下用于复发事件的非参数方法,称为软贝叶斯加性回归树(SBART),该方法结合多个软决策树以实现高预测精度和复发事件潜在强度的平滑估计。所提出的模型将非齐次泊松过程的条件强度函数表示为时间常数基线、个体特定脆弱随机效应以及捕获潜在非线性协变量效应和协变量与时间之间未知交互作用的非参数分量的乘积。采用两层数据增强方案,以在我们的计算算法中有效整合SBART组件。模拟研究表明,即使我们的建模假设不成立,我们的方法(简称RecSBART)在估计累积强度方面也优于现有方法。通过对结直肠癌患者反复住院研究的贝叶斯分析,我们进一步证明了RecSBART方法在复发事件研究中揭示和解释协变量之间潜在复杂关系的能力。

英文摘要

Recurrent event data frequently arise in biomedical studies, where individuals may experience multiple recurrences of the same type of events, such as recurrent hospitalizations. This article introduces a nonparametric method for recurrent events under a Bayesian ensemble learning framework, called Soft Bayesian Additive Regression Trees (SBART), which combines multiple soft decision trees to achieve high predictive accuracy and a smooth estimator of the underlying intensity of the recurrent events. The proposed model represents the conditional intensity function of the non-homogeneous Poisson process as the product of a time-constant baseline, a subject-specific frailty random effect, and a nonparametric component capturing potentially nonlinear covariate effects and unknown interactions among covariates and time. A two-layer data augmentation scheme is employed to efficiently incorporate the SBART component within our computational algorithm. Simulation studies demonstrate that our method, called RecSBART in short, achieves superior accuracy in estimating cumulative intensity compared to existing approaches, even when our modeling assumptions are not true. With the Bayesian analysis of a study of recurrent hospitalizations of colorectal cancer patients, we further demonstrate our RecSBART method's ability to reveal and interpret the underlying complex relationships among covariates in a recurrent events study.

2606.12677 2026-06-12 stat.ME 新提交

Restricted Multivariate Spatial Modeling

受限多变量空间建模

Jihyeon Kwon, Harrison Quick

AI总结 针对多变量条件自回归模型信息量过强的问题,提出一种通过重参数化控制信息量的受限MCAR模型,并在心脏病死亡数据中展示其优势。

Comments 30 pages

详情
AI中文摘要

在对小区域健康事件建模时,Besag、York和Mollié(BYM)的条件自回归(CAR)框架被广泛使用。对于多结局,多变量CAR(MCAR)扩展除了空间依赖性外,还容纳了共享风险因素的疾病之间的依赖性,并且还可以联合建模单一疾病的人口统计子组,允许在相关人群之间借用信息。然而,最近的研究表明,BYM CAR模型可能信息量过强,导致估计过于精确。虽然MCAR模型由于跨子组共享额外信息而预期信息量更强,但其信息量水平此前尚未被量化。我们提出了一个框架来测量MCAR模型的信息量,作为先前工作的扩展,并引入了一种控制信息量的方法,确保模型对每个子组的贡献相当。我们通过在一个计算高效的框架内对MCAR模型进行重参数化来实现这一点。我们展示了MCAR模型在信息量和过度平滑方面与BYM CAR模型的比较,并利用按种族和性别分层的县级心脏病死亡数据突出了受限MCAR模型的优势。

英文摘要

When modeling health events in small areas, the conditional autoregressive (CAR) framework of Besag, York, and Mollié (BYM) is widely used. For multiple outcomes, the multivariate CAR (MCAR) extension accommodates dependence among diseases that share risk factors, in addition to spatial dependence, and can also jointly model demographic subgroups for a single disease, allowing information to be borrowed across related populations. However, recent studies have shown that the BYM CAR model can be overly informative, leading to excessively precise estimates. While the MCAR model is expected to be more informative due to additional information shared across subgroups, its level of informativeness has not been previously quantified. We propose a framework to measure MCAR model informativeness as an extension of prior work and introduce a method to control it, ensuring the model contributes comparably to each subgroup. We achieve this through a reparameterization of the MCAR model within a computationally efficient framework. We demonstrate how the MCAR model compares with the BYM CAR model in terms of informativeness and oversmoothing and highlight the advantages of the restricted MCAR model using county-level heart disease death data stratified by race and sex.

2606.12596 2026-06-12 stat.ME 新提交

Extending Prais-Winsten Regression to Panel Data with Higher-Order Autoregressive Errors: A Simulation Study

将Prais-Winsten回归扩展到具有高阶自回归误差的面板数据:一项模拟研究

Ariel Linden

AI总结 将Prais-Winsten AR(k) GLS变换扩展到面板数据,在Stata包xtpraisk中实现,并通过蒙特卡洛模拟验证其统计性质,发现xtpraisk在保持名义第一类错误率的同时比xtscc具有更高功效,且对自回归阶数误设稳健。

详情
AI中文摘要

我们将Prais-Winsten AR(k)广义最小二乘(GLS)变换扩展到Beck-Katz面板校正标准误(PCSE)框架内的面板数据,并在社区贡献的Stata包xtpraisk中实现了该方法。作为Prais-Winsten的面板扩展,xtpraisk是xtscc(Newey-West的面板扩展和Driscoll-Kraay估计量的实现)的自然比较对象。我们进行了蒙特卡洛模拟以验证xtpraisk的统计性质,并将其有限样本性能与xtscc进行比较。模拟涵盖了自回归阶数1-3、三种自相关情景、三种面板规模、六种序列长度和五种效应大小,每种条件进行2000次重复。在所有条件下,xtpraisk在保持接近名义第一类错误率、置信区间覆盖率和标准误校准的同时,实现了比xtscc更高的功效。相比之下,xtscc在短序列长度下表现出系统性的标准误低估和第一类错误膨胀,且这两种缺陷随着自回归阶数的增加而恶化。两种估计量基本上无偏。自回归阶数的误设不会降低xtpraisk的推断性能,而面板间相关性和面板规模对任一估计量的相对性能影响可忽略。结果表明,当统计效率和有效推断均为优先考虑时,尤其是在持久的高阶自相关和短到中等序列长度下,xtpraisk更优。

英文摘要

We extend the Prais-Winsten AR(k) generalized least squares (GLS) transformation to panel data within the Beck-Katz panel-corrected standard error (PCSE) framework and implement the method in the community-contributed Stata package xtpraisk. As the panel extension of Prais-Winsten, xtpraisk is the natural comparator to xtscc, the panel extension of Newey-West and implementation of the Driscoll-Kraay estimator. We conduct a Monte Carlo simulation to validate the statistical properties of xtpraisk and compare its finite-sample performance with xtscc. The simulation spans autoregressive orders 1-3, three autocorrelation scenarios, three panel sizes, six series lengths, and five effect sizes, with 2,000 replications per condition. Across all conditions, xtpraisk achieved higher power than xtscc while maintaining near-nominal Type I error rates, confidence interval coverage, and standard error calibration. In contrast, xtscc exhibited systematic standard error underestimation and inflated Type I error at short series lengths, with both deficiencies worsening as autoregressive order increased. Both estimators were essentially unbiased. Misspecification of the autoregressive order did not degrade xtpraisk's inferential performance, and cross-panel correlation and panel size had negligible effects on the relative performance of either estimator. The results indicate that xtpraisk is preferable when both statistical efficiency and valid inference are priorities, particularly under persistent higher-order autocorrelation and short to moderate series lengths.

2606.12566 2026-06-12 stat.ME 新提交

Inferring resource selection and utilization distributions from irregular and error-prone animal tracking data

从不规则且带有误差的动物追踪数据推断资源选择和利用分布

Fanny Dupont, Brett T. McClintock, Jan-Ole Fischer, Marianne Marcoux, Nigel E. Hussey, Marie Auger-Méthé

AI总结 提出基于拉普拉斯近似的单阶段框架,通过TMB实现,同时处理测量误差和不规则采样,在模拟和独角鲸数据中优于两步法。

Comments 26 pages

详情
AI中文摘要

栖息地选择和空间利用是理解动物分布的基础。从遥测数据量化栖息地偏好的传统方法假设规则采样和可忽略的测量误差。然而,这些假设在海洋系统中经常被违反。实践者通常在进行模型拟合之前对数据进行规则化和过滤,但这两步过程未能传播过滤阶段的不确定性,并可能导致有偏估计。栖息地驱动的Langevin扩散模型提供了一种优雅的替代方案,自然地适应不规则采样。然而,通过状态空间公式纳入测量误差具有挑战性,因为栖息地协变量依赖于潜在的真实位置。我们利用拉普拉斯近似同时积分真实位置并考虑潜在路径上的栖息地协变量,从而在模板模型构建器(TMB)中高效实现单阶段框架。通过这样做,我们提供了第一个能够处理依赖于潜在变量的协变量的TMB实现,允许通过快速高效的极大似然估计进行推断。模拟表明,我们的方法优于两步法,即使在显著的测量误差和缺失数据下也能恢复栖息地选择参数,并得到更准确的利用分布和轨迹重建。应用于独角鲸(Monodon monoceros)遥测数据时,两步法将栖息地选择系数大幅缩小至接近零,而我们的统一方法恢复了更强的信号。我们的框架为栖息地选择推断中长期存在的测量误差和时间不规则性挑战提供了计算高效的解决方案,适用于广泛的分类群和环境。

英文摘要

Habitat selection and space use are fundamental to understanding animal distribution. Traditional methods for quantifying habitat preferences from telemetry data assume regular sampling and negligible measurement error. However, these assumptions are routinely violated in marine systems. Practitioners typically regularize and filter the data before fitting models, but these two-step procedures do not propagate uncertainty from the filtering stage and can yield biased estimates. Habitat-driven Langevin diffusion models offer an elegant alternative, naturally accommodating irregular sampling. However, incorporating measurement error via a state-space formulation is challenging because habitat covariates depend on the latent true locations. We address this using the Laplace approximation to simultaneously integrate over true locations and account for habitat covariates along latent paths, yielding a single-stage framework efficiently implemented in Template Model Builder (TMB). By doing so, we provide the first TMB implementation capable of handling covariates that depend on latent variables, allowing inference via fast and efficient maximum likelihood estimation. Simulations show that our approach outperforms the two-step method, recovering habitat-selection parameters even under substantial measurement error and missing data, with more accurate utilization distributions and trajectory reconstructions. Applied to narwhal (Monodon monoceros) telemetry data, the two-step method substantially shrinks the habitat selection coefficient towards zero, while our unified approach recovers a much stronger signal. Our framework offers a computationally efficient solution to long-standing challenges of measurement error and temporal irregularity in habitat selection inference, applicable across a wide range of taxa and environments.

2606.13618 2026-06-12 q-fin.PM 新提交

A Declining CVaR Glidepath Framework for Target-Date Fund Design with an Application to the Chilean Pension System

一个递减CVaR下滑路径框架用于目标日期基金设计及其在智利养老金系统中的应用

Israel Muñoz, Fernando Suárez, Omar Larré, Arturo Cifuentes

AI总结 提出一个通过递减条件风险价值约束控制风险的目标日期基金设计框架,以智利2025年养老金改革为例,发现过渡年龄是关键设计参数,缴费密度是硬约束。

Comments 29 pages, 3 figures

详情
AI中文摘要

我们提出了一个框架,用于围绕明确的回报目标设计目标日期基金(TDF),同时通过递减的条件风险价值(CVaR)约束直接在投资组合层面控制风险。在这种方法中,监管机构或发起人指定一个CVaR下滑路径,使投资组合经理有足够的灵活性以相当高的概率达到目标回报。目标回报由养老金设计输入(如退休年龄、缴费率、工作年限、预期寿命和替代率目标)外生决定。这与传统的TDF设计不同,后者设定年龄依赖的资产类别限制,而没有与所需回报明确关联。该方法的一个关键特征是它不假设经理在每个时期选择最优投资组合。相反,经理每月从满足CVaR约束的投资组合集合中抽取一个配置。这产生了对每个下滑路径的保守评估:成功概率是允许配置的平均值,而非最佳情况结果。我们引入了两个性能指标:达到目标回报的概率和TDF生命周期内累积的风险。作为概念验证,我们使用九种智利和全球资产类别以及40年的积累期,将该框架应用于智利2025年养老金改革。结果表明,风险开始下降的过渡年龄是最重要的设计参数,并且缴费密度充当硬约束:低于临界阈值时,仅靠投资组合设计无法补偿结构性低缴费。该框架是通用的,可以应用于任何围绕明确回报目标设计的TDF。

英文摘要

We propose a framework for designing Target-Date Funds (TDFs) around an explicit return objective while controlling risk directly at the portfolio level through a declining Conditional Value-at-Risk (CVaR) constraint. In this approach, the regulator or sponsor specifies a CVaR glidepath that gives the portfolio manager enough flexibility to reach a target return with a reasonably high probability. The target return is determined exogenously from pension-design inputs such as retirement age, contribution rate, working years, life expectancy, and replacement-rate goals. This differs from conventional TDF design, where age-dependent asset-class limits are set without an explicit link to a required return. A key feature of the method is that it does not assume the manager selects an optimal portfolio each period. Instead, each month the manager draws an allocation from the set of portfolios satisfying the CVaR constraint. This yields a conservative evaluation of each glidepath: success probabilities are averages over admissible allocations, rather than best-case outcomes. We introduce two figures of merit: the probability of meeting the target return and the cumulative risk assumed over the life of the TDF. As a proof of concept, we apply the framework to Chile's 2025 pension reform using nine Chilean and global asset classes and a 40-year accumulation horizon. The results show that the transition age at which risk starts to decline is the most consequential design parameter, and that contribution density acts as a hard constraint: below a critical threshold, portfolio design alone cannot compensate for structurally low contributions. The framework is general and can be applied to any TDF designed around an explicit return objective.

2606.13506 2026-06-12 econ.GN q-fin.EC 新提交

Skill vs Education Types of Labour Mismatch and Their Association with Earnings

技能与教育类型的劳动错配及其与收入的关系

Vsevolod Iakovlev

AI总结 利用26国PIAAC数据,通过教育-技能指标和误差成分模型,揭示教育错配与技能错配对收入的不同影响,并控制国家异质性后证实过度教育与过度技能导致工资惩罚,不足教育与不足技能带来工资溢价。

详情
AI中文摘要

本文分析了教育类型和技能类型的劳动错配之间的区别及其与收入的关系。利用来自OECD(2012)成人技能调查(PIAAC)第一周期的26个国家横截面数据,我使用一套全面的基于教育和技能的指标考察了教育和技能错配,探索了工人特征之间的异质性,并通过误差成分模型调查了与国家层面收入相关性冲突的来源。结果表明,国家层面的未观测异质性导致内生性偏差,其方向和大小因错配指标而异。一旦控制了未观测异质性,过度教育和过度技能与工资惩罚相关,而教育不足和技能不足则与工资溢价相关。这些发现强调了教育错配与技能错配之间的概念和实证区别,并证明了指标选择在分析中的重要性。

英文摘要

This paper analyses the distinction between educational and skill types of labour mismatch and their association with earnings. Drawing on cross-sectional data for 26 countries from the 1st Cycle of the OECD (2012) Survey of Adult Skills (PIAAC), I examine educational and skill mismatch using a comprehensive set of education- and skill-based indicators, explore heterogeneity across worker characteristics, and investigate the sources of conflicting country-level correlations with earnings through an error components model. The results show that country-level unobserved heterogeneity induces endogeneity bias, with both its direction and magnitude varying across mismatch measures. Once unobserved heterogeneity is controlled for, over-education and over-skilling are associated with wage penalties, whereas under-education and under-skilling are linked to wage premiums. These findings highlight both conceptual and empirical distinctions between educational and skill mismatch and demonstrate the importance of indicator choice in the analysis.

2606.13419 2026-06-12 q-fin.TR 新提交

Realtime price impact detection

实时价格影响检测

Ilija I Zovko

AI总结 提出通过测量交易者行为与后续不利市场事件的时间同步性来检测每笔交易的价格影响,核心是统计意外性检验,假设快速不利事件是因果证据。

详情
AI中文摘要

对于执行订单的算法交易者来说,一个重要问题是理解自己的行为是否在推动市场朝着不利于自己的方向移动——即造成市场影响。传统的答案通常是两种之一:(i)实时监控价格滑点,随着滑点增加可能减少不利活动,或(ii)放弃动态交易调整,依赖基于大量事件样本的事后滑点估计的半静态规则。实时监控失败是因为可靠估计滑点在统计上成本高昂——需要数百次成交才能将其与背景波动区分开。然而更根本的是,它并未建立因果关系。观察到的不利价格变动可能由交易者自身行为引起,也可能由争夺相同流动性并捕获相同阿尔法的无关参与者引起。最优反应(例如,减速与加速)在两种情况下相反。我们提出一种方法,通过测量交易者行为与随后不利市场事件之间的时间同步性,在每笔交易基础上检测价格影响。该方法的核心是对交易者行为后不利事件发生时间的统计意外性检验。我们必须明确,这里我们做了一个假设,即意外快速的不利市场事件是因果关系的证据,且该行为触发了它们——这是影响和信息泄露的直接特征。验证它需要真实的执行数据;我们列出了将进行的实证检验。

英文摘要

An important question for an algo trader working an order is to understand if their actions are moving the market against them -- i.e., causing market impact. The conventional answer usually is one of two: (i) monitor price slippage in real-time, potentially reducing adverse activity with increased slippage, or (ii) do away with dynamic trading adjustments and rely on semi-static rules based on ex-post estimates of slippage over a large sample of events. Realtime monitoring fails because reliably estimating slippage is statistically expensive -- it requires hundreds of fills before it can be told apart from background volatility. More fundamentally however, it does not establish causality. Observed adverse price moves may be caused by the trader's own actions, or by an unrelated participant competing for the same liquidity and capturing the same alpha. The optimal response (say, slow down vs.\ speed up) is opposite in the two cases. We propose a method that detects price impact, on a per-action basis, by measuring the timing synchronicity between a trader's actions and subsequent adverse market events. The method at heart is a test for statistical \emph{surprise} in the timing of adverse events post trader action. We must be clear in that we do make a leap of faith here and assume that surprisingly fast adverse market events are evidence of causation and that the action triggered them -- a direct signature of impact and information leakage. Validating it requires real execution data; we set out the empirical tests that would do so.

2606.13314 2026-06-12 econ.GN q-fin.EC 新提交

The Privilege of Exposure: Caste and Generative AI in India's Graduate Labour Market

暴露的特权:种姓与生成式AI在印度毕业生劳动力市场

Kaibalyapati Mishra

AI总结 研究利用印度最新劳动力调查数据,发现种姓影响毕业生对生成式AI的暴露程度,低种姓毕业生暴露度显著低于高种姓,且该差距通过职业分布和工资溢价加剧种姓收入不平等。

详情
AI中文摘要

在发展中国家劳动力市场中,谁暴露于生成式AI?我们将三个职业AI暴露指数映射到印度重新设计的定期劳动力调查(2025年),并记录了83,000名就业毕业生中显著的种姓梯度:在同一地区内,来自在册种姓和在册部落的毕业生比高种姓毕业生的暴露度低0.24-0.37个标准差。两个渠道导致了这一差距:四分之一的在册种姓和三分之一的在册部落毕业生从事不受AI影响的农业或初级职业,而那些从事白领工作的人在管理、软件和金融职业中的代表性不足。由于暴露度带来高达20%的工资溢价,生成式AI可能会扩大而非缩小印度的种姓收入差距。

英文摘要

Who is exposed to generative AI in a developing-country labour market? We map three occupational AI-exposure indices to India's redesigned Periodic Labour Force Survey (2025) and document a steep caste gradient among 83,000 employed graduates: graduates from the Scheduled Castes and the Scheduled Tribes are 0.24--0.37 standard deviations less exposed than upper-caste graduates within the same district. Two channels drive the gap: one in four SC and one in three ST graduates work in farm or elementary occupations untouched by AI, and those in white-collar work are underrepresented in managerial, software, and finance occupations. Because exposure commands a wage premium of up to 20 per cent, generative AI stands to widen, not narrow, India's caste earnings gap.

2606.12893 2026-06-12 econ.GN q-fin.EC 新提交

Technology Shocks, Relative Performance Measures, and Outcomes: Evidence from Classical Chess

技术冲击、相对绩效度量与结果:来自经典国际象棋的证据

Dan Ben-Moshe, David Genesove

AI总结 利用390万局经典国际象棋比赛数据,发现2020年神经网络引擎普及后和棋率上升约4个百分点,而基于相对绩效的等级分变化不大,表明技术冲击被广泛吸收。

详情
AI中文摘要

2020年秋季,神经网络方法使得国际象棋引擎的性能大幅提升,并免费广泛可用。到2021年底,经典国际象棋的月度和棋率上升了约四个百分点,但通常被视为棋力指标的棋手等级分分布变化不大。然而,等级分是一种相对度量,基于与其他有等级分棋手对弈的结果构建,而非绝对棋力尺度,因此广泛共享的进步不必改变等级分。利用2015年3月至2023年11月的390万局有等级分的经典比赛,我们记录到和棋率上升在控制双方等级分后仍然存在,在重复同色对局中成立,并非先前趋势的延续,并持续到样本期末。一个线性变换将疫情后等级分映射到更高的疫情前等效值,且在低等级分处差距更大,解释了拟合的和棋、白胜和黑胜概率的疫情后减疫情前偏移的90%以上。相比之下,棋手的等级分和排名没有显示出额外的排名重新洗牌,也没有相对于疫情前基准的组内离散度普遍扩大。我们将这些发现解释为与各等级分水平的采用一致,且低等级分棋手获得了更大的等级分等效增益。

英文摘要

In the fall of 2020, neural-network methods produced a large improvement in chess engines that became freely and widely available. By the end of 2021, the monthly draw rate in classical chess had risen by about four percentage points, but the distribution of player ratings, which are commonly read as measures of playing strength, had changed little. Ratings, however, are a relative measure, built from results against other rated players rather than from an absolute scale of play quality, so an improvement shared broadly across players need not change their ratings. Using 3.9 million rated classical games from March 2015 to November 2023, we document that the increased draw rate remains after conditioning on both players' ratings, holds within repeated same-color matchups, is not a continuation of a pre-existing trend, and persists through the end of the sample. A linear transformation that maps post-Covid ratings to higher pre-Covid equivalents, with a larger gap at lower ratings, accounts for more than 90 percent of the post-minus-pre shift in the fitted draw, White-win, and Black-win probabilities. Players' ratings and ranks, by contrast, show no additional rank reshuffling and no general widening of within-group dispersion relative to the pre-Covid benchmark. We interpret these findings as consistent with adoption across rating levels, with larger rating-equivalent gains for lower-rated players.

2606.12612 2026-06-12 q-fin.PM 新提交

The Mathematics of Heuristic Portfolio Optimization (HPO)

启发式投资组合优化(HPO)的数学原理

Miquel Noguer i Alonso

AI总结 本文提出启发式投资组合优化(HPO)框架,将Markowitz解投影到稳定规则类,通过隐含收益原理推导启发式最优性集,并建立与强化学习投资组合优化(RLPO)的联系,提供可测试的统计条件。

详情
AI中文摘要

从业者使用等权重、逆波动率、风险平价、HRP和经收益调整的HRP(RA-HRP)等预测轻规则分配资本。本文发展了\emph{启发式投资组合优化}(HPO):将Markowitz/切线解信息受限地投影到稳定规则类。隐含收益原理,即$\w$是最大夏普比当且仅当$\bmu_e\propto\bSigma\w$,给出了主要启发式规则的闭式最优性集,并揭示了HRP背后的Schur补替代。对于RA-HRP,我们引入了固定树聚类-夏普比递归、无单位HRP-RA-HRP插值、切线条件、条件风险分割以及权重扭曲的路径/KL分解。一阶夏普比微积分将收益信息的边际价值表示为相对于HRP的节点alpha,并得出线性KL信任预算。我们形式化了通用HPO映射,定义了隐含收益缺陷,证明其等于平方夏普比无效性,通过节点质量比刻画树-HPO重合,并给出了估计规则的偏差-方差分解。最后,HPO被嵌入强化学习投资组合优化(RLPO):每个HPO映射诱导一个确定性平稳策略;静态HPO是Bellman问题的$\gamma=0$无摩擦面;RA-HRP提供了分层策略先验;当延续价值超过短视HPO缺陷加摩擦时,动态改进是合理的。一个性能差异恒等式定价了短视价值缺口,给出了$\varepsilon/(1-\gamma)$短视界,并识别节点alpha为分层演员的策略梯度坐标。因此,HPO是静态最优性层,RLPO是动态控制层。这些条件可进行GRS检验,在椭圆对称性下扩展到均值-CVaR和期望效用,并在扩散极限下成为Kelly增长条件。

英文摘要

Practitioners allocate capital with forecast-light rules such as equal weight, inverse volatility, risk parity, HRP, and return-adjusted HRP (RA-HRP). This paper develops \emph{Heuristic Portfolio Optimization} (HPO): an information-restricted projection of the Markowitz/tangency solution onto a stable rule class. The implied-return principle, $\mathbf{w}$ is maximum-Sharpe iff $\mathbfμ_e \propto \mathbfΣ\mathbf{w}$, gives closed-form optimality sets for leading heuristics and exposes the Schur-complement substitutions behind HRP. For RA-HRP, we introduce fixed-tree cluster-Sharpe recursion, unit-free HRP--RA-HRP interpolation, tangency conditions, conditional-risk splits, and pathwise/KL decompositions of weight distortion. First-order Sharpe calculus expresses the marginal value of return information as nodewise alphas against HRP and yields a linear KL trust budget. We formalize generic HPO maps, define the implied-return defect, prove that it equals squared Sharpe inefficiency, characterize tree-HPO coincidence by nodewise mass ratios, and give a bias--variance decomposition for estimated rules. Finally, HPO is embedded into Reinforcement Learning Portfolio Optimization (RLPO): every HPO map induces a deterministic stationary policy; static HPO is the $γ=0$ no-friction face of the Bellman problem; RA-HRP supplies a hierarchical policy prior; and dynamic improvement is warranted when continuation value exceeds myopic HPO defect plus frictions. A performance-difference identity prices the myopic value gap, gives an $\varepsilon/(1-γ)$ myopia bound, and identifies nodewise alphas as policy-gradient coordinates of the hierarchical actor. Thus HPO is the static optimality layer and RLPO the dynamic control layer. The conditions are GRS-testable, extend to mean--CVaR and expected utility under ellipticity, and become Kelly-growth conditions in diffusion limits.

2606.13620 2026-06-12 q-bio.QM 新提交

Balancing label resolution and computational cost in dynamical models of lipid metabolism

脂质代谢动力学模型中标签分辨率与计算成本的平衡

Paul Jonas Jost, Christoph Thiele, Jan Hasenauer

AI总结 研究多标签脂质代谢实验中模型标签数量对参数估计、轨迹恢复和计算成本的影响,发现使用三个标签可在实验可行性、推理能力和计算效率间取得平衡。

Comments 3 Supplementary Files

详情
AI中文摘要

脂质代谢是一个核心生物学过程,通常使用破坏性质谱实验进行研究。最近提出的一种策略利用多个标签从单次破坏性测量中提取脂质代谢的时间信息。然而,基于模型的数据分析的计算复杂度随着标签数量迅速增加,在测量信息内容和分析成本之间产生基本权衡。在这里,我们研究了建模标签数量如何影响参数估计准确性、轨迹恢复和计算成本,以及建模少于实验可用标签是否可以缓解这种权衡。使用五标签实验的合成数据,我们发现建模五个标签中的三个在实验可行性、推理能力和计算可处理性之间提供了实用的平衡。在肝细胞甘油三酯循环的应用中,我们进一步表明,最具成本效益的单标签模型可能对未观测物种产生生物学上不可信的预测,而解析更多标签的模型更好地约束了这些潜在动力学。这些结果为多标签实验中选择模型分辨率提供了实用指导,并为平衡推理能力与计算成本建立了定量基础。

英文摘要

Lipid metabolism is a central biological process that is commonly studied using destructive mass-spectrometry experiments. A recently proposed strategy, uses multiple labels to extract temporal information about lipid metabolism from a single destructive measurement. However, the computational complexity of the model-based data analysis increases rapidly with the number of labels, creating a fundamental trade-off between the information content of the measurements and the cost of analysis. Here, we examine how the number of modelled labels affects parameter estimation accuracy, trajectory recovery, and computational cost, and whether modelling fewer labels than are experimentally available can mitigate this trade-off. Using synthetic data from a five-label experiment, we find that modelling three of the five labels provides a practical balance between experimental feasibility, inferential power, and computational tractability. In an application to hepatocyte triglyceride cycling, we further show that the most cost-efficient, single-label model can yield biologically implausible predictions for unobserved species, whereas models that resolve more labels better constrain these latent dynamics. These results provide practical guidance for selecting model resolution in multi-label experiments and establish a quantitative basis for balancing inferential power against computational cost.

2606.13475 2026-06-12 q-bio.QM q-bio.PE 新提交

A likelihood-based framework for simultaneously learning both noise and growth dynamics using biologically-informed neural networks

基于似然的框架:利用生物信息神经网络同时学习噪声和生长动力学

Rebecca M. Crossley, Ruth E. Baker

AI总结 提出一种扩展的生物信息神经网络框架,通过可学习的噪声模型从数据中联合发现噪声结构和生长动力学,提高了预测准确性。

Comments 28 pages (including one page SI), 6 figures (one in SI)

详情
AI中文摘要

近年来,神经常微分方程框架如生物信息神经网络(BINNs)在从稀疏数据中学习机械定律方面显示出潜力。然而,大多数现有方法隐含地假设同方差高斯噪声,因此未考虑生物变异性中潜在有意义的结构。在此,我们提出了对现有BINNs框架的扩展,包括一个可学习的噪声模型,允许直接从数据中发现噪声模型。以种群增长为例,我们证明了该框架能够准确恢复底层噪声结构,并相比现有方法改进了对底层生长定律的预测。因此,这项工作建立了一个通用的基于似然的框架,用于在机械神经网络方法中联合学习动力学和异方差噪声。

英文摘要

In recent years, neural ordinary differential equation frameworks such as Biologically-Informed Neural Networks (BINNs) have shown promise for learning mechanistic laws from sparse data. However, most existing approaches implicitly assume homoscedastic Gaussian noise, and therefore do not account for potentially meaningful structure in biological variability. Here, we present an extension to the existing BINNs framework that includes a learnable noise model, allowing discovery of the noise model directly from data. Using population growth as an example, we demonstrate that the framework accurately recovers the underlying noise structure and improves predictions of the underlying growth laws compared to existing approaches. As such, this work establishes a general likelihood-based framework for jointly learning dynamics and heteroscedastic noise within mechanistic neural network approaches.

2606.13463 2026-06-12 q-bio.OT 新提交

Begging with a Purpose? Testing Behavioural Hallmarks of First-Order Intentionality in Free-ranging Hanuman Langurs

乞讨有目的?自由活动哈努曼叶猴一阶意向性行为标志的测试

Dishari Dasgupta, Shriparna Chattopadhyay, Sruti Banerjee, Pratyusha Adhikary, Akash Dutta, Manabi Paul, Anindita Bhadra

AI总结 通过实验测试自由活动哈努曼叶猴向人类乞食时的行为标志,发现其展示了一阶意向性的多个特征,扩展了意向性研究在非猿灵长类中的分布。

详情
AI中文摘要

意向性交流在灵长类中已被广泛研究,但来自自由活动的非猿类物种的证据仍然有限。最近描述了哈努曼叶猴(Semnopithecus entellus)向人类乞食的姿势,但这些行为是否表现出与一阶意向性相关的行为标志尚不清楚。在这里,我们通过实验调查了印度南西孟加拉邦六个地点自由活动哈努曼叶猴中这些标志的存在。我们进行了360次实验和对照试验,并量化了常用于操作化意向性交流的行为标记。实验试验引发了观众检查、接收者定向、快速接近反应、乞食姿势和姿势灵活性,而这些行为在对照试验中很少或没有出现。实验和对照条件之间的差异在所有六个研究地点均显著。信号也在获得食物后停止,符合与“明显满意结果”相关的停止规则。我们的研究结果表明,自由活动哈努曼叶猴向人类定向的姿势交流中存在与一阶意向性相关的多个行为标志。这些结果将意向性研究扩展到猿类之外,并为意向性相关特征在灵长类中的进化分布提供了新见解。

英文摘要

Intentional communication has been studied extensively in primates, yet evidence from free-ranging non-ape species remains limited. Human-directed food-solicitation gestures in Hanuman langurs (Semnopithecus entellus) have recently been described, but whether these behaviours exhibit behavioural hallmarks associated with first-order intentionality remains unknown. Here, we experimentally investigated the presence of these hallmarks in free-ranging Hanuman langurs across six anthropogenic sites in southern West Bengal, India. We conducted 360 experimental and control trials and quantified behavioural markers commonly used to operationalize intentional communication. Experimental trials elicited audience checking, recipient-directed orientation, rapid approach responses, food-solicitation gestures and gestural flexibility, whereas these behaviours were rare or absent in control trials. Differences between experimental and control conditions were significant across all six study sites. Signalling also ceased following food acquisition, consistent with the stopping rule associated with an Apparently Satisfactory Outcome. Our findings demonstrate the presence of multiple behavioural hallmarks linked to first-order intentionality in the human-directed gestural communication of free-ranging Hanuman langurs. These results extend the study of intentionality beyond apes and provide new insights into the evolutionary distribution of intentionality-related traits across primates.

2606.13132 2026-06-12 q-bio.NC 新提交

Including the Cost of Irreducible Uncertainty in the Policy Compression Framework

将不可约不确定性的成本纳入策略压缩框架

Álvaro Garrido-Pérez, Pieter Simoens, Amrapali Pednekar, Yara Khaluf

AI总结 本文扩展策略压缩框架,通过引入条件熵加权项来建模不可约不确定性的认知成本,使最优策略精度可独立于奖励敏感性变化,更准确解释人类决策偏差。

Comments Accepted at the 5th International Conference on Hybrid Human-Artificial Intelligence, 2026

详情
AI中文摘要

AI决策支持系统可以从预测人类决策中的偏差中受益。许多此类偏差可能源于人类认知限制。策略压缩框架将决策建模为奖励最大化与编码状态依赖行动策略的认知成本之间的权衡,该成本形式化为状态与行动之间的互信息(策略复杂度)。我们认为这一描述是不完整的,因为它将条件熵——给定状态下应选择哪个行动的不可约不确定性——视为无成本,尽管经验证据表明它调节反应时间。因此,我们通过将认知成本定义为策略复杂度与加权条件熵项(由新参数$\eta$控制)之和来扩展该框架。由此产生的最优策略保留标准指数形式,但随着$\eta$增加而变得更尖锐,使得策略精度可以更独立于奖励敏感性变化。这一修改意味着标准策略压缩框架可能低估行动选择的认知成本,并有可能更好地解释人类决策中的偏差。同时,它给将模型拟合到人类数据带来了额外的复杂性,这需要未来工作来解决。

英文摘要

AI decision-support systems can benefit from anticipating biases in human decision-making. Many such biases may arise from human cognitive limitations. The policy compression framework models decision-making as a trade-off between reward maximization and the cognitive cost of encoding state-dependent action policies, formalized as the mutual information between states and actions (policy complexity). We argue that this account is incomplete because it treats conditional entropy--the irreducible uncertainty about which action should be selected given a state--as costless, even though empirical evidence suggests that it modulates reaction times. We therefore extend the framework by defining cognitive cost as the sum of policy complexity and a weighted conditional-entropy term, governed by a new parameter, $η$. The resulting optimal policy retains the standard exponential form but becomes sharper as $η$ increases, allowing policy precision to vary more independently of reward sensitivity. This modification implies that the standard policy compression framework may underestimate the cognitive cost of action selection, and it has the potential to better account for biases in human decision-making. At the same time, it introduces additional complexity for fitting the model to human data, which future work will need to address.

2606.13047 2026-06-12 q-bio.BM q-bio.CB 新提交

Irregular curvature at focal adhesions modulates Piezo1 activity and low frequency ultrasound induced apoptosis in cancer cells

黏着斑处的不规则曲率调控Piezo1活性及低频超声诱导的癌细胞凋亡

Ivana Pajic-Lijakovic, Milan Milivojevic, Boris Martinac, Peter V. E. McClintock

AI总结 本文提出理论框架,解释癌细胞与健康细胞对低频超声的不同响应:癌细胞不规则黏着斑曲率导致Piezo1通道松散排列保持活性,而健康细胞规则曲率促使胆固醇重排降低Piezo1活性,从而揭示超声选择性杀伤癌细胞的物理机制。

Comments 38 pages, 4 figures

Journal ref Physics of Life Reviews, June 2026

详情
AI中文摘要

低频低强度超声(LIUS)已成为一种有前景的物理方式,能够诱导癌细胞选择性凋亡,同时保留健康上皮细胞和成纤维细胞。迄今为止,这种选择性的机制尚不清楚,但我们现提出并发展了一个理论框架,将癌细胞与健康细胞的不同力学行为与其对LIUS的差异性响应联系起来。我们指出,癌细胞表现出不均匀的腹侧应力纤维网络,这可能在低强度超声(LIUS)下产生不规则的黏着斑几何形状和黏着斑附近的内向膜曲率。这些曲率不规则性有利于Piezo1通道的松散堆积,从而保持其活性。相反,健康上皮细胞和成纤维细胞表现出更均匀的细胞骨架组织,这可能导致黏着斑附近更规则的曲率轮廓。这导致曲率驱动的胆固醇重新分布,从而改变Piezo1簇的空间组织并降低协调通道活性,使细胞在暴露于LIUS时保持其活跃增殖状态。基于理论建模和先前的实验发现,我们提出细胞骨架组织和膜曲率的差异可能导致健康细胞与癌细胞之间不同的Piezo1激活模式。我们的分析将曲率介导的Piezo1重新分布确定为LIUS选择性的潜在物理基础,并为设计基于超声的疗法以利用癌细胞固有的细胞骨架脆弱性提供了机制基础。

英文摘要

Low-frequency, low intensity ultrasound (LIUS) has emerged as a promising physical modality capable of inducing selective apoptosis of cancer cells, while sparing healthy epithelial cells and fibroblasts. Hitherto, the mechanism underlying this selectivity has been unclear, but we now propose and develop a theoretical framework linking the distinct mechanical behaviours of cancer versus healthy cells to their differential responses to LIUS. We point out that cancer cells exhibit inhomogeneous ventral stress-fiber networks, which can produce irregular focal adhesion geometry and inward membrane curvature near focal adhesions under low-intensity ultrasound (LIUS). These curvature irregularities can favor loose packing of Piezo1 channels, thereby preserving their activity. In contrast, healthy epithelial cells and fibroblasts display more homogeneous cytoskeletal organization, which can result in more regular curvature profiles adjacent to focal adhesions. This leads to curvature-driven cholesterol redistribution, resulting in altered spatial organization of Piezo1 clusters and reduced coordinated channel activity and allowing cells to remain in their active, proliferative state when exposed to LIUS. Based on theoretical modeling and previous experimental findings, we propose that differences in cytoskeletal organization and membrane curvature can contribute to distinct Piezo1 activation patterns between healthy and cancerous cells. Our analysis identifies curvature-mediated Piezo1 redistribution as a potential physical basis for LIUS selectivity and provides a mechanistic foundation for designing ultrasound-based therapies to exploit the intrinsic cytoskeletal vulnerabilities of cancer cells.

2606.12772 2026-06-12 q-bio.QM 新提交

EasyNano: rapid epitope-targeted nanobody CDR design via differentiable distogram optimization with ESMFold2

EasyNano: 通过可微距离图优化与ESMFold2实现快速表位靶向纳米抗体CDR设计

Yue Hu, Wanyu Cheng, Junqing Wang, Yingchao Liu

AI总结 提出EasyNano流程,利用ESMFold2可微距离图优化,在10-20分钟内快速设计靶向特定表位的纳米抗体CDR,显著提升ipTM指标。

详情
AI中文摘要

计算设计结合用户指定蛋白表位的纳米抗体可能变革治疗开发,但当前方法要么依赖随机采样需要数天GPU计算,要么采用无法直接靶向表位的逆折叠方法。这里我们提出EasyNano,一个实用的流程,用于快速、表位靶向的纳米抗体互补决定区(CDR)设计,在高端个人工作站上约10-20分钟完成。EasyNano通过ESMFold2成对距离距离图进行梯度下降优化CDR残基logits,使用轻量级ESMFold2-Fast模型(721M)作为可微预言机,由包含专用表位邻近项的复合损失引导。完整的ESMFold2(1.3B)CA坐标结构先验防止框架位姿漂移。野生型logit初始化偏差作为控制CDR突变性的关键实际参数出现。在涵盖自我恢复和从头设计场景的六个靶标-框架对中,EasyNano将ipTM提升高达+0.559——从0.143到0.702(Ty1/RBD)——并在手动对接的AQP4靶向框架上实现4.6倍改进(ipTM从0.117到0.538),同时保持已有强结合剂的ipTM。随机CDR基线(每个靶标n=30)确认统计显著性(Ty1高于随机均值5.7 sigma)。多种子分析揭示多样的局部最小值,强调了重复运行的重要性。针对晶体结构的Kabsch交叉验证确认设计的CDR保留了框架位姿盆地。EasyNano证明基于ESMFold2的可微优化为纳米抗体CDR设计提供了一种快速、实用且表位特异的方法。

英文摘要

Computational design of nanobodies that bind user-specified protein epitopes could transform therapeutic development, but current methods either rely on stochastic sampling requiring days of GPU computation or inverse folding approaches unable to target epitopes directly. Here we present EasyNano, a practical pipeline for rapid, epitope-targeted nanobody complementarity-determining region (CDR) design that operates in approximately 10-20 minutes on a high-end personal workstation. EasyNano optimizes CDR residue logits via gradient descent through the ESMFold2 pairwise distance distogram, using the lightweight ESMFold2-Fast model (721M) as a differentiable oracle guided by a composite loss including a dedicated epitope proximity term. A full ESMFold2 (1.3B) CA-coordinate structure prior prevents framework pose drift. The wild-type logit initialization bias emerges as a critical practical parameter controlling CDR mutability. Across six target-framework pairs spanning self-recovery and de novo design scenarios, EasyNano improves ipTM by up to +0.559 -- from 0.143 to 0.702 (Ty1/RBD) -- and achieves a 4.6-fold improvement (ipTM 0.117 to 0.538) on a manually docked AQP4-targeting framework, while preserving ipTM on already-strong binders. Random CDR baselines (n=30 per target) confirm statistical significance (5.7 sigma above random mean for Ty1). Multi-seed analysis reveals diverse local minima, underscoring the importance of replicate runs. Kabsch cross-validation against crystal structures confirms that designed CDRs preserve the framework pose basin. EasyNano demonstrates that ESMFold2-based differentiable optimization provides a fast, practical, and epitope-specific approach to nanobody CDR design.

2606.12712 2026-06-12 q-bio.MN 新提交

Predictions for and lack of maximal information transmission in the neuromuscular junction

神经肌肉接头中最大信息传输的预测与缺失

Eitan Goldfein, Sarah Marzen

AI总结 通过信息最大化分析,比较理论预测与果蝇神经肌肉接头的实验分布,发现果蝇NMJ并未通过调节突触囊泡释放概率分布来最大化神经到肌肉的信息传输。

Comments 12 pages, 7 figures

详情
AI中文摘要

理论生物学的一个关键问题是生物系统在物理和功能约束下如何有效地保留其输入信息。我们在神经肌肉接头(NMJ)中通过研究胆碱能和谷氨酸能NMJ中神经递质浓度如何转化为电流来探讨这个问题。基于对剂量-反应关系的生物学理解,使用信息最大化分析推导出神经递质浓度的理论分布。将这些理论分布与从果蝇NMJ获得的实验分布进行比较。理论和实验分布显示出非常低的一致性,表明果蝇NMJ并未通过塑造其突触囊泡释放概率分布来最大化从神经系统到肌肉的信息传输。提供了胆碱能系统的预测。

英文摘要

A key question in theoretical biology is how effectively biological systems preserve information about their inputs while operating under physical and functional constraints. We examine that question at the neuromuscular junction (NMJ) by studying how neurotransmitter concentration is transformed into current at both cholinergic and glutamatergic NMJs. An information maximization analysis was used to derive a theoretical distribution over neurotransmitter concentrations based on biological understandings of dose-response relationships. These theoretical distributions were compared to an experimentally derived distribution obtained from a Drosophila NMJ. The theoretical and experimental distributions showed very little agreement, indicating that the Drosophila NMJ does not shape its distribution of synaptic vesicle release probabilities in order to maximize information transmission from nervous system to muscle. Predictions for cholinergic systems are provided.

2606.12597 2026-06-12 q-bio.QM q-bio.PE 新提交

A structural causal framework for interventions on evolutionary accumulation models

进化累积模型干预的结构因果框架

Ramon Diaz-Uriarte, Íñigo Ríos-Arroyo, Iain G. Johnston

AI总结 提出一个基于Pearl do算子的结构因果框架,用于从进化累积模型中提取干预预测,并区分杀死和失活两种干预类型,以排序候选干预目标。

详情
AI中文摘要

进化累积模型(EvAMs),也称为癌症进展模型(CPMs),从横截面数据推断肿瘤进展过程中突变累积顺序的依赖关系。已有研究表明EvAMs可用于识别治疗靶点,但文献中缺乏如何从这些模型中提取干预下预测的流程。简单的条件化于突变缺失的方法会给出错误预测。我们通过使用Pearl的do算子和条件干预,形式化“干预”对所有当前可用的EvAM方法(OT, OncoBN, CBN, H-ESBCN, MHN, HyperHMM, HyperTraPS)的含义,填补了这一空白。对于每个模型,我们展示了如何实施干预(在大多数情况下作为特定的参数修改),识别等效的实施程序,并分析模块化假设——干预需要定义良好——是否合理。借助将适应性作为显式变量的个体级因果DAG,我们区分了标准EvAM表示中混淆的两种干预类型(杀死和失活)。由于目标是优先考虑干预候选,我们将问题重新定义为排序问题:我们定义了三个干预目标,并提供了一个评估EvAMs对目标排序效果的协议。我们的框架不特定于癌症或EvAMs;它适用于任何可将拟合的计算模型解释为结构因果模型的情况。代码可从该网址获取。

英文摘要

Evolutionary accumulation models (EvAMs), also known as cancer progression models (CPMs), infer dependencies in the order of accumulation of mutations during tumor progression from cross-sectional data. It has been suggested that EvAMs could be used to identify therapeutic targets, but there is no procedure in the literature for how to extract predictions under intervention from these models. A simple approach of conditioning on the absence of a mutation gives incorrect predictions. We address this gap by formalizing what ``intervene'' means for all currently available EvAM methods (OT, OncoBN, CBN, H-ESBCN, MHN, HyperHMM, HyperTraPS), using Pearl's do operator and conditional interventions. For each model, we show how to implement the intervention (in most cases as specific parameter modifications), identify equivalent implementation procedures, and analyze whether the modularity assumption -- required for the intervention to be well-defined -- is justified. Drawing on individual-level causal DAGs that make fitness an explicit variable, we distinguish two types of intervention (killing and inactivating) that are conflated in standard EvAM representations. Since the goal is to prioritize intervention candidates, we recast the problem as one of ranking: we define three intervention objectives and provide a protocol for evaluating how well EvAMs rank targets. Our framework is not specific to cancer or EvAMs; it applies wherever fitted computational models can be interpreted as structural causal models. Code available from https://github.com/rdiaz02/scm-interv-evams.

2606.12449 2026-06-12 q-bio.NC 新提交

A quantum-like benchmark for context-sensitive associative memory with adaptive plasticity

具有自适应可塑性的上下文敏感联想记忆的类量子基准

Yashine H. Goolam Hossen, Lea Gassab, Travis J. A. Craddock

AI总结 提出一种顺序敏感的自适应可塑性基准,用于测试类量子联想记忆模型在弱支持条件下的真实回忆能力,发现自适应可塑性(尤其是稳态稳定化)是主要贡献因素,且类量子模型在顺序敏感性和阶段组织上更一致。

详情
AI中文摘要

学习与记忆需要在可塑性与稳定性之间取得平衡:突触连接必须编码新信息,同时不崩溃、饱和或擦除先前有用的结构。当固定背景连接已经承载了部分任务时,联想记忆模型可能看似成功学习,这使得难以区分真正的回忆动态与结构辅助。我们使用一个顺序敏感的自适应可塑性基准来测试这一问题,该基准用于阶段性联想回忆。基准将类量子联想记忆模型与匹配的实值无相位和马尔可夫率控制模型在相同任务计划、扰动轮廓、弱支持条件和可塑性设置下进行比较。这里,“类量子”指的是建模形式,而非关于量子计算的生物学主张。我们首先筛选弱结构支持,然后固定一个保守的操作点,用于跨模型家族和可塑性机制的因子比较。有用的弱支持区间狭窄且非单调。在无可塑性消融中,弱结构单独无法挽救回忆,而大多数有用的回忆增益来自自适应可塑性,尤其是稳态稳定化。马尔可夫率控制通常实现更强的原始回忆,但类量子模型更一致地保持顺序敏感性和阶段依赖的组织。这些结果不支持普遍的类量子优势。相反,它们表明模型类别通过结合回忆、时间组织和上下文敏感性的多目标轮廓比通过任何单一回忆得分更能被区分。因此,该基准为在弱支持、受调控可塑性和匹配经典比较下研究上下文敏感记忆动态提供了一个受控框架。

英文摘要

Learning and memory require a balance between plasticity and stability: synaptic connections must encode new information without collapsing, saturating, or erasing previously useful structure. Associative-memory models can appear to learn successfully when fixed background connectivity already carries part of the task, making it difficult to distinguish genuine recall dynamics from structural assistance. We test this issue using an order-sensitive adaptive-plasticity benchmark for staged associative recall. The benchmark compares a quantum-like associative-memory model with matched real-valued no-phase and Markov-rate controls under the same task schedule, perturbation profiles, weak-support conditions, and plasticity settings. Here, "quantum-like" refers to the modeling formalism, not to a biological claim about quantum computation. We first screen weak structural support and then fix a conservative operating point for factorial comparisons across model families and plasticity mechanisms. The useful weak-support regime is narrow and non-monotonic. Weak structure alone does not rescue recall in the no-plasticity ablation, whereas most useful recall gains arise from adaptive plasticity, especially homeostatic stabilization. The Markov-rate control often achieves stronger raw recall, but the quantum-like model more consistently preserves order sensitivity and stage-dependent organization. These results do not support a universal quantum-like advantage. Instead, they show that model classes are better distinguished by a multi-objective profile combining recall, temporal organization, and context sensitivity than by any single recall score. The benchmark therefore provides a controlled framework for studying context-sensitive memory dynamics under weak support, regulated plasticity, and matched classical comparison.

2606.12585 2026-06-12 econ.GN cs.HC q-fin.EC 新提交

Revisiting the ABCs of Working with AI: A Replication with Radiologists

重新审视与AI合作的ABC:一项针对放射科医生的复制研究

Daniel Martin

AI总结 本研究在放射科医生分析胸部X光片的场景中,复制了Caplin等人关于能力和信念校准影响AI辅助收益的发现,验证了其外部有效性。

详情
AI中文摘要

人工智能(AI)系统越来越多地协助人类专家,但AI辅助对生产力的影响可能具有异质性。Caplin、Deming、S. Li、Martin、Marx、Weidmann和Ye(2025b)提供的证据表明,两个特征——能力和信念校准——有助于确定AI辅助的回报。本文表明,他们的结果在专业放射科医生利用最先进的机器学习预测分析胸部X光片的场景中得到了复制。我利用了Moehring、Kutwal、Huang、Banerjee、Jacobi、Eber、Mendoza、Chung、Dayan、Gupta、Bui、Truong、Pareek、Langlotz、Lungren、Agarwal、Rajpurkar和Salz(2025)描述的公共Collab-CXR数据存储库,该数据首先由Agarwal、Moehring、Rajpurkar和Salz(2023)用于人机协作分析。为了忠实再现Caplin、Deming、S. Li、Martin、Marx、Weidmann和Ye(2025b)的分析,我使用了重复病例设计中的放射科医生评估,包括68名放射科医生和11,420个配对的放射科医生-患者-病理观察结果。本复制结果支持其核心发现的外部有效性:较低的基础能力和较高的校准预测了AI带来的更大增量价值。

英文摘要

Artificial intelligence (AI) systems increasingly assist human experts, but the consequences of AI assistance on productivity can be heterogeneous. Caplin, Deming, S. Li, Martin, Marx, Weidmann, and Ye (2025b) provide evidence that two characteristics, ability and belief calibration, help to determine the returns to AI assistance. This note shows that their results replicate to a setting where professional radiologists analyze chest X-rays with access to state-of-the-art machine learning predictions. I leverage the public Collab-CXR data repository described by Moehring, Kutwal, Huang, Banerjee, Jacobi, Eber, Mendoza, Chung, Dayan, Gupta, Bui, Truong, Pareek, Langlotz, Lungren, Agarwal, Rajpurkar, and Salz (2025) and first analyzed for human-AI collaboration by Agarwal, Moehring, Rajpurkar, and Salz (2023). To faithfully reproduce the analysis in Caplin, Deming, S. Li, Martin, Marx, Weidmann, and Ye (2025b), I use the radiologist assessments from the repeated-case designs, which include 68 radiologists and 11,420 paired radiologist-patient-pathology observations. The results of this replication support the external validity of their core findings: lower baseline ability and higher calibration predict larger incremental value from AI.

2606.13659 2026-06-12 cs.PL cs.AR 新提交

Specifying Hardware Communication as Programs

将硬件通信规范为程序

Ernest Ng, Nikil Shyamsunder, Francis Pham, Adrian Sampson, Kevin Laeufer

AI总结 提出一种DSL,将硬件通信协议规范为简洁的命令式程序,同一规范可用于驱动设计和监控事务,并自动从波形推断事务级跟踪。

详情
AI中文摘要

为了测试和调试硬件模块,通常需要编写两个程序:驱动程序,它将高级事务转换为模块输入和输出信号上的交互;以及监视器,它分析信号级执行轨迹并识别事务。这两个程序通常针对每个硬件协议单独实现,但这种分离需要手动工作并存在不一致的风险。我们提倡一种替代方法。我们提出一种DSL,用户在其中将硬件通信协议规范为简洁的命令式程序。关键在于,同一规范既可用于驱动设计,也可用于监控事务。我们介绍了一个工具的设计,该工具给定我们的DSL中的规范和波形,自动推断与波形一致的事务级跟踪。我们讨论了在真实世界互连(如Wishbone和AXI-Stream)上评估我们的DSL的计划。

英文摘要

To test and debug hardware modules, it is common to write two programs: a driver, which translates high-level transactions into interactions on the module's input and output signals, and a monitor, which analyzes a signal-level execution trace and recognizes a transaction. These two programs are commonly implemented separately for each hardware protocol, but this separation entails manual effort and risks inconsistencies. We advocate an alternative approach. We present a DSL in which users specify hardware communication protocols as succinct imperative programs. Crucially, the same specification can be used to both drive designs and monitor transactions. We present the design of a tool, which given a specification in our DSL and a waveform, automatically infers a transaction-level trace consistent with the waveform. We discuss plans to evaluate our DSL on real-world interconnects such as Wishbone and AXI-Stream.

2606.13639 2026-06-12 cs.MA 新提交

Tuning Agent-Based Predator-Prey Models Toward Lotka-Volterra Dynamics

将基于智能体的捕食者-猎物模型调谐至洛特卡-沃尔泰拉动力学

Corinna Mandl, Siddharth Chaturvedi, Marcel van Gerven

AI总结 研究通过调整环境与种群参数,使基于智能体的捕食者-猎物模型产生类似洛特卡-沃尔泰拉方程的周期性振荡,并利用基于特征的损失函数优化参数。

Comments 12 pages, 3 figures

详情
AI中文摘要

近年来计算能力的增长使得使用大规模基于智能体的模型模拟复杂自适应系统越来越可行。一个核心困难在于此类模型包含许多局部规则和参数,微小的变化可能导致失控行为、种群崩溃或达到人为边界饱和。我们在一个连续的捕食者-猎物系统中研究这一问题,其中羊和狼是具有局部感知、内部能量和基于递归神经网络控制器的主动智能体。我们询问是否可以调整环境和种群参数,使得产生的种群动态类似于经典的洛特卡-沃尔泰拉周期。我们使用基于特征的损失函数优化这些参数,该损失函数奖励持续振荡、相位滞后、有界种群和长期持续性,首先针对随机控制器,然后在更自然的环境中针对进化控制器。该模型在ABMax中实现,ABMax是一个基于JAX的智能体建模框架,能够在硬件加速器上进行高效的批量模拟。

英文摘要

Recent growth in compute power has made it increasingly feasible to use large-scale agent-based models to simulate complex adaptive systems. A central difficulty is that such models contain many local rules and parameters, where small changes can lead to runaway behaviour, population collapse, or saturation at artificial bounds. We study this problem in a continuous predator-prey system where sheep and wolves are active agents with local sensing, internal energy, and recurrent neural network-based controllers. We ask whether environmental and demographic parameters can be tuned so that the resulting population dynamics resemble classical Lotka-Volterra cycles. We optimise these parameters with a feature-based loss that rewards sustained oscillations, phase lag, bounded populations, and long-term persistence, first for random controllers and then for evolved controllers in a more naturalistic setting. The model is implemented in ABMax, a JAX-based agent-based modelling framework that enables efficient batched simulation on hardware accelerators.

2606.13631 2026-06-12 cs.PF cs.NI 新提交

Beyond Virtual Delay: Improving Packet Delay Bound in Network Calculus

超越虚拟延迟:改善网络演算中的数据包延迟界

Yuming Jiang

AI总结 本文指出经典虚拟延迟界过于保守,提出一种仅基于到达曲线和服务曲线的新数据包延迟界,在漏桶到达曲线和速率-延迟服务曲线下严格优于经典界,并在时间敏感网络中验证。

详情
AI中文摘要

在网络演算中,一个基本结果是经典的延迟界,由到达曲线和服务曲线之间的水平偏差给出。虽然被广泛使用,但经典界源于虚拟延迟的概念。在这项工作中,我们首先证明最大数据包延迟总是被最大虚拟延迟上界所限制,揭示了将基于虚拟延迟的界应用于数据包延迟时固有的保守性。受此启发,我们重新审视数据包延迟分析,并推导出一个新的数据包延迟界,该界除了到达曲线和服务曲线外不需要任何假设。将新界专门应用于具有漏桶到达曲线和速率-延迟服务曲线的系统,显示出相对于经典界的严格改进,并通过时间敏感网络中的案例研究进一步证明。

英文摘要

In network calculus, a fundamental result is the classical delay bound given by the horizontal deviation between the arrival and service curves. While widely used, the classical bound is derived from the notion of virtual delay. In this work, we first show that the maximum packet delay is always upper-bounded by the maximum virtual delay, revealing inherent conservatism when applying the virtual-delay-based bound to packet delay. Motivated by this insight, we revisit packet delay analysis and derive a new packet delay bound that requires no assumptions beyond the arrival and service curves. Specializing the new bound to a system with leaky-bucket arrival curve and rate-latency service curve shows strict improvement over the classical bound, which is further demonstrated through a case study in time-sensitive networking (TSN).

2606.13628 2026-06-12 cs.CC 新提交

A near-quadratic lower bound on the border determinantal complexity of $\sum_i x_i^n$ via conormal specialization

关于 $\sum_i x_i^n$ 的边界行列式复杂性的近二次下界:通过共法线特殊化

Karthik Sheshadri

AI总结 通过共法线多度量和退化论证,证明多项式 $\sum_{i=1}^n x_i^n$ 的边界行列式复杂性在普通和对称模型下分别达到 $(n-1)^2/(4e)$ 和 $(n-1)^2/(2e)$ 的下界,匹配已知的 $O(n^2)$ 上界。

详情
AI中文摘要

多项式 $f$ 的边界行列式复杂性 $\dcb(f)$ 是最小的 $m$,使得 $f$ 是 $m\times m$ 仿射线性形式矩阵的行列式的极限。我们证明,对于每个 $n\ge3$,在 $\CC$ 上,在普通和对称模型中分别有 \\[ \dcb\Big(\sum_{i=1}^n x_i^n\Big)\\ \ge\\ \frac{(n-1)^2}{4e}, \qquad \sdcb\Big(\sum_{i=1}^n x_i^n\Big)\\ \ge\\ \frac{(n-1)^2}{2e} \\] 两者均匹配已知的 $O(n^2)$ 上界(仅常数不同)。据我们所知,这是第一个显式族在变量数上超线性的边界行列式下界:已知的永久式的二次边界读作对偶簇的维数,且在其变量数上是线性的,而我们传递了对偶次数。证明有两个要素。第一个是对于任意仿射线性行列式(允许奇异、可约和非约化纤维)的 slot-$(n-2)$ 共法线多度量的无条件界,通过提升核关联的多齐次 Bézout 计数得到。第二个是特殊化论证:沿着任何退化 $\det A_c\to\sum_ix_i^n$,这些 Gauss-图循环的平坦极限包含 Fermat 锥的共法线簇且系数为正。一个锥移位恒等式将该共法线多度量转化为光滑 Fermat 超曲面的经典对偶次数 $n(n-1)^{n-2}$,取 $(n-1)$ 次根得到二次界。作者配套手稿中的精确下界作为推论得出。

英文摘要

The border determinantal complexity $\dcb(f)$ of a polynomial $f$ is the least $m$ such that $f$ is a limit of determinants of $m\times m$ matrices of affine-linear forms. We prove that for every $n\ge3$, over $\CC$, \[ \dcb\Big(\sum_{i=1}^n x_i^n\Big)\ \ge\ \frac{(n-1)^2}{4e}, \qquad \sdcb\Big(\sum_{i=1}^n x_i^n\Big)\ \ge\ \frac{(n-1)^2}{2e} \] in the ordinary and symmetric models respectively; both match the known $O(n^2)$ upper bounds up to the constant. To our knowledge these are the first border determinantal lower bounds for an explicit family that are superlinear in the number of variables: the known quadratic border bound for the permanent reads the \emph{dimension} of the dual variety and is linear in its number of variables, whereas we transfer the dual \emph{degree}. The proof has two ingredients. The first is an unconditional bound on the slot-$(n-2)$ conormal multidegree of the multiplicity-one Gauss-graph cycle of an arbitrary affine-linear determinant -- singular, reducible, and non-reduced fibers allowed -- by a multihomogeneous Bézout count of a lifted kernel incidence. The second is a specialization argument: along any degeneration $\det A_c\to\sum_ix_i^n$, the flat limit of these Gauss-graph cycles contains the conormal variety of the Fermat cone with positive coefficient. A cone-shift identity converts that conormal multidegree into the classical dual degree $n(n-1)^{n-2}$ of the smooth Fermat hypersurface, and an $(n-1)$-st root yields the quadratic bound. The exact lower bounds of the author's companion manuscripts follow as corollaries.

2606.13623 2026-06-12 cs.DC 新提交

Finding Conservation Laws of Large Dynamical Systems with Tasks and Futures: A Case Study in Utilizing Dynamic Data Dependencies

利用任务和未来发现大型动力系统的守恒律:动态数据依赖的案例研究

Rüdiger Nather

AI总结 针对未来模型值不可变限制内存复用的问题,提出await_delete扩展以安全重用值,并基于此实现块状稠密对称矩阵求逆算法,在大规模矩阵上实现近线性扩展。

Comments To be published in Lecture Notes in Computer Science, Volume 16592

详情
AI中文摘要

随着并行工作负载复杂性的增加,管理细粒度数据依赖成为一个关键挑战。未来模型为处理这些依赖提供了一种有前景的方案,特别是在不规则算法中,但同时也带来了值不可变的限制。这种不可变性限制了执行原地内存更新的能力,而内存回收对于高性能线性代数至关重要。在本文中,我们通过引入一个新的构造await_delete来克服这些限制,该构造扩展了传统的未来语义,允许在消费者完成后安全地重用值。基于这一扩展,我们提出了一种新的基于未来的算法,用于稠密对称矩阵的块状求逆,其动机来自最近一种用于发现动力系统守恒律的算法。我们在Taskflow的扩展版本中实现了我们的方法,并通过强扩展实验进行评估。结果表明,虽然未来模型在较小问题规模上会产生显著开销,但在大矩阵上实现了近线性扩展。我们分析了摊销阈值,并表明未来模型是大规模线性代数中可行的高性能工具。

英文摘要

As parallel workloads grow in complexity, managing fine-grained data dependencies becomes a critical challenge. Futures offer a promising model for handling these dependencies, particularly in irregular algorithms, but they also come with the restriction of value-immutability. This immutability limits the ability to perform in-place memory updates, a necessity for high-performance linear algebra where memory recycling is paramount. In this paper, we address these limitations by introducing a new construct, await_delete, which extends traditional future semantics to allow safe value reuse once consumers are finished. Building on this extension, we present a novel future-based algorithm for the block-wise inversion of dense, symmetric matrices, motivated by a recent algorithm for finding conservation laws of dynamical systems. We implement our approach in an extended version of Taskflow and evaluate it through strong-scaling experiments. Our results demonstrate that while futures incur significant overhead on smaller problem sizes, they achieve nearly linear scaling on large matrices. We analyze the amortization threshold and show that futures are a viable high-performance tool for large-scale linear algebra.

2606.13612 2026-06-12 cs.CR 新提交

Beyond the IT Checklist: Engineering a Reasonable Standard of Care for Cyber Safety

超越IT检查清单:为网络安全设计合理的注意标准

Matthew E. Jablonski, Linton Wells, Kathryn B. Laskey, F. Brett Berlin

AI总结 本文通过分析292份关键基础设施政策文件,指出当前以IT合规为中心的网络安全方法不足以应对网络物理系统的安全风险,并提出基于危险特定可追溯性、结构化保证案例和网络弹性工程的现代化注意标准。

Comments 6 pages, 2 figures, Accepted for publication and presentation the Cyber Safety Summit, Washington, D.C., 2026

详情
AI中文摘要

当前美国网络安全政策以安全为中心,通常将控制措施文档和事件报告视为建筑环境安全的替代指标。本文认为,这种方法对于网络物理系统是不充分的,因为数字故障可能产生物理伤害。我们构建并编码了一个关键基础设施政策文档语料库(N=292,2000-2025年),以研究“合理注意”如何在NIST SP 800-160 Vol.2弹性生命周期中实施。生成的映射显示,义务集中在“预期”阶段并强调行政合规,而“承受”和“恢复”阶段则严重依赖对IT控制目录的委托引用,这些目录与基于物理的危害对齐不良。我们识别出三个主要脱节:校准不当的委托标准、将恢复定义为通知而非工程化导航,以及跨部门的不均匀适应要求。然后,我们提出了一种现代化的注意标准,基于危险特定可追溯性、结构化保证案例和网络弹性工程。最后,我们建议联邦政策将这些工程义务与有针对性的激励措施配对,使关键基础设施的弹性架构成为可行的商业决策,而非无资金支持的期望。

英文摘要

Current U.S. cyber policy, centered on security, often treats documentation of controls and incident reports as a proxy for safety in the built environment. This paper argues that such an approach is inadequate for cyber-physical systems, where digital failures can produce kinetic harm. We construct and code a corpus of critical infrastructure policy documents (N=292, 2000-2025) to examine how "reasonable care" is operationalized across the NIST SP 800-160 Vol.~2 resilience lifecycle. The resulting maps show that obligations are concentrated in the Anticipate phase and emphasize administrative compliance, while Withstand and Recover phases rely heavily on delegated references to IT-focused control catalogs that are poorly aligned with physics-based hazards. We identify three major disconnects: miscalibrated delegated standards, recovery defined as notification rather than engineered navigation, and uneven adaptation requirements across sectors. We then propose a modernized standard of care anchored in hazard-specific traceability, structured assurance cases, and cyber resiliency engineering. Finally, we recommend that federal policy pair these engineering obligations with targeted incentives so that resilient architectures for critical infrastructure become a viable business decision rather than an unfunded expectation.

2606.13594 2026-06-12 cs.MA 新提交

See What I See, Know What I Think: Dense Latent Communication Across Heterogeneous Agents

见我所见,知我所想:异构智能体间的密集潜通信

Siyi Chen, Xiaoyan Zhang, Meng Wu, Jonathan Tremblay, Valts Blukis, Stan Birchfield, Rene Vidal, Alvaro Velasquez, Sijia Liu, Qing Qu

AI总结 针对异构智能体间文本通信高损耗问题,提出基于KV缓存变换的密集对齐方法,通过两阶段训练实现跨模型潜空间对齐,在上下文感知和未知场景下均优于基线,计算成本降低2-3倍。

详情
AI中文摘要

多智能体系统主要通过文本进行通信,这会产生有损且昂贵的解码和重新编码成本。KV缓存通信是一种有前景的替代方案,但大多数先前的工作是同构的,使用相同模型的重复副本,并回避了跨模型潜在对齐的核心挑战;现有的异构方法也具有限制性,通常假设共享输入,并且主要使用传输的缓存进行引导。我们研究了一个更基本的问题:异构智能体能否被充分对齐,以执行真正的“读心术”,并传递一个智能体看到的内容及其思考方式?我们的信息结构分析揭示了一种对偶性:上下文感知的传输由稀疏的推理信号驱动,而上下文无关的传输(接收者看不到任何输入)则需要密集的上下文知识保留。受此启发,我们提出通过轻量级跨模型缓存变换和两阶段训练(重建后生成)来实现异构KV缓存通信的密集对齐。在{Qwen3-4B, 8B, 14B}的所有六个方向以及六个域内和域外基准测试中,我们的方法优于先前的异构基线,在上下文感知设置中与文本通信相当或更优,计算成本降低约2到3倍,并且在先前方法失效的上下文无关传输中仍然有效。

英文摘要

Multi-agent systems communicate mostly through text, paying a lossy and expensive decode and re-encode cost. KV-cache communication is a promising alternative, yet most prior work is homogeneous, using duplicate copies of the same model, and avoids the central challenge of cross-model latent alignment; existing heterogeneous methods are also restrictive, typically assuming shared input and using transferred caches mainly for steering. We study a more fundamental question: can heterogeneous agents be aligned well enough to perform real "mind reading" and transfer both what one agent sees and how it thinks? Our information-structure analysis reveals a duality: context-aware transfer is driven by sparse reasoning signals, while context-unaware transfer, where the receiver sees no input, requires dense contextual knowledge preservation. Motivated by this, we propose dense alignment for heterogeneous KV-cache communication via a lightweight cross-model cache transformation and two-phase training: reconstruction followed by generation. Across all six directions of {Qwen3-4B, 8B, 14B} and six in-domain and out-of-domain benchmarks, our method outperforms prior heterogeneous baselines, matches or exceeds text communication in context-aware settings at roughly 2 to 3 times lower compute, and remains effective in context-unaware transfer where prior methods collapse.

2606.13583 2026-06-12 cs.DS 新提交

Testing Bipartiteness in Logarithmic Rounds

在对数轮次中测试二分性

Yumou Fei, Ronitt Rubinfeld

AI总结 本文改进Goldreich和Ron的二分性测试算法,将随机游走长度从O(log^6 n)降至O(log n),步数从O(√(n log n))降至O(√n),并得到最优轮复杂度的流式算法。

详情
AI中文摘要

Goldreich和Ron的开创性工作(《Combinatorica, 1999》)表明,有界度图的二分性可以通过$O(\sqrt{n\log n})$次长度为$O(\log^{6} n)$的随机游走来测试。在本文中,我们改进了他们的结果,证明$O(\sqrt{n})$次长度为$O(\log n)$的随机游走就足够了。作为推论,我们得到了一个$O(\log n)$轮、$O(\sqrt{n}\log n)$空间的流式算法用于测试二分性,其轮复杂度在Fei、Minzer和Wang(《arXiv, 2026》)最近的下界意义下是最优的。我们的证明采用了与Goldreich和Ron不同的方法,使用了Goemans和Williamson(《J. ACM, 1995》)引入的Max-Cut半定规划松弛。

英文摘要

The seminal work of Goldreich and Ron (\textit{Combinatorica, 1999}) showed that bipartiteness of bounded-degree graphs can be tested using $O(\sqrt{n\log n})$ random walks of length $O(\log^{6} n)$. In this work, we improve their result by showing that $O(\sqrt{n})$ random walks of length $O(\log n)$ suffice. As a corollary, we obtain an $O(\log n)$-pass, $O(\sqrt{n}\log n)$-space streaming algorithm for testing bipartiteness, whose pass complexity is optimal in light of a recent lower bound of Fei, Minzer, and Wang (\textit{arXiv, 2026}). Our proof takes a different approach from that of Goldreich and Ron, using the semidefinite programming relaxation for Max-Cut introduced by Goemans and Williamson (\textit{J. ACM, 1995}).

2606.13563 2026-06-12 cs.CR cs.DS 新提交

Differentially Private Hierarchical Heavy Hitters

差分隐私层次重击者

Ari Biswas, Graham Cormode, Yaron Kanza, Divesh Srivastava, Zhengyi Zhou

AI总结 研究差分隐私下层次重击者(HHH)的发布问题,在非流式设置中前缀残差计数的相对误差与层次高度和重击者数量无关;在流式设置中频率估计的绝对误差与可用空间无关。

Comments This is the updated version of the PODS 2025 conference version. Note that the conference version has a bug in the privacy proof fro the non-streaming version. We have addressed the bug in this full version

详情
AI中文摘要

寻找层次重击者(HHH)的任务由Cormode等人[VLDB 2003]引入,作为重击者问题的推广。虽然数据流中HHH的发现已被广泛研究,但底层数据私有时发布HHH的问题仍未探索。在本文中,我们研究了流式和非流式设置下的差分隐私HHH发布。在非流式设置中,我们展示了令人惊讶的结果:任何前缀的残差计数估计的相对误差与层次高度和流中重击者数量无关。同时,在流式设置中,尽管HHH的精确版本具有低全局敏感性(因为计数查询是1-敏感的),但由于流式处理导致的近似函数具有高全局敏感性,与可用空间成线性关系。尽管存在这一障碍,我们表明流式设置中频率估计的绝对误差与可用空间无关。

英文摘要

The task of finding _Hierarchical_ Heavy Hitters (HHH) was introduced by Cormode et al. [VLDB 2003] as a generalisation of the heavy hitter problem. While finding HHH in data streams has been studied extensively, the question of releasing HHH when the underlying data is private remains unexplored. In this paper, we study differentially private HHH release in both the streaming and non-streaming setting. In the non-streaming setting, we show the surprising result that the relative error in estimating the residual count for any prefix is independent of the height of the hierarchy and the number of heavy hitters in the stream. Meanwhile, in the streaming setting, although the exact version of HHH has low global sensitivity (as counting queries are 1-sensitive), the approximation functions due to streaming have high global sensitivity, linear in the available space. Despite this obstacle, we show that the absolute error for estimating frequencies in the steaming setting is independent of the available space.