arXivDaily arXiv每日学术速递 周一至周五更新
重置
2606.13519 2026-06-12 econ.EM 新提交

Semiparametric Local Projections

半参数局部投影

Silvia Goncalves, Ana Maria Herrera, Lutz Kilian, Elena Peavento, Iones Kelanemer Holban

AI总结 提出一种半参数局部投影估计量,用于非线性脉冲响应函数,基于双稳健矩条件结合交叉拟合,实现√T一致性和渐近正态性。

详情
AI中文摘要

我们提出了一种半参数局部投影估计量,用于估计一类广泛的结构动态模型的非线性脉冲响应函数,这些模型与应用宏观经济学相关,包括具有非线性变换回归变量、状态依赖系数以及冲击与状态变量之间非线性相互作用的模型。该估计量基于一个双稳健矩条件,该条件将平均响应函数识别为非参数条件均值的线性泛函,并辅以一个密度比来捕捉移动感兴趣冲击的效果。我们将此矩条件与处理序列依赖的交叉拟合相结合。得到的估计量是$\sqrt{T}$一致且渐近正态的。我们在一系列非线性数据生成过程中检验了该估计量的有限样本性能,并通过两个实证示例说明了其应用。

英文摘要

We propose a semiparametric local projection estimator of nonlinear impulse response functions for a broad class of structural dynamic models relevant for applied macroeconomics, including models with nonlinearly transformed regressors, state dependent coefficients, and nonlinear interactions between shocks and state variables. The estimator is based on a doubly robust moment condition that identifies the average response function as a linear functional of a nonparametric conditional mean, augmented by a density ratio that captures the effect of shifting the shock of interest. We combine this moment condition with cross-fitting that handles serial dependence. The resulting estimator is $\sqrt{T}$-consistent and asymptotically normal. We examine the finite-sample performance of the estimator across a range of nonlinear data generating processes and illustrate its use in two empirical examples.

2606.12739 2026-06-12 econ.EM 新提交

Estimating Semiparametric and Nonparametric Fixed Effects Panel Data Models with mgcv

使用 mgcv 估计半参数和非参数固定效应面板数据模型

Ivan Korolev

AI总结 本文介绍如何使用 R 包 mgcv 估计半参数和非参数固定效应面板数据模型,重点讨论实现方法、平滑项指定和聚类稳健推断,并通过蒙特卡洛实验验证惩罚样条估计的准确性。

详情
AI中文摘要

本文提供了使用 R 中的 mgcv 包估计半参数和非参数固定效应面板数据模型的实用指南。重点在于实现:使用单位指示变量、一阶差分或惩罚单位效应处理固定效应;指定平滑项;以及进行聚类稳健推断。蒙特卡洛实验比较了 mgcv::bam 估计量与线性固定序列样条估计量。模拟表明,惩罚样条适应未知平滑度,并在本文研究的设计中准确估计函数。惩罚调整的聚类稳健协方差估计量对有限维参数产生接近名义水平的检验,置信带对中心化的未知函数提供准确的覆盖。

英文摘要

This paper provides a practical guide to estimating semiparametric and nonparametric fixed-effects panel data models using the mgcv package in R. The focus is implementation: handling fixed effects with unit indicators, first differencing, or penalized unit effects; specifying smooth terms; and conducting cluster-robust inference. Monte Carlo experiments compare \code{mgcv::bam} estimators with linear and fixed-series spline estimators. Simulations suggest that penalized splines adapt to unknown smoothness and estimate functions accurately in the designs studied here. A penalty-adjusted cluster-robust covariance estimator yields tests with near-nominal size for finite-dimensional parameters, and confidence bands provide accurate coverage for centered unknown functions.

2606.12571 2026-06-12 econ.TH 新提交

Cross-Validation Equilibrium

交叉验证均衡

Ran Spiegler, Stephan Waizmann

AI总结 研究玩家将信念形成委托给预测性机器学习时的策略互动,提出交叉验证均衡概念,分析其性质并应用于陪审团投票、投机性赌博和线性二次支付博弈。

详情
AI中文摘要

我们研究当玩家将信念形成委托给预测性机器学习(ML)时的策略互动。在一个静态贝叶斯博弈中,每个玩家的ML代理预测一个与收益相关的结果变量,作为玩家类型的函数。ML代理的训练样本是内生的:它来自于由玩家ML引导行为产生的结果分布。在交叉验证均衡(CVE)中,每个玩家的ML代理根据其实现的训练样本,选择预测模型以最小化期望的样本外平方误差,并且每个玩家对其ML代理选择的模型所产生的信念做出最优反应。我们分析CVE并将其与其他均衡概念联系起来。我们将CVE应用于陪审团投票、投机性赌博以及具有线性二次支付的博弈。例如,在团队努力博弈中,内生模型选择可能导致多重均衡。

英文摘要

We study strategic interaction when players delegate belief formation to predictive machine learning (ML). In a static Bayesian game, each player's ML agent predicts a payoff-relevant outcome variable as a function of the player's type. The ML agent's training sample is endogenous: it is drawn from the outcome distribution generated by players' ML-guided behavior. In Cross-Validation Equilibrium (CVE), each player's ML agent selects a predictive model to minimize expected out-of-sample squared error, given its realized training sample, and each player best-replies to the belief generated by the model her ML agent selected. We analyze CVE and relate it to other equilibrium concepts. We apply CVE to jury voting, speculative betting, and games with linear-quadratic payoffs. E.g., in a team-effort game, endogenous model selection can give rise to multiple equilibria.

2606.12492 2026-06-12 econ.TH 新提交

Continuity of equilibria in spaces of Bochner and Gel'fand economies

Bochner与Gel'fand经济空间中均衡的连续性

Matías Fuentes

AI总结 本文在商品空间为Banach格的无穷维框架下,证明均衡对应在允许均衡的经济域稠密子集上关于Polish拓扑是连续的,统一处理了多种经济模型,无需可微性假设。

详情
AI中文摘要

我们研究了无穷维环境(商品空间为Banach格)中均衡对应的连续性。经济被建模为特征空间上的Borel概率测度,总禀赋通过Bochner或Gel'fand积分定义。在此框架下,我们证明了均衡对应在允许均衡的经济域稠密子集上(赋予适当的Polish拓扑)是连续的。这些结果通过提供适用于更广泛局部凸空间类的统一分析处理,扩展了经典和近期的连续性定理,并涵盖了无限规划期限、垄断竞争、新古典经济、金融均衡和非对称信息等模型。重要的是,本研究证明了无需施加正则经济中通常需要的可微性假设来研究均衡连续性。

英文摘要

We examine the continuity of equilibrium correspondences in infinite-dimensional settings where the commodity spaces are Banach lattices. Economies are modeled as Borel probability measures on a space of characteristics, with aggregate endowments defined via Bochner or Gel'fand integrals. Within this framework, we prove that the equilibrium correspondence is continuous on a dense subset of the domain of economies admitting equilibria, endowed with a suitable Polish topology. These results extend both classical and recent continuity theorems by providing a unified analytical treatment applicable to a substantially broader class of locally convex spaces and encompass models with infinite planning horizons, monopolistic competition, neoclassical economies, financial equilibria, and asymmetric information. Importantly, this study demonstrates that there is no necessity to impose differentiability assumptions that are typically required in regular economies to study equilibrium continuity.

2606.13506 2026-06-12 econ.GN q-fin.EC 新提交

Skill vs Education Types of Labour Mismatch and Their Association with Earnings

技能与教育类型的劳动错配及其与收入的关系

Vsevolod Iakovlev

AI总结 利用26国PIAAC数据,通过教育-技能指标和误差成分模型,揭示教育错配与技能错配对收入的不同影响,并控制国家异质性后证实过度教育与过度技能导致工资惩罚,不足教育与不足技能带来工资溢价。

详情
AI中文摘要

本文分析了教育类型和技能类型的劳动错配之间的区别及其与收入的关系。利用来自OECD(2012)成人技能调查(PIAAC)第一周期的26个国家横截面数据,我使用一套全面的基于教育和技能的指标考察了教育和技能错配,探索了工人特征之间的异质性,并通过误差成分模型调查了与国家层面收入相关性冲突的来源。结果表明,国家层面的未观测异质性导致内生性偏差,其方向和大小因错配指标而异。一旦控制了未观测异质性,过度教育和过度技能与工资惩罚相关,而教育不足和技能不足则与工资溢价相关。这些发现强调了教育错配与技能错配之间的概念和实证区别,并证明了指标选择在分析中的重要性。

英文摘要

This paper analyses the distinction between educational and skill types of labour mismatch and their association with earnings. Drawing on cross-sectional data for 26 countries from the 1st Cycle of the OECD (2012) Survey of Adult Skills (PIAAC), I examine educational and skill mismatch using a comprehensive set of education- and skill-based indicators, explore heterogeneity across worker characteristics, and investigate the sources of conflicting country-level correlations with earnings through an error components model. The results show that country-level unobserved heterogeneity induces endogeneity bias, with both its direction and magnitude varying across mismatch measures. Once unobserved heterogeneity is controlled for, over-education and over-skilling are associated with wage penalties, whereas under-education and under-skilling are linked to wage premiums. These findings highlight both conceptual and empirical distinctions between educational and skill mismatch and demonstrate the importance of indicator choice in the analysis.

2606.13314 2026-06-12 econ.GN q-fin.EC 新提交

The Privilege of Exposure: Caste and Generative AI in India's Graduate Labour Market

暴露的特权:种姓与生成式AI在印度毕业生劳动力市场

Kaibalyapati Mishra

AI总结 研究利用印度最新劳动力调查数据,发现种姓影响毕业生对生成式AI的暴露程度,低种姓毕业生暴露度显著低于高种姓,且该差距通过职业分布和工资溢价加剧种姓收入不平等。

详情
AI中文摘要

在发展中国家劳动力市场中,谁暴露于生成式AI?我们将三个职业AI暴露指数映射到印度重新设计的定期劳动力调查(2025年),并记录了83,000名就业毕业生中显著的种姓梯度:在同一地区内,来自在册种姓和在册部落的毕业生比高种姓毕业生的暴露度低0.24-0.37个标准差。两个渠道导致了这一差距:四分之一的在册种姓和三分之一的在册部落毕业生从事不受AI影响的农业或初级职业,而那些从事白领工作的人在管理、软件和金融职业中的代表性不足。由于暴露度带来高达20%的工资溢价,生成式AI可能会扩大而非缩小印度的种姓收入差距。

英文摘要

Who is exposed to generative AI in a developing-country labour market? We map three occupational AI-exposure indices to India's redesigned Periodic Labour Force Survey (2025) and document a steep caste gradient among 83,000 employed graduates: graduates from the Scheduled Castes and the Scheduled Tribes are 0.24--0.37 standard deviations less exposed than upper-caste graduates within the same district. Two channels drive the gap: one in four SC and one in three ST graduates work in farm or elementary occupations untouched by AI, and those in white-collar work are underrepresented in managerial, software, and finance occupations. Because exposure commands a wage premium of up to 20 per cent, generative AI stands to widen, not narrow, India's caste earnings gap.

2606.12893 2026-06-12 econ.GN q-fin.EC 新提交

Technology Shocks, Relative Performance Measures, and Outcomes: Evidence from Classical Chess

技术冲击、相对绩效度量与结果:来自经典国际象棋的证据

Dan Ben-Moshe, David Genesove

AI总结 利用390万局经典国际象棋比赛数据,发现2020年神经网络引擎普及后和棋率上升约4个百分点,而基于相对绩效的等级分变化不大,表明技术冲击被广泛吸收。

详情
AI中文摘要

2020年秋季,神经网络方法使得国际象棋引擎的性能大幅提升,并免费广泛可用。到2021年底,经典国际象棋的月度和棋率上升了约四个百分点,但通常被视为棋力指标的棋手等级分分布变化不大。然而,等级分是一种相对度量,基于与其他有等级分棋手对弈的结果构建,而非绝对棋力尺度,因此广泛共享的进步不必改变等级分。利用2015年3月至2023年11月的390万局有等级分的经典比赛,我们记录到和棋率上升在控制双方等级分后仍然存在,在重复同色对局中成立,并非先前趋势的延续,并持续到样本期末。一个线性变换将疫情后等级分映射到更高的疫情前等效值,且在低等级分处差距更大,解释了拟合的和棋、白胜和黑胜概率的疫情后减疫情前偏移的90%以上。相比之下,棋手的等级分和排名没有显示出额外的排名重新洗牌,也没有相对于疫情前基准的组内离散度普遍扩大。我们将这些发现解释为与各等级分水平的采用一致,且低等级分棋手获得了更大的等级分等效增益。

英文摘要

In the fall of 2020, neural-network methods produced a large improvement in chess engines that became freely and widely available. By the end of 2021, the monthly draw rate in classical chess had risen by about four percentage points, but the distribution of player ratings, which are commonly read as measures of playing strength, had changed little. Ratings, however, are a relative measure, built from results against other rated players rather than from an absolute scale of play quality, so an improvement shared broadly across players need not change their ratings. Using 3.9 million rated classical games from March 2015 to November 2023, we document that the increased draw rate remains after conditioning on both players' ratings, holds within repeated same-color matchups, is not a continuation of a pre-existing trend, and persists through the end of the sample. A linear transformation that maps post-Covid ratings to higher pre-Covid equivalents, with a larger gap at lower ratings, accounts for more than 90 percent of the post-minus-pre shift in the fitted draw, White-win, and Black-win probabilities. Players' ratings and ranks, by contrast, show no additional rank reshuffling and no general widening of within-group dispersion relative to the pre-Covid benchmark. We interpret these findings as consistent with adoption across rating levels, with larger rating-equivalent gains for lower-rated players.

2606.12585 2026-06-12 econ.GN cs.HC q-fin.EC 新提交

Revisiting the ABCs of Working with AI: A Replication with Radiologists

重新审视与AI合作的ABC:一项针对放射科医生的复制研究

Daniel Martin

AI总结 本研究在放射科医生分析胸部X光片的场景中,复制了Caplin等人关于能力和信念校准影响AI辅助收益的发现,验证了其外部有效性。

详情
AI中文摘要

人工智能(AI)系统越来越多地协助人类专家,但AI辅助对生产力的影响可能具有异质性。Caplin、Deming、S. Li、Martin、Marx、Weidmann和Ye(2025b)提供的证据表明,两个特征——能力和信念校准——有助于确定AI辅助的回报。本文表明,他们的结果在专业放射科医生利用最先进的机器学习预测分析胸部X光片的场景中得到了复制。我利用了Moehring、Kutwal、Huang、Banerjee、Jacobi、Eber、Mendoza、Chung、Dayan、Gupta、Bui、Truong、Pareek、Langlotz、Lungren、Agarwal、Rajpurkar和Salz(2025)描述的公共Collab-CXR数据存储库,该数据首先由Agarwal、Moehring、Rajpurkar和Salz(2023)用于人机协作分析。为了忠实再现Caplin、Deming、S. Li、Martin、Marx、Weidmann和Ye(2025b)的分析,我使用了重复病例设计中的放射科医生评估,包括68名放射科医生和11,420个配对的放射科医生-患者-病理观察结果。本复制结果支持其核心发现的外部有效性:较低的基础能力和较高的校准预测了AI带来的更大增量价值。

英文摘要

Artificial intelligence (AI) systems increasingly assist human experts, but the consequences of AI assistance on productivity can be heterogeneous. Caplin, Deming, S. Li, Martin, Marx, Weidmann, and Ye (2025b) provide evidence that two characteristics, ability and belief calibration, help to determine the returns to AI assistance. This note shows that their results replicate to a setting where professional radiologists analyze chest X-rays with access to state-of-the-art machine learning predictions. I leverage the public Collab-CXR data repository described by Moehring, Kutwal, Huang, Banerjee, Jacobi, Eber, Mendoza, Chung, Dayan, Gupta, Bui, Truong, Pareek, Langlotz, Lungren, Agarwal, Rajpurkar, and Salz (2025) and first analyzed for human-AI collaboration by Agarwal, Moehring, Rajpurkar, and Salz (2023). To faithfully reproduce the analysis in Caplin, Deming, S. Li, Martin, Marx, Weidmann, and Ye (2025b), I use the radiologist assessments from the repeated-case designs, which include 68 radiologists and 11,420 paired radiologist-patient-pathology observations. The results of this replication support the external validity of their core findings: lower baseline ability and higher calibration predict larger incremental value from AI.

2606.12848 2026-06-12 cs.AI econ.GN q-fin.EC 新提交

(Human) Attention Is (Still) All You Need: Human oversight makes AI-assisted social science reliable

(人类的)注意力(仍然)就是一切:人类监督使AI辅助的社会科学变得可靠

Chen Zhu, Xiaolu Wang, Weilong Zhang

发表机构 * China Agricultural University(中国农业大学) University of Cambridge(剑桥大学)

AI总结 提出人机协同决策架构HLER,通过预承诺、决策排序、问责和注意力分配,将AI辅助研究的失败率从72%降至16%。

详情
AI中文摘要

大型语言模型(LLMs)越来越多地被用于曾经只有训练有素的研究人员才能完成的任务,包括假设生成、规范选择和结论起草。我们认为,AI辅助研究的可靠性不仅取决于模型能力,还取决于认知劳动在人与机器之间的分配方式。我们通过人机协同经济研究(HLER)来研究这个问题,这是一种基于预承诺、决策排序、问责和注意力分配的决策架构。在一个预先指定的2*4因子实验中,涉及四个数据集的280个完整研究运行,无约束的多智能体基线在72%的运行中产生了关键失败。使用相同的底层模型、相同的智能体分解以及共享推理智能体的相同提示,HLER通过施加三个架构承诺将失败率降低到16%:LLMs进行推理但不执行数据工作,数据和估计以确定性方式处理,以及三个人类决策门约束工作流程。Fisher精确检验在p<0.001水平上拒绝失败率相等的假设。可靠性增益在公开代表性最低的数据集(一份清代人口登记册)上最大,这与基于任务的产出质量服从弗雷歇分布的生产模型一致。一项80次运行的消融研究表明,确定性计算和人类决策门独立贡献,并存在互补性的探索性证据。我们将HLER解释为一种研究框架而非自主的AI科学家:它大幅减少失败,使残留的弱点更加可见,并防止不可靠的主张作为可发表的成果被提出。

英文摘要

Large language models (LLMs) are increasingly used for tasks once reserved for trained researchers, including hypothesis generation, specification choice, and drafting conclusions. We argue that the reliability of AI-assisted research depends not only on model capability, but also on how cognitive labour is structured between humans and machines. We study this problem through Human-in-the-Loop Economic Research (HLER), a decision architecture based on pre-commitment, decision sequencing, accountability, and attention allocation. In a pre-specified 2*4 factorial experiment with 280 complete research runs across four datasets, an unconstrained multi-agent baseline produced critical failures in 72% of runs. Using the same underlying model, the same agent decomposition, and identical prompts for the shared reasoning agents, HLER reduced the failure rate to 16% by imposing three architectural commitments: LLMs reason but do not execute data work, data and estimation are handled deterministically, and three human decision gates bind the workflow. Fisher's exact test rejects equality of failure rates at p<0.001. Reliability gains were largest on the least publicly represented dataset, a Qing-dynasty population register, consistent with a task-based production model with Frechet-distributed output quality. An 80-run ablation suggests that deterministic computation and human gates contribute independently, with exploratory evidence of complementarity. We interpret HLER as a research harness rather than an autonomous AI scientist: it sharply reduces failures, makes residual weaknesses more visible, and prevents unreliable claims from being advanced as publication-ready outputs.

2606.12788 2026-06-12 cs.SI cs.CY cs.DC cs.SY econ.GN eess.SY q-fin.EC 新提交

To Share or Not to Share: Orchestrating Trustworthy Data in Global Value Chains

共享还是不共享:协调全球价值链中的可信数据

Han-Teng Liao, Chang-Yi Kao

AI总结 针对欧盟CBAM带来的监管透明与数据主权矛盾,提出基于IDSA框架的RegTech参考架构,通过主权数据交换实现数字产品护照,驱动全球商业服务能力需求,并集成Agentic AI与绿色金融,为全球产业集群提供可扩展蓝图。

详情
AI中文摘要

随着欧盟碳边境调节机制(CBAM)的临近,全球半导体价值链在监管透明度和数据主权之间面临日益增长的结构性紧张。本文提出了一种使用国际数据空间(IDSA)框架的RegTech参考架构,以在半导体-石化关联领域协调可信的环境遥测。该架构区分了强制性CBAM要求和自愿性科学碳目标倡议(SBTi)框架,同时解决了安全与可持续设计(SSbD)框架的附加复杂性。超越标准线性技术栈,我们引入了一种前瞻性路线图方法,将上游物理脆弱性转化为循环的负反馈循环。聚焦台北和槟城技术走廊,本文详细说明了主权数据交换如何使数字产品护照(DPP)能够驱动全球商业服务(GBS)能力需求。最后,我们讨论了集成Agentic AI以实现自主合规以及金融科技绿色融资,为全球产业集群实现主权、可持续和透明的价值链提供了可扩展蓝图。

英文摘要

As the EU Carbon Border Adjustment Mechanism (CBAM) approaches, the global semiconductor value chain faces growing structural tensions between regulatory transparency and data sovereignty. This article proposes a RegTech reference architecture using the International Data Spaces (IDSA) framework to orchestrate trustworthy environmental telemetry across the semiconductor-petrochemical nexus. The framework distinguishes the mandatory CBAM requirements from voluntary Science Based Targets initiative (SBTi) frameworks, while addressing the additive complexities of the Safe-and-Sustainable-by-Design (SSbD) framework. Moving beyond standard linear technology stacks, we introduce a prospective roadmapping methodology that transforms upstream physical vulnerabilities into circular, negative feedback loops. Focusing on the Taipei and Penang technology corridor, the article details how sovereign data exchange enables Digital Product Passports (DPPs) to drive Global Business Services (GBSs) capability demands. Finally, we discuss the integration of Agentic AI for autonomous compliance and FinTech green financing, providing a scalable blueprint for global industrial clusters to achieve sovereign, sustainable, and transparent value chains.

2606.12787 2026-06-12 cs.SI cs.CY cs.SY econ.GN eess.SY q-fin.EC q-fin.RM 新提交

Orchestrating the Twin Transition in Multinational Corporations: Technology Roadmapping for Green and Digital Global Business Services

跨国企业中的双重转型编排:面向绿色与数字全球商业服务的技术路线图

Han-Teng Liao, Karen Ang

AI总结 本文综合技术路线图与ITU创新生态系统工具,提出社会技术框架,分析跨国企业全球商业服务如何通过“可持续智能”演进,协调绿色与数字双重转型,并识别关键枢纽国家的作用。

Comments 9 pages, 6 figures

详情
AI中文摘要

全球商业服务(GBS)已成为绿色与数字双重转型的“活实验室”,因为跨国企业(MNCs)面临协调数字效率与环境管理的日益增长的压力。为推导出一个社会技术框架,本文将技术路线图(TRM)与国际电信联盟(ITU)以ICT为中心的创新生态系统工具包相结合。对研究集群的文献计量分析揭示了从基本流程自动化向“可持续智能”的演进转变,将GBS单元识别为中央“操作气闸”,在景观压力(如欧盟双重指令和碳边境调节机制)与AI原生工作流中的利基创新之间进行调解。研究进一步将这些集群映射到利益相关者参与画布上,突出显示波兰、葡萄牙和马来西亚的韧性“中等强国”枢纽如何绕过中等收入陷阱,在地缘政治分裂的云环境中为全球价值链提供“第三条道路”。结果为领导者及创业支持网络提供了数据驱动的设计方法,以编排人才和供应链流动,从而丰富对工业5.0的概念理解以及GBS作为在动荡、多极数字经济中导航的主要机制的作用。

英文摘要

Global Business Services (GBS) have emerged as a "living laboratory" for the Twin Transition of Green and Digital Transformation, as multinational corporations (MNCs) face increasing pressure to harmonize digital efficiency with environmental stewardship. Aiming to derive a socio-technical framework, this paper synthesizes Technology Roadmapping (TRM) with the International Telecommunication Union (ITU) ICT-centric innovation ecosystem toolkit. A bibliometric analysis of research clusters reveals an evolutionary shift from basic process automation toward "Sustainable Intelligence," identifying the GBS unit as a central "operational airlock" that mediates between landscape pressures -- such as the EU's dual mandate and Carbon Border Adjustment Mechanisms -- and niche innovations in AI-native workflows. The study further maps these clusters onto a stakeholder engagement canvas, highlighting how resilient "Middle Power" hubs in Poland, Portugal, and Malaysia are bypassing the middle-income trap to provide a "third way" for global value chains amidst a bifurcated geopolitical cloud. The results offer a data-driven design approach for leaders and entrepreneurial support networks to orchestrate talent and supply chain flows, thereby enriching the conceptual understanding of Industry 5.0 and the role of GBS as a primary mechanism for navigating a volatile, multipolar digital economy.

2606.12892 2026-06-12 stat.ML cs.LG econ.EM math.ST stat.ME stat.TH 新提交

Prediction-Powered Causal Inference by Automatic Debiased Machine Learning and Semi-Supervised Riesz Regression

预测驱动的因果推断:自动去偏机器学习与半监督Riesz回归

Masahiro Kato

发表机构 * University of Tokyo(东京大学)

AI总结 研究半监督设置下因果参数的半参数有效估计,通过结合去偏机器学习和半监督Riesz回归,提出DML-PPCI和TMLE-PPCI方法,实现比仅用标注数据更小的渐近方差。

详情
AI中文摘要

本研究探讨了在半监督设置下因果和结构参数的半参数有效估计。在我们的设置中,除了由结果和回归变量组成的标注观测数据外,还有未标记的辅助回归变量可用。我们的目标是构建因果和结构参数的估计量,其渐近方差小于仅使用标注数据构建的估计量。我们将此框架称为预测驱动的因果推断(PPCI)。我们首先推导了有效影响函数和效率界,这表明使用辅助回归变量可以获得比仅从标注观测数据可达到的效率界更小的渐近方差。然后,通过将有效影响函数与去偏机器学习(DML)框架相结合,我们提出了称为DML-PPCI的方法。如果我们构建一个估计方程估计量,我们称之为EE-DML-PPCI;如果我们构建一个目标学习估计量,我们称之为TMLE-DML-PPCI。两种估计量的渐近方差都与我们推导的效率界相匹配。在构建估计量时,有效影响函数的估计起着重要作用。在我们的研究中,有效影响函数也是一个Neyman正交分数,它依赖于Riesz表示子和回归函数。对于Riesz表示子估计,我们开发了具有收敛速度保证的半监督广义Riesz回归。

英文摘要

This study investigates semiparametric efficient estimation of causal and structural parameters in a semi-supervised setting. In our setting, unlabeled auxiliary regressors are available in addition to labeled observations consisting of outcomes and regressors. Our goal is to construct estimators of causal and structural parameters whose asymptotic variances are smaller than those of estimators constructed using only labeled data. We refer to this framework as prediction-powered causal inference (PPCI). We first derive the efficient influence function and the efficiency bound, which imply that the use of auxiliary regressors can attain a smaller asymptotic variance than the efficiency bound attainable from labeled observations alone. Then, by combining the efficient influence function with the debiased machine learning (DML) framework, we propose methods that we call DML-PPCI. If we construct an estimating-equation estimator, we refer to the method as EE-DML-PPCI; if we construct a targeted-learning estimator, we refer to the method as TMLE-DML-PPCI. The asymptotic variances of both estimators match our derived efficiency bound. In the construction of the estimators, estimation of the efficient influence function plays an important role. In our study, the efficient influence function is also a Neyman orthogonal score, which depends on the Riesz representer and the regression function. For Riesz representer estimation, we develop semi-supervised generalized Riesz regression with convergence rate guarantees.

2606.12460 2026-06-12 physics.soc-ph econ.EM 新提交

Sovereign Stress Avalanches and Network Amplification in Latin America

拉丁美洲主权压力雪崩与网络放大效应

Diego Vallarino

AI总结 利用J.P.摩根EMBI全球多元化利差数据,通过幂律诊断、网络分析和安慰剂检验,发现拉丁美洲主权压力事件具有重尾分布(指数1.77),且同步性显著高于随机水平,但放大效应源于共同因子而非区域传播。

详情
AI中文摘要

本文利用2007-2026年间11个主权国家的J.P.摩根EMBI全球多元化月度利差数据,研究拉丁美洲信贷市场中的主权压力雪崩与网络放大效应。国家压力事件定义为正对数利差创新超过国家特定波动率阈值,区域雪崩统计每月压力国家数量。实证设计结合有限样本幂律诊断、阈值稳健性检验、国家级重排安慰剂以及滚动相关、偏相关和最小生成树网络。雪崩规模呈重尾分布,估计指数为1.77,而利差变化和事件间隔时间处于重尾边界区域。安慰剂显示同步性远高于独立压力时间,p值低于0.001。大型雪崩与更密集且频谱放大更强的原始相关网络同时出现,但经偏相关过滤后不再显著,表明是共同因子联动而非条件区域传播。网络指标描述同期压力状态而非早期预警信号。结果为监测新兴市场主权脆弱性提供了有限规模临界性框架。

英文摘要

This paper studies sovereign stress avalanches and network amplification in Latin American credit markets using monthly J.P. Morgan EMBI Global Diversified spreads for eleven sovereigns over 2007-2026. Country stress events are defined as positive log-spread innovations exceeding country-specific volatility thresholds, and regional avalanches count the number of stressed countries in each month. The empirical design combines finite-sample power-law diagnostics, threshold robustness checks, a country-level reshuffling placebo, and rolling correlation, partial-correlation, and minimum-spanning-tree networks. Avalanche sizes are heavy-tailed, with an estimated exponent of 1.77, while spread changes and inter-event times lie in a heavy-tail boundary regime. The placebo shows synchronization far above independent stress timing, with p-values below 0.001. Large avalanches coincide with denser and more spectrally amplifying raw-correlation networks, but not after partial-correlation filtering, indicating common-factor co-movement rather than conditional regional propagation. Network metrics describe contemporaneous stress regimes rather than early-warning signals. The results provide a finite-size criticality framework for monitoring sovereign fragility in emerging markets.

2606.07489 2026-06-12 cs.AI econ.GN q-fin.EC 新提交

How AI Agents Reshape Knowledge Work: Autonomy, Efficiency, and Scope

AI代理如何重塑知识工作:自主性、效率与范围

Jeremy Yang, Kate Zyskowski, Noah Yonack, Jerry Ma

发表机构 * Harvard Business School(哈佛商学院) Perplexity AI

AI总结 基于Perplexity产品数据,研究发现AI代理通过端到端任务执行,将自主工作时间从33秒提升至26分钟,完成时间缩短87%,成本降低94%,并扩展了工作范围与认知层次。

详情
AI中文摘要

前沿AI系统正从对话式助手转向端到端执行任务的自主代理,弥合智能与实用性之间的差距。利用Perplexity的Search和Computer产品的生产数据,我们通过研究AI代理如何加速和重塑知识工作来考察这一转变。三个关键实证发现出现。首先,使用具有几乎相同初始查询对的会话作为同一底层任务的自然实验,Computer每个用户会话执行26分钟的自主工作,而Search为33秒。Computer自动化了Search用户可能手动编排和实现的任务分解与执行。因此,Computer将后续查询分布转向更高层次的工作,如验证和扩展。自主性也提高了执行质量,Computer上每次查询的不满意率比Search低55%。其次,由于其自主性优势,Computer在匹配任务上将完成时间从269分钟减少到36分钟,与仅配备Search的人类相比,估计时间和成本分别降低87%和94%。第三,Computer改变了用户尝试的工作范围:Computer查询更常跨越职业边界,需要更高层次的认知,利用更广泛的专业知识,采取将相互依赖的子任务捆绑到单个查询中的复合任务形式,并解锁了同一用户在Search使用中基本不存在的工作活动。综合来看,证据表明AI代理加速工作流程、提高输出质量、降低成本,并扩展自动化工作的广度和深度。

英文摘要

Frontier AI systems are bridging the gap between intelligence and utility by shifting from conversational assistants to autonomous agents that execute tasks end to end. Using production data from Perplexity's Search and Computer products, we study this transition by examining how AI agents accelerate and reshape knowledge work. Three key empirical findings emerge. First, using sessions with near-identical initial query pairs as natural experiments for the same underlying task attempted with both products, Computer performs 26 minutes of autonomous work per user session, versus 33 seconds for Search. Computer automates task decomposition and execution that Search users might otherwise manually orchestrate and implement. As a result, Computer shifts follow-up query distribution toward higher-order work such as verification and extension. Autonomy also increases execution quality, with per-query dissatisfaction rates 55% lower on Computer than on Search. Second, due to its autonomy advantage, Computer reduces completion time from 269 to 36 minutes on matched tasks, lowering estimated time and cost by 87% and 94%, respectively, compared to humans equipped with Search alone. Third, Computer changes the scope of work that users attempt: Computer queries more often cross occupational boundaries, require higher-order cognition, draw on broader expertise, take the form of composite tasks that bundle interdependent subtasks into a single query, and unlock work activities that are essentially absent from Search usage among the same users. Together, the evidence indicates that AI agents accelerate workflows, enhance output quality, reduce costs, and expand the breadth and depth of automated work.

2605.16703 2026-06-12 econ.EM 版本更新

Designing Persuasive Experiments

设计说服性实验

Karun Adusumilli, Abhi Vemulapati

AI总结 本文提出了解决实验设计中激励不一致问题的框架,通过设定社会福利阈值约束实验者优化设计,减少样本量并提升社会福利。

详情
AI中文摘要

在实验设计中,激励通常不一致:实验者设计并资助实验以寻求监管批准,而监管者寻求最大化社会福利。我们提出一个框架来解决这一冲突,其中监管者设定一个最小预期福利阈值,而实验者在该约束下优化设计。该框架不需要了解实验者的私人偏好或成本,并减轻了战略贝叶斯说服。在正常先验下,按照奈曼分配抽样总是最优的,无论具体目标如何。此外,我们还刻画了最优停止规则。在一项校准到历史临床试验数据的数值研究中,我们的框架将预期样本量比达到相同社会福利的古典设计减少了超过48%。

英文摘要

Incentives in experimental design are often misaligned: experimenters design and finance experiments to seek regulatory approval, while regulators seek to maximize social-welfare. We propose a framework to resolve this conflict, wherein regulators set a minimum welfare threshold, and experimenters optimize designs subject to this constraint. It requires no knowledge of experimenters' private preferences or costs and mitigates strategic Bayesian persuasion. Under normal priors, Neyman-allocation is always the optimal-sampling strategy, regardless of specific objectives. We also characterize the optimal stopping-rule. A numerical study calibrated to clinical-trial data shows sample-size reductions of over 48% relative to classical designs attaining the same social-welfare.

2605.06721 2026-06-12 cs.GT econ.TH 版本更新

A Simple Method for School Choice Lotteries

学校选择抽签的一种简单方法

Yasunori Okumura

AI总结 提出一种多项式时间方法构建事前稳定且满足同等对待的学校选择抽签,并证明其最优性。

详情
AI中文摘要

本文提出一种简单的多项式时间方法,用于构建满足同等对待(ETE)的事前稳定学校选择抽签。我们证明,任何约束有效稳定匹配的ETE重新分配都是事前稳定的,满足ETE,并且不被任何其他事前稳定抽签序数支配。我们进一步证明,存在一个约束有效稳定匹配,其ETE重新分配不被任何事后稳定抽签序数支配。

英文摘要

This note proposes a simple polynomial-time method for constructing an ex ante stable school-choice lottery satisfying equal treatment of equals (ETE). We show that the ETE reassignment of any constrained efficient stable matching is ex ante stable, satisfies ETE, and is not ordinally dominated by any other ex ante stable lottery. We further show that there exists a constrained efficient stable matching whose ETE reassignment is not ordinally dominated by any ex post stable lottery.

2511.03142 2026-06-12 econ.TH math.OC 版本更新

A Theory of Saving under Risk Preference Dynamics

风险偏好动态下的储蓄理论

Qingyin Ma, Xinxi Song, Alexis Akira Toda

AI总结 本文通过引入风险偏好冲击,提出了一种最优储蓄理论,揭示了风险偏好随机变化导致高财富家庭储蓄率趋于100%且边际消费倾向趋于零的机制。

Comments 52 pages, 3 tables, 3 figures

详情
AI中文摘要

实证证据表明,富裕家庭比其他群体具有更高的储蓄率和显著更低的边际消费倾向(MPC)。现有理论无法在不联合施加关于回报、贴现和偏好的限制性假设的情况下解释这一模式。在本文中,我们发展了一个具有偏好冲击的最优储蓄的一般理论,并识别了一种新的机制,通过该机制,随机风险偏好重塑了渐近消费和储蓄行为。具体而言,仅仅是下一期变得不那么风险厌恶的可能性就提高了将财富向前转移的价值,因为未来的自我可能更愿意将财富转化为消费。与经典的预防性储蓄动机(通常源于资源风险并随财富增加而减弱)不同,这种力量即使在任意高的财富水平下仍然有效,产生了延迟消费的持续激励,并推动渐近MPC降至零(即100%的渐近储蓄率)。因此,消失的MPC成为风险偏好动态的一般含义,而非限制性假设的产物,为富裕家庭中观察到的持续高储蓄率和低MPC提供了一个理论上稳健且经验上一致的解释。

英文摘要

Empirical evidence shows that wealthy households have substantially higher saving rates and markedly lower marginal propensity to consume (MPC) than other groups. Existing theory cannot account for this pattern without jointly imposing restrictive assumptions on returns, discounting, and preferences. In this paper, we develop a general theory of optimal savings with preference shocks and identify a novel mechanism through which stochastic risk preferences reshape the asymptotic consumption and saving behavior. Specifically, the mere possibility of becoming less risk averse next period raises the value of carrying wealth forward, since future selves may be more willing to convert wealth into consumption. Unlike the classical precautionary saving motive, which typically arises from resource risks and weakens as wealth increases, this force remains operative even at arbitrarily high wealth levels, generating a persistent incentive to defer consumption and driving the asymptotic MPC to zero (i.e., a 100% asymptotic saving rate). As a result, vanishing MPCs emerge as a generic implication of risk preference dynamics, rather than an artifact of restrictive assumptions, offering a theoretically robust and empirically consistent account of the persistently high saving rates and low MPCs observed among wealthy households.

2601.06363 2026-06-12 econ.TH cs.MA 版本更新

The Replicator-Optimization Mechanism: A Scale-Relative Formalism for Persistence-Conditioned Dynamics with Application to Consent-Based Metaethics

复制者-优化机制:一种面向持久条件动力学的尺度相对形式化及其在基于同意的元伦理学中的应用

Murad Farzulla

AI总结 本文提出一种尺度相对的形式化框架,统一复制者-突变动力学与价格选择-传输模型,通过摩擦、合法性和信念传递三个核心概念,从社会契约论独立推导出与生物学相同的结构,并建立描述性动力学与工具性规范性的桥梁,避免实然-应然谬误。

Comments 67 pages, 1 table, Lean 4 verification appendix (machine-checked). v2: substantially expanded from v1; adds formal-verification and identifiability sections and corrects references

详情
AI中文摘要

本文形式化了一类广泛使用的动力学——复制者-突变动力学和价格式选择与传输——并明确了决定该类动力学跨领域实例化的建模选择(尺度、原子单位、交互拓扑、传输核)。其主干是已知的;我们不声称发现了选择。新颖贡献有三方面:(i)一种尺度相对的核参数化,其中原子单位本身是参数,使得在物理学、生物学、经济学、认知和社会组织中系统实例化成为可能;(ii)一种用于政治哲学的同意-摩擦实例化,其中摩擦是原始量,合法性作为生存概率,信念传递作为突变核;(iii)一条从社会契约论而非生物学或物理学出发的推导路径,通过独立途径到达相同的正式结构。我们提供了一个连接描述性动力学与工具性规范性的桥梁原则:如果主体偏好更低的预期摩擦,那么“应当”主张就是减少指定动力学下预期摩擦的策略的简写。这种条件结构避免了实然-应然谬误,同时将规范性话语建立在经验上可处理的动力学基础上。我们通过显式建模潜在摩擦与观察摩擦来处理病态情况(威权稳定性、被抑制的摩擦)。该框架通过摩擦、合法性和信念传递动力学的操作化产生可检验的预测,并且在测量装置层面而非正式结构层面是可证伪的。

英文摘要

This paper formalizes a widely used dynamical class--replicator-mutator dynamics and Price-style selection-and-transmission--and makes explicit the modeling choices (scale, atomic unit, interaction topology, transmission kernel) that determine how this class instantiates across domains. The backbone is known; we do not claim to have discovered selection. The novel contributions are threefold: (i) a scale-relative kernel parameterization where atomic units are themselves parameters, enabling systematic instantiation across physics, biology, economics, cognition, and social organization; (ii) a consent-friction instantiation for political philosophy, where friction is the primitive, legitimacy functions as survival probability, and belief-transfer functions as mutation kernel; and (iii) a derivation path from social contract theory rather than from biology or physics, arriving at the same formal structure via an independent route. We provide a bridge principle connecting descriptive dynamics to instrumental normativity: if agents prefer lower expected friction, then "ought" claims are shorthand for policies that reduce expected friction under the specified dynamics. This conditional structure avoids the is-ought fallacy while grounding normative discourse in empirically tractable dynamics. We address pathological cases (authoritarian stability, suppressed friction) through explicit modeling of latent versus observed friction. The framework generates testable predictions through operationalization of friction, legitimacy, and belief-transfer dynamics, and is falsifiable at the level of measurement apparatus rather than formal structure.

2509.23554 2026-06-12 econ.GN q-fin.EC 版本更新

When Clear Skies Cloud Trust: Environmental Cues and the Paradox of Confidence in Government

当晴空万里反而侵蚀信任:环境线索与政府信心的悖论

Xiangzhe Xu, Ran Wu

AI总结 利用世界价值观调查与NASA气象数据,发现晴朗天气通过增强环境意识和负面归因,反而降低政府信任,并识别出主观幸福感等中介路径。

Comments Realized that parts of the analysis substantially overlap with our ongoing follow-up project. We prefer to withdraw this version and will submit a substantially revised and extended version later

详情
AI中文摘要

政府信任作为政治经济学和公共政策研究的核心概念,是民主合法性和国家能力的基本基石。本文研究环境条件(尤其是日照效率)如何通过情感和认知机制影响报告的政府信任。利用世界价值观调查第7波数据与NASA POWER高频气象数据,我们提出并验证了一种新的“显著性与归因”机制:更晴朗的天空可能通过提高环境意识和引发负面归因,反而降低政府信任。我们进一步识别出潜在的中介路径,包括主观幸福感、政治兴趣、政治讨论和健康感知,并证明环境条件会在基于调查的信任指标中引入测量误差。我们的研究结果为环境心理学、行为政治经济学和调查方法学提供了理论贡献,并对治理、政策设计和调查实践具有实际意义。

英文摘要

Government trust, as a core concept in political economy and public policy research, serves as a fundamental cornerstone of democratic legitimacy and state capacity. This paper examines how environmental conditions, particularly sunlight efficiency, influence reported government trust through both affective and cognitive mechanisms. Leveraging World Values Survey Wave 7 data merged with NASA POWER high-frequency weather data, we propose and validate a novel ``salience and attribution'' mechanism: clearer skies may paradoxically reduce government trust by heightening environmental awareness and triggering negative attributions. We further identify potential mediating pathways, including subjective well-being, political interest, political discussion, and health perception, and demonstrate that environmental conditions introduce measurement error in survey-based trust indicators. Our findings provide theoretical contributions to environmental psychology, behavioral political economy, and survey methodology, and yield practical implications for governance, policy design, and survey

2503.20092 2026-06-12 econ.TH 版本更新

Entry and disclosure in group contests

群体竞赛中的进入与信息披露

Luke Boosey, Philip Brookins, Dmitry Ryvkin

AI总结 研究群体竞赛中的信息披露政策,发现群体内信息披露明确提高总投入,而完全披露效果不确定,与个体竞赛中信息披露降低总投入的结论不同。

详情
AI中文摘要

我们研究群体竞赛的信息披露政策。每个参与者内生地决定是否作为其群体成员参与竞争。群体内努力的聚合采用最佳射击方式,即每个群体的表现由其成员中的最高投资决定。我们考虑一个广义的全支付拍卖设置,其中表现最高的群体确定性地赢得竞赛。参与者在进入阶段对获胜的价值是私人信息,但在竞争阶段可能被披露。我们比较三种披露政策:(i)不披露,当进入者数量未知且其价值私有时;(ii)群体内披露,当此信息在每个群体内披露但不在群体间披露时;(iii)完全披露,当关于进入者的信息在群体间披露时。对于个体间竞赛的基准情况,信息披露总是降低预期的总投入。然而,这在群体竞赛中不再成立:群体内披露明确提高总投入,而完全披露的效果是不确定的。

英文摘要

We study information disclosure policies for contests among groups. Each player endogenously decides whether or not to participate in competition as a member of their group. Within-group aggregation of effort is best-shot, i.e., each group's performance is determined by the highest investment among its members. We consider a generalized all-pay auction setting, in which the group with the highest performance wins the contest with certainty. Players' values for winning are private information at the entry stage, but may be disclosed at the competition stage. We compare three disclosure policies: (i) no disclosure, when the number of entrants remains unknown and their values private; (ii) within-group disclosure, when this information is disclosed within each group but not across groups; and (iii) full disclosure, when the information about entrants is disclosed across groups. For the benchmark case of contests between individuals, information disclosure always reduces expected aggregate investment. However, this is no longer true in group contests: Within-group disclosure unambiguously raises aggregate investment, while the effect of full disclosure is ambiguous.

2111.08157 2026-06-12 econ.EM math.ST stat.ME stat.TH 版本更新

Fine Stratification of Survey Experiments

调查实验的精细分层

Max Cytrynbaum

AI总结 本文提出两阶段实验模型,通过匹配k元组随机化实现精细分层,开发快速匹配算法,证明可减少处理效应估计方差,并提供充分利用设计效率的推断方法。

详情
AI中文摘要

本文研究了一个两阶段实验模型,其中研究者首先从符合条件的池中抽样具有代表性的实验参与者,然后使用匹配的$k$元组随机化将每个抽样单元分配到处理组或对照组。为了实现这种设计,我们开发了一种快速的新算法,用于将单元匹配成$k$元组,适用于任意$k \ge 2$和任意维度的协变量。通过调查200篇近期实验工作论文,我们估计该算法新近实现了多变量精细分层,并为经济学中约44%的实验提供了可证明的匹配质量保证。我们表明,精细分层抽样和分配都非参数地降低了处理效应估计的方差,其中分层抽样的收益随着合格池的大小以及协变量预测处理效应异质性的程度而增加。我们开发了新的推断方法,充分利用两个设计阶段的效率提升,允许研究者报告更小的标准误,如果他们设计了代表性实验。对九个已发表实验的应用量化了效率提升。

英文摘要

This paper studies a two-stage model of experimentation, where the researcher first samples representative experimental participants from an eligible pool, then assigns each sampled unit to treatment or control, using matched $k$-tuples randomization at both stages. To implement such designs, we develop a fast new algorithm for matching units into $k$-tuples for any $k \ge 2$ and any dimension of covariates. By surveying 200 recent experimental working papers, we estimate that our algorithm newly enables multivariate fine stratification with provable match quality guarantees for about 44\% of experiments in economics. We show that finely stratified sampling and assignment both nonparametrically reduce the variance of treatment effect estimation, with the gains from stratified sampling increasing in the size of the eligible pool and how well covariates predict treatment effect heterogeneity. We develop new inference methods that fully exploit the efficiency gains from both design stages, allowing researchers to report smaller standard errors if they designed a representative experiment. An application to nine published experiments quantifies the efficiency gains.