2605.27991
2026-06-05
stat.ML
cs.LG
Gradient-Flow Optimization as Dynamic Random-Effects Inference: Testing and Early Stopping with Applications to Deep Learning
深度神经网络训练作为随机效应:优化-推断对偶性
Minhao Yao, Ruoyu Wang, Xihong Lin, Lin Liu, Zhonghua Liu
发表机构
*
Centre for Biomedical Data Science, Duke-NUS Medical School, National University of Singapore(生物医学数据科学中心,国立新加坡大学杜克-新加坡医学学校)
;
Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA(生物统计学系,哈佛T.H. Chan公共卫生学院,马萨诸塞州波士顿,美国)
;
Institute of Natural Sciences, MOE-LSC, School of Mathematical Sciences, CMA-Shanghai, SJTU-Yale Joint Center of Biostatistics and Data Science, Shanghai Jiao Tong University(自然科学院,MOE-LSC,数学科学学院,CMA-上海,SJTU-耶鲁联合生物统计学与数据科学中心,上海交通大学)
;
Department of Biostatistics, Columbia University, New York, NY, USA(生物统计学系,哥伦比亚大学,纽约州纽约市,美国)
AI总结
本文提出深度神经网络训练与经典随机效应模型等价,揭示了优化-推断对偶性,并利用限制最大似然估计实现基于似然的早停规则。