Small Experiments, Cheaper Decisions: A Case Study in Staged Promotion for Micro-Pretraining
小实验,更经济的决策:微预训练中分阶段提升的案例研究
发表机构 * Hewlett Packard Enterprise(慧与科技公司)
AI总结 研究微预训练中分阶段提升协议,通过固定预算筛选配置,在Windows A100和Linux L40S上验证,发现早期排名不稳定,但最终协议以144 GPU小时找到最优配置,成本低于全量筛选。
Comments 14 pages, 5 figures; 12-hour dual-host micro-pretraining promotion study; source package includes curated ancillary artifacts