2605.26246
2026-05-27
cs.LG
The Bridge-Garden Dilemma in LLM Distillation: Why Mixing Hard and Soft Labels Works
LLM蒸馏中的桥园困境:为什么混合硬标签和软标签有效
Guanghui Wang, Kaiwen Lv Kacuila, Zhiyong Yang, Zitai Wang, Jin-Wen Wu, Longtao Huang, Qianqian Xu, Qingming Huang
发表机构
*
School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing, China(中国科学院大学计算机科学与技术学院)
;
Alibaba Group, Hangzhou, China(阿里巴巴集团)
;
State Key Laboratory of AI Safety, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China(中国科学院人工智能安全国家重点实验室)
;
Beijing Academy of Artificial Intelligence, Beijing, China(北京人工智能研究院)
;
Key Laboratory of Big Data Mining and Knowledge Management (BDKM), University of Chinese Academy of Sciences, Beijing, China(中国科学院大数据挖掘与知识管理重点实验室)
AI总结
针对大语言模型知识蒸馏中硬标签与软标签的混合使用,提出桥园分解理论解释其降低暴露偏差的机制,并开发自适应混合监督方法,在多个模型上实现性能提升和9.7倍训练成本降低。