2506.13127
2026-06-17
cs.SD
eess.AS
版本更新
Leveraging Local and Global Knowledge Integration with Time-Frequency Calibrated Distillation for Speech Enhancement
利用局部和全局知识整合与时间频率校准蒸馏进行语音增强
Jiaming Cheng, Ruiyu Liang, Ye Ni, Chao Xu, Jing Li, Wei Zhou, Rui Liu, Björn W. Schuller, Xiaoshuai Hao
发表机构
*
School of Computer Science, Nanjing Audit University(南京审计大学计算机科学学院)
;
School of Communication Engineering, Nanjing Institute of Technology(南京工程技术学院通信工程学院)
;
School of Information Science and Engineering, Southeast University(东南大学信息科学与工程学院)
;
Cardiff University(卡迪夫大学)
;
Inner Mongolia University(内蒙古大学)
;
CHI – the Chair of Health Informatics, TUM University Hospital(健康信息学系,技术大学医院)
;
GLAM – the Group on Language, Audio, & Music, Imperial College London(语言、音频与音乐组,伦敦帝国理工学院)
;
Xiaomi EV(小米电动车)
AI总结
本文提出了一种融合框架,通过时间频率校准知识蒸馏提升语音增强性能,结合局部信息聚焦与全局知识流通,改进了低复杂度学生模型的表现。