Decomposing the Basic Abilities of Large Language Models: Mitigating Cross-Task Interference in Multi-Task Instruct-Tuning
分解大型语言模型的基本能力:在多任务指令微调中缓解跨任务干扰
发表机构 * College of Computer Science and Technology, Jilin University(吉林大学计算机科学与技术学院) ; Key Laboratory of Symbolic Computation and Knowledge Engineering, Ministry of Education, Jilin University(吉林大学符号计算与知识工程重点实验室) ; RIKEN Center for Advanced Intelligence Project(RIKEN高级智能项目中心) ; Graduate School of Frontier Sciences, University of Tokyo(东京大学前沿科学研究生院)
AI总结 本文通过实验揭示现有方法仍存在跨任务干扰问题,提出BADIT方法将LLM参数分解为正交的高奇异值LoRA专家,通过球形聚类保持正交性,实验证明其在多任务指令微调中优于现有方法。
Comments Accepted by ICML 2026. 25 pages, 13 figures. Code: https://github.com/wangbing1416/BADIT