2606.15258
2026-06-16
cs.AI
新提交
Mask-Proof: An LLM-based Automated Data Curation Pipeline on Mathematical Proofs
Mask-Proof: 一种基于LLM的数学证明自动数据整理流水线
Jierui Zhang, Siyuan Tan, Xinhang Li, Longzhuangzhi Lin, Dailin Li, Chengfeng Gu, Xinping Li, Yaxian Hao, Shengjia Liang, Yuxiang Ren, Wenhao Liu
发表机构
*
School of Computer Science, Beijing University of Posts and Telecommunications(北京邮电大学计算机学院)
;
Graduate College for Engineers, Beijing University of Posts and Telecommunications(北京邮电大学研究生院工程师学院)
;
School of Mathematical Sciences, Fudan University(复旦大学数学科学学院)
;
School of Cyberspace Security, Beijing University of Posts and Telecommunications(北京邮电大学网络空间安全学院)
;
School of Computer Science and Technology, Dalian University of Technology(大连理工大学计算机科学与技术学院)
;
Chu Kochen Honors College, Zhejiang University(浙江大学竺可桢学院)
;
Department of Psychological and Cognitive Sciences, Tsinghua University(清华大学心理学与认知科学系)
;
State Key Laboratory of Virtual Reality Technology and Systems, Beihang University(北京航空航天大学虚拟现实技术与系统国家重点实验室)
;
School of Intelligence Science and Technology, Nanjing University(南京大学智能科学与技术学院)
AI总结
提出Mask-Proof流水线,将真实证明转化为可自动检查的掩码步骤任务,通过LLM等价性判断器评估模型推理,构建包含292个问题的基准,推理增强模型性能提升12%-27%。