arXiv AI recent: Mask-Proof: An LLM-based Automated Data Curation Pipeline on Mathematical Proofs
Researchers introduced Mask-Proof, a pipeline that turns real proofs into automatically checkable masked-step tasks.,The pipeline masks key formula steps, provides surrounding context, an...
Mask-Proof is a pipeline that evaluates step-level reasoning in mathematical proofs using large language models (LLMs).,The pipeline has been tested with 17 models, with reasoning-enhanced models outperforming standard models by 12% to 27%, and the evaluator achieving 96.8% agreement with expert...