AI-MO/NuminaMath-CoT
收藏Hugging Face2024-11-25 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/AI-MO/NuminaMath-CoT
下载链接
链接失效反馈官方服务:
资源简介:
NuminaMath CoT数据集包含约860k个数学问题,每个问题的解决方案都以Chain of Thought (CoT)格式呈现。数据来源包括中国高中数学练习题、美国和国际数学奥林匹克竞赛问题等。数据处理步骤包括OCR、分割、翻译、重新对齐和最终答案格式化。
The dataset contains approximately 860k math problems, where each solution is formatted in a Chain of Thought (CoT) manner. The sources of the dataset range from Chinese high school math exercises to US and international mathematics olympiad competition problems. The data were primarily collected from online exam paper PDFs and mathematics discussion forums. The processing steps include (a) OCR from the original PDFs, (b) segmentation into problem-solution pairs, (c) Translation into English, (d) realignment to produce a CoT reasoning format, and (e) final answer formatting.
提供机构:
AI-MO



