five

AI-MO/NuminaMath-CoT

收藏
Hugging Face2024-11-25 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/AI-MO/NuminaMath-CoT
下载链接
链接失效反馈
官方服务:
资源简介:
NuminaMath CoT数据集包含约860k个数学问题,每个问题的解决方案都以Chain of Thought (CoT)格式呈现。数据来源包括中国高中数学练习题、美国和国际数学奥林匹克竞赛问题等。数据处理步骤包括OCR、分割、翻译、重新对齐和最终答案格式化。

The dataset contains approximately 860k math problems, where each solution is formatted in a Chain of Thought (CoT) manner. The sources of the dataset range from Chinese high school math exercises to US and international mathematics olympiad competition problems. The data were primarily collected from online exam paper PDFs and mathematics discussion forums. The processing steps include (a) OCR from the original PDFs, (b) segmentation into problem-solution pairs, (c) Translation into English, (d) realignment to produce a CoT reasoning format, and (e) final answer formatting.
提供机构:
AI-MO
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作