567-labs/gsm8k
收藏Hugging Face2024-08-26 更新2025-04-19 收录
下载链接:
https://hf-mirror.com/datasets/567-labs/gsm8k
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
dataset_info:
features:
- name: question
dtype: string
- name: answer
dtype: int64
- name: reasoning
dtype: string
splits:
- name: train
num_bytes: 3968293
num_examples: 7473
- name: test
num_bytes: 714662
num_examples: 1319
download_size: 2718651
dataset_size: 4682955
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
---
GSM8K (Grade School Math 8K) is a dataset of 8.5K high quality linguistically diverse grade school math word problems. The dataset was created to support the task of question answering on basic mathematical problems that require multi-step reasoning. This version is a cleaned and refined version of the original OpenAI dataset, with the answer and chain of thought (reasoning) separated from the original combined answer field.
We have ensured that the reasoning (Chain of Thought) and the final answer are stored separately, providing greater clarity and ease of use for downstream applications.
Credits to OpenAI for the original dataset which [you can find here](https://huggingface.co/datasets/openai/gsm8k).
提供机构:
567-labs



