xzh12356/GSM8K_zh
收藏Hugging Face2026-04-29 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/xzh12356/GSM8K_zh
下载链接
链接失效反馈官方服务:
资源简介:
`GSM8K_zh`是一个用于中文数学推理的数据集,其中的问答对是通过`GPT-3.5-Turbo`从GSM8K数据集翻译而来。数据集包含7473个训练样本和1319个测试样本,训练样本用于监督微调,测试样本用于评估。训练样本包含`question_zh`和`answer_zh`两个键,分别表示问题和答案;测试样本仅包含翻译后的问题`question_zh`。
`GSM8K_zh` is a dataset for mathematical reasoning in Chinese, question-answer pairs are translated from GSM8K by `GPT-3.5-Turbo` with few-shot prompting. The dataset consists of 7473 training samples and 1319 testing samples. The former is for supervised fine-tuning, while the latter is for evaluation. For training samples, `question_zh` and `answer_zh` are question and answer keys, respectively; for testing samples, only the translated questions are provided (`question_zh`).
提供机构:
xzh12356



