callofthenight1/gaokao-sft-chinese-large
收藏Hugging Face2026-04-16 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/callofthenight1/gaokao-sft-chinese-large
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
task_categories:
- text-generation
- question-answering
language:
- zh
pretty_name: Gaokao SFT Chinese Large
size_categories:
- 10K<n<100K
---
# Gaokao SFT Chinese Large
This is the larger Chinese-only SFT-style exam dataset version.
## Composition
- Total samples: 64164
- Train samples: 62920
- Validation samples: 1244
## Fields
- `id`
- `lang`
- `subject`
- `source`
- `instruction`
- `input`
- `output`
- `messages`
## Example: chemistry
```json
{
"instruction": "化学实验基本方法的第一节主要涉及哪些内容?",
"output": "解析:化学实验基本方法的第一节主要介绍了化学实验的基本操作和注意事项,包括实验仪器的使用、实验操作的规范、实验安全知识以及实验数据的记录和处理等内容。这些基础知识是进行化学实验的基础,对于确保实验的准确性和安全性至关重要。"
}
```
## Example: history
```json
{
"instruction": "请解答下面的题目,并给出简洁解析。\n题目:1.( 4分)西周分封制在中国历史上影响深远。下列省、自治区中,其简称源\n自西周封国国名的是( )\nA. 河南.河北\nB. 湖南.湖北\nC. 山东.山西\nD. 广东.广西",
"output": "解析:西周分封的诸侯国主要有鲁齐燕卫宋晋。A项河南的简称是豫,河北的\n简称是冀; B项湖南的简称是湘,湖北的简称是鄂; D项广东的简称是粤,\n广西的简称是桂。其简称都不是源自西周封国国名, 故排除 ABD三项。 \nC项山东的简称是鲁,山西的简称是晋,其简称都是源自西周封国国名。故C项\n正确。 \n故选: C。\n最终答案:C"
}
```
## Example: english exam
```json
{
"instruction": "请根据给定语境完成这道英语题,并给出简洁解析。\n题型:Cloze-With-Multiple-Choices\n语境:Conversation One:John: Hello, Jack. Let's play computer ____.Jack: That sounds fun. But I don't have a computer.John: Well, do you have a volleyball?Jack: Yes, I ____.John: Then let's play volleyball.Jack: Oh, volleyball ____ so difficult.John: Well, we can ____ football.Jack: All right. Let's go.\nA. games\nB. do\nC. is\nD. play\n考点:Contextual Vocabulary",
"output": "解析:games fit the sentence.\n最终答案:A"
}
```
提供机构:
callofthenight1



