AndyChiang/cloth
收藏Hugging Face2022-10-14 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/AndyChiang/cloth
下载链接
链接失效反馈资源简介:
---
pretty_name: cloth
multilinguality:
- monolingual
language:
- en
license:
- mit
size_categories:
- 10K<n<100K
tags:
- cloze
- mid-school
- high-school
- exams
task_categories:
- fill-mask
---
# cloth
**CLOTH** is a dataset which is a collection of nearly 100,000 cloze questions from middle school and high school English exams. The detail of CLOTH dataset is shown below.
| Number of questions | Train | Valid | Test |
| ------------------- | ----- | ----- | ----- |
| **Middle school** | 22056 | 3273 | 3198 |
| **High school** | 54794 | 7794 | 8318 |
| **Total** | 76850 | 11067 | 11516 |
Source: https://www.cs.cmu.edu/~glai1/data/cloth/
提供机构:
AndyChiang
原始信息汇总
数据集概述
基本信息
- 名称: CLOTH
- 语言: 英语(en)
- 许可证: MIT
- 大小: 10,000 < n < 100,000
描述
CLOTH是一个包含近100,000个完形填空问题的数据集,这些问题来源于中学和高中的英语考试。
数据分布
| 学校级别 | 训练集 | 验证集 | 测试集 |
|---|---|---|---|
| 中学 | 22,056 | 3,273 | 3,198 |
| 高中 | 54,794 | 7,794 | 8,318 |
| 总计 | 76,850 | 11,067 | 11,516 |
标签和任务
- 标签: cloze, mid-school, high-school, exams
- 任务类别: fill-mask



