LGCNS/KorQuAD_1.0

Name: LGCNS/KorQuAD_1.0
Creator: LGCNS
Published: 2025-08-07 06:52:08
License: 暂无描述

Hugging Face2025-08-07 更新2025-11-01 收录

下载链接：

https://hf-mirror.com/datasets/LGCNS/KorQuAD_1.0

下载链接

链接失效反馈

官方服务：

资源简介：

KorQuAD 1.0是一个为韩国语机器阅读理解任务收集和构建的数据集。该数据集中的所有问题的答案都是由原文Wikipedia段落的一部分子串组成，遵循与Stanford Question Answering Dataset (SQuAD) v1.0相同的格式。数据集包括1,560个Wikipedia文档，10,645个段落，以及66,181个问题-答案对，分为训练集和开发集。

KorQuAD 1.0 is a dataset collected and constructed for Korean language machine reading comprehension. All answers to questions in the dataset are composed of a sub-span of the original Wikipedia paragraph, following the same format as the Stanford Question Answering Dataset (SQuAD) v1.0. The dataset includes 1,560 Wikipedia documents, 10,645 paragraphs, and 66,181 question-answer pairs, split into a training set and a development set.

提供机构：

LGCNS

5,000+

优质数据集

54 个

任务类型

进入经典数据集