Japanese Multiple-Choice Machine Reading Comprehension Dataset JLPT-MC

Name: Japanese Multiple-Choice Machine Reading Comprehension Dataset JLPT-MC
Creator: Science Data Bank
Published: 2026-03-06 09:19:24
License: 暂无描述

DataCite Commons2026-03-06 更新2026-05-05 收录

下载链接：

https://www.scidb.cn/detail?dataSetId=2125ee54e8464db98c2be120c7ba875d

下载链接

链接失效反馈

官方服务：

资源简介：

To verify the effectiveness of the model, the author constructed a Japanese MCRC dataset based on the Japanese Language Proficiency Test (JLPT). This dataset covers the questions in the "Solution" section of JLPT's five test levels from 1992 to 2023, covering categories such as history, daily life, music culture, etc. This dataset extracts a total of 1731 pairs of context question candidate option triplets, including 1146 contexts, 1731 questions, and 6924 candidate options. To ensure that the model can be trained on pure text data, this article excludes non pure text data such as image selection, route planning, and graphic judgment. The processed triplet data consists of a total of 1329 pairs, including 904 contexts, 1329 questions, and 5316 candidate options. The dataset is named JLPT-MC.

提供机构：

Science Data Bank

创建时间：

2026-03-06

5,000+

优质数据集

54 个

任务类型

进入经典数据集