ViMMRC 2.0
收藏arXiv2023-03-31 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2303.18162v1
下载链接
链接失效反馈官方服务:
资源简介:
ViMMRC 2.0是一个专为越南语教育设计的多项选择阅读理解语料库,由越南信息技术大学的信息科学与工程学院创建。该数据集包含699篇阅读文章和5273个问题,涵盖从一年级到十二年级的学生阅读材料。数据来源于越南教育出版社根据越南教育培训部出版的文学教科书。ViMMRC 2.0旨在通过提供更复杂的问题和阅读材料,挑战和提升机器阅读理解模型,特别是在理解文本的隐含上下文和相关事实链接方面。
ViMMRC 2.0 is a multiple-choice reading comprehension corpus specifically designed for Vietnamese language education, developed by the School of Information Science and Engineering at Vietnam University of Information Technology. This corpus contains 699 reading passages and 5273 questions, covering reading materials for students from Grade 1 to Grade 12. The data is sourced from literature textbooks published by the Educational Publishing House of Vietnam, which comply with the curriculum standards formulated by the Vietnamese Ministry of Education and Training. ViMMRC 2.0 aims to challenge and enhance machine reading comprehension models by providing more complex questions and reading materials, especially in terms of understanding the implicit context of texts and their relevant factual connections.
提供机构:
信息科学与工程学院,信息技术大学,胡志明市,越南
创建时间:
2023-03-31



