COMPILING
收藏arXiv2022-09-29 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2209.14614v1
下载链接
链接失效反馈官方服务:
资源简介:
COMPILING数据集是由北京语言大学创建,旨在为汉语作为外语学习者提供复杂度可控的定义生成任务。该数据集包含127,757条记录,每条记录包括一个词、其定义、示例及两个复杂度测量。数据集通过结合《当代汉语学习词典》与《现代汉语词典》第七版构建,利用HSK词汇等级量化定义复杂度,适用于语言学习和辅助教学,尤其有助于低文化程度读者及语言障碍者。
The COMPILING dataset was developed by Beijing Language and Culture University to provide complexity-controllable definition generation tasks for learners of Chinese as a foreign language. It comprises 127,757 entries, each containing a target word, its associated definition, an illustrative example, and two complexity metrics. This dataset was compiled by integrating the *Contemporary Chinese Learning Dictionary* and the 7th edition of *Modern Chinese Dictionary*, and uses HSK vocabulary proficiency levels to quantify the complexity of definitions. It is applicable to language learning and supportive teaching, and is particularly beneficial for readers with limited educational backgrounds and individuals with language impairments.
提供机构:
北京语言大学
创建时间:
2022-09-29



