Chinese_Classifier
收藏Opencsg2024-03-22 更新2024-06-22 收录
下载链接:
https://www.opencsg.com/datasets/OpenDataLab/Chinese_Classifier
下载链接
链接失效反馈官方服务:
资源简介:
量词是汉语中用来表达数量的虚词,对语言学习者来说尤其困难。这个中文分类器数据集可用于根据上下文预测中文分类器。 该数据集包含大量来自三种语言语料库(普通话兰开斯特语料库、UCLA 书面汉语语料库和莱顿微博语料库)的中文分类器使用示例句子。为基于上下文的分类器预测任务清理和处理数据。
Classifiers are function words in Chinese used to denote quantity, which pose particular challenges for language learners. This Chinese classifier dataset is developed for the task of predicting Chinese classifiers based on contextual information. It contains a large number of example sentences demonstrating the usage of Chinese classifiers from three linguistic corpora: the Mandarin Lancaster Corpus, the UCLA Written Chinese Corpus, and the Leiden Weibo Corpus. The data has been cleaned and preprocessed for the context-based classifier prediction task.
创建时间:
2024-03-22



