Chinese_Classifier
收藏OpenCSG2024-03-22 更新2026-01-19 收录
下载链接:
https://opencsg.com/datasets/OpenDataLab/Chinese_Classifier?tab=summary
下载链接
链接失效反馈官方服务:
资源简介:
量词是汉语中用来表达数量的虚词,对语言学习者来说尤其困难。这个中文分类器数据集可用于根据上下文预测中文分类器。 该数据集包含大量来自三种语言语料库(普通话兰开斯特语料库、UCLA 书面汉语语料库和莱顿微博语料库)的中文分类器使用示例句子。为基于上下文的分类器预测任务清理和处理数据。
Chinese classifiers are function words used to express quantity in the Chinese language, and they pose particular difficulties for language learners. This Chinese classifier dataset is designed for the task of predicting Chinese classifiers based on contextual information. It contains a large number of example sentences demonstrating the usage of Chinese classifiers from three linguistic corpora: the Lancaster Corpus of Mandarin Chinese, the UCLA Written Chinese Corpus, and the Leiden Weibo Corpus. The data has been cleaned and processed specifically for the context-aware classifier prediction task.
提供机构:
OpenDataLab
创建时间:
2024-03-22



