five

Chinese-Fill-in-the-Blank(CFITB)

收藏
科学数据银行2022-02-15 更新2026-04-23 收录
下载链接:
https://www.scidb.cn/en/detail?dataSetId=5189727dd1ce4a6caab5d9a38b2d8819
下载链接
链接失效反馈
官方服务:
资源简介:
In order to enrich the Chinese lexical choice data set, taking the people's daily corpus as the initial corpus source, this paper constructs a Chinese test dataset Chinese-Fill-in-the-Blank(CFITB)containing three target word parts of speech: nouns, verbs and adjectives. CFITB dataset contains 500 test samples. Each test sample contains three parts: "number", "test sentence" and "candidate", in which each test sentence contains a target word. Delete the target word and use "__" Instead, the corresponding candidate contains five Chinese words, and each Chinese word is brought into "__" The goal of modeling this dataset is to find the target word corresponding to the most standardized sentence in semantics and grammar from the five candidate sentences.
提供机构:
青海师范大学; Minzu University of China; 国家语言资源监测与研究少数民族语言中心; 中央民族大学
创建时间:
2021-12-31
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作