five

The Chinese and English Learner Language Corpus (CELL Corpus)

收藏
Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/gs4ppd7sz3
下载链接
链接失效反馈
官方服务:
资源简介:
The Chinese and English Learner Language Corpus (referred to as ‘the CELL Corpus’ hereafter) is designed as a learner language corpus. The CELL Corpus, as a learner language corpus, is thus designed as a collection of text data chiefly composed of Chinese and English academic essay-type assignments written by university undergraduate students. The students submitted their academic essays as assignments for the assessment purpose of the courses they enrolled for, which suggests the authenticity of the text data collected. In addition to the text data, the CELL Corpus is also designed to include the meta data of the students whose academic essays were collected. The meta data collected represent five types of demographic information of the students, which are namely: age, gender, place of birth, first language and public examination results for Chinese Language and English Language. The two datasets (i.e. text data and meta data) of the CELL Corpus are delineated in the following sub-sections. The datasets uploaded on this webpage are solely utilized for the establishment of the CELL Corpus (https://cellcorpusouhk.com/). Approval from the authors must be sought if anyone would like to download the datasets for research purposes.
创建时间:
2022-02-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作