CEFR-SP
收藏数据集概述
数据集名称: CEFR-Based Sentence Difficulty Annotation and Assessment
数据集内容: 包含17,000个英语句子,这些句子由英语教育专业人士根据CEFR(Common European Framework of Reference for Languages)标准进行难度标注。
数据集结构:
- 数据集文件位于
/CEFR-SP目录。 - CEFR难度评估模型的代码位于
/src目录。
引用信息:
-
若在研究中使用此数据集,请引用以下文献:
Yuki Arase, Satoru Uchida, and Tomoyuki Kajiwara. 2022. CEFR-Based Sentence-Difficulty Annotation and Assessment. in Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2022) (Dec. 2022).
@inproceedings{arase:emnlp2022, title = "{CEFR}-Based Sentence-Difficulty Annotation and Assessment", author = "Arase, Yuki and Uchida, Satoru, and Kajiwara, Tomoyuki", booktitle = "Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2022)", month = dec, year = "2022", }




