CoNLL-2014 Shared Task: Grammatical Error Correction
收藏DataCite Commons2020-10-10 更新2024-07-13 收录
下载链接:
http://www.comp.nus.edu.sg/~nlp/conll14st.html
下载链接
链接失效反馈官方服务:
资源简介:
The NUS Corpus of Learner English (NUCLE) was collected in a collaboration project between the National University of Singapore (NUS) Natural Language Processing (NLP) Group led by Prof. Hwee Tou Ng and the NUS Centre for English Language Communication (CELC) led by Prof. Siew Mei Wu. The work was carried out as part of the PhD thesis research of Daniel Dahlmeier at the NUS NLP Group. The corpus consists of about 1,400 essays written by university students at the National University of Singapore on a wide range of topics, such as environmental pollution, healthcare, etc. It contains over one million words which are completely annotated with error tags and corrections. All annotations have been performed by professional English instructors at the NUS CELC. The corpus is distributed under the standard NUS licensing agreement the terms and conditions for which are provided below. Please read the terms and conditions carefully. To download and reuse this dataset, please refer to instructions on https://doi.org/10.25542/BC41-IY5R. Related Publication: Ng, Hwee Tou, & Wu, Siew Mei, & Briscoe, Ted, & Hadiwinoto, Christian, & Susanto, Raymond Hendy, & Bryant, Christopher (2014). The CoNLL-2014 Shared Task on Grammatical Error Correction. In Proceedings of the Eighteenth Conference on Computational Natural Language Learning: Shared Task (CoNLL-2014 Shared Task). Baltimore, Maryland.
新加坡国立大学学习者英语语料库(NUS Corpus of Learner English,简称NUCLE)由黄辉涛(Hwee Tou Ng)教授领衔的新加坡国立大学(National University of Singapore,简称NUS)自然语言处理(Natural Language Processing,简称NLP)团队,与吴秀梅(Siew Mei Wu)教授领衔的新加坡国立大学英语语言交流中心(NUS Centre for English Language Communication,简称CELC)联合收集而成。本语料库的构建工作系新加坡国立大学NLP团队成员Daniel Dahlmeier的博士论文研究内容之一。本语料库收录约1400篇新加坡国立大学大学生撰写的议论文,话题涵盖环境污染、医疗保健等多个领域,总词量逾百万,所有文本均标注了错误标签与修正内容。所有标注工作均由新加坡国立大学英语语言交流中心的专业英语讲师完成。本语料库依照新加坡国立大学标准授权协议进行分发,协议条款详见下文,请务必仔细阅读相关细则。如需下载并复用该数据集,请参考链接https://doi.org/10.25542/BC41-IY5R中的操作指引。相关出版物:黄辉涛(Hwee Tou Ng)、吴秀梅(Siew Mei Wu)、泰德·布里斯科(Ted Briscoe)、克里斯蒂安·哈迪维诺托(Christian Hadiwinoto)、雷蒙德·亨迪·苏桑托(Raymond Hendy Susanto)、克里斯托弗·布莱恩特(Christopher Bryant)(2014)。《2014年计算自然语言学习会议语法错误纠正共享任务》,收录于《第十八届计算自然语言学习会议:共享任务论文集》(CoNLL-2014共享任务),举办地:美国马里兰州巴尔的摩。
提供机构:
National University of Singapore
创建时间:
2017-11-06
搜集汇总
数据集介绍

背景与挑战
背景概述
CoNLL-2014共享任务数据集专注于语法错误纠正,为非英语母语者提供自动化的语法检查工具,包含标注的训练数据和盲测数据,使用F0.5评估指标进行系统评估。
以上内容由遇见数据集搜集并总结生成



