FCGEC
收藏arXiv2022-10-22 更新2024-06-21 收录
下载链接:
https://github.com/xlxwalex/FCGEC
下载链接
链接失效反馈官方服务:
资源简介:
FCGEC是一个大规模的细粒度中文语法错误修正数据集,由浙江大学计算机科学与技术学院创建。该数据集包含41,340个句子,主要从公共学校中文考试的多项选择题中收集,并由人工标注。数据集旨在检测、识别和纠正语法错误,提供多个参考答案,以支持模型的评估和改进。FCGEC数据集的应用领域包括教育、媒体和出版,旨在解决中文语法错误修正的问题,提高文本的准确性和流畅性。
FCGEC is a large-scale fine-grained Chinese grammatical error correction (GEC) dataset constructed by the College of Computer Science and Technology, Zhejiang University. This dataset consists of 41,340 sentences, which are mainly collected from multiple-choice questions in public school Chinese proficiency exams and manually annotated. It is designed to detect, identify and correct grammatical errors, and provides multiple reference answers to support the evaluation and improvement of relevant models. The application fields of FCGEC cover education, media and publishing, with the objective of addressing Chinese grammatical error correction tasks and enhancing the accuracy and fluency of texts.
提供机构:
浙江大学计算机科学与技术学院
创建时间:
2022-10-22
搜集汇总
数据集介绍

背景与挑战
背景概述
FCGEC是一个由浙江大学创建的大规模细粒度中文语法错误修正数据集,包含41,340个从公共学校中文考试中收集并人工标注的句子。该数据集旨在通过检测、识别和纠正语法错误,提供多个参考答案,支持模型评估和改进,应用于教育、媒体和出版领域,以提升文本准确性和流畅性。
以上内容由遇见数据集搜集并总结生成



