twnlp/csc_data
收藏Hugging Face2025-02-10 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/twnlp/csc_data
下载链接
链接失效反馈官方服务:
资源简介:
CSC数据集包含以下几部分数据:W271K共279,816条数据,Medical共39,303条数据,Lemon共22,259条数据,ECSpell共6,688条数据,CSCD共35,001条数据。该数据集似乎是用于中文错误校正的任务。
The CSC dataset consists of several parts: W271K with 279,816 entries, Medical with 39,303 entries, Lemon with 22,259 entries, ECSpell with 6,688 entries, and CSCD with 35,001 entries. This dataset seems to be used for Chinese error correction tasks.
提供机构:
twnlp



