five

LocalDoc/azerbaijani_spell_corrector_dataset

收藏
Hugging Face2024-12-04 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/LocalDoc/azerbaijani_spell_corrector_dataset
下载链接
链接失效反馈
官方服务:
资源简介:
阿塞拜疆语拼写纠正数据集,主要用于文本到文本生成任务,特别是拼写纠正。数据集包含两个主要特征:incorrect_sentence和correct_sentence,分别表示错误的句子和纠正后的句子。数据集分为一个训练集,包含1,350,991个例子,总大小为308,737,084字节。数据集的语言为阿塞拜疆语(az),许可证为cc-by-nc-4.0。

Azerbaijani Spell Correction Dataset, primarily used for text-to-text generation tasks, specifically for spell correction. The dataset includes two main features: incorrect_sentence and correct_sentence, representing incorrect sentences and their corrected versions, respectively. The dataset is divided into one training set containing 1,350,991 examples, with a total size of 308,737,084 bytes. The language of the dataset is Azerbaijani (az), and it is licensed under cc-by-nc-4.0.
提供机构:
LocalDoc
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作