LocalDoc/azerbaijani_spell_corrector_dataset

Name: LocalDoc/azerbaijani_spell_corrector_dataset
Creator: LocalDoc
Published: 2024-12-04 06:02:12
License: 暂无描述

Hugging Face2024-12-04 更新2024-12-14 收录

下载链接：

https://hf-mirror.com/datasets/LocalDoc/azerbaijani_spell_corrector_dataset

下载链接

链接失效反馈

官方服务：

资源简介：

阿塞拜疆语拼写纠正数据集，主要用于文本到文本生成任务，特别是拼写纠正。数据集包含两个主要特征：incorrect_sentence和correct_sentence，分别表示错误的句子和纠正后的句子。数据集分为一个训练集，包含1,350,991个例子，总大小为308,737,084字节。数据集的语言为阿塞拜疆语（az），许可证为cc-by-nc-4.0。

Azerbaijani Spell Correction Dataset, primarily used for text-to-text generation tasks, specifically for spell correction. The dataset includes two main features: incorrect_sentence and correct_sentence, representing incorrect sentences and their corrected versions, respectively. The dataset is divided into one training set containing 1,350,991 examples, with a total size of 308,737,084 bytes. The language of the dataset is Azerbaijani (az), and it is licensed under cc-by-nc-4.0.

提供机构：

LocalDoc

5,000+

优质数据集

54 个

任务类型

进入经典数据集