the-cramer-project/Misspelled-KG-dataset_wth_ID
收藏Hugging Face2025-04-22 更新2025-08-09 收录
下载链接:
https://hf-mirror.com/datasets/the-cramer-project/Misspelled-KG-dataset_wth_ID
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本数据,其中包括原始文本(clean)、包含垃圾字符的文本(trash)、带有标点的垃圾字符文本(trash_punc)和示例的唯一标识符(ID)。数据集分为训练集,适用于文本清洗和分类任务。
The dataset contains text data, including original text (clean), text with trash characters (trash), text with punctuated trash characters (trash_punc), and unique identifiers for examples (ID). The dataset is split into a training set, suitable for text cleaning and classification tasks.
提供机构:
the-cramer-project



