Ro551/WikiCorrupted_spanish_to_GEC-GED
收藏Hugging Face2026-04-28 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/Ro551/WikiCorrupted_spanish_to_GEC-GED
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含句子及其被篡改版本的数据集,旨在用于文本错误检测和纠正。数据集中的每个样本包括原始句子、被篡改的句子、分词、错误标签以及错误类型。数据集分为训练集、验证集和测试集,可用于机器学习模型的训练和评估。
This dataset includes sentences and their corrupted versions, designed for text error detection and correction. Each sample in the dataset contains the original sentence, the corrupted sentence, tokenization, error tags, and error types. The dataset is split into training, validation, and test sets for machine learning model training and evaluation.
提供机构:
Ro551



