minashirinchi/perspell-tokens-half-labeled
收藏Hugging Face2025-08-04 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/minashirinchi/perspell-tokens-half-labeled
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含错误和正确文本字符串对,以及相关的输入ID、注意力掩码、标签、令牌和单词ID等信息。数据集分为训练集、验证集和测试集,分别包含1957294、489324和490450个示例。数据集总大小为3.4GB,下载大小为1.1GB。
The dataset includes pairs of erroneous and corrected text strings, along with related information such as input IDs, attention masks, labels, tokens, and word IDs. The dataset is split into training, validation, and test sets, containing 1,957,294, 489,324, and 490,450 examples respectively. The total size of the dataset is 3.4GB, and the download size is 1.1GB.
提供机构:
minashirinchi



