Aletheia-ng/noisy_dataset
收藏Hugging Face2025-01-28 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/Aletheia-ng/noisy_dataset
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本和相关特征,如文本内容、数据集名称、脚本类型、语言和脚本组合以及带噪声的文本。数据集分为训练集、验证集和测试集,分别包含60000、4000和20000个示例。适合用于文本处理和特征分析任务。
The dataset includes text and related features such as text content, dataset name, script type, language-script combination, and noisy text. It is split into training, validation, and test sets, containing 60,000, 4,000, and 20,000 examples respectively. Suitable for text processing and feature analysis tasks.
提供机构:
Aletheia-ng



