Thermostatic/CCMatrix-English-Spanish
收藏Hugging Face2025-07-28 更新2025-08-30 收录
下载链接:
https://hf-mirror.com/datasets/Thermostatic/CCMatrix-English-Spanish
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含英语和西班牙语文本的数据集,划分为训练集,共有超过四亿条样本,数据集大小约为100GB。
This dataset contains text in English and Spanish, split into a training set with over 400 million examples, totaling about 100GB in size.
提供机构:
Thermostatic



