kanwal-mehreen18/c4_romance_splits
收藏Hugging Face2025-10-11 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/kanwal-mehreen18/c4_romance_splits
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个多语言数据集,包含西班牙语、法语、意大利语和葡萄牙语四种语言的文本数据。每个语言都有训练集和验证集,适用于机器学习模型的训练和验证。数据集的特征包括文本内容、时间戳和URL。
This dataset is a multilingual dataset containing text data in Spanish, French, Italian, and Portuguese. Each language has its own training and validation sets, suitable for training and validating machine learning models. The features of the dataset include text content, timestamp, and URL.
提供机构:
kanwal-mehreen18



