francis47/en_mg_dataset
收藏Hugging Face2024-10-08 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/francis47/en_mg_dataset
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两个主要字段:en和mg,均为字符串类型。数据集分为训练集和测试集,训练集包含3,778,194个样本,测试集包含944,549个样本。数据集的下载大小为492,714,923字节,总大小为703,966,723字节。默认配置下,训练集和测试集的数据文件路径分别为data/train-*和data/test-*。
The dataset contains two main fields: en and mg, both of which are of string type. The dataset is divided into a training set and a test set, with the training set containing 3,778,194 samples and the test set containing 944,549 samples. The download size of the dataset is 492,714,923 bytes, and the total size is 703,966,723 bytes. Under the default configuration, the data file paths for the training set and test set are data/train-* and data/test-* respectively.
提供机构:
francis47



