parameterlab/scaling_mia_the_pile_00_PubMed_Abstracts
收藏Hugging Face2024-09-24 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/parameterlab/scaling_mia_the_pile_00_PubMed_Abstracts
下载链接
链接失效反馈官方服务:
资源简介:
这个数据集包含了文本数据和相关元数据信息。文本数据为字符串类型,元数据中包含了一个名为pile_set_name的字段。数据集分为训练集、验证集和测试集三个部分,分别包含977059、29871和29895个示例。数据集的总大小为1,407,206,775字节,下载大小为787,216,517字节。
The dataset includes text data and associated metadata information. The text data is of string type, and the metadata contains a field named pile_set_name. The dataset is divided into three parts: training set, validation set, and test set, containing 977,059, 29,871, and 29,895 examples respectively. The total size of the dataset is 1,407,206,775 bytes, and the download size is 787,216,517 bytes.
提供机构:
parameterlab



