minpeter/arxiv-abstracts-split
收藏Hugging Face2025-06-15 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/minpeter/arxiv-abstracts-split
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含文本数据的数据集,每个数据点包含添加时间、创建时间、ID、元数据(作者、全文许可、许可、出处、提交者、URL)、数据来源和文本内容。数据集分为五个部分,每部分包含约500,000至500,000个样本,数据以不同的字节数存储。整个数据集的下载大小约为1.7GB,总大小约为3.4GB。
This dataset contains text data, with each data point including fields for addition time, creation time, ID, metadata (authors, full text license, license, provenance, submitter, URL), source, and text content. The dataset is divided into five parts, each containing approximately 500,000 to 500,000 samples, stored in different byte sizes. The total download size of the dataset is about 1.7GB, and the total size is about 3.4GB.
提供机构:
minpeter



