arthrod/new3_1excluded_exhibits_part3_9565.86mb
收藏Hugging Face2024-12-17 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/arthrod/new3_1excluded_exhibits_part3_9565.86mb
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了多个字段,如唯一标识符、收集时间戳、文件URL(可能为空)、主文件名、文档类型、提交的文件名和文档文件名。数据集分为训练集,共有约4016万条示例,总大小约为4.5GB。提供了默认配置,指定了训练集的数据文件路径。
The dataset includes fields such as unique identifier, collection timestamp, file URL (possibly null), master file name, document type, submission filename, and document filename. The dataset is split into a training set, which contains approximately 40.16 million examples and has a total size of about 4.5GB. A default configuration is provided, specifying the data file path for the training set.
提供机构:
arthrod



