arthrod/new3_1excluded_exhibits_part5_9569.67mb
收藏Hugging Face2024-12-17 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/arthrod/new3_1excluded_exhibits_part5_9569.67mb
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,如唯一标识_id、收集时间戳timestamp_collection、文档类型document_type等。数据集被划分为训练集train,大小为约4.89GB,包含约4016.5万个样本。整个数据集的大小也是约4.89GB。根据这些信息,可以推断这是一个用于文本处理或分析的较大规模数据集。
The dataset consists of multiple fields such as unique identifier _id, collection timestamp timestamp_collection, document type document_type, etc. The dataset is split into a training set train, which is approximately 4.89GB in size and contains about 40.165 million samples. The total size of the dataset is also approximately 4.89GB. Based on this information, it can be inferred that this is a large-scale dataset for text processing or analysis.
提供机构:
arthrod



