five

arthrod/new3_5excluded_exhibits_part0_20204.72mb

收藏
Hugging Face2024-12-17 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/arthrod/new3_5excluded_exhibits_part0_20204.72mb
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含多个特征,如唯一标识符(_id)、收集时间戳(timestamp_collection)、提交URL(submission_url)、主文件(master_file)、文档类型(document_type)、提交文件名(submission_filename)、文档文件名(document_filename)和SEC头信息完整性(sec-header-complete)。数据集主要用于存储和管理与文档提交相关的信息,可能用于分析文档提交的模式、时间分布或文档类型等。数据集被分割为训练集,包含2,683,476个样本,总大小为17,384,051,562字节。

This dataset includes multiple features such as unique identifier (_id), collection timestamp (timestamp_collection), submission URL (submission_url), master file (master_file), document type (document_type), submission filename (submission_filename), document filename (document_filename), and SEC header completeness (sec-header-complete). The dataset is primarily used for storing and managing information related to document submissions, potentially for analyzing patterns of document submissions, time distributions, or document types. The dataset is split into a training set containing 2,683,476 samples, with a total size of 17,384,051,562 bytes.
提供机构:
arthrod
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作