arthrod/new3_5excluded_exhibits_part0_20204.72mb
收藏Hugging Face2024-12-17 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/arthrod/new3_5excluded_exhibits_part0_20204.72mb
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,如唯一标识符(_id)、收集时间戳(timestamp_collection)、提交URL(submission_url)、主文件(master_file)、文档类型(document_type)、提交文件名(submission_filename)、文档文件名(document_filename)和SEC头信息完整性(sec-header-complete)。数据集主要用于存储和管理与文档提交相关的信息,可能用于分析文档提交的模式、时间分布或文档类型等。数据集被分割为训练集,包含2,683,476个样本,总大小为17,384,051,562字节。
This dataset includes multiple features such as unique identifier (_id), collection timestamp (timestamp_collection), submission URL (submission_url), master file (master_file), document type (document_type), submission filename (submission_filename), document filename (document_filename), and SEC header completeness (sec-header-complete). The dataset is primarily used for storing and managing information related to document submissions, potentially for analyzing patterns of document submissions, time distributions, or document types. The dataset is split into a training set containing 2,683,476 samples, with a total size of 17,384,051,562 bytes.
提供机构:
arthrod



