five

theaayushbajaj/10-X-raw-v1

收藏
Hugging Face2024-11-23 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/theaayushbajaj/10-X-raw-v1
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含1993年至2023年间的SEC 10-X(10-K, 10-Q)文件,重点关注公司财务报告中的风险因素和管理讨论与分析部分。数据经过处理,更适合用于自然语言处理和财务分析任务。数据处理包括风险因素和管理讨论与分析部分的提取、数据组织以及格式转换。数据集以Parquet格式存储,包含CIK、文件日期、风险因素和管理讨论与分析四个字段。

This dataset contains processed SEC 10-X (10-K, 10-Q) filings, focusing on Risk Factors and Management Discussion & Analysis (MD&A) sections from corporate financial reports from 1993-2023. The dataset is derived from SEC 10-X filings and provides structured access to two critical sections of corporate financial reports: Risk Factors (Item 1A) and Managements Discussion and Analysis (Item 7). The data has been processed to enable natural language processing and financial analysis tasks. The dataset was created using a parsing pipeline that addresses several key considerations, including Risk Factors Extraction, MD&A Extraction, and Data Organization. The dataset is stored in Parquet format, containing Central Index Key (CIK), filing date, extracted Risk Factors section, and Management Discussion & Analysis section.
提供机构:
theaayushbajaj
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作