five

ichanchiu/Summarized_10K-MDA

收藏
Hugging Face2024-12-03 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/ichanchiu/Summarized_10K-MDA
下载链接
链接失效反馈
官方服务:
资源简介:
Summarized 10-K MD&A数据集提供了公开交易公司10-K文件的简明机器生成摘要。这些文件来源于SEC EDGAR数据库,旨在促进金融文本分析,如摘要生成、情感分析和财务披露研究。数据集包含98,100行数据,主要列包括CIK(公司中央索引键)、Form Type(文件类型,如10-K)、Filing Date(文件日期)、Accession Number(文件唯一标识符)和Summary(10-K文件内容的AI生成摘要)。数据集的使用意图包括训练金融摘要模型、分析财务披露中的情感以及研究财务报告的趋势。数据集的局限性包括摘要可能遗漏重要细节,且主要关注美国公司。

The **Summarized 10-K MD&A** dataset provides concise, machine-generated summaries of 10-K filings for publicly traded companies. These filings are sourced from the SEC EDGAR database, and the dataset is designed to facilitate financial text analysis, such as summarization, sentiment analysis, and financial disclosure studies. The dataset includes key features such as the company identifier CIK, form type, filing date, unique identifier for the filing, and an AI-generated summary. The dataset is intended for training financial summarization models, analyzing sentiment within financial disclosures, and investigating trends in financial reporting over time. Limitations include the potential omission of important details in the summaries and the focus on U.S.-based companies.
提供机构:
ichanchiu
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作