Wells Fargo 8-K reports database 2015 - 2019
收藏doi.org2025-03-24 收录
下载链接:
http://doi.org/10.17632/z8k5cdsvc2.1
下载链接
链接失效反馈官方服务:
资源简介:
We create a dataset which focuses on 8-K reports for the years 2015 - 2019 for Wells Fargo. An 8-K is a report of unscheduled material events or corporate changes at a company that could be of importance to the shareholders or the Securities and Exchange Commission (SEC). Also known as a Form 8K, the report notifies the public of events, including acquisitions, bankruptcy, the resignation of directors, or changes in the fiscal year. We have compiled this dataset, thanks to SEC's EDGAR tool.
The texts were pre-processed by applying a classical pipeline :
- removal of non-alphanumeric characters;
- lemmatisation;
- removal of rare words and stopwords: we obtain a dictionary of 4377 distinct roots for the whole corpus.
This company published 672 reports for the years 2015 and 2019.
The file is a list of two items. The first item is composed of all information about the 8K and extracted texts. The second item is the document-term matrix with the pre-processed texts with 672 texts and 4377 words.
An example of 8-K can be found here https://www.sec.gov/files/form8-k.pdf.
本团队精心构建了一个专注于2015年至2019年Wells Fargo公司8-K报告的语料库。所谓8-K报告,系指公司针对可能对股东或美国证券交易委员会(SEC)产生重要影响的非计划性重大事件或公司变革所编制的公告。亦称表8-K,此类报告旨在向公众通报包括并购、破产、董事辞职或财年变更等事件。得益于SEC的EDGAR工具,本语料库得以编纂。文本预处理流程采用经典管道进行处理:包括移除非字母数字字符、词干提取以及移除罕见词汇和停用词:整个语料库共收集到4377个不同的词根。该公司在2015年和2019年共发布672份报告。文件包含两项内容:第一项为8-K报告及其提取文本的全部信息;第二项为包含672篇文本和4377个单词的文档-词矩阵。8-K报告的示例可参考以下链接:https://www.sec.gov/files/form8-k.pdf。
提供机构:
doi.org



