five

A Multimodal Dataset of Financial Disclosures, MD&A, and Audit Opinions with Next-Year Bankruptcy Labels

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/A_Multimodal_Dataset_of_Financial_Disclosures_MD_A_and_Audit_Opinions_with_Next-Year_Bankruptcy_Labels/30305341
下载链接
链接失效反馈
官方服务:
资源简介:
We publicly release a multimodal dataset derived from 10-K annual reports to support research on next-year bankruptcy prediction. For each report, we collected all reported financial figures, the corresponding Management Discussion & Analysis (MD&A), and the Audit Opinion text, along with a bankruptcy label indicating whether the company filed for bankruptcy in the year following the report’s release. This dataset is designed to encourage research in this challenging area by presenting several key difficulties, including: Extreme class imbalanceMultimodality — integration of both tabular and textual dataMultisource heterogeneity — signals from the three sources may align or even contradictNLP-related challenges — long documents with substantial portions of text that may be irrelevant to the bankruptcy outcomeThis dataset accompanies our published paper in 6th ACM International Conference on AI in Finance with title "A Multimodal Alignment-Based Anomaly Detection Method for Bankruptcy Prediction" Authors: Andreas Sideras Konstantinos Bougiatiotis Elias Zavitsanos Georgios Paliouras George Vouros
创建时间:
2025-10-08
二维码
社区交流群
二维码
科研交流群
商业服务