five

Financial fraud dataset of Chinese listed companies (2015-2020)

收藏
科学数据银行2025-04-17 更新2026-04-23 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=b3579289c15c4a7f840f2279d3cd4574
下载链接
链接失效反馈
官方服务:
资源简介:
The data is sourced from the CSMAR database, covering violation records of companies listed on the Shanghai and Shenzhen stock exchanges from 2015 to 2020, focusing on five types of financial fraud: fictitious profits, inflated assets, false records, material omissions, and inaccurate disclosures. After excluding financial firms, the fraud sample set includes 2,652 violation records from 1,226 companies. Additionally, 2,652 high-quality companies without fraud were selected from the CNRDS ESG rating database to form the non-fraud sample set. The dataset consists of two parts:  1) Structured data: The file "financial fraud dataset (structured data).xlsx" contains 5,304 records covering 43 fields, such as basic company information, financial indicators, structural indicators, and linguistic features of annual report texts. Field names are listed in Table 1.  2) Annual report text data: The folder named "Annual report text data" includes 2,652 fraud samples (file names formatted as Symbol-Year.txt) and 2,652 non-fraud samples (same format). The files contain the MD&A sections of listed companies' annual reports.
提供机构:
Nie Hui
创建时间:
2025-04-16
二维码
社区交流群
二维码
科研交流群
商业服务