Quality Excellence Text Dataset
收藏DataCite Commons2026-05-05 更新2026-05-07 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.20034959
下载链接
链接失效反馈官方服务:
资源简介:
The deposited materials include: (1) results across validation settings, seeds, and folds; (2) sentence-level attention and SHAP outputs used for repository construction; (3) topic-assignment outputs underlying the BERTopic analyses; and (4) codes for preprocessing, model training, explanation extraction, and topic modelling. Raw annual reports were obtained from publicly available corporate disclosures of Chinese A-share listed firms. Because the original reports are third-party source documents, readers should access the original annual reports from the corresponding public disclosure platforms.
本次提交存档的数据集材料包含以下内容:(1) 覆盖各类验证设置、随机种子及交叉验证折次的实验结果;(2) 用于构建本数据集仓库的句子级注意力权重与SHAP(SHapley Additive exPlanations)输出结果;(3) BERTopic主题建模分析所依托的主题分配输出结果;(4) 用于数据预处理、模型训练、解释性结果提取以及主题建模的代码脚本。
本数据集所用的原始年报均取自中国A股上市公司的公开企业披露文件。由于原始年报属于第三方源文档,使用者需通过对应公开披露平台获取原始年报原件。
提供机构:
Zenodo
创建时间:
2026-05-05



