CCF corpus
收藏arXiv2025-09-30 收录
下载链接:
https://www.ccf.org.cn/Academic_Evaluation/By_category/
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了2005年至2019年间,从中国计算机学会(CCF)的人工智能期刊和会议中提取的122,446篇人工智能论文。这些数据通过使用GROBID工具将PDF格式转换为XML格式,并包含了论文的标题、国家、机构和参考文献。该数据集的规模为122,446篇论文,可应用于统计分析与传播分析、人工智能标记提取以及章节分类等任务。
This dataset contains 122,446 artificial intelligence papers extracted from the AI journals and conferences of the China Computer Federation (CCF) between 2005 and 2019. The data was converted from PDF to XML format using the GROBID tool, and includes paper titles, countries, author affiliations, and references. With a total of 122,446 papers, this dataset can be applied to tasks including statistical analysis, dissemination analysis, extraction of AI-related markers, and section classification.
提供机构:
China Computer Federation (CCF)



