nick007x/arxiv-papers
收藏Hugging Face2025-10-14 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/nick007x/arxiv-papers
下载链接
链接失效反馈官方服务:
资源简介:
Complete ArXiv Papers Dataset是一个包含完整ArXiv科学论文档案的数据集,按照主题类别和出版年份组织。它包含了物理学、计算机科学、数学、统计学等多个领域的论文,以ZIP压缩的PDF格式存储,提供了丰富的元数据。这个数据集适合用于科学文档理解、多模态检索、科学自然语言处理等多种研究和应用。
The Complete ArXiv Papers Dataset is a collection of the full ArXiv scientific papers archive, organized by subject categories and publication years. It includes papers from fields such as physics, computer science, mathematics, statistics, and more, stored in ZIP-compressed PDF format with extensive metadata. This dataset is suitable for scientific document understanding, multi-modal retrieval, scientific natural language processing, and various other research and applications.
提供机构:
nick007x



