openalex-metadata
收藏魔搭社区2025-11-27 更新2025-10-04 收录
下载链接:
https://modelscope.cn/datasets/laion/openalex-metadata
下载链接
链接失效反馈官方服务:
资源简介:
<div align="center">
<img src="openalex.jpg" alt="ChemrXiv Pdf" width="500"/>
<p><b>ChemrXiv Pdf</b></p>
</div>
**OpenAlex Metadata** is a dataset consisting of information and details on over **258 million** research papers, including both closed and open access publications. This dataset is one of the largest known collections of research papers published to date. Our initiative aims to democratize access to collective human knowledge and to support the development of domain-specific artificial intelligence models in the scientific community.
In the near future, we plan to enhance our efforts by implementing a matching mechanism that will allow users to search this metadata index. This feature will enable researchers to discover relevant research papers based on the metadata we have made available, facilitating greater accessibility and understanding of existing research.
### Dataset information
**Indexed:** October 2024
**Total research paper info:** 258,602,038
**Where this dataset was sourced?**
We extracted it from the OpenAlex website.
**What format dataset is stored?**
JSON (GZIPPED)
**What are the JSON keys?**
1. ID: OpenAlex link
2. DOI: Paper’s DOI
3. Title: Paper title
<div align="center">
<img src="openalex.jpg" alt="ChemrXiv Pdf" width="500"/>
<p><b>ChemrXiv 论文PDF</b></p>
</div>
**OpenAlex 元数据(OpenAlex Metadata)** 是收录超2.58亿篇学术论文信息与细节的数据集,涵盖闭源与开源两类出版的研究成果,是目前已知规模最大的学术论文馆藏数据集之一。本项目旨在推动人类集体知识的民主化获取,并为科学界开发领域专用人工智能模型提供支持。
近期,我们计划通过引入匹配机制优化服务,使用户可检索该元数据索引。该功能将帮助研究人员基于公开的元数据发现相关学术论文,提升现有研究成果的可及性与理解深度。
### 数据集信息
**索引时间:** 2024年10月
**总学术论文条目数:** 258,602,038
**数据集来源:** 本数据集从OpenAlex官方网站提取所得。
**数据集存储格式:** JSON (GZIPPED)
**JSON字段说明:**
1. ID:OpenAlex官方链接
2. DOI:论文的数字对象唯一标识符(DOI)
3. Title:论文标题
提供机构:
maas
创建时间:
2025-10-03



