laion/Pes2o-Abstract-X
收藏Hugging Face2024-09-03 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/laion/Pes2o-Abstract-X
下载链接
链接失效反馈官方服务:
资源简介:
Pes2o-X,也称为Pes2o-Abstract-X,是基于Allen AI发布的Pes2o数据集的一个衍生版本。该数据集旨在提供一个大型的开放获取研究论文语料库,包括摘要和全文。LAION AI通过其X项目对原始数据集进行了重组和优化,提取了摘要部分,并编译成Pes2o-Abstract-X。该数据集包含30.57M篇研究论文的摘要,并保留了所有原始元数据。Pes2o-X和整个X项目致力于从这些数据集中开发高质量的抽象,以支持先进人工智能模型的开发和增强现有大型语言模型管道的模块化功能。
Pes2o-X, also known as Pes2o-Abstract-X, is a derived dataset from the original Pes2o dataset released by Allen AI. The Pes2o dataset aimed to provide a large corpus of open-access research papers, including both abstracts and full text. LAION AI project X reorganized the Pes2o dataset version 2, extracted the abstracts, and compiled Pes2o-Abstract-X. The dataset contains 30.57M research paper abstracts and preserves all the original metadata.
提供机构:
laion



