laion/biorXiv-pdf
收藏Hugging Face2024-10-18 更新2025-04-08 收录
下载链接:
https://hf-mirror.com/datasets/laion/biorXiv-pdf
下载链接
链接失效反馈官方服务:
资源简介:
BiorXiv PDF数据集是一个从BiorXiv网站收集的PDF文档集合,旨在为研究人员提供易于获取的训练数据集,以促进人工智能研究的民主化。该数据集包含生物及相关学科领域的预印本文章,由冷泉港实验室(CSHL)和扎克伯格·陈计划运营。预计研究人员和爱好者将使用这个数据集来训练和开发开创性的科学领域特定模型,以及为特定体裁和应用微调现有模型。
The BiorXiv PDF dataset is a collection of PDF documents gathered from the BiorXiv website, aiming to provide researchers with easily accessible training datasets to democratize artificial intelligence research. This dataset contains preprint papers in the fields of biology and related disciplines, operated by Cold Spring Harbor Laboratory (CSHL) and the Chan Zuckerberg Initiative. It is anticipated that researchers and enthusiasts will use this dataset to train and develop groundbreaking domain-specific models in the scientific field, as well as to fine-tune existing models for specialized genres and applications.
提供机构:
laion



