omicseye/prokar_train
收藏Hugging Face2025-03-17 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/omicseye/prokar_train
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个预训练数据集,包含原核生物和古菌样本,具体包括2000种细菌的20M个样本和100种古菌的1M个样本,总计20,950,000个样本。
This dataset is a pretraining dataset containing prokaryotic and archaeal samples, specifically including 20M samples from 2000 bacterial species and 1M samples from 100 archaeal species, totaling 20,950,000 samples.
提供机构:
omicseye



