omicseye/eukar_train
收藏Hugging Face2025-03-17 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/omicseye/eukar_train
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个预训练数据集,仅包含真核生物样本,包括来自人类基因组中的600K样本,来自小鼠基因组(Mus musculus)的600K样本,以及来自真菌基因组的样本,总计共有7,280,000个训练样本。
The dataset is a pretraining dataset consisting exclusively of eukaryotic samples, including 600K samples from the human genome, 600K samples from the mouse genome (Mus musculus), and samples from fungal genomes, totaling 7,280,000 training samples.
提供机构:
omicseye



