dzjxzyd/UniRef50_len_0_50
收藏Hugging Face2024-10-01 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/dzjxzyd/UniRef50_len_0_50
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是从UniRef50数据库中下载的,包含序列长度在0到50之间的蛋白质序列。UniRef50是一个蛋白质序列聚类系统,通过将序列相似性至少为50%的序列聚类在一起,进一步减少冗余。数据集下载过程包括从UniProt数据库中选择特定长度范围的序列,并生成TSV格式的压缩文件。
This dataset contains protein sequences downloaded from the UniRef50 database with sequence lengths ranging from 0 to 50. The dataset was downloaded via an API, selecting specific length ranges and identity values, and was downloaded in TSV format. The dataset sources include the UniRef and UniParc databases, where the UniRef database clusters sequences based on different identity values, and the UniParc database contains all publicly available protein sequences.
提供机构:
dzjxzyd
搜集汇总
数据集介绍

以上内容由遇见数据集搜集并总结生成



