five

vir2_NCBI_21-03-2018

收藏
NIAID Data Ecosystem2026-03-10 收录
下载链接:
https://figshare.com/articles/dataset/vir2_NCBI_21-03-2018/6106892
下载链接
链接失效反馈
官方服务:
资源简介:
Two fasta sequence datasets downloaded from Genebank nucleotides and Genebank proteins using the Galaxy tool "fetch_fasta_from_ncbi" (https://toolshed.g2.bx.psu.edu/) and the query strings DNA sequences were retrieved on 21-03-2018 using two queries"txid10239[Organism] NOT txid131567[Organism] NOT phage[All Fields] NOT patent[All Fields] NOT chimeric[Title] NOT vector[Title] NOT method[Title] NOT X174[All Fields] AND 301:10000[Sequence length]" and "txid10239[Organism] NOT txid131567[Organism] NOT phage[All Fields] NOT patent[All Fields] NOT chimeric[Title] NOT vector[Title] NOT method[Title] NOT X174[All Fields] AND 10001:1300000[Sequence length]". 301-10000nt long sequences were then subjected to clustering using the galaxy tool vsearch. The resulting centroids were finally merged with the 10001-1300000nt long sequences leading to vir2_NCBI_21-03-2018 Protein sequences were retrieved using the query "txid10239[Organism] NOT txid131567[Organism] NOT phage[All Fields] NOT patent[All Fields] NOT chimeric[Title] NOT vector[Title] NOT method[Title] NOT X174[All Fields] AND 30:9000[Sequence length]".
创建时间:
2018-04-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作