five

Vantage-point search (vpsearch) dataset and index

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/6509510
下载链接
链接失效反馈
官方服务:
资源简介:
This is a dataset for use with the vpsearch tool. This dataset contains the following files: A compressed sequence file, bac120_ssu_reps_r207-sliced-dedup.fa.gz, which was obtained from the GTDB dataset of bacterial sequences (v207) by extracting the v3-v4 hypervariable region and removing duplicate sequences. A vantage-point tree, built with version 0.1.2 of the vpsearch software. Note: the vantage-point tree needs to be decompressed (tar xzvf bac120_ssu_reps_r207-sliced-dedup.db.tar.gz) before it can be used for querying. A sample query file, query.fa, containing the sequence NR_126253.1 from RefSeq. The scripts used to prepare the dataset can be found in the vpsearch GitHub repository. Also included in the repository is a description of how the primary data was obtained.
创建时间:
2022-05-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作