five

Fast.genomics comparative genome browser: 2023 release

收藏
Figshare2023-08-23 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Fast_genomics_comparative_genome_browser_2023_release/24010353
下载链接
链接失效反馈
官方服务:
资源简介:
This is an archive of the 2023 release of http://fast.genomics.lbl.gov, a fast comparative genome browser for diverse bacteria and archaea.fast_code_Aug2023.tar.gz contains the source code, which is also available at github. See SETUP for installation instructions and lib/neighbor.sql for the database schema.Some of the code depends on Perl libraries from the PaperBLAST code base. These are archived in PaperBLAST_lib.tar.gz and should be exploded into ../PaperBLAST/ to create the ../PaperBLAST/lib directory. They are also available from github.fast_main_May2023.tar.gz contains the main database, with one representative genome for each of 6,377 genera. neighbor.db is the SQLite3 database and neighbor.faa.gz has the protein sequences. Put these files in the data/ directory. You can build the mmseqs database with:gunzip data/neighbor.faa.gzmmseqs createdb data/neighbor.faa data/mmseqsdb --dbtype 1mmseqs createindex data/mmseqsdb /tmp -k 6The sub-databases, with additional genomes for each taxonomic order, can be downloaded here or here. The tarball contains a directory for each sub-database; these subdirectories should go in the data/ directory. After installing the main database, you can build the clustered BLAST+ database for each sub-database's cluster with:for sub in `sqlite3 data/neighbor.db 'select prefix FROM SubDb;'`;do gunzip data/$sub/cluster.faa.gzmakeblastdb -in data/$sub/cluster.faa -dbtype prot -out data/$sub/cluster.faa.plusdbdoneOr, to download the the sub-database for a specific order, visit the fast.genomics web site, search for that order, switch to the sub-database, and see downloads section at the bottom of the main page.
创建时间:
2023-08-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作