five

E. coli BIGSI index

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://figshare.com/articles/dataset/E_coli_BIGSI_index/12666497
下载链接
链接失效反馈
官方服务:
资源简介:
A BIGSI index of the E. coli collection which can be used to easily and quickly query the genomes for any DNA sequence of 61 bp or longer BIGSI uses a k-mer based approach to query any DNA sequence of 61 bp or greater against all the assemblies of the collection (Bradley et al. 2019). This can be achieved as follows: bigsi search -c config_10K_00.yaml -t 0.8 ATGAAAAACACAATACATATCAACTTCGCTATTTTTTTAATAATTGCAAATATTATCTACA Where config_10K_00.yaml provides the config file to the BIGSI index of the assemblies, and 0.8 is the threshold in k-mer similarity used to defined a match, and “ATGAAAAACACAATACATATCAACTTCGCTATTTTTTTAATAATTGCAAATATTATCTACA” is the sequence being search. BIGSI will return all the genome identifiers in the collection that have this sequence in at least 80% k-mer similarity. The properties of these genomes can be investigated in File F1. The user will need to ensure the path to the index is correct (“filename:”) in the config_10K_00.yaml file. Please refer to the BIGSI documentation (https://github.com/iqbal-lab-org/BIGSI) for full details.
创建时间:
2020-09-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作