five

E. coli BIGSI index

收藏
DataCite Commons2025-06-01 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/dataset/E_coli_BIGSI_index/12666497/1
下载链接
链接失效反馈
官方服务:
资源简介:
A BIGSI index of the E. coli collection which can be used to easily and quickly query the genomes for any DNA sequence of 61 bp or longer <br>BIGSI uses a k-mer based approach to query any DNA sequence of 61 bp or greater against all the assemblies of the collection (Bradley et al. 2019). This can be achieved as follows:<br>bigsi search -c config_10K_00.yaml -t 0.8ATGAAAAACACAATACATATCAACTTCGCTATTTTTTTAATAATTGCAAATATTATCTACA<br>Where config_10K_00.yaml provides the config file to the BIGSI index of the assemblies, and 0.8 is the threshold in k-mer similarity used to defined a match, and “ATGAAAAACACAATACATATCAACTTCGCTATTTTTTTAATAATTGCAAATATTATCTACA” is the sequence being search. BIGSI will return all the genome identifiers in the collection that have this sequence in at least 80% k-mer similarity. The properties of these genomes can be investigated in File F1. The user will need to ensure the path to the index is correct (“filename:”) in the config_10K_00.yaml file. Please refer to the BIGSI documentation (https://github.com/iqbal-lab-org/BIGSI) for full details.<br>
提供机构:
figshare
创建时间:
2020-09-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作