Complete_Enterobacteriaceae_plasmids
收藏NIAID Data Ecosystem2026-03-10 收录
下载链接:
https://figshare.com/articles/dataset/complete_Enterobacteriaceae_plasmids_nucleotideseq_fa/4609303
下载链接
链接失效反馈官方服务:
资源简介:
This dataset comprises sequences of 2097 complete Enterobacteriaceae plasmids, curated following initial retrieval from the NCBI nucleotide database on 26th August 2016. The 2097 nucleotide sequences are provided as a FASTA file ('nucleotideseq.fa'). Corresponding protein sequences (n=12,582), generated by translating each plasmid in all 6 frames, are also provided ('translatedproteinseq.fa'). In addition, there are two zipped Genbank files providing more information on accessions. One contains the 2097 curated accessions; the other contains 6952 accessions that were obtained initially, prior to curation.
The protein dataset ('translatedproteinseq.fa') is a useful resource for MOB typing plasmids (a method of plasmid classification based on detection of relaxase proteins). To conduct MOB typing, download the protein dataset, as well as scripts provided in a related Figshare code repository: https://figshare.com/s/3f8973dea1fe03c4f62fFurther instructions can be found on the Github page referenced in the Description section of the Figshare code repository.
For more details about the dataset provided here, see the associated journal article: "A curated dataset of complete Enterobacteriaceae plasmids compiled from the NCBI nucleotide database", Orlek et al. in press.
创建时间:
2017-02-23



