five

Complete_Enterobacteriaceae_plasmids

收藏
NIAID Data Ecosystem2026-03-10 收录
下载链接:
https://figshare.com/articles/dataset/complete_Enterobacteriaceae_plasmids_nucleotideseq_fa/4609303
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset comprises sequences of 2097 complete Enterobacteriaceae plasmids, curated following initial retrieval from the NCBI nucleotide database on 26th August 2016. The 2097 nucleotide sequences are provided as a FASTA file ('nucleotideseq.fa'). Corresponding protein sequences (n=12,582), generated by translating each plasmid in all 6 frames, are also provided ('translatedproteinseq.fa'). In addition, there are two zipped Genbank files providing more information on accessions. One contains the 2097 curated accessions; the other contains 6952 accessions that were obtained initially, prior to curation. The protein dataset ('translatedproteinseq.fa') is a useful resource for MOB typing plasmids (a method of plasmid classification based on detection of relaxase proteins). To conduct MOB typing, download the protein dataset, as well as scripts provided in a related Figshare code repository: https://figshare.com/s/3f8973dea1fe03c4f62fFurther instructions can be found on the Github page referenced in the Description section of the Figshare code repository. For more details about the dataset provided here, see the associated journal article: "A curated dataset of complete Enterobacteriaceae plasmids compiled from the NCBI nucleotide database", Orlek et al. in press.
创建时间:
2017-02-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作