Additional file 2 of PAM-repeat associations and spacer selection preferences in single and co-occurring CRISPR-Cas systems
收藏NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://figshare.com/articles/dataset/Additional_file_2_of_PAM-repeat_associations_and_spacer_selection_preferences_in_single_and_co-occurring_CRISPR-Cas_systems/16713917
下载链接
链接失效反馈官方服务:
资源简介:
Additional file 2 CSV file containing for each unique spacer in the CRISPRCasDB the following columns: Spacers: spacer sequence; Repeats: repeat sequence in host(s) (can be multiple if multiple genomes contain same spacer); Accessionnrs: accession number of host(s); Subtype: subtype of array; cas_genes: Cas_genes present in host(s); hit: if match found in (meta)genomic database equals 1 (else 0); consensus_flanks: consensus sequence of left and right flank from flanks of all the hits in databases to this spacer; repeat_cluster: id of repeat cluster generated with CD-hit; strandbias: Orientation of hit in reference to ORF (1 coding strand 0 template strand, -1 undetermined); type: Type of CRISPR array; orientation_CRISPRCasdb: Orientation of spacer determined in CRISPRCasDB [6]; orientation_PAMbased: Orientation of spacer determined in this study based on PAM; orientation_TOPbased: Orientation of spacer determined with TOP [56]; PAM: PAM sequence of repeat cluster (if predicted); Genus, Family, Order ….: Taxonomy of host; Type I, TypeII, Type III…: Whether host genomes contain genes related to specific Type (1 yes, 0 no); Subtypesingenomes: Which subtypes are in genomes; Subtypesinproximity: Which subtypes are in proximity (<25000 bp from spacer); Proximity_subtypes: Distance of spacer to gene cluster of specific subtype; subtypesCas1: Which subtypes are in genomes that contain a Cas1 protein.
创建时间:
2021-09-30



