Evolution of the carbohydrate 6-O sulfotransferase (C6OST) family in vertebrates and report of CHST16, a previously unrecognized C6OST gene lost from amniotes
收藏DataCite Commons2020-08-26 更新2024-08-25 收录
下载链接:
https://figshare.com/articles/Evolution_of_the_carbohydrate_6-O_sulfotransferase_C6OST_family_in_vertebrates_and_report_of_CHST16_a_previously_unrecognized_C6OST_gene_lost_from_amniotes/9596285/3
下载链接
链接失效反馈官方服务:
资源简介:
<b>Master_table_C6OST.xlsx:</b> Complete table of all identified C6OST sequences. Includes sequence names, chromosomal locations, database identifiers and annotation notes of all C6OST sequences identified in this study. Also includes complete list of species, species abbreviations and genome assemblies used in the study.<br>Sequence names include species abbreviations followed by chromosome/linkage group designations (if available) and gene symbols. Asterisks indicate incomplete/partial sequences. Paralogs (within-species duplicates) with uncertain phylogenetic relationships are designated as (1of2), (2of2) et c.<br><br><b>Master_C6OST_all.rtf: </b>All identified C6OST sequences in FASTA format, in the same order as in Master_table_C6OST.xlsx. <b>Rich Test Format file </b>marking exon junctions in alternating colors.<br><br><b>Master_C6OST_all.fasta: </b>All identified C6OST sequences in FASTA format, in the same order as in Master_table_C6OST.xlsx. <b>FASTA format file </b>for alignment/sequence viewing applications.<br><br><b>Ident_C6OST_seq.txt:</b> List of identical C6OST sequences in this dataset.<br><br><b>190121_C6OST_full_align.fasta:</b> Alignment including the full repertoire of C6OST sequences (<i>CHST1, </i><i>CHST</i><i>2, </i><i>CHST</i><i>3, </i><i>CHST</i><i>4, </i><i>CHST</i><i>5, </i><i>CHST</i><i>6, </i><i>CHST</i><i>7, </i><i>CHST</i><i>16 </i>and related genes) in a smaller set of species. <br><br><b>190121_C6OST_full_IQ-TREE.tar.gz:</b> Phylogenetic analysis (IQ-TREE output files) for the full repertoire of C6OST sequences in a smaller set of species. This analysis corresponds to Figures 1-5 in the publication. The file 190305_CHST7_align.fasta.treefile includes the phylogenetic tree in Newick format.<br><br>The following files correspond to the alignments and phylogenetic analyses for each of the C6OST subfamilies with the full representation of species. These analyses correspond to Supplementary Figures S1-S8 in the publication. Within the IQ-TREE output files, the files ending on .treefile include the phylogenetic trees in Newick format.<br><b>190130_CHST1_align.fasta</b> <b>190130_CHST1_IQ-TREE.tar.gz</b><b>190130_CHST2_align.fasta</b><br><b>190130_CHST2</b><b>_IQ-TREE.tar.gz</b><b><br></b><b>190130_CHST3_align.fasta</b><br><b>190130_CHST3</b><b>_IQ-TREE.tar.gz</b><b>190130_CHST4-5-6_align.fasta</b><br><b>190130_CHST4-5-6</b><b>_IQ-TREE.tar.gz</b><b>190130_CHST16_align.fasta</b><br><b>190130_CHST16</b><b>_IQ-TREE.tar.gz</b><b>190305_CHST7_align.fasta</b><br><b>190305_CHST7</b><b>_IQ-TREE.tar.gz</b><b><br></b><b>Conserved_synteny_gene_lists_Ens83.xlsx: </b>Lists of genes from the C6OST gene-bearing chromosome regions in the human, Carolina anole lizard, spotted gar and zebrafish genomes. Lists are arranged by Ensembl protein family predictions (Ensembl version 83) and number of times each protein family is represented on C6OST-bearing chromosome regions (column named '#'). <b><br></b><b><br></b><b>Conserved_synteny_data.xlsx:</b> Chromosomal/conserved synteny data. Includes chromosomal locations and database identifiers of all C6OST-neighboring genes identified in the study. New gene symbols/names suggested by this study are highlighted in yellow. This file also includes all identified conserved synteny blocks in the human, chicken, Western clawed frog, spotted gar, zebrafish and medaka genomes.<br><b>Channel_catfish_CHST1a_region.xlsx:</b> Genes neighboring <i>CHST1a </i>in the channel catfish genome. Used to identify the orthologous region of the zebrafish genome, which lacks a <i>CHST1a </i>gene.<br><br><b>Anolis_Chr2_conserved_synteny.xlsx: </b>Genes neighboring the "<i>CHST4/5-like</i>" gene on Carolina anole lizard chromosome 2. Used to identify the orthologous regions of the human and spotted gar genomes.<br>
<b>Master_table_C6OST.xlsx:</b>本研究中鉴定的所有C6OST序列的完整汇总表。本表涵盖本研究鉴定的全部C6OST序列的序列名称、染色体定位、数据库标识符及注释说明,同时包含本研究使用的所有物种类别、物种缩写及基因组组装版本的完整列表。<br>序列名称由物种缩写、染色体/连锁群标识(若有)及基因符号组合而成。星号(*)代表序列不完整或为片段序列。系统发育关系尚不明确的物种内旁系同源基因(复制基因)将以(1of2)、(2of2)等形式标注。<br><br><b>Master_C6OST_all.rtf:</b>所有已鉴定C6OST序列的富文本格式(Rich Text Format, RTF)文件,序列顺序与Master_table_C6OST.xlsx完全一致,该文件以交替颜色标注外显子剪接位点。<br><br><b>Master_C6OST_all.fasta:</b>所有已鉴定C6OST序列的FASTA格式(FASTA)文件,序列顺序与Master_table_C6OST.xlsx完全一致,适用于序列比对或序列查看工具。<br><br><b>Ident_C6OST_seq.txt:</b>本数据集中所有完全一致的C6OST序列列表。<br><br><b>190121_C6OST_full_align.fasta:</b>包含少量物种的全部C6OST序列(<i>CHST1</i>、<i>CHST2</i>、<i>CHST3</i>、<i>CHST4</i>、<i>CHST5</i>、<i>CHST6</i>、<i>CHST7</i>、<i>CHST16</i>及相关基因)的比对文件。<br><br><b>190121_C6OST_full_IQ-TREE.tar.gz:</b>针对少量物种的全部C6OST序列进行系统发育分析的IQ-TREE输出文件压缩包,该分析对应论文中的图1至图5。其中190305_CHST7_align.fasta.treefile文件包含Newick格式的系统发育树。<br><br>以下文件对应各C6OST亚家族的序列比对及系统发育分析,涵盖所有物种类别,这些分析对应论文中的补充图S1至S8。在IQ-TREE输出文件中,后缀为.treefile的文件包含Newick格式的系统发育树。<br><b>190130_CHST1_align.fasta</b> <b>190130_CHST1_IQ-TREE.tar.gz</b><b>190130_CHST2_align.fasta</b><br><b>190130_CHST2_IQ-TREE.tar.gz</b><b><br></b><b>190130_CHST3_align.fasta</b><br><b>190130_CHST3_IQ-TREE.tar.gz</b><b>190130_CHST4-5-6_align.fasta</b><br><b>190130_CHST4-5-6_IQ-TREE.tar.gz</b><b>190130_CHST16_align.fasta</b><br><b>190130_CHST16_IQ-TREE.tar.gz</b><b>190305_CHST7_align.fasta</b><br><b>190305_CHST7_IQ-TREE.tar.gz</b><b><br></b><b>Conserved_synteny_gene_lists_Ens83.xlsx:</b>人、卡罗莱纳安乐蜥、斑点雀鳝及斑马鱼基因组中携带C6OST基因的染色体区域的基因列表。该列表基于Ensembl蛋白家族预测(Ensembl版本83)进行整理,列名为“#”的栏目统计了每个蛋白家族在C6OST携带染色体区域的出现次数。<b><br></b><b><br></b><b>Conserved_synteny_data.xlsx:</b>染色体保守同线性数据文件。本表涵盖本研究鉴定的所有C6OST邻近基因的染色体定位与数据库标识符,本研究提出的新基因符号/名称以黄色高亮标注。本文件同时包含人、鸡、西部爪蛙、斑点雀鳝、斑马鱼及青鳉基因组中所有已鉴定的保守同线性区块。<br><b>Channel_catfish_CHST1a_region.xlsx:</b>斑点叉尾鮰基因组中<i>CHST1a</i>基因的邻近基因列表,用于鉴定斑马鱼基因组中缺乏<i>CHST1a</i>基因的同源区域。<br><br><b>Anolis_Chr2_conserved_synteny.xlsx:</b>卡罗莱纳安乐蜥2号染色体上<i>CHST4/5-like</i>基因的邻近基因列表,用于鉴定人及斑点雀鳝基因组的同源区域。
提供机构:
figshare
创建时间:
2019-09-02



