five

Evolution of the carbohydrate 6-O sulfotransferase (C6OST) family in vertebrates and report of CHST16, a previously unrecognized C6OST gene lost from amniotes

收藏
DataCite Commons2020-08-26 更新2024-07-27 收录
下载链接:
https://figshare.com/articles/Evolution_of_the_carbohydrate_6-O_sulfotransferase_C6OST_family_in_vertebrates_and_report_of_CHST16_a_previously_unrecognized_C6OST_gene_lost_from_amniotes/9596285/2
下载链接
链接失效反馈
官方服务:
资源简介:
<b>Master_table_C6OST.xlsx:</b> Complete table of all identified C6OST sequences. Includes sequence names, chromosomal locations, database identifiers and annotation notes of all C6OST sequences identified in this study. Also includes complete list of species, species abbreviations and genome assemblies used in the study.<br>Sequence names include species abbreviations followed by chromosome/linkage group designations (if available) and gene symbols. Asterisks indicate incomplete/partial sequences. Paralogs (within-species duplicates) with uncertain phylogenetic relationships are designated as (1of2), (2of2) et c.<br><br><b>Master_C6OST_all.rtf: </b>All identified C6OST sequences in FASTA format, in the same order as in Master_table_C6OST.xlsx. <b>Rich Test Format file </b>marking exon junctions in alternating colors.<br><br><b>Master_C6OST_all.fasta: </b>All identified C6OST sequences in FASTA format, in the same order as in Master_table_C6OST.xlsx. <b>FASTA format file </b>for alignment/sequence viewing applications.<br><br><b>Ident_C6OST_seq.txt:</b> List of identical C6OST sequences in this dataset.<br><br><b>190121_C6OST_full_align.fasta:</b> Alignment including the full repertoire of C6OST sequences (<i>CHST1, </i><i>CHST</i><i>2, </i><i>CHST</i><i>3, </i><i>CHST</i><i>4, </i><i>CHST</i><i>5, </i><i>CHST</i><i>6, </i><i>CHST</i><i>7, </i><i>CHST</i><i>16 </i>and related genes) in a smaller set of species. <br><br><b>190121_C6OST_full_IQ-TREE.tar.gz:</b> Phylogenetic analysis (IQ-TREE output files) for the full repertoire of C6OST sequences in a smaller set of species. This analysis corresponds to Figures 1-5 in the publication. The file 190305_CHST7_align.fasta.treefile includes the phylogenetic tree in Newick format.<br><br>The following files correspond to the alignments and phylogenetic analyses for each of the C6OST subfamilies with the full representation of species. These analyses correspond to Supplementary Figures S1-S8 in the publication. Within the IQ-TREE output files, the files ending on .treefile include the phylogenetic trees in Newick format.<br><b>190130_CHST1_align.fasta</b> <b>190130_CHST1_IQ-TREE.tar.gz</b><b>190130_CHST2_align.fasta</b><br><b>190130_CHST2</b><b>_IQ-TREE.tar.gz</b><b><br></b><b>190130_CHST3_align.fasta</b><br><b>190130_CHST3</b><b>_IQ-TREE.tar.gz</b><b>190130_CHST4-5-6_align.fasta</b><br><b>190130_CHST4-5-6</b><b>_IQ-TREE.tar.gz</b><b>190130_CHST16_align.fasta</b><br><b>190130_CHST16</b><b>_IQ-TREE.tar.gz</b><b>190305_CHST7_align.fasta</b><br><b>190305_CHST7</b><b>_IQ-TREE.tar.gz</b><b><br></b><b>Conserved_synteny_data.xlsx:</b> Chromosomal/conserved synteny data. Includes chromosomal locations and database identifiers of all C6OST-neighboring genes identified in the study. New gene symbols/names suggested by this study are highlighted in yellow. This file also includes all identified conserved synteny blocks in the human, chicken, Western clawed frog, spotted gar, zebrafish and medaka genomes.<br>

<b>Master_table_C6OST.xlsx:</b>本研究中鉴定得到的所有C6OST序列的完整汇总表。该表格包含本研究鉴定的全部C6OST序列的序列名称、染色体定位、数据库标识符及注释说明,同时收录了本研究使用的所有物种、物种缩写及基因组组装版本的完整列表。<br>序列名称格式为:物种缩写 + 染色体/连锁群编号(若有)+ 基因符号。星号(*)代表序列不完整或为片段序列。系统发育关系尚不明确的物种内旁系同源基因(种内重复序列)将标注为(1of2)、(2of2)等形式。<br><br><b>Master_C6OST_all.rtf:</b>所有已鉴定C6OST序列的FASTA格式(FASTA)文件,序列排列顺序与Master_table_C6OST.xlsx完全一致。该文件为富文本格式(Rich Text Format, RTF),以交替颜色标记外显子剪接位点。<br><br><b>Master_C6OST_all.fasta:</b>所有已鉴定C6OST序列的FASTA格式(FASTA)文件,序列排列顺序与Master_table_C6OST.xlsx完全一致,适用于序列比对/序列查看工具。<br><br><b>Ident_C6OST_seq.txt:</b>本数据集内完全相同的C6OST序列列表。<br><br><b>190121_C6OST_full_align.fasta:</b>包含少量物种的全部C6OST序列(<i>CHST1</i>、<i>CHST2</i>、<i>CHST3</i>、<i>CHST4</i>、<i>CHST5</i>、<i>CHST6</i>、<i>CHST7</i>、<i>CHST16</i>及相关基因)的比对文件。<br><br><b>190121_C6OST_full_IQ-TREE.tar.gz:</b>针对少量物种的全部C6OST序列进行系统发育分析的IQ-TREE输出文件压缩包。该分析结果对应论文中的图1至图5。其中文件190305_CHST7_align.fasta.treefile包含Newick格式(Newick)的系统发育树文件。<br><br>以下文件对应各C6OST亚家族的序列比对及系统发育分析,覆盖所有研究物种,其分析结果对应论文中的补充图S1至S8。在IQ-TREE的输出文件中,后缀为.treefile的文件包含Newick格式(Newick)的系统发育树。<br><b>190130_CHST1_align.fasta</b> <b>190130_CHST1_IQ-TREE.tar.gz</b><b>190130_CHST2_align.fasta</b><br><b>190130_CHST2</b><b>_IQ-TREE.tar.gz</b><b><br></b><b>190130_CHST3_align.fasta</b><br><b>190130_CHST3</b><b>_IQ-TREE.tar.gz</b><b>190130_CHST4-5-6_align.fasta</b><br><b>190130_CHST4-5-6</b><b>_IQ-TREE.tar.gz</b><b>190130_CHST16_align.fasta</b><br><b>190130_CHST16</b><b>_IQ-TREE.tar.gz</b><b>190305_CHST7_align.fasta</b><br><b>190305_CHST7</b><b>_IQ-TREE.tar.gz</b><b><br></b><b>Conserved_synteny_data.xlsx:</b>染色体保守同线性数据文件。该文件包含本研究鉴定的所有C6OST邻近基因的染色体定位及数据库标识符,本研究提出的新基因符号/名称以黄色高亮标注。此外,该文件还收录了人类、鸡、西部爪蟾、斑点雀鳝、斑马鱼及青鳉基因组中所有鉴定得到的保守同线性区块。
提供机构:
figshare
创建时间:
2019-08-14
二维码
社区交流群
二维码
科研交流群
商业服务