Gene annotation, protein models, coding sequences list
收藏DataCite Commons2025-08-22 更新2025-01-06 收录
下载链接:
https://figshare.com/articles/dataset/Gene_annotation_protein_models_coding_sequences_list_for_stinging_nettle_female_i_Urtica_dioica_i_ssp_i_dioica_i_/28012703
下载链接
链接失效反馈官方服务:
资源简介:
Gene annotation of Nettle (<i>Urtica dioica</i> ssp. <i>dioica</i>) genomes using BRAKER3. The .tar.gz file contains the following 6 files each:<b>Figshare_uploads_Gene.tar</b><b>.gz: </b>Gene annotation of haplotype 1 in .gtf (braker3.Nettle_female_H1_v2_RNA_ProtsViridiplantaeOrthoDB.gtf)Gene annotation of haplotype 2 in .gtf (braker3.Nettle_female_H2_v2_RNA_ProtsViridiplantaeOrthoDB.gtf)List of protein sequences annotated in haplotype 1 in amino acid .fasta format (braker3.Nettle_female_H1_v2_RNA_ProtsViridiplantaeOrthoDB.faa)List of protein sequences annotated in haplotype 2 in amino acid .fasta format (braker3.Nettle_female_H2_v2_RNA_ProtsViridiplantaeOrthoDB.faa)List of coding sequences (CDS) annotated in haplotype 1 in nucleotide .fasta format (braker3.Nettle_female_H1_v2_RNA_ProtsViridiplantaeOrthoDB.codingseq)List of protein sequences (CDS) annotated in haplotype 2 in nucleotide .fasta format (braker3.Nettle_female_H2_v2_RNA_ProtsViridiplantaeOrthoDB.codingseq)<b>Figshare_uploads_Gene_male.tar: </b>Gene annotation of haplotype 1 in .gtf (braker3.Nettle_male_H1_v2_RNA_ProtsViridiplantaeOrthoDB.gtf)Gene annotation of haplotype 2 in .gtf (braker3.Nettle_male_H2_v2_RNA_ProtsViridiplantaeOrthoDB.gtf)List of protein sequences annotated in haplotype 1 in amino acid .fasta format (braker3.Nettle_male_H1_v2_RNA_ProtsViridiplantaeOrthoDB.faa)List of protein sequences annotated in haplotype 2 in amino acid .fasta format (braker3.Nettle_male_H2_v2_RNA_ProtsViridiplantaeOrthoDB.faa)List of coding sequences (CDS) annotated in haplotype 1 in nucleotide .fasta format (braker3.Nettle_male_H1_v2_RNA_ProtsViridiplantaeOrthoDB.codingseq)List of protein sequences (CDS) annotated in haplotype 2 in nucleotide .fasta format (braker3.Nettle_male_H2_v2_RNA_ProtsViridiplantaeOrthoDB.codingseq)Note that headers of the protein sequences match the "transcript" IDs on the .gtf file, meaning the protein sequences include all the splicing variants predicted.
本数据集为基于BRAKER3对荨麻(*Urtica dioica* ssp. *dioica*)基因组开展的基因注释结果。压缩包Figshare_uploads_Gene.tar.gz包含以下6类文件:
1. 单倍型1的基因注释GTF(Gene Transfer Format,基因转移格式)文件:braker3.Nettle_female_H1_v2_RNA_ProtsViridiplantaeOrthoDB.gtf
2. 单倍型2的基因注释GTF文件:braker3.Nettle_female_H2_v2_RNA_ProtsViridiplantaeOrthoDB.gtf
3. 单倍型1注释得到的氨基酸FASTA格式蛋白序列列表:braker3.Nettle_female_H1_v2_RNA_ProtsViridiplantaeOrthoDB.faa
4. 单倍型2注释得到的氨基酸FASTA格式蛋白序列列表:braker3.Nettle_female_H2_v2_RNA_ProtsViridiplantaeOrthoDB.faa
5. 单倍型1注释得到的编码序列(Coding Sequence,CDS)核苷酸FASTA格式列表:braker3.Nettle_female_H1_v2_RNA_ProtsViridiplantaeOrthoDB.codingseq
6. 单倍型2注释得到的编码序列(CDS,Coding Sequence)核苷酸FASTA格式列表:braker3.Nettle_female_H2_v2_RNA_ProtsViridiplantaeOrthoDB.codingseq
压缩包Figshare_uploads_Gene_male.tar包含以下6类文件:
1. 单倍型1的基因注释GTF文件:braker3.Nettle_male_H1_v2_RNA_ProtsViridiplantaeOrthoDB.gtf
2. 单倍型2的基因注释GTF文件:braker3.Nettle_male_H2_v2_RNA_ProtsViridiplantaeOrthoDB.gtf
3. 单倍型1注释得到的氨基酸FASTA格式蛋白序列列表:braker3.Nettle_male_H1_v2_RNA_ProtsViridiplantaeOrthoDB.faa
4. 单倍型2注释得到的氨基酸FASTA格式蛋白序列列表:braker3.Nettle_male_H2_v2_RNA_ProtsViridiplantaeOrthoDB.faa
5. 单倍型1注释得到的编码序列(CDS)核苷酸FASTA格式列表:braker3.Nettle_male_H1_v2_RNA_ProtsViridiplantaeOrthoDB.codingseq
6. 单倍型2注释得到的编码序列(CDS)核苷酸FASTA格式列表:braker3.Nettle_male_H2_v2_RNA_ProtsViridiplantaeOrthoDB.codingseq
请注意,蛋白序列的标题与GTF文件中的"transcript" ID一一对应,即该蛋白序列涵盖了所有预测得到的可变剪接变体。
提供机构:
figshare
创建时间:
2024-12-13



