five

Gene annotation, protein models, coding sequences list

收藏
DataCite Commons2025-08-22 更新2025-01-06 收录
下载链接:
https://figshare.com/articles/dataset/Gene_annotation_protein_models_coding_sequences_list_for_stinging_nettle_female_i_Urtica_dioica_i_ssp_i_dioica_i_/28012703
下载链接
链接失效反馈
官方服务:
资源简介:
Gene annotation of Nettle (<i>Urtica dioica</i> ssp. <i>dioica</i>) genomes using BRAKER3. The .tar.gz file contains the following 6 files each:<b>Figshare_uploads_Gene.tar</b><b>.gz: </b>Gene annotation of haplotype 1 in .gtf (braker3.Nettle_female_H1_v2_RNA_ProtsViridiplantaeOrthoDB.gtf)Gene annotation of haplotype 2 in .gtf (braker3.Nettle_female_H2_v2_RNA_ProtsViridiplantaeOrthoDB.gtf)List of protein sequences annotated in haplotype 1 in amino acid .fasta format (braker3.Nettle_female_H1_v2_RNA_ProtsViridiplantaeOrthoDB.faa)List of protein sequences annotated in haplotype 2 in amino acid .fasta format (braker3.Nettle_female_H2_v2_RNA_ProtsViridiplantaeOrthoDB.faa)List of coding sequences (CDS) annotated in haplotype 1 in nucleotide .fasta format (braker3.Nettle_female_H1_v2_RNA_ProtsViridiplantaeOrthoDB.codingseq)List of protein sequences (CDS) annotated in haplotype 2 in nucleotide .fasta format (braker3.Nettle_female_H2_v2_RNA_ProtsViridiplantaeOrthoDB.codingseq)<b>Figshare_uploads_Gene_male.tar: </b>Gene annotation of haplotype 1 in .gtf (braker3.Nettle_male_H1_v2_RNA_ProtsViridiplantaeOrthoDB.gtf)Gene annotation of haplotype 2 in .gtf (braker3.Nettle_male_H2_v2_RNA_ProtsViridiplantaeOrthoDB.gtf)List of protein sequences annotated in haplotype 1 in amino acid .fasta format (braker3.Nettle_male_H1_v2_RNA_ProtsViridiplantaeOrthoDB.faa)List of protein sequences annotated in haplotype 2 in amino acid .fasta format (braker3.Nettle_male_H2_v2_RNA_ProtsViridiplantaeOrthoDB.faa)List of coding sequences (CDS) annotated in haplotype 1 in nucleotide .fasta format (braker3.Nettle_male_H1_v2_RNA_ProtsViridiplantaeOrthoDB.codingseq)List of protein sequences (CDS) annotated in haplotype 2 in nucleotide .fasta format (braker3.Nettle_male_H2_v2_RNA_ProtsViridiplantaeOrthoDB.codingseq)Note that headers of the protein sequences match the "transcript" IDs on the .gtf file, meaning the protein sequences include all the splicing variants predicted.

本数据集为基于BRAKER3对荨麻(*Urtica dioica* ssp. *dioica*)基因组开展的基因注释结果。压缩包Figshare_uploads_Gene.tar.gz包含以下6类文件: 1. 单倍型1的基因注释GTF(Gene Transfer Format,基因转移格式)文件:braker3.Nettle_female_H1_v2_RNA_ProtsViridiplantaeOrthoDB.gtf 2. 单倍型2的基因注释GTF文件:braker3.Nettle_female_H2_v2_RNA_ProtsViridiplantaeOrthoDB.gtf 3. 单倍型1注释得到的氨基酸FASTA格式蛋白序列列表:braker3.Nettle_female_H1_v2_RNA_ProtsViridiplantaeOrthoDB.faa 4. 单倍型2注释得到的氨基酸FASTA格式蛋白序列列表:braker3.Nettle_female_H2_v2_RNA_ProtsViridiplantaeOrthoDB.faa 5. 单倍型1注释得到的编码序列(Coding Sequence,CDS)核苷酸FASTA格式列表:braker3.Nettle_female_H1_v2_RNA_ProtsViridiplantaeOrthoDB.codingseq 6. 单倍型2注释得到的编码序列(CDS,Coding Sequence)核苷酸FASTA格式列表:braker3.Nettle_female_H2_v2_RNA_ProtsViridiplantaeOrthoDB.codingseq 压缩包Figshare_uploads_Gene_male.tar包含以下6类文件: 1. 单倍型1的基因注释GTF文件:braker3.Nettle_male_H1_v2_RNA_ProtsViridiplantaeOrthoDB.gtf 2. 单倍型2的基因注释GTF文件:braker3.Nettle_male_H2_v2_RNA_ProtsViridiplantaeOrthoDB.gtf 3. 单倍型1注释得到的氨基酸FASTA格式蛋白序列列表:braker3.Nettle_male_H1_v2_RNA_ProtsViridiplantaeOrthoDB.faa 4. 单倍型2注释得到的氨基酸FASTA格式蛋白序列列表:braker3.Nettle_male_H2_v2_RNA_ProtsViridiplantaeOrthoDB.faa 5. 单倍型1注释得到的编码序列(CDS)核苷酸FASTA格式列表:braker3.Nettle_male_H1_v2_RNA_ProtsViridiplantaeOrthoDB.codingseq 6. 单倍型2注释得到的编码序列(CDS)核苷酸FASTA格式列表:braker3.Nettle_male_H2_v2_RNA_ProtsViridiplantaeOrthoDB.codingseq 请注意,蛋白序列的标题与GTF文件中的"transcript" ID一一对应,即该蛋白序列涵盖了所有预测得到的可变剪接变体。
提供机构:
figshare
创建时间:
2024-12-13
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作