five

Genome annotation and protein-coding gene set for the chromosome-level assembly of the predatory stink bug, Eocanthecona furcellata

收藏
Figshare2025-09-18 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Genome_annotation_and_protein-coding_gene_set_for_the_chromosome-level_assembly_of_the_predatory_stink_bug_i_Eocanthecona_furcellata_i_/30157360
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset provides the comprehensive genome annotation files for the chromosome-level genome assembly of the predatory stink bug, Eocanthecona furcellata. The structural annotation was performed using the GETA pipeline, which integrated homology-based, transcriptome-guided, and ab initio evidence to produce a high-quality, non-redundant gene set. This process identified a total of 16,880 protein-coding genes. Subsequent functional annotation was conducted by querying all protein sequences against the NCBI non-redundant (NR) protein database using DIAMOND blastp and scanning for conserved domains against the Pfam, CDD, SMART, and SUPERFAMILY databases using InterProScan. This dual approach successfully assigned putative functions to 16,687 genes, representing 98.74% of the total gene set. This repository contains four key files: (1) The structural annotation in General Feature Format version 3 (.gff3), detailing the genomic coordinates of genes, exons, and CDSs. (2) The predicted protein sequences in FASTA format (.pep.fasta). (3) The coding DNA sequences (CDS) in FASTA format (.cds.fasta). (4) A comprehensive functional annotation table in tab-separated values format (.annotation.tsv), which includes gene IDs, NR blast hits, InterProScan domain information, and assigned Gene Ontology (GO) terms. These data represent a foundational resource for researchers studying the genetics, molecular biology, and evolution of insect predators and will facilitate comparative genomics and functional studies in Pentatomidae. The primary genome assembly corresponding to this annotation is available at GenBank under the accession GCA_052056905.1. Users of these data are encouraged to cite both this Figshare dataset and the associated primary publication.
创建时间:
2025-09-18
二维码
社区交流群
二维码
科研交流群
商业服务