Complete T2T genome annotation files for soybean cultivar QH34: Gene structure, repeats, and functional annotation
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://figshare.com/articles/dataset/Complete_T2T_genome_annotation_files_for_soybean_cultivar_QH34_Gene_structure_repeats_and_functional_annotation/29816705
下载链接
链接失效反馈官方服务:
资源简介:
This dataset provides comprehensive annotations for the telomere-to-telomere (T2T) genome assembly of soybean (*Glycine max*) cultivar QH34. It includes:
1. **Gene structure annotation** (`QH34_T2T_Chr.gff3`):
- Gene models (CDS, exons, UTRs) predicted using [BRAKER2/Funannotate].
- Format: GFF3 (standard genomic feature format).
2. **Repeat annotation** (`QH34_T2T_Chr.fasta.mod.EDTA.TEanno.gff3`):
- Transposable elements (TEs) and repetitive sequences identified by [EDTA/RepeatModeler2+RepeatMasker].
- Format: GFF3 with TE classifications.
3. **Functional annotation** (`func_merge.xls`):
**Data generation**:
- Genome assembly: PacBio HiFi (101×) + ONT (102×) + Hi-C scaffolding.
- Analysis pipelines: [`EDTA` for repeats, `InterProScan` for domains].
**Usage notes**:
- Files are compatible with genome browsers (e.g., IGV) and analysis tools (e.g., BEDTools).
- For functional annotations, refer to the header row in `func_merge.xls` for field descriptions.
创建时间:
2025-08-04



