five

Chromosome-scale haploid genome sequence and annotation dataset of the durian cultivar 'Kan Yao'

收藏
DataCite Commons2025-04-27 更新2025-04-16 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=40849d4415ff4396b54a87d48906193e
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset includes the sequence files and gene annotation files for two haplotype genomes of the durian cultivar 'Kan Yao', assembled using HiFi, ONT, Hi-C, and second-generation sequencing data. The core software used for genome assembly includes Hifiasm (0.19.5), 3D-DNA (190716), and AssemblyMapper (1.0.3). Gene structure annotation employed three distinct strategies: de novo annotation, homology-based annotation, and transcriptome-based annotation. For de novo annotation, Braker software was used to construct models based on Arabidopsis protein sequences (arabidopsis_pep_20101214.fa) and all merged Illumina transcriptome data to predict gene structures. Homology-based annotation was conducted using GenomeThreader software, referencing the protein annotation file of the Durian genome (GCF_002303985.1_Duzib1.0_protein.faa). Transcriptome-based annotation was performed using PASA software, utilizing all merged Iso-Seq transcriptome sequencing data. Subsequently, the annotation files from these strategies were merged using EVM software and updated with PASA software to incorporate UTR and alternative splicing information, resulting in the final annotation file. Non-coding gene annotation was performed using Infernal software.
提供机构:
Science Data Bank
创建时间:
2024-06-16
二维码
社区交流群
二维码
科研交流群
商业服务