five

A high-quality genome assembly for Dillenia turbinata (Dilleniales)

收藏
DataCite Commons2025-06-01 更新2025-06-15 收录
下载链接:
https://datadryad.org/dataset/doi:10.5061/dryad.msbcc2g3j
下载链接
链接失效反馈
官方服务:
资源简介:
Objectives: Dillenia turbinata (Dilleniaceae) is a member of the order Dilleniales, an enigmatic clade of critical importance for understanding the diversification history of flowering plants but for which genome sequences are not available. We have produced and annotated a chromosome-scale whole genome assembly for D. turbinata through the resources of the 10KP (10,000 Plants) Genomes Project. The genome assembly and associated data provided here will serve as a useful resource for comparative and evolutionary genomics research across the flowering plants. Data description: The D. turbinata genome was assembled from Oxford Nanopore Technology (ONT) and whole-genome shotgun (WGS) sequences, and scaffolded into chromosome-scale pseudomolecules using Hi-C data. The genome assembly is 723,739,077 base pairs in length with a BUSCO completeness score of 97%.  Twenty-eight scaffolds contain more than 99% of the assembly. The repeat-masked genome sequence is annotated with 36,967 protein-coding gene models (93% BUSCO completeness) supported by transcriptome/protein evidence and/or Pfam domain content. These measures indicate high contiguity and completeness for the D. turbinata genome.

研究目标:陀螺五桠果(Dillenia turbinata)隶属于五桠果科(Dilleniaceae)五桠果目(Dilleniales),该类群是解析被子植物演化历史的关键疑难演化支,但目前尚无公开可用的基因组序列。本研究依托10KP(10,000 Plants Genomes Project,万种植物基因组计划)的项目资源,完成了陀螺五桠果的染色体级全基因组组装与注释。本次发布的基因组组装数据及配套关联数据,将为被子植物类群的比较基因组学与进化基因组学研究提供极具价值的研究资源。 数据概况:陀螺五桠果的基因组基于牛津纳米孔技术(Oxford Nanopore Technology, ONT)测序数据与全基因组鸟枪法(whole-genome shotgun, WGS)测序数据组装得到,并通过Hi-C技术将序列挂载为染色体级假染色体。该基因组组装的总长度为723,739,077碱基对,BUSCO(Benchmarking Universal Single-Copy Orthologs,通用单拷贝同源基因基准集)完整度评分达97%。其中28个序列支架(scaffold)涵盖了组装序列总量的99%以上。经重复序列屏蔽后的基因组序列,通过转录组/蛋白质组证据以及Pfam(Protein Family Database,蛋白质家族数据库)结构域注释,共注释得到36,967个蛋白质编码基因模型,其BUSCO完整度为93%。上述各项指标均表明,该陀螺五桠果基因组具有极高的连续性与完整度。
提供机构:
Dryad
创建时间:
2023-05-20
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集提供了Dillenia turbinata(Dilleniales目)的高质量染色体尺度基因组组装,填补了该关键植物分支基因组序列的空白。组装基于Oxford Nanopore和全基因组鸟枪法测序,结合Hi-C数据,长度约7.24亿碱基对,BUSCO完整性达97%,并注释了36,967个蛋白质编码基因,适用于开花植物的比较和进化基因组学研究。
以上内容由遇见数据集搜集并总结生成
二维码
社区交流群
二维码
科研交流群
商业服务