five

Euglena gracilis isolate:CCAP 1224/5Z Genome sequencing. Euglena gracilis isolate:CCAP 1224/5Z

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://www.ncbi.nlm.nih.gov/bioproject/PRJNA1106208
下载链接
链接失效反馈
官方服务:
资源简介:
Euglena gracilis (E. gracilis), pivotal in the study of photosynthesis, endosymbiosis, and chloroplast development, is also an industrial microalga for paramylon production. Despite its importance, E. gracilis genome exploration faces challenges due to its intricate nature. In this study, we achieved a chromosome-level de novo assembly (2.37 Gb) using Illumina, PacBio, Bionano, and Hi-C data. The assembly exhibited a contig N50 of 619 Kb and scaffold N50 of 1.12 Mb, indicating superior continuity. Approximately 99.83% of the genome was anchored to 46 chromosomes, revealing structural insights. Repetitive elements constituted 58.84% of the sequences. Functional annotations were assigned to 39,362 proteins, enhancing interpretative power. BUSCO analysis confirmed assembly completeness at 80.39%. This first high-quality E. gracilis genome offers insights for genetics and genomics studies, overcoming previous limitations. The impact extends to academic and industrial research, providing a foundational resource.

纤细裸藻(Euglena gracilis, E. gracilis)在光合作用、内共生及叶绿体发育研究中具有关键地位,同时也是生产副淀粉(paramylon)的工业微藻。尽管其科研与应用价值显著,但纤细裸藻基因组因其结构复杂,其解析工作长期面临挑战。本研究依托Illumina、PacBio、Bionano及Hi-C测序数据,完成了染色体级别的基因组从头组装,组装总长度达2.37 Gb。该组装的重叠群(contig)N50为619 Kb,支架(scaffold)N50为1.12 Mb,展现出优异的组装连续性。约99.83%的基因组序列被锚定至46条染色体,为基因组结构解析提供了关键信息。重复序列占基因组总序列的58.84%。研究团队为39362个蛋白质编码基因完成了功能注释,大幅提升了该基因组的研究可解释性。BUSCO分析证实该组装的完整度为80.39%。这首个高质量纤细裸藻基因组突破了既往研究的局限,为遗传学与基因组学研究提供了全新视角,其价值覆盖学术与工业研究领域,为相关研究提供了核心基础资源。
创建时间:
2024-04-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作