five

Genome sequence of Beluga and Narwhal

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://www.ncbi.nlm.nih.gov/bioproject/PRJNA925093
下载链接
链接失效反馈
官方服务:
资源简介:
Reference genomes provide a foundational framework for evolutionary investigations, ecological analysis, and conservation science, and yet the context to understand uncertainty and errors in reference genome construction is typically not provided for end-users. The reference genome for beluga (Delphinapterus leucas) was forwarded in 2017 based on linked reads and iterative scaffolding, and since improved upon with Hi-C data. Here, we forward an improved reference genome for beluga built using a combination of PacBio CLR, illumina short reads, and Hi-C data. We identified several large structural errors in the scaffolding of the original 2017 beluga assembly and unsupported scaffolding orientations in the Hi-C scaffolded version. We also found discrepancies in the order and orientation of contigs that remained in our PacBio assemblies, with inversions being notably abundant. Altogether, we forward a more accurate, if slightly less contiguous, representation of the beluga whale genome, and provide users with intermediate files, code, tables listing regions of uncertainty/discrepancy across assemblies, and gene annotations to critically evaluate, leverage, and potentially improve on our work.
创建时间:
2023-01-18
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作