five

DUN.CDS.fasta

收藏
DataCite Commons2024-03-26 更新2024-07-13 收录
下载链接:
https://melbourne.figshare.com/articles/dataset/DUN_CDS_fasta/25479169
下载链接
链接失效反馈
官方服务:
资源简介:
Marsupials exhibit highly specialized patterns of reproduction and development, making them uniquely valuable for comparative genomics studies with their sister lineage, eutherian (also known as placental) mammals. However, marsupial genomic resources still lag far behind those of eutherian mammals, limiting our insight into mammalian diversity. Here, we present a series of novel genomic resources for the fat-tailed dunnart (<em>Sminthopsis crassicaudata</em>), a mouse-like marsupial that, due to its ease of husbandry and <em>ex-utero</em> development, is emerging as a laboratory model. To enable wider use, we have generated a multi-tissue <em>de novo</em> transcriptome assembly of dunnart RNA-seq reads spanning 15 tissues. This highly representative transcriptome is comprised of 2,093,982 assembled transcripts, with a mean transcript length of 830 bp. The transcriptome mammalian BUSCO completeness score of 93% is the highest amongst all other published marsupial transcriptomes. Additionally, we report an improved fat-tailed dunnart genome assembly which is 3.23 Gb long, organized into 1,848 scaffolds, with a scaffold N50 of 72.64 Mb. The genome annotation, supported by assembled transcripts and <em>ab initio</em> predictions, revealed 21,622 protein-coding genes. Altogether, these resources will contribute greatly towards characterizing marsupial biology and mammalian genome evolution.

有袋类(Marsupials)展现出高度特化的繁殖与发育模式,使其与姊妹类群真兽类(eutherian,又称胎盘类)哺乳动物的比较基因组学研究具有独特的研究价值。然而,当前有袋类的基因组资源仍远落后于真兽类哺乳动物,这限制了我们对哺乳动物多样性的深入认知。在此,我们为宽足袋鼩(<em>Sminthopsis crassicaudata</em>)提供一系列全新的基因组学资源——这是一种形似小鼠的有袋类动物,因其易于饲养管理且可实现子宫外(ex-utero)发育培养,正逐渐成为实验室研究模型。为推动该资源的广泛应用,我们基于覆盖15种组织的袋鼩RNA-seq测序读段,构建了多组织从头(de novo)转录组组装序列。该极具代表性的转录组共包含2093982条组装转录本,平均转录本长度为830 bp。其哺乳动物BUSCO完整度评分达93%,为所有已发表的有袋类转录组中最高分。此外,我们还报道了一个优化后的宽足袋鼩基因组组装版本:其总长度为3.23 Gb,共组装为1848个支架(scaffold),支架N50为72.64 Mb。该基因组注释结合了组装得到的转录本与从头(ab initio)预测结果,共鉴定出21622个蛋白质编码基因。综上,这些资源将极大助力我们解析有袋类生物学特性与哺乳动物基因组演化历程。
提供机构:
The University of Melbourne
创建时间:
2024-03-26
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作