Supporting data for "a draft genome assembly of the sea slug Elysia chlorotica"
收藏Figshare2019-01-07 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/Supporting_data_for_a_draft_genome_assembly_of_the_sea_slug_Elysia_chlorotica_/7057916/2
下载链接
链接失效反馈官方服务:
资源简介:
Elysia chlorotica, a sacoglossan sea slug found off the East Coast of the United States, is well-known for its ability to sequester chloroplasts from its algal prey and survive by photosynthesis for up to 12 months in the absence of food supply. Here we present a draft genome assembly of E. chlorotica that was generated using a hybrid assembly strategy with Illumina short reads and PacBio long reads. The genome assembly comprised 9,989 scaffolds, with a total length of 557 Mb and a scaffold N50 of 442 kb. BUSCO assessment indicated that 93.3 % of the expected metazoan genes were completely present in the genome assembly. Annotation of the E. chlorotica genome identified 176 Mb (32.6 %) of repetitive sequences and a total of 24,980 protein-coding genes. We anticipate that the annotated draft genome assembly of the E. chlorotica sea slug will promote the investigation of sacoglossan genetics, evolution, and particularly, the genetic signatures accounting for the long-term functioning of algal chloroplasts in an animal.<br><b>Genome assembly and annotation files provided in this dataset:</b>1. <i>Elysia_chlorotica.fa.gz</i>: genome assembly of Elysia chlorotica in fasta format.2. <i>Elysia_chlorotica.gene.gff.gz</i>: protein-coding gene annotation in GFF3 format.3. <i>Elysia_chlorotica.gene.cds.gz</i>: coding sequences of the protein-coding genes in fasta format.4. <i>Elysia_chlorotica.gene.pep.gz</i>: peptide sequences of the protein-coding genes in fasta format.<br>5. <i>Elysia_chlorotica.ProteinMask.gff.gz</i>: homology-based repetitive elements identified by searching against TE protein database with RepeatProteinMask in GFF3 format.<br>6. <i>Elysia_chlorotica.RepeatMasker.gff.gz</i>: homology-based repetitive elements identified by searching against Repbase with RepeatMasker in GFF3 format.<br>7. <i>Elysia_chlorotica.RepeatModeler.gff.gz</i>: denovo-based repetitive elements identified by RepeatModeler followed by RepeatMasker in GFF3 format.<br>8. <i>Elysia_chlorotica.TRF.gff.gz</i>: tandem repeats identified by Tandem Repeats Finder in GFF3 format.<br>
提供机构:
Mitsuyasu Hasebe; Huanming Yang; Huimin Cai; Mingji Feng; Xiaodong Fang; Qiye Li; Shuaicheng Li
创建时间:
2019-01-07



