five

Supporting data for "Long-read HiFi sequencing correctly assembles repetitive heavy fibroin silk genes in new moth and caddisfly genomes"

收藏
DataCite Commons2025-05-26 更新2024-07-13 收录
下载链接:
http://gigadb.org/dataset/102229
下载链接
链接失效反馈
官方服务:
资源简介:
Insect silk is an incredibly versatile biomaterial. Lepidoptera and their sister lineage, Trichoptera, display some of the most diverse uses of silk with varying strength, adhesive qualities and elastic properties. It is well known that silk fibroin genes are long (&gt; 20 kb) and have many repetitive motifs. These features make these genes challenging to sequence. Most research thus far has focused on conserved N- and C-terminal regions of fibroin genes because a full comparison of repetitive regions across taxa has not been possible. Using the PacBio Sequel II system and SMRT sequencing, we generated high fidelity (HiFi) long-read genomic and transcriptomic sequences for the Indianmeal moth (<i>Plodia interpunctella</i>) and genomic sequences for the caddisfly, <i>Eubasilissa regina</i>. Both genomes were highly contiguous (N50 = 9.7 Mbp/32.4 Mbp, L50 = 13/11) and complete (BUSCO Complete = 99.3%/95.5%), with complete and contiguous recovery of silk <i>heavy fibroin</i> gene sequences. This study demonstrates that HiFi long-read sequencing can significantly help our understanding of genes with highly contiguous, repetitive regions.
提供机构:
GigaScience Database
创建时间:
2022-06-09
二维码
社区交流群
二维码
科研交流群
商业服务