five

A highly contiguous reference genome for the Steller's jay (Cyanocitta stelleri)

收藏
DataONE2023-05-17 更新2024-06-08 收录
下载链接:
https://search.dataone.org/view/sha256:71a6ef7fc1b3ed93ca32967e3dc232814cc9c64317373428a9f54ddfcb61fe81
下载链接
链接失效反馈
官方服务:
资源简介:
The Steller's jay is a familiar bird of western forests from Alaska south to Nicaragua. Here, we report a draft reference assembly for the species generated from PacBio HiFi long read and Omni-C chromatin-proximity sequencing data as part of the California Conservation Genomics Project (CCGP). Sequenced reads were assembled into 352 scaffolds totaling 1.16 Gb in length. Assembly metrics indicate a highly contiguous and complete assembly with a contig N50 of 7.8 Mb, scaffold N50 of 25.8 Mb, and BUSCO completeness score of 97.2%. Repetitive elements span 16.6% of the genome including nearly 90% of the W chromosome. Compared with high quality assemblies from other members of the family Corvidae, the Steller's jay genome contains a larger proportion of repetitive elements than four crow species (Corvus), but a lower proportion of repetitive elements than the California scrub-jay (Aphelocoma californica). This reference genome will serve as an essential resource for future studies on speciat..., We performed de novo repeat annotation of the draft Steller's jay reference assembly using the program RepeatModeler2 with the ltrstruct option selected to improve identification of LTR elements (Flynn et al. 2020). We next prioritized LTR and unclassified elements for manual curation that were at least 1000 base pairs in length. For each LTR consensus sequence, we used blastn (Camacho et al. 2009) to identify other members of each TE family in the genome, added 2000 bp of flanking sequence to both ends of each blastn hit, aligned extended sequences with mafft (Katoh & Standley 2013), and visualized the alignment in Aliview (Larsson et al. 2014). We confirmed the completeness of LTR elements based on the presence of canonical 5' TG and 3' CA dinucleotides at the termini of LTRs. A consensus sequence of the trimmed multiple sequence alignment was then generated using the cons tool in EMBOSS (Rice et al. 2000). For sequences labeled as unclassified, we used blastn to check for signifi...,
创建时间:
2025-07-23
二维码
社区交流群
二维码
科研交流群
商业服务