five

Sterechinus neumayeri v1 genome annotation

收藏
Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/byjrp4xns7
下载链接
链接失效反馈
官方服务:
资源简介:
Genome annotation was performed using the BRAKER3 pipeline (Gabriel et al., 2023). A repeat library was generated with RepeatModeler (Smit & Hubley, 2010). The repeat families were compared to known echinoderm protein coding gene models using BLAST and any repeat with a significant hit (e-value < 5e-5) were removed. The resulting repeat library was used to identify and mask repeats using RepeatMasker prior to annotation (Smit et al., 2010). BRAKER was run using protein models from S. purpuratus (available on echinobase https://download.xenbase.org/echinobase/Genomics/Spur5.0/sp5_0_GCF.gff3.gz ; last accessed 2/15/24), L.variegatus (available from echinobase https://download.xenbase.org/echinobase/Genomics/Lvar3.0/Lvar3_0_GCF_proteins.fa.gz ; last accessed 2/15/24), and L.pictus (available from echinobase https://download.xenbase.org/echinobase/Genomics/Lpic2.1/Lpic2_1_GCF_proteins.fa.gz ; last accessed 2/15/24) along with publicly available transcriptomic data on NCBI sequencing read archive and in house RNAseq dataset spanning embryogenesis. The gene models were annotated using the notation SNE_XXXXXX and analyzed for ‘completeness’ with BUSCO version 4 using the metazoan gene set (Simão et al., 2015). GRN gene curation was first carried out by curating a list of gene regulatory network genes (supplemental file X). Then, orthofinder2 (Emms & Kelly, 2019) was used to identify orthologs between S.neumayeri, L.variegatus, S.purupratus, and recently published genome annotation for P.lividus (Emms & Kelly, 2019; Marlétaz et al., 2023).
创建时间:
2024-10-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作