five

T. indotineae reference genome annotation

收藏
DataCite Commons2024-10-22 更新2024-11-06 收录
下载链接:
https://figshare.com/articles/dataset/T_indotineae_reference_genome_annotation/27275004
下载链接
链接失效反馈
官方服务:
资源简介:
Annotation of the T. indotineae reference genome (GenBank GCA_023065905.1; strain TIMM20114). Annotation completed as follows: The <i>T. indotineae</i> reference genome was annotated by identifying repeat content using Repeatmodeler v2.0.3 (25) (engine NCBI) with CD-HIT (26), ReCon v1.08 (27), RMBlast v2.11.0, Tandem Repeats Finder v4.09 (28), RepeatScout v1.06 (29) and RepeatMasker v4.1.2-p1 (30). The output of Repeatmodeller was then used as a library for RepeatMasker to identify all instances of repeats and soft mask the genome (parameter -xsmall). Gene annotation on the repeat masked assembly was achieved using the Braker2 (31) pipeline (parameters –fungus –softmasking), which uses Samtools v0.1.19-44428cd (23), Bamtools v2.4.0 (32), Diamond v2.0.4 (33), Genemark-ET v4.15 (34), and Augustus v3.2.1 (35). The pipeline identified 5,451 protein-coding genes, and BUSCO assessment resulted in 90.8% complete BUSCOs.
提供机构:
figshare
创建时间:
2024-10-22
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作