Repeat annotation of Linum trigynum
收藏figshare.scilifelab.se2024-08-29 更新2025-01-21 收录
下载链接:
https://figshare.scilifelab.se/articles/dataset/Repeat_annotation_of_i_Linum_trigynum_i_/24324310/2
下载链接
链接失效反馈官方服务:
资源简介:
The file contains the annotation fo the repetitive regions in *Linum trigynum*. The detection of Repeat has been done with [RepeatMasker/RepeatModeler](https://www.repeatmasker.org/) using a custom repeat library. The genome has been deposited to [ENA](https://www.ebi.ac.uk/ena/browser/home) with the accession id: GCA_964030455 (project PRJEB67577). It is part of this article: Gutiérrez-Valencia, J., Zervakis, P. I., Postel, Z., Fracassetti, M., Losvik, A., Mehrabi, S., ... & Slotte, T. (2024). Genetic causes and genomic consequences of breakdown of distyly in Linum trigynum. Molecular Biology and Evolution, msae087. https://doi.org/10.1093/molbev/msae087The file is in BED format converted with [BEDOPS](https://bedops.readthedocs.io/en/latest/content/reference/file-management/conversion/rmsk2bed.html). The fourth (Repeat class) and eleventh (Repeat name) columns have been reversed to highlight the Repeat name in the visualization. Query sequence 1 chromosomeQuery start 2 startQuery end 3 stopRepeat class 4 idSmith-Waterman score 5 scoreStrand 6 strandPercentage, substitutions 7 Percentage, deleted bases 8 Percentage, inserted bases 9 Bases in query, past match 10 Repeat name 11 Bases in complement of the repeat consensus sequence 12 Match start 13 Match end 14 Unique ID 15 Higher-scoring match (optional) 16
该文件包含了亚麻(Linum trigynum)重复区域的注释。重复区域的检测是通过使用定制重复库的[RepeatMasker/RepeatModeler](https://www.repeatmasker.org/)进行的。基因组信息已存入[ENA](https://www.ebi.ac.uk/ena/browser/home),登录号为GCA_964030455(项目PRJEB67577)。该数据集是本文的研究成果:Gutiérrez-Valencia, J.,Zervakis, P. I.,Postel, Z.,Fracassetti, M.,Losvik, A.,Mehrabi, S.,... & Slotte, T.(2024). 亚麻(Linum trigynum)杂种不育的遗传原因和基因组后果. 分子生物学与进化,msae087. https://doi.org/10.1093/molbev/msae087。文件格式为BED,由[BEDOPS](https://bedops.readthedocs.io/en/latest/content/reference/file-management/conversion/rmsk2bed.html)转换而来。第四列(重复类别)和第十一列(重复名称)已进行反转,以在可视化中突出显示重复名称。查询序列、染色体查询起始位置、染色体查询结束位置、重复类别、Smith-Waterman得分、链、百分比、替换次数、百分比、删除碱基、百分比、插入碱基、查询序列中匹配后的碱基数、重复名称、重复一致序列互补链中的碱基数、匹配起始位置、匹配结束位置、唯一标识符、更高得分的匹配(可选)。
提供机构:
SciLifeLab



