Data sets for Polyplax serrata article "Highly-resolved genomes of two closely related lineages of the louse Polyplax serrata with different host specificities"
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/8381479
下载链接
链接失效反馈官方服务:
资源简介:
Supplementary data for Polyplax serrata article 2023
Data included in this repository were generated and used in various genomic and phylogenetic analysis presented by the publication "Highly-resolved genomes of two closely related lineages of the louse Polyplax serrata with different host specificities"
Description of the data and file structure
Data provided for each analyzed taxa include:
- Annotation table.
- fasta format files for transcripts (CDS and mRNA).
- fasta format file for genome.
- protein fasta file.
- gbk format file that includes the genome with its corresponding annotations.
Additionally, repeat families in fasta format were included for Polyplax serrata S and N lineages.
Sharing/Access information
GenBank accession number of analyzed taxa:
· Aedes Aegypti (GenBank accession no. GCF_002204515.2).
· Brueelia nebulsa ( GenBank accession no. GCA_028293925.1).
· Columbicola columbae (GenBank accession no. GCA_016920875.1).
· Cimex lectularis (GenBank accession no. GCF_000648675.2).
· Glossina morsitans (GenBank accession no. GCA_001077435.1).
· Pediculus humanus (GenBank accession no. GCA_000006295.1).
· Rhodnius prolixus (GenBank accession no. GCA_000181055.3).
· Polyplax serrata S lineage (GenBank accession no. JAWJWF000000000).
· Polyplax serrata N lineage (GenBank accession no. JAWJWE000000000).
Note: All the latter genomes except for the two genomes of Polyplax serrata S and N lineages, were acquired from GenBank database and were subjected to the same gene prediction and annotation workflow as P. serrata genomes to maintain methodological consistence in downstream analysis of the annotation results.
Software
- Gene prediction and annotation was performed using Funannotate v1.18.14 (https://github.com/nextgenusfs/funannotate)).
- Repeat were identified in the genomes of P. serrata S and N lineages using RepeatModeler v2.0.3.
创建时间:
2024-01-17



