D7 - Host-encoded plastid-associated proteins - Identity of the contigs
收藏DataCite Commons2023-08-02 更新2024-08-18 收录
下载链接:
https://figshare.com/articles/dataset/D7_-_Host-encoded_plastid-associated_proteins_-_Identity_of_the_contigs/21829755
下载链接
链接失效反馈官方服务:
资源简介:
To check the identity of the contigs containing the canidates, the up and down stream protein identities were investigated (so these files do not contain the candidates themselves). <br> For each protein, there are 5 files: 1) A fasta file of the protien and its top hits from database searches (top 20 hits from blastp_nr mode, blastp_tsa_nr mode and SequenceServer 2.0.0 with the EukProt V3 database) 2) The alignment file of hits fasta file, having been aligned with MAFFT 3) The trimmed alignment after trimAL was applied 4) The phylogenetic tree predicted by IQ-TREE for the alignment file (Used with ModelFInder to determine the best-fit model, 1000 ultrafast bootstrap replicates, and SH-aLRT test) 5)The log file from running IQtree <br> Each file name has the name of the protein, the co-assembly the candidate comes from, the node identifier, and the protein number (multiple protein numbers are present when manual curation was required to rejoin exons). The proteins per contig are held together in one subfolder.
提供机构:
figshare
创建时间:
2023-08-02



