five

D7 - Host-encoded plastid-associated proteins - Identity of the contigs

收藏
DataCite Commons2023-08-02 更新2024-08-18 收录
下载链接:
https://figshare.com/articles/dataset/D7_-_Host-encoded_plastid-associated_proteins_-_Identity_of_the_contigs/21829755
下载链接
链接失效反馈
官方服务:
资源简介:
To check the identity of the contigs containing the canidates, the up and down stream protein identities were investigated (so these files do not contain the candidates themselves). <br> For each protein, there are 5 files: 1) A fasta file of the protien and its top hits from database searches (top 20 hits from blastp_nr mode, blastp_tsa_nr mode and SequenceServer 2.0.0 with the EukProt V3 database) 2) The alignment file of hits fasta file, having been aligned with MAFFT 3) The trimmed alignment after trimAL was applied 4) The phylogenetic tree predicted by IQ-TREE for the alignment file (Used with ModelFInder to determine the best-fit model, 1000 ultrafast bootstrap replicates, and SH-aLRT test) 5)The log file from running IQtree <br> Each file name has the name of the protein, the co-assembly the candidate comes from, the node identifier, and the protein number (multiple protein numbers are present when manual curation was required to rejoin exons). The proteins per contig are held together in one subfolder.
提供机构:
figshare
创建时间:
2023-08-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作