five

ProteinCartograhpy data accompanying "Identification of capsid-like proteins in venomous and parasitic animals"

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/12796463
下载链接
链接失效反馈
官方服务:
资源简介:
This is the data for the ProteinCartography analysis in the pub "Identification of capsid-like proteins in venomous and parastic animals." Note that ProteinCartography (v0.4.2) was run in "Cluster" mode or "From-folder" mode using the parameters set in the config_ff.yml and the Ornithodoros turicata proteins. Notebooks used to fetch data and prepare custom plots can be found in the capsids GitHub repository.   Files in this data repository include:  output.zip is a folder containing all of the output files for the Ornithodoros ProteinCartography run, including the maps, aggregated features files, and the all-v-all similarity matrix. structures.zip is a folder containing all the structures used in the ProteinCartography analysis. Structures beginning with "VOG" are viral capsid proteins folded using ESMFold. ornithodoros_aggregated_features.tsv is a file containing all the metadata gathered for each protein in the analysis from either UniProt or the VOG database. ornithodoros_aggregated_features_pca_umap.html is the final map of the capsid proteins with the Ornithodoros proteins with metadata overlays. ornithodoros_aggregated_features_pca_umap.tsv is a file containing all of the metadata gathered for each protein in the analysis, as well as the coordinates for the map. ornithodoros_leiden_similarity.html is a heatmap showing the average between-cluster and within-cluster similarity between every cluster in the analysis. tick_or_virus_umap.html is a version of the final map that specifically highlights which proteins are from capsids and which proteins are from Ornithodoros.  uniprot_features1.tsv is a file containing all of the UniProt metadata fetched for the Ornithodoros proteins, as well as the viral capsid VOG identifiers. This file is used as an input for the ProteinCartography analysis. ornithodoros.txt us a file containing all of the Ornithodoros proteins fetched from the AlphaFold database. config_ff.yml is the configuration file for the ProteinCartography run.
创建时间:
2024-07-23
二维码
社区交流群
二维码
科研交流群
商业服务