five

Supporting data for "Graph2GO: a multi-modal attributed network embedding method for inferring protein functions"

收藏
DataCite Commons2025-05-26 更新2025-04-15 收录
下载链接:
http://gigadb.org/dataset/100761
下载链接
链接失效反馈
官方服务:
资源简介:
Identifying protein functions is important for many biological applications. Since experimental functional characterization of proteins is time-consuming and costly, accurate and efficient computational methods for predicting protein functions are in great demand for generating testable hypotheses guiding large-scale experiments. Here we proposed Graph2GO, a multi-modal graph-based representation learning model that can integrate heterogeneous information including multiple types of interaction networks (sequence similarity network and protein-protein interaction network) and protein features (amino acid sequence, subcellular location and protein domains) to predict protein functions on Gene Ontology. Comparing Graph2GO to BLAST, a baseline model and two popular protein function prediction methods: Mashup and deepNF, we demonstrated that our model can achieve state-of-the-art performance. We show the robustness of our model by testing on multiple species. We also provide a web server supporting function query and downstream analysis on-the-fly. Graph2GO is the first model that utilized attributed network representation learning methods to model both interaction networks and protein features for predicting protein functions, and achieved promising performance. Our model can be easily extended to include more protein features to further improve the performance. Besides, Graph2GO is also applicable to other application scenarios involving biological networks and the learned latent representations can be used as feature inputs for machine learning tasks in various downstream analysis.
提供机构:
GigaScience Database
创建时间:
2020-06-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作