PEER

Name: PEER
Creator: Mila - Québec AI Institute
Published: 2022-09-20 01:31:38
License: 暂无描述

arXiv2022-09-20 更新2024-06-21 收录

下载链接：

https://github.com/DeepGraphLearning/PEER_Benchmark

下载链接

链接失效反馈

官方服务：

资源简介：

PEER是由Mila - Québec AI Institute创建的综合性多任务蛋白质序列理解基准数据集。该数据集涵盖了蛋白质功能预测、定位预测、结构预测、蛋白质-蛋白质相互作用预测以及蛋白质-配体相互作用预测等多个任务。数据集旨在通过提供多样化的蛋白质理解任务，评估不同基于序列的方法的性能，包括传统的特征工程方法、不同的序列编码方法以及大规模预训练的蛋白质语言模型。PEER数据集的应用领域广泛，旨在解决蛋白质序列分析中的关键问题，如蛋白质功能的预测和蛋白质相互作用的建模，从而推动机器学习在蛋白质研究中的应用。

PEER is a comprehensive multi-task benchmark dataset for protein sequence understanding, developed by the Mila - Québec AI Institute. This dataset encompasses multiple tasks including protein function prediction, subcellular localization prediction, protein structure prediction, protein-protein interaction prediction, and protein-ligand interaction prediction. It is designed to evaluate the performance of diverse sequence-based approaches by offering a broad range of protein understanding tasks, which include traditional feature engineering methods, various sequence encoding techniques, and large-scale pre-trained protein language models. The PEER dataset has broad application prospects, and aims to address key challenges in protein sequence analysis such as protein function prediction and protein interaction modeling, thereby advancing the application of machine learning in protein research.

提供机构：

Mila - Québec AI Institute

创建时间：

2022-06-05

搜集汇总

背景与挑战

背景概述

PEER是由Mila - Québec AI Institute创建的综合性多任务蛋白质序列理解基准数据集，涵盖蛋白质功能预测、定位预测、结构预测、蛋白质-蛋白质相互作用预测以及蛋白质-配体相互作用预测等多个任务。该数据集旨在通过多样化的蛋白质理解任务，评估不同基于序列的方法的性能，包括传统特征工程、序列编码和大规模预训练蛋白质语言模型，从而解决蛋白质序列分析中的关键问题，推动机器学习在蛋白质研究中的应用。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集