Discrimination of GO term annotated proteins based on amino acid occurrence and composition

Name: Discrimination of GO term annotated proteins based on amino acid occurrence and composition
Creator: Monash University
Published: 2026-02-12 08:03:33
License: 暂无描述

DataCite Commons2026-02-12 更新2026-05-04 收录

下载链接：

https://bridges.monash.edu/articles/dataset/Discrimination_of_GO_term_annotated_proteins_based_on_amino_acid_occurrence_and_composition/5619466

下载链接

链接失效反馈

官方服务：

资源简介：

In this paper, we have applied linear discriminant analysis and support vector machine for predicting GO term annotated proteins using amino acid occurrence/composition in uniref50 data set, i.e., uniprot with less than 50 % sequence identity.We found that our method could discriminate between proteins with at least one known GO term and those without any annotation at an AUC of 0.82 using three-fold cross validation test. Discrimination of the 38 most frequent GO terms is achieved with the maximum AUC of 0.91. Our method is solely based on amino acid sequence and hence it will be useful to predict GO term associations of newly obtained amino acid sequence without any annotated known homolog. PRIB 2008 proceedings found at: http://dx.doi.org/10.1007/978-3-540-88436-1 Contributors: Monash University. Faculty of Information Technology. Gippsland School of Information Technology ; Chetty, Madhu ; Ahmad, Shandar ; Ngom, Alioune ; Teng, Shyh Wei ; Third IAPR International Conference on Pattern Recognition in Bioinformatics (PRIB) (3rd : 2008 : Melbourne, Australia) ; Coverage: Rights: Copyright by Third IAPR International Conference on Pattern Recognition in Bioinformatics. All rights reserved.

提供机构：

Monash University

创建时间：

2026-02-11

5,000+

优质数据集

54 个

任务类型

进入经典数据集