Discrimination of GO term annotated proteins based on amino acid occurrence and composition
收藏DataCite Commons2026-02-12 更新2026-05-04 收录
下载链接:
https://bridges.monash.edu/articles/dataset/Discrimination_of_GO_term_annotated_proteins_based_on_amino_acid_occurrence_and_composition/5619466
下载链接
链接失效反馈官方服务:
资源简介:
In this paper, we have applied linear discriminant analysis and support vector machine for predicting GO term annotated proteins using amino acid occurrence/composition in uniref50 data set, i.e., uniprot with less than 50 % sequence identity.We found that our method could discriminate between proteins with at least one known GO term and those without any annotation at an AUC of 0.82 using three-fold cross validation test. Discrimination of the 38 most frequent GO terms is achieved with the maximum AUC of 0.91. Our method is solely based on amino acid sequence and hence it will be useful to predict GO term associations of newly obtained amino acid sequence without any annotated known homolog. PRIB 2008 proceedings found at: http://dx.doi.org/10.1007/978-3-540-88436-1
Contributors: Monash University. Faculty of Information Technology. Gippsland School of Information Technology ;
Chetty, Madhu ;
Ahmad, Shandar ;
Ngom, Alioune ;
Teng, Shyh Wei ;
Third IAPR International Conference on Pattern Recognition in Bioinformatics (PRIB) (3rd : 2008 : Melbourne, Australia) ;
Coverage:
Rights: Copyright by Third IAPR International Conference on Pattern Recognition in Bioinformatics. All rights reserved.
提供机构:
Monash University
创建时间:
2026-02-11



