An efficient method for measuring the similarity of protein sequences
收藏Figshare2016-04-22 更新2026-04-29 收录
下载链接:
https://figshare.com/articles/dataset/An_efficient_method_for_measuring_the_similarity_of_protein_sequences/3188995
下载链接
链接失效反馈官方服务:
资源简介:
An accurate numerical descriptor for protein sequence is introduced. It is basically a set of each three successive amino acids in the sequence (triplet), starting from left to right, in addition to the distances between each two successive amino acids in the triplet such that the summation of these distances does not exceed 8. This numerical descriptor combines two features the amino acid composition and the position of each amino acid relative to the other nearby amino acids. This numerical descriptor is used to measure the similarity between protein sequences in three sets: NADH dehydrogenase subunit 5 (ND5) proteins of different species, 24 transferrin proteins from vertebrates and 12 proteins of baculoviruses. High correlation coefficient values between our results and the results of ClustalW program are obtained. These values are higher than the values obtained in many other related works.
创建时间:
2016-04-22



