Residue-centric rate of sequence evolution according to Rate4Site on orthogroups of 14 fungal species
收藏DataCite Commons2021-02-09 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/dataset/Residue-centric_rate_of_sequence_evolution_according_to_Rate4Site_on_orthogroups_of_14_fungal_species/13736935
下载链接
链接失效反馈官方服务:
资源简介:
Overall, 42 descriptors (features) are mapped on 2129854 residues comprising 3797 unique proteins.The legend for each descriptor is given in the associated header file.<br><br>Columns 1-4 provide protein identifiers:<br>- SGD identifier,<br>- ORF,<br>- UniprotKB,<br>- PDB code and chain of closest structure<br><br>Columns 5-8 correspond to the length in number of residues for each of the previously defined identifiers:<br>Columns 9-12 contain the residue number of each aligned position within the full sequence.<br><br>Columns 13-14 contain the residue name of each aligned position in respectively the sequence and the structure.<br><br>Columns 15-16 indicates whether aligned position correspond to a gap (indel) position in the sequence and the structure.<br><br>Columns 17-20 indicates whether the protein has disorder predictions from IUPRED or D2P2, evolutionary rate from Rate4Site and a PDB structure matching the protein.<br><br>Columns 21-24 contain the raw and normalized evolutionary rates at each position as well as the min. and max. evolutionary rate in the proteins (from raw values).<br><br>Columns 25-31 correspond to the information relevant to the PDB structure:- Identity % <br>- Overlap %<br>- Resolution<br>- Number of PDB matched<br>- Number of subunits in closest structure<br>- QSBIO error probability on quaternary structure assignment<br><br>Columns 32-36 correspond to the geometric descriptor which indicate whether the residue is:<br>- surface exposed<br>- buried<br>- at the interface<br>- relative Accessible Surface Area in monomeric form<br>-relative Accessible Surface Area in Biological UnitColumn 37-42 are indicating the various biophysical features mapped to the residue:<br>- predicted disordered by IUPRED<br>- predicted NOT disordered by IUPRED<br>- predicted disordered by D2P2<br>- predicted NOT disordered by D2P2<br>- predicted to be in PFAM or SUPERFAMILY domain<br>- predicted NOT to be in PFAM or SUPERFAMILY domain<br>
提供机构:
figshare
创建时间:
2021-02-08



