five

Residue-centric rate of sequence evolution according to Rate4Site on orthogroups of 14 fungal species

收藏
Figshare2021-02-08 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Residue-centric_rate_of_sequence_evolution_according_to_Rate4Site_on_orthogroups_of_14_fungal_species/13736935
下载链接
链接失效反馈
官方服务:
资源简介:
Overall, 42 descriptors (features) are mapped on 2129854 residues comprising 3797 unique proteins.The legend for each descriptor is given in the associated header file.Columns 1-4 provide protein identifiers:- SGD identifier,- ORF,- UniprotKB,- PDB code and chain of closest structureColumns 5-8 correspond to the length in number of residues for each of the previously defined identifiers:Columns 9-12 contain the residue number of each aligned position within the full sequence.Columns 13-14 contain the residue name of each aligned position in respectively the sequence and the structure.Columns 15-16 indicates whether aligned position correspond to a gap (indel) position in the sequence and the structure.Columns 17-20 indicates whether the protein has disorder predictions from IUPRED or D2P2, evolutionary rate from Rate4Site and a PDB structure matching the protein.Columns 21-24 contain the raw and normalized evolutionary rates at each position as well as the min. and max. evolutionary rate in the proteins (from raw values).Columns 25-31 correspond to the information relevant to the PDB structure:- Identity % - Overlap %- Resolution- Number of PDB matched- Number of subunits in closest structure- QSBIO error probability on quaternary structure assignmentColumns 32-36 correspond to the geometric descriptor which indicate whether the residue is:- surface exposed- buried- at the interface- relative Accessible Surface Area in monomeric form-relative Accessible Surface Area in Biological UnitColumn 37-42 are indicating the various biophysical features mapped to the residue:- predicted disordered by IUPRED- predicted NOT disordered by IUPRED- predicted disordered by D2P2- predicted NOT disordered by D2P2- predicted to be in PFAM or SUPERFAMILY domain- predicted NOT to be in PFAM or SUPERFAMILY domain
创建时间:
2021-02-08
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作