Structural ontogeny of protein-protein interactions
收藏DataONE2025-10-29 更新2025-11-01 收录
下载链接:
https://search.dataone.org/view/sha256:e957758c5051e44f9481ec6ef999d25554dfd4f968aa13b07c1390ec8a0c9578
下载链接
链接失效反馈官方服务:
资源简介:
Natural protein binding sites are often the most âdruggableâ sites on proteins, while alternative protein surfaces can be difficult targets. To explore the structural basis of this phenomenon, we used synthetic coevolution to engineer new interactions between naïve surfaces, simulating the de novo formation of protein complexes. We isolated seven distinct structural families of protein Z-domain complexes and found that synthetic complexes explore multiple shallow energy wells through ratchet-like docking modes, while complexes co-evolved from a natural binding surface converged in a deep energy well with a relatively fixed docking geometry. Epistasis analysis using machine learning to estimate fitness landscapes extracted âseedâ contacts emerging from silent surfaces between binding partners that anchored the earliest stages of encounter complex formation. These data suggest why natural binding sites attract binders: alternative surfaces have a shallow energy landscape that disfavo..., , # README: Coevolution Library Sequencing Data
This dataset contains the sequencing data obtained from the coevolution libraries in the study titled *\"Structural Ontogeny of Protein-Protein Interactions.\"*
The dataset provides information on the Z-A and Z-B protein pairs along with their respective read counts across multiple rounds of yeast display selection.
## Dataset Contents
* NGS Data: Parsed sequences for each Z-A and Z-B pair, along with their corresponding read counts.
## Data Processing and Analysis
All data processing and analysis procedures performed on the dataset are described in detail in the accompanying research paper.\
Note that we used filtered data for downstream coevolution analysis and machine learning as described in the paper.
## Dataset Structure
The file name is assigned as follows:
* `selection round` (e.g., naive, R1, R2)
## File Format
The processed data files are in tabular format with the following columns:
1. Z-A Sequence
2. Z-B Sequence
3. Rea...,
创建时间:
2025-10-30



