five

Structural ontogeny of protein-protein interactions

收藏
DataONE2025-10-29 更新2025-11-01 收录
下载链接:
https://search.dataone.org/view/sha256:e957758c5051e44f9481ec6ef999d25554dfd4f968aa13b07c1390ec8a0c9578
下载链接
链接失效反馈
官方服务:
资源简介:
Natural protein binding sites are often the most “druggable” sites on proteins, while alternative protein surfaces can be difficult targets. To explore the structural basis of this phenomenon, we used synthetic coevolution to engineer new interactions between naïve surfaces, simulating the de novo formation of protein complexes. We isolated seven distinct structural families of protein Z-domain complexes and found that synthetic complexes explore multiple shallow energy wells through ratchet-like docking modes, while complexes co-evolved from a natural binding surface converged in a deep energy well with a relatively fixed docking geometry. Epistasis analysis using machine learning to estimate fitness landscapes extracted “seed” contacts emerging from silent surfaces between binding partners that anchored the earliest stages of encounter complex formation. These data suggest why natural binding sites attract binders: alternative surfaces have a shallow energy landscape that disfavo..., , # README: Coevolution Library Sequencing Data This dataset contains the sequencing data obtained from the coevolution libraries in the study titled *\"Structural Ontogeny of Protein-Protein Interactions.\"* The dataset provides information on the Z-A and Z-B protein pairs along with their respective read counts across multiple rounds of yeast display selection. ## Dataset Contents * NGS Data: Parsed sequences for each Z-A and Z-B pair, along with their corresponding read counts. ## Data Processing and Analysis All data processing and analysis procedures performed on the dataset are described in detail in the accompanying research paper.\ Note that we used filtered data for downstream coevolution analysis and machine learning as described in the paper. ## Dataset Structure The file name is assigned as follows: * `selection round` (e.g., naive, R1, R2) ## File Format The processed data files are in tabular format with the following columns: 1. Z-A Sequence 2. Z-B Sequence 3. Rea...,
创建时间:
2025-10-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作