five

Protein foundation models: a comprehensive survey

收藏
中国科学数据2026-04-21 更新2026-04-25 收录
下载链接:
https://www.sciengine.com/AA/doi/10.1007/s11427-025-3147-2
下载链接
链接失效反馈
官方服务:
资源简介:
Protein foundation models (pFMs) have emerged as pivotal tools in advancing protein science. By leveraging advanced deep learning architectures trained on large-scale protein datasets, pFMs learn generalizable patterns in proteins, enabling accurate prediction of key protein characteristics and generation of novel proteins with tailored properties. In this review, we provide a comprehensive exploration of the developments, applications, challenges, and prospects of pFMs. We systematically examine the multimodal dataset resources that underpin the development of pFMs, ranging from protein sequences to experimentally resolved and predicted three-dimensional (3D) structures, functional annotations, and interaction networks. We explore the advances in pFMs—spanning autoencoding, autoregressive, diffusion, and flow matching models—and highlight their representative applications across fundamental biological research, protein discovery and engineering, and biomedical applications, thereby illustrating their versatility and impact. We also discuss major challenges, encompassing data bottlenecks, evaluation complexities, and model interpretability. Looking forward, we outline promising research directions, including modelling protein dynamism and interactions, as well as developing integrated virtual cell systems, paving the way for next-generation bioengineering and therapeutic development. This survey offers both a roadmap for computational biologists and a strategic framework for experimentalists who are applying pFMs in their work.
创建时间:
2025-11-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作