five

Integrated machine learning-driven QSAR modelling and systems biology approach for the identification of potential SARS-CoV-2 3CLpro inhibitors

收藏
DataCite Commons2025-11-19 更新2026-04-25 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Integrated_machine_learning-driven_QSAR_modelling_and_systems_biology_approach_for_the_identification_of_potential_SARS-CoV-2_3CLpro_inhibitors/30655800/1
下载链接
链接失效反馈
官方服务:
资源简介:
SARS-CoV-2 3C-like protease (3CLpro) is an essential viral enzyme responsible for processing viral polyproteins and is a validated target for small-molecule therapeutic intervention. This study presents an integrative cheminformatics and systems biology framework to identify and evaluate potential 3CLpro inhibitors. A curated dataset of 919 compounds from the CHEMBL database was used to develop predictive QSAR models based on substructure fingerprints and 1D/2D molecular descriptors. The best-performing models achieved strong correlation coefficients (0.9736 for training and 0.7413 for testing), and key molecular features were identified using feature importance analysis. A web-based tool, 3CLpro-Pred was developed to enable rapid prediction of compound bioactivity. Molecular docking and dynamics simulations further validated QSAR findings by elucidating key atomic-level interactions at the protease active site. Top hit compounds were prioritized for systems-level analysis at the host-pathogen interface, where gene ontology and KEGG pathway enrichment revealed their potential to modulate key host signalling pathways. Critical regulatory genes, including <i>TBK1, PIK3CA, IKBKB, GSK3B</i>, and <i>CASP3</i>, were identified as major nodes linking compound action to antiviral and immune processes. This study delivers a combinatorial computational pipeline to accelerate the discovery of 3CLpro-targeted antivirals, providing mechanistic insights and a shortlist of candidates for future experimental validation.
提供机构:
Taylor & Francis
创建时间:
2025-11-19
二维码
社区交流群
二维码
科研交流群
商业服务