five

Data Sheet 1_NeoTImmuML: a machine learning-based prediction model for human tumor neoantigen immunogenicity.zip

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/Data_Sheet_1_NeoTImmuML_a_machine_learning-based_prediction_model_for_human_tumor_neoantigen_immunogenicity_zip/30414553
下载链接
链接失效反馈
官方服务:
资源简介:
IntroductionTumor neoantigens possess high specificity and immunogenicity, making them crucial targets for personalized cancer immunotherapies such as mRNA vaccines and T-cell therapies. However, experimental identification and evaluation of their immunogenicity are time-consuming, which limits the efficiency of vaccine development. MethodsTo address these challenges, we implemented two key strategies. First, we upgraded the TumorAgDB database by integrating publicly available neoantigen data from the past two years, resulting in TumorAgDB2.0. Second, we developed NeoTImmuML, a weighted ensemble machine learning model for predicting neoantigen immunogenicity. Using data from TumorAgDB2.0, we calculated the physicochemical properties of peptides and systematically evaluated eight machine learning algorithms via five-fold cross-validation. The top-performing algorithms — LightGBM, XGBoost, and Random Forest — were integrated into a weighted ensemble model. ResultsTumorAgDB2.0 (https://tumoragdb.com.cn) now contains 187,223 entries. Moreover, NeoTImmuML demonstrated strong generalization performance on both internal and external test datasets. SHAP feature importance analysis revealed that peptide hydrophilicity and length are key determinants of immunogenicity. DiscussionTumorAgDB2.0 provides a comprehensive data resource for neoantigen research, while NeoTImmuML offers an efficient and interpretable tool for predicting neoantigen immunogenicity. Together, they offer valuable support for the design of personalized neoantigen vaccines and the development of cancer immunotherapy strategies.
创建时间:
2025-10-22
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作