five

Prototype Selection Method Based on the Rivality and Reliability Indexes for the Improvement of the Classification Models and External Predictions

收藏
Figshare2020-04-26 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Prototype_Selection_Method_Based_on_the_Rivality_and_Reliability_Indexes_for_the_Improvement_of_the_Classification_Models_and_External_Predictions/12198243
下载链接
链接失效反馈
官方服务:
资源简介:
Prototype or instance selection techniques is an important field of research in knowledge discovery, data mining, and machine learning. In QSAR, the use of prototype selection techniques in the preprocessing stage of the construction of the QSAR models favors the data set curation, improving the interpretability and accuracy of the models as well as the performance of the algorithms. In this paper, we propose an efficient method for prototype selection to be used in the preprocessing stage of the construction of QSAR classification models. The proposed method is able to generate very high reduction rates in the cardinality of the training set while maintaining or even increasing the accuracy of the classification models. The validation of the method has been carried out by means of the prediction of external molecules, demonstrating that the prediction of new molecules is also maintained or even improved. The method has been tested using 40 benchmark data sets of different sizes and balancing ratios; the results of the tests have demonstrated the wide applicability domain of the proposed method.
创建时间:
2020-04-26
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作