five

Performance of algorithms (area under ROC curve, AUC) in the rediscovery experiment using only NEM316 genome.

收藏
Figshare2015-12-02 更新2026-04-29 收录
下载链接:
https://figshare.com/articles/dataset/_Performance_of_algorithms_area_under_ROC_curve_AUC_in_the____rediscovery_experiment_using_only_NEM316_genome_/456661
下载链接
链接失效反馈
官方服务:
资源简介:
This analysis evaluated the relative performance of each algorithm to rediscover virulence genes by applying stratified n-fold cross-validations with of the entire set of S. agalactiae NEM316 genes serving as test-set in each fold. Each fold of training set comprised positive and negative examples.n: number of virulence genes in the category. Singleton virulence gene categories were excluded from this analysis, as it is not possible to perform cross-validations on training sets with n = 1. All but one (labeled*) AUCs reached the statistical significance level at α = 0.05 (two-tailed Mann-Whitley U-test). At least 3 out of 4 algorithms were still significant after adjustment for multiple testing (across the family of 4 algorithms) by the Bonferroni method. Abbreviations: ADTree: alternating decision tree; IBk: nearest neighbor classifier; SVM: support vector machine; RBF: SVM with radial basis function; Poly: SVM with polynomial kernel. Refer to the methods section for the parameters used to train the machine learning algorithms. The numbers in bold face indicate the best performing algorithm for a given category.
创建时间:
2015-12-02
二维码
社区交流群
二维码
科研交流群
商业服务