five

10-fold Cross-Validation Area-Under-the-Curve scores (mean ± st.dev.) of domain vs non-domain representations in the protein inference tasks. Overall, there is consistency between region enrichment of the top Hist-8000 features selected and best-performing Hist-8000 region-specific representations, between domain and non-domain regions per task (tasks 1-8). Best-performing methods in bold. See section ‘Protein inference problems’ for tasks sources. Hist-8000: Histogram-8000, SoT: Sum-of-learnt-Trigrams, AUC: Area Under the Curve, 10foldCV: 10-fold Cross-Validation, VFs: Virulence Factors, Gram-pos: Gram-positive, Gram-neg: Gram-negative, st.dev: standard deviation, doms: domains, non-doms: non-domains, #Top feats: number of top features.

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://figshare.com/articles/dataset/10-fold_Cross-Validation_Area-Under-the-Curve_scores_mean_st_dev_of_domain_vs_non-domain_representations_in_the_protein_inference_tasks_Overall_there_is_consistency_between_region_enrichment_of_the_top_Hist-8000_features_selected_and_best-p/29845908
下载链接
链接失效反馈
官方服务:
资源简介:
10-fold Cross-Validation Area-Under-the-Curve scores (mean ± st.dev.) of domain vs non-domain representations in the protein inference tasks. Overall, there is consistency between region enrichment of the top Hist-8000 features selected and best-performing Hist-8000 region-specific representations, between domain and non-domain regions per task (tasks 1-8). Best-performing methods in bold. See section ‘Protein inference problems’ for tasks sources. Hist-8000: Histogram-8000, SoT: Sum-of-learnt-Trigrams, AUC: Area Under the Curve, 10foldCV: 10-fold Cross-Validation, VFs: Virulence Factors, Gram-pos: Gram-positive, Gram-neg: Gram-negative, st.dev: standard deviation, doms: domains, non-doms: non-domains, #Top feats: number of top features.
创建时间:
2025-08-06
二维码
社区交流群
二维码
科研交流群
商业服务