five

Experimental Data Set for the study "Exploratory Landscape Analysis is Strongly Sensitive to the Sampling Strategy"

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/3886815
下载链接
链接失效反馈
官方服务:
资源简介:
This are the feature values used in the study  "Exploratory Landscape Analysis is Strongly Sensitive to the Sampling Strategy". The dataset regroups feature values for every "cheap" features available in the R package flacco and are computed using 5 sampling strategies and in dimension \($d=5$\): Random: the classical Mersenne-Twister algorithm; Randu: a random number generator that is notoriously bad; LHS: a centered Latin Hypercube Design; iLHS: an improved Latin Hypercube Design; Sobol: points extracted from a Sobol' low-discrepancy sequence. The csv file features_summury_dim_5_ppsn.csv regroups 100 values for every features whereas features_summury_dim_5_ppsn_median.csv regroups for every feature the median of the 100 values. In the folder PPSN_feature_plots are the histograms of feature values on the 24 COCO functions for 3 sampling strategies: Random, LHS and Sobol. The Python file sampling_ppsn.py is the code used to generate the sample points from which the feature values are computed. The file stats50_knn_dt.csv provide the raw data of median and IQR (inter quartile interval) for the heatmaps and boxplots available in the paper. Finally, the files results_classif_knn100.csv (resp. dt) provide the accuracy of 100 classifications for every settings.
创建时间:
2021-01-28

社区讨论

ArXiv论文作者在Figshare上也放了数据集: https://figshare.com/collections/FUMPE/4107803/1

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作