Gaussian Blobs of Varying numbers of samples, centers and features
收藏DataCite Commons2020-11-12 更新2025-04-16 收录
下载链接:
https://ieee-dataport.org/open-access/gaussian-blobs-varying-numbers-samples-centers-and-features
下载链接
链接失效反馈官方服务:
资源简介:
The dataset has Gaussian Blobs of varying samples, centers and features. The number of samples ranges from 500 to 50,000. Similarly, the number of centers varies from 2 to 100, while the number of features varies from 2 to 2048. These different sets of Gaussian blobs can be used for testing clustering algorithms for their scalability and effectiveness. There are two kinds of files inside the compressed sets. Files ending with "_X.csv" consist of datapoints, while the files ending with "_y.csv" represent respective class data.The filename of each gaussian blob inside compressed sets gives a sketch of the blob. For example, the file "s50000_c50_f2048_X.csv" contains 50,000 samples of data that have 2048 dimensions (features) with 50 centers, and the file "s50000_c50_f2048_y.csv" is the associated class data of the file "s50000_c50_f2048_X.csv". The blob files are organized based on their number of samples. For example, the compressed file "10,000 datapoints set.zip" contains a collection of Gaussian blobs with 10,000 samples of data with a varying number of centers and features. The documentation section has PDF document that provides list of files inside each compressed file.
提供机构:
IEEE DataPort
创建时间:
2020-11-12



