five

Gaussian Blobs of Varying numbers of samples, centers and features

收藏
IEEE2020-11-24 更新2026-04-17 收录
下载链接:
https://ieee-dataport.org/open-access/gaussian-blobs-varying-numbers-samples-centers-and-features
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset has Gaussian Blobs of varying samples, centers and features. The number of samples ranges from 500 to 50,000. Similarly, the number of centers varies from 2 to 100, while the number of features varies from 2 to 2048. These different sets of Gaussian blobs can be used for testing clustering algorithms for their scalability and effectiveness. There are two kinds of files inside the compressed sets. Files ending with _X.csv consist of datapoints, while the files ending with _y.csv represent respective class data.The filename of each gaussian blob inside compressed sets gives a sketch of the blob. For example, the file s50000_c50_f2048_X.csv contains 50,000 samples of data that have 2048 dimensions (features) with 50 centers, and the file s50000_c50_f2048_y.csv is the associated class data of the file s50000_c50_f2048_X.csv. The blob files are organized based on their number of samples. For example, the compressed file 10,000 datapoints set.zip contains a collection of Gaussian blobs with 10,000 samples of data with a varying number of centers and features. The documentation section has PDF document that provides list of files inside each compressed file.The naming convention of the files uses following alphabets that represent the content of the repective file. s represents number of samplesc represents number of centersf represents number of features
提供机构:
Sharma, Sadiksha
创建时间:
2020-11-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作