five

Simulated Hierarchical Benchmark Dataset to assess dendro-classification methods (hierarchical classification)

收藏
DataONE2018-02-13 更新2024-06-25 收录
下载链接:
https://search.dataone.org/view/e3488d912288f92f46ccab084bfc6c88
下载链接
链接失效反馈
官方服务:
资源简介:
A hierarchically ordered distribution of 3D-points was created with matlab. It contains 120,000 datapoints in five hierarchical levels with one to four child nodes per parent. Data values for the three axes range betwwen 0 and 1. The structure can be seen in the attached figure. In each hierarchical level different distributions of datapoints are implemented. This allows to test classifiers under various conditions. The most common distribution in the dataset is a simple gaussian distributed point cloud. Other sampled distributions are a spherical distribution (sphere in 3D), or a circular (donut) distribution along different axes. XOR distributions are implemented in different patterns, e.g. four batches with crossed classes or eight batches with two or four classes. The most complex data distribution is the springroll, where the datapoints are intertwined into one another. To create indistinguishable cases, where the prediction of a classifier is supposed to perform bad, some datapoints are just randomly intermixed with another class. The .csv-file contains four columns: label | x-coordinate | y-coordinate | z-coordinate The label for each sample provides all hierarchical information needed. Each label is composed of five digits, one for each hierarchical level. As an example: Sample '11421': Hierarchical level 1: class 1 Hierarchical level 2: class 1 Hierarchical level 3: class 4 Hierarchical level 4: class 2 Hierarchical level 5: class 1
创建时间:
2018-02-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作