clustering benchmark dataset collections

Name: clustering benchmark dataset collections
Creator: 华沙理工大学数学与信息科学学院
Published: 2023-10-26 06:32:18
License: 暂无描述

arXiv2023-10-26 更新2024-06-21 收录

下载链接：

https://clustering-benchmarks.gagolewski.com

下载链接

链接失效反馈

官方服务：

资源简介：

本数据集是由华沙理工大学数学与信息科学学院的Marek Gagolewski开发的聚类基准数据集集合，旨在提供一致的方法来测试聚类算法。数据集包含多种维度、大小和聚类类型的数据，用于评估算法在不同问题上的表现。创建过程中，数据集经过聚合、打磨和标准化，确保其质量和适用性。该数据集主要应用于机器学习和数据挖掘领域，解决聚类算法评估和比较的问题。

This dataset is a collection of clustering benchmark datasets developed by Marek Gagolewski from the Faculty of Mathematics and Information Science at Warsaw University of Technology. It is designed to provide a consistent methodology for testing clustering algorithms. The datasets cover data with diverse dimensions, scales and cluster types, enabling the assessment of algorithm performance across various problem scenarios. During its development, the datasets have been aggregated, polished and standardized to guarantee their quality and applicability. This collection is mainly utilized in the fields of machine learning and data mining, focusing on solving the challenges of evaluating and comparing clustering algorithms.

提供机构：

华沙理工大学数学与信息科学学院

创建时间：

2022-09-20

5,000+

优质数据集

54 个

任务类型

进入经典数据集