Data envelopment analysis clustering: an optimization based machine learning

Name: Data envelopment analysis clustering: an optimization based machine learning
Creator: Zhu, Joe; Zhao, Jing; An, Qingxian
Published: 2025-07-31 00:00:00
License: 暂无描述

Taylor & Francis Group2025-07-31 更新2026-04-16 收录

下载链接：

https://tandf.figshare.com/articles/dataset/Data_envelopment_analysis_clustering_an_optimization_based_machine_learning/29750079/1

下载链接

链接失效反馈

官方服务：

资源简介：

Clustering algorithms are commonly used to group units based on their intrinsic characteristics, represented by various features, typically divided into cost features and benefit features. Most existing clustering methods analyze these features based on distance functions, and the cost-benefit relationship is not a focus. In this study, we approach clustering from the perspective of constructing a composite index reflecting the cost-benefit ratio. We utilize data envelopment analysis (DEA) to develop a “k-DEA-weights” clustering method to identify clusters and reveal the relative degree of importance of different features within each cluster to enhance the explainability of clustering outcomes. The proposed method can be viewed as an optimization-based explainable machine learning technique. We first employ a DEA cross-efficiency model to calculate cross-evaluated weights for units and a group DEA cross-efficiency model to obtain common weights for clusters. Through an iterative heuristic algorithm, we optimize the clustering process by minimizing the distance between the cross-evaluated weights and the common weights within a cluster. To validate its effectiveness, we conduct numerical experiments that demonstrate strong clustering performance. We then apply it to Chunyu Doctor platform, a Chinese online healthcare consultation platform. The results provide actionable insights for optimizing services and improving user experience.

提供机构：

Zhu, Joe; Zhao, Jing; An, Qingxian

创建时间：

2025-07-31

5,000+

优质数据集

54 个

任务类型

进入经典数据集