five

Data envelopment analysis clustering: an optimization based machine learning

收藏
Taylor & Francis Group2025-07-31 更新2026-04-16 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Data_envelopment_analysis_clustering_an_optimization_based_machine_learning/29750079/1
下载链接
链接失效反馈
官方服务:
资源简介:
Clustering algorithms are commonly used to group units based on their intrinsic characteristics, represented by various features, typically divided into cost features and benefit features. Most existing clustering methods analyze these features based on distance functions, and the cost-benefit relationship is not a focus. In this study, we approach clustering from the perspective of constructing a composite index reflecting the cost-benefit ratio. We utilize data envelopment analysis (DEA) to develop a “k-DEA-weights” clustering method to identify clusters and reveal the relative degree of importance of different features within each cluster to enhance the explainability of clustering outcomes. The proposed method can be viewed as an optimization-based explainable machine learning technique. We first employ a DEA cross-efficiency model to calculate cross-evaluated weights for units and a group DEA cross-efficiency model to obtain common weights for clusters. Through an iterative heuristic algorithm, we optimize the clustering process by minimizing the distance between the cross-evaluated weights and the common weights within a cluster. To validate its effectiveness, we conduct numerical experiments that demonstrate strong clustering performance. We then apply it to Chunyu Doctor platform, a Chinese online healthcare consultation platform. The results provide actionable insights for optimizing services and improving user experience.
提供机构:
Zhu, Joe; Zhao, Jing; An, Qingxian
创建时间:
2025-07-31
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作