five

Data and scripts for "An Exploratory Study on Machine Learning Model Management"

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10602340
下载链接
链接失效反馈
官方服务:
资源简介:
AbstractEffective model management is crucial for ensuring performance and reliability in Machine Learning (ML) systems, given the dynamic nature of data and operational environments. However, standard practices are lacking, often resulting in ad hoc approaches. To address this, our research provides a clear definition of ML model management activities, processes, and techniques. Analyzing 227 ML repositories, we propose a taxonomy of 16 model management activities and identify 12 unique challenges. We highlight documentation and bug fixing as two of the most critical model management activities. Additionally, our findings indicate a significant shift towards automation of the ML pipeline, emphasizing the adoptions of tools for data, model, and documentation versioning. To offer practical guidance, we conducted a survey with industry practitioners and academic researchers to understand how model management challenges can be addressed. Our contributions include a detailed taxonomy of model management activities, a mapping of challenges to these activities, practitioner-informed solutions for challenge mitigation, and a publicly available dataset of model management activities and challenges. This work aims to equip ML developers with knowledge and best practices essential for the robust management of ML models.
创建时间:
2024-01-31
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作