five

Dependency Update Strategies and Package Characteristics

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/5643627
下载链接
链接失效反馈
官方服务:
资源简介:
This is the replication package for our paper on predicting dependency update strategies. Here is a short description of what is contained in this package: Pre-processing Code The data_preparation.py file filters the initial libraries.io dataset to only include relevant columns for the npm packages. The feature_extraction.py file includes the extraction of model features. The feature_process.py file includes the majority of preprocessing scripts to derive new features, clean-up missing values and prepare the data for the models. ML Models The models.py file contains the scripts for training, validating and evaluating the random forest model and the two baselines (stratified random and SemVer only models) used for the study. Datasets The raw dataset can be downloaded from libraries.io. The Processed_Project_Features[SP51][RT].csv dataset is the result of all preprocessing steps and is used in the model feed function. Visualization Scripts The visualizer.py file includes the visualization scripts used for the paper. Sampled Packages The complete set of visualizations for the 160 sampled packages in RQ3.
创建时间:
2022-05-18
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作