Dependency Update Strategies and Package Characteristics

NIAID Data Ecosystem2026-03-13 收录

下载链接：

https://zenodo.org/record/5643627

下载链接

链接失效反馈

官方服务：

资源简介：

This is the replication package for our paper on predicting dependency update strategies. Here is a short description of what is contained in this package: Pre-processing Code The data_preparation.py file filters the initial libraries.io dataset to only include relevant columns for the npm packages. The feature_extraction.py file includes the extraction of model features. The feature_process.py file includes the majority of preprocessing scripts to derive new features, clean-up missing values and prepare the data for the models. ML Models The models.py file contains the scripts for training, validating and evaluating the random forest model and the two baselines (stratified random and SemVer only models) used for the study. Datasets The raw dataset can be downloaded from libraries.io. The Processed_Project_Features[SP51][RT].csv dataset is the result of all preprocessing steps and is used in the model feed function. Visualization Scripts The visualizer.py file includes the visualization scripts used for the paper. Sampled Packages The complete set of visualizations for the 160 sampled packages in RQ3.

创建时间：

2022-05-18

5,000+

优质数据集

54 个

任务类型

进入经典数据集