Replication package for DRAGON: Robust Classification for Very Large Collections of Software Repositories
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/15020641
下载链接
链接失效反馈官方服务:
资源简介:
DRAGON: Multi-Label Classification Replication Package
This archive contains the replication package for the DRAGON multi-label classification models, which leverage BERT-based architectures. The package includes scripts for repository mining, dataset creation, data processing, model training, and evaluation.
Key Components:
Repository Mining: Scripts to extract repositories for dataset creation.
Dataset Preparation: Jupyter notebooks for cleaning and transforming data.
Data Processing: Conversion into a Hugging Face dataset format.
Model Training: Training scripts for DRAGON and LEGION, with configurable preprocessing options.
Evaluation: Threshold tuning and performance assessment.
创建时间:
2025-03-13



