DEFault++ benchmark v1

Name: DEFault++ benchmark v1
Creator: Zenodo
Published: 2026-05-04 04:38:32
License: 暂无描述

DataCite Commons2026-05-04 更新2026-05-07 收录

下载链接：

https://zenodo.org/doi/10.5281/zenodo.20018623

下载链接

链接失效反馈

官方服务：

资源简介：

DEFault++ benchmark v1: Hierarchical fault diagnosis of fine-tuned HuggingFace transformers Contents- encoder_merged.csv (6,042 rows × 2,121 cols): trainer-ready feature vectors for 4 encoder models × 6 GLUE tasks (BERT, DistilBERT, RoBERTa, DistilRoBERTa on MRPC, QNLI, QQP, RTE, SST-2, STS-B).- decoder_merged.csv (2,535 rows × 3,130 cols): same shape for 4 decoder models × 5 LM tasks (GPT-2, DistilGPT-2, GPT-Neo-125M, OPT-125M on Lambada, OpenWebText, PTB, WikiText).- 35 per-task source CSVs and feature dictionary for full reproducibility.- Pre-computed deduplication, NaN handling, log-transforms, and CV filtering applied; downstream layer aggregation runs at training time via FeatureProcessor. Companion code: https://github.com/SigmaJahan/DEFaultplusplus-Transformer-Debugging Built from fine-tuning runs of 11,251 encoder + 4,282 decoder configurations covering ~13 fault categories and ~44 root causes (see feature_dictionary.csv for the complete catalog). Citation: see the DOI on this page.

提供机构：

Zenodo

创建时间：

2026-05-04

5,000+

优质数据集

54 个

任务类型

进入经典数据集