five

DEFault++ benchmark v1

收藏
DataCite Commons2026-05-04 更新2026-05-07 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.20018623
下载链接
链接失效反馈
官方服务:
资源简介:
DEFault++ benchmark v1: Hierarchical fault diagnosis of fine-tuned HuggingFace transformers Contents- encoder_merged.csv (6,042 rows × 2,121 cols): trainer-ready feature  vectors for 4 encoder models × 6 GLUE tasks (BERT, DistilBERT,  RoBERTa, DistilRoBERTa on MRPC, QNLI, QQP, RTE, SST-2, STS-B).- decoder_merged.csv (2,535 rows × 3,130 cols): same shape for 4  decoder models × 5 LM tasks (GPT-2, DistilGPT-2, GPT-Neo-125M,  OPT-125M on Lambada, OpenWebText, PTB, WikiText).- 35 per-task source CSVs and feature dictionary for full reproducibility.- Pre-computed deduplication, NaN handling, log-transforms, and CV  filtering applied; downstream layer aggregation runs at training  time via FeatureProcessor. Companion code: https://github.com/SigmaJahan/DEFaultplusplus-Transformer-Debugging Built from fine-tuning runs of 11,251 encoder + 4,282 decoder configurations covering ~13 fault categories and ~44 root causes (see feature_dictionary.csv for the complete catalog). Citation: see the DOI on this page.
提供机构:
Zenodo
创建时间:
2026-05-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作