five

Human-aligned Large Language Models Evaluation for Network Automation

收藏
IEEE2026-04-17 收录
下载链接:
https://ieee-dataport.org/documents/human-aligned-large-language-models-evaluation-network-automation
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset accompanies the study in which the HALLE (Human-aligned Auto LLM evaluation) framework is applied to a use case involving explainable AI (XAI) outputs for optical network quality-of-transmission (QoT) estimation. It includes the prompt sets used to query the candidate large language models (LLMs), the explanations generated by each model, and the evaluation scores assigned by human experts and by the LLM judges. Human evaluators annotated 40 instances, and these annotations were used to compare all candidate LLM judges. The selected judge then evaluated a set of 100 instances to produce the final scoring used in the paper. The dataset also contains the trained XGBoost QoT prediction model and the SHAP data used to produce the XAI feature attributions. All files are provided to enable exact reproduction of the experiments reported in the associated publication.
提供机构:
Omran Ayoub; Kiarash Rezaei; Paolo Monti; Carlos Natalino
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作