Human-aligned Large Language Models Evaluation for Network Automation

Name: Human-aligned Large Language Models Evaluation for Network Automation
Creator: Omran Ayoub; Kiarash Rezaei; Paolo Monti; Carlos Natalino
License: 暂无描述

IEEE2026-04-17 收录

下载链接：

https://ieee-dataport.org/documents/human-aligned-large-language-models-evaluation-network-automation

下载链接

链接失效反馈

官方服务：

资源简介：

This dataset accompanies the study in which the HALLE (Human-aligned Auto LLM evaluation) framework is applied to a use case involving explainable AI (XAI) outputs for optical network quality-of-transmission (QoT) estimation. It includes the prompt sets used to query the candidate large language models (LLMs), the explanations generated by each model, and the evaluation scores assigned by human experts and by the LLM judges. Human evaluators annotated 40 instances, and these annotations were used to compare all candidate LLM judges. The selected judge then evaluated a set of 100 instances to produce the final scoring used in the paper. The dataset also contains the trained XGBoost QoT prediction model and the SHAP data used to produce the XAI feature attributions. All files are provided to enable exact reproduction of the experiments reported in the associated publication.

提供机构：

Omran Ayoub; Kiarash Rezaei; Paolo Monti; Carlos Natalino

5,000+

优质数据集

54 个

任务类型

进入经典数据集