Comprehensive model information and evaluation metrics.

NIAID Data Ecosystem2026-05-02 收录

下载链接：

https://figshare.com/articles/dataset/Comprehensive_model_information_and_evaluation_metrics_/29122277

下载链接

链接失效反馈

官方服务：

资源简介：

Summary of all analyzed LLMs, including their characteristics and performance metrics. The ‘open-source’ column indicates model accessibility (yes/no), ‘Size’ shows the number of parameters in billions where available, ‘Date’ indicates the model version’s release date, ‘Distance’ shows the Euclidean distance between the model’s and human AMCE values across nine moral preference categories, and ‘valid response rate’ represents the proportion of valid responses in the evaluation scenarios. (CSV)

本数据集汇总了所有经分析的大语言模型（Large Language Model，LLM）的特征与性能指标。其中，"开源性"列标注模型的可获取性（是/否）；"参数量"列展示可用模型的参数规模（单位：十亿）；"发布日期"列记录模型版本的上线时间；"距离"列代表在九类道德偏好类别中，模型与人类AMCE值之间的欧氏距离；"有效回复率"则表示评估场景中有效回复的占比。 (CSV)

创建时间：

2025-05-21

5,000+

优质数据集

54 个

任务类型

进入经典数据集