ErikYip/LLM-Uncertainty-Bench

Name: ErikYip/LLM-Uncertainty-Bench
Creator: ErikYip
Published: 2024-01-25 08:26:07
License: 暂无描述

Hugging Face2024-01-25 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/ErikYip/LLM-Uncertainty-Bench

下载链接

链接失效反馈

官方服务：

资源简介：

Datasets comprising 10,000 instances each used for uncertainty quantification in LLMs. 1. mmlu_10k is used for question answering. 2. cosmosqa_10k is used for reading comprehension. 3. hellaswag_10k is used for commonsense inference. 4. halu_dialogue is used for dialogue response selection. 5. halu_summarization is used for document summarization. For more details on how these datasets are utilized, check out our github repo: https://github.com/smartyfh/LLM-Uncertainty-Bench/tree/main

提供机构：

ErikYip

原始信息汇总

数据集概述

本数据集包含五个子数据集，每个子数据集包含10,000个实例，用于大型语言模型（LLMs）中的不确定性量化。

子数据集详情

mmlu_10k：用于问答任务。
cosmosqa_10k：用于阅读理解任务。
hellaswag_10k：用于常识推理任务。
halu_dialogue：用于对话响应选择任务。
halu_summarization：用于文档摘要任务。

5,000+

优质数据集

54 个

任务类型

进入经典数据集