zhongshupeng/dataset_A100
收藏Hugging Face2023-10-27 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/zhongshupeng/dataset_A100
下载链接
链接失效反馈官方服务:
资源简介:
# Disclaimer:
this dataset is curated for NeurIPS 2023 LLM efficiency challange, and currently work in progress. Please use at your own risk.
# Data composition:
All data were derived from the training set portion of the open source dataset.
**Data Sources**:
-dolly: https://huggingface.co/datasets/databricks/databricks-dolly-15k
-cnn_dailymail: https://huggingface.co/datasets/cnn_dailymail
-mmlu: https://huggingface.co/datasets/cais/mmlu
-bbq: https://huggingface.co/datasets/tasksource/bigbench
-ScienceQA: https://huggingface.co/datasets/tasksource/ScienceQA_text_only
提供机构:
zhongshupeng
原始信息汇总
数据集概述
数据来源
- dolly: 来自 https://huggingface.co/datasets/databricks/databricks-dolly-15k
- cnn_dailymail: 来自 https://huggingface.co/datasets/cnn_dailymail
- mmlu: 来自 https://huggingface.co/datasets/cais/mmlu
- bbq: 来自 https://huggingface.co/datasets/tasksource/bigbench
- ScienceQA: 来自 https://huggingface.co/datasets/tasksource/ScienceQA_text_only
数据构成
所有数据均源自开源数据集的训练集部分。



