zhongshupeng/dataset_4090_3
收藏Hugging Face2023-10-27 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/zhongshupeng/dataset_4090_3
下载链接
链接失效反馈官方服务:
资源简介:
# Disclaimer:
this dataset is curated for NeurIPS 2023 LLM efficiency challange, and currently work in progress. Please use at your own risk.
# Data composition:
All data were derived from the training set portion of the open source dataset.
**gsm2k_dolly12k_cnnadd4k_mmlulog1.7w_bbqabc8k.json**:
-gsm8k_2000: https://huggingface.co/datasets/gsm8k
-dolly_12000: https://huggingface.co/datasets/databricks/databricks-dolly-15k
-cnn_dailymail_4000: https://huggingface.co/datasets/cnn_dailymail
-mmlu_17000: https://huggingface.co/datasets/cais/mmlu
-bbq_8000: https://huggingface.co/datasets/tasksource/bigbench
提供机构:
zhongshupeng
原始信息汇总
数据集概述
数据来源
所有数据均源自开源数据集的训练集部分。
数据文件
gsm2k_dolly12k_cnnadd4k_mmlulog1.7w_bbqabc8k.json:
- gsm8k_2000:来自 gsm8k
- dolly_12000:来自 databricks/databricks-dolly-15k
- cnn_dailymail_4000:来自 cnn_dailymail
- mmlu_17000:来自 cais/mmlu
- bbq_8000:来自 tasksource/bigbench



