HiTZ/Magpie-Llama-3.1-8B-Instruct-Unfiltered
收藏Hugging Face2025-06-11 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/HiTZ/Magpie-Llama-3.1-8B-Instruct-Unfiltered
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是使用meta-llama/Llama-3.1-8B-Instruct模型和MAGPIE代码库生成的,包含了多种特征如conversation_id、instruction、response等。数据集的训练分割包含3640000个样本,总大小为23402752764字节。此外,数据集还提供了不同任务类型的系统提示,如通用、代码、数学、算术和机器翻译。
This dataset is generated using the meta-llama/Llama-3.1-8B-Instruc model with the MAGPIE codebase. It includes multiple fields such as conversation_id, instruction, response, conversations, gen_mode, gen_input_configs, intent, knowledge, difficulty, and more. The dataset is primarily for training, containing 3,640,000 samples with a size of 23,402,752,764 bytes. The dataset configuration is named default, with data file paths at data/train-*. The dataset is licensed under apache-2.0, in English, and tagged as synthetic.
提供机构:
HiTZ



