HiTZ/Magpie-Llama-3.1-8B-Instruct-Filtered
收藏Hugging Face2025-06-11 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/HiTZ/Magpie-Llama-3.1-8B-Instruct-Filtered
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是通过meta-llama/Llama-3.1-8B-Instruct模型和MAGPIE代码库生成的,包含了多种特征如对话ID、指令、响应、对话内容、生成模式、输入配置、意图、知识、难度、输入质量、任务类别等。数据集经过过滤,确保高质量和低重复性。系统提示用于不同场景如通用、代码、数学、算术和机器翻译。
Dataset generated using [meta-llama/Llama-3.1-8B-Instruc](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) with the [MAGPIE codebase](https://github.com/magpie-align/magpie). The dataset includes features such as conversation_id, instruction, response, conversations, gen_mode, gen_input_configs, intent, knowledge, difficulty, and more. The dataset is divided into a training set containing 2353894 samples. The filtering criteria include avoiding repetition and ensuring high-quality inputs. System prompts include general, code, math, arithmetic, and machine translation types.
提供机构:
HiTZ



