five

HiTZ/Magpie-Llama-3.1-8B-Instruct-Filtered

收藏
Hugging Face2025-06-11 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/HiTZ/Magpie-Llama-3.1-8B-Instruct-Filtered
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是通过meta-llama/Llama-3.1-8B-Instruct模型和MAGPIE代码库生成的,包含了多种特征如对话ID、指令、响应、对话内容、生成模式、输入配置、意图、知识、难度、输入质量、任务类别等。数据集经过过滤,确保高质量和低重复性。系统提示用于不同场景如通用、代码、数学、算术和机器翻译。

Dataset generated using [meta-llama/Llama-3.1-8B-Instruc](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) with the [MAGPIE codebase](https://github.com/magpie-align/magpie). The dataset includes features such as conversation_id, instruction, response, conversations, gen_mode, gen_input_configs, intent, knowledge, difficulty, and more. The dataset is divided into a training set containing 2353894 samples. The filtering criteria include avoiding repetition and ensuring high-quality inputs. System prompts include general, code, math, arithmetic, and machine translation types.
提供机构:
HiTZ
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作