five

HiTZ/Magpie-Llama-3.1-8B-Instruct-Unfiltered

收藏
Hugging Face2025-06-11 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/HiTZ/Magpie-Llama-3.1-8B-Instruct-Unfiltered
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是使用meta-llama/Llama-3.1-8B-Instruct模型和MAGPIE代码库生成的,包含了多种特征如conversation_id、instruction、response等。数据集的训练分割包含3640000个样本,总大小为23402752764字节。此外,数据集还提供了不同任务类型的系统提示,如通用、代码、数学、算术和机器翻译。

This dataset is generated using the meta-llama/Llama-3.1-8B-Instruc model with the MAGPIE codebase. It includes multiple fields such as conversation_id, instruction, response, conversations, gen_mode, gen_input_configs, intent, knowledge, difficulty, and more. The dataset is primarily for training, containing 3,640,000 samples with a size of 23,402,752,764 bytes. The dataset configuration is named default, with data file paths at data/train-*. The dataset is licensed under apache-2.0, in English, and tagged as synthetic.
提供机构:
HiTZ
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作