Magpie-Align/Magpie-Gemma2-Pro-200K-Filtered

Name: Magpie-Align/Magpie-Gemma2-Pro-200K-Filtered
Creator: Magpie-Align
Published: 2024-07-22 01:24:04
License: 暂无描述

Hugging Face2024-07-22 更新2024-07-22 收录

下载链接：

https://hf-mirror.com/datasets/Magpie-Align/Magpie-Gemma2-Pro-200K-Filtered

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是通过Magpie方法使用Gemma-2-27b-it模型生成的，包含了指令和响应的对话数据。数据集的特征包括输入长度、输出长度、任务类别、输入质量、输入难度、最小邻居距离、安全性、奖励和语言等。数据集还经过了进一步的过滤，确保指令的简洁性和响应的长度。

The dataset is generated by the Gemma-2-27b-it model using the Magpie method, containing 200,000 training data entries. The dataset features include UUID, model name, generation input configurations, instructions, responses, conversations, task categories, etc. The dataset is filtered to ensure concise and safe instructions, selecting the 200,000 entries with the longest responses. The dataset is used for training and evaluating model performance, especially in task alignment and preference optimization.

提供机构：

Magpie-Align

原始信息汇总

数据集概述

数据集信息

特征

uuid: 字符串类型
model: 字符串类型
gen_input_configs: 结构体类型
- temperature: 浮点数类型
- top_p: 浮点数类型
- input_generator: 字符串类型
- seed: 空类型
- pre_query_template: 字符串类型
instruction: 字符串类型
response: 字符串类型
conversations: 列表类型
- from: 字符串类型
- value: 字符串类型
task_category: 字符串类型
other_task_category: 序列类型
task_category_generator: 字符串类型
difficulty: 字符串类型
intent: 字符串类型
knowledge: 字符串类型
difficulty_generator: 字符串类型
input_quality: 字符串类型
quality_explanation: 字符串类型
quality_generator: 字符串类型
llama_guard_2: 字符串类型
reward_model: 字符串类型
instruct_reward: 浮点数类型
min_neighbor_distance: 浮点数类型
repeat_count: 整数类型
min_similar_uuid: 字符串类型
instruction_length: 整数类型
response_length: 整数类型
language: 字符串类型