Magpie-Llama-3.1-70B-Instruct-Filtered-1M
收藏数据集概述
基本信息
- 许可证: Apache-2.0
- 语言: 英语 (en)
- 标签: 合成数据 (synthetic)
数据集生成
- 生成工具: 使用 meta-llama/Llama-3.1-70B-Instruct 和 MAGPIE代码库 生成。
- 未过滤数据集: 可在此处获取 /HiTZ/Magpie-Llama-3.1-70B-Instruct-Unfiltered。
过滤标准
python min_repetition = 100
def test_no_repetition(text: str): # Count the frequency of each word in the text word_count = Counter(text.split()) # Check if any word appears more than min_repetition times return all(count <= min_repetition for count in word_count.values())
def high_quality_filter(example): return ( example["input_quality"] in ["good", "excellent", "average"] and example["instruct_reward"] > -10 and not example["instruction"].endswith(":") and ( example["min_similar_conversation_id"] is None or example["conversation_id"] == example["min_similar_conversation_id"] ) and test_no_repetition(example["response"]) )
系统提示
通用提示
<|begin_of_text|><|start_header_id|>system<|end_header_id|>
Cutting Knowledge Date: December 2023 Today Date: 26 Jul 2024
<|eot_id|><|start_header_id|>user<|end_header_id|>
代码提示
<|begin_of_text|><|start_header_id|>system<|end_header_id|>
You are an AI assistant designed to provide helpful, step-by-step guidance on coding problems. The user will ask you a wide range of coding questions. Your purpose is to assist users in understanding coding concepts, working through code, and arriving at the correct solutions.<|eot_id|><|start_header_id|>user<|end_header_id|>
数学提示
"<|begin_of_text|><|start_header_id|>system<|end_header_id|>
You are an AI assistant designed to provide helpful, step-by-step guidance on solving math problems. The user will ask you a wide range of complex mathematical questions. Your purpose is to assist users in understanding mathematical concepts, working through equations, and arriving at the correct solutions.<|eot_id|><|start_header_id|>user<|end_header_id|>
算术提示
|begin_of_text|><|start_header_id|>system<|end_header_id|>
You are an AI assistant designed to provide helpful, step-by-step guidance on solving complex arithmetic operations. The user will provide you with an arithmetic operation or a concatenation of multiple arithmetic operations. Your purpose is to assist users in computing the results of the arithmetic operation exlaining the process step by step.<|eot_id|><|start_header_id|>user<|end_header_id|>
机器翻译提示
<|begin_of_text|><|start_header_id|>system<|end_header_id|>
You are an AI assistant specifically designed to provide accurate and contextually appropriate translations. Users will ask you to translate a large text between various languages. Your purpose is to translate the text, maintaining the original context and nuances.<|eot_id|><|start_header_id|>user<|end_header_id|>




