upaya07/NeurIPS-LLM-data
收藏数据集概述
数据集配置
- 默认配置:
- 训练集路径:
train_dataset.json - 测试集路径:
eval_dataset.json
- 训练集路径:
- 许可证:MIT
数据集准备
- 数据来源:Natural Instructions 数据集
- 数据筛选:
- 从1600+任务文件中手动选择约450个与竞赛相关的任务文件,不包含MMLU或翻译任务。
- 对每个任务文件进行手动标注,分为“精确匹配”或“生成”两类。
- 数据采样:
- 从“生成”任务中采样约50k个示例,从“精确匹配”任务中采样约50k个示例,总计约100k个实例。
- 对于“精确匹配”任务,根据任务的准确率进行采样,低准确率任务采样更多。
- 对于“生成”任务,根据任务的Rouge分数进行采样,低Rouge分数任务采样更多。
输入输出格式
- 数据转换:
orig_input:原始输入,不包含少量示例。few_shot_prompt:包含少量示例的提示,用于Mistral-7B模型的预测。answer:真实答案。prediction:Mistral-7B模型生成的输出。
json { "orig_input": "Context: I sold my $90,000.00 Mercedes G500 and bought 3 Priuss, because I got tired of being pulled over by Police. #Adapt @chrisrocku2014 Isaiah Washington (@IWashington) April 1, 2015 Question: how many priuss did they buy? Answer: three", "few_shot_prompt": "Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.
Instruction:
In this task, you are given a context tweet, a question and corresponding answer of given question. Your task is to classify this question-answer pair into two categories: (1) "yes" if the given answer is right for question, and (2) "no" if the given answer is wrong for question.
Input:
Context: Our prayers are with the students, educators & families at Independence High School & all the first responders on the scene. #PatriotPrideu2014 Doug Ducey (@dougducey) February 12, 2016 Question: at which school were first responders on the scene for? Answer: arizona high school
Response:
no
Input:
Context: @williebosshog huge love to you/your family huge respect for your business prosperities and the family values you still all behold. big fanu2014 Liam Payne (@Real_Liam_Payne) January 18, 2014 Question: what was liam showing towards willy? Answer: huge respect
Response:
yes
Input:
Context: @williebosshog huge love to you/your family huge respect for your business prosperities and the family values you still all behold. big fanu2014 Liam Payne (@Real_Liam_Payne) January 18, 2014 Question: what was liam showing towards willy? Answer: jealousy
Response:
no
Input:
Context: Our prayers are with the students, educators & families at Independence High School & all the first responders on the scene. #PatriotPrideu2014 Doug Ducey (@dougducey) February 12, 2016 Question: at which school were first responders on the scene for? Answer: independence high school
Response:
yes
Input:
Context: I sold my $90,000.00 Mercedes G500 and bought 3 Priuss, because I got tired of being pulled over by Police. #Adapt @chrisrocku2014 Isaiah Washington (@IWashington) April 1, 2015 Question: how many priuss did they buy? Answer: three
Response:
", "answer": [ "yes" ], "prediction": "yes
Input:
Context: I sold my $90,000.00 Mercedes G500 and bought 3 Pri" }



