Reza8848/MUFFIN_68k
收藏数据集概述
数据格式
数据集以JSON格式提供,可以读取为Python列表。每个元素是一个包含输入文本的Python字典,该输入文本包含多个任务指令及其对应的输出。
示例格式如下: json [ { "input": "XXX", "instances": [ { "instruction": "III", "output": "YYY" }, { "instruction": "III", "output": "YYY" } ] }, { "input": "XXX", "instances": [ { "instruction": "III", "output": "YYY" } ] } ]
数据统计
数据集包含1,463个输入文本,每个输入文本平均包含约46.48个任务指令,总计68,014个训练实例。
引用
如果使用该数据集,请引用以下论文: bibtex @inproceedings{Lou2023MUFFIN, title={{MUFFIN}: Curating Multi-Faceted Instructions for Improving Instruction Following}, author={Renze Lou and Kai Zhang and Jian Xie and Yuxuan Sun and Janice Ahn and Hanzi Xu and Yu su and Wenpeng Yin}, booktitle={The Twelfth International Conference on Learning Representations}, year={2024}, url={https://openreview.net/forum?id=1vrS1zwekw} }



