WaltonFuture/InstructionGPT-4

Name: WaltonFuture/InstructionGPT-4
Creator: WaltonFuture
Published: 2023-10-09 08:14:18
License: 暂无描述

Hugging Face2023-10-09 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/WaltonFuture/InstructionGPT-4

下载链接

链接失效反馈

官方服务：

资源简介：

--- task_categories: - visual-question-answering size_categories: - n<1K --- # InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4 [Lai Wei](https://waltonfuture.github.io/), Zihao Jiang, [Weiran Huang](https://www.weiranhuang.com/), [Lichao Sun](https://lichao-sun.github.io/). **Shanghai Jiao Tong University, Lehigh University** [Paper](https://arxiv.org/abs/2308.12067), [Link](https://mp.weixin.qq.com/s/s4Acec71v5oMlFkyhlCL_g), [Code](https://github.com/waltonfuture/InstructionGPT-4) ## Introduction Multimodal large language models acquire their instruction-following capabilities through a two-stage training process: pre-training on image-text pairs and fine-tuning on supervised vision-language instruction data. Recent studies have shown that large language models can achieve satisfactory results even with a limited amount of high-quality instruction-following data. In this paper, we introduce InstructionGPT-4, which is fine-tuned on a small dataset comprising only 200 examples, amounting to approximately 6% of the instruction-following data used in the alignment dataset for MiniGPT-4. We first propose several metrics to access the quality of multimodal instruction data. Based on these metrics, we present a simple and effective data selector to automatically identify and filter low-quality vision-language data. By employing this method, InstructionGPT-4 outperforms the original MiniGPT-4 on various evaluations (e.g., visual question answering, GPT-4 preference). Overall, our findings demonstrate that less but high-quality instruction tuning data is efficient to enable multimodal large language models to generate better output. ## Usage You can download our vision-language dataset containing only 200 high-quality examples and replace the original cc_sbu_align dataset used in the fine-tuning stage of MiniGPT-4. The training settings are the same as [MiniGPT-4](https://github.com/Vision-CAIR/MiniGPT-4). If you're using InstructionGPT-4 in your research or applications, please cite using this BibTeX: ```bibtex @article{wei2023instructiongpt, title={InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4}, author={Wei, Lai and Jiang, Zihao and Huang, Weiran and Sun, Lichao}, journal={arXiv preprint arXiv:2308.12067}, year={2023} } ```

提供机构：

WaltonFuture

原始信息汇总

InstructionGPT-4 数据集概述

基本信息

任务类别: 视觉问答 (visual-question-answering)
数据集规模: 小于1000条 (n<1K)

数据集介绍

名称: InstructionGPT-4
来源: 上海交通大学, 理海大学
作者: Lai Wei, Zihao Jiang, Weiran Huang, Lichao Sun
相关链接: 论文, 代码

数据集详情

数据集大小: 仅包含200个高质量示例
用途: 用于替换MiniGPT-4微调阶段的原始cc_sbu_align数据集
训练设置: 与MiniGPT-4相同

引用信息

bibtex @article{wei2023instructiongpt, title={InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4}, author={Wei, Lai and Jiang, Zihao and Huang, Weiran and Sun, Lichao}, journal={arXiv preprint arXiv:2308.12067}, year={2023} }

5,000+

优质数据集

54 个

任务类型

进入经典数据集