five

llm-wizard/alpaca-gpt4-data

收藏
Hugging Face2023-04-07 更新2024-05-25 收录
下载链接:
https://hf-mirror.com/datasets/llm-wizard/alpaca-gpt4-data
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: instruction dtype: string - name: input dtype: string - name: output dtype: string splits: - name: train num_bytes: 40178951 num_examples: 52002 download_size: 24027484 dataset_size: 40178951 license: cc-by-4.0 language: - en pretty_name: Instruction Tuning with GPT-4 size_categories: - 10K<n<100K task_categories: - text-generation tags: - gpt - alpaca - fine-tune - instruct-tune - instruction --- # Dataset Description - **Project Page:** https://instruction-tuning-with-gpt-4.github.io - **Repo:** https://github.com/Instruction-Tuning-with-GPT-4/GPT-4-LLM - **Paper:** https://arxiv.org/abs/2304.03277 # Dataset Card for "alpaca-gpt4-data" All of the work is done by [this team](https://github.com/Instruction-Tuning-with-GPT-4/GPT-4-LLM). # Usage and License Notices The data is intended and licensed for research use only. The dataset is CC BY NC 4.0 (allowing only non-commercial use) and models trained using the dataset should not be used outside of research purposes. # Chinese Dataset [Found here](https://huggingface.co/datasets/c-s-ale/alpaca-gpt4-data-zh) # Citation ``` @article{peng2023gpt4llm, title={Instruction Tuning with GPT-4}, author={Baolin Peng, Chunyuan Li, Pengcheng He, Michel Galley, Jianfeng Gao}, journal={arXiv preprint arXiv:2304.03277}, year={2023} } ```
提供机构:
llm-wizard
原始信息汇总

数据集概述

基本信息

  • 数据集名称: Instruction Tuning with GPT-4
  • 数据集大小: 40178951字节
  • 下载大小: 24027484字节
  • 语言: 英语 (en)
  • 许可证: CC-BY-4.0

数据集特征

  • 特征名称: instruction, input, output
  • 数据类型: 字符串

数据集划分

  • 训练集:
    • 示例数量: 52002
    • 字节数: 40178951

数据集类别

  • 大小类别: 10K<n<100K
  • 任务类别: 文本生成 (text-generation)

标签

  • gpt
  • alpaca
  • fine-tune
  • instruct-tune
  • instruction
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作