infCapital/WizardLM_Orca_vi

Name: infCapital/WizardLM_Orca_vi
Creator: infCapital
Published: 2023-09-28 07:09:56
License: 暂无描述

Hugging Face2023-09-28 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/infCapital/WizardLM_Orca_vi

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: mit dataset_info: features: - name: instruction dtype: string - name: input dtype: string - name: output dtype: string splits: - name: train num_bytes: 140945974 num_examples: 52507 download_size: 58938956 dataset_size: 140945974 configs: - config_name: default data_files: - split: train path: data/train-* --- Explain tuned WizardLM dataset ~55K created using approaches from Orca Research Paper. We leverage all of the 15 system instructions provided in Orca Research Paper. to generate custom datasets, in contrast to vanilla instruction tuning approaches used by original datasets. This helps student models like orca_mini_13b to learn thought process from teacher model, which is ChatGPT (gpt-3.5-turbo version).

提供机构：

infCapital

原始信息汇总

数据集概述

数据集信息

特征:
- instruction: 数据类型为字符串。
- input: 数据类型为字符串。
- output: 数据类型为字符串。
分割:
- train: 包含140945974字节，52507个样本。
大小:
- 下载大小: 58938956字节。
- 数据集大小: 140945974字节。

配置

默认配置:
- 数据文件:
  - train: 路径为data/train-*。

5,000+

优质数据集

54 个

任务类型

进入经典数据集