OEvortex/vortex-mini
收藏Hugging Face2024-02-27 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/OEvortex/vortex-mini
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
- pt
- hi
- te
- mr
license: other
license_name: hsul
license_link: https://huggingface.co/OEvortex/vortex-3b/raw/main/LICENSE.md
size_categories:
- 10K<n<100K
task_categories:
- text-generation
tags:
- alpaca
dataset_info:
features:
- name: output
dtype: string
- name: instruction
dtype: string
- name: input
dtype: string
splits:
- name: train
num_bytes: 815756970
num_examples: 989990
download_size: 498317527
dataset_size: 815756970
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
This dataset includes multiple languages (English, Portuguese, Hindi, Telugu, Marathi) and is primarily used for text generation tasks. The dataset size ranges between 10K and 100K, containing a training set with 989990 samples, totaling 815756970 bytes. The dataset features include output, instruction, and input, all of which are string types.
提供机构:
OEvortex
原始信息汇总
数据集概述
语言
- 英语 (en)
- 葡萄牙语 (pt)
- 印地语 (hi)
- 泰卢固语 (te)
- 马拉地语 (mr)
许可
- 许可类型: other
- 许可名称: hsul
- 许可链接: https://huggingface.co/OEvortex/vortex-3b/raw/main/LICENSE.md
数据集大小
- 大小范围: 10K<n<100K
任务类别
- 文本生成
标签
- alpaca
数据集信息
特征
- 输出 (output): 字符串类型
- 指令 (instruction): 字符串类型
- 输入 (input): 字符串类型
分割
- 训练集 (train)
- 字节数: 815756970
- 样本数: 989990
下载和数据集大小
- 下载大小: 498317527
- 数据集大小: 815756970
配置
- 配置名称: default
- 数据文件:
- 分割: 训练集
- 路径: data/train-*
- 数据文件:



