nyunai/samvaad-hi-v1-chat-format
收藏Hugging Face2024-04-24 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/nyunai/samvaad-hi-v1-chat-format
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
- hi
dataset_info:
features:
- name: messages
list:
- name: content
dtype: string
- name: role
dtype: string
- name: text
dtype: string
splits:
- name: train
num_bytes: 932351683
num_examples: 101476
download_size: 403328278
dataset_size: 932351683
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
This dataset includes two languages: English and Hindi. The main features of the dataset are messages and text. messages is a list containing content and role fields, both of which are string types. text is also a string type. The dataset is divided into a training set with 101476 samples, totaling 932351683 bytes. The download size of the dataset is 403328278 bytes, and the total size of the dataset is 932351683 bytes. The dataset configuration is named default, and the training data files are located at data/train-* path.
提供机构:
nyunai
原始信息汇总
数据集概述
语言
- 英语 (en)
- 印地语 (hi)
数据集信息
特征
- messages
- content: 数据类型为字符串 (string)
- role: 数据类型为字符串 (string)
- text: 数据类型为字符串 (string)
分割
- train
- 字节数: 932351683
- 样本数: 101476
数据集大小
- 下载大小: 403328278 字节
- 数据集大小: 932351683 字节
配置
- default
- 数据文件:
- 分割: train
- 路径: data/train-*
- 数据文件:



