lamhieu/sharegpt_dialogue_base
收藏Hugging Face2024-05-17 更新2024-05-25 收录
下载链接:
https://hf-mirror.com/datasets/lamhieu/sharegpt_dialogue_base
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: messages
list:
- name: content
dtype: string
- name: role
dtype: string
splits:
- name: train
num_bytes: 847547561
num_examples: 111912
download_size: 383263271
dataset_size: 847547561
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
license: mit
task_categories:
- text-generation
- text2text-generation
language:
- en
- vi
- zh
- es
- pt
- ja
- ko
size_categories:
- 100K<n<1M
---
## Description
The dataset is from unknown, formatted as dialogues for speed and ease of use. Many thanks to author for releasing it.
Importantly, this format is easy to use via the default chat template of `transformers`, meaning you can use [huggingface/alignment-handbook](https://github.com/huggingface/alignment-handbook) immediately, [unsloth](https://github.com/unslothai/unsloth).
## Structure
*View online through viewer.*
## Note
We advise you to reconsider before use, thank you. If you find it useful, please like and follow this account.
## Reference
The **Ghost X** was developed with the goal of researching and developing artificial intelligence useful to humans.
- HuggingFace: [ghost-x](https://huggingface.co/ghost-x)
- Github: [ghost-x-ai](https://github.com/ghost-x-ai)
- X / Twitter: [ghostx_ai](https://twitter.com/ghostx_ai)
- Website: [ghost-x.org](https://ghost-x.org/)
### ~
提供机构:
lamhieu
原始信息汇总
数据集概述
数据集信息
-
特征(Features):
- messages:
- content: 数据类型为字符串
- role: 数据类型为字符串
- messages:
-
分割(Splits):
- train:
- num_bytes: 847547561
- num_examples: 111912
- train:
-
下载大小(Download Size): 383263271
-
数据集大小(Dataset Size): 847547561
配置(Configs)
- config_name: default
- data_files:
- split: train
- path: data/train-*
许可证(License): MIT
任务类别(Task Categories):
- text-generation
- text2text-generation
语言(Language):
- en
- vi
- zh
- es
- pt
- ja
- ko
大小类别(Size Categories):
- 100K<n<1M



