Isotonic/Cria-MultiDialogues
收藏Hugging Face2024-02-07 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Isotonic/Cria-MultiDialogues
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: text
dtype: string
splits:
- name: train
num_bytes: 595416313
num_examples: 253859
download_size: 206563103
dataset_size: 595416313
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
## Cria-SFT-v1 is a collection of the following datasets
- Datasets:
- [VMware/open-instruct](https://huggingface.co/datasets/VMware/open-instruct)
- [LDJnr/Capybara](https://huggingface.co/datasets/LDJnr/Capybara)
- [cognitivecomputations/ultrachat-uncensored](https://huggingface.co/datasets/cognitivecomputations/ultrachat-uncensored)
- [starfishmedical/webGPT_x_dolly](https://huggingface.co/datasets/starfishmedical/webGPT_x_dolly)
- [THUDM/webglm-qa](https://huggingface.co/datasets/THUDM/webglm-qa)
## Prompt Format
```
<|im_start|>system
{system_message}<|im_end|>
<|im_start|>user
{user_message}<|im_end|>
<|im_start|>assistant
{assistant message}<|im_end|>
```
提供机构:
Isotonic
原始信息汇总
数据集信息
特征
- 名称: text
- 数据类型: string
数据分割
- 名称: train
- 字节数: 595416313
- 样本数: 253859
下载和数据大小
- 下载大小: 206563103
- 数据集大小: 595416313
配置
- 配置名称: default
- 数据文件:
- 分割: train
- 路径: data/train-*



