jqi/alpaca_nemo
收藏Hugging Face2024-04-11 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/jqi/alpaca_nemo
下载链接
链接失效反馈官方服务:
资源简介:
---
task_categories:
- text-generation
language:
- en
tags:
- SFT
size_categories:
- 10K<n<100K
---
Download alpaca in NeMo SFT Chat format
```
git lfs install
git clone https://huggingface.co/datasets/jqi/alpaca_nemo
```
Then you can find data file: `alpaca_nemo/alpaca_nemo.jsonl`
This file is about 32 MB in size.
To use it in NeMo, set the config:
```
data:
chat: True
chat_prompt_tokens:
system_turn_start: '<extra_id_0>'
turn_start: '<extra_id_1>'
label_start: '<extra_id_2>'
end_of_turn: "\x0A"
end_of_name: "\x0A"
train_ds:
file_names: [ 'alpaca_nemo/alpaca_nemo.jsonl' ]
```
提供机构:
jqi
原始信息汇总
数据集概述
基本信息
- 任务类别:text-generation
- 语言:en
- 标签:SFT
- 大小范围:10K<n<100K
数据文件
- 文件名:
alpaca_nemo/alpaca_nemo.jsonl - 大小:约32 MB
使用配置
- 配置设置:
- chat: True
- chat_prompt_tokens:
- system_turn_start: <extra_id_0>
- turn_start: <extra_id_1>
- label_start: <extra_id_2>
- end_of_turn: "x0A"
- end_of_name: "x0A"
- train_ds:
- file_names: [ alpaca_nemo/alpaca_nemo.jsonl ]



