hdeldar/Persian-Text-llama2-1k

Name: hdeldar/Persian-Text-llama2-1k
Creator: hdeldar
Published: 2023-09-17 14:53:05
License: 暂无描述

Hugging Face2023-09-17 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/hdeldar/Persian-Text-llama2-1k

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: text dtype: string splits: - name: train num_bytes: 1830325 num_examples: 1000 download_size: 1841325 dataset_size: 1830325 dataset_name: json configs: - config_name: default data_files: - split: train path: data/data-* --- # Persian-Text-QA: Lazy Llama 2 Formatting This is a subset (1k samples) of the [`SeyedAli/Persian-Text-QA`](https://huggingface.co/datasets/SeyedAli/Persian-Text-QA) dataset, processed to match Llama 2's prompt format as described [in this article](https://huggingface.co/blog/llama2#how-to-prompt-llama-2). It was created using the following [colab notebook](https://colab.research.google.com/drive/1Ad7a9zMmkxuXTOh1Z7-rNSICA4dybpM2?usp=sharing). Useful if you don't want to reformat it by yourself (e.g., using a script). It was designed for [this article](https://mlabonne.github.io/blog/posts/Fine_Tune_Your_Own_Llama_2_Model_in_a_Colab_Notebook.html) about fine-tuning a Llama 2 (chat) model in a Google Colab.

提供机构：

hdeldar

原始信息汇总

数据集概述

数据集信息

特征:
- 名称: text
- 数据类型: string
分割:
- 名称: train
- 字节数: 1830325
- 样本数: 1000
下载大小: 1841325
数据集大小: 1830325
数据集名称: json

配置

配置名称: default
数据文件:
- 分割: train
- 路径: data/data-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集