Rardilit/Panther-dataset_v1

Name: Rardilit/Panther-dataset_v1
Creator: Rardilit
Published: 2023-05-29 11:18:55
License: 暂无描述

Hugging Face2023-05-29 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/Rardilit/Panther-dataset_v1

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: other task_categories: - text-generation - conversational - question-answering - text2text-generation language: - en tags: - text generation - panther pretty_name: Panther size_categories: - 100K<n<1M --- # Dataset Details This dataset is a modified version of [Anthropic/hh-rlhf](https://huggingface.co/datasets/Anthropic/hh-rlhf) This dataset is used in fine tuning [Panther](https://huggingface.co/Rardilit/Panther_v1) - an state of the art LLM funtuned on llama-7b pretrained model. A very small portion i.e. 5.3% of prompts and responses were taken from this dataset to finetune and train [Panther](https://huggingface.co/Rardilit/Panther_v1) ## Dataset Details ### Dataset Structure ### Train Train rows : 377k ### Validation Validation rows : 20.3k ### Dataset Format ```python input : "prompt" output : "response" ``` ## How to Use ```python from datasets import load_dataset dataset = load_dataset("Rardilit/Panther-dataset_v1") ```

提供机构：

Rardilit

原始信息汇总

数据集概述

基本信息

许可证: other
任务类别:
- text-generation
- conversational
- question-answering
- text2text-generation
语言: en
标签:
- text generation
- panther
数据集名称: Panther
数据集大小: 100K<n<1M

数据集详情

数据来源

该数据集是Anthropic/hh-rlhf的修改版本。

数据用途

用于微调Panther，这是一个基于llama-7b预训练模型的先进LLM。
数据集中5.3%的提示和响应被用于微调和训练Panther。

数据集结构

训练集: 377k行
验证集: 20.3k行

数据集格式

python input : "prompt" output : "response"

如何使用

python from datasets import load_dataset

dataset = load_dataset("Rardilit/Panther-dataset_v1")

5,000+

优质数据集

54 个

任务类型

进入经典数据集