PKU-SafeRLHF-prompt
收藏魔搭社区2025-12-05 更新2025-02-08 收录
下载链接:
https://modelscope.cn/datasets/PKU-Alignment/PKU-SafeRLHF-prompt
下载链接
链接失效反馈官方服务:
资源简介:
# Dataset Card for PKU-SafeRLHF-prompt
This dataset contains 44.6K unique prompts from [PKU-SafeRLHF](https://huggingface.co/datasets/PKU-Alignment/PKU-SafeRLHF). 22.4% of the prompts in this dataset come from the sibling project [BeaverTails](https://huggingface.co/datasets/PKU-Alignment/BeaverTails). Additionally, we performed SFT on Llama3-70B using the Alpaca 52K dataset, resulting in Alpaca3-70B. 63.6% and 14.0% of our dataset is generated by Alpaca3-70B and WizardLM-30B-Uncensored, respectively, under the guidance of experts.
Here is the generation pipeline:

## Usage
To load our dataset, use the `load_dataset()` function as follows:
```python
from datasets import load_dataset
dataset = load_dataset("PKU-Alignment/PKU-SafeRLHF-prompt")
```
# PKU-SafeRLHF-prompt 数据集卡片
本数据集包含源自[PKU-SafeRLHF](https://huggingface.co/datasets/PKU-Alignment/PKU-SafeRLHF)的4.46万个独特提示词。其中22.4%的提示词来自姊妹项目[BeaverTails](https://huggingface.co/datasets/PKU-Alignment/BeaverTails)。此外,我们使用Alpaca 52K数据集对Llama3-70B执行监督微调(Supervised Fine-Tuning,SFT),得到Alpaca3-70B;本数据集63.6%与14.0%的提示词分别由Alpaca3-70B与WizardLM-30B-Uncensored在专家指导下生成。
以下为提示词生成流水线:

## 使用方法
如需加载本数据集,请使用`load_dataset()`函数,示例代码如下:
python
from datasets import load_dataset
dataset = load_dataset("PKU-Alignment/PKU-SafeRLHF-prompt")
提供机构:
maas
创建时间:
2025-02-07



