five

PKU-SafeRLHF-prompt

收藏
魔搭社区2025-12-05 更新2025-02-08 收录
下载链接:
https://modelscope.cn/datasets/PKU-Alignment/PKU-SafeRLHF-prompt
下载链接
链接失效反馈
官方服务:
资源简介:
# Dataset Card for PKU-SafeRLHF-prompt This dataset contains 44.6K unique prompts from [PKU-SafeRLHF](https://huggingface.co/datasets/PKU-Alignment/PKU-SafeRLHF). 22.4% of the prompts in this dataset come from the sibling project [BeaverTails](https://huggingface.co/datasets/PKU-Alignment/BeaverTails). Additionally, we performed SFT on Llama3-70B using the Alpaca 52K dataset, resulting in Alpaca3-70B. 63.6% and 14.0% of our dataset is generated by Alpaca3-70B and WizardLM-30B-Uncensored, respectively, under the guidance of experts. Here is the generation pipeline: ![prompts_generation_pipeline](prompts_generation_pipeline.png) ## Usage To load our dataset, use the `load_dataset()` function as follows: ```python from datasets import load_dataset dataset = load_dataset("PKU-Alignment/PKU-SafeRLHF-prompt") ```

# PKU-SafeRLHF-prompt 数据集卡片 本数据集包含源自[PKU-SafeRLHF](https://huggingface.co/datasets/PKU-Alignment/PKU-SafeRLHF)的4.46万个独特提示词。其中22.4%的提示词来自姊妹项目[BeaverTails](https://huggingface.co/datasets/PKU-Alignment/BeaverTails)。此外,我们使用Alpaca 52K数据集对Llama3-70B执行监督微调(Supervised Fine-Tuning,SFT),得到Alpaca3-70B;本数据集63.6%与14.0%的提示词分别由Alpaca3-70B与WizardLM-30B-Uncensored在专家指导下生成。 以下为提示词生成流水线: ![prompts_generation_pipeline](prompts_generation_pipeline.png) ## 使用方法 如需加载本数据集,请使用`load_dataset()`函数,示例代码如下: python from datasets import load_dataset dataset = load_dataset("PKU-Alignment/PKU-SafeRLHF-prompt")
提供机构:
maas
创建时间:
2025-02-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作