Data for: Evol-Preference : Evol-Preference: Automatic Evolution of Preference Data For Safety Alignment
收藏NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://data.mendeley.com/datasets/ck364mhrvb
下载链接
链接失效反馈官方服务:
资源简介:
The PKU-Alignment team released the dataset "Beavertails", which focuses on AI safety. We have extended and optimized "Beavertails" to obtain this dataset. Readers can directly use our dataset to train large language models to enhance their usefulness and harmlessness.Training details: 70% for supervised fine-tuning(SFT), 30% direct preference optimization (DPO), training hyperparameters available in Appendix C of the paper.
创建时间:
2025-12-15



