Safety4M
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/q-rz/saffron
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个针对大型语言模型(LLM)安全的令牌级安全奖励数据集,旨在加速未来在LLM安全性领域的研究。它与Saffron-1模型一同发布,任务是为大型语言模型的安全性提供保障。
This dataset is a token-level safety reward dataset targeting the safety of Large Language Models (LLMs), which aims to accelerate future research in the field of LLM safety. It is released alongside the Saffron-1 model, with the primary objective of providing safety guarantees for Large Language Models.
提供机构:
Authors of the paper



