Dense Reward for Free in RLHF
收藏DataCite Commons2024-12-16 更新2025-04-16 收录
下载链接:
https://service.tib.eu/ldmservice/dataset/e452cf03-7749-49f0-97a4-9c8d4a502117
下载链接
链接失效反馈官方服务:
资源简介:
The dataset used in the paper is not explicitly described, but it is mentioned that it is a preference dataset for language models.
提供机构:
TIB
创建时间:
2024-12-16



