Learning to summarize with human feedback
收藏DataCite Commons2026-01-07 更新2025-04-16 收录
下载链接:
https://service.tib.eu/ldmservice/dataset/7ce7e34c-b6b4-417a-88ae-e30050c5dceb
下载链接
链接失效反馈官方服务:
资源简介:
The paper presents a study on the impact of synthetic data on large language models (LLMs) and proposes a method to steer LLMs towards desirable non-differentiable attributes.
提供机构:
TIB
创建时间:
2024-12-16



