TaiGary/base_model_fine_tune_data_ultrachat_2k
收藏Hugging Face2024-07-02 更新2024-07-06 收录
下载链接:
https://hf-mirror.com/datasets/TaiGary/base_model_fine_tune_data_ultrachat_2k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集用于在CleanGen论文中微调基础模型,使其成为参考模型。数据集包含来自UltraChat的1800个对话和来自HH-RLHF的200个样本。对于HH-RLHF中的每个有害问题,响应开头都添加了拒绝短语Im sorry, but I cannot assist with that.。
This dataset was used to fine-tune the base models to be reference models in the paper CleanGen. The dataset contains 1800 conversations from UltraChat and 200 samples from HH-RLHF. For each harmful question from HH-RLHF, a refusal phrase, "Im sorry, but I cannot assist with that," is added at the beginning of the response.
提供机构:
TaiGary



