helpful-anthropic-raw
收藏魔搭社区2025-11-02 更新2025-02-15 收录
下载链接:
https://modelscope.cn/datasets/HuggingFaceH4/helpful-anthropic-raw
下载链接
链接失效反馈官方服务:
资源简介:
# Dataset Card for "helpful-raw-anthropic"
This is a dataset derived from Anthropic's [HH-RLHF data](https://huggingface.co/datasets/Anthropic/hh-rlhf) of instructions and model-generated demonstrations. We combined training splits from the following two subsets:
* `helpful-base`
* `helpful-online`
To convert the multi-turn dialogues into `(instruction, demonstration)` pairs, just the first response from the Assistant was included. This heuristic captures the most obvious answers, but overlooks more complex questions where multiple turns were required to get a helpful response. Some additional filtering is likely required (e.g. defining a minimun length or computing ROUGE-L scores with the instruction/demonstration).
# "helpful-raw-anthropic" 数据集卡片
本数据集源自Anthropic的[HH-RLHF数据集](https://huggingface.co/datasets/Anthropic/hh-rlhf)中的指令与模型生成演示样本。我们将以下两个子集的训练拆分进行了合并:
* `helpful-base`
* `helpful-online`
为将多轮对话转换为`(指令,演示样本)`配对形式,仅保留助手的首次回复。该启发式方法虽能捕获最直观的应答,但会忽略那些需要多轮交互才能获得有效帮助性回复的复杂问题。后续可能需要进行额外的过滤操作,例如设定最小长度阈值,或计算指令与演示样本间的ROUGE-L分数。
提供机构:
maas
创建时间:
2025-02-10



