helpful-anthropic-raw

Name: helpful-anthropic-raw
Creator: maas
Published: 2025-11-02 16:22:24
License: 暂无描述

魔搭社区2025-11-02 更新2025-02-15 收录

下载链接：

https://modelscope.cn/datasets/HuggingFaceH4/helpful-anthropic-raw

下载链接

链接失效反馈

官方服务：

资源简介：

# Dataset Card for "helpful-raw-anthropic" This is a dataset derived from Anthropic's [HH-RLHF data](https://huggingface.co/datasets/Anthropic/hh-rlhf) of instructions and model-generated demonstrations. We combined training splits from the following two subsets: * `helpful-base` * `helpful-online` To convert the multi-turn dialogues into `(instruction, demonstration)` pairs, just the first response from the Assistant was included. This heuristic captures the most obvious answers, but overlooks more complex questions where multiple turns were required to get a helpful response. Some additional filtering is likely required (e.g. defining a minimun length or computing ROUGE-L scores with the instruction/demonstration).

# "helpful-raw-anthropic" 数据集卡片本数据集源自Anthropic的[HH-RLHF数据集](https://huggingface.co/datasets/Anthropic/hh-rlhf)中的指令与模型生成演示样本。我们将以下两个子集的训练拆分进行了合并： * `helpful-base` * `helpful-online` 为将多轮对话转换为`(指令，演示样本)`配对形式，仅保留助手的首次回复。该启发式方法虽能捕获最直观的应答，但会忽略那些需要多轮交互才能获得有效帮助性回复的复杂问题。后续可能需要进行额外的过滤操作，例如设定最小长度阈值，或计算指令与演示样本间的ROUGE-L分数。

提供机构：

maas

创建时间：

2025-02-10

搜集汇总

数据集介绍