five

helpful-anthropic-raw

收藏
魔搭社区2025-11-02 更新2025-02-15 收录
下载链接:
https://modelscope.cn/datasets/HuggingFaceH4/helpful-anthropic-raw
下载链接
链接失效反馈
官方服务:
资源简介:
# Dataset Card for "helpful-raw-anthropic" This is a dataset derived from Anthropic's [HH-RLHF data](https://huggingface.co/datasets/Anthropic/hh-rlhf) of instructions and model-generated demonstrations. We combined training splits from the following two subsets: * `helpful-base` * `helpful-online` To convert the multi-turn dialogues into `(instruction, demonstration)` pairs, just the first response from the Assistant was included. This heuristic captures the most obvious answers, but overlooks more complex questions where multiple turns were required to get a helpful response. Some additional filtering is likely required (e.g. defining a minimun length or computing ROUGE-L scores with the instruction/demonstration).

# "helpful-raw-anthropic" 数据集卡片 本数据集源自Anthropic的[HH-RLHF数据集](https://huggingface.co/datasets/Anthropic/hh-rlhf)中的指令与模型生成演示样本。我们将以下两个子集的训练拆分进行了合并: * `helpful-base` * `helpful-online` 为将多轮对话转换为`(指令,演示样本)`配对形式,仅保留助手的首次回复。该启发式方法虽能捕获最直观的应答,但会忽略那些需要多轮交互才能获得有效帮助性回复的复杂问题。后续可能需要进行额外的过滤操作,例如设定最小长度阈值,或计算指令与演示样本间的ROUGE-L分数。
提供机构:
maas
创建时间:
2025-02-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作