RLHFlow/ArmoRM-Multi-Objective-Data-v0.2
收藏Hugging Face2024-09-07 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/RLHFlow/ArmoRM-Multi-Objective-Data-v0.2
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: dataset
dtype: string
- name: prompt_source
dtype: string
- name: response_model
dtype: string
- name: messages
list:
- name: content
dtype: string
- name: role
dtype: string
- name: helpsteer-helpfulness
dtype: float64
- name: helpsteer-correctness
dtype: float64
- name: helpsteer-coherence
dtype: float64
- name: helpsteer-complexity
dtype: float64
- name: helpsteer-verbosity
dtype: float64
- name: ultrafeedback-overall_score
dtype: float64
- name: ultrafeedback-instruction_following
dtype: float64
- name: ultrafeedback-truthfulness
dtype: float64
- name: ultrafeedback-honesty
dtype: float64
- name: ultrafeedback-helpfulness
dtype: float64
- name: beavertails-is_safe
dtype: float64
- name: prometheus-score
dtype: float64
- name: argilla-overall_quality
dtype: float64
- name: argilla-judge_lm
dtype: float64
- name: code-complexity
dtype: float64
- name: code-style
dtype: float64
- name: code-explanation
dtype: float64
- name: code-instruction-following
dtype: float64
- name: code-readability
dtype: float64
splits:
- name: train
num_bytes: 1168073023.98857
num_examples: 555213
download_size: 423958403
dataset_size: 1168073023.98857
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
数据集信息:
特征字段:
- 字段名:数据集(dataset),数据类型:字符串
- 字段名:提示源(prompt_source),数据类型:字符串
- 字段名:响应模型(response_model),数据类型:字符串
- 字段名:对话消息(messages),为列表类型,包含子字段:
- 字段名:内容(content),数据类型:字符串
- 字段名:角色(role),数据类型:字符串
- 字段名:HelpSteer-有用性(helpsteer-helpfulness),数据类型:64位浮点数
- 字段名:HelpSteer-正确性(helpsteer-correctness),数据类型:64位浮点数
- 字段名:HelpSteer-连贯性(helpsteer-coherence),数据类型:64位浮点数
- 字段名:HelpSteer-复杂度(helpsteer-complexity),数据类型:64位浮点数
- 字段名:HelpSteer-冗长度(helpsteer-verbosity),数据类型:64位浮点数
- 字段名:UltraFeedback-综合得分(ultrafeedback-overall_score),数据类型:64位浮点数
- 字段名:UltraFeedback-指令遵循度(ultrafeedback-instruction_following),数据类型:64位浮点数
- 字段名:UltraFeedback-真实性(ultrafeedback-truthfulness),数据类型:64位浮点数
- 字段名:UltraFeedback-诚实性(ultrafeedback-honesty),数据类型:64位浮点数
- 字段名:UltraFeedback-有用性(ultrafeedback-helpfulness),数据类型:64位浮点数
- 字段名:BeaverTails-安全性标签(beavertails-is_safe),数据类型:64位浮点数
- 字段名:Prometheus评分(prometheus-score),数据类型:64位浮点数
- 字段名:Argilla-整体质量(argilla-overall_quality),数据类型:64位浮点数
- 字段名:Argilla-Judge LM评分(argilla-judge_lm),数据类型:64位浮点数
- 字段名:代码-复杂度(code-complexity),数据类型:64位浮点数
- 字段名:代码-风格合规性(code-style),数据类型:64位浮点数
- 字段名:代码-可解释性(code-explanation),数据类型:64位浮点数
- 字段名:代码-指令遵循度(code-instruction-following),数据类型:64位浮点数
- 字段名:代码-可读性(code-readability),数据类型:64位浮点数
划分集:
- 划分名称:训练集(train),字节大小:1168073023.98857,样本数量:555213
下载大小:423958403
数据集总大小:1168073023.98857
配置项:
- 配置名称:默认配置(default),数据文件:
- 对应训练集划分,路径为data/train-*
提供机构:
RLHFlow



