Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_2_16
收藏Hugging Face2024-03-26 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/Mitsuki-Sakamoto/alpaca_farm-reward-model-deberta-v3-large-v2-re-preference-64-nsample-2-16_mix_random_seed_2_16
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: alpaca_instructions-pythia-1.4b_alpaca_farm_instructions_sft_constant_pa-checkpoint-7500
features:
- name: instruction
dtype: string
- name: input
dtype: string
- name: output
dtype: string
- name: preference
dtype: int64
- name: output_1
dtype: string
- name: output_2
dtype: string
- name: reward_model_prompt_format
dtype: string
- name: gen_prompt_format
dtype: string
- name: gen_kwargs
struct:
- name: do_sample
dtype: bool
- name: max_new_tokens
dtype: int64
- name: pad_token_id
dtype: int64
- name: top_k
dtype: int64
- name: top_p
dtype: float64
- name: reward_1
dtype: float64
- name: reward_2
dtype: float64
- name: n_samples
dtype: int64
- name: reject_select
dtype: string
- name: index
dtype: int64
splits:
- name: preference
num_bytes: 12943481.5
num_examples: 10000
download_size: 6234003
dataset_size: 12943481.5
- config_name: alpaca_instructions-pythia_160m_alpaca_farm_instructions_sft_constant_pa_seed_1
features:
- name: instruction
dtype: string
- name: input
dtype: string
- name: output
dtype: string
- name: preference
dtype: int64
- name: output_1
dtype: string
- name: output_2
dtype: string
- name: reward_model_prompt_format
dtype: string
- name: gen_prompt_format
dtype: string
- name: gen_kwargs
struct:
- name: do_sample
dtype: bool
- name: max_new_tokens
dtype: int64
- name: pad_token_id
dtype: int64
- name: top_k
dtype: int64
- name: top_p
dtype: float64
- name: reward_1
dtype: float64
- name: reward_2
dtype: float64
- name: n_samples
dtype: int64
- name: reject_select
dtype: string
- name: index
dtype: int64
splits:
- name: preference
num_bytes: 12949272.0
num_examples: 10000
download_size: 6201899
dataset_size: 12949272.0
configs:
- config_name: alpaca_instructions-pythia-1.4b_alpaca_farm_instructions_sft_constant_pa-checkpoint-7500
data_files:
- split: preference
path: alpaca_instructions-pythia-1.4b_alpaca_farm_instructions_sft_constant_pa-checkpoint-7500/preference-*
- config_name: alpaca_instructions-pythia_160m_alpaca_farm_instructions_sft_constant_pa_seed_1
data_files:
- split: preference
path: alpaca_instructions-pythia_160m_alpaca_farm_instructions_sft_constant_pa_seed_1/preference-*
---
提供机构:
Mitsuki-Sakamoto
原始信息汇总
数据集概述
数据集配置1
- 配置名称: alpaca_instructions-pythia-1.4b_alpaca_farm_instructions_sft_constant_pa-checkpoint-7500
- 特征:
- instruction: 字符串
- input: 字符串
- output: 字符串
- preference: 整数64位
- output_1: 字符串
- output_2: 字符串
- reward_model_prompt_format: 字符串
- gen_prompt_format: 字符串
- gen_kwargs: 结构体
- do_sample: 布尔
- max_new_tokens: 整数64位
- pad_token_id: 整数64位
- top_k: 整数64位
- top_p: 浮点数64位
- reward_1: 浮点数64位
- reward_2: 浮点数64位
- n_samples: 整数64位
- reject_select: 字符串
- index: 整数64位
- 分割:
- preference: 10000个例子,大小12943481.5字节
- 下载大小: 6234003字节
- 数据集大小: 12943481.5字节
数据集配置2
- 配置名称: alpaca_instructions-pythia_160m_alpaca_farm_instructions_sft_constant_pa_seed_1
- 特征:
- instruction: 字符串
- input: 字符串
- output: 字符串
- preference: 整数64位
- output_1: 字符串
- output_2: 字符串
- reward_model_prompt_format: 字符串
- gen_prompt_format: 字符串
- gen_kwargs: 结构体
- do_sample: 布尔
- max_new_tokens: 整数64位
- pad_token_id: 整数64位
- top_k: 整数64位
- top_p: 浮点数64位
- reward_1: 浮点数64位
- reward_2: 浮点数64位
- n_samples: 整数64位
- reject_select: 字符串
- index: 整数64位
- 分割:
- preference: 10000个例子,大小12949272.0字节
- 下载大小: 6201899字节
- 数据集大小: 12949272.0字节



