dongyoung4091/shp_with_features_20k_flan_t5_large
收藏Hugging Face2024-03-08 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/dongyoung4091/shp_with_features_20k_flan_t5_large
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: post_id
dtype: string
- name: domain
dtype: string
- name: upvote_ratio
dtype: float64
- name: history
dtype: string
- name: c_root_id_A
dtype: string
- name: c_root_id_B
dtype: string
- name: created_at_utc_A
dtype: int64
- name: created_at_utc_B
dtype: int64
- name: score_A
dtype: int64
- name: score_B
dtype: int64
- name: human_ref_A
dtype: string
- name: human_ref_B
dtype: string
- name: labels
dtype: int64
- name: seconds_difference
dtype: float64
- name: score_ratio
dtype: float64
- name: helpfulness_A
dtype: float64
- name: helpfulness_B
dtype: float64
- name: specificity_A
dtype: float64
- name: specificity_B
dtype: float64
- name: intent_A
dtype: float64
- name: intent_B
dtype: float64
- name: factuality_A
dtype: float64
- name: factuality_B
dtype: float64
- name: easy-to-understand_A
dtype: float64
- name: easy-to-understand_B
dtype: float64
- name: relevance_A
dtype: float64
- name: relevance_B
dtype: float64
- name: readability_A
dtype: float64
- name: readability_B
dtype: float64
- name: enough-detail_A
dtype: float64
- name: enough-detail_B
dtype: float64
- name: biased:_A
dtype: float64
- name: biased:_B
dtype: float64
- name: fail-to-consider-individual-preferences_A
dtype: float64
- name: fail-to-consider-individual-preferences_B
dtype: float64
- name: repetetive_A
dtype: float64
- name: repetetive_B
dtype: float64
- name: fail-to-consider-context_A
dtype: float64
- name: fail-to-consider-context_B
dtype: float64
- name: too-long_A
dtype: float64
- name: too-long_B
dtype: float64
- name: __index_level_0__
dtype: int64
- name: log_score_A
dtype: float64
- name: log_score_B
dtype: float64
splits:
- name: train
num_bytes: 20707062
num_examples: 9459
- name: test
num_bytes: 20659940
num_examples: 9459
download_size: 23927350
dataset_size: 41367002
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
---
提供机构:
dongyoung4091
原始信息汇总
数据集概述
特征信息
数据集包含以下特征及其数据类型:
post_id: 字符串domain: 字符串upvote_ratio: 浮点数history: 字符串c_root_id_A: 字符串c_root_id_B: 字符串created_at_utc_A: 整数created_at_utc_B: 整数score_A: 整数score_B: 整数human_ref_A: 字符串human_ref_B: 字符串labels: 整数seconds_difference: 浮点数score_ratio: 浮点数helpfulness_A: 浮点数helpfulness_B: 浮点数specificity_A: 浮点数specificity_B: 浮点数intent_A: 浮点数intent_B: 浮点数factuality_A: 浮点数factuality_B: 浮点数easy-to-understand_A: 浮点数easy-to-understand_B: 浮点数relevance_A: 浮点数relevance_B: 浮点数readability_A: 浮点数readability_B: 浮点数enough-detail_A: 浮点数enough-detail_B: 浮点数biased:_A: 浮点数biased:_B: 浮点数fail-to-consider-individual-preferences_A: 浮点数fail-to-consider-individual-preferences_B: 浮点数repetetive_A: 浮点数repetetive_B: 浮点数fail-to-consider-context_A: 浮点数fail-to-consider-context_B: 浮点数too-long_A: 浮点数too-long_B: 浮点数__index_level_0__: 整数log_score_A: 浮点数log_score_B: 浮点数
数据分割
数据集分为以下部分:
train: 包含9459个样本,占用20707062字节test: 包含9459个样本,占用20659940字节
数据集大小
- 下载大小: 23927350字节
- 数据集总大小: 41367002字节
配置信息
- 配置名称:
default - 数据文件路径:
- 训练集:
data/train-* - 测试集:
data/test-*
- 训练集:



