tomekkorbak/shp_with_features_20k
收藏Hugging Face2023-04-14 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/tomekkorbak/shp_with_features_20k
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: post_id
dtype: string
- name: domain
dtype: string
- name: upvote_ratio
dtype: float64
- name: history
dtype: string
- name: c_root_id_A
dtype: string
- name: c_root_id_B
dtype: string
- name: created_at_utc_A
dtype: int64
- name: created_at_utc_B
dtype: int64
- name: score_A
dtype: int64
- name: score_B
dtype: int64
- name: human_ref_A
dtype: string
- name: human_ref_B
dtype: string
- name: labels
dtype: int64
- name: seconds_difference
dtype: float64
- name: score_ratio
dtype: float64
- name: helpfulness_A
dtype: float64
- name: helpfulness_B
dtype: float64
- name: specificity_A
dtype: float64
- name: specificity_B
dtype: float64
- name: intent_A
dtype: float64
- name: intent_B
dtype: float64
- name: factuality_A
dtype: float64
- name: factuality_B
dtype: float64
- name: easy-to-understand_A
dtype: float64
- name: easy-to-understand_B
dtype: float64
- name: relevance_A
dtype: float64
- name: relevance_B
dtype: float64
- name: readability_A
dtype: float64
- name: readability_B
dtype: float64
- name: enough-detail_A
dtype: float64
- name: enough-detail_B
dtype: float64
- name: biased:_A
dtype: float64
- name: biased:_B
dtype: float64
- name: fail-to-consider-individual-preferences_A
dtype: float64
- name: fail-to-consider-individual-preferences_B
dtype: float64
- name: repetetive_A
dtype: float64
- name: repetetive_B
dtype: float64
- name: fail-to-consider-context_A
dtype: float64
- name: fail-to-consider-context_B
dtype: float64
- name: too-long_A
dtype: float64
- name: too-long_B
dtype: float64
- name: __index_level_0__
dtype: int64
splits:
- name: train
num_bytes: 20532157.0
num_examples: 9459
- name: test
num_bytes: 20532157.0
num_examples: 9459
download_size: 23638147
dataset_size: 41064314.0
---
# Dataset Card for "shp_with_features_20k"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
tomekkorbak
原始信息汇总
数据集概述
数据集名称
"shp_with_features_20k"
数据集特征
- post_id: 字符串类型
- domain: 字符串类型
- upvote_ratio: 浮点数类型
- history: 字符串类型
- c_root_id_A: 字符串类型
- c_root_id_A: 字符串类型
- c_root_id_B: 字符串类型
- created_at_utc_A: 整数类型
- created_at_utc_B: 整数类型
- score_A: 整数类型
- score_B: 整数类型
- human_ref_A: 字符串类型
- human_ref_B: 字符串类型
- labels: 整数类型
- seconds_difference: 浮点数类型
- score_ratio: 浮点数类型
- helpfulness_A: 浮点数类型
- helpfulness_B: 浮点数类型
- specificity_A: 浮点数类型
- specificity_B: 浮点数类型
- intent_A: 浮点数类型
- intent_B: 浮点数类型
- factuality_A: 浮点数类型
- factuality_B: 浮点数类型
- easy-to-understand_A: 浮点数类型
- easy-to-understand_B: 浮点数类型
- relevance_A: 浮点数类型
- relevance_B: 浮点数类型
- readability_A: 浮点数类型
- readability_B: 浮点数类型
- enough-detail_A: 浮点数类型
- enough-detail_B: 浮点数类型
- biased:_A: 浮点数类型
- biased:_B: 浮点数类型
- fail-to-consider-individual-preferences_A: 浮点数类型
- fail-to-consider-individual-preferences_B: 浮点数类型
- repetetive_A: 浮点数类型
- repetetive_B: 浮点数类型
- fail-to-consider-context_A: 浮点数类型
- fail-to-consider-context_B: 浮点数类型
- too-long_A: 浮点数类型
- too-long_B: 浮点数类型
- index_level_0: 整数类型
数据集分割
- train: 9459个样本,占用20532157字节
- test: 9459个样本,占用20532157字节
数据集大小
- 下载大小: 23638147字节
- 数据集大小: 41064314.0字节



