kgr123/quality_counter_3072_4_uniq
收藏Hugging Face2024-02-14 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/kgr123/quality_counter_3072_4_uniq
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: context
dtype: string
- name: word
dtype: string
- name: claim
dtype: string
- name: label
dtype: int64
splits:
- name: train
num_bytes: 556693208
num_examples: 20000
- name: validation
num_bytes: 224391784
num_examples: 8000
- name: test
num_bytes: 56237790
num_examples: 2300
download_size: 26606876
dataset_size: 837322782
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validation
path: data/validation-*
- split: test
path: data/test-*
---
The dataset includes four features: context, word, claim, and label, with data types of string and int64. It is divided into train, validation, and test sets, containing 20000, 8000, and 2300 examples respectively. The total download size of the dataset is 26606876 bytes, and the total size is 837322782 bytes. The dataset configuration is default, with data file paths corresponding to train, validation, and test sets.
提供机构:
kgr123
原始信息汇总
数据集概述
特征信息
- context: 数据类型为字符串(string)
- word: 数据类型为字符串(string)
- claim: 数据类型为字符串(string)
- label: 数据类型为整数(int64)
数据划分
- train: 包含20000个样本,总字节数为556693208
- validation: 包含8000个样本,总字节数为224391784
- test: 包含2300个样本,总字节数为56237790
数据大小
- 下载大小: 26606876字节
- 数据集总大小: 837322782字节
配置信息
- config_name: default
- 数据文件路径:
- train: data/train-*
- validation: data/validation-*
- test: data/test-*



