atmallen/qm_alice_mixture_1.0e
收藏Hugging Face2023-11-16 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/atmallen/qm_alice_mixture_1.0e
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validation
path: data/validation-*
- split: test
path: data/test-*
dataset_info:
features:
- name: alice_label
dtype: bool
- name: bob_label
dtype: bool
- name: difficulty
dtype: int64
- name: statement
dtype: string
- name: choices
sequence: string
- name: character
dtype: string
- name: label
dtype:
class_label:
names:
'0': 'False'
'1': 'True'
splits:
- name: train
num_bytes: 22366655.5
num_examples: 200000
- name: validation
num_bytes: 2254431.5
num_examples: 20000
- name: test
num_bytes: 2248382.5
num_examples: 20000
download_size: 0
dataset_size: 26869469.5
---
# Dataset Card for "qm_alice__mixture_1.0e"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
The dataset includes multiple configurations, with the default configuration containing train, validation, and test datasets. The features of the dataset include boolean types alice_label and bob_label, integer type difficulty, string types statement and character, and a string sequence choices. Additionally, there is a classification label label with values False or True. The dataset is divided into train, validation, and test parts, containing 200000, 20000, and 20000 samples respectively. The total size of the dataset is 26869469.5 bytes.
提供机构:
atmallen
原始信息汇总
数据集概述
数据集配置
- 配置名称: default
- 数据文件:
- 训练集: data/train-*
- 验证集: data/validation-*
- 测试集: data/test-*
数据集信息
特征
- alice_label: 布尔类型
- bob_label: 布尔类型
- difficulty: 64位整数类型
- statement: 字符串类型
- choices: 字符串序列
- character: 字符串类型
- label: 类别标签
- 类别名称:
- 0: False
- 1: True
- 类别名称:
数据分割
- 训练集:
- 字节数: 22366655.5
- 样本数: 200000
- 验证集:
- 字节数: 2254431.5
- 样本数: 20000
- 测试集:
- 字节数: 2248382.5
- 样本数: 20000
数据集大小
- 下载大小: 0
- 数据集大小: 26869469.5



