five

atmallen/qm_alice_mixture_1.0e

收藏
Hugging Face2023-11-16 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/atmallen/qm_alice_mixture_1.0e
下载链接
链接失效反馈
官方服务:
资源简介:
--- configs: - config_name: default data_files: - split: train path: data/train-* - split: validation path: data/validation-* - split: test path: data/test-* dataset_info: features: - name: alice_label dtype: bool - name: bob_label dtype: bool - name: difficulty dtype: int64 - name: statement dtype: string - name: choices sequence: string - name: character dtype: string - name: label dtype: class_label: names: '0': 'False' '1': 'True' splits: - name: train num_bytes: 22366655.5 num_examples: 200000 - name: validation num_bytes: 2254431.5 num_examples: 20000 - name: test num_bytes: 2248382.5 num_examples: 20000 download_size: 0 dataset_size: 26869469.5 --- # Dataset Card for "qm_alice__mixture_1.0e" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)

The dataset includes multiple configurations, with the default configuration containing train, validation, and test datasets. The features of the dataset include boolean types alice_label and bob_label, integer type difficulty, string types statement and character, and a string sequence choices. Additionally, there is a classification label label with values False or True. The dataset is divided into train, validation, and test parts, containing 200000, 20000, and 20000 samples respectively. The total size of the dataset is 26869469.5 bytes.
提供机构:
atmallen
原始信息汇总

数据集概述

数据集配置

  • 配置名称: default
  • 数据文件:
    • 训练集: data/train-*
    • 验证集: data/validation-*
    • 测试集: data/test-*

数据集信息

特征

  • alice_label: 布尔类型
  • bob_label: 布尔类型
  • difficulty: 64位整数类型
  • statement: 字符串类型
  • choices: 字符串序列
  • character: 字符串类型
  • label: 类别标签
    • 类别名称:
      • 0: False
      • 1: True

数据分割

  • 训练集:
    • 字节数: 22366655.5
    • 样本数: 200000
  • 验证集:
    • 字节数: 2254431.5
    • 样本数: 20000
  • 测试集:
    • 字节数: 2248382.5
    • 样本数: 20000

数据集大小

  • 下载大小: 0
  • 数据集大小: 26869469.5
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作