five

ctu-aic/qacg-cs

收藏
Hugging Face2024-01-02 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/ctu-aic/qacg-cs
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: balanced features: - name: claim dtype: string - name: label dtype: string - name: evidence sequence: string splits: - name: train num_bytes: 27930763 num_examples: 295209 - name: validation num_bytes: 2851211 num_examples: 30087 - name: test num_bytes: 2668281 num_examples: 28440 download_size: 23918846 dataset_size: 33450255 - config_name: balanced_shuf features: - name: claim dtype: string - name: label dtype: string - name: evidence sequence: string splits: - name: train num_bytes: 17771582 num_examples: 188364 - name: validation num_bytes: 1808175 num_examples: 19174 - name: test num_bytes: 1698300 num_examples: 18146 download_size: 14960384 dataset_size: 21278057 - config_name: default features: - name: claim dtype: string - name: label dtype: string - name: evidence sequence: string splits: - name: train num_bytes: 55853686 num_examples: 590903 - name: validation num_bytes: 5606118 num_examples: 59260 - name: test num_bytes: 5305514 num_examples: 56585 download_size: 47350094 dataset_size: 66765318 - config_name: fever_size features: - name: claim dtype: string - name: label dtype: string - name: evidence sequence: string splits: - name: train num_bytes: 10151341 num_examples: 107330 - name: validation num_bytes: 946732 num_examples: 9999 - name: test num_bytes: 938933 num_examples: 9999 download_size: 8485306 dataset_size: 12037006 configs: - config_name: balanced data_files: - split: train path: balanced/train-* - split: validation path: balanced/validation-* - split: test path: balanced/test-* - config_name: balanced_shuf data_files: - split: train path: balanced_shuf/train-* - split: validation path: balanced_shuf/validation-* - split: test path: balanced_shuf/test-* - config_name: default data_files: - split: train path: data/train-* - split: validation path: data/validation-* - split: test path: data/test-* - config_name: fever_size data_files: - split: train path: fever_size/train-* - split: validation path: fever_size/validation-* - split: test path: fever_size/test-* ---
提供机构:
ctu-aic
原始信息汇总

数据集概述

配置名称:balanced

  • 特征
    • claim: string
    • label: string
    • evidence: sequence of string
  • 分割
    • train:
      • 字节数: 27930763
      • 样本数: 295209
    • validation:
      • 字节数: 2851211
      • 样本数: 30087
    • test:
      • 字节数: 2668281
      • 样本数: 28440
  • 下载大小: 23918846
  • 数据集大小: 33450255

配置名称:balanced_shuf

  • 特征
    • claim: string
    • label: string
    • evidence: sequence of string
  • 分割
    • train:
      • 字节数: 17771582
      • 样本数: 188364
    • validation:
      • 字节数: 1808175
      • 样本数: 19174
    • test:
      • 字节数: 1698300
      • 样本数: 18146
  • 下载大小: 14960384
  • 数据集大小: 21278057

配置名称:default

  • 特征
    • claim: string
    • label: string
    • evidence: sequence of string
  • 分割
    • train:
      • 字节数: 55853686
      • 样本数: 590903
    • validation:
      • 字节数: 5606118
      • 样本数: 59260
    • test:
      • 字节数: 5305514
      • 样本数: 56585
  • 下载大小: 47350094
  • 数据集大小: 66765318

配置名称:fever_size

  • 特征
    • claim: string
    • label: string
    • evidence: sequence of string
  • 分割
    • train:
      • 字节数: 10151341
      • 样本数: 107330
    • validation:
      • 字节数: 946732
      • 样本数: 9999
    • test:
      • 字节数: 938933
      • 样本数: 9999
  • 下载大小: 8485306
  • 数据集大小: 12037006
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作