five

Atipico1/trivia_test

收藏
Hugging Face2024-04-18 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/Atipico1/trivia_test
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: adversary features: - name: question dtype: string - name: answers sequence: string - name: ctxs list: - name: hasanswer dtype: bool - name: score dtype: float64 - name: text dtype: string - name: title dtype: string - name: gpt_answer_sentence dtype: string - name: gpt_adv_sentence sequence: string - name: is_valid_adv_sentence dtype: bool - name: gpt_adv_passage sequence: string - name: is_valid_adv_passage dtype: bool splits: - name: train num_bytes: 91910594 num_examples: 11313 download_size: 52541960 dataset_size: 91910594 - config_name: adversary_v2 features: - name: question dtype: string - name: answers sequence: string - name: ctxs list: - name: hasanswer dtype: bool - name: score dtype: float64 - name: text dtype: string - name: title dtype: string - name: gpt_answer_sentence dtype: string - name: gpt_adv_sentence sequence: string - name: is_valid_adv_sentence dtype: bool - name: gpt_adv_passage sequence: string - name: is_valid_adv_passage dtype: bool splits: - name: train num_bytes: 91910491 num_examples: 11313 download_size: 52546819 dataset_size: 91910491 - config_name: adversary_v2-sent features: - name: question dtype: string - name: answers sequence: string - name: gpt_answer_sentence dtype: string - name: gpt_adv_sentence sequence: string - name: is_valid_adv_sentence dtype: bool - name: gpt_adv_passage sequence: string - name: is_valid_adv_passage dtype: bool - name: ctxs list: - name: hasanswer dtype: bool - name: score dtype: float32 - name: text dtype: string splits: - name: train num_bytes: 27671483 num_examples: 11313 download_size: 15964809 dataset_size: 27671483 - config_name: conflict features: - name: question dtype: string - name: answers sequence: string - name: ctxs list: - name: hasanswer dtype: bool - name: score dtype: float64 - name: text dtype: string - name: title dtype: string - name: gpt_answer_sentence dtype: string - name: entity_type dtype: string - name: similar_entity dtype: string - name: similar_entity_score dtype: float32 - name: random_entity dtype: string - name: random_entity_score dtype: float64 splits: - name: train num_bytes: 79041831 num_examples: 11313 download_size: 45974504 dataset_size: 79041831 - config_name: conflict_v1 features: - name: question dtype: string - name: answers sequence: string - name: ctxs list: - name: hasanswer dtype: bool - name: score dtype: float64 - name: text dtype: string - name: title dtype: string - name: gpt_answer_sentence dtype: string - name: entity_type dtype: string - name: similar_entity dtype: string - name: similar_entity_score dtype: float32 - name: random_entity dtype: string - name: random_entity_score dtype: float64 - name: gpt_conflict_sentence sequence: string - name: is_valid_conflict_sentence dtype: bool - name: gpt_conflict_passage sequence: string - name: is_valid_conflict_passage dtype: bool splits: - name: train num_bytes: 82500749 num_examples: 11313 download_size: 48085357 dataset_size: 82500749 - config_name: conflict_v1-sent features: - name: question dtype: string - name: answers sequence: string - name: gpt_answer_sentence dtype: string - name: entity_type dtype: string - name: similar_entity dtype: string - name: similar_entity_score dtype: float32 - name: random_entity dtype: string - name: random_entity_score dtype: float64 - name: gpt_conflict_sentence sequence: string - name: is_valid_conflict_sentence dtype: bool - name: gpt_conflict_passage sequence: string - name: is_valid_conflict_passage dtype: bool - name: ctxs list: - name: hasanswer dtype: bool - name: score dtype: float32 - name: text dtype: string splits: - name: train num_bytes: 18261681 num_examples: 11313 download_size: 11503546 dataset_size: 18261681 - config_name: default features: - name: question dtype: string - name: answers sequence: string - name: ctxs list: - name: hasanswer dtype: bool - name: score dtype: float64 - name: text dtype: string - name: title dtype: string splits: - name: train num_bytes: 77273159 num_examples: 11313 download_size: 44781875 dataset_size: 77273159 configs: - config_name: adversary data_files: - split: train path: adversary/train-* - config_name: adversary_v2 data_files: - split: train path: adversary_v2/train-* - config_name: adversary_v2-sent data_files: - split: train path: adversary_v2-sent/train-* - config_name: conflict data_files: - split: train path: conflict/train-* - config_name: conflict_v1 data_files: - split: train path: conflict_v1/train-* - config_name: conflict_v1-sent data_files: - split: train path: conflict_v1-sent/train-* - config_name: default data_files: - split: train path: data/train-* ---
提供机构:
Atipico1
原始信息汇总

数据集概述

数据集配置

配置 adversary

  • 特征:
    • question: 字符串
    • answers: 字符串序列
    • ctxs: 列表
      • hasanswer: 布尔值
      • score: 浮点数 (float64)
      • text: 字符串
      • title: 字符串
    • gpt_answer_sentence: 字符串
    • gpt_adv_sentence: 字符串序列
    • is_valid_adv_sentence: 布尔值
    • gpt_adv_passage: 字符串序列
    • is_valid_adv_passage: 布尔值
  • 分割:
    • train:
      • 字节数: 91910594
      • 样本数: 11313
  • 下载大小: 52541960 字节
  • 数据集大小: 91910594 字节

配置 adversary_v2

  • 特征:
    • question: 字符串
    • answers: 字符串序列
    • ctxs: 列表
      • hasanswer: 布尔值
      • score: 浮点数 (float64)
      • text: 字符串
      • title: 字符串
    • gpt_answer_sentence: 字符串
    • gpt_adv_sentence: 字符串序列
    • is_valid_adv_sentence: 布尔值
    • gpt_adv_passage: 字符串序列
    • is_valid_adv_passage: 布尔值
  • 分割:
    • train:
      • 字节数: 91910491
      • 样本数: 11313
  • 下载大小: 52546819 字节
  • 数据集大小: 91910491 字节

配置 adversary_v2-sent

  • 特征:
    • question: 字符串
    • answers: 字符串序列
    • gpt_answer_sentence: 字符串
    • gpt_adv_sentence: 字符串序列
    • is_valid_adv_sentence: 布尔值
    • gpt_adv_passage: 字符串序列
    • is_valid_adv_passage: 布尔值
    • ctxs: 列表
      • hasanswer: 布尔值
      • score: 浮点数 (float32)
      • text: 字符串
  • 分割:
    • train:
      • 字节数: 27671483
      • 样本数: 11313
  • 下载大小: 15964809 字节
  • 数据集大小: 27671483 字节

配置 conflict

  • 特征:
    • question: 字符串
    • answers: 字符串序列
    • ctxs: 列表
      • hasanswer: 布尔值
      • score: 浮点数 (float64)
      • text: 字符串
      • title: 字符串
    • gpt_answer_sentence: 字符串
    • entity_type: 字符串
    • similar_entity: 字符串
    • similar_entity_score: 浮点数 (float32)
    • random_entity: 字符串
    • random_entity_score: 浮点数 (float64)
  • 分割:
    • train:
      • 字节数: 79041831
      • 样本数: 11313
  • 下载大小: 45974504 字节
  • 数据集大小: 79041831 字节

配置 conflict_v1

  • 特征:
    • question: 字符串
    • answers: 字符串序列
    • ctxs: 列表
      • hasanswer: 布尔值
      • score: 浮点数 (float64)
      • text: 字符串
      • title: 字符串
    • gpt_answer_sentence: 字符串
    • entity_type: 字符串
    • similar_entity: 字符串
    • similar_entity_score: 浮点数 (float32)
    • random_entity: 字符串
    • random_entity_score: 浮点数 (float64)
    • gpt_conflict_sentence: 字符串序列
    • is_valid_conflict_sentence: 布尔值
    • gpt_conflict_passage: 字符串序列
    • is_valid_conflict_passage: 布尔值
  • 分割:
    • train:
      • 字节数: 82500749
      • 样本数: 11313
  • 下载大小: 48085357 字节
  • 数据集大小: 82500749 字节

配置 conflict_v1-sent

  • 特征:
    • question: 字符串
    • answers: 字符串序列
    • gpt_answer_sentence: 字符串
    • entity_type: 字符串
    • similar_entity: 字符串
    • similar_entity_score: 浮点数 (float32)
    • random_entity: 字符串
    • random_entity_score: 浮点数 (float64)
    • gpt_conflict_sentence: 字符串序列
    • is_valid_conflict_sentence: 布尔值
    • gpt_conflict_passage: 字符串序列
    • is_valid_conflict_passage: 布尔值
    • ctxs: 列表
      • hasanswer: 布尔值
      • score: 浮点数 (float32)
      • text: 字符串
  • 分割:
    • train:
      • 字节数: 18261681
      • 样本数: 11313
  • 下载大小: 11503546 字节
  • 数据集大小: 18261681 字节

配置 default

  • 特征:
    • question: 字符串
    • answers: 字符串序列
    • ctxs: 列表
      • hasanswer: 布尔值
      • score: 浮点数 (float64)
      • text: 字符串
      • title: 字符串
  • 分割:
    • train:
      • 字节数: 77273159
      • 样本数: 11313
  • 下载大小: 44781875 字节
  • 数据集大小: 77273159 字节
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作