five

Atipico1/webq_test

收藏
Hugging Face2024-04-22 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/Atipico1/webq_test
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: adversary features: - name: question dtype: string - name: answers sequence: string - name: ctxs list: - name: hasanswer dtype: bool - name: score dtype: float64 - name: text dtype: string - name: title dtype: string - name: gpt_answer_sentence dtype: string - name: gpt_adv_sentence sequence: string - name: is_valid_adv_sentence dtype: bool - name: gpt_adv_passage sequence: string - name: is_valid_adv_passage dtype: bool splits: - name: train num_bytes: 16335230 num_examples: 2032 download_size: 9129827 dataset_size: 16335230 - config_name: adversary_v2 features: - name: question dtype: string - name: answers sequence: string - name: ctxs list: - name: hasanswer dtype: bool - name: score dtype: float64 - name: text dtype: string - name: title dtype: string - name: gpt_answer_sentence dtype: string - name: gpt_adv_sentence sequence: string - name: is_valid_adv_sentence dtype: bool - name: gpt_adv_passage sequence: string - name: is_valid_adv_passage dtype: bool splits: - name: train num_bytes: 16335074 num_examples: 2032 download_size: 9121125 dataset_size: 16335074 - config_name: adversary_v2-sent features: - name: question dtype: string - name: answers sequence: string - name: gpt_answer_sentence dtype: string - name: gpt_adv_sentence sequence: string - name: is_valid_adv_sentence dtype: bool - name: gpt_adv_passage sequence: string - name: is_valid_adv_passage dtype: bool - name: ctxs list: - name: hasanswer dtype: bool - name: score dtype: float32 - name: text dtype: string splits: - name: train num_bytes: 4579727 num_examples: 2032 download_size: 2507827 dataset_size: 4579727 - config_name: conflict features: - name: question dtype: string - name: answers sequence: string - name: ctxs list: - name: hasanswer dtype: bool - name: score dtype: float64 - name: text dtype: string - name: title dtype: string - name: gpt_answer_sentence dtype: string - name: entity_type dtype: string - name: similar_entity dtype: string - name: similar_entity_score dtype: float32 - name: random_entity dtype: string - name: random_entity_score dtype: float64 splits: - name: train num_bytes: 13758558 num_examples: 2032 download_size: 7878512 dataset_size: 13758558 - config_name: conflict_v1 features: - name: question dtype: string - name: answers sequence: string - name: ctxs list: - name: hasanswer dtype: bool - name: score dtype: float64 - name: text dtype: string - name: title dtype: string - name: gpt_answer_sentence dtype: string - name: entity_type dtype: string - name: similar_entity dtype: string - name: similar_entity_score dtype: float32 - name: random_entity dtype: string - name: random_entity_score dtype: float64 - name: gpt_conflict_sentence sequence: string - name: is_valid_conflict_sentence dtype: bool - name: gpt_conflict_passage sequence: string - name: is_valid_conflict_passage dtype: bool splits: - name: train num_bytes: 14671132 num_examples: 2032 download_size: 8415624 dataset_size: 14671132 - config_name: conflict_v1-sent features: - name: question dtype: string - name: answers sequence: string - name: gpt_answer_sentence dtype: string - name: entity_type dtype: string - name: similar_entity dtype: string - name: similar_entity_score dtype: float32 - name: random_entity dtype: string - name: random_entity_score dtype: float64 - name: gpt_conflict_sentence sequence: string - name: is_valid_conflict_sentence dtype: bool - name: gpt_conflict_passage sequence: string - name: is_valid_conflict_passage dtype: bool - name: ctxs list: - name: hasanswer dtype: bool - name: score dtype: float32 - name: text dtype: string splits: - name: train num_bytes: 2915604 num_examples: 2032 download_size: 1801711 dataset_size: 2915604 - config_name: default features: - name: question dtype: string - name: answers sequence: string - name: ctxs list: - name: hasanswer dtype: bool - name: score dtype: float64 - name: text dtype: string - name: title dtype: string splits: - name: train num_bytes: 13481387 num_examples: 2032 download_size: 7685748 dataset_size: 13481387 configs: - config_name: adversary data_files: - split: train path: adversary/train-* - config_name: adversary_v2 data_files: - split: train path: adversary_v2/train-* - config_name: adversary_v2-sent data_files: - split: train path: adversary_v2-sent/train-* - config_name: conflict data_files: - split: train path: conflict/train-* - config_name: conflict_v1 data_files: - split: train path: conflict_v1/train-* - config_name: conflict_v1-sent data_files: - split: train path: conflict_v1-sent/train-* - config_name: default data_files: - split: train path: data/train-* ---
提供机构:
Atipico1
原始信息汇总

数据集概述

数据集配置

配置名称:adversary

  • 特征
    • question: 字符串
    • answers: 字符串序列
    • ctxs: 列表
      • hasanswer: 布尔值
      • score: 浮点数 (float64)
      • text: 字符串
      • title: 字符串
    • gpt_answer_sentence: 字符串
    • gpt_adv_sentence: 字符串序列
    • is_valid_adv_sentence: 布尔值
    • gpt_adv_passage: 字符串序列
    • is_valid_adv_passage: 布尔值
  • 分割
    • train
      • 字节数:16335230
      • 样本数:2032
  • 下载大小:9129827
  • 数据集大小:16335230

配置名称:adversary_v2

  • 特征
    • question: 字符串
    • answers: 字符串序列
    • ctxs: 列表
      • hasanswer: 布尔值
      • score: 浮点数 (float64)
      • text: 字符串
      • title: 字符串
    • gpt_answer_sentence: 字符串
    • gpt_adv_sentence: 字符串序列
    • is_valid_adv_sentence: 布尔值
    • gpt_adv_passage: 字符串序列
    • is_valid_adv_passage: 布尔值
  • 分割
    • train
      • 字节数:16335074
      • 样本数:2032
  • 下载大小:9121125
  • 数据集大小:16335074

配置名称:adversary_v2-sent

  • 特征
    • question: 字符串
    • answers: 字符串序列
    • gpt_answer_sentence: 字符串
    • gpt_adv_sentence: 字符串序列
    • is_valid_adv_sentence: 布尔值
    • gpt_adv_passage: 字符串序列
    • is_valid_adv_passage: 布尔值
    • ctxs: 列表
      • hasanswer: 布尔值
      • score: 浮点数 (float32)
      • text: 字符串
  • 分割
    • train
      • 字节数:4579727
      • 样本数:2032
  • 下载大小:2507827
  • 数据集大小:4579727

配置名称:conflict

  • 特征
    • question: 字符串
    • answers: 字符串序列
    • ctxs: 列表
      • hasanswer: 布尔值
      • score: 浮点数 (float64)
      • text: 字符串
      • title: 字符串
    • gpt_answer_sentence: 字符串
    • entity_type: 字符串
    • similar_entity: 字符串
    • similar_entity_score: 浮点数 (float32)
    • random_entity: 字符串
    • random_entity_score: 浮点数 (float64)
  • 分割
    • train
      • 字节数:13758558
      • 样本数:2032
  • 下载大小:7878512
  • 数据集大小:13758558

配置名称:conflict_v1

  • 特征
    • question: 字符串
    • answers: 字符串序列
    • ctxs: 列表
      • hasanswer: 布尔值
      • score: 浮点数 (float64)
      • text: 字符串
      • title: 字符串
    • gpt_answer_sentence: 字符串
    • entity_type: 字符串
    • similar_entity: 字符串
    • similar_entity_score: 浮点数 (float32)
    • random_entity: 字符串
    • random_entity_score: 浮点数 (float64)
    • gpt_conflict_sentence: 字符串序列
    • is_valid_conflict_sentence: 布尔值
    • gpt_conflict_passage: 字符串序列
    • is_valid_conflict_passage: 布尔值
  • 分割
    • train
      • 字节数:14671132
      • 样本数:2032
  • 下载大小:8415624
  • 数据集大小:14671132

配置名称:conflict_v1-sent

  • 特征
    • question: 字符串
    • answers: 字符串序列
    • gpt_answer_sentence: 字符串
    • entity_type: 字符串
    • similar_entity: 字符串
    • similar_entity_score: 浮点数 (float32)
    • random_entity: 字符串
    • random_entity_score: 浮点数 (float64)
    • gpt_conflict_sentence: 字符串序列
    • is_valid_conflict_sentence: 布尔值
    • gpt_conflict_passage: 字符串序列
    • is_valid_conflict_passage: 布尔值
    • ctxs: 列表
      • hasanswer: 布尔值
      • score: 浮点数 (float32)
      • text: 字符串
  • 分割
    • train
      • 字节数:2915604
      • 样本数:2032
  • 下载大小:1801711
  • 数据集大小:2915604

配置名称:default

  • 特征
    • question: 字符串
    • answers: 字符串序列
    • ctxs: 列表
      • hasanswer: 布尔值
      • score: 浮点数 (float64)
      • text: 字符串
      • title: 字符串
  • 分割
    • train
      • 字节数:13481387
      • 样本数:2032
  • 下载大小:7685748
  • 数据集大小:13481387
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作