five

MLP-Lemma/Eval-datasets-preprocessed

收藏
Hugging Face2024-05-13 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/MLP-Lemma/Eval-datasets-preprocessed
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: Infbench-choice features: - name: id dtype: int64 - name: input_ids sequence: int32 - name: input_sentences_ids sequence: sequence: int64 splits: - name: train num_bytes: 206737528 num_examples: 229 download_size: 45538021 dataset_size: 206737528 - config_name: Infbench-choice-prefix features: - name: id dtype: int64 - name: input_ids sequence: int32 - name: input_sentences_ids sequence: sequence: int64 splits: - name: train num_bytes: 285356180 num_examples: 229 download_size: 62416434 dataset_size: 285356180 - config_name: Infbench-qa features: - name: id dtype: int64 - name: input_ids sequence: int32 - name: input_sentences_ids sequence: sequence: int64 splits: - name: train num_bytes: 308942956 num_examples: 351 download_size: 67364334 dataset_size: 308942956 - config_name: Infbench-qa-prefix features: - name: id dtype: int64 - name: input_ids sequence: int32 - name: input_sentences_ids sequence: sequence: int64 splits: - name: train num_bytes: 442819832 num_examples: 351 download_size: 95147661 dataset_size: 442819832 - config_name: Infbench-sum features: - name: id dtype: int64 - name: input_ids sequence: int32 - name: input_sentences_ids sequence: sequence: int64 splits: - name: train num_bytes: 88885088 num_examples: 103 download_size: 19650275 dataset_size: 88885088 - config_name: Infbench-sum-prefix features: - name: id dtype: int64 - name: input_ids sequence: int32 - name: input_sentences_ids sequence: sequence: int64 splits: - name: train num_bytes: 121575588 num_examples: 103 download_size: 26580676 dataset_size: 121575588 - config_name: TriviaQA features: - name: input_ids sequence: int32 - name: input_sentences_ids sequence: sequence: int64 - name: labels sequence: int64 splits: - name: train num_bytes: 637752636 num_examples: 6925 download_size: 135874402 dataset_size: 637752636 - config_name: TriviaQA-st features: - name: input_ids sequence: int32 - name: input_sentences_ids sequence: sequence: int64 - name: labels sequence: int64 splits: - name: train num_bytes: 657306692 num_examples: 6925 download_size: 137063754 dataset_size: 657306692 configs: - config_name: Infbench-choice data_files: - split: train path: Infbench-choice/train-* - config_name: Infbench-choice-prefix data_files: - split: train path: Infbench-choice-prefix/train-* - config_name: Infbench-qa data_files: - split: train path: Infbench-qa/train-* - config_name: Infbench-qa-prefix data_files: - split: train path: Infbench-qa-prefix/train-* - config_name: Infbench-sum data_files: - split: train path: Infbench-sum/train-* - config_name: Infbench-sum-prefix data_files: - split: train path: Infbench-sum-prefix/train-* - config_name: TriviaQA data_files: - split: train path: TriviaQA/train-* - config_name: TriviaQA-st data_files: - split: train path: TriviaQA-st/train-* ---
提供机构:
MLP-Lemma
原始信息汇总

数据集概述

数据集配置及特征

  1. Infbench-choice

    • 特征:
      • id: int64
      • input_ids: int32序列
      • input_sentences_ids: int64序列的序列
    • 训练集:
      • 数据大小: 206737528字节
      • 示例数量: 229
      • 下载大小: 45538021字节
  2. Infbench-choice-prefix

    • 特征:
      • id: int64
      • input_ids: int32序列
      • input_sentences_ids: int64序列的序列
    • 训练集:
      • 数据大小: 285356180字节
      • 示例数量: 229
      • 下载大小: 62416434字节
  3. Infbench-qa

    • 特征:
      • id: int64
      • input_ids: int32序列
      • input_sentences_ids: int64序列的序列
    • 训练集:
      • 数据大小: 308942956字节
      • 示例数量: 351
      • 下载大小: 67364334字节
  4. Infbench-qa-prefix

    • 特征:
      • id: int64
      • input_ids: int32序列
      • input_sentences_ids: int64序列的序列
    • 训练集:
      • 数据大小: 442819832字节
      • 示例数量: 351
      • 下载大小: 95147661字节
  5. Infbench-sum

    • 特征:
      • id: int64
      • input_ids: int32序列
      • input_sentences_ids: int64序列的序列
    • 训练集:
      • 数据大小: 88885088字节
      • 示例数量: 103
      • 下载大小: 19650275字节
  6. Infbench-sum-prefix

    • 特征:
      • id: int64
      • input_ids: int32序列
      • input_sentences_ids: int64序列的序列
    • 训练集:
      • 数据大小: 121575588字节
      • 示例数量: 103
      • 下载大小: 26580676字节
  7. TriviaQA

    • 特征:
      • input_ids: int32序列
      • input_sentences_ids: int64序列的序列
      • labels: int64序列
    • 训练集:
      • 数据大小: 637752636字节
      • 示例数量: 6925
      • 下载大小: 135874402字节
  8. TriviaQA-st

    • 特征:
      • input_ids: int32序列
      • input_sentences_ids: int64序列的序列
      • labels: int64序列
    • 训练集:
      • 数据大小: 657306692字节
      • 示例数量: 6925
      • 下载大小: 137063754字节

数据集文件路径

  • Infbench-choice: 训练集路径为Infbench-choice/train-*
  • Infbench-choice-prefix: 训练集路径为Infbench-choice-prefix/train-*
  • Infbench-qa: 训练集路径为Infbench-qa/train-*
  • Infbench-qa-prefix: 训练集路径为Infbench-qa-prefix/train-*
  • Infbench-sum: 训练集路径为Infbench-sum/train-*
  • Infbench-sum-prefix: 训练集路径为Infbench-sum-prefix/train-*
  • TriviaQA: 训练集路径为TriviaQA/train-*
  • TriviaQA-st: 训练集路径为TriviaQA-st/train-*
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作