five

liaad/translation_sample

收藏
Hugging Face2024-01-15 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/liaad/translation_sample
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: ai2_arc features: - name: question dtype: string - name: question_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: choices sequence: string - name: choices_translated list: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string splits: - name: test num_bytes: 713 num_examples: 1 download_size: 7660 dataset_size: 713 - config_name: boolq features: - name: question dtype: string - name: question_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: passage dtype: string - name: passage_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string splits: - name: test num_bytes: 1338 num_examples: 1 download_size: 13729 dataset_size: 1338 - config_name: gsm8k features: - name: question dtype: string - name: question_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: answer dtype: string - name: answer_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string splits: - name: test num_bytes: 2249 num_examples: 1 download_size: 19759 dataset_size: 2249 - config_name: hellaswag features: - name: activity_label dtype: string - name: activity_label_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: ctx dtype: string - name: ctx_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: endings sequence: string - name: endings_translated list: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string splits: - name: test num_bytes: 3111 num_examples: 1 download_size: 17613 dataset_size: 3111 - config_name: mbpp features: - name: text dtype: string - name: text_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string splits: - name: test num_bytes: 358 num_examples: 1 download_size: 4822 dataset_size: 358 - config_name: natural_questions_parsed features: - name: document dtype: string - name: document_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: question dtype: string - name: question_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: candidates sequence: string - name: candidates_translated list: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: long_answer dtype: string - name: long_answer_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string splits: - name: test num_bytes: 5399 num_examples: 1 download_size: 38881 dataset_size: 5399 - config_name: openbookqa features: - name: question_stem dtype: string - name: question_stem_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: choices sequence: string - name: choices_translated list: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: fact1 dtype: string - name: fact1_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string splits: - name: test num_bytes: 776 num_examples: 1 download_size: 10475 dataset_size: 776 - config_name: quac features: - name: background dtype: string - name: background_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: context dtype: string - name: context_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: questions sequence: string - name: questions_translated list: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: orig_answers sequence: string - name: orig_answers_translated list: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string splits: - name: test num_bytes: 11166 num_examples: 1 download_size: 76251 dataset_size: 11166 - config_name: social_i_qa features: - name: context dtype: string - name: context_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: question dtype: string - name: question_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: answerA dtype: string - name: answerA_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: answerB dtype: string - name: answerB_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: answerC dtype: string - name: answerC_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string splits: - name: test num_bytes: 677 num_examples: 1 download_size: 15127 dataset_size: 677 - config_name: squad_v1_pt features: - name: context dtype: string - name: context_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: question dtype: string - name: question_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: answers sequence: string - name: answers_translated list: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string splits: - name: test num_bytes: 1587 num_examples: 1 download_size: 17739 dataset_size: 1587 - config_name: trivia_qa features: - name: question dtype: string - name: question_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: search_results_search_context sequence: string - name: search_results_search_context_translated list: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: answer_value dtype: string - name: answer_value_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string splits: - name: test num_bytes: 1154 num_examples: 1 download_size: 15177 dataset_size: 1154 - config_name: winogrande features: - name: sentence dtype: string - name: sentence_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: option1 dtype: string - name: option1_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string - name: option2 dtype: string - name: option2_translated struct: - name: Helsinki-NLP/opus-mt-tc-big-en-pt dtype: string - name: google_translation dtype: string - name: libre_translation dtype: string splits: - name: test num_bytes: 677 num_examples: 1 download_size: 11112 dataset_size: 677 configs: - config_name: ai2_arc data_files: - split: test path: ai2_arc/test-* - config_name: boolq data_files: - split: test path: boolq/test-* - config_name: gsm8k data_files: - split: test path: gsm8k/test-* - config_name: hellaswag data_files: - split: test path: hellaswag/test-* - config_name: mbpp data_files: - split: test path: mbpp/test-* - config_name: natural_questions_parsed data_files: - split: test path: natural_questions_parsed/test-* - config_name: openbookqa data_files: - split: test path: openbookqa/test-* - config_name: quac data_files: - split: test path: quac/test-* - config_name: social_i_qa data_files: - split: test path: social_i_qa/test-* - config_name: squad_v1_pt data_files: - split: test path: squad_v1_pt/test-* - config_name: trivia_qa data_files: - split: test path: trivia_qa/test-* - config_name: winogrande data_files: - split: test path: winogrande/test-* ---
提供机构:
liaad
原始信息汇总

数据集概述

ai2_arc

  • 特征:
    • question: 字符串
    • question_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • choices: 字符串序列
    • choices_translated: 列表,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
  • 分割:
    • test: 713字节,1个样本
  • 下载大小: 7660字节
  • 数据集大小: 713字节

boolq

  • 特征:
    • question: 字符串
    • question_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • passage: 字符串
    • passage_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
  • 分割:
    • test: 1338字节,1个样本
  • 下载大小: 13729字节
  • 数据集大小: 1338字节

gsm8k

  • 特征:
    • question: 字符串
    • question_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • answer: 字符串
    • answer_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
  • 分割:
    • test: 2249字节,1个样本
  • 下载大小: 19759字节
  • 数据集大小: 2249字节

hellaswag

  • 特征:
    • activity_label: 字符串
    • activity_label_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • ctx: 字符串
    • ctx_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • endings: 字符串序列
    • endings_translated: 列表,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
  • 分割:
    • test: 3111字节,1个样本
  • 下载大小: 17613字节
  • 数据集大小: 3111字节

mbpp

  • 特征:
    • text: 字符串
    • text_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
  • 分割:
    • test: 358字节,1个样本
  • 下载大小: 4822字节
  • 数据集大小: 358字节

natural_questions_parsed

  • 特征:
    • document: 字符串
    • document_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • question: 字符串
    • question_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • candidates: 字符串序列
    • candidates_translated: 列表,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • long_answer: 字符串
    • long_answer_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
  • 分割:
    • test: 5399字节,1个样本
  • 下载大小: 38881字节
  • 数据集大小: 5399字节

openbookqa

  • 特征:
    • question_stem: 字符串
    • question_stem_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • choices: 字符串序列
    • choices_translated: 列表,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • fact1: 字符串
    • fact1_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
  • 分割:
    • test: 776字节,1个样本
  • 下载大小: 10475字节
  • 数据集大小: 776字节

quac

  • 特征:
    • background: 字符串
    • background_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • context: 字符串
    • context_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • questions: 字符串序列
    • questions_translated: 列表,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • orig_answers: 字符串序列
    • orig_answers_translated: 列表,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
  • 分割:
    • test: 11166字节,1个样本
  • 下载大小: 76251字节
  • 数据集大小: 11166字节

social_i_qa

  • 特征:
    • context: 字符串
    • context_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • question: 字符串
    • question_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • answerA: 字符串
    • answerA_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • answerB: 字符串
    • answerB_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • answerC: 字符串
    • answerC_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
  • 分割:
    • test: 677字节,1个样本
  • 下载大小: 15127字节
  • 数据集大小: 677字节

squad_v1_pt

  • 特征:
    • context: 字符串
    • context_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • question: 字符串
    • question_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • answers: 字符串序列
    • answers_translated: 列表,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
  • 分割:
    • test: 1587字节,1个样本
  • 下载大小: 17739字节
  • 数据集大小: 1587字节

trivia_qa

  • 特征:
    • question: 字符串
    • question_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • search_results_search_context: 字符串序列
    • search_results_search_context_translated: 列表,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • answer_value: 字符串
    • answer_value_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
  • 分割:
    • test: 1154字节,1个样本
  • 下载大小: 15177字节
  • 数据集大小: 1154字节

winogrande

  • 特征:
    • sentence: 字符串
    • sentence_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • option1: 字符串
    • option1_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
    • option2: 字符串
    • option2_translated: 结构体,包含以下字段:
      • Helsinki-NLP/opus-mt-tc-big-en-pt: 字符串
      • google_translation: 字符串
      • libre_translation: 字符串
  • 分割:
    • test: 677字节,1个样本
  • 下载大小: 11112字节
  • 数据集大小: 677字节
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作