liaad/translation_sample_lid
收藏Hugging Face2024-01-15 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/liaad/translation_sample_lid
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: ai2_arc
features:
- name: question
dtype: string
- name: question_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: choices
sequence: string
- name: choices_translated
list:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
splits:
- name: train
num_bytes: 809
num_examples: 1
download_size: 11996
dataset_size: 809
- config_name: boolq
features:
- name: question
dtype: string
- name: question_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: passage
dtype: string
- name: passage_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
splits:
- name: train
num_bytes: 1386
num_examples: 1
download_size: 17972
dataset_size: 1386
- config_name: gsm8k
features:
- name: question
dtype: string
- name: question_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: answer
dtype: string
- name: answer_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
splits:
- name: train
num_bytes: 2297
num_examples: 1
download_size: 24008
dataset_size: 2297
- config_name: mbpp
features:
- name: text
dtype: string
- name: text_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
splits:
- name: train
num_bytes: 382
num_examples: 1
download_size: 6927
dataset_size: 382
- config_name: natural_questions_parsed
features:
- name: document
dtype: string
- name: document_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: question
dtype: string
- name: question_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: candidates
sequence: string
- name: candidates_translated
list:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: long_answer
dtype: string
- name: long_answer_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
splits:
- name: train
num_bytes: 5543
num_examples: 1
download_size: 47553
dataset_size: 5543
- config_name: openbookqa
features:
- name: question_stem
dtype: string
- name: question_stem_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: choices
sequence: string
- name: choices_translated
list:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: fact1
dtype: string
- name: fact1_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
splits:
- name: train
num_bytes: 920
num_examples: 1
download_size: 16942
dataset_size: 920
- config_name: quac
features:
- name: background
dtype: string
- name: background_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: context
dtype: string
- name: context_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: questions
sequence: string
- name: questions_translated
list:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: orig_answers
sequence: string
- name: orig_answers_translated
list:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
splits:
- name: train
num_bytes: 11406
num_examples: 1
download_size: 85011
dataset_size: 11406
- config_name: social_i_qa
features:
- name: context
dtype: string
- name: context_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: question
dtype: string
- name: question_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: answerA
dtype: string
- name: answerA_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: answerB
dtype: string
- name: answerB_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: answerC
dtype: string
- name: answerC_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
splits:
- name: train
num_bytes: 797
num_examples: 1
download_size: 25730
dataset_size: 797
- config_name: squad_v1_pt
features:
- name: context
dtype: string
- name: context_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: question
dtype: string
- name: question_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: answers
sequence: string
- name: answers_translated
list:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
splits:
- name: train
num_bytes: 1659
num_examples: 1
download_size: 24226
dataset_size: 1659
- config_name: winogrande
features:
- name: sentence
dtype: string
- name: sentence_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: option1
dtype: string
- name: option1_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: option2
dtype: string
- name: option2_translated
struct:
- name: Helsinki-NLP/opus-mt-tc-big-en-pt
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: google_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
- name: libre_translation
struct:
- name: prediction
dtype: float64
- name: text
dtype: string
splits:
- name: train
num_bytes: 749
num_examples: 1
download_size: 17465
dataset_size: 749
configs:
- config_name: ai2_arc
data_files:
- split: train
path: ai2_arc/train-*
- config_name: boolq
data_files:
- split: train
path: boolq/train-*
- config_name: gsm8k
data_files:
- split: train
path: gsm8k/train-*
- config_name: mbpp
data_files:
- split: train
path: mbpp/train-*
- config_name: natural_questions_parsed
data_files:
- split: train
path: natural_questions_parsed/train-*
- config_name: openbookqa
data_files:
- split: train
path: openbookqa/train-*
- config_name: quac
data_files:
- split: train
path: quac/train-*
- config_name: social_i_qa
data_files:
- split: train
path: social_i_qa/train-*
- config_name: squad_v1_pt
data_files:
- split: train
path: squad_v1_pt/train-*
- config_name: winogrande
data_files:
- split: train
path: winogrande/train-*
---
提供机构:
liaad
原始信息汇总
数据集概述
数据集配置
ai2_arc
- 特征:
question: 类型为stringquestion_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)choices: 类型为sequence的字符串choices_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)
- 分割:
train: 字节数为 809,示例数为 1
- 下载大小: 11996 字节
- 数据集大小: 809 字节
boolq
- 特征:
question: 类型为stringquestion_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)passage: 类型为stringpassage_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)
- 分割:
train: 字节数为 1386,示例数为 1
- 下载大小: 17972 字节
- 数据集大小: 1386 字节
gsm8k
- 特征:
question: 类型为stringquestion_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)answer: 类型为stringanswer_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)
- 分割:
train: 字节数为 2297,示例数为 1
- 下载大小: 24008 字节
- 数据集大小: 2297 字节
mbpp
- 特征:
text: 类型为stringtext_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)
- 分割:
train: 字节数为 382,示例数为 1
- 下载大小: 6927 字节
- 数据集大小: 382 字节
natural_questions_parsed
- 特征:
document: 类型为stringdocument_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)question: 类型为stringquestion_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)candidates: 类型为sequence的字符串candidates_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)long_answer: 类型为stringlong_answer_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)
- 分割:
train: 字节数为 5543,示例数为 1
- 下载大小: 47553 字节
- 数据集大小: 5543 字节
openbookqa
- 特征:
question_stem: 类型为stringquestion_stem_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)choices: 类型为sequence的字符串choices_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)fact1: 类型为stringfact1_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)
- 分割:
train: 字节数为 920,示例数为 1
- 下载大小: 16942 字节
- 数据集大小: 920 字节
quac
- 特征:
background: 类型为stringbackground_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)context: 类型为stringcontext_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)questions: 类型为sequence的字符串questions_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)orig_answers: 类型为sequence的字符串orig_answers_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)
- 分割:
train: 字节数为 11406,示例数为 1
- 下载大小: 85011 字节
- 数据集大小: 11406 字节
social_i_qa
- 特征:
context: 类型为stringcontext_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)question: 类型为stringquestion_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)answerA: 类型为stringanswerA_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)answerB: 类型为stringanswerB_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)answerC: 类型为stringanswerC_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)
- 分割:
train: 字节数为 797,示例数为 1
- 下载大小: 25730 字节
- 数据集大小: 797 字节
squad_v1_pt
- 特征:
context: 类型为stringcontext_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)question: 类型为stringquestion_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)answers: 类型为sequence的字符串answers_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)
- 分割:
train: 字节数为 1659,示例数为 1
- 下载大小: 24226 字节
- 数据集大小: 1659 字节
winogrande
- 特征:
sentence: 类型为stringsentence_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)option1: 类型为stringoption1_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)option2: 类型为stringoption2_translated: 包含多个翻译模型结果,每个模型结果包含prediction(类型为float64)和text(类型为string)
- 分割:
train: 字节数为 749,示例数为 1
- 下载大小: 17465 字节
- 数据集大小: 749 字节



