EricPeter/msc-baseline-mt
收藏Hugging Face2024-06-09 更新2024-07-06 收录
下载链接:
https://hf-mirror.com/datasets/EricPeter/msc-baseline-mt
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: translation
struct:
- name: translation
struct:
- name: en
dtype: string
- name: lg
dtype: string
splits:
- name: train
num_bytes: 6326565
num_examples: 56734
- name: validation
num_bytes: 1355786
num_examples: 12157
- name: test
num_bytes: 1348588
num_examples: 12159
download_size: 5959022
dataset_size: 9030939
---
# Dataset Card for "msc-baseline-mt"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
This dataset is primarily used for machine translation tasks, containing translation data from English to another language (lg). The dataset is divided into train, validation, and test sets, with 56734, 12157, and 12159 examples respectively.
提供机构:
EricPeter



