intelsense/wmt_bengali_combined
收藏Hugging Face2024-08-11 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/intelsense/wmt_bengali_combined
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
dataset_info:
features:
- name: lang_1
dtype: string
- name: lang_2
dtype: string
- name: bn
dtype: string
- name: split
dtype: string
splits:
- name: de_en_bn
num_bytes: 88128507
num_examples: 120000
- name: fi_en_bn
num_bytes: 149172111
num_examples: 200000
- name: gu_en_bn
num_bytes: 1208884
num_examples: 11670
- name: kk_en_bn
num_bytes: 19783383
num_examples: 126583
- name: lt_en_bn
num_bytes: 138132887
num_examples: 200000
- name: ru_en_bn
num_bytes: 98586792
num_examples: 200000
- name: cs_en_bn
num_bytes: 136444041
num_examples: 200000
- name: zh_en_bn
num_bytes: 2719108
num_examples: 3981
download_size: 289721646
dataset_size: 634175713
configs:
- config_name: default
data_files:
- split: de_en_bn
path: data/de_en_bn-*
- split: fi_en_bn
path: data/fi_en_bn-*
- split: gu_en_bn
path: data/gu_en_bn-*
- split: kk_en_bn
path: data/kk_en_bn-*
- split: lt_en_bn
path: data/lt_en_bn-*
- split: ru_en_bn
path: data/ru_en_bn-*
- split: cs_en_bn
path: data/cs_en_bn-*
- split: zh_en_bn
path: data/zh_en_bn-*
---
提供机构:
intelsense



