Cognitive-Lab/GoogleIndicGenBench_flores_xxen_in
收藏Hugging Face2024-06-04 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/Cognitive-Lab/GoogleIndicGenBench_flores_xxen_in
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: gu
features:
- name: target
dtype: string
- name: source
dtype: string
- name: translation_direction
dtype: string
- name: lang
dtype: string
splits:
- name: test
num_bytes: 485428
num_examples: 1012
- name: dev
num_bytes: 485428
num_examples: 1012
download_size: 501492
dataset_size: 970856
- config_name: hi
features:
- name: target
dtype: string
- name: source
dtype: string
- name: translation_direction
dtype: string
- name: lang
dtype: string
splits:
- name: test
num_bytes: 491449
num_examples: 1012
- name: dev
num_bytes: 491449
num_examples: 1012
download_size: 502260
dataset_size: 982898
- config_name: kn
features:
- name: target
dtype: string
- name: source
dtype: string
- name: translation_direction
dtype: string
- name: lang
dtype: string
splits:
- name: test
num_bytes: 529825
num_examples: 1012
- name: dev
num_bytes: 529825
num_examples: 1012
download_size: 524842
dataset_size: 1059650
- config_name: ml
features:
- name: target
dtype: string
- name: source
dtype: string
- name: translation_direction
dtype: string
- name: lang
dtype: string
splits:
- name: test
num_bytes: 566113
num_examples: 1012
- name: dev
num_bytes: 566113
num_examples: 1012
download_size: 551704
dataset_size: 1132226
- config_name: mr
features:
- name: target
dtype: string
- name: source
dtype: string
- name: translation_direction
dtype: string
- name: lang
dtype: string
splits:
- name: test
num_bytes: 510066
num_examples: 1012
- name: dev
num_bytes: 510066
num_examples: 1012
download_size: 519440
dataset_size: 1020132
- config_name: ta
features:
- name: target
dtype: string
- name: source
dtype: string
- name: translation_direction
dtype: string
- name: lang
dtype: string
splits:
- name: test
num_bytes: 576001
num_examples: 1012
- name: dev
num_bytes: 576001
num_examples: 1012
download_size: 537170
dataset_size: 1152002
- config_name: te
features:
- name: target
dtype: string
- name: source
dtype: string
- name: translation_direction
dtype: string
- name: lang
dtype: string
splits:
- name: test
num_bytes: 508060
num_examples: 1012
- name: dev
num_bytes: 508060
num_examples: 1012
download_size: 518266
dataset_size: 1016120
configs:
- config_name: gu
data_files:
- split: test
path: gu/test-*
- split: dev
path: gu/dev-*
- config_name: hi
data_files:
- split: test
path: hi/test-*
- split: dev
path: hi/dev-*
- config_name: kn
data_files:
- split: test
path: kn/test-*
- split: dev
path: kn/dev-*
- config_name: ml
data_files:
- split: test
path: ml/test-*
- split: dev
path: ml/dev-*
- config_name: mr
data_files:
- split: test
path: mr/test-*
- split: dev
path: mr/dev-*
- config_name: ta
data_files:
- split: test
path: ta/test-*
- split: dev
path: ta/dev-*
- config_name: te
data_files:
- split: test
path: te/test-*
- split: dev
path: te/dev-*
---
提供机构:
Cognitive-Lab
原始信息汇总
数据集概述
数据集配置
配置名称: gu
- 特征:
- target: 字符串
- source: 字符串
- translation_direction: 字符串
- lang: 字符串
- 分割:
- test: 1012个样本, 485428字节
- dev: 1012个样本, 485428字节
- 下载大小: 501492字节
- 数据集大小: 970856字节
配置名称: hi
- 特征:
- target: 字符串
- source: 字符串
- translation_direction: 字符串
- lang: 字符串
- 分割:
- test: 1012个样本, 491449字节
- dev: 1012个样本, 491449字节
- 下载大小: 502260字节
- 数据集大小: 982898字节
配置名称: kn
- 特征:
- target: 字符串
- source: 字符串
- translation_direction: 字符串
- lang: 字符串
- 分割:
- test: 1012个样本, 529825字节
- dev: 1012个样本, 529825字节
- 下载大小: 524842字节
- 数据集大小: 1059650字节
配置名称: ml
- 特征:
- target: 字符串
- source: 字符串
- translation_direction: 字符串
- lang: 字符串
- 分割:
- test: 1012个样本, 566113字节
- dev: 1012个样本, 566113字节
- 下载大小: 551704字节
- 数据集大小: 1132226字节
配置名称: mr
- 特征:
- target: 字符串
- source: 字符串
- translation_direction: 字符串
- lang: 字符串
- 分割:
- test: 1012个样本, 510066字节
- dev: 1012个样本, 510066字节
- 下载大小: 519440字节
- 数据集大小: 1020132字节
配置名称: ta
- 特征:
- target: 字符串
- source: 字符串
- translation_direction: 字符串
- lang: 字符串
- 分割:
- test: 1012个样本, 576001字节
- dev: 1012个样本, 576001字节
- 下载大小: 537170字节
- 数据集大小: 1152002字节
配置名称: te
- 特征:
- target: 字符串
- source: 字符串
- translation_direction: 字符串
- lang: 字符串
- 分割:
- test: 1012个样本, 508060字节
- dev: 1012个样本, 508060字节
- 下载大小: 518266字节
- 数据集大小: 1016120字节
数据文件路径
-
配置名称: gu
- test: gu/test-*
- dev: gu/dev-*
-
配置名称: hi
- test: hi/test-*
- dev: hi/dev-*
-
配置名称: kn
- test: kn/test-*
- dev: kn/dev-*
-
配置名称: ml
- test: ml/test-*
- dev: ml/dev-*
-
配置名称: mr
- test: mr/test-*
- dev: mr/dev-*
-
配置名称: ta
- test: ta/test-*
- dev: ta/dev-*
-
配置名称: te
- test: te/test-*
- dev: te/dev-*



