Cognitive-Lab/GoogleIndicGenBench_crosssum_in
收藏Hugging Face2024-06-03 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/Cognitive-Lab/GoogleIndicGenBench_crosssum_in
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: gu
features:
- name: text
dtype: string
- name: lang
dtype: string
- name: summary
dtype: string
- name: source_url
dtype: string
- name: target_url
dtype: string
splits:
- name: train
num_bytes: 500794
num_examples: 100
- name: test
num_bytes: 571119
num_examples: 104
- name: dev
num_bytes: 425362
num_examples: 90
download_size: 926882
dataset_size: 1497275
- config_name: hi
features:
- name: text
dtype: string
- name: lang
dtype: string
- name: summary
dtype: string
- name: source_url
dtype: string
- name: target_url
dtype: string
splits:
- name: train
num_bytes: 461223
num_examples: 100
- name: test
num_bytes: 2059328
num_examples: 481
- name: dev
num_bytes: 1844052
num_examples: 463
download_size: 2627436
dataset_size: 4364603
- config_name: kn
features:
- name: text
dtype: string
- name: lang
dtype: string
- name: summary
dtype: string
- name: source_url
dtype: string
- name: target_url
dtype: string
splits:
- name: train
num_bytes: 505354
num_examples: 100
- name: test
num_bytes: 2337780
num_examples: 500
- name: dev
num_bytes: 422265
num_examples: 100
download_size: 1995280
dataset_size: 3265399
- config_name: ml
features:
- name: text
dtype: string
- name: lang
dtype: string
- name: summary
dtype: string
- name: source_url
dtype: string
- name: target_url
dtype: string
splits:
- name: train
num_bytes: 510329
num_examples: 100
- name: test
num_bytes: 2352436
num_examples: 500
- name: dev
num_bytes: 424071
num_examples: 100
download_size: 2001476
dataset_size: 3286836
- config_name: mr
features:
- name: text
dtype: string
- name: lang
dtype: string
- name: summary
dtype: string
- name: source_url
dtype: string
- name: target_url
dtype: string
splits:
- name: train
num_bytes: 537636
num_examples: 100
- name: test
num_bytes: 638745
num_examples: 118
- name: dev
num_bytes: 542942
num_examples: 118
download_size: 1060889
dataset_size: 1719323
- config_name: ta
features:
- name: text
dtype: string
- name: lang
dtype: string
- name: summary
dtype: string
- name: source_url
dtype: string
- name: target_url
dtype: string
splits:
- name: train
num_bytes: 457100
num_examples: 100
- name: test
num_bytes: 1489893
num_examples: 315
- name: dev
num_bytes: 1322132
num_examples: 315
download_size: 1945463
dataset_size: 3269125
- config_name: te
features:
- name: text
dtype: string
- name: lang
dtype: string
- name: summary
dtype: string
- name: source_url
dtype: string
- name: target_url
dtype: string
splits:
- name: train
num_bytes: 502504
num_examples: 100
- name: test
num_bytes: 1132670
num_examples: 212
- name: dev
num_bytes: 942937
num_examples: 212
download_size: 1560674
dataset_size: 2578111
configs:
- config_name: gu
data_files:
- split: train
path: gu/train-*
- split: test
path: gu/test-*
- split: dev
path: gu/dev-*
- config_name: hi
data_files:
- split: train
path: hi/train-*
- split: test
path: hi/test-*
- split: dev
path: hi/dev-*
- config_name: kn
data_files:
- split: train
path: kn/train-*
- split: test
path: kn/test-*
- split: dev
path: kn/dev-*
- config_name: ml
data_files:
- split: train
path: ml/train-*
- split: test
path: ml/test-*
- split: dev
path: ml/dev-*
- config_name: mr
data_files:
- split: train
path: mr/train-*
- split: test
path: mr/test-*
- split: dev
path: mr/dev-*
- config_name: ta
data_files:
- split: train
path: ta/train-*
- split: test
path: ta/test-*
- split: dev
path: ta/dev-*
- config_name: te
data_files:
- split: train
path: te/train-*
- split: test
path: te/test-*
- split: dev
path: te/dev-*
---
提供机构:
Cognitive-Lab
原始信息汇总
数据集概述
配置名称:gu
- 特征:
- text: 字符串类型
- lang: 字符串类型
- summary: 字符串类型
- source_url: 字符串类型
- target_url: 字符串类型
- 分割:
- train: 100个样本,500794字节
- test: 104个样本,571119字节
- dev: 90个样本,425362字节
- 下载大小: 926882字节
- 数据集大小: 1497275字节
配置名称:hi
- 特征:
- text: 字符串类型
- lang: 字符串类型
- summary: 字符串类型
- source_url: 字符串类型
- target_url: 字符串类型
- 分割:
- train: 100个样本,461223字节
- test: 481个样本,2059328字节
- dev: 463个样本,1844052字节
- 下载大小: 2627436字节
- 数据集大小: 4364603字节
配置名称:kn
- 特征:
- text: 字符串类型
- lang: 字符串类型
- summary: 字符串类型
- source_url: 字符串类型
- target_url: 字符串类型
- 分割:
- train: 100个样本,505354字节
- test: 500个样本,2337780字节
- dev: 100个样本,422265字节
- 下载大小: 1995280字节
- 数据集大小: 3265399字节
配置名称:ml
- 特征:
- text: 字符串类型
- lang: 字符串类型
- summary: 字符串类型
- source_url: 字符串类型
- target_url: 字符串类型
- 分割:
- train: 100个样本,510329字节
- test: 500个样本,2352436字节
- dev: 100个样本,424071字节
- 下载大小: 2001476字节
- 数据集大小: 3286836字节
配置名称:mr
- 特征:
- text: 字符串类型
- lang: 字符串类型
- summary: 字符串类型
- source_url: 字符串类型
- target_url: 字符串类型
- 分割:
- train: 100个样本,537636字节
- test: 118个样本,638745字节
- dev: 118个样本,542942字节
- 下载大小: 1060889字节
- 数据集大小: 1719323字节
配置名称:ta
- 特征:
- text: 字符串类型
- lang: 字符串类型
- summary: 字符串类型
- source_url: 字符串类型
- target_url: 字符串类型
- 分割:
- train: 100个样本,457100字节
- test: 315个样本,1489893字节
- dev: 315个样本,1322132字节
- 下载大小: 1945463字节
- 数据集大小: 3269125字节
配置名称:te
- 特征:
- text: 字符串类型
- lang: 字符串类型
- summary: 字符串类型
- source_url: 字符串类型
- target_url: 字符串类型
- 分割:
- train: 100个样本,502504字节
- test: 212个样本,1132670字节
- dev: 212个样本,942937字节
- 下载大小: 1560674字节
- 数据集大小: 2578111字节



