komats/Mega-SSum
收藏Hugging Face2024-06-13 更新2024-07-06 收录
下载链接:
https://hf-mirror.com/datasets/komats/Mega-SSum
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
dataset_info:
features:
- name: id
dtype: string
- name: audio
dtype: audio
- name: transcription
dtype: string
- name: summary
dtype: string
- name: summary1
dtype: string
- name: summary2
dtype: string
- name: summary3
dtype: string
splits:
- name: core
num_bytes: 17683719490.0
num_examples: 50000
- name: validation
num_bytes: 205421854.0
num_examples: 1000
- name: test
num_bytes: 244384744.0
num_examples: 624
download_size: 18266744549
dataset_size: 18133526088.0
configs:
- config_name: default
data_files:
- split: core
path: data/core-*
- split: validation
path: data/validation-*
- split: test
path: data/test-*
---
The dataset includes audio, transcription text, and multiple summaries. It is divided into core dataset (50000 samples), validation set (1000 samples), and test set (624 samples). Each part has corresponding file paths. The total download size and actual size of the dataset are also provided.
提供机构:
komats
原始信息汇总
数据集概述
许可证
- Apache 2.0
数据集信息
特征
- id: 字符串类型
- audio: 音频类型
- transcription: 字符串类型
- summary: 字符串类型
- summary1: 字符串类型
- summary2: 字符串类型
- summary3: 字符串类型
数据分割
- core:
- 字节数: 17683719490.0
- 样本数: 50000
- validation:
- 字节数: 205421854.0
- 样本数: 1000
- test:
- 字节数: 244384744.0
- 样本数: 624
数据大小
- 下载大小: 18266744549
- 数据集大小: 18133526088.0
配置
- config_name: default
- data_files:
- core: data/core-*
- validation: data/validation-*
- test: data/test-*
- data_files:



