formospeech/hakkatv_hanzawa

Name: formospeech/hakkatv_hanzawa
Creator: formospeech
Published: 2024-06-20 09:12:17
License: 暂无描述

Hugging Face2024-06-20 更新2024-06-29 收录

下载链接：

https://hf-mirror.com/datasets/formospeech/hakkatv_hanzawa

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集名为sixian，包含音频数据及其相关文本信息。每个样本包含以下字段：id（唯一标识符）、audio（音频文件）、duration（音频时长）、text（文本内容）、ipa（国际音标）和dialect（方言信息）。数据集包含一个训练集，共有7345个样本，总大小为317006262.285字节。

The dataset named sixian contains audio data and related text information. Each sample includes the following fields: id (unique identifier), audio (audio file), duration (audio duration), text (text content), ipa (International Phonetic Alphabet), and dialect (dialect information). The dataset includes a training set with 7345 samples, totaling 317006262.285 bytes.

提供机构：

formospeech

原始信息汇总

数据集概述

数据集信息

配置名称: sixian
特征:
- id: 字符串类型
- audio: 音频类型
- duration: 浮点数类型
- text: 字符串类型
- ipa: 字符串类型
- dialect: 字符串类型

数据分割

训练集:
- 名称: train
- 字节数: 317006262.285
- 样本数: 7345

数据集大小

下载大小: 312520005
数据集大小: 317006262.285

配置

配置名称: sixian
数据文件:
- 分割: train
- 路径: sixian/train-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集