lunarlist/edited_common_voice
收藏Hugging Face2023-07-25 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/lunarlist/edited_common_voice
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: audio_filepath
dtype: audio
- name: text
dtype: string
- name: duration
dtype: float64
splits:
- name: train
num_bytes: 6731304269.504
num_examples: 36296
- name: test
num_bytes: 340059709.94
num_examples: 1911
download_size: 6985650459
dataset_size: 7071363979.443999
license: mit
task_categories:
- text-to-speech
language:
- th
---
# Dataset Card for "edited_common_voice"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
This dataset is a Thai TTS dataset that use the voice from [Common Voice dataset](https://commonvoice.mozilla.org/) and modify the voice to not to sound like the original.
Medium: [Text-To-Speech ภาษาไทยด้วย Tacotron2](https://medium.com/@taetiyateachamatavorn/text-to-speech-%E0%B8%A0%E0%B8%B2%E0%B8%A9%E0%B8%B2%E0%B9%84%E0%B8%97%E0%B8%A2%E0%B8%94%E0%B9%89%E0%B8%A7%E0%B8%A2-tacotron2-986417b44edc)
提供机构:
lunarlist
原始信息汇总
数据集概述
数据集名称
- 名称: edited_common_voice
数据集特征
- 特征列表:
- audio_filepath: 数据类型为音频。
- text: 数据类型为字符串。
- duration: 数据类型为浮点数。
数据集划分
- 训练集:
- 样本数量: 36296
- 数据大小: 6731304269.504字节
- 测试集:
- 样本数量: 1911
- 数据大小: 340059709.94字节
数据集大小
- 下载大小: 6985650459字节
- 总数据大小: 7071363979.443999字节
许可证
- 许可证类型: MIT
任务类别
- 任务类别: 文本到语音转换
语言
- 语言: 泰语



