Debatts-Data
收藏魔搭社区2026-05-11 更新2024-11-02 收录
下载链接:
https://modelscope.cn/datasets/amphion/Debatts-Data
下载链接
链接失效反馈官方服务:
资源简介:
# Debatts-Data: The First Madarin Rebuttal Speech Dataset for Expressive Text-to-Speech Synthesis
The Debatts-Data dataset is the first Madarin rebuttal speech dataset for expressive text-to-speech synthesis. It is constructed from a vast collection of professional Madarin speech data sourced from diverse video platforms and podcasts on the Internet. The in-the-wild collection approach ensures the real and natural rebuttal speech. In addition, the dataset contains annotations of transcription, duration and style embed.
The table and chart below provide the statistic information for the dataset. For some dataset samples and more information regarding Debatts system, please visit the [Debatts project page](https://amphionspace.github.io/debatts/).
## Dataset Specifications
| Attribute | Value |
|----------------------|---------------|
| Language | ZH |
| Number of Speakers | 2,350 (est.) |
| Duration (hrs) | 111 |
| Type | Text + Speech |
| Sample Rate (kHz) | 16 |
| Recorded Method | Wild |
The JSON files in the dataset contain the following keys:
| Key | Description |
|-------------------|----------------------------------------------------------|
| `key` | Unique identifier for each sample in the dataset |
| `text` | Text transcription of the audio |
| `duration` | Duration of the audio clip in seconds |
| `language` | Language of the audio content |
| `wav_path` | Path to the corresponding WAV file |
| `prompt0_wav_path`| Path to the WAV file used as a prompt |
| `style_feature` | Style features associated with the audio sample |
## README 🔥🔥🔥
## Dataset Usage
To utilize the Debatts-Data dataset, you can download the raw audio files from the files and versions. The Debatts-Data.tar.gz contains the training data, while the Debatts-Data_test.tar.gz contains the testing data with extra speaker prompt speech.
*Please note that Debatts-Data does not own the copyright to the audio files; the copyright remains with the original owners of the videos or audio. Users are permitted to use this dataset only for non-commercial purposes under the CC BY-NC-4.0 license.*
# Debatts-Data: 首个面向高表现力文本转语音合成(expressive text-to-speech synthesis)的普通话辩驳语音数据集
Debatts-Data数据集是首个面向高表现力文本转语音合成(expressive text-to-speech synthesis)的普通话辩驳语音数据集。该数据集依托互联网多类视频平台与播客中的海量专业普通话语音素材构建而成。其真实场景采集的方式确保了语音数据的真实性与自然性。此外,数据集还包含了文本转写、时长以及风格嵌入(style embed)三类标注。
下表与下图展示了该数据集的统计信息。如需获取数据集部分样本及Debatts系统的更多详情,请访问[Debatts项目主页](https://amphionspace.github.io/debatts/)。
## 数据集规格参数
| 属性名称 | 参数值 |
|----------------------|---------------|
| 语言 | ZH |
| 说话者数量 | 2,350(估算值) |
| 总时长(小时) | 111 |
| 数据类型 | 文本 + 语音 |
| 采样率(千赫兹) | 16 |
| 采集方式 | 真实场景采集 |
数据集中的JSON文件包含以下字段:
| 字段名 | 字段说明 |
|-------------------|----------------------------------------------------------|
| `key` | 数据集中每个样本的唯一标识符 |
| `text` | 音频对应的文本转写结果 |
| `duration` | 音频片段的时长,单位为秒 |
| `language` | 音频内容的语言 |
| `wav_path` | 对应WAV格式音频文件的存储路径 |
| `prompt0_wav_path`| 用作提示音频的WAV文件存储路径 |
| `style_feature` | 与该音频样本关联的风格特征 |
## README 🔥🔥🔥
## 数据集使用方法
如需使用Debatts-Data数据集,您可从文件与版本板块下载原始音频文件。其中,`Debatts-Data.tar.gz`包含训练数据集,`Debatts-Data_test.tar.gz`则包含测试数据集,且附带额外的说话者提示语音。
*请注意:Debatts-Data数据集不享有音频文件的版权,版权仍归属于原视频或音频的所有者。用户仅可在CC BY-NC-4.0许可协议框架下,将该数据集用于非商业用途。*
提供机构:
maas
创建时间:
2024-10-28



