AdoCleanCode/hifitts2_audio_edit_v5
收藏Hugging Face2025-10-17 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/AdoCleanCode/hifitts2_audio_edit_v5
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含语音文件的转录信息及相关特征,具体包括说话者ID、FLAC音频文件名、完整转录文本、移除的单词、不包含移除单词的转录文本、完整音素、移除音素、注释音素、DAC令牌和序列信息。数据集被分为8个批次,每个批次包含1000个示例,整个数据集的总大小为444,366,245字节,下载大小为195,442,379字节。
The dataset includes transcription information and related features of audio files, specifically including speaker ID, FLAC audio filename, full transcription text, removed words, transcription text without removed words, full phonemes, removed phonemes, annotated phonemes, DAC tokens, and sequence information. The dataset is divided into 8 batches, each containing 1000 examples, with the total size of the dataset being 444,366,245 bytes and the download size being 195,442,379 bytes.
提供机构:
AdoCleanCode



