syvai/combined-neucodec
收藏Hugging Face2026-01-25 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/syvai/combined-neucodec
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为Combined Neucodec Dataset,是一个将多个丹麦语文本到语音(TTS)的neucodec数据集合并而成的统一数据集。它整合了4个不同的子数据集,总计包含2,898,535条数据。数据集包含三个主要列:text(文本)、tokens(标记)和phonemes(音素)。其中phonemes列是使用espeak工具生成的,保留了标点符号并包含重音标记。
This dataset is named Combined Neucodec Dataset and combines multiple Danish text-to-speech (TTS) neucodec datasets into a single unified dataset. It integrates 4 different sub-datasets with a total of 2,898,535 entries. The dataset contains three main columns: text, tokens, and phonemes. The phonemes column was generated using the espeak tool with preserved punctuation and stress marks.
提供机构:
syvai



