projecte-aina/synthetic_dem
收藏Hugging Face2025-09-15 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/projecte-aina/synthetic_dem
下载链接
链接失效反馈官方服务:
资源简介:
Synthetic DEM Corpus是El Colegio de México和BSC合作的结果,旨在为墨西哥西班牙语提供一个文本到语音的合成语音语料库。该语料库包含了单词、定义、示例以及由大型语言模型生成的示例。语料库总共包含371小时的语音数据,可用于自动语音识别等任务。数据集采用CC-BY-4.0许可。
The Synthetic DEM Corpus is a result of collaboration between El Colegio de México and BSC, aiming to provide a text-to-speech synthetic speech corpus for Mexican Spanish. The corpus includes words, definitions, examples, and examples generated by a large language model. It comprises a total of 371 hours of speech data, suitable for tasks such as automatic speech recognition. The dataset is licensed under CC-BY-4.0.
提供机构:
projecte-aina



