five

OpenVoiceOS/tts-vc-mcv-scripted-v24.0-fy-nl-miro

收藏
Hugging Face2026-03-01 更新2026-04-05 收录
下载链接:
https://hf-mirror.com/datasets/OpenVoiceOS/tts-vc-mcv-scripted-v24.0-fy-nl-miro
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - fy license: cc0-1.0 library_name: datasets tags: - audio - text-to-speech - tts - frisian - mozilla-common-voice datasets: - mozilla-foundation/common_voice_17_0 metrics: - wer --- # OVOS - TTS Voice-Converted Mozilla Common Voice Scripted Speech v24.0 - Frisian Dataset This dataset is a refined, processed version of the **Mozilla Common Voice (MCV) 24.0** Frisian (`fy-NL`) corpus, specifically curated for **Text-to-Speech (TTS)** training. Unlike the raw ASR-focused Common Voice data, this version has been filtered for high-quality audio, normalized text, and formatted for modern TTS architectures like VITS, Glow-TTS, or YourTTS. ## Dataset Description - **Language:** Frisian (fy-NL) - **Source:** [Mozilla Common Voice Scripted Speech v24.0 - Frisian](https://datacollective.mozillafoundation.org/datasets/cmj8u3p4d008tnxxbxw3bkw70) - **Format:** 24,000Hz Mono WAV - **License:** [CC0 1.0 Universal](https://creativecommons.org/publicdomain/zero/1.0/) ## Processing Steps To make this data suitable for TTS, the following pipeline was applied: 1. **Format conversion:** Conversion from mp3 to wav. 2. **Denoising:** Suppression of background noise and correction of distortion. 3. **Trimming:** Stripped silence from all clips. 4. **Normalization:** All audio intensity normalized. 5. **Outlier filtering:** Outliers filtered out in terms of unusual words per minute. 6. **Voice conversion:** Conversion of speaker features in the audio to a donated voice (to OVOS). ## Dataset Structure The dataset follows a standard metadata format: * `file_name`: The audio file path or array. * `transcript`: The normalized transcript for TTS. An alternative metadata file is provided with the original Mozilla Common Voice metadata. ## Licensing Consistent with the [Mozilla Common Voice Legal Terms](https://commonvoice.mozilla.org/en/terms), this dataset is released under the **Creative Commons CC0 1.0 Universal (Public Domain)** license. You may use, modify, and distribute this data without restriction. ## Citation If you use this dataset, please cite the original Common Voice project: ```bibtex @inproceedings{commonvoice:2020, author = {Ardila, R. and others}, title = {Common Voice: A Massively-Multilingual Speech Corpus}, booktitle = {Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020)}, pages = {4211--4215}, year = {2020} }
提供机构:
OpenVoiceOS
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作