five

grushaaaaa/tts-indian

收藏
Hugging Face2026-04-08 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/grushaaaaa/tts-indian
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 language: - bn - ks - gu - kn - ta - te tags: - tts - speech - indian-languages - audio pretty_name: TTS Indian Languages --- # TTS Indian Languages Dataset Speech dataset for Text-to-Speech covering 6 Indian languages, collected and processed from YouTube. ## Languages & Speakers | Speaker | Language | Gender | |---------|----------|--------| | monihara_bengali | Bengali | Male | | munir_kashmiri | Kashmiri | Male | | nandini_gujarati | Gujarati | Female | | sansri_kannada | Kannada | Female | | tamil_pokkisham | Tamil | Male | | teluguM | Telugu | Male | ## Pipeline Audio was collected and processed through these stages: 1. **YouTube Download** — yt-dlp from curated speaker playlists 2. **Vocal Separation** — Demucs (htdemucs_ft) to remove background music/noise 3. **Audio Enhancement** — Resemble-Enhance + DeepFilterNet for clean speech 4. **Diarization** — Silero VAD + WavLM clustering to segment speaker turns 5. **Chunking** — 3–15 second clips 6. **Quality Filtering** — DNSMOS scoring to keep only clean audio 7. **Transcription** — ASR transcription saved alongside each clip ## Format Each row contains: - `audio`: 16kHz mono WAV - `transcription`: verbatim text - `speaker`: speaker ID - `language`: language name - `gender`: male/female
提供机构:
grushaaaaa
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作