notmax123/SententicDataTTS
收藏Hugging Face2026-03-30 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/notmax123/SententicDataTTS
下载链接
链接失效反馈官方服务:
资源简介:
---
license: other
task_categories:
- text-to-speech
language:
- he
- en
tags:
- audio
- tts
- hebrew
- speech
size_categories:
- 100K<n<1M
---
# SententicDataTTS
A Hebrew and English TTS dataset with male and female speakers, resampled to 44.1kHz and time-stretched (slowed).
## Audio Generation
- **slow_44K.7z** — generated using [Chatterbox](https://github.com/resemble-ai/chatterbox)
- **Mamre_generated.7z** — generated using MamreTTS
## Contents
### slow_44K.7z
Audio files resampled to 44.1kHz and time-stretched (slowed), containing:
- `data/` — audio files (WAV, 44.1kHz, slowed)
- CSV metadata files per speaker:
- `female1_hebrew_slow_filtered.csv`
- `female1_slow_filtered.csv`
- `female2_slow_filtered.csv`
- `female3_slow_filtered.csv`
- `female4_slow_filtered.csv`
- `female5_slow_filtered.csv`
- `male1_hebrew_slow_filtered.csv`
- `male1_slow_filtered.csv`
- `male2_slow_filtered.csv`
- `male3_slow_filtered.csv`
- `male4_slow_filtered.csv`
- `male5_slow_filtered.csv`
## Speakers
- 5 female speakers
- 5 male speakers
- Hebrew and English utterances
### Mamre_generated.7z
Audio generated with MamreTTS.
### Phoneme Metadata CSVs
- `voice1_high_quality_phonemes.csv` — phoneme-level metadata for voice1 high quality recordings
- `voice2_improved_phonemes.csv` — phoneme-level metadata for voice2 improved recordings
## Usage
```python
import py7zr
with py7zr.SevenZipFile("slow_44K.7z", mode="r") as z:
z.extractall(path="slow_44K/")
with py7zr.SevenZipFile("Mamre_generated.7z", mode="r") as z:
z.extractall(path="Mamre_generated/")
```
提供机构:
notmax123



