five

hci-lab-dcug/bibletts-asante-twi-max29secs-total9hrs-sr22050

收藏
Hugging Face2025-07-04 更新2026-01-03 收录
下载链接:
https://hf-mirror.com/datasets/hci-lab-dcug/bibletts-asante-twi-max29secs-total9hrs-sr22050
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-sa-4.0 task_categories: - text-to-speech language: - ak - tw size_categories: - 1K<n<10K --- # BibleTTS Asante Twi Dataset ## Dataset Information This dataset is derived from the BibleTTS corpus, specifically focusing on Asante Twi speech data. The original BibleTTS is a large, high-fidelity, multilingual, and uniquely African speech corpus. - **Total Duration**: {total_hours:.2f} hours ({total_hours*60:.1f} minutes) - **Number of Files**: {file_count:,} - **Sample Rate**: {sample_rate:,} Hz - **Max File Duration**: {max_duration:.1f} seconds - **Format**: WAV files with corresponding transcriptions - **Language**: Asante Twi (tw) ## Dataset Structure ``` bible-tts-dataset/ ├── wavs/ │ ├── ABC/ │ │ └── batch0-99/ │ │ ├── ABC_file1.wav │ │ ├── ABC_file2.wav │ │ └── ... │ ├── DEF/ │ │ └── batch0-99/ │ │ └── ... │ └── ... └── dataset.jsonl ``` ## Usage Each entry in `dataset.jsonl` contains: - `filename`: Relative path to the audio file - `duration`: Duration in seconds - `transcription`: Text transcription of the audio - `speaker`: Speaker ID (3-letter code) ## Loading the Dataset ```python import json import librosa # Load dataset metadata with open('dataset.jsonl', 'r', encoding='utf-8') as f: dataset = [json.loads(line) for line in f] # Load an audio file audio_path = dataset[0]['filename'] audio, sr = librosa.load(audio_path, sr={sample_rate}) transcription = dataset[0]['transcription'] speaker = dataset[0]['speaker'] ``` ## Statistics - **Total Duration**: {total_hours:.2f} hours - **Average File Duration**: {(total_hours * 3600 / file_count):.1f} seconds - **Files**: {file_count:,} audio files - **Multiple speakers**: Selected from train/dev/test splits ## Citation If you use this dataset, please cite the original BibleTTS paper: ```bibtex @inproceedings{{meyer2022bibletts, title={{BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus}}, author={{Josh Meyer and David Adelani and Edresson Casanova and Alp {{\\"O}}ktem and Daniel Whitenack and Julian Weber and Salomon Kabongo Kabenamualu and Elizabeth Salesky and Iroro Orife and Colin Leong and Perez Ogayo and Chris Chinenye Emezue and Jonathan Mukiibi and Salomey Osei and Apelete Agbolo and Victor Akinode and Bernard Opoku and Olanrewaju Samuel and Jesujoba Alabi and Shamsuddeen Hassan Muhammad}}, booktitle={{Interspeech}}, publisher = {{{{ISCA}}}}, year={{2022}}, url={{https://arxiv.org/pdf/2207.03546.pdf}} }} ``` ## Dataset Origin This dataset is a processed subset of the original BibleTTS corpus, specifically containing Asante Twi speech data. The original audio files were recorded at 48kHz and have been downsampled to 22.05kHz for this distribution. ## License Please refer to the original BibleTTS licensing terms for usage permissions.
提供机构:
hci-lab-dcug
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作