hci-lab-dcug/bibletts-asante-twi-max29secs-total9hrs-sr22050

Name: hci-lab-dcug/bibletts-asante-twi-max29secs-total9hrs-sr22050
Creator: hci-lab-dcug
Published: 2025-07-04 15:25:08
License: 暂无描述

Hugging Face2025-07-04 更新2026-01-03 收录

下载链接：

https://hf-mirror.com/datasets/hci-lab-dcug/bibletts-asante-twi-max29secs-total9hrs-sr22050

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: cc-by-sa-4.0 task_categories: - text-to-speech language: - ak - tw size_categories: - 1K<n<10K --- # BibleTTS Asante Twi Dataset ## Dataset Information This dataset is derived from the BibleTTS corpus, specifically focusing on Asante Twi speech data. The original BibleTTS is a large, high-fidelity, multilingual, and uniquely African speech corpus. - **Total Duration**: {total_hours:.2f} hours ({total_hours*60:.1f} minutes) - **Number of Files**: {file_count:,} - **Sample Rate**: {sample_rate:,} Hz - **Max File Duration**: {max_duration:.1f} seconds - **Format**: WAV files with corresponding transcriptions - **Language**: Asante Twi (tw) ## Dataset Structure ``` bible-tts-dataset/ ├── wavs/ │ ├── ABC/ │ │ └── batch0-99/ │ │ ├── ABC_file1.wav │ │ ├── ABC_file2.wav │ │ └── ... │ ├── DEF/ │ │ └── batch0-99/ │ │ └── ... │ └── ... └── dataset.jsonl ``` ## Usage Each entry in `dataset.jsonl` contains: - `filename`: Relative path to the audio file - `duration`: Duration in seconds - `transcription`: Text transcription of the audio - `speaker`: Speaker ID (3-letter code) ## Loading the Dataset ```python import json import librosa # Load dataset metadata with open('dataset.jsonl', 'r', encoding='utf-8') as f: dataset = [json.loads(line) for line in f] # Load an audio file audio_path = dataset[0]['filename'] audio, sr = librosa.load(audio_path, sr={sample_rate}) transcription = dataset[0]['transcription'] speaker = dataset[0]['speaker'] ``` ## Statistics - **Total Duration**: {total_hours:.2f} hours - **Average File Duration**: {(total_hours * 3600 / file_count):.1f} seconds - **Files**: {file_count:,} audio files - **Multiple speakers**: Selected from train/dev/test splits ## Citation If you use this dataset, please cite the original BibleTTS paper: ```bibtex @inproceedings{{meyer2022bibletts, title={{BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus}}, author={{Josh Meyer and David Adelani and Edresson Casanova and Alp {{\\"O}}ktem and Daniel Whitenack and Julian Weber and Salomon Kabongo Kabenamualu and Elizabeth Salesky and Iroro Orife and Colin Leong and Perez Ogayo and Chris Chinenye Emezue and Jonathan Mukiibi and Salomey Osei and Apelete Agbolo and Victor Akinode and Bernard Opoku and Olanrewaju Samuel and Jesujoba Alabi and Shamsuddeen Hassan Muhammad}}, booktitle={{Interspeech}}, publisher = {{{{ISCA}}}}, year={{2022}}, url={{https://arxiv.org/pdf/2207.03546.pdf}} }} ``` ## Dataset Origin This dataset is a processed subset of the original BibleTTS corpus, specifically containing Asante Twi speech data. The original audio files were recorded at 48kHz and have been downsampled to 22.05kHz for this distribution. ## License Please refer to the original BibleTTS licensing terms for usage permissions.

提供机构：

hci-lab-dcug

5,000+

优质数据集

54 个

任务类型

进入经典数据集