five

Afaan Oromo Text to Speech Synthesis dataset

收藏
Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/mpy85ns82z
下载链接
链接失效反馈
官方服务:
资源简介:
Afaan Oromo Text to Speech Synthesis dataset is a public domain speech dataset consisting of 8,076 short audio clips of a single male speaker reading sentences collected from legitimate sources such as News Media sources, Non-fiction books, and Afaan Oromo Holy bible. A transcription and its normalized text are provided for each clip. After two weeks of the audio recording process, a total of 17 hours of recorded speech data that corresponded to a total of 8076 recorded .wav files was created. File Format Metadata is provided in metadata.csv. This file consists of one record per line, delimited by the pipe character. The fields are: ID: this is the name of the corresponding .wav file Transcription: words spoken by the reader (UTF-8) Normalized Transcription: transcription with numbers, ordinals, and monetary units expanded into full words (UTF-8). Each audio file is a single-channel 16-bit PCM WAV with a sample rate of 22050 Hz.
创建时间:
2023-10-13
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作