[SAMPLE] Nexdata | Multilingual Speech Synthesis Data | 400 Hours | TTS Data | Audio Data | AI ...
收藏Databricks2024-05-31 收录
下载链接:
https://marketplace.databricks.com/details/9d88b4d3-31f1-4cd2-a43a-bd0795d3b0e6/Nexdata_SAMPLE-Nexdata-Multilingual-Speech-Synthesis-Data-400-Hours-TTS-Data-Audio-Data-AI-
下载链接
链接失效反馈官方服务:
资源简介:
1. Specifications
Format : 44.1 kHz/48 kHz, 16bit/24bit, uncompressed wav, mono channel.
Recording environment : professional recording studio.
Recording content : general narrative sentences, interrogative sentences, etc.
Speaker : native speaker
Annotation Feature : word transcription, part-of-speech, phoneme boundary, four-level accents, four-level prosodic boundary.
Device : Microphone
Language : American English, British English, Japanese, French, Dutch,Mandarin Chinese, Catonese, Canadian French,Australian English, Italian, New Zealand English, Spanish, Mexican Spanish
Application scenarios : speech synthesis
Accuracy rate: Word transcription: the sentences accuracy rate is not less than 99%.
Part-of-speech annotation: the sentences accuracy rate is not less than 98%.
Phoneme annotation: the sentences accuracy rate is not less than 98% (the error rate of voiced and swallowed phonemes is not included, because the labelling is more subjective).
Accent annotation: the word accuracy rate is not less than 95%.
Prosodic boundary annotation: the sentences accuracy rate is not less than 97%
Phoneme boundary annotation: the phoneme accuracy rate is not less than 95% (the error range of boundary is within 5%)
2. About Nexdata
Nexdata owns off-the-shelf 200,000 hours of speech recognition data, 800TB of image/video data, about 2 billion pieces of NLP data. These ready-to-go AI & ML Training Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/tts?source=Datarade
提供机构:
Nexdata



