[SAMPLE] Nexdata | Multilingual Children Speech Data| 10,000 Hours | AI Training Data | Speech ...
收藏Databricks2024-05-31 收录
下载链接:
https://marketplace.databricks.com/details/21ed619a-4a97-4dd8-98b3-f68a1a737562/Nexdata_SAMPLE-Nexdata-Multilingual-Children-Speech-Data-10,000-Hours-AI-Training-Data-Speech-
下载链接
链接失效反馈官方服务:
资源简介:
1. Specifications
Format : 16kHz/22.05kHz/44.1kHz, 16bit, uncompressed wav, mono channel
Recording environment : quiet indoor environment, without echo
Recording content (read speech) : children's books; human-machine interaction category; smart home command and control category; numbers; general category
Speaker : children of 5-12 years old,gender balance
Device: microphone,mobile phone
Language : English,Mandarin, Korean, Japanese,German, French, Italian, Russian, Portuguese, Turkish, Dutch, Swedish, Norwegian, Finnish, Hungarian, Thai, Hindi, Indonesian, Vietnamese, Malay, Burmese, Filipino(Tagalog)
Transcription content : text
Application scenarios : speech recognition; voiceprint recognition
Accuracy rate : sentence accuracy rate 95%
2. About Nexdata
Nexdata owns off-the-shelf 200,000 hours of speech recognition data, 800TB of Annotated Imagery Data, about 2 billion pieces of Natural Language Processing (NLP) Data. These ready-to-go Natural Language Processing (NLP) Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/speechrecog?source=Datarade
提供机构:
Nexdata



