[SAMPLE] Nexdata | Multilingual Read Speech Data | 65,000 Hours | Generative AI Audio Data| ...
收藏Databricks2024-05-31 收录
下载链接:
https://marketplace.databricks.com/details/2cb1fde5-8725-45f4-a851-6d327341cf0c/Nexdata_SAMPLE-Nexdata-Multilingual-Read-Speech-Data-65,000-Hours-Generative-AI-Audio-Data-
下载链接
链接失效反馈官方服务:
资源简介:
1. Specifications
Format : 16kHz, 16bit, uncompressed wav, mono channel
Recording environment : quiet indoor environment, without echo
Recording content (read speech) : economy, entertainment, news, oral language, numbers, letters
Speaker : native speaker, gender balance
Device : Android mobile phone, iPhone
Language : 100+ languages
Transcription content : text, time point of speech data, 5 noise symbols, 5 special identifiers
Accuracy rate : 95% (the accuracy rate of noise symbols and other identifiers is not included)
Application scenarios : speech recognition, voiceprint recognition
2. About Nexdata
Nexdata owns off-the-shelf 200,000 hours of speech recognition data, 800TB of Annotated Imagery Data, about 2 billion pieces of Natural Language Processing (NLP) Data. These ready-to-go Machine Learning (ML) Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/speechrecog?source=Datarade
提供机构:
Nexdata



