[SAMPLE] Nexdata | Multilingual Read Speech Data | 65,000 Hours | Generative AI Audio Data| ...
收藏Databricks2024-12-25 收录
下载链接:
https://marketplace.databricks.com/details/73b52437-dbcd-49e1-99e5-279d420afd1c/Nexdata_SAMPLE-Nexdata-Multilingual-Read-Speech-Data-65,000-Hours-Generative-AI-Audio-Data-
下载链接
链接失效反馈官方服务:
资源简介:
1. Specifications
Format : 16kHz, 16bit, uncompressed wav, mono channel
Recording environment : quiet indoor environment, without echo
Recording content (read speech) : economy, entertainment, news, oral language, numbers, letters
Speaker : native speaker, gender balance
Device : Android mobile phone, iPhone
Language : 100+ languages
Transcription content : text, time point of speech data, 5 noise symbols, 5 special identifiers
Accuracy rate : 95% (the accuracy rate of noise symbols and other identifiers is not included)
Application scenarios : speech recognition, voiceprint recognition
2. About Nexdata
Nexdata owns off-the-shelf PB-level Large Language Model(LLM) Data, 1 million hours of Audio Data and 800TB of Annotated Imagery Data. These ready-to-go Machine Learning (ML) Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/speechrecog?source=Datarade
提供机构:
Nexdata



