[SAMPLE] Nexdata | Multilingual Conversational Speech Data | 8kHz Telephone| 15,000 Hours | ...
收藏Databricks2024-05-09 收录
下载链接:
https://marketplace.databricks.com/details/08100346-25cd-4ce5-a085-3bbf4254b9ce/Nexdata_SAMPLE-Nexdata-Multilingual-Conversational-Speech-Data-8kHz-Telephone-15,000-Hours-
下载链接
链接失效反馈官方服务:
资源简介:
1. Specifications
Format : 8kHz, 8bit, u-law/a-law pcm, mono channel;
Environment : quiet indoor environment, without echo;
Recording content : No preset linguistic data,dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed;
Demographics : Speakers are evenly distributed across all age groups, covering children, teenagers, middle-aged, elderly, etc.
Annotation : annotating for the transcription text, speaker identification, gender and noise symbols;
Device : Telephony recording system;
Language : 100+ Languages;
Application scenarios : speech recognition; voiceprint recognition;
Accuracy rate : the word accuracy rate is not less than 98%
2. About Nexdata
Nexdata owns off-the-shelf 200,000 hours of speech recognition data, 800TB of Annotated Imagery Data, about 2 billion pieces of Natural Language Processing (NLP) Data. These ready-to-go Machine Learning (ML) Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/speechRecognition?source=Datarade
提供机构:
Nexdata



