SAMPLE 8kHz Conversational Speech Data | 15,000 Hours | Audio Data | Speech Recognition Data| ...

Name: SAMPLE 8kHz Conversational Speech Data | 15,000 Hours | Audio Data | Speech Recognition Data| ...
Creator: Nexdata
License: 暂无描述

Databricks2025-05-27 收录

下载链接：

https://marketplace.databricks.com/details/3f0b5384-24a8-4313-9725-80ee79e0bc64/Nexdata_SAMPLE-8kHz-Conversational-Speech-Data-15,000-Hours-Audio-Data-Speech-Recognition-Data-

下载链接

链接失效反馈

官方服务：

资源简介：

1. Specifications Format : 8kHz, 8bit, u-law/a-law pcm, mono channel; Environment : quiet indoor environment, without echo; Recording content : No preset linguistic data，dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed; Demographics : Speakers are evenly distributed across all age groups, covering children, teenagers, middle-aged, elderly, etc. Annotation : annotating for the transcription text, speaker identification, gender and noise symbols; Device : Telephony recording system; Language : 100+ Languages; Application scenarios : speech recognition; voiceprint recognition; Accuracy rate : the word accuracy rate is not less than 98% 2. About Nexdata Nexdata owns off-the-shelf PB-level Large Language Model(LLM) Data, 1 million hours of Audio Data and 800TB of Annotated Imagery Data. These ready-to-go Machine Learning (ML) Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/speechrecog?source=Datarade

提供机构：

Nexdata

5,000+

优质数据集

54 个

任务类型

进入经典数据集