[SAMPLE] Nexdata | Multilingual Read Speech Data | 65,000 Hours | Generative AI Audio Data| ...

Name: [SAMPLE] Nexdata | Multilingual Read Speech Data | 65,000 Hours | Generative AI Audio Data| ...
Creator: Nexdata
License: 暂无描述

Databricks2024-12-25 收录

下载链接：

https://marketplace.databricks.com/details/73b52437-dbcd-49e1-99e5-279d420afd1c/Nexdata_SAMPLE-Nexdata-Multilingual-Read-Speech-Data-65,000-Hours-Generative-AI-Audio-Data-

下载链接

链接失效反馈

官方服务：

资源简介：

1. Specifications Format : 16kHz, 16bit, uncompressed wav, mono channel Recording environment : quiet indoor environment, without echo Recording content (read speech) : economy, entertainment, news, oral language, numbers, letters Speaker : native speaker, gender balance Device : Android mobile phone, iPhone Language : 100+ languages Transcription content : text, time point of speech data, 5 noise symbols, 5 special identifiers Accuracy rate : 95% (the accuracy rate of noise symbols and other identifiers is not included) Application scenarios : speech recognition, voiceprint recognition 2. About Nexdata Nexdata owns off-the-shelf PB-level Large Language Model(LLM) Data, 1 million hours of Audio Data and 800TB of Annotated Imagery Data. These ready-to-go Machine Learning (ML) Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/speechrecog?source=Datarade

提供机构：

Nexdata

5,000+

优质数据集

54 个

任务类型

进入经典数据集