Nexdata | Multilingual Unsupervised Speech Data |1 Million Hours | Spontaneous Speech | LLM | Pre-training |Large Language Model(LLM) Data
收藏Datarade2025-01-04 收录
下载链接:
https://datarade.ai/data-products/nexdata-multilingual-unsupervised-speech-data-1-million-ho-nexdata
下载链接
链接失效反馈官方服务:
资源简介:
1. Specifications Format: 16k Hz, 16 bit, wav, mono channel Content category: Dialogue or monologue in several common domains, such as daily vlogs, travel, podcast, technology, beauty, etc Language: English(USA, UK, Canada, Australia, India, Philippine, etc.), French, German, Japanese, Arabic(MSA, Gulf, Levantine, Egyptian accents, etc.), Mandarin, etc. Recording condition: Mixed(indoor, public place, entertainment,etc.) 2. About Nexdata Nexdata owns off-the-shelf PB-level Large Language Model(LLM) Data, 1 million hours of speech data and 800TB of Annotated Imagery Data. These ready-to-go data supports instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/speechrecog?source=Datarade
提供机构:
Nexdata



