Lithuanian audio dataset for speech recognition 20 hours (1/5)
收藏Datarade2024-07-22 收录
下载链接:
https://datarade.ai/data-products/lithuanian-audio-dataset-for-speech-recognition-20-hours-1-5-stagezero
下载链接
链接失效反馈官方服务:
资源简介:
Specifications: - Each user has a unique ID across the entire dataset. - Maximum four hours of speech per person in the dataset. - Speech is recorded and transcribed on separate tracks. - High-quality transcriptions come with the data in JSON format. - No noise and high-quality recordings with both male and female speakers. - Metadata includes: gender, age, and location. - License terms: you pay once and you can use the data commercially in your products, but you cannot resell the data.
数据集规范:
- 全数据集内所有用户均拥有唯一标识符(ID)。
- 数据集中每位用户的语音总时长不超过四小时。
- 语音录音与转写文本分别存储于独立轨道。
- 高质量转写文本将以JSON格式随数据集一同提供。
- 录音无背景杂音,音质优异,且涵盖男性与女性发言者。
- 数据集元数据包含发言者性别、年龄与所在地区。
- 授权条款:一次性付费后即可将本数据集用于商业产品开发,但不得转售该数据集本身。
提供机构:
StageZero



