Lithuanian audio dataset for speech recognition 20 hours (4/5)
收藏Datarade2024-07-22 收录
下载链接:
https://datarade.ai/data-products/lithuanian-audio-dataset-for-speech-recognition-20-hours-4-5-stagezero
下载链接
链接失效反馈官方服务:
资源简介:
Specifications: - Each user has a unique ID across the entire dataset. - Maximum four hours of speech per person in the dataset. - Speech is recorded and transcribed on separate tracks. - High-quality transcriptions come with the data in JSON format. - No noise and high-quality recordings with both male and female speakers. - Metadata includes: gender, age, and location. - License terms: you pay once and you can use the data commercially in your products, but you cannot resell the data.
提供机构:
StageZero
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集是一个20小时的立陶宛语语音识别音频资源,包含高质量、无噪音的男女说话者录音,并附带JSON格式的精确转录。每个说话者有唯一ID,录音时长上限为4小时,同时提供性别、年龄和地点等元数据,许可允许一次性付费后商业使用,但禁止转售数据。
以上内容由遇见数据集搜集并总结生成



