five

Lithuanian audio dataset for speech recognition 20 hours (4/5)

收藏
Datarade2024-07-22 收录
下载链接:
https://datarade.ai/data-products/lithuanian-audio-dataset-for-speech-recognition-20-hours-4-5-stagezero
下载链接
链接失效反馈
官方服务:
资源简介:
Specifications: - Each user has a unique ID across the entire dataset. - Maximum four hours of speech per person in the dataset. - Speech is recorded and transcribed on separate tracks. - High-quality transcriptions come with the data in JSON format. - No noise and high-quality recordings with both male and female speakers. - Metadata includes: gender, age, and location. - License terms: you pay once and you can use the data commercially in your products, but you cannot resell the data.
提供机构:
StageZero
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集是一个20小时的立陶宛语语音识别音频资源,包含高质量、无噪音的男女说话者录音,并附带JSON格式的精确转录。每个说话者有唯一ID,录音时长上限为4小时,同时提供性别、年龄和地点等元数据,许可允许一次性付费后商业使用,但禁止转售数据。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作