emre/Open_SLR108_Turkish_10_hours
收藏Hugging Face2022-12-06 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/emre/Open_SLR108_Turkish_10_hours
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-4.0
tags:
- robust-speech-event
datasets:
- MediaSpeech
---
MediaSpeech
Identifier: SLR108
Summary: French, Arabic, Turkish and Spanish media speech datasets
Category: Speech
License: dataset is distributed under the Creative Commons Attribution 4.0 International License.
About this resource:
MediaSpeech is a dataset of French, Arabic, Turkish and Spanish media speech built with the purpose of testing Automated Speech Recognition (ASR) systems performance. The dataset contains 10 hours of speech for each language provided.
The dataset consists of short speech segments automatically extracted from media videos available on YouTube and manually transcribed, with some pre- and post-processing.
Baseline models and wav version of the dataset can be found in the following git repository: https://github.com/NTRLab/MediaSpeech
@misc{mediaspeech2021,
title={MediaSpeech: Multilanguage ASR Benchmark and Dataset},
author={Rostislav Kolobov and Olga Okhapkina and Olga Omelchishina, Andrey Platunov and Roman Bedyakin and Vyacheslav Moshkin and Dmitry Menshikov and Nikolay Mikhaylovskiy},
year={2021},
eprint={2103.16193},
archivePrefix={arXiv},
primaryClass={eess.AS}
}
提供机构:
emre
原始信息汇总
数据集概述
数据集名称
- MediaSpeech
标识符
- SLR108
语言
- 包含语言: 法语、阿拉伯语、土耳其语、西班牙语
类别
- 类别: 语音
许可
- 许可类型: 创意共享署名4.0国际许可
数据集描述
- 目的: 用于测试自动语音识别(ASR)系统的性能。
- 内容: 包含每种语言10小时的语音数据,这些数据是从YouTube上的媒体视频中自动提取并手动转录的短语音片段。
- 处理: 包括预处理和后处理。
相关资源
- 基准模型和wav数据集版本: 可在GitHub获取。
引用信息
- 标题: MediaSpeech: Multilanguage ASR Benchmark and Dataset
- 作者: Rostislav Kolobov, Olga Okhapkina, Olga Omelchishina, Andrey Platunov, Roman Bedyakin, Vyacheslav Moshkin, Dmitry Menshikov, Nikolay Mikhaylovskiy
- 年份: 2021
- 预印本: arXiv:2103.16193



