five

emre/Open_SLR108_Turkish_10_hours

收藏
Hugging Face2022-12-06 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/emre/Open_SLR108_Turkish_10_hours
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 tags: - robust-speech-event datasets: - MediaSpeech --- MediaSpeech Identifier: SLR108 Summary: French, Arabic, Turkish and Spanish media speech datasets Category: Speech License: dataset is distributed under the Creative Commons Attribution 4.0 International License. About this resource: MediaSpeech is a dataset of French, Arabic, Turkish and Spanish media speech built with the purpose of testing Automated Speech Recognition (ASR) systems performance. The dataset contains 10 hours of speech for each language provided. The dataset consists of short speech segments automatically extracted from media videos available on YouTube and manually transcribed, with some pre- and post-processing. Baseline models and wav version of the dataset can be found in the following git repository: https://github.com/NTRLab/MediaSpeech @misc{mediaspeech2021, title={MediaSpeech: Multilanguage ASR Benchmark and Dataset}, author={Rostislav Kolobov and Olga Okhapkina and Olga Omelchishina, Andrey Platunov and Roman Bedyakin and Vyacheslav Moshkin and Dmitry Menshikov and Nikolay Mikhaylovskiy}, year={2021}, eprint={2103.16193}, archivePrefix={arXiv}, primaryClass={eess.AS} }
提供机构:
emre
原始信息汇总

数据集概述

数据集名称

  • MediaSpeech

标识符

  • SLR108

语言

  • 包含语言: 法语、阿拉伯语、土耳其语、西班牙语

类别

  • 类别: 语音

许可

  • 许可类型: 创意共享署名4.0国际许可

数据集描述

  • 目的: 用于测试自动语音识别(ASR)系统的性能。
  • 内容: 包含每种语言10小时的语音数据,这些数据是从YouTube上的媒体视频中自动提取并手动转录的短语音片段。
  • 处理: 包括预处理和后处理。

相关资源

  • 基准模型和wav数据集版本: 可在GitHub获取。

引用信息

  • 标题: MediaSpeech: Multilanguage ASR Benchmark and Dataset
  • 作者: Rostislav Kolobov, Olga Okhapkina, Olga Omelchishina, Andrey Platunov, Roman Bedyakin, Vyacheslav Moshkin, Dmitry Menshikov, Nikolay Mikhaylovskiy
  • 年份: 2021
  • 预印本: arXiv:2103.16193
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作