five

Earnings-22

收藏
arXiv2022-03-29 更新2024-06-21 收录
下载链接:
https://github.com/revdotcom/speech-datasets/tree/master/earnings22
下载链接
链接失效反馈
官方服务:
资源简介:
Earnings-22数据集是由Rev.com开发的,旨在为自动语音识别(ASR)系统提供一个包含多种口音的实际音频基准。该数据集包含来自全球27个国家的125个英语财报电话录音,总时长119小时,涵盖7种地区口音。数据集的创建过程涉及从多个来源收集录音,并通过专业的人工转录平台进行高质量转录。Earnings-22数据集主要用于学术和工业研究,以解决ASR系统在处理真实世界口音音频时的性能问题,推动更公平和有效的语音技术发展。

The Earnings-22 dataset was developed by Rev.com, designed to serve as a real-world audio benchmark with diverse accents for automatic speech recognition (ASR) systems. This dataset comprises 125 English earnings call recordings from 27 countries worldwide, with a total duration of 119 hours and coverage of 7 regional accents. The creation of the Earnings-22 dataset involved collecting recordings from multiple sources and performing high-quality transcriptions via professional manual transcription platforms. The Earnings-22 dataset is primarily utilized for academic and industrial research, aiming to resolve the performance issues of ASR systems when processing real-world accented audio and promote the development of more equitable and effective speech technologies.
提供机构:
Rev.com
创建时间:
2022-03-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作