five

ClovaCall

收藏
arXiv2020-05-17 更新2024-06-21 收录
下载链接:
https://github.com/ClovaAI/ClovaCall
下载链接
链接失效反馈
官方服务:
资源简介:
ClovaCall是一个大规模的韩语目标导向对话语音语料库,专门用于接触中心的自动语音识别服务。该数据集包含约60,746对简短句子和相应的口语表达,主要集中在餐厅预订领域。数据集的创建过程涉及通过众包平台Crowdworks收集和筛选句子,并通过电话记录口语表达。ClovaCall数据集适用于多种基于接触中心的预订服务,因为其内容涵盖了预订服务中常用的词汇和表达。该数据集的应用旨在提高目标导向对话场景中自动语音识别的准确性。

ClovaCall is a large-scale Korean goal-oriented conversational speech corpus specifically designed for automatic speech recognition services in contact centers. This dataset contains approximately 60,746 pairs of short sentences and corresponding spoken utterances, mainly focusing on the restaurant reservation domain. The creation of this dataset involved collecting and filtering sentences via the crowdsourcing platform Crowdworks, as well as recording spoken utterances through telephone calls. The ClovaCall dataset is applicable to a variety of contact center-based reservation services, as its content covers the commonly used vocabulary and expressions in reservation scenarios. The application of this dataset aims to improve the accuracy of automatic speech recognition in goal-oriented conversational contexts.
提供机构:
Clova AI, NAVER Corp.
创建时间:
2020-04-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作