Murple/ksponspeech
收藏数据集概述
数据集名称
- 名称: KsponSpeech
数据集属性
- 语言: 韩语 (ko)
- 语言创建方式: 众包 (crowdsourced)
- 多语言性: 单语种 (monolingual)
- 注释创建方式: 专家生成 (expert-generated)
- 大小: 10K<n<100K
- 源数据: 原始数据 (original)
- 任务类别: 自动语音识别 (automatic-speech-recognition)
数据集描述
- 摘要: 包含969小时的通用开放领域对话语音,由约2000名母语为韩语的说话者在清洁环境中录制。数据通过记录两人自由对话并手动转录构建。转录提供正字法和发音的双重转录,以及如填充词、重复词和词片段等自发语音的不流畅标签。
- 支持任务: 自动语音识别
- 语言: 韩语
数据集结构
- 数据实例: 每个实例包含音频信息(路径、数组、采样率)、文本转录和唯一ID。
- 数据字段:
- 音频: 包含音频文件路径、解码音频数组和采样率。
- 文本: 音频文件的转录。
- ID: 数据样本的唯一标识。
- 数据分割: 包括训练集、验证集和两个评估集(eval.clean 和 eval.other)。
数据集创建
- 源数据: 数据通过记录两人自由对话并手动转录构建。
- 注释: 提供正字法和发音的双重转录,以及自发语音的不流畅标签。
引用信息
bibtex @Article{app10196936, AUTHOR = {Bang, Jeong-Uk and Yun, Seung and Kim, Seung-Hi and Choi, Mu-Yeol and Lee, Min-Kyu and Kim, Yeo-Jeong and Kim, Dong-Hyun and Park, Jun and Lee, Young-Jik and Kim, Sang-Hun}, TITLE = {KsponSpeech: Korean Spontaneous Speech Corpus for Automatic Speech Recognition}, JOURNAL = {Applied Sciences}, VOLUME = {10}, YEAR = {2020}, NUMBER = {19}, ARTICLE-NUMBER = {6936}, URL = {https://www.mdpi.com/2076-3417/10/19/6936}, ISSN = {2076-3417}, ABSTRACT = {This paper introduces a large-scale spontaneous speech corpus of Korean, named KsponSpeech. This corpus contains 969 h of general open-domain dialog utterances, spoken by about 2000 native Korean speakers in a clean environment. All data were constructed by recording the dialogue of two people freely conversing on a variety of topics and manually transcribing the utterances. The transcription provides a dual transcription consisting of orthography and pronunciation, and disfluency tags for spontaneity of speech, such as filler words, repeated words, and word fragments. This paper also presents the baseline performance of an end-to-end speech recognition model trained with KsponSpeech. In addition, we investigated the performance of standard end-to-end architectures and the number of sub-word units suitable for Korean. We investigated issues that should be considered in spontaneous speech recognition in Korean. KsponSpeech is publicly available on an open data hub site of the Korea government.}, DOI = {10.3390/app10196936} }




