five

CSLU: S4X Release 1.2

收藏
Mendeley Data2024-01-31 更新2024-06-28 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2009S03
下载链接
链接失效反馈
官方服务:
资源简介:
Introduction CSLU: S4X Release 1.2, Linguistic Data Consortium (LDC) catalog number LDC2009S03 and isbn 1-58563-523-5, was created by the Center for Spoken Language Understanding, Oregon Health and Science University (CSLU). The corpus consists of 36 speakers (22 male, 14 female) uttering 11 specified words. The speakers repeated the following words six times on each of four channels: startrek, supernova, tektronix, generation, nebula, processing, singularity, 71523, abracadabra, sungeeta and computer. The four channels used were office phone, home phone, carbon microphone telephone and speaker phone. Each speech file has a corresponding time-aligned phoneme-level transcription (achieved using automatic forced alignment) and an automatically-generated world-level transcription. Humans reviewed each utterance in two passes and classified it as good, bad, noisy or different. The results of this verification process are included in the /docs directory. Data The data was recorded with the CSLU T1 digital data collection system. Each utterance is recorded as a separate file. These files were sampled at 8 khz 8-bit and stored as ulaw files. All of the data use the RIFF standard file format. This file format is 16-bit linearly encoded. Samples For an example of the data in this corpus, please listen to this recording of a subject speaking the word 'computer': SD-1030-computer-t3-67. Portions © 1996, 1998, 2000, 2002 Center for Spoken Language Understanding, Oregon Health and Science University, © 2009 Trustees of the University of Pennsylvania

简介 CSLU S4X 1.2版数据集由俄勒冈健康与科学大学口语语言理解中心(Center for Spoken Language Understanding, Oregon Health and Science University,简称CSLU)制作,隶属于语言数据联盟(Linguistic Data Consortium, LDC),目录号为LDC2009S03,ISBN编号为1-58563-523-5。 该语料库包含36名发音者(男性22名、女性14名)录制的11个指定词汇语音。每位发音者需在四种信道下,针对下述11个词汇各重复录制六次,词汇分别为:startrek、supernova、tektronix、generation、nebula、processing、singularity、71523、abracadabra、sungeeta与computer。所用四种信道为:办公电话、家用电话、碳粒话筒电话以及免提电话。 每条语音文件均配有对应的时间对齐音素级转录(采用自动强制对齐(automatic forced alignment)技术生成)与自动生成的词级转录。研究人员对每条语音进行两轮人工审核,将其划分为优质、劣质、含噪或内容偏差四类,该验证流程的结果存储于/docs目录中。 数据采集说明 本数据集采用CSLU T1数字数据采集系统录制,每条语音对应一个独立文件。所有语音文件采样率为8 kHz、采样精度为8位,以ulaw格式存储,且均遵循RIFF标准文件格式,该格式采用16位线性编码。 示例样本 若需查看本语料库的示例数据,请收听受试者录制词汇"computer"的语音文件:SD-1030-computer-t3-67。 版权声明 部分内容©1996、1998、2000、2002 俄勒冈健康与科学大学口语语言理解中心,©2009 宾夕法尼亚大学董事会
创建时间:
2024-01-31
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
CSLU: S4X Release 1.2是一个英语语音识别数据集,包含36位说话者在四种电话通道上重复11个特定单词的录音,数据经过音素和单词级转录及人工验证,适用于语音识别研究。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作