CSLU: S4X Release 1.2

Mendeley Data2024-01-31 更新2024-06-28 收录

下载链接：

https://catalog.ldc.upenn.edu/LDC2009S03

下载链接

链接失效反馈

官方服务：

资源简介：

Introduction CSLU: S4X Release 1.2, Linguistic Data Consortium (LDC) catalog number LDC2009S03 and isbn 1-58563-523-5, was created by the Center for Spoken Language Understanding, Oregon Health and Science University (CSLU). The corpus consists of 36 speakers (22 male, 14 female) uttering 11 specified words. The speakers repeated the following words six times on each of four channels: startrek, supernova, tektronix, generation, nebula, processing, singularity, 71523, abracadabra, sungeeta and computer. The four channels used were office phone, home phone, carbon microphone telephone and speaker phone. Each speech file has a corresponding time-aligned phoneme-level transcription (achieved using automatic forced alignment) and an automatically-generated world-level transcription. Humans reviewed each utterance in two passes and classified it as good, bad, noisy or different. The results of this verification process are included in the /docs directory. Data The data was recorded with the CSLU T1 digital data collection system. Each utterance is recorded as a separate file. These files were sampled at 8 khz 8-bit and stored as ulaw files. All of the data use the RIFF standard file format. This file format is 16-bit linearly encoded. Samples For an example of the data in this corpus, please listen to this recording of a subject speaking the word 'computer': SD-1030-computer-t3-67. Portions © 1996, 1998, 2000, 2002 Center for Spoken Language Understanding, Oregon Health and Science University, © 2009 Trustees of the University of Pennsylvania

简介 CSLU S4X 1.2版数据集由俄勒冈健康与科学大学口语语言理解中心（Center for Spoken Language Understanding, Oregon Health and Science University，简称CSLU）制作，隶属于语言数据联盟（Linguistic Data Consortium, LDC），目录号为LDC2009S03，ISBN编号为1-58563-523-5。该语料库包含36名发音者（男性22名、女性14名）录制的11个指定词汇语音。每位发音者需在四种信道下，针对下述11个词汇各重复录制六次，词汇分别为：startrek、supernova、tektronix、generation、nebula、processing、singularity、71523、abracadabra、sungeeta与computer。所用四种信道为：办公电话、家用电话、碳粒话筒电话以及免提电话。每条语音文件均配有对应的时间对齐音素级转录（采用自动强制对齐（automatic forced alignment）技术生成）与自动生成的词级转录。研究人员对每条语音进行两轮人工审核，将其划分为优质、劣质、含噪或内容偏差四类，该验证流程的结果存储于/docs目录中。数据采集说明本数据集采用CSLU T1数字数据采集系统录制，每条语音对应一个独立文件。所有语音文件采样率为8 kHz、采样精度为8位，以ulaw格式存储，且均遵循RIFF标准文件格式，该格式采用16位线性编码。示例样本若需查看本语料库的示例数据，请收听受试者录制词汇"computer"的语音文件：SD-1030-computer-t3-67。版权声明部分内容©1996、1998、2000、2002 俄勒冈健康与科学大学口语语言理解中心，©2009 宾夕法尼亚大学董事会

创建时间：

2024-01-31

搜集汇总

数据集介绍

背景与挑战

背景概述

CSLU: S4X Release 1.2是一个英语语音识别数据集，包含36位说话者在四种电话通道上重复11个特定单词的录音，数据经过音素和单词级转录及人工验证，适用于语音识别研究。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集