five

CSLU: Numbers Version 1.3

收藏
Mendeley Data2024-01-31 更新2024-06-30 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2009S01
下载链接
链接失效反馈
官方服务:
资源简介:
Introduction: CSLU: Numbers Version 1.3, Linguistic Data Consortium (LDC) catalog number LDC2009S01 and isbn 1-58563-501-4, was created by the Center for Spoken Language Understanding (CSLU) at OGI School of Science and Engineering, Oregon Health and Science University, Beaverton, Oregon. It is a collection of naturally produced numbers taken from utterances in various CSLU telephone speech data collections. The corpus consists of approximately fifteen hours of speech and includes isolated digit strings, continuous digit strings, and ordinal/cardinal numbers. The numbers have several sources, among them, phone numbers, numbers from street addresses and zip codes, uttered by 12618 speakers in a total of 23902 files. In most of CSLU's telephone data collections, callers were asked for their phone number, birthdate or zip code. Callers would also occasionally leave numbers in the midst of another utterance. The numbers in those situations were extracted from the host utterance and added to the corpus. Additional information about this publication is available from the corpus web page at CSLU. Data: The speech data was collected over analog and digital telephone lines. The analog data was recorded using a Gradient Technologies analog-to-digital conversion box; those files were recorded as 16-bit, 8 khz and stored in a linear format. The digital data was recorded with the CSLU T1 digital data collection system; those files were sampled at 8khz, 8-bit and stored as ulaw files. All of the data in this release has been linearly encoded in 16-bit RIFF standard file format. Each file includes an orthographic transcription following the CSLU Labeling guidelines which are included in the documentation for this publication. Also, many of the utterances have been phonetically labeled. Statistics: CSLU: Numbers Version 1.3 consists of approximately fifteen hours of speech. The following table gives a count of the number of files for each utterance type. Type Number phone 2970 street 7079 zipcode 7076 other 6771 Samples: For an example of the data contained in this corpus, please examine the audio files and labels for the following spoken sequences Street Address: one sixteen wav|label Zipcode: one oh three one four wav|label Portions © 1998, 2000, 2002 Center for Spoken Language Understanding, Oregon Health & Science University, © 2009 Trustees of the University of Pennsylvania

引言:CSLU数字语料库版本1.3,由俄勒冈健康与科学大学OGI科学与工程学院口语语言理解中心(Center for Spoken Language Understanding, CSLU)创建,其语言数据联盟(Linguistic Data Consortium, LDC)编号为LDC2009S01,ISBN为1-58563-501-4。该语料库源自各类CSLU电话语音数据集中的自然发音话语,提取其中的数字内容形成数字集合。本语料库包含约15小时的语音数据,涵盖孤立数字串、连续数字串以及序数/基数数字。其数字来源涵盖电话号码、街道地址数字及邮政编码,由12618名发音人录制,总计23902个音频文件。在多数CSLU电话语料采集场景中,采集方会要求通话者提供电话号码、出生日期或邮政编码;此外,通话者偶尔也会在其他话语中提及数字,此类场景下的数字会从原话语中提取并加入本语料库。更多关于本出版物的信息可从CSLU的语料库网页获取。 数据采集:本次语音数据通过模拟与数字电话线采集。模拟数据采用Gradient Technologies模数转换盒录制,文件为16位采样、8kHz,以线性格式存储;数字数据采用CSLU T1数字数据采集系统录制,文件采用8kHz、8-bit采样,以ulaw格式存储。本发布版的所有数据均采用16位RIFF标准文件格式进行线性编码。每个音频文件均包含遵循CSLU标注指南的正字法转录文本,该标注指南已随本出版物的配套文档一同提供。此外,多数话语已完成语音标注。 统计信息:CSLU数字语料库版本1.3总计包含约15小时的语音数据。下表列出了各话语类型的文件数量: | 话语类型 | 文件数 | |----------|--------| | 电话号码 | 2970 | | 街道地址 | 7079 | | 邮政编码 | 7076 | | 其他 | 6771 | 示例:如需查看本语料库中数据的示例,请查阅以下语音序列对应的音频文件及标注文本: - 街道地址:one sixteen.wav | 标注文本 - 邮政编码:one oh three one four.wav | 标注文本 版权声明:本内容©1998、2000、2002年口语语言理解中心(俄勒冈健康与科学大学),©2009年宾夕法尼亚大学董事会。
创建时间:
2024-01-31
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作