five

TI 46-Word

收藏
Mendeley Data2024-01-31 更新2024-06-29 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC93S9
下载链接
链接失效反馈
官方服务:
资源简介:
Introduction This release contains a corpus of speech which was originally designed and collected at Texas Instruments, Inc. (TI) in 1980 and used initially in performance assessment tests of isolated-word speaker-dependent technology. (See "Speech Recognition: Turning Theory to Practice" by G. R. Doddington and T. B. Schalk, in IEEE Spectrum, Vol. 18, No. 9, September 1981.) The 46-word vocabulary consists of two sub-vocabularies: (1) the TI 20-word vocabulary (consisting of the digits zero through nine plus the words "enter," "erase," "go," "help," "no," "rubout," "repeat," "stop," "start," and "yes" as well as (2) the TI 26-word "alphabet set" (consisting of the letters "a" through "z"). Data The corpus contains read utterances from 16 speakers (eight males and eight females) each speaking 26 utterances of the 46-word vocabulary: 16 tokens designated as training and ten as test. Note these numbers reflect the aim of the collection and for various reasons, the full number of utterances was not reached for some speakers. See the included documentation for more information. The corpus was collected at Texas Instruments in a quiet acoustic enclosure using an Electro-Voice RE-16 Dynamic Cardiod microphone at 12.5kHz sample rate with 12-bit quantization. The files are in NIST SPHERE format and have a ".wav" filename extension. Updates As of October 5, 2016 the documentation was updated to more closely reflect the file inventory.

数据集简介:本发布版包含一套语音语料库,该语料库于1980年由德州仪器公司(Texas Instruments, Inc.,简称TI)设计并采集,最初用于孤立词说话人相关技术(isolated-word speaker-dependent technology)的性能评估测试。详见G. R. Doddington与T. B. Schalk发表于1981年9月《IEEE Spectrum》第18卷第9期的《Speech Recognition: Turning Theory to Practice》一文。 该语料库的46词词汇表包含两个子词汇表:(1) TI 20词词汇表,涵盖数字0至9,以及单词enter、erase、go、help、no、rubout、repeat、stop、start和yes;(2) TI 26词“字母集”,涵盖字母a至z。 语料数据:本语料库包含16名说话人(8名男性、8名女性)的朗读语音片段,每位说话人针对46词词汇表朗读26条语音,其中16条Token作为训练集样本,10条作为测试集样本。需说明,上述数值仅为采集预设目标,受各类因素影响,部分说话人未完成全部语音片段的录制。详细信息请参阅随附文档。本语料库于德州仪器的安静声学舱内采集,使用Electro-Voice RE-16动圈心形指向麦克风,采样率为12.5kHz,采用12比特量化。音频文件采用NIST SPHERE格式,文件扩展名为".wav"。 更新说明:截至2016年10月5日,本数据集的文档已完成更新,以更精准地反映文件清单情况。
创建时间:
2024-01-31
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
TI 46-Word是一个包含16位说话者发音的英语语音数据集,用于语音识别研究,包含46个单词的录音,采样率为12.5kHz,12位量化。数据最初由德州仪器公司收集,文件格式为NIST SPHERE。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作