five

REMIX Telephone Collection

收藏
DataCite Commons2025-05-06 更新2024-07-13 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2023S09
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3> <p>REMIX Telephone Collection was developed by the Linguistic Data Consortium (LDC) and contains 320 hours of English conversational telephone speech from 358 speakers who had completed all tasks in one of the previous LDC Mixer collections, specifically, Mixers 4-7. The data was collected in 2012; recordings in this corpus were used to support the NIST 2012 Speaker Recognition Evaluation.</p> <h3>Data</h3> <p>The audio recordings were generated using LDC's computer telephony system capable of collecting speech from the telephone network. Recruited speakers were connected through a robot operator to carry on casual conversations on suggested topics lasting up to 10 minutes. Subjects were asked to complete 12 calls, half of those in a "noisy" environment. Examples of proposed noisy environments included using a speakerphone, calling from a busy street, noisy store or office, or calling from a room with loud background noise.</p> <p>The documentation for this release includes call topics, the number of calls per subject, the number of noisy calls and certain speaker demographic information (e.g., year of birth, education level, occupation).</p> <p>The REMIX collection contains 1917 telephone recordings. The files are formatted as 2-channel, 8-bit, mu-law encoded sample data recorded at 8000 samples/second, with a NIST SPHERE-format header on each file.</p> <h3>Samples</h3> <p><a href="desc/addenda/LDC2023S09.sph">SPH file</a></p> <h3>Updates</h3> <p>None at this time.</p>
提供机构:
Linguistic Data Consortium
创建时间:
2023-11-16
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作