five

Magic Data Chinese Mandarin Conversational Speech

收藏
DataCite Commons2021-07-01 更新2025-04-16 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2019S23
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>Magic Data Chinese Mandarin Conversational Speech was developed by <a href="https://www.magicdatatech.com/">Beijing Magic Data Technology Co., Ltd.</a> and consists of approximately 10 hours of Mandarin conversational speech from 60 speakers. Each conversation was recorded on multiple devices and is presented in multiple forms, resulting in a total of approximately 60 hours of audio with corresponding transcripts.</p><br> <h3>Data</h3><br> <p>All participants were native speakers of Mandarin in Mainland China from accent regions across the country. Speakers were paired for conversations on a range of topics, including travel, fitness, games, sports and pets.</p><br> <p>Speech data was recorded on mobile devices and is presented as 16kHz, 16-bit flac compressed pcm wav. Most files are single channel; however, a stereo version of each conversation is also included.</p><br> <p>Transcript data is contained in UTF-8 encoded plain text <a href="http://www.fon.hum.uva.nl/praat/manual/TextGrid.html">TextGrids</a>. Metadata such as topic, collection date, mobile device and speaker demographic information is found in the documentation accompanying this release.</p><br> <h3>Samples</h3><br> <p>Please view this <a href="desc/addenda/LDC2019S23.flac">stereo speech sample</a> and <a href="desc/addenda/LDC2019S23.txt">transcript sample</a>.</p><br> <h3>Updates</h3><br> <p>None at this time.</p></br> Portions © 2019 Beijing Magic Data Technology Co., Ltd., © 2019 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2020-11-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作