five

Xi'an Guanzhong Object Naming

收藏
DataCite Commons2022-09-12 更新2024-07-13 收录
下载链接:
https://catalog.ldc.upenn.edu/LDC2022S09
下载链接
链接失效反馈
官方服务:
资源简介:
<h3>Introduction</h3><br> <p>Xi'an Guanzhong Object Naming is comprised of approximately 15 hours of audio recordings from speakers of the Guanzhong dialect of Mandarin Chinese living in or near Xi'an in Shaangxi Province (China) naming objects that appeared in colored line drawings. The corpus was developed to support traditional and computer aided language documentation.</p><br> <h3>Data</h3><br> <p>This collection was conducted from February-May 2021 using <a href="https://languagearc.com/">LanguageArc</a>, a citizen science portal developed by the Linguistic Data Consortium, from a closed volunteer community. Speakers were presented with images selected from the <a href="https://www.bcbl.eu/databases/multipic">MultiPic dataset</a> and were asked to record themselves naming the objects in the images.</p><br> <p>The task yielded 34,729 audio recordings. The data is organized into 622 directories according to the image presented. Each directory contains on average 42 recordings sampled at 16kHz, 16bit, in single channel, FLAC encoded files.</p><br> <h3>Samples</h3><br> <p>Please view the following <a href="desc/addenda/LDC2022S09.flac">sample</a>. Note that due to the very short length of the audio files in this corpus, some browsers and applications may have difficulty playing the files.</p><br> <h3>Updates</h3><br> <p>None at this time.</p></br> Portions © 2022 Trustees of the University of Pennsylvania
提供机构:
Linguistic Data Consortium
创建时间:
2022-09-12
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作