Nonverbal Vocalization Dataset 深度非言语发声数据集
收藏帕依提提2024-03-04 收录
下载链接:
https://www.payititi.com/opendatasets/show-26438.html
下载链接
链接失效反馈官方服务:
资源简介:
about this resource: The Nonverbal Vocalization Dataset is a human nonverbal vocal sound dataset crowdsourced by the general public in South Korea. Also, the dataset includes metadata such as age, sex, noise level, and quality of utterance. 16 classes of Included human nonverbal sound contain 'teeth-chattering', teeth-grinding', 'tongue-clicking', 'nose-blowing', 'coughing', 'yawning', 'throat clearing', 'sighing', 'lip-popping', 'lip-smacking', 'panting', 'crying', 'laughing', 'sneezing', 'moaning', and 'screaming'. The dataset is a subset(approximately 1%) of a much bigger dataset which were recorded under the same environment as this public dataset. Please visit our website Deeply Inc., GitHub, or contact us for more details and access to the full set of the dataset with commercial license. Deeply makes products that anyone can use with audio AI technology and makes people's lives happier with those products. For more products and services, please visit Deeply Inc.. Structure You can cite the data as follows: Contact Tel: (+82) 70-7459-0704 Web: http://deeplyinc.com/ Email: contact@deeplyinc.com
本资源说明:非语言发声数据集(Nonverbal Vocalization Dataset)是由韩国普通民众通过众包方式采集构建的人类非语言发声数据集。该数据集包含受试者年龄、性别、录音噪音水平以及发声质量等元数据信息。本次收录的16类人类非语言发声分别为:牙关打颤声(teeth-chattering)、磨牙声(teeth-grinding)、舌弹击声(tongue-clicking)、擤鼻涕声(nose-blowing)、咳嗽声(coughing)、打哈欠声(yawning)、清喉声(throat clearing)、叹息声(sighing)、唇啵声(lip-popping)、咂唇声(lip-smacking)、喘气声(panting)、哭泣声(crying)、笑声(laughing)、喷嚏声(sneezing)、呻吟声(moaning)以及尖叫声(screaming)。本数据集为同期同环境录制的超大规模数据集的子集,占比约1%。如需获取更多详情,或申请带有商业授权的完整数据集访问权限,请访问Deeply Inc.官方网站、GitHub仓库,或与我们取得联系。Deeply Inc. 依托音频人工智能技术打造普惠易用的产品,致力于通过这些产品提升民众的生活幸福感。如需了解更多产品与服务信息,请访问Deeply Inc.官方网站。数据引用方式如下:联系电话:(+82) 70-7459-0704 官方网站:http://deeplyinc.com/ 电子邮箱:contact@deeplyinc.com
提供机构:
帕依提提
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集是一个众包的非言语发声数据集,采集自韩国公众,包含约16类人类非言语声音(如牙齿打颤、咳嗽、笑声等),总时长约0.6小时(完整集达57小时),涉及约500名说话者。数据格式为16kHz、16位单声道音频,附带年龄、性别、噪声水平和发声质量等元数据,适用于语音识别和音频分析研究。
以上内容由遇见数据集搜集并总结生成



