five

Deeply Vocal Characterizer Dataset - AI & ML Training Data, South Korea

收藏
Datarade2024-04-19 收录
下载链接:
https://datarade.ai/data-products/vocal-characterizer-dataset-deeply
下载链接
链接失效反馈
官方服务:
资源简介:
The Vocal Characterizer Dataset is a human nonverbal vocal sound dataset consisting of 56.7 hours of short clips from 1419 speakers, crowdsourced by the general public in South Korea and validated by the AI data platform. Also, the dataset includes metadata such as age, sex, noise level, and quality of utterance. 16 classes of Included human nonverbal sound contain ‘teeth-chattering’, ‘teeth-grinding’, ‘tongue-clicking’, ‘nose-blowing’, ‘coughing’, ‘yawning’, ‘throat-clearing’, ‘sighing’, ‘lip-popping’, ‘lip-smacking’, ‘panting’, ’crying’, ‘laughing’, ‘sneezing’, ‘moaning’, and ‘screaming’. The dataset is the first dataset to the world due to its large volume, various types of nonverbal vocal cues, and various participants. We expect that the utilization of this dataset would bring precise detection of the nonverbal vocal cues, and a better understanding of the human conversation. We're ready to deliver further information, statistics, or samples upon request. Don't hesitate to reach out! *The dataset can be delivered as either original wav files(44,100Hz, 16-bit PCM, 1-channel) or a single compressed h5 file(resampled to 16,000Hz).
提供机构:
Deeply
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作