Deeply Vocal Characterizer Dataset - AI & ML Training Data, South Korea

Name: Deeply Vocal Characterizer Dataset - AI & ML Training Data, South Korea
Creator: Deeply
License: 暂无描述

Datarade2024-04-19 收录

下载链接：

https://datarade.ai/data-products/vocal-characterizer-dataset-deeply

下载链接

链接失效反馈

官方服务：

资源简介：

The Vocal Characterizer Dataset is a human nonverbal vocal sound dataset consisting of 56.7 hours of short clips from 1419 speakers, crowdsourced by the general public in South Korea and validated by the AI data platform. Also, the dataset includes metadata such as age, sex, noise level, and quality of utterance. 16 classes of Included human nonverbal sound contain ‘teeth-chattering’, ‘teeth-grinding’, ‘tongue-clicking’, ‘nose-blowing’, ‘coughing’, ‘yawning’, ‘throat-clearing’, ‘sighing’, ‘lip-popping’, ‘lip-smacking’, ‘panting’, ’crying’, ‘laughing’, ‘sneezing’, ‘moaning’, and ‘screaming’. The dataset is the first dataset to the world due to its large volume, various types of nonverbal vocal cues, and various participants. We expect that the utilization of this dataset would bring precise detection of the nonverbal vocal cues, and a better understanding of the human conversation. We're ready to deliver further information, statistics, or samples upon request. Don't hesitate to reach out! *The dataset can be delivered as either original wav files(44,100Hz, 16-bit PCM, 1-channel) or a single compressed h5 file(resampled to 16,000Hz).

提供机构：

Deeply

5,000+

优质数据集

54 个

任务类型

进入经典数据集