negfir/speech_commands_pitch_100hz
收藏Hugging Face2025-11-11 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/negfir/speech_commands_pitch_100hz
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含音频文件、标签、是否未知、说话者ID、话语ID、性别、音高七个字段。标签分为34个类别,包括肯定、否定、上下左右等方向指示、数字、物体名称、情感表达等。数据集分为训练集、测试集和验证集,共计约31GB大小。训练集包含84156个样本,测试集包含4505个样本,验证集包含9931个样本。
The dataset includes seven fields: audio file, label, unknown flag, speaker ID, utterance ID, gender, and pitch. The labels are categorized into 34 classes, which include affirmations, negations, directional indicators, numbers, object names, emotional expressions, etc. The dataset is split into training, test, and validation sets, totaling approximately 31GB in size. The training set contains 84156 samples, the test set contains 4505 samples, and the validation set contains 9931 samples.
提供机构:
negfir



