ExHuBERT Emotion Datasets
收藏数据集概述
ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37 Emotion Datasets
- 数据集名称: ExHuBERT
- 数据来源: 37个情感数据集
- 样本数量: 150,907个样本
- 总时长: 119.5小时
- 支持语言: 英语、德语、中文、法语、荷兰语、希腊语、意大利语、西班牙语、缅甸语、希伯来语、瑞典语、波斯语、土耳其语、乌尔都语
数据集应用示例
-
模型加载: python import torch from transformers import AutoModelForAudioClassification, Wav2Vec2FeatureExtractor
model_name = amiriparian/ExHuBERT feature_extractor = Wav2Vec2FeatureExtractor.from_pretrained("facebook/hubert-base-ls960") model = AutoModelForAudioClassification.from_pretrained(model_name, trust_remote_code=True, revision="b158d45ed8578432468f3ab8d46cbe5974380812")
-
模型配置: python model.freeze_og_encoder() sampling_rate = 16000 device = torch.device("cuda" if torch.cuda.is_available() else "cpu") model = model.to(device)
-
音频处理与预测: python import numpy as np import librosa import torch.nn.functional as F
waveform, sr_wav = librosa.load("audio_002.wav") waveform = feature_extractor(waveform, sampling_rate=sampling_rate, padding=max_length, max_length=48000) waveform = waveform[input_values][0] waveform = waveform.reshape(1, -1) waveform = torch.from_numpy(waveform).to(device)
with torch.no_grad(): output = model(waveform) output = F.softmax(output.logits, dim=1) output = output.detach().cpu().numpy().round(2) print(output)
-
输出示例: python
[[0. 0. 0. 1. 0. 0.]]
Low | High Arousal
Neg. Neut. Pos. | Neg. Neut. Pos Valence
Disgust, Neutral, Kind| Anger, Surprise, Joy Example emotions




