vietnamese-audio-dataset

Name: vietnamese-audio-dataset
Creator: maas
Published: 2025-12-05 16:48:50
License: 暂无描述

魔搭社区2025-12-05 更新2025-12-06 收录

下载链接：

https://modelscope.cn/datasets/Kratos-AI/vietnamese-audio-dataset

下载链接

链接失效反馈

官方服务：

资源简介：

# Vietnamese Audio Dataset *This dataset contains high-quality (“A-grade”) data. It has been carefully curated, cleaned, and verified to ensure accuracy, completeness, and consistency, making it suitable for high-stakes or production-grade model training. ## Contact For queries or collaborations related to this dataset, contact: - anoushka@kgen.io - abhishek.vadapalli@kgen.io ## Supported Tasks - **Task Categories**: Speech Emotion Recognition (SER) - **Supported Tasks**: - Emotion classification from speech - Audio signal processing for affective computing - Speaker demographic analysis - Cross-cultural emotion recognition research - Voice synthesis with emotional expression (secondary use) ## Languages - **Primary Language**: Vietnamese ## Dataset Creation ### Curation Rationale This dataset was created to advance Vietnamese speech emotion recognition research by providing labeled emotional speech samples across different demographic groups. ### Source Data - **Contributors**: Vietnamese native speakers across different age groups and genders ### Other Known Limitations - **Size**: Relatively small dataset may limit model generalization - **Audio Quality**: Variations in recording conditions may affect model performance - **Regional Dialects**: May not represent all Vietnamese regional speech patterns ## Intended Uses ### ✅ Direct Use - Training and benchmarking Speech Emotion Recognition models for Vietnamese - Research in cross-cultural emotion recognition - Development of Vietnamese-language affective computing applications - Academic research in computational linguistics and psychology ### ❌ Out-of-Scope Use - Real-time production systems without additional validation - Clinical or diagnostic applications for mental health - Commercial use without proper attribution - Surveillance or privacy-invasive applications ## License CC BY 4.0

# 越南语音频数据集 *本数据集包含高质量（A级）数据，经精心筛选、清理与验证，确保数据的准确性、完整性与一致性，适用于高风险场景或生产级模型训练。 ## 联系方式若对本数据集有查询或合作需求，请联系： - anoushka@kgen.io - abhishek.vadapalli@kgen.io ## 支持任务 - **任务类别**：语音情感识别（Speech Emotion Recognition, SER） - **支持任务**： - 语音情感分类 - 情感计算领域的音频信号处理 - 说话人群体人口统计学分析 - 跨文化情感识别研究 - 带情感表达的语音合成（二次使用） ## 语言 - **主要语言**：越南语 ## 数据集构建 ### 筛选依据本数据集旨在通过提供覆盖不同人口统计学群体的标注情感语音样本，推动越南语语音情感识别研究的发展。 ### 源数据 - **贡献者**：来自不同年龄层与性别的越南母语使用者 ### 已知其他局限性 - **规模**：数据集规模相对较小，可能限制模型的泛化能力 - **音频质量**：录音环境存在差异，可能影响模型性能 - **区域方言**：未涵盖所有越南语区域语音模式 ## 预期用途 ### ✅ 直接用途 - 越南语语音情感识别模型的训练与基准测试 - 跨文化情感识别研究 - 越南语情感计算应用开发 - 计算语言学与心理学领域的学术研究 ### ❌ 超出范围的使用 - 未经过额外验证的实时生产系统 - 心理健康相关的临床或诊断应用 - 未进行合理署名的商业使用 - 监视或侵犯隐私的应用 ## 许可证 CC BY 4.0

提供机构：

maas

创建时间：

2025-08-30

搜集汇总

数据集介绍