medical-opinion-english-audio
收藏魔搭社区2025-12-05 更新2025-08-16 收录
下载链接:
https://modelscope.cn/datasets/Kratos-AI/medical-opinion-english-audio
下载链接
链接失效反馈官方服务:
资源简介:
# Medical Opinion English Audio Dataset
*This dataset contains intentionally low-quality (“B-grade”) data. It has been curated to include noisy, imperfect, or otherwise suboptimal samples for the purpose of testing model robustness and performance under degraded input conditions
**Text spoken by all participants:**
""Doctor, another physician suggested my chest pain is stress-related, but I'm anxious. It feels like a heavy weight on my heart, and I struggle to breathe deeply. I'm scared. What's your opinion? I need reassurance.""
The dataset supports training and evaluation of models in:
- Automatic Speech Recognition (ASR)
- Emotional tone classification
- Voice synthesis and generation
- Emotion-aware conversational agents
---
## Intended Uses
### ✅ Direct Use
- Training and benchmarking ASR models with Indian-accented English
- Emotion detection and classification from voice
- Research in affective computing and empathetic AI
### ❌ Out-of-Scope Use
- Real-time or production-grade systems
- Commercial use without proper CC BY 4.0 attribution
- Clinical or diagnostic use cases
---
## Considerations and Limitations
- ❗ The dataset is small (<1,000 samples) and not fully representative of India's linguistic and emotional diversity
- 💡 Emotions are subjective — classification results may vary by listener or model
- 🔄 Future versions will aim to expand multilingual support and speaker diversity
---
## License
**CC BY 4.0** — You can use, modify, and share the dataset with appropriate credit.
---
## Contact
- For queries or collaborations related to datasets, contact at :
- anoushka@kgen.io
- abhishek.vadapalli@kgen.io
---
医学意见英语音频数据集(Medical Opinion English Audio Dataset)
本数据集采用刻意制作的低质量(“B级”)数据,经筛选纳入含噪、不完美或其他次优样本,用于测试模型在劣化输入条件下的鲁棒性与性能表现。
### 所有参与者的口述文本:
"医生,另一位医师认为我的胸痛是压力诱发的,但我十分焦虑。我感觉胸口像压了一块重物,深呼吸时倍感困难。我很害怕。您有什么看法?我需要得到安心的答复。"
本数据集可支持以下领域的模型训练与评估:
- 自动语音识别(Automatic Speech Recognition,ASR)
- 情感语调分类
- 语音合成与生成
- 情感感知对话AI智能体(AI Agent)
---
## 预期用途
### ✅ 直接用途
- 针对带印度英语口音的自动语音识别模型开展训练与基准测试
- 从语音信号中进行情感检测与分类
- 情感计算与共情人工智能领域的研究
### ❌ 不适用范围
- 实时或生产级系统
- 未按CC BY 4.0协议正确署名的商业使用
- 临床或诊断类应用场景
---
## 注意事项与局限性
- ❗ 本数据集规模较小(样本量不足1000),无法完全覆盖印度的语言与情感多样性
- 💡 情感具有主观性——分类结果可能因评估听众或模型的不同而存在差异
- 🔄 未来版本将致力于拓展多语言支持与说话人多样性
---
## 许可协议
**CC BY 4.0** — 您可在标注适当来源的前提下使用、修改与分享本数据集。
---
## 联系方式
- 若有数据集相关的咨询或合作需求,请联系:
- anoushka@kgen.io
- abhishek.vadapalli@kgen.io
提供机构:
maas
创建时间:
2025-08-01
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集是一个包含故意低质量样本的医疗意见英语音频集合,专门用于测试模型在噪声或降级输入条件下的鲁棒性。它支持自动语音识别、情感语调分类及语音合成等任务的模型训练与评估,但数据规模较小且情感标注具有主观性。
以上内容由遇见数据集搜集并总结生成



