odia-language-audiodataset
收藏魔搭社区2025-12-05 更新2025-12-06 收录
下载链接:
https://modelscope.cn/datasets/Kratos-AI/odia-language-audiodataset
下载链接
链接失效反馈官方服务:
资源简介:
# Odia Language Audio Dataset
**Text spoken by all participants:**
"କୃତ୍ରିମ ବୁଦ୍ଧିମତ୍ତା (AI) ଦ୍ରୁତ ଗତିରେ ବିକଶିତ ହେଉଛି, ଯାହା ଦୈନଦିନ ଜୀବନକୁ ବଦଳାଉଛି। ଏହାର ନୂଆପଣ ଶିକ୍ଷା, ସ୍ୱାସ୍ଥ୍ୟସେବା ଓ କାମକୁ ଉନ୍ନତ କରୁଛି, ନୂଆ ସୁଯୋଗ ସୃଷ୍ଟି କରୁଛି।"
The dataset supports training and evaluation of models in:
- Automatic Speech Recognition (ASR)
- Emotional tone classification
- Voice synthesis and generation
- Emotion-aware conversational agents
---
## Intended Uses
### ✅ Direct Use
- Training and benchmarking ASR models with Indian-accented Marathi.
- Emotion detection and classification from voice
- Research in affective computing and empathetic AI
### ❌ Out-of-Scope Use
- Real-time or production-grade systems
- Commercial use without proper CC BY 4.0 attribution
- Clinical or diagnostic use cases
---
## Considerations and Limitations
- ❗ The dataset is small (<1,000 samples) and not fully representative of India's linguistic and emotional diversity
- 💡 Emotions are subjective — classification results may vary by listener or model
- 🔄 Future versions will aim to expand multilingual support and speaker diversity
---
## License
**CC BY 4.0** — You can use, modify, and share the dataset with appropriate credit.
---
## Contact
- For queries or collaborations related to datasets, contact at :
- anoushka@kgen.io
- abhishek.vadapalli@kgen.io
---
# 奥里亚语(Odia)音频数据集
**所有参与者的朗读文本:**
“人工智能(AI)正以极快的速度发展,不断改变着我们的日常生活。它正推动教育、医疗服务与工作领域的革新,并创造全新的发展机遇。”
本数据集可支持以下方向的模型训练与评估:
- 自动语音识别(Automatic Speech Recognition,ASR)
- 情感语调分类
- 语音合成与生成
- 情感感知对话AI智能体(AI Agent)
---
## 预期用途
### ✅ 直接使用场景
- 针对带有印度口音的马拉地语自动语音识别模型开展训练与基准测试
- 从语音中进行情感检测与分类
- 情感计算与共情AI领域的相关研究
### ❌ 超出适用范围的使用场景
- 实时系统或工业级生产系统
- 未按CC BY 4.0协议进行恰当署名的商业使用
- 临床或诊断类应用场景
---
## 注意事项与局限性
- ❗ 本数据集规模较小(样本量不足1000),未能完全覆盖印度的语言与情感多样性
- 💡 情感具有主观性——分类结果可能因听众或模型的不同而存在差异
- 🔄 未来版本将致力于拓展多语言支持与说话人多样性
---
## 授权协议
**CC BY 4.0** — 您可在恰当署名的前提下使用、修改及分享本数据集。
---
## 联系方式
- 若有关于本数据集的咨询或合作需求,请联系:
- anoushka@kgen.io
- abhishek.vadapalli@kgen.io
提供机构:
maas
创建时间:
2025-08-29



