five

BHAAV (भाव)

收藏
arXiv2019-10-09 更新2024-06-21 收录
下载链接:
https://doi.org/10.5281/zenodo.3457467
下载链接
链接失效反馈
官方服务:
资源简介:
BHAAV (भाव) 是一个专为情感分析设计的印地语文本语料库,由Adobe系统公司创建。该数据集包含20,304个句子,来源于230篇不同类型的短篇故事,涵盖18种流派。每个句子都由至少三位具有十年以上印地语教育背景的母语者标注,分为五种情感类别:愤怒、喜悦、悬念、悲伤和中性。BHAAV旨在解决印地语文本中情感表达的分析问题,特别适用于开发针对低资源语言的情感分析工具,并可用于改进自动讲故事体验,如在文本到语音系统中引入情感线索。

BHAAV (भाव) is a Hindi text corpus specifically designed for sentiment analysis, created by Adobe Systems Incorporated. This dataset contains 20,304 sentences sourced from 230 short stories across 18 distinct genres. Each sentence was annotated by at least three native Hindi speakers with over ten years of formal Hindi education, and categorized into five sentiment classes: anger, joy, suspense, sadness, and neutral. BHAAV aims to address the analysis of emotional expressions in Hindi text, and is particularly applicable for developing sentiment analysis tools for low-resource languages. It can also be used to enhance automatic storytelling experiences, such as incorporating emotional cues into text-to-speech systems.
提供机构:
Adobe系统公司
创建时间:
2019-10-09
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作