DGurgurov/urdu_sa
收藏数据集概述
数据集名称
Sentiment Analysis Data for the Urdu Language
数据集描述
本数据集包含由Khan et al. (2020)提供的情感分析数据,专为乌尔都语设计。
数据结构
该数据用于项目“通过图知识改进低资源语言的词嵌入”(improving word embeddings with graph knowledge for Low Resource Languages)。
语言
乌尔都语
任务类别
- 文本分类
许可证
MIT
引用信息
bibtex @inproceedings{khan2017harnessing, title={Harnessing English Sentiment Lexicons for Polarity Detection in Urdu Tweets: A Baseline Approach}, author={Khan, Muhammad Yaseen and Emaduddin, Shah Muhammad and Junejo, Khurum Nazir}, booktitle={2017 IEEE 11th International Conference on Semantic Computing (ICSC)}, pages={242--249}, year={2017}, organization={IEEE} }
@inproceedings{khan2020usc, title={Urdu Sentiment Corpus (v1.0): Linguistic Exploration and Visualization of Labeled Datasetfor Urdu Sentiment Analysis.}, author={Khan, Muhammad Yaseen and Nizami, Muhammad Suffian}, booktitle={2020 IEEE 2nd International Conference On Information Science & Communication Technology (ICISCT)}, pages={}, year={2020}, organization={IEEE} }



