Abdou/dz-sentiment-yt-comments
收藏Hugging Face2023-11-06 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Abdou/dz-sentiment-yt-comments
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
task_categories:
- text-classification
language:
- ar
size_categories:
- 10K<n<100K
---
# A Sentiment Analysis Dataset for the Algerian Dialect of Arabic
This dataset consists of 50,016 samples of comments extracted from Algerian YouTube channels. It is manually annotated with 3 classes (the `label` column) and is not balanced. Here are the number of rows of each class:
- 0 (Negative): **17,033 (34.06%)**
- 1 (Neutral): **11,136 (22.26%)**
- 2 (Positive): **21,847 (43.68%)**
Please note that there are some swear words in the dataset, so please use it with caution.
# Citation
If you find our work useful, please cite it as follows:
```bibtex
@article{2023,
title={Sentiment Analysis on Algerian Dialect with Transformers},
author={Zakaria Benmounah and Abdennour Boulesnane and Abdeladim Fadheli and Mustapha Khial},
journal={Applied Sciences},
volume={13},
number={20},
pages={11157},
year={2023},
month={Oct},
publisher={MDPI AG},
DOI={10.3390/app132011157},
ISSN={2076-3417},
url={http://dx.doi.org/10.3390/app132011157}
}
```
提供机构:
Abdou
原始信息汇总
数据集概述
数据集名称
A Sentiment Analysis Dataset for the Algerian Dialect of Arabic
数据集描述
该数据集包含从阿尔及利亚YouTube频道提取的50,016条评论样本。这些样本被手动标注为3个类别(label列),并且数据集不平衡。各类别的样本数量如下:
- 0 (Negative): 17,033 (34.06%)
- 1 (Neutral): 11,136 (22.26%)
- 2 (Positive): 21,847 (43.68%)
注意事项
数据集中包含一些粗俗词汇,请谨慎使用。
引用信息
如果该数据集对您的研究有用,请按以下方式引用: bibtex @article{2023, title={Sentiment Analysis on Algerian Dialect with Transformers}, author={Zakaria Benmounah and Abdennour Boulesnane and Abdeladim Fadheli and Mustapha Khial}, journal={Applied Sciences}, volume={13}, number={20}, pages={11157}, year={2023}, month={Oct}, publisher={MDPI AG}, DOI={10.3390/app132011157}, ISSN={2076-3417}, url={http://dx.doi.org/10.3390/app132011157} }



