five

Abdou/dz-sentiment-yt-comments

收藏
Hugging Face2023-11-06 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Abdou/dz-sentiment-yt-comments
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit task_categories: - text-classification language: - ar size_categories: - 10K<n<100K --- # A Sentiment Analysis Dataset for the Algerian Dialect of Arabic This dataset consists of 50,016 samples of comments extracted from Algerian YouTube channels. It is manually annotated with 3 classes (the `label` column) and is not balanced. Here are the number of rows of each class: - 0 (Negative): **17,033 (34.06%)** - 1 (Neutral): **11,136 (22.26%)** - 2 (Positive): **21,847 (43.68%)** Please note that there are some swear words in the dataset, so please use it with caution. # Citation If you find our work useful, please cite it as follows: ```bibtex @article{2023, title={Sentiment Analysis on Algerian Dialect with Transformers}, author={Zakaria Benmounah and Abdennour Boulesnane and Abdeladim Fadheli and Mustapha Khial}, journal={Applied Sciences}, volume={13}, number={20}, pages={11157}, year={2023}, month={Oct}, publisher={MDPI AG}, DOI={10.3390/app132011157}, ISSN={2076-3417}, url={http://dx.doi.org/10.3390/app132011157} } ```
提供机构:
Abdou
原始信息汇总

数据集概述

数据集名称

A Sentiment Analysis Dataset for the Algerian Dialect of Arabic

数据集描述

该数据集包含从阿尔及利亚YouTube频道提取的50,016条评论样本。这些样本被手动标注为3个类别(label列),并且数据集不平衡。各类别的样本数量如下:

  • 0 (Negative): 17,033 (34.06%)
  • 1 (Neutral): 11,136 (22.26%)
  • 2 (Positive): 21,847 (43.68%)

注意事项

数据集中包含一些粗俗词汇,请谨慎使用。

引用信息

如果该数据集对您的研究有用,请按以下方式引用: bibtex @article{2023, title={Sentiment Analysis on Algerian Dialect with Transformers}, author={Zakaria Benmounah and Abdennour Boulesnane and Abdeladim Fadheli and Mustapha Khial}, journal={Applied Sciences}, volume={13}, number={20}, pages={11157}, year={2023}, month={Oct}, publisher={MDPI AG}, DOI={10.3390/app132011157}, ISSN={2076-3417}, url={http://dx.doi.org/10.3390/app132011157} }

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作