five

abdessamad-bourkibate/Scam-Religious-Detection

收藏
Hugging Face2026-03-01 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/abdessamad-bourkibate/Scam-Religious-Detection
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit task_categories: - text-classification language: - ar tags: - religious-scam - detection - twitter - arabic-nlp pretty_name: Scam Religious Detection Dataset size_categories: - 1B<n<10B --- # Scam Religious Detection Dataset This dataset is designed for detecting religious-based scams, particularly on social media platforms like Twitter. It contains a collection of tweets categorized into various classes to facilitate the training of machine learning models for scam detection. ## Dataset Details - **Dataset Name:** Scam-Religious-Detection - **Primary Language:** Arabic (ar) - **License:** MIT - **Total Size:** ~4.89 GB ## Dataset Structure The dataset consists of several CSV files: | File Name | Size | Description | |-----------|------|-------------| | `TweetsStreamingTotal.csv` | 4.51 GB | Large-scale collection of raw tweets related to religious topics. | | `scam_religious_corpus_v1.csv` | 225 MB | Curated corpus specifically focused on religious scams. | | `real_legit_tweets.csv` | 96.9 MB | Collection of legitimate (non-scam) religious tweets. | | `merged_final_corpus.csv` | 41.8 MB | Merged and cleaned dataset ready for training. | | `real_scam_tweets.csv` | 11.3 MB | Verified scam religious tweets. | | `real_suspicious_tweets.csv` | 4.98 MB | Tweets identified as suspicious but requiring further analysis. | ## Usage You can load this dataset using the Hugging Face `datasets` library: ```python from datasets import load_dataset dataset = load_dataset("abdessamad-bourkibate/Scam-Religious-Detection") ## Maintenance This dataset is maintained by abdessamad-bourkibate.
提供机构:
abdessamad-bourkibate
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作