MagedSaeed/wasm
收藏Hugging Face2025-01-05 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/MagedSaeed/wasm
下载链接
链接失效反馈官方服务:
资源简介:
WASM数据集是一个为阿拉伯语推文设计的标签推荐基准数据集,包含101099条经过精心过滤的推文和87个独特的标签。该数据集可用于推文和标签相关的多种任务,如推文分类和标签生成。数据集在构建过程中使用了Twitter官方API和Tweepy工具,经过两阶段的收集和严格的过滤处理,确保了数据的相关性和质量。
The WASM dataset is a benchmark dataset designed for hashtag recommendation in Arabic tweets, containing 101,099 meticulously filtered tweets and 87 unique hashtags. It can be used for a variety of tasks related to tweets and hashtags, such as tweet classification and hashtag generation. The dataset was collected using Twitters official APIs and Tweepy in a two-phase process, undergoing rigorous filtering to ensure relevance and quality.
提供机构:
MagedSaeed



