appvoid/raw-corpus
收藏Hugging Face2025-02-23 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/appvoid/raw-corpus
下载链接
链接失效反馈官方服务:
资源简介:
raw-corpus数据集是从tweets数据集衍生出的简化版本,包含了满足特定条件(至少8个单词、至少100个点赞且至少70%字符为拉丁字母)的推文。该数据集未经过滤,包含了原始的推文内容。
raw-corpus dataset is a reduced version derived from the tweets dataset, including tweets that meet specific criteria (at least 8 words, at least 100 likes, and at least 70% Latin alphabet characters). This dataset is unfiltered and contains raw tweet content.
提供机构:
appvoid



