sawalni-ai/fw-darija-websites
收藏Hugging Face2024-12-08 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/sawalni-ai/fw-darija-websites
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,涉及域名、计数、分数、首次出现时间、最后出现时间、生命周期、单词计数、每文档平均单词数、每天平均单词数、顶级域名、IP地址、国家、总令牌数和每天令牌数等信息。数据集分为训练集,包含4003个样本,总大小为495711字节。
This dataset includes multiple fields such as domain, count, score, first seen, last seen, lifetime, word count, average words per document, average words per day, top-level domain, IP address, country, total tokens, and tokens per day. The dataset is divided into a training set containing 4003 samples with a total size of 495711 bytes.
提供机构:
sawalni-ai



