five

cls-e/labeler_07_final.top-3

收藏
Hugging Face2025-02-10 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/cls-e/labeler_07_final.top-3
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含网页URL和与之相关的top-3序列字符串。数据集分为三个部分:训练集包含100,000个示例,大小为9132177字节;不平衡的训练集包含91,799个示例,大小为8369636字节;测试集包含8,201个示例,大小为762541字节。数据集的总下载大小为4,556,910字节,整个数据集大小为18,264,354字节。

The dataset includes web page URLs and associated top-3 sequence strings. The dataset is divided into three parts: the training set contains 100,000 examples, with a size of 9132177 bytes; the unbalanced training set contains 91,799 examples, with a size of 8369636 bytes; the test set contains 8,201 examples, with a size of 762541 bytes. The total download size of the dataset is 4,556,910 bytes, and the entire dataset size is 18,264,354 bytes.
提供机构:
cls-e
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作