five

hugosousa/AllProducts

收藏
Hugging Face2024-12-02 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/hugosousa/AllProducts
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含两个配置项:cleaned_items和cleaned_products。cleaned_items配置项包含两个特征:product_id(产品ID)和product_title(产品标题),数据类型均为字符串。cleaned_products配置项包含三个特征:product_id(产品ID)、product_title(产品标题)和source(来源),数据类型均为字符串。两个配置项的训练集分别包含35,079,201个示例,cleaned_items的训练集大小为3,725,568,272.9847035字节,cleaned_products的训练集大小为4,470,244,722.926626字节。数据文件的路径分别为cleaned_items/train-*和cleaned_products/train-*。

The dataset contains two configurations: cleaned_items and cleaned_products. The cleaned_items configuration includes two features: product_id and product_title, both of which are of string type. The cleaned_products configuration includes three features: product_id, product_title, and source, all of which are of string type. The training sets for both configurations contain 35,079,201 examples, with the cleaned_items training set size being 3,725,568,272.9847035 bytes and the cleaned_products training set size being 4,470,244,722.926626 bytes. The data file paths are cleaned_items/train-* and cleaned_products/train-* respectively.
提供机构:
hugosousa
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作