零售产品分类数据集
收藏arXiv2021-04-03 更新2024-06-21 收录
下载链接:
https://www.kaggle.com/c/retail-products-classification/data
下载链接
链接失效反馈官方服务:
资源简介:
零售产品分类数据集是由德国奥芬堡大学机器学习和分析研究所创建,包含约48000个产品,涵盖21个类别,每个产品配有100x100像素的彩色图像及描述文本。数据集分为42000个训练样本和6000多个测试样本,确保每个类别样本分布均匀。创建过程中,数据集从亚马逊2018年评论数据中提取,经过整理和分类。该数据集主要用于机器学习模型的评估,特别是产品图像和描述的分类预测,广泛应用于推荐系统、产品搜索引擎和内部供应链物流优化等领域。
The Retail Product Classification Dataset was developed by the Institute of Machine Learning and Analytics, Offenburg University, Germany. It contains approximately 48,000 products across 21 categories, where each product is paired with a 100×100 pixel color image and a descriptive text. The dataset is split into 42,000 training samples and over 6,000 test samples, ensuring a uniform sample distribution across all categories. During its construction, the dataset was extracted, curated and classified from Amazon's 2018 review data. This dataset is primarily used for evaluating machine learning models, especially classification prediction based on product images and descriptions, and is widely applied in recommendation systems, product search engines, internal supply chain logistics optimization and other related fields.
提供机构:
德国奥芬堡大学机器学习和分析研究所
创建时间:
2021-03-25
搜集汇总
背景与挑战
背景概述
零售产品分类数据集是一个由德国奥芬堡大学创建的大规模数据集,包含约48000个产品,涵盖21个类别,每个产品配有100x100像素的彩色图像和描述文本,并分为42000个训练样本和6000多个测试样本,确保类别分布均匀。该数据集从亚马逊2018年评论数据中提取,主要用于机器学习模型的评估,特别是产品图像和描述的分类预测,广泛应用于推荐系统、产品搜索引擎和供应链物流优化等领域。
以上内容由遇见数据集搜集并总结生成



