MightyOctopus/amazon-pricer-dataset-v2-0
收藏Hugging Face2025-12-17 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/MightyOctopus/amazon-pricer-dataset-v2-0
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含亚马逊产品文本与真实价格的配对数据,源自McAuley-Lab/Amazon-Reviews-2023数据集。版本2.0在v1基础上增加了结构化产品类别信息。数据集的设计目的是用于训练和评估语言模型在文本到数字回归任务(如产品价格预测)上的性能。数据字段包括text(包含产品描述和类别的结构化自然语言提示)和price(真实产品价格,以美元计)。数据集的预期用途包括基于LLM的价格预测、与传统机器学习模型的基准测试以及研究和教育目的。不适用的用途包括实时定价系统和金融或商业决策。
This dataset contains Amazon product text paired with ground-truth prices, derived from the McAuley-Lab/Amazon-Reviews-2023 dataset. Version 2.0 extends v1 (MightyOctopus/amazon-pricer-dataset) by adding structured product category information to each sample. The dataset is designed for training and evaluating language models on text-to-number regression tasks such as product price prediction. The data fields include text (structured natural-language prompt containing product description and category) and price (ground-truth product price in USD). The intended uses include LLM-based price prediction, benchmarking against classical ML models, and research and educational purposes. Out-of-scope uses include real-time pricing systems and financial or commercial decision-making.
提供机构:
MightyOctopus



