five

Open e-commerce 1.0: Five years of crowdsourced U.S. Amazon purchase histories with user demographics

收藏
DataCite Commons2025-05-12 更新2025-05-17 收录
下载链接:
https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/YGLYDY
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains longitudinal purchases data from 5027 Amazon.com users in the US, spanning 2018 through 2022: amazon-purchases.csv<br> It also includes demographic data and other consumer level variables for each user with data in the dataset. These consumer level variables were collected through an online survey and are included in survey.csv <br> fields.csv describes the columns in the survey.csv file, where fields/survey columns correspond to survey questions. <br> <br> The dataset also contains the survey instrument used to collect the data. More details about the survey questions and possible responses, and the format in which they were presented can be found by viewing the survey instrument. <br> <br> A 'Survey ResponseID' column is present in both the amazon-purchases.csv and survey.csv files. It links a user's survey responses to their Amazon.com purchases. The 'Survey ResponseID' was randomly generated at the time of data collection. <br><br> <b>amazon-purchases.csv</b> <br> Each row in this file corresponds to an Amazon order. Each such row has the following columns: <ul> <li>Survey ResponseID</li> <li>Order date</li> <li>Shipping address state</li> <li>Purchase price per unit</li> <li>Quantity</li> <li>ASIN/ISBN (Product Code)</li> <li>Title </li> <li>Category</li> </ul> <br> The data were exported by the Amazon users from Amazon.com and shared by users with their informed consent. PII and other information not listed above were stripped from the data. This processing occurred on users' machines before sharing with researchers.

本数据集包含美国5027名亚马逊(Amazon)用户2018年至2022年的纵向购买行为数据,相关数据存储于amazon-purchases.csv文件中。<br> 数据集同时包含每位用户的人口统计特征及其他消费者层面变量,此类变量通过在线调研收集,存储于survey.csv文件。<br> fields.csv文件用于说明survey.csv文件中的列字段信息,其中各字段/调研列对应调研问卷中的具体问题。<br> <br> 本数据集还附带本次调研使用的调研工具文档。如需了解更多调研问题、可选回复内容及展示格式,可查阅该调研工具文档。<br> <br> amazon-purchases.csv与survey.csv文件中均包含<b>调研回复ID(Survey ResponseID)</b>字段,该字段可实现用户调研回复与亚马逊购买记录的关联。该"Survey ResponseID"在数据收集阶段随机生成。<br> <br> <b>amazon-purchases.csv</b><br> 该文件中每一行对应一笔亚马逊订单,各行列字段如下:<br> <ul> <li>调研回复ID(Survey ResponseID)</li> <li>订单日期(Order date)</li> <li>收货地址州名(Shipping address state)</li> <li>单件商品购买价格(Purchase price per unit)</li> <li>购买数量(Quantity)</li> <li>ASIN/ISBN(商品编码)</li> <li>商品标题(Title)</li> <li>商品品类(Category)</li> </ul> <br> 本数据集由亚马逊用户从Amazon.com平台导出,并经用户知情同意后共享。数据已剔除上述未列明的个人可识别信息(Personally Identifiable Information, PII)及其他无关信息,该数据预处理工作在用户本地设备上完成,随后才共享给研究人员。
提供机构:
Harvard Dataverse
创建时间:
2023-12-02
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集包含5027名美国亚马逊用户五年间的购买记录和人口统计信息,通过众包方式收集并经过去标识化处理,适用于电子商务和消费者行为研究。
以上内容由遇见数据集搜集并总结生成
二维码
社区交流群
二维码
科研交流群
商业服务