Amazon Product Data|电商产品数据集|市场分析数据集
收藏🛒Amazon Product Data Analysis
📊Dataset Overview
- 🆔Product Details:
- Product ID
- Name
- Category
- Discounted Price
- Actual Price
- Discount Percentage
- ⭐Customer Ratings and Reviews:
- Rating
- Rating Count
- Review ID
- Review Title
- Review Content
- User ID
- User Name
- 📷Additional Information:
- Product Image Link
- Product Link
- Product Description
🎯Key Objectives and Queries
- 🥇Identifying the highest-rated products and those with the largest discounts.
- 💸Analyzing pricing trends across categories, including average, minimum, and maximum prices.
- 📈Exploring customer review patterns, such as the number of reviews per product and the average rating by category.
- 🌟Detecting the most popular products based on rating counts and reviews.
- 📉Calculating average discounts and evaluating how discount percentages correlate with product ratings and review counts.
- 📝Investigating product descriptions and user feedback to find common keywords or phrases related to high ratings.
💡Skills Demonstrated
- 🗃️SQL Querying: Advanced filtering, grouping, sorting, and aggregation techniques.
- 🔍Data Investigation: Extracting and interpreting trends in pricing, discounts, and user ratings.
- 📊Data Visualization & Reporting: Translating SQL results into meaningful visualizations and summaries for business insights.
- 🧩Analytical Problem-Solving: Leveraging SQL for complex, real-world data analysis challenges.
🔎Insights and Outcomes
- The analysis provides valuable insights into Amazon product trends and customer feedback, offering data-driven recommendations to optimize product listings, pricing strategies, and promotional discounts.

中国空气质量数据集(2014-2020年)
数据集中的空气质量数据类型包括PM2.5, PM10, SO2, NO2, O3, CO, AQI,包含了2014-2020年全国360个城市的逐日空气质量监测数据。监测数据来自中国环境监测总站的全国城市空气质量实时发布平台,每日更新。数据集的原始文件为CSV的文本记录,通过空间化处理生产出Shape格式的空间数据。数据集包括CSV格式和Shape格式两数数据格式。
国家地球系统科学数据中心 收录
Materials Project
材料项目是一组标有不同属性的化合物。数据集链接: MP 2018.6.1(69,239 个材料) MP 2019.4.1(133,420 个材料)
OpenDataLab 收录
典型分布式光伏出力预测数据集
光伏电站出力数据每5分钟从电站机房监控系统获取;气象实测数据从气象站获取,气象站建于电站30号箱变附近,每5分钟将采集的数据通过光纤传输到机房;数值天气预报数据利用中国电科院新能源气象应用机房的WRF业务系统(包括30TF计算刀片机、250TB并行存储)进行中尺度模式计算后输出预报产品,每日8点前通过反向隔离装置推送到电站内网预测系统。
国家基础学科公共科学数据中心 收录
PCLT20K
PCLT20K数据集是由湖南大学等机构创建的一个大规模PET-CT肺癌肿瘤分割数据集,包含来自605名患者的21,930对PET-CT图像,所有图像都带有高质量的像素级肿瘤区域标注。该数据集旨在促进医学图像分割研究,特别是在PET-CT图像中肺癌肿瘤的分割任务。
arXiv 收录
GME Data
关于2021年GameStop股票活动的数据,包括每日合并的GME短期成交量数据、每日失败交付数据、可借股数、期权链数据以及不同时间框架的开盘/最高/最低/收盘/成交量条形图。
github 收录