thulthula/Amazon-Reviews-2023-Extended-min500char
收藏Hugging Face2026-04-10 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/thulthula/Amazon-Reviews-2023-Extended-min500char
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: All_Beauty
data_files:
- split: train
path: data/All_Beauty/train-*.parquet
- config_name: Arts_Crafts_and_Sewing
data_files:
- split: train
path: data/Arts_Crafts_and_Sewing/train-*.parquet
- config_name: Automotive
data_files:
- split: train
path: data/Automotive/train-*.parquet
- config_name: Baby_Products
data_files:
- split: train
path: data/Baby_Products/train-*.parquet
- config_name: Beauty_and_Personal_Care
data_files:
- split: train
path: data/Beauty_and_Personal_Care/train-*.parquet
- config_name: Books
data_files:
- split: train
path: data/Books/train-*.parquet
- config_name: CDs_and_Vinyl
data_files:
- split: train
path: data/CDs_and_Vinyl/train-*.parquet
- config_name: Cell_Phones_and_Accessories
data_files:
- split: train
path: data/Cell_Phones_and_Accessories/train-*.parquet
- config_name: Clothing_Shoes_and_Jewelry
data_files:
- split: train
path: data/Clothing_Shoes_and_Jewelry/train-*.parquet
- config_name: Electronics
data_files:
- split: train
path: data/Electronics/train-*.parquet
- config_name: Gift_Cards
data_files:
- split: train
path: data/Gift_Cards/train-*.parquet
- config_name: Grocery_and_Gourmet_Food
data_files:
- split: train
path: data/Grocery_and_Gourmet_Food/train-*.parquet
- config_name: Health_and_Household
data_files:
- split: train
path: data/Health_and_Household/train-*.parquet
- config_name: Home_and_Kitchen
data_files:
- split: train
path: data/Home_and_Kitchen/train-*.parquet
- config_name: Industrial_and_Scientific
data_files:
- split: train
path: data/Industrial_and_Scientific/train-*.parquet
- config_name: Kindle_Store
data_files:
- split: train
path: data/Kindle_Store/train-*.parquet
- config_name: Magazine_Subscriptions
data_files:
- split: train
path: data/Magazine_Subscriptions/train-*.parquet
- config_name: Movies_and_TV
data_files:
- split: train
path: data/Movies_and_TV/train-*.parquet
- config_name: Musical_Instruments
data_files:
- split: train
path: data/Musical_Instruments/train-*.parquet
- config_name: Office_Products
data_files:
- split: train
path: data/Office_Products/train-*.parquet
- config_name: Patio_Lawn_and_Garden
data_files:
- split: train
path: data/Patio_Lawn_and_Garden/train-*.parquet
- config_name: Pet_Supplies
data_files:
- split: train
path: data/Pet_Supplies/train-*.parquet
- config_name: Software
data_files:
- split: train
path: data/Software/train-*.parquet
- config_name: Sports_and_Outdoors
data_files:
- split: train
path: data/Sports_and_Outdoors/train-*.parquet
- config_name: Tools_and_Home_Improvement
data_files:
- split: train
path: data/Tools_and_Home_Improvement/train-*.parquet
- config_name: Toys_and_Games
data_files:
- split: train
path: data/Toys_and_Games/train-*.parquet
- config_name: Unknown
data_files:
- split: train
path: data/Unknown/train-*.parquet
- config_name: Video_Games
data_files:
- split: train
path: data/Video_Games/train-*.parquet
---
# Amazon Reviews 2023 — Extended 5-core Dataset
Extended version of [McAuley-Lab/Amazon-Reviews-2023](https://huggingface.co/datasets/McAuley-Lab/Amazon-Reviews-2023) with review text and product metadata joined. Only products with combined text (title + features + description) ≥ 500 characters are included.
## Columns
- `split` — train / valid / test (last-out 5-core benchmark)
- `user_id` — reviewer ID
- `parent_asin` — product group ID
- `asin` — exact variant purchased
- `product_images` — list of hi-res image URLs
- `product_title` — product name
- `features` — bullet-point product features
- `description` — product description
- `review_text` — full review text
- `review_rating` — 1–5 star rating
提供机构:
thulthula



