five

thulthula/Amazon-Reviews-2023-Extended-min500char

收藏
Hugging Face2026-04-10 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/thulthula/Amazon-Reviews-2023-Extended-min500char
下载链接
链接失效反馈
官方服务:
资源简介:
--- configs: - config_name: All_Beauty data_files: - split: train path: data/All_Beauty/train-*.parquet - config_name: Arts_Crafts_and_Sewing data_files: - split: train path: data/Arts_Crafts_and_Sewing/train-*.parquet - config_name: Automotive data_files: - split: train path: data/Automotive/train-*.parquet - config_name: Baby_Products data_files: - split: train path: data/Baby_Products/train-*.parquet - config_name: Beauty_and_Personal_Care data_files: - split: train path: data/Beauty_and_Personal_Care/train-*.parquet - config_name: Books data_files: - split: train path: data/Books/train-*.parquet - config_name: CDs_and_Vinyl data_files: - split: train path: data/CDs_and_Vinyl/train-*.parquet - config_name: Cell_Phones_and_Accessories data_files: - split: train path: data/Cell_Phones_and_Accessories/train-*.parquet - config_name: Clothing_Shoes_and_Jewelry data_files: - split: train path: data/Clothing_Shoes_and_Jewelry/train-*.parquet - config_name: Electronics data_files: - split: train path: data/Electronics/train-*.parquet - config_name: Gift_Cards data_files: - split: train path: data/Gift_Cards/train-*.parquet - config_name: Grocery_and_Gourmet_Food data_files: - split: train path: data/Grocery_and_Gourmet_Food/train-*.parquet - config_name: Health_and_Household data_files: - split: train path: data/Health_and_Household/train-*.parquet - config_name: Home_and_Kitchen data_files: - split: train path: data/Home_and_Kitchen/train-*.parquet - config_name: Industrial_and_Scientific data_files: - split: train path: data/Industrial_and_Scientific/train-*.parquet - config_name: Kindle_Store data_files: - split: train path: data/Kindle_Store/train-*.parquet - config_name: Magazine_Subscriptions data_files: - split: train path: data/Magazine_Subscriptions/train-*.parquet - config_name: Movies_and_TV data_files: - split: train path: data/Movies_and_TV/train-*.parquet - config_name: Musical_Instruments data_files: - split: train path: data/Musical_Instruments/train-*.parquet - config_name: Office_Products data_files: - split: train path: data/Office_Products/train-*.parquet - config_name: Patio_Lawn_and_Garden data_files: - split: train path: data/Patio_Lawn_and_Garden/train-*.parquet - config_name: Pet_Supplies data_files: - split: train path: data/Pet_Supplies/train-*.parquet - config_name: Software data_files: - split: train path: data/Software/train-*.parquet - config_name: Sports_and_Outdoors data_files: - split: train path: data/Sports_and_Outdoors/train-*.parquet - config_name: Tools_and_Home_Improvement data_files: - split: train path: data/Tools_and_Home_Improvement/train-*.parquet - config_name: Toys_and_Games data_files: - split: train path: data/Toys_and_Games/train-*.parquet - config_name: Unknown data_files: - split: train path: data/Unknown/train-*.parquet - config_name: Video_Games data_files: - split: train path: data/Video_Games/train-*.parquet --- # Amazon Reviews 2023 — Extended 5-core Dataset Extended version of [McAuley-Lab/Amazon-Reviews-2023](https://huggingface.co/datasets/McAuley-Lab/Amazon-Reviews-2023) with review text and product metadata joined. Only products with combined text (title + features + description) ≥ 500 characters are included. ## Columns - `split` — train / valid / test (last-out 5-core benchmark) - `user_id` — reviewer ID - `parent_asin` — product group ID - `asin` — exact variant purchased - `product_images` — list of hi-res image URLs - `product_title` — product name - `features` — bullet-point product features - `description` — product description - `review_text` — full review text - `review_rating` — 1–5 star rating
提供机构:
thulthula
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作