ksabeh/openbrand-zs
收藏Hugging Face2023-08-28 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/ksabeh/openbrand-zs
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: category
dtype: string
- name: title
dtype: string
- name: brand
dtype: string
- name: asin
dtype: string
- name: imageURL
dtype: string
- name: position_index
dtype: int64
- name: num_tokens
dtype: int64
- name: title_length
dtype: int64
- name: title_category
dtype: string
splits:
- name: train
num_bytes: 24211621
num_examples: 61075
- name: val
num_bytes: 2685833
num_examples: 6788
- name: test
num_bytes: 9453851
num_examples: 25221
- name: electronics
num_bytes: 2423259
num_examples: 4786
- name: sports
num_bytes: 1904597
num_examples: 5420
- name: toys
num_bytes: 2078207
num_examples: 6329
- name: automotive
num_bytes: 2271017
num_examples: 6446
- name: grocery
num_bytes: 776771
num_examples: 2240
download_size: 13092616
dataset_size: 45805156
---
# Dataset Card for "openbrand-zs"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
The dataset includes multiple features such as category, title, brand, ASIN, imageURL, position_index, num_tokens, title_length, and title_category. The dataset is divided into several subsets including train, validation, test, and specific category subsets (such as electronics, sports, toys, automotive, and grocery). Each subset has its corresponding number of bytes and examples. The total download size and dataset size are also provided.
提供机构:
ksabeh
原始信息汇总
数据集概述
特征信息
- category: 字符串类型
- title: 字符串类型
- brand: 字符串类型
- asin: 字符串类型
- imageURL: 字符串类型
- position_index: 64位整数类型
- num_tokens: 64位整数类型
- title_length: 64位整数类型
- title_category: 字符串类型
数据分割
- train: 24211621字节, 61075个样本
- val: 2685833字节, 6788个样本
- test: 9453851字节, 25221个样本
- electronics: 2423259字节, 4786个样本
- sports: 1904597字节, 5420个样本
- toys: 2078207字节, 6329个样本
- automotive: 2271017字节, 6446个样本
- grocery: 776771字节, 2240个样本
数据集大小
- 下载大小: 13092616字节
- 数据集大小: 45805156字节



