gevam/wine-reviews
收藏Hugging Face2026-04-06 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/gevam/wine-reviews
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: country
dtype: string
- name: description
dtype: string
- name: designation
dtype: string
- name: points
dtype: int64
- name: price
dtype: float64
- name: province
dtype: string
- name: region_1
dtype: string
- name: region_2
dtype: string
- name: variety
dtype: string
- name: winery
dtype: string
- name: taster_name
dtype: string
- name: taster_twitter_handle
dtype: string
- name: title
dtype: string
splits:
- name: train
num_bytes: 78963991.68931402
num_examples: 196630
- name: validation
num_bytes: 11280570.241330575
num_examples: 28090
- name: test
num_bytes: 22561542.069355395
num_examples: 56181
download_size: 56884103
dataset_size: 112806104
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validation
path: data/validation-*
- split: test
path: data/test-*
task_categories:
- text-classification
---
# Original Dataset Details
- **License:** [CC BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/)
- **Attribution:** Zackthoutt
- **Source:** [Wine Reviews Dataset on Kaggle](https://www.kaggle.com/datasets/zynicide/wine-reviews)
数据集信息(dataset_info):
特征列表:
- 字段名:国家(country),数据类型:字符串(string)
- 字段名:品鉴描述(description),数据类型:字符串(string)
- 字段名:酒款标识(designation),数据类型:字符串(string)
- 字段名:评分(points),数据类型:64位整型(int64)
- 字段名:价格(price),数据类型:64位浮点型(float64)
- 字段名:省份(province),数据类型:字符串(string)
- 字段名:一级产区(region_1),数据类型:字符串(string)
- 字段名:二级产区(region_2),数据类型:字符串(string)
- 字段名:葡萄品种(variety),数据类型:字符串(string)
- 字段名:酒庄(winery),数据类型:字符串(string)
- 字段名:品鉴师姓名(taster_name),数据类型:字符串(string)
- 字段名:品鉴师推特账号(taster_twitter_handle),数据类型:字符串(string)
- 字段名:酒款标题(title),数据类型:字符串(string)
数据集拆分:
- 训练集(train):字节占用量78963991.68931402,样本数量196630
- 验证集(validation):字节占用量11280570.241330575,样本数量28090
- 测试集(test):字节占用量22561542.069355395,样本数量56181
下载总大小:56884103
数据集总存储大小:112806104
配置项:
- 配置名称:默认(default),数据文件路径:
- 训练集(train):data/train-*
- 验证集(validation):data/validation-*
- 测试集(test):data/test-*
任务类别:文本分类(text-classification)
# 原始数据集详情
- 许可证:[知识共享署名-非商业性使用-相同方式共享4.0国际许可协议(CC BY-NC-SA 4.0)](https://creativecommons.org/licenses/by-nc-sa/4.0/)
- 署名方:Zackthoutt
- 数据来源:[Kaggle平台葡萄酒品鉴数据集(Wine Reviews Dataset)](https://www.kaggle.com/datasets/zynicide/wine-reviews)
提供机构:
gevam



