contemmcm/drug-reviews
收藏Hugging Face2024-05-02 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/contemmcm/drug-reviews
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
task_categories:
- text-classification
dataset_info:
- config_name: default
features:
- name: id
dtype: int64
- name: drugName
dtype: string
- name: condition
dtype: string
- name: review
dtype: string
- name: rating
dtype:
class_label:
names:
'0': 1/10
'1': 2/10
'2': 3/10
'3': 4/10
'4': 5/10
'5': 6/10
'6': 7/10
'7': 8/10
'8': 9/10
'9': 10/10
- name: date
dtype: string
- name: usefulCount
dtype: int64
- config_name: polarity
features:
- name: id
dtype: int64
- name: drugName
dtype: string
- name: condition
dtype: string
- name: review
dtype: string
- name: rating
dtype:
class_label:
names:
'0': negative
'1': positive
- name: date
dtype: string
- name: usefulCount
dtype: int64
configs:
- config_name: default
data_files:
- split: complete
path: "complete.csv"
- split: train
path: "train.csv"
- split: test
path: "test.csv"
default: true
- config_name: polarity
data_files:
- split: complete
path: "positive_negative.csv"
- split: balanced
path: "positive_negative_balanced.csv"
---
提供机构:
contemmcm
原始信息汇总
数据集概述
数据集配置
-
默认配置 (config_name: default)
- 特征:
- id: 数据类型为 int64
- drugName: 数据类型为 string
- condition: 数据类型为 string
- review: 数据类型为 string
- rating: 数据类型为 class_label,评分等级从1/10到10/10
- date: 数据类型为 string
- usefulCount: 数据类型为 int64
- 数据文件:
- 完整数据集: "complete.csv"
- 训练集: "train.csv"
- 测试集: "test.csv"
- 特征:
-
极性配置 (config_name: polarity)
- 特征:
- id: 数据类型为 int64
- drugName: 数据类型为 string
- condition: 数据类型为 string
- review: 数据类型为 string
- rating: 数据类型为 class_label,评分等级为 negative 或 positive
- date: 数据类型为 string
- usefulCount: 数据类型为 int64
- 数据文件:
- 完整数据集: "positive_negative.csv"
- 平衡数据集: "positive_negative_balanced.csv"
- 特征:



