wentingzhao/test-info-medical
收藏Hugging Face2024-05-23 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/wentingzhao/test-info-medical
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: content
dtype: string
- name: country
dtype: string
- name: hashed_ip
dtype: string
- name: header
struct:
- name: accept-language
dtype: string
- name: user-agent
dtype: string
- name: language
dtype: string
- name: redacted
dtype: bool
- name: role
dtype: string
- name: state
dtype: string
- name: timestamp
dtype: 'null'
- name: toxic
dtype: bool
- name: turn_identifier
dtype: int64
- name: info
dtype: bool
- name: medical
dtype: bool
splits:
- name: train
num_bytes: 686569
num_examples: 1698
download_size: 296105
dataset_size: 686569
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
The dataset includes multiple features such as content, country, hashed IP, header information (including accept-language and user-agent), language, whether it is redacted, role, state, timestamp, whether it is toxic, turn identifier, information, whether it is medical, etc. The dataset is divided into a training set, containing 1698 samples, with a total size of 686569 bytes. The configuration name of the dataset is default, and the data file path is data/train-*.
提供机构:
wentingzhao
原始信息汇总
数据集概述
数据集特征
- content: 数据类型 - string
- country: 数据类型 - string
- hashed_ip: 数据类型 - string
- header: 结构化数据类型
- accept-language: 数据类型 - string
- user-agent: 数据类型 - string
- language: 数据类型 - string
- redacted: 数据类型 - bool
- role: 数据类型 - string
- state: 数据类型 - string
- timestamp: 数据类型 - null
- toxic: 数据类型 - bool
- turn_identifier: 数据类型 - int64
- info: 数据类型 - bool
- medical: 数据类型 - bool
数据集分割
- train:
- 数据大小: 686569 bytes
- 示例数量: 1698
数据集大小
- 下载大小: 296105 bytes
- 数据集大小: 686569 bytes
配置
- config_name: default
- data_files:
- split: train
- path: data/train-*



