ricardosantoss/top12_com_relatorios_de_alta
收藏Hugging Face2023-11-27 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/ricardosantoss/top12_com_relatorios_de_alta
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: Nota Clinica
dtype: string
- name: Sequencia_CID10_Lista
sequence: string
- name: __index_level_0__
dtype: int64
splits:
- name: train
num_bytes: 1906716
num_examples: 1899
- name: test
num_bytes: 240160
num_examples: 238
- name: validation
num_bytes: 237032
num_examples: 237
download_size: 954473
dataset_size: 2383908
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
- split: validation
path: data/validation-*
---
This dataset is primarily used for clinical note analysis, featuring three fields: Nota Clinica (clinical note), Sequencia_CID10_Lista (CID10 list sequence), and __index_level_0__ (index level 0). The dataset is divided into a training set, a test set, and a validation set, containing 1899, 238, and 237 samples respectively.
提供机构:
ricardosantoss
原始信息汇总
数据集概述
数据特征
- Nota Clinica: 数据类型为字符串。
- Sequencia_CID10_Lista: 序列类型为字符串。
- index_level_0: 数据类型为整数64位。
数据分割
- 训练集 (train):
- 字节数: 1906716
- 样本数: 1899
- 测试集 (test):
- 字节数: 240160
- 样本数: 238
- 验证集 (validation):
- 字节数: 237032
- 样本数: 237
数据集大小
- 下载大小: 954473 字节
- 数据集大小: 2383908 字节
配置信息
- 配置名称: default
- 数据文件路径:
- 训练集:
data/train-* - 测试集:
data/test-* - 验证集:
data/validation-*
- 训练集:



