five

justpyschitry/autotrain-data-Psychiatry_Article_Identifier

收藏
Hugging Face2022-06-15 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/justpyschitry/autotrain-data-Psychiatry_Article_Identifier
下载链接
链接失效反馈
官方服务:
资源简介:
--- task_categories: - text-classification --- # AutoTrain Dataset for project: Psychiatry_Article_Identifier ## Dataset Descritpion This dataset has been automatically processed by AutoTrain for project Psychiatry_Article_Identifier. ### Languages The BCP-47 code for the dataset's language is unk. ## Dataset Structure ### Data Instances A sample from this dataset looks as follows: ```json [ { "text": "diffuse actinic keratinocyte dysplasia", "target": 15 }, { "text": "cholesterol atheroembolism", "target": 8 } ] ``` ### Dataset Fields The dataset has the following fields (also called "features"): ```json { "text": "Value(dtype='string', id=None)", "target": "ClassLabel(num_classes=20, names=['Certain infectious or parasitic diseases', 'Developmental anaomalies', 'Diseases of the blood or blood forming organs', 'Diseases of the genitourinary system', 'Mental behavioural or neurodevelopmental disorders', 'Neoplasms', 'certain conditions originating in the perinatal period', 'conditions related to sexual health', 'diseases of the circulatroy system', 'diseases of the digestive system', 'diseases of the ear or mastoid process', 'diseases of the immune system', 'diseases of the musculoskeletal system or connective tissue', 'diseases of the nervous system', 'diseases of the respiratory system', 'diseases of the skin', 'diseases of the visual system', 'endocrine nutritional or metabolic diseases', 'pregnanacy childbirth or the puerperium', 'sleep-wake disorders'], id=None)" } ``` ### Dataset Splits This dataset is split into a train and validation split. The split sizes are as follow: | Split name | Num samples | | ------------ | ------------------- | | train | 9828 | | valid | 2468 |
提供机构:
justpyschitry
原始信息汇总

AutoTrain Dataset for project: Psychiatry_Article_Identifier

数据集描述

本数据集是为项目Psychiatry_Article_Identifier自动处理的数据集。

语言

数据集的语言BCP-47代码为unk。

数据集结构

数据实例

数据集的样本示例如下:

json [ { "text": "diffuse actinic keratinocyte dysplasia", "target": 15 }, { "text": "cholesterol atheroembolism", "target": 8 } ]

数据集字段

数据集包含以下字段:

json { "text": "Value(dtype=string, id=None)", "target": "ClassLabel(num_classes=20, names=[Certain infectious or parasitic diseases, Developmental anaomalies, Diseases of the blood or blood forming organs, Diseases of the genitourinary system, Mental behavioural or neurodevelopmental disorders, Neoplasms, certain conditions originating in the perinatal period, conditions related to sexual health, diseases of the circulatroy system, diseases of the digestive system, diseases of the ear or mastoid process, diseases of the immune system, diseases of the musculoskeletal system or connective tissue, diseases of the nervous system, diseases of the respiratory system, diseases of the skin, diseases of the visual system, endocrine nutritional or metabolic diseases, pregnanacy childbirth or the puerperium, sleep-wake disorders], id=None)" }

数据集分割

数据集分为训练集和验证集,分割大小如下:

分割名称 样本数量
训练集 9828
验证集 2468
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作