five

cburger/MD_NoPunctuation

收藏
Hugging Face2023-05-28 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/cburger/MD_NoPunctuation
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: text dtype: string - name: label dtype: class_label: names: '0': ' Allergy / Immunology' '1': ' Autopsy' '2': ' Bariatrics' '3': ' Cardiovascular / Pulmonary' '4': ' Chiropractic' '5': ' Consult - History and Phy.' '6': ' Cosmetic / Plastic Surgery' '7': ' Dentistry' '8': ' Dermatology' '9': ' Diets and Nutritions' '10': ' Discharge Summary' '11': ' ENT - Otolaryngology' '12': ' Emergency Room Reports' '13': ' Endocrinology' '14': ' Gastroenterology' '15': ' General Medicine' '16': ' Hematology - Oncology' '17': ' Hospice - Palliative Care' '18': ' IME-QME-Work Comp etc.' '19': ' Lab Medicine - Pathology' '20': ' Letters' '21': ' Nephrology' '22': ' Neurology' '23': ' Neurosurgery' '24': ' Obstetrics / Gynecology' '25': ' Office Notes' '26': ' Ophthalmology' '27': ' Orthopedic' '28': ' Pain Management' '29': ' Pediatrics - Neonatal' '30': ' Physical Medicine - Rehab' '31': ' Podiatry' '32': ' Psychiatry / Psychology' '33': ' Radiology' '34': ' Rheumatology' '35': ' SOAP / Chart / Progress Notes' '36': ' Sleep Medicine' '37': ' Speech - Language' '38': ' Surgery' '39': ' Urology' splits: - name: train num_bytes: 15217808 num_examples: 4966 download_size: 7116577 dataset_size: 15217808 --- # Dataset Card for "MD_NoPunctuation" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
cburger
原始信息汇总

数据集概述

特征信息

  • text: 数据类型为字符串。
  • label: 数据类型为类别标签,包含以下类别名称:
    • 0: Allergy / Immunology
    • 1: Autopsy
    • 2: Bariatrics
    • 3: Cardiovascular / Pulmonary
    • 4: Chiropractic
    • 5: Consult - History and Phy.
    • 6: Cosmetic / Plastic Surgery
    • 7: Dentistry
    • 8: Dermatology
    • 9: Diets and Nutritions
    • 10: Discharge Summary
    • 11: ENT - Otolaryngology
    • 12: Emergency Room Reports
    • 13: Endocrinology
    • 14: Gastroenterology
    • 15: General Medicine
    • 16: Hematology - Oncology
    • 17: Hospice - Palliative Care
    • 18: IME-QME-Work Comp etc.
    • 19: Lab Medicine - Pathology
    • 20: Letters
    • 21: Nephrology
    • 22: Neurology
    • 23: Neurosurgery
    • 24: Obstetrics / Gynecology
    • 25: Office Notes
    • 26: Ophthalmology
    • 27: Orthopedic
    • 28: Pain Management
    • 29: Pediatrics - Neonatal
    • 30: Physical Medicine - Rehab
    • 31: Podiatry
    • 32: Psychiatry / Psychology
    • 33: Radiology
    • 34: Rheumatology
    • 35: SOAP / Chart / Progress Notes
    • 36: Sleep Medicine
    • 37: Speech - Language
    • 38: Surgery
    • 39: Urology

数据分割

  • train: 包含4966个样本,占用15217808字节。

数据集大小

  • 下载大小: 7116577字节
  • 数据集大小: 15217808字节
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作