prithvidalal/my-distiset-b6926cd3
收藏Hugging Face2025-02-12 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/prithvidalal/my-distiset-b6926cd3
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个合成数据集,用于训练句子变换器模型,如MedBERT和ClinicalBERT,以分类24个医疗病史标题类别。数据集包含三个字段:prompt、completion和system_prompt,其中包含了生成样本的描述、能力和目标。数据集旨在创建多样化的样本,以覆盖广泛的临床场景、疾病类别和治疗方式,确保模型不会对特定模式过度拟合,并能够处理各种输入场景。
This dataset is a synthetic dataset for training sentence transformers like MedBERT and ClinicalBERT to classify 24 medical case history heading categories. It includes three fields: prompt, completion, and system_prompt, which contain descriptions, capabilities, and objectives for generating samples. The dataset aims to create diverse samples covering a wide range of clinical scenarios, disease categories, and treatment modalities to ensure that the models are not overfitted to specific patterns and can handle various input scenarios.
提供机构:
prithvidalal



