five

noone-m/shifaa-processed

收藏
Hugging Face2025-12-16 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/noone-m/shifaa-processed
下载链接
链接失效反馈
官方服务:
资源简介:
Shifaa阿拉伯语医疗咨询数据集(分类处理版)是一个经过处理的阿拉伯语医疗咨询数据集,用于单标签分类任务。该数据集包含患者提出的医疗问题及其对应的主要医疗专业类别。数据经过清洗处理,包括从层次标签中提取主类别、移除有冲突类别的问题、去重以及分层拆分(训练集约81%,验证集9%,测试集10%)。数据集包含三个特征:Question(问题文本)、Main Category(主类别),并分为train、validation和test三个split。

Shifaa Arabic Medical Consultations - Processed for Classification is a processed dataset of Arabic medical consultations for single-label classification tasks. The dataset contains patients medical questions along with their corresponding main medical specialty categories. The data has been cleaned by extracting main categories from hierarchical labels, removing questions with conflicting categories, deduplicating questions, and performing stratified splitting (~81% train, 9% val, 10% test). The dataset includes two features: Question (text of the question) and Main Category (main category), and is divided into train, validation, and test splits.
提供机构:
noone-m
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作