IIC/distemist
收藏Hugging Face2026-02-06 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/IIC/distemist
下载链接
链接失效反馈官方服务:
资源简介:
这是一个用于命名实体识别的西班牙语医学数据集,专注于疾病实体标注。数据集包含文本、分词序列和对应的命名实体标签,采用BIO标注方案(B-ENFERMEDAD表示疾病开始,I-ENFERMEDAD表示疾病内部,O表示非疾病实体)。数据集分为训练集(510个样本)、验证集(90个样本)和测试集(150个样本)。
This is a Spanish medical dataset for named entity recognition, focusing on disease entity annotation. The dataset contains text, token sequences, and corresponding named entity labels using the BIO tagging scheme (B-ENFERMEDAD for disease beginning, I-ENFERMEDAD for disease inside, and O for non-disease entities). The dataset is divided into train (510 examples), validation (90 examples), and test (150 examples) sets.
提供机构:
IIC



