IIC/livingner3
收藏Hugging Face2026-02-13 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/IIC/livingner3
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- es
tags:
- biomedical
- clinical
- spanish
multilinguality:
- monolingual
task_categories:
- text-classification
task_ids:
- multi-label-classification
license:
- cc-by-4.0
pretty_name: LivingNER3
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validation
path: data/validation-*
- split: test
path: data/test-*
dataset_info:
features:
- name: text
dtype: string
- name: label
list:
class_label:
names:
'0': 'No'
'1': 'Yes'
splits:
- name: train
num_bytes: 1560010
num_examples: 425
- name: validation
num_bytes: 320464
num_examples: 75
- name: test
num_bytes: 897775
num_examples: 250
download_size: 1591274
dataset_size: 2778249
train-eval-index:
- task: text-classification
task_id: multi_label_classification
splits:
train_split: train
eval_split: test
metrics:
- type: f1
name: f1
---
# LivingNER
This is a third party reupload of the [LivingNER](https://temu.bsc.es/livingner/) task 3 dataset.
It only contains the task 3 for the Spanish language. It does not include the multilingual data nor the background data.
This dataset is part of a benchmark in the paper [A comparative analysis of Spanish Clinical encoder-based models on NER and classification tasks](https://doi.org/10.1093/jamia/ocae054).
### Citation Information
```bibtext
@article{10.1093/jamia/ocae054,
author = {García Subies, Guillem and Barbero Jiménez, Álvaro and Martínez Fernández, Paloma},
title = {A comparative analysis of Spanish Clinical encoder-based models on NER and classification tasks},
journal = {Journal of the American Medical Informatics Association},
volume = {31},
number = {9},
pages = {2137-2146},
year = {2024},
month = {03},
issn = {1527-974X},
doi = {10.1093/jamia/ocae054},
url = {https://doi.org/10.1093/jamia/ocae054},
}
```
### Citation Information of the original dataset
```bibtex
@article{amiranda2022nlp,
title={Mention detection, normalization \& classification of species, pathogens, humans and food in clinical documents: Overview of LivingNER shared task and resources},
author={Miranda-Escalada, Antonio and Farr{'e}-Maduell, Eul{`a}lia and Lima-L{'o}pez, Salvador and Estrada, Darryl and Gasc{'o}, Luis and Krallinger, Martin},
journal = {Procesamiento del Lenguaje Natural},
year={2022}
}
```
提供机构:
IIC
原始信息汇总
数据集概述
基本信息
- 名称: LivingNER3
- 语言: 西班牙语 (es)
- 标签: 生物医学、临床、西班牙语
- 多语言性: 单语种
- 任务类别: 文本分类
- 任务ID: 多标签分类
- 许可证: CC-BY-4.0
训练与评估
- 任务: 文本分类
- 任务ID: 多标签分类
- 数据分割:
- 训练集: train
- 评估集: test
- 评估指标: F1分数
数据内容
- 仅包含西班牙语的任务3数据,不包括多语言数据和背景数据。



