five

LivingNER terminology: NCBI Taxonomy translated to Spanish

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/6390506
下载链接
链接失效反馈
官方服务:
资源简介:
Official NCBI Taxonomy FTP dump (https://ftp.ncbi.nlm.nih.gov/pub/taxonomy/) with the terms translated to Spanish by a Neural Machine Translator fine-tuned for the biomedical domain. We have added the following terms:  2560602    Mumps orthorubulavirus    Mumps orthorubulavirus    scientific name    Paperas ortorubulavirus 2560526    Human orthorubulavirus 4    Human orthorubulavirus 4    scientific name    orthorubulavirus humano 4 2847144    hepatitis C virus genotype 1a    hepatitis C virus genotype 1a    scientific name    virus de la hepatitis C genotipo 1a 2560525    Human orthorubulavirus 2    Human orthorubulavirus 2    scientific name    orthorubulavirus humano 2 _NOCODE_    out of NCBI Taxonomy scope    out of NCBI Taxonomy scope    NA    mención no codificable a NCBI Taxonomy The first 4 were added because they appear in the LivingNER corpus, and are present in the browser version of NCBI Taxonomy. The last one (_NOCODE_) is added to identify terms in LivingNER corpus not present in the NCBI Taxonomy.     Format: Tab-separated file with the following columns: tax_id: the id of node associated with this name name_txt: name itself unique name: the unique variant of this name if name not unique name class: (synonym, common name, scientific name, ...) Spanish name: name in Spanish   Please cite if you use this dataset: A. Miranda-Escalada, E. Farré-Maduell, S. Lima-López, D. Estrada, L. Gascó, M. Krallinger, Mention detection, normalization & classification of species, pathogens, humans and food in clinical documents: Overview of livingner shared task and resources, Procesamiento del Lenguaje Natural (2022) @article{amiranda2022nlp, title={Mention detection, normalization \& classification of species, pathogens, humans and food in clinical documents: Overview of LivingNER shared task and resources}, author={Miranda-Escalada, Antonio and Farr{\'e}-Maduell, Eul{`a}lia and Lima-L{\'o}pez, Salvador and Estrada, Darryl and Gasc{\'o}, Luis and Krallinger, Martin}, journal = {Procesamiento del Lenguaje Natural}, year={2022} }   Resources Web Training and validation set Annotation Guidelines Evaluation library   For more information visit https://temu.bsc.es/livingner/ or email us at encargo-pln-life@bsc.es Check out the translator demo: https://textmining.bsc.es/translator
创建时间:
2022-07-13
二维码
社区交流群
二维码
科研交流群
商业服务