NLMChem a new resource for chemical entity recognition in PubMed full-text literature
收藏DataONE2021-03-22 更新2025-05-10 收录
下载链接:
https://search.dataone.org/view/sha256:90886604ada2dfb70116161a9ebbef2e54ab3fb4b7d555ec74e413f708abc4ed
下载链接
链接失效反馈官方服务:
资源简介:
Automatically identifying chemical and drug names in scientific publications advances information access for this important class of entities in a variety of biomedical disciplines by enabling improved retrieval and linkage to related concepts. While current methods for tagging chemical entities were developed for the article title and abstract, their performance in the full article text is substantially lower. However, the full text frequently contains more detailed chemical information, such as the properties of chemical compounds, their biological effects, and interactions with diseases, genes, and other chemicals.Â
We, therefore, present the NLM-Chem corpus, a full-text resource to support the development and evaluation of automated chemical entity taggers. The NLM-Chem corpus consists of 150 full-text articles, doubly annotated by ten expert NLM indexers, with ~5000 unique chemical name annotations, mapped to ~2000 MeSH identifiers. Using this corpus, we built a substantially im...
创建时间:
2025-05-05



