five

MORFITT : A multi-label corpus of French scientific articles in the biomedical domain

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/7893840
下载链接
链接失效反馈
官方服务:
资源简介:
This article presents MORFITT, the first multi-label corpus in French annotated in specialties in the medical field. MORFITT is composed of 3~624 abstracts of scientific articles from PubMed, annotated in 12 specialties for a total of 5,116 annotations. We detail the corpus, the experiments and the preliminary results obtained using a classifier based on the pre-trained language model CamemBERT. These preliminary results demonstrate the difficulty of the task, with a weighted average F1-score of 61.78%.
创建时间:
2023-05-04
二维码
社区交流群
二维码
科研交流群
商业服务