Nanobiology Corpus
收藏arXiv2025-09-30 收录
下载链接:
https://gricad-gitlab.univ-grenoble-alpes.fr/nanobubbles/nano-ner-wiesp-2023.git
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含728篇专注于纳米生物学的科研文章,总计158,283个句子和3,762,791个词汇单元。这些文章均为英文,且在处理过程中排除了非核心部分。该数据集遵循CoNLL2003标准进行了注释,规模属于大型,其任务是对纳米生物学领域的科研文章进行实体识别的标注工作。
This dataset comprises 728 scientific articles focused on nanobiology, totaling 158,283 sentences and 3,762,791 lexical units. All articles are in English, with non-core sections removed during preprocessing. This large-scale dataset was annotated in accordance with the CoNLL-2003 standard, and its designated task is named entity recognition annotation for scientific articles in the field of nanobiology.



