Plant science corpus
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10022685
下载链接
链接失效反馈官方服务:
资源简介:
The plant science corpus consists of the titles and abstracts of plant science articles in PubMed published prior to 2021 with a small number of 2021 records due to modification of records. The columns are:
Index: integer index serving as identifier
PMID: PubMed identifier
Date: Publication date
Journal: journal where the article was published
Title: Title of the article
Abstract: Abstract of the article
Corpus: Title and abstract combined
Text classification score: plant science record prediction model score
Preprocessed corpus: Corpus after lower-casing, stop word removal, removal of non-alphanumeric and non-white space characters, lemmitisation
Topic: index of topics after topic modeling
创建时间:
2023-11-15



