Replication Data: Leveraging Researcher Domain Expertise to Annotate Concepts within Imbalanced Data
收藏NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://doi.org/10.7910/DVN/IEX083
下载链接
链接失效反馈官方服务:
资源简介:
In this manuscript, we describe a method to utilize researcher domain expertise to annotate concepts efficiently and accurately within an imbalanced dataset. This folder contains two scripts that run two variations of the simulation referred to in our paper. Additionally, we included two separate datasets that were utilized in the simulations. For each, we shared the list of document embeddings used for classification, together with a corresponding CSV which holds the categorical labels for each embedding. We recommend first reading the "README" text file, before running the scripts.
创建时间:
2023-02-15



