Dataset of Data Augmentation Algorithm Improvement Strategy for Small-Sample Named Entity Recognition
收藏DataCite Commons2025-02-02 更新2025-04-16 收录
下载链接:
https://www.scidb.cn/en/detail?dataSetId=726504f936dd4b1ea9f116ff0a3e2fec
下载链接
链接失效反馈官方服务:
资源简介:
This data set includes NER data set in military field for small sample named entity recognition, which is jointly marked and reviewed by graduate students of Grade 20 and Grade 21 in Institute of Cognitive Intelligence, School of Computer, Heilongjiang University of Science and Technology. Research on improvement strategy of data enhancement algorithm for small sample named entity recognition; Python code realized by EDA algorithm; Python code of BERT-wwm-BiLSTM-CRF architecture NER model; An entity dictionary in the military field built by integrating open source military data and military knowledge maps; People's Daily NER data set and Weibo NER data set. The NER data set enhanced by various EDA strategies can be obtained by combining the improved EDA algorithm with the NER original data set. Through the improved EDA algorithm, combined with the original NER data set, the enhanced NER data set based on the above EDA improvement strategy can be obtained. This experiment includes "Military NER Enhanced Data Set", "People's Daily NER Enhanced Data Set" and "Weibo NER Enhanced Data Set".See DOI: 10.11925/infotech.2096-3467.2096.0261 for details.
提供机构:
Science Data Bank
创建时间:
2022-09-26



