DroNER: Dataset for Drone Named Entity Recognition
收藏Mendeley Data2024-03-27 更新2024-06-26 收录
下载链接:
https://data.mendeley.com/datasets/fwcjyc754h
下载链接
链接失效反馈官方服务:
资源简介:
The dataset is constructed using several drone images acquired from VTO Labs Drone Forensic Dataset [1]. The dataset's main objective is to attempt performing NER on the human-readable messages contained in the drone flight log files. Six entity types, i.e., component, action, issue, parameter, state, and function, are identified as the region of interest in the domain problem, which is then used to label the entities mentioned in a log message. The entity type identification is performed in the context of drone forensics, as the original intention of constructing this dataset is to build an information extraction model to help the forensic investigator pinpoint an incident-related log record. The NER dataset is annotated using consistent and contextual tagging to compare the effect of contextual tagging on the NER model's performance. Contextual tagging considers surrounding words and uses the longest span as the context to determine which entity type of a particular word belongs to. Contrarily, consistent tagging uses the shortest span as the context of a word within a sentence. The train and test set are split based on the drone models resulting in a proportion of 76:24 since the number of messages extracted from each drone image is uncontrollable.
创建时间:
2024-01-23



