A deep learning approach to recognizing fine-grained expressway location reference from unstructured texts in Chinese
收藏Figshare2023-12-25 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/_b_A_b_b_deep_learning_b_b_approach_to_recognizing_fine-grained_b_b_expressway_location_reference_b_b_from_unstructured_texts_in_Chinese_b_/24902058
下载链接
链接失效反馈官方服务:
资源简介:
TitleA deep learning approach to recognizing fine-grained expressway location reference from unstructured texts in ChineseAbstractComplex nested and discontinuous location references are common in unstructured text. Extracting them is essential for accurate location information retrieval and spatial inference. However, traditional methods struggle with these references due to annotation system and model architecture limitations. In this study, we introduce a deep learning approach to uniformly recognize flat, nested, and discontinuous location references, motivated by recognizing fine-grained expressway location references. The approach uses a pre-trained language model to generate semantic sentence representations and a distance and direction-aware Transformer for contextual encoding. Then, it recognizes location references by modeling the adjacency and boundary relations between word pairs. We evaluated the approach on seven benchmark datasets and compared it with state-of-the-art methods. The results show that the approach achieves higher accuracy with faster inference, validating our modeling paradigm and architecture. The ablation study further confirms the effectiveness of submodules in architecture. These findings can provide valuable insights for developing advanced unified location reference recognition methods. Moreover, the detailed labeled dataset for location references can facilitate the evaluation and comparison of unified recognition methods and systems.Data and codes availabilityThe project shares a corpus of 5274 reports and 31,954 labeled entities, and a deep learning method that can simultaneously recognize flat, nested, and discontinuous location references. The labeled entities contain 12 types of entities such as expressway name, road section, direction, tunnel, and flyover. The dataset can be used for several tasks such as Geoparsing, construction of knowledge graphs, named entity recognition, and social sensing.
创建时间:
2023-12-25



