five

A deep learning approach to recognizing fine-grained expressway location reference from unstructured texts in Chinese

收藏
Figshare2023-12-25 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/_b_A_b_b_deep_learning_b_b_approach_to_recognizing_fine-grained_b_b_expressway_location_reference_b_b_from_unstructured_texts_in_Chinese_b_/24902058
下载链接
链接失效反馈
官方服务:
资源简介:
TitleA deep learning approach to recognizing fine-grained expressway location reference from unstructured texts in ChineseAbstractComplex nested and discontinuous location references are common in unstructured text. Extracting them is essential for accurate location information retrieval and spatial inference. However, traditional methods struggle with these references due to annotation system and model architecture limitations. In this study, we introduce a deep learning approach to uniformly recognize flat, nested, and discontinuous location references, motivated by recognizing fine-grained expressway location references. The approach uses a pre-trained language model to generate semantic sentence representations and a distance and direction-aware Transformer for contextual encoding. Then, it recognizes location references by modeling the adjacency and boundary relations between word pairs. We evaluated the approach on seven benchmark datasets and compared it with state-of-the-art methods. The results show that the approach achieves higher accuracy with faster inference, validating our modeling paradigm and architecture. The ablation study further confirms the effectiveness of submodules in architecture. These findings can provide valuable insights for developing advanced unified location reference recognition methods. Moreover, the detailed labeled dataset for location references can facilitate the evaluation and comparison of unified recognition methods and systems.Data and codes availabilityThe project shares a corpus of 5274 reports and 31,954 labeled entities, and a deep learning method that can simultaneously recognize flat, nested, and discontinuous location references. The labeled entities contain 12 types of entities such as expressway name, road section, direction, tunnel, and flyover. The dataset can be used for several tasks such as Geoparsing, construction of knowledge graphs, named entity recognition, and social sensing.
创建时间:
2023-12-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作