ner dataset and code of paper "Accelerating the Exploration of Information in Chinese Geological Texts Using Pretrained Model and Self Attention"
收藏DataCite Commons2025-05-01 更新2024-08-19 收录
下载链接:
https://figshare.com/articles/dataset/ner_dataset_and_code/25416583/3
下载链接
链接失效反馈官方服务:
资源简介:
The datasets and code used in this study are publicly available.Datasets:The datasets used in this study have been divided into training, testing, and validation sets.Code:This code repository includes Python scripts that replicate the experimental setup described in the paper. The code is organized into the following modules:Data Preprocessing: This module contains code for loading, cleaning, and transforming the datasets.Model Training: This module includes code for training various named entity recognition models using pre-trained language models. The code also includes implementations of the ablation experiments and data augmentation techniques described in the paper.Evaluation: This module contains code for evaluating the performance of the trained models on the held-out data sets.The data and code are provided to facilitate reproducibility and further research on named entity recognition in the Chinese language.<br><b>Clarification: None of the authors are affiliated with Tsinghua University.</b>
提供机构:
figshare
创建时间:
2024-04-02



