ner dataset and code of paper "Accelerating the Exploration of Information in Chinese Geological Texts Using Pretrained Model and Self Attention"

Name: ner dataset and code of paper "Accelerating the Exploration of Information in Chinese Geological Texts Using Pretrained Model and Self Attention"
Creator: figshare
Published: 2025-05-01 07:13:44
License: 暂无描述

DataCite Commons2025-05-01 更新2024-08-19 收录

下载链接：

https://figshare.com/articles/dataset/ner_dataset_and_code/25416583/3

下载链接

链接失效反馈

官方服务：

资源简介：

The datasets and code used in this study are publicly available.Datasets:The datasets used in this study have been divided into training, testing, and validation sets.Code:This code repository includes Python scripts that replicate the experimental setup described in the paper. The code is organized into the following modules:Data Preprocessing: This module contains code for loading, cleaning, and transforming the datasets.Model Training: This module includes code for training various named entity recognition models using pre-trained language models. The code also includes implementations of the ablation experiments and data augmentation techniques described in the paper.Evaluation: This module contains code for evaluating the performance of the trained models on the held-out data sets.The data and code are provided to facilitate reproducibility and further research on named entity recognition in the Chinese language.<br><b>Clarification: None of the authors are affiliated with Tsinghua University.</b>

提供机构：

figshare

创建时间：

2024-04-02

5,000+

优质数据集

54 个

任务类型

进入经典数据集