NER dataset related to legal texts
收藏Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/scpttyz6t5
下载链接
链接失效反馈官方服务:
资源简介:
The following data pertains to Named Entity Recognition for legal judgment documents related to the crime of assisting in information network crimes. The dataset consists of a total of 4,236 samples, including both training and validation data, with a total of 8 labels. The file train1.json contains the raw data in JSON format, which is not divided into training and validation sets. The ner_data folder contains processed data in .txt file format, with the dataset split into training and validation sets at a ratio of 5:1. This folder also includes all label names. Ultimately, the model is trained using the processed dataset.
创建时间:
2024-09-16



