Data for: Improving Named Entity Recognition in Noisy User-generated Text with Local Distance Neighbor Feature
收藏NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://data.mendeley.com/datasets/nsfdt6m47j
下载链接
链接失效反馈官方服务:
资源简介:
NUToT Dataset (Noisy User-generated Text on Tor)
Name: Noisy User-generated Text on Tor
Acronym: NUToT
Description: The data is annotated for Named Entity Recognition (NER) task, and it involves six categories: Person, Location, Group, Creative work, Corporation, and Product. The Text comes from the domains of two categories of DUTA dataset (DUTA DATASET: http://gvis.unileon.es/dataset/duta-darknet-usage-text-addresses/). They are Drugs and Weapons. The dataset has 851 Sentences with 1200 named entities.
The dataset is also available on our group website: http://gvis.unileon.es/dataset/nutot/
创建时间:
2020-03-31



