five

Tweet Dataset Annotated for Named Entity Recognition and Stance Detection

收藏
arXiv2019-01-16 更新2024-06-21 收录
下载链接:
https://github.com/dkucuk/Tweet-Dataset-NER-SD
下载链接
链接失效反馈
官方服务:
资源简介:
本数据集名为‘Tweet Dataset Annotated for Named Entity Recognition and Stance Detection’,由土耳其能源研究所创建,包含1065条土耳其语推特数据,主要用于研究命名实体识别和立场检测。数据集内容涉及土耳其两大体育俱乐部,通过标注立场(支持或反对)和命名实体(人名、地点、组织)来丰富数据。创建过程中,数据集经历了多次迭代和标注,最终版本包含详细的命名实体和立场信息。该数据集的应用领域主要集中在自然语言处理,特别是社交媒体文本分析,旨在通过实体识别和立场分析提升文本处理能力。

This dataset is named "Tweet Dataset Annotated for Named Entity Recognition and Stance Detection", created by the Turkish Energy Institute. It contains 1065 Turkish-language Twitter posts, primarily intended for research on named entity recognition (NER) and stance detection. The dataset centers on two major Turkish sports clubs, and enriches the corpus by annotating stances (supportive or opposing) as well as named entities including personal names, locations and organizations. During its development, the dataset underwent multiple iterations and annotation rounds, and the final version includes comprehensive named entity and stance annotation information. Its main application domains focus on natural language processing, particularly social media text analysis, with the goal of enhancing text processing capabilities via entity recognition and stance analysis.
提供机构:
土耳其能源研究所
创建时间:
2019-01-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作