Broad Twitter Corpus

Name: Broad Twitter Corpus
Creator: OpenDataLab
Published: 2026-05-24 05:30:17
License: 暂无描述

OpenDataLab2026-05-24 更新2024-05-09 收录

下载链接：

https://opendatalab.org.cn/OpenDataLab/Broad_Twitter_Corpus

下载链接

链接失效反馈

官方服务：

资源简介：

广泛的 Twitter 语料库是一个命名实体注释的推文数据集，收集这些数据是为了捕捉时间、空间和社会的多样性。语料库的目标是提供社交媒体中命名实体的代表性示例。它的注解具有很高的一致性和质量，它有大约 12000 个实体注解，类型为 Person、Location 和 Organization。

The broad Twitter corpus is a named entity-annotated tweet dataset compiled to capture temporal, spatial, and social diversity. The core objective of this corpus is to provide representative examples of named entities within social media. Its annotations boast high consistency and quality, containing approximately 12,000 entity annotations categorized into three types: Person, Location, and Organization.

提供机构：

OpenDataLab

创建时间：

2022-06-23

搜集汇总

数据集介绍