five

Tweet geolocation 5m

收藏
DataCite Commons2020-09-04 更新2024-07-27 收录
下载链接:
https://figshare.com/articles/dataset/Tweet_geolocation_5m/3168529
下载链接
链接失效反馈
官方服务:
资源简介:
Tweet-geolocation-5m is a dataset with more than 5 million geolocated tweets with detailed geolocation information associated. Each geolocated tweet is associated with its fine-grained location information, collected from OpenStreetMap [1] using the reverse geocoding feature in Nominatim [2]. It was originally created for country-level classification of tweets, but finer-grained classification is also provided with the dataset. The country codes are provided using the ISO 3166-1 alpha-2 standard [3].<br>The dataset was collected in two different week long periods: TC2014, collected in October 2014, and TC2015, collected in October 2015.<br>Two files are provided here:* tweet-geolocation-5m.tar.bz2, which is the actual datasets, providing the tweet IDs and ground truth country IDs that enable conducting further experiments.* vectors-and-folds.tar.bz2, which is provided for the purposes of reproducibility. With the information provided in this file, you should be able to reproduce the results we presented in the paper.
提供机构:
figshare
创建时间:
2016-04-22
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作