five

Emoji Gestures in Russian Tweets: Moscow

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/5800199
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset consists of 48 838 tweets each of them contains one of the 31 gesture emoji (different hand configurations) and its skin tone modifier options (e.g. 🙏🙏🏿🙏🏾🙏🏽🙏🏼🙏🏻), and posted within 50km from Moscow, Russia, in Russian, during May-August 2021. The dataset can be used to investigate the use of gesture emoji by Russian users of the Twitter platform. Python libraries used for collecting tweets and preprocessing: tweepy, re, preprocessor, emoji, regex, string, nltk.  The dataset contains 11 columns: preprocessed preprocessed text of the tweet (4 steps) all_emoji lists all emoji in a given tweet hashtags lists all hashtags in a given tweet user_encoded encoded Twitter user name: the first 3 characters of the user name and the first 3 characters of the user's location location_encoded location of the user: "moscow", "moscow_region", or "other" mention_present checks whether each tweet contains mentions url_present checks whether each tweet contains url preprocess_tweet preprocessing step 1: tokenizing mentions, urls, and hashtags lowercase_tweet preprocessing step 2: lowercasing remove_punct_tweet preprocessing step 3: removing punctuation tokenize_tweet preprocessing step 4: tokenizing The further information on the research project can be found here: https://github.com/mzhukovaucsb/emoji_gestures/
创建时间:
2022-05-18
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作