five

Replication Data for: VaccinEU: COVID-19 vaccine conversations on Twitter in French, German and Italian

收藏
DataCite Commons2025-05-11 更新2025-05-17 收录
下载链接:
https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/NZUMZG
下载链接
链接失效反馈
官方服务:
资源简介:
We curated a list of vaccine-related keywords as complete as possible with the help of native speakers, using a snowball sampling approach, and collected over 70 million tweets in three different languages, from November 1st 2020 to November 15th 2021, using a combination of streaming and historical search Twitter APIs. We provide public access to this data in agreement with Twitter terms of service by releasing ids of tweets which can be used to retrieve full objects via APIs. For each language, we further collected a list of hashtags which strongly state a stance in favor or against vaccination, and we manually annotated a random sample of 1,000 tweets with four labels (Pro, Anti, Neutral, Out-of-context). We provide full access to this metadata, which can be used to better understand the polarized debate around vaccinations and train machine learning classifiers to automatically detect anti-vaccination messages.
提供机构:
Harvard Dataverse
创建时间:
2022-01-11
二维码
社区交流群
二维码
科研交流群
商业服务