five

Newly Emerged Rumors in Twitter|社交媒体分析数据集|谣言传播数据集

收藏
Mendeley Data2024-03-27 更新2024-06-27 收录
社交媒体分析
谣言传播
下载链接:
https://zenodo.org/record/2563864
下载链接
链接失效反馈
资源简介:
*** Newly Emerged Rumors in Twitter *** These 12 datasets are the results of an empirical study on the spreading process of newly emerged rumors in Twitter. Newly emerged rumors are those rumors whose rise and fall happen in a short period of time, in contrast to the long standing rumors. Particularly, we have focused on those newly emerged rumors which have given rise to an anti-rumor spreading simultaneously against them. The story of each rumor is as follow : 1- Dataset_R1 : The National Football League team in Washington D.C. changed its name to Redhawks. 2- Dataset_R2 : A Muslim waitress refused to seat a church group at a restaurant, claiming "religious freedom" allowed her to do so. 3- Dataset_R3 : Facebook CEO Mark Zuckerberg bought a "super-yacht" for $150 million. 4- Dataset_R4 : Actor Denzel Washington said electing President Trump saved the U.S. from becoming an "Orwellian police state." 5- Dataset_R5 : Joy Behar of "The View" sent a crass tweet about a fatal fire in Trump Tower. 6- Dataset_R6 : Harley-Davidson's chief executive officer Matthew Levatich called President Trump "a moron." 7- Dataset_R7 : The animated children's program 'VeggieTales' introduced a cannabis character in August 2018. 8- Dataset_R8 : Michael Jordan resigned from the board at Nike and took his Air Jordan line of apparel with him. 9- Dataset_R9 : In September 2018, the University of Alabama football program ended its uniform contract with Nike, in response to Nike's endorsement deal with Colin Kaepernick. 10- Dataset_R10 : During confirmation hearings for Supreme Court nominee Brett Kavanaugh, congressional Democrats demanded that the nominee undergo DNA testing to prove he is not Adolf Hitler. 11- Dataset_R11 : Singer Michael Bublé's upcoming album will be his last, as he is retiring from making music.Singer Michael Bublé's upcoming album will be his last, as he is retiring from making music. 12- Dataset_R12 : A screenshot from MyLife.com confirms that mail bomb suspect Cesar Sayoc was registered as a Democrat. The structure of excel files for each dataset is as follow : - Each row belongs to one captured tweet/retweet related to the rumor, and each column of the dataset presents a specific information about the tweet/retweet. These columns from left to right present the following information about the tweet/retweet : - User ID (user who has posted the current tweet/retweet) - The description sentence in the profile of the user who has published the tweet/retweet - The number of published tweet/retweet by the user at the time of posting the current tweet/retweet - Date and time of creation of the the account by which the current tweet/retweet has been posted - Language of the tweet/retweet - Number of followers - Number of followings (friends) - Date and time of posting the current tweet/retweet - Number of like (favorite) the current tweet had been acquired before crawling it - Number of times the current tweet had been retweeted before crawling it - Is there any other tweet inside of the current tweet/retweet (for example this happens when the current tweet is a quote or reply or retweet) - The source (OS) of device by which the current tweet/retweet was posted - Tweet/Retweet ID - Retweet ID (if the post is a retweet then this feature gives the ID of the tweet that is retweeted by the current post) - Quote ID (if the post is a quote then this feature gives the ID of the tweet that is quoted by the current post) - Reply ID (if the post is a reply then this feature gives the ID of the tweet that is replied by the current post) - Frequency of tweet occurrences which means the number of times the current tweet is repeated in the dataset (for example the number of times that a tweet exists in the dataset in the form of retweet posted by others) - State of the tweet which can be one of the following forms (achieved by an agreement between the annotators) : r : The tweet/retweet is a rumor post a : The tweet/retweet is an anti-rumor post q : The tweet/retweet is a question about the rumor, however neither confirm nor deny it n : The tweet/retweet is not related to the rumor (even though it contains the queries related to the rumor, but does not refer to the rumor)
创建时间:
2023-06-28
用户留言
有没有相关的论文或文献参考?
这个数据集是基于什么背景创建的?
数据集的作者是谁?
能帮我联系到这个数据集的作者吗?
这个数据集如何下载?
点击留言
数据主题
具身智能
数据集  4098个
机构  8个
大模型
数据集  439个
机构  10个
无人机
数据集  37个
机构  6个
指令微调
数据集  36个
机构  6个
蛋白质结构
数据集  50个
机构  8个
空间智能
数据集  21个
机构  5个
5,000+
优质数据集
54 个
任务类型
进入经典数据集
热门数据集

poi

本项目收集国内POI兴趣点,当前版本数据来自于openstreetmap。

github 收录

中国区域交通网络数据集

该数据集包含中国各区域的交通网络信息,包括道路、铁路、航空和水路等多种交通方式的网络结构和连接关系。数据集详细记录了各交通节点的位置、交通线路的类型、长度、容量以及相关的交通流量信息。

data.stats.gov.cn 收录

World Flights

该数据集包含使用OpenSky Network实时API收集的两小时飞行数据。飞行颜色基于出发国家,记录了18000次飞行,由于缺乏卫星覆盖,海洋上的航线不完整。每条航线还加入了来自airlinecodes.co.uk的航空公司信息。

github 收录

FER2013

FER2013数据集是一个广泛用于面部表情识别领域的数据集,包含28,709个训练样本和7,178个测试样本。图像属性为48x48像素,标签包括愤怒、厌恶、恐惧、快乐、悲伤、惊讶和中性。

github 收录

PlantVillage

在这个数据集中,39 种不同类别的植物叶子和背景图像可用。包含 61,486 张图像的数据集。我们使用了六种不同的增强技术来增加数据集的大小。这些技术是图像翻转、伽玛校正、噪声注入、PCA 颜色增强、旋转和缩放。

OpenDataLab 收录