Tweets and Gab Postings for source classfication
收藏DataCite Commons2025-04-01 更新2024-07-27 收录
下载链接:
https://figshare.com/articles/Tweets_and_Gab_Postings_for_source_classfication/9773717/1
下载链接
链接失效反馈官方服务:
资源简介:
Data can be open using pickle with "rb" model.<br>LanguageExample object could be found here:https://gist.github.com/billweasley/b8e05969b5b2267473d13f263253812e<br><br><br>equally_split_examples.pickle:<br><br>- Hashtag Equally Devide Dataset [Ensure Hashtag (topic) distribution]- It is a map containing two keys: "train" and "dev"<br>- The value in the map is a list of <i>LanguageExample </i>object- 1168 Tweets, 1168 Gab Postings- Label: "0" indicates the sample from Gab, "1" indicates the sample from Twitter<br>random_split_examples.pickle:<br><br>- Random Devide Dataset [Ensure only the gab and tweet has a same number of samples, but do not ensure it has same distribution of hashtags for the different social network platforms]- It is a map containing two keys: "train" and "dev"<br>- The value in the map is a list of <i>LanguageExample </i>object- 2181 Tweets, 2181 Gab Postings- Label: "0" indicates the sample from Gab, "1" indicates the sample from Twitter<br>gabs_tags_information_maps.pickle- Original Gabs collected and pre-processed (URL removels, HTTP tag removel, HTTP decoding)- A map where the key is hashtags and the value is a list of gab language associated to the correspoding hashtag.<br><br>tweets_tags_information_maps.pickle- Original Tweets collected and pre-processed (URL removels, HTTP tag removel, HTTP decoding)- A map where the key is hashtags and the value is a list of tweets language associated to the correspoding hashtag.
提供机构:
figshare
创建时间:
2019-09-05



