Tweets and Gab Postings for source classification
收藏Figshare2019-09-05 更新2026-04-29 收录
下载链接:
https://figshare.com/articles/dataset/Tweets_and_Gab_Postings_for_source_classfication/9773717
下载链接
链接失效反馈官方服务:
资源简介:
Data can be open using pickle with "rb" model.LanguageExample object could be found here:https://gist.github.com/billweasley/b8e05969b5b2267473d13f263253812eequally_split_examples.pickle:- Hashtag Equally Devide Dataset [Ensure Hashtag (topic) distribution]- It is a dictionary containing two keys: "train" and "dev"- The value in the dictionary is a list of LanguageExample object- 1168 Tweets, 1168 Gab Postings- Label: "0" indicates the sample from Gab, "1" indicates the sample from Twitterrandom_split_examples.pickle:- Random Devide Dataset [Ensure only the gab and tweet has a same number of samples, but do not ensure it has same distribution of hashtags for the different social network platforms]- It is a dictionary containing two keys: "train" and "dev"- The value in the dictionary is a list of LanguageExample object- 2181 Tweets, 2181 Gab Postings- Label: "0" indicates the sample from Gab, "1" indicates the sample from Twittergabs_tags_information_maps.pickle- Original Gabs collected and pre-processed (URL removels, HTTP tag removel, HTTP decoding)- A map where the key is hashtags and the value is a list of gab language associated to the correspoding hashtag.tweets_tags_information_maps.pickle- Original Tweets collected and pre-processed (URL removels, HTTP tag removel, HTTP decoding)- A map where the key is hashtags and the value is a list of tweets language associated to the correspoding hashtag.
创建时间:
2019-09-05



