five

Memecry:Tracing the Repetition-with-Variation of Formulas on 4chan/pol/

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7100863
下载链接
链接失效反馈
官方服务:
资源简介:
Datasets underlying the analysis of the paper "Memecry: Tracing the Repetition-with-Variation of Formulas on 4chan/pol/ This upload includes the following: seedwords.csv: A .csv file with terms we used as a seed list to filter for 4chan/pol/-post containing vernacular. seedword-network_x.gdf/gephi: .gdf and .gephi network files for NPMI-weighted co-word networks of /pol/-posts. We only included posts that contained one of the aforementioned seed list words. twoflow-data_x.xlsx: .xlsx files with data on triplets common to 4chan/pol/. We identified these three-word sequences through the above network files. For example: "gr8 b8 m8", "orange man bad", "lurk moar newfag". The Excel data on these triplet includes: The absolute amount of /pol/-posts per year mentioning the triplets (within a window of five words). The average NPMI scores between the three triplet words per year. The top co-words per year having an average NPMI higher than 0.18 with two of the three triplet words. triplets.csv: A .csv file with the extracted triplets, including their common appearance as memetic phrases and a short explanation. This data was used for "two-flow graphs" available at oilab.eu/formulas/. See the paper for full explanations on the data.
创建时间:
2022-09-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作