five

Keyword frequencies in popular tech media (01.2016-12.2019)

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/3942059
下载链接
链接失效反馈
官方服务:
资源简介:
Sources with weights Euractiv 5% The Conversation 5% Politico Europe 5 % IEEE Spectrum 5 % Techforge 5% Fastcompany 5% The Guardian (Tech) 12% Arstechnica 5% Reuters 5% Gizmodo 9% ZDNet 9% The Register 12% The Verge 9% TechCrunch 9% Methodology Frequency of appearances for all unigrams and bigrams in the texts Frequency: number of appearances of every term divided by the number of all terms (for every month and source)  Several media sources: a representative index is calculated with weighted average (weights as above) Average monthly change in the analised term's frequency is calculated by OLS regressions The dependent variable of the estimation is the frequency index, while the number of months since the beginning of the analysed period (January 2016) is the independent variable The regression coefficient (referred to as coef) shows by how much on average the analysed expression’s frequency changed with every observed month (marginal change of the frequency), revealing which keywords had the biggest monthly growth Columns freq_months (e.g. freq_2019-04): the average frequency of the term coef: the regression coefficient coef_norm: the regression coefficient divided by the mean frequency of the keyword
创建时间:
2021-12-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作