Data and code for project "What makes users click: the effect of news values negativity and surprise in news headlines on the CTR"
收藏NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/5896853
下载链接
链接失效反馈官方服务:
资源简介:
This repository includes the following files and corresponding Python codes saved in Jupiter Notebooks:
upworthy-archive-confirmatory-packages-03.12.2020.csv: Original dataset downloaded from The Upworthy Research Archive (https://osf.io/jd64p/)
cleaned_data.csv: upworthy-archive-confirmatory-packages-03.12.2020.csv cleaned with notebook_clean_data.ipynb
Data sorted by clickability_test_id
Headlines with 0 clicks deleted
Variables ctr, mean_ctr, and lift created
Only kept first occurrence of headlines that occur multiple times in the dataset per test
Deleted tests with only one headline
sample_manual_annotations.csv: sample of 300 headlines of cleaned_data.csv retrieved with notebook_sample_manual_annotations.ipynb
inter-annotator_sentiment.csv: input for notebook_inter-annotator_scores_confusion_matrix.ipynb to calculate inter-annotator agreement scores with regard to annotating sentiment
inter-annotator_emotion.csv: input for notebook_inter-annotator_scores_confusion_matrix.ipynb to calculate inter-annotator agreement scores with regard to annotating emotion
input_sentiment.tsv: input for notebook_sentiment_emotion_test_set.ipynb for sentiment analysis on sample of 300 headlines in sample_manual_annotations.csv
input_emotion.tsv: input for notebook_sentiment_emotion_test_set.ipynb for emotion analysis on sample of 300 headlines in sample_manual_annotations.csv
input_all_headlines.tsv: input for notebook_sentiment_emotion_final_annotation.ipynb to annotate all headlines in cleaned_data.csv
创建时间:
2022-02-20



