five

Data and code for project "What makes users click: the effect of news values negativity and surprise in news headlines on the CTR"

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/5896853
下载链接
链接失效反馈
官方服务:
资源简介:
This repository includes the following files and corresponding Python codes saved in Jupiter Notebooks: upworthy-archive-confirmatory-packages-03.12.2020.csv: Original dataset downloaded from The Upworthy Research Archive (https://osf.io/jd64p/) cleaned_data.csv: upworthy-archive-confirmatory-packages-03.12.2020.csv cleaned with notebook_clean_data.ipynb Data sorted by clickability_test_id Headlines with 0 clicks deleted Variables ctr, mean_ctr, and lift created Only kept first occurrence of headlines that occur multiple times in the dataset per test Deleted tests with only one headline sample_manual_annotations.csv: sample of 300 headlines of cleaned_data.csv retrieved with notebook_sample_manual_annotations.ipynb inter-annotator_sentiment.csv: input for notebook_inter-annotator_scores_confusion_matrix.ipynb to calculate inter-annotator agreement scores with regard to annotating sentiment inter-annotator_emotion.csv: input for notebook_inter-annotator_scores_confusion_matrix.ipynb to calculate inter-annotator agreement scores with regard to annotating emotion input_sentiment.tsv: input for notebook_sentiment_emotion_test_set.ipynb for sentiment analysis on sample of 300 headlines in sample_manual_annotations.csv input_emotion.tsv: input for notebook_sentiment_emotion_test_set.ipynb for emotion analysis on sample of 300 headlines in sample_manual_annotations.csv input_all_headlines.tsv: input for notebook_sentiment_emotion_final_annotation.ipynb to annotate all headlines in cleaned_data.csv
创建时间:
2022-02-20
二维码
社区交流群
二维码
科研交流群
商业服务