Replication Data for: Computer-Assisted Qualitative Visual Analysis
收藏DataCite Commons2024-07-11 更新2024-07-13 收录
下载链接:
https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/NS9HV4
下载链接
链接失效反馈官方服务:
资源简介:
This dataset encompasses the top 1,000 advertisements collected from the "adPorn" subreddit over the period from April 2, 2011, to August 1, 2022. After a manual cleaning process, the dataset was refined to 866 images. These images were analysed using Google Cloud Vision and GPT-4 Turbo, providing a rich set of data on each advertisement. The dataset includes links to the original images hosted on Reddit, alongside the analytical data produced by Google Cloud Vision and GPT-4 Turbo. Additionally, the images were thematically clustered and this clustering information is also included in the dataset.
本数据集涵盖2011年4月2日至2022年8月1日期间,从Reddit平台adPorn子版块采集的前1000条广告。经手动数据清洗流程后,数据集被精简至866张图片。研究团队使用谷歌云视觉(Google Cloud Vision)与GPT-4 Turbo对上述图片开展分析,为每条广告生成了丰富的多维度数据。本数据集包含托管于Reddit的原始图片链接,以及由谷歌云视觉和GPT-4 Turbo生成的分析数据。此外,研究团队对图片进行了主题聚类,相关聚类信息也已收录至本数据集。
提供机构:
Harvard Dataverse
创建时间:
2024-07-11



