Replication Data for: Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language Models

Name: Replication Data for: Revisiting Weimar Film Reviewers’ Sentiments: Integrating Lexicon-Based Sentiment Analysis with Large Language Models
Creator: Harvard Dataverse
Published: 2025-05-11 23:56:07
License: 暂无描述

DataCite Commons2025-05-11 更新2025-05-17 收录

下载链接：

https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/8NINQK

下载链接

链接失效反馈

官方服务：

资源简介：

This dataset contains one excel table (corpus_reviews.xlsx) related to 80 historical film reviews of three Weimar films: "Das Cabinet des Dr. Caligari" (1920), "Nosferatu" (1922) and "Metropolis" (1927). The table also includes metadata related to the origin of the reviews and their full text in their original languages and in English translation. Furthermore, it contains the results of a range of methods for sentiment analysis of the reviews, including manual judgments and different approaches to automated sentiment analysis. The python code used to implement these is included. We first undertake a verbose sentiment analysis of the reviews by running the same prompt over each review through the OpeanAI API (GPT_API_all_reviews.ipynb). A less successful attempt, showing the results of the same prompt using the open source HuggingChat is also included (huggingChat_API_reviews.ipynb). We then apply a lexicon-based sentiment analysis (with Python’s NLTK library and its VADER lexicon) to the result of ChatGPT’s analysis and to the reviews directly (sentiment_analysis.ipynb). We then compare the results (results_analysis.ipynb).

提供机构：

Harvard Dataverse

创建时间：

2024-06-27

5,000+

优质数据集

54 个

任务类型

进入经典数据集