Replication Data for: A natural language measure of ideology in the Brazilian Senate

DataONE2022-02-07 更新2024-06-08 收录

下载链接：

https://search.dataone.org/view/sha256:b1ae733e798537fb1d47d3a0d02018a84e745842a12153e29add6464ec4569a1

下载链接

链接失效反馈

官方服务：

资源简介：

We first obtained the documents from Dados Abertos using data scraping. Then, we built our data set according to each legislature, so that each legislature has its very own corpus. For each legislature, there is a specific set of politicians that can have one or more speeches. We treat each speech as a single document, in the sense that we don’t concatenate all speeches from a specific politician in a single document. So that at the end of our pre-processing phase we have five set of speeches (i.e., five corpora) in which each of them comprises a specific set of politicians with one or more speeches. Another important step is labelling the documents. As described in The Model (main text), we make use of Power and Zucco Jr. (2009) ideological estimation of the main Brazilian parties. They use a multidimensional scaling technique based on survey data and roll-call votes data. Their classification assigns each party to a specific ideological position, in the way that we have parties at the left, center or right . So that given the politician’s party, we can assign their speeches to its respective ideological class.

创建时间：

2023-11-12