Brazilian First Names and Gender Ratios
收藏DataCite Commons2025-05-12 更新2025-05-17 收录
下载链接:
https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/ORH029
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains a list of ~74,000 Brazilian first names and how often each each name is used for a male or female candidates in the 2000, 2004, 2008, and 2012 municipal elections in Brazil. Thus it effectively works as a tool to classify the gender of any Brazilian (and perhaps Portuguese) name.The original data can be found on the Brazilian Electoral Tribunal's website <a href = "http://www.tse.jus.br/hotSites/pesquisas-eleitorais/candidatos.html">here.</a>
In total, there are 1,653,604 candidates in municipal elections from 2000-2012 and 1,652,685 had gender reported in the officially provided data. I simply take the first characters in the reported names that preceded a space and count the reported genders associated with that name. This yielded 74,650 unique first names which make up this dataset. There are errors in this data set, but if you are matching from a well-formed list of names than this should work fairly well as a prediction of the gender of the names in a different list with minimal manipulation of the original dataset.
提供机构:
Harvard Dataverse
创建时间:
2015-06-15



