five

Brazilian Politician dataset for Record Linkage

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/7957491
下载链接
链接失效反馈
官方服务:
资源简介:
The Brazilian political dataset from the Tribunal Superior Eleitoral (TSE) is a comprehensive and valuable resource. It contains information about Brazilian politicians, including their names, political parties, electoral districts, and personal data such as race, gender, address, place, and date of birth. The TSE has published a version of this dataset every two years from 1992 until now, updating the information of the politicians. The structure of the TSE dataset can be utilized to create rich and meaningful datasets for record linkage projects. This means the data can link information from different versions (presented over the year) and create a more comprehensive understanding of politician data evolution. Observing the data's evolution makes it possible to identify patterns and connections that might not be apparent otherwise. This can also aid in identifying irregularities or illegal activities related to political campaigns. Overall, the TSE dataset's structure lends itself well to record linkage and populational projects, making it a valuable resource for researchers and analysts. We use the politician TSE dataset to build a dataset that can be used in the record linkage and privacy-preserving record linkage context. Moreover, we leverage the modification/updates in the politician's personal information (reflected in the dataset) over time to build our dataset. For example, a politician can marry and change his/her name, or the record could be inserted with a typo in the political affiliations. We created a dataset for the record linkage application by utilizing the variations in the original TSE dataset, which resulted in real-world linkage errors. This dataset can be useful for training classifiers and measuring the accuracy of both record linkage and privacy-preserving record linkage.
创建时间:
2023-06-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作