Database of Occupational Benefits Costs in Brazil (Period from March 2022 to March 2024)
收藏NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://data.mendeley.com/datasets/88r8dcffzg
下载链接
链接失效反馈官方服务:
资源简介:
We used the Knowledge Discovery in Databases (KDD) method to create the database.
In the first KDD stage, we selected 25 files containing benefit data issued by INSS (National Social Security Institute). These files cover March 2022 to March 2024, representing the only period available for download on the Brazilian Open Data portal at the time of our research. We exported all these data files to the SQL Server 2020 database using the Microsoft SQL Server Management Studio's import and export wizard. Each file generated a distinct table.
A query to the tables revealed they contained all social security benefits issued by INSS. For this research, we only focused on occupational benefits (B91, B92, B93, and B94). Thus, in the first step of the second KDD stage, we excluded benefits not related to work-related diseases and accidents. Still, within data preprocessing, we equalized the number of columns in each table. To do this, we removed columns not present in all tables or those irrelevant for analysis, such as "Meio pagamento" and "Banco."
In the data transformation stage, we created columns for the country's region and the benefit code. We populated the region column based on the benefit's state of issuance. Subsequently, we filled the benefit code column according to the benefit's name.
Finally, we exported all tables into a single table, "tbbeneficios," within the DBCustos database. This resulted in a table with over 18 million rows and a backup file of approximately 5 GB. The Table 1 presents the structure of the "tbbeneficios" table.
创建时间:
2025-12-15



