PresidioMCP: Restricted FastMCP Server for Detecting and Pseudonymizing PII in Tabular Data
收藏International Potato Center2025-01-01 更新2026-05-11 收录
下载链接:
https://data.cipotato.org/citation?persistentId=doi:10.21223/P3/DX5MWO
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains restricted research software implementing PresidioMCP, a FastMCP server that supports privacy-preserving processing of tabular datasets by detecting potential personally identifiable information (PII) and pseudonymizing selected columns. The server provides tools to (1) list columns from local CSV/Excel files, (2) detect PII presence per column by sampling rows and running Presidio-based structured analysis, and (3) pseudonymize selected columns using consistent per-column pseudonyms written to a new output file. The implementation includes a helper Pseudonymizer that ensures stable replacements within each entity/column type. The deposit also includes pytest-based automated tests that validate the end-to-end workflow (column listing, PII detection, and anonymization) and error handling. The deposited files are restricted because they currently contain non-public software.
创建时间:
2025-01-01



