five

PresidioMCP: Restricted FastMCP Server for Detecting and Pseudonymizing PII in Tabular Data

收藏
DataCite Commons2025-12-16 更新2026-05-03 收录
下载链接:
https://data.cipotato.org/citation?persistentId=doi:10.21223/P3/DX5MWO
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains restricted research software implementing PresidioMCP, a FastMCP server that supports privacy-preserving processing of tabular datasets by detecting potential personally identifiable information (PII) and pseudonymizing selected columns. The server provides tools to (1) list columns from local CSV/Excel files, (2) detect PII presence per column by sampling rows and running Presidio-based structured analysis, and (3) pseudonymize selected columns using consistent per-column pseudonyms written to a new output file. The implementation includes a helper Pseudonymizer that ensures stable replacements within each entity/column type. The deposit also includes pytest-based automated tests that validate the end-to-end workflow (column listing, PII detection, and anonymization) and error handling. The deposited files are restricted because they currently contain non-public software.
提供机构:
International Potato Center
创建时间:
2025-12-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作