ai4privacy/open-pii-masking-500k-ai4privacy
收藏Hugging Face2025-03-19 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/ai4privacy/open-pii-masking-500k-ai4privacy
下载链接
链接失效反馈官方服务:
资源简介:
Open PII Masking 500k Ai4Privacy数据集是一个包含500k个人识别信息(PII)遮罩任务的公开数据集,用于训练和评估模型自动从文本中识别和遮罩PII信息,支持多语言,包括英语、法语、德语、意大利语、西班牙语、荷兰语、印地语和泰卢固语。数据集适用于多种机器学习任务,如文本分类、标记分类等,并可用于聊天机器人、客户支持系统、电子邮件过滤等多种场景。
The Open PII Masking 500k Ai4Privacy Dataset is a public dataset containing 500k PII masking tasks, designed for training and evaluating models to automatically identify and mask PII information in text. It supports multiple languages including English, French, German, Italian, Spanish, Dutch, Hindi, and Telugu. The dataset is applicable to various machine learning tasks such as text classification, token classification, and can be used in scenarios like chatbots, customer support systems, email filtering, and more.
提供机构:
ai4privacy



