UlrikKoren/GreatShoes_PII_Dataset
收藏GreatShoes PII Dataset
概述
GreatShoes PII Dataset 是一个用于研究目的的合成数据集,旨在评估个人身份信息(PII)修订模型的性能。该数据集模拟了虚构鞋店“Great Shoes”的客户支持交互,包含多种类型的PII,如姓名、订单号、电话号码、地址和电子邮件。
数据集描述
数据集包含客户与支持代理的交互,嵌入了PII。每个条目包括一系列客户消息和相应的支持代理响应。数据集分为两个主要文件:
- GreatShoes_PII_Dataset.csv: 包含嵌入PII的客户-支持交互。
- pii_used_in_finetune.csv: 列出每个条目中存在的PII元素,基于索引。
示例行
以下是 GreatShoes_PII_Dataset.csv 的前几行示例:
plaintext customer_message_1;response_1;customer_message_2;response_2;customer_message_3;response_3;name "Can I return a pair of shoes I bought over a month ago? They are still unworn and with tags."; "Im sorry, Ram Watanabe, but our return policy allows for returns within 30 days of purchase if the product is unworn, unwashed, and still has its tags attached. Unfortunately, if its been over a month, we wont be able to accept a return."; "What if the shoes were a gift and I just received them?"; "If the shoes were a gift, please provide the order number or the purchasers information. We understand these situations and will do our best to assist you, Ram Watanabe.
Best regards, Great Shoes.";;;Ram Watanabe "How do I get a return label for my order: ON-805533?"; "To obtain a return label for your order, Justin Weber, please visit our website at www.greatshoes.com/return-label. Enter your order number and follow the instructions to print your free return label."; "I printed the label. Do I need to pay for shipping?"; "No, Justin Weber, you dont need to pay for shipping. Just attach the return label to your package and drop it off at the nearest post office. Shipping is on us!
Best wishes, Great Shoes.";;;Justin Weber "I received the wrong size for order: ON-712976. How can I exchange them?"; "I apologize for the inconvenience, Arthur Luo. To exchange the shoes for the correct size, please initiate a return on our website and specify the size you need. We offer free returns and shipping for situations like this."; "Thanks! And how long will the exchange process take?"; "Once we receive your returned item, Arthur Luo, the exchange process typically takes 5-7 business days. Well expedite your exchange to ensure you get the correct size as soon as possible.
Thank you for your understanding, Great Shoes.";;;Arthur Luo
PII元素
pii_used_in_finetune.csv 文件包含与每个条目关联的PII元素。以下是其内容的示例:
plaintext name,order number,phone,address,email Ram Watanabe,,,, Justin Weber,ON-805533,,, Arthur Luo,ON-712976,,, Sushila Martinez,ON-689959,,,
使用场景
该数据集可用于评估和微调设计用于客户服务上下文中PII修订的模型。它提供了一个现实场景,其中客户支持交互包含需要识别和修订的敏感信息。
许可证
GreatShoes PII Dataset 根据 Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) 许可证分发。这意味着您可以自由使用、分发和修改数据集,用于非商业目的,前提是您给予适当的信用并指示是否进行了更改。



