PLUE
收藏arXiv2023-05-12 更新2024-06-21 收录
下载链接:
https://github.com/JFChi/PLUE
下载链接
链接失效反馈官方服务:
资源简介:
PLUE是一个专注于隐私政策语言理解的评估基准,由加州大学洛杉矶分校创建。该数据集包含六个任务,涵盖文本分类、问答、语义解析和命名实体识别等多个领域,旨在帮助研究人员和实践者更好地理解和分析隐私政策。数据集内容丰富,包括从网站和移动应用隐私政策中收集的大量文本,用于支持特定领域的语言模型预训练。数据集的创建过程涉及从多个来源收集隐私政策,并进行预处理以适应模型训练。PLUE的应用领域主要集中在隐私政策的自动化分析,旨在解决隐私政策理解中的复杂性和长度问题,提高隐私保护的透明度和效率。
PLUE is an evaluation benchmark focused on privacy policy language understanding, developed by the University of California, Los Angeles. This dataset comprises six tasks spanning multiple domains including text classification, question answering, semantic parsing, and named entity recognition, with the goal of assisting researchers and practitioners in better understanding and analyzing privacy policies. It features rich content, including a large corpus of texts collected from privacy policies of websites and mobile applications, to support pre-training of domain-specific language models. The dataset's creation process involves collecting privacy policies from diverse sources and performing preprocessing to meet the requirements of model training. The primary application scenarios of PLUE center on automated analysis of privacy policies, aiming to address the complexity and length-related challenges in privacy policy understanding, and enhance the transparency and efficiency of privacy protection.
提供机构:
加州大学洛杉矶分校
创建时间:
2022-12-20



