SaPGAN

Name: SaPGAN
Creator: IEEE DataPort
Published: 2025-02-24 08:54:46
License: 暂无描述

DataCite Commons2025-02-24 更新2025-04-16 收录

下载链接：

https://ieee-dataport.org/documents/sapgan-0

下载链接

链接失效反馈

官方服务：

资源简介：

With the rapid advancement of large language models (LLMs), Model-as-a-Service (MaaS) has emerged as a powerful paradigm, enabling providers to deliver pre-trained models, computational resources, and database management within a unified platform.However, the MaaS pattern raises critical data security concerns, especially regarding the risk of data leakage during transmission. Existing privacy-preserving fine-tuning approaches apply differential privacy (DP) by perturbing text embeddings before transmission. Nevertheless, these approaches rely on single noise-additions, referred to as "rigid perturbation". These mechanisms often compromise semantic integrity, resulting in suboptimal fine-tuning performance.To address the limitation, we propose SaPGAN, the first framework that leverages a sequence-to-sequence Generator with a transformer-based Discriminator for adaptive perturbation in LLM privacy preservation. Through adversarial training, the Generator produces perturbed texts that retain high semantic coherence with the original contents. A Sampler further optimizes privacy by selecting tokens to replace, allowing the framework to effectively balance privacy protection and semantic integrity. By applying such adaptive semantically-aware perturbation, SaPGAN strikes an optimal balance between fine-tuning performance and privacy preservation. Experiments demonstrate substantial improvements in text classification and generation tasks with empirical privacy increased by up to 129.31\% at the highest utility accuracies and reduced perturbation time by up to 26.83\%.

提供机构：

IEEE DataPort

创建时间：

2025-02-24

5,000+

优质数据集

54 个

任务类型

进入经典数据集