DeepFake Paraphrased
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14886916
下载链接
链接失效反馈官方服务:
资源简介:
Dataset Overview:------------------
This dataset contains paraphrased and original text formed from the DeepFake Dataset / MAGE
The dataset includes both human and machine-generated text using LLAMA, NeoX20B, OPT30B, GPT
Additionally, a portion of the AI-generated text has been paraphrased using a Pegasus-based paraphraser.
Paraphrasing Information:--------------------------
50% of the AI-generated text in the dataset is paraphrased, while the other half remains unchanged as originally generated.
The selection of AI text for paraphrasing is random.
Dataset Structure:------------------Each file in the dataset contains the following required columns:
original_text: The non-paraphrased text.
label: - 1: Human-generated text - 0: AI-generated text
is_selected: - 1: The AI text was selected for paraphrasing - 0: The text was not selected for paraphrasing
text: The paraphrased version of the AI-generated text (if applicable), non-paraphrased otherwise.
创建时间:
2025-02-19



