Korean Voice Phishing Detection Dataset with Multilingual Back-Translation and SMOTE Augmentations
收藏IEEE2026-04-17 收录
下载链接:
https://ieee-dataport.org/documents/korean-voice-phishing-detection-dataset-multilingual-back-translation-and-smote
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains original and augmented versions of the Korean Call Content Vishing (KorCCVi v2) dataset used in the study titled, Enhancing Voice Phishing Detection Using Multilingual Back-Translation and SMOTE: An Empirical Study. The dataset addresses challenges of data imbalance and asymmetry in Korean voice phishing detection, leveraging data augmentation techniques such as multilingual back-translation (BT) with English, Chinese, and Japanese as intermediate languages, and Synthetic Minority Oversampling Technique (SMOTE). The augmented dataset provides a valuable resource for machine learning (ML) and deep learning (DL) applications in natural language processing (NLP) and cybersecurity research.
提供机构:
Park, Dong-Joo; Moussavou Boussougou, Milandu Keith; Hamandawana, Prince



