five

"Amazon"

收藏
DataCite Commons2025-05-14 更新2025-05-17 收录
下载链接:
https://ieee-dataport.org/documents/amazon
下载链接
链接失效反馈
官方服务:
资源简介:
"Graph Neural Networks (GNNs) have become the predominant approach for graph fraud detection due to their intrinsic capability to handle graph-structured data and effectively capture complex relational patterns in fraudulent behaviors. However, existing GNN-based graph fraud detection models face limitations: homophily-based models struggle with handling heterogeneous relationships in fraud graphs, while heterophily-based models typically model only a single attribute- or structural-space, leading to constrained detection performance. To address these issues, this paper introduces DualH-FDNet, a semi-supervised graph fraud detection model based on dual-space heterogeneous relation analysis. This model represents user relationships as multi-relational heterogeneous directed graphs and employs a multi-layer graph convolutional architecture. Each convolutional layer consists of three modules: (1) Heterogeneity Learning Module: Utilize the label information of labeled nodes in relational subgraphs to learn heterogeneity separately in the attribute-space and structural-space, and achieve feature interaction of dual-space heterogeneity through a weighted fusion strategy. (2) Cross-Space Graph Aggregation Module: It computes attention weights based on fused heterophily representations and updates node representations via multi-relational graph aggregation. (3) Prototype-Guided Classification Module: It constructs category prototypes using labeled node representations and labels, guiding the classification of unlabeled nodes through prototype learning. Additionally, to tackle the challenges of scarce labeled data and label imbalance, the model utilizes balanced sampling strategies for semi-supervised training. Experimental results show that on the YelpChi and Amazon datasets, DualH-FDNet improves Recall by 0.9626% and 0.6444%, respectively, and AUC by 0.8594% and 0.1479% compared to the best-performing baseline models among nine comparative models. This study offers a novel solution for fraud detection in complex heterogeneous graph environments. The code and data are available at https:\/\/github.com\/AyomF\/DualH-FDNet."
提供机构:
IEEE DataPort
创建时间:
2025-05-14
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
Amazon数据集是一个用于图神经网络欺诈检测研究的标准数据集,包含用户对乐器的评论数据,用户被标记为欺诈或良性,并定义了三种基于产品、评分和文本相似性的关系类型。该数据集共有11,944个节点,欺诈节点占比为6.87%,适用于评估半监督图欺诈检测模型的性能。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作