Researcher: Zolani Xulu, University of the Witwatersrand, Johannesburg
Supervisor: Dr. Martins Arasomwan, University of the Witwatersrand, Johannesburg

This study introduces a hybrid machine learning framework that combines a Graph Attention Network (GAT), k-means clustering, and XGBoost enhanced with the Synthetic Minority Oversampling Technique (SMOTE) to uncover hidden tax fraud patterns. The GAT is employed to learn node embeddings that capture both the individual features of taxpayers and their interconnections within a tax
network. These embeddings are then clustered using k-means to reveal unusual behavioral patterns, while XGBoost performs final classification between fraudulent and legitimate entities. By integrating graph-based learning with clustering and ensemble methods, this approach enhances fraud detection
accuracy and interpretability. The results demonstrate that hybrid graph-driven models outperform traditional systems in identifying complex and previously unseen fraud behaviors, offering tax authorities a powerful data-driven tool for improving compliance and reducing revenue losses.