TY - GEN
T1 - GAT-RWOS
T2 - 15th IEEE International Conference on Knowledge Graphs, ICKG 2024, Co-located with 24th IEEE International Conference on Data Mining, ICDM 2024
AU - Rustamov, Zahiriddin
AU - Lakas, Abderrahmane
AU - Zaki, Nazar
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Class imbalance poses a significant challenge in machine learning (ML), often leading to biased models favouring the majority class. In this paper, we propose GAT-RWOS, a novel graph-based oversampling method that combines the strengths of Graph Attention Networks (GATs) and random walk-based oversampling. GAT-RWOS leverages the attention mechanism of GATs to guide the random walk process, focusing on the most informative neighbourhoods for each minority node. By performing attention-guided random walks and interpolating features along the traversed paths, GAT-RWOS generates synthetic minority samples that expand class boundaries while preserving the original data distribution. Extensive experiments on a diverse set of imbalanced datasets demonstrate the effectiveness of GAT-RWOS in improving classification performance, outperforming state-of-the-art oversampling techniques. The proposed method has the potential to significantly improve the performance of ML models on imbalanced datasets and contribute to the development of more reliable classification systems. Code is available at https://github.com/zahiriddin-rustamov/gat-rwos.
AB - Class imbalance poses a significant challenge in machine learning (ML), often leading to biased models favouring the majority class. In this paper, we propose GAT-RWOS, a novel graph-based oversampling method that combines the strengths of Graph Attention Networks (GATs) and random walk-based oversampling. GAT-RWOS leverages the attention mechanism of GATs to guide the random walk process, focusing on the most informative neighbourhoods for each minority node. By performing attention-guided random walks and interpolating features along the traversed paths, GAT-RWOS generates synthetic minority samples that expand class boundaries while preserving the original data distribution. Extensive experiments on a diverse set of imbalanced datasets demonstrate the effectiveness of GAT-RWOS in improving classification performance, outperforming state-of-the-art oversampling techniques. The proposed method has the potential to significantly improve the performance of ML models on imbalanced datasets and contribute to the development of more reliable classification systems. Code is available at https://github.com/zahiriddin-rustamov/gat-rwos.
KW - graph attention networks
KW - imbalanced data
KW - machine learning
KW - oversampling
KW - random walks
UR - https://www.scopus.com/pages/publications/86000220278
UR - https://www.scopus.com/pages/publications/86000220278#tab=citedBy
U2 - 10.1109/ICKG63256.2024.00047
DO - 10.1109/ICKG63256.2024.00047
M3 - Conference contribution
AN - SCOPUS:86000220278
T3 - Proceedings - 2024 IEEE International Conference on Knowledge Graph, ICKG 2024
SP - 317
EP - 324
BT - Proceedings - 2024 IEEE International Conference on Knowledge Graph, ICKG 2024
A2 - Chen, Huajun
A2 - Fensel, Anna
A2 - Zhu, Xingquan
A2 - Wattenhofer, Roger
A2 - Wu, Xindong
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 11 December 2024 through 12 December 2024
ER -