TY - GEN
T1 - Cardiovascular Disease Prediction using Ensemble Learning Techniques
T2 - 19th IEEE International Colloquium on Signal Processing and Its Applications, CSPA 2023
AU - Rustamov, Zahiriddin
AU - Rustamov, Jaloliddin
AU - Sultana, Most Sarmin
AU - Ywei, Jeanne
AU - Balakrishnan, Vimala
AU - Zaki, Nazar
N1 - Funding Information:
The authors would like to thank the United Arab Emirates University for funding this work under UAEU-ZU Joint Research Grant G00003715 (Fund No.: 12T034) through Emirates Center for Mobility Research.
Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Cardiovascular diseases (CVDs) are group disorders affecting the heart or involving constricted blood arteries. Early disease detection increases the likelihood of survival. As a result, newer methods such as machine learning emerged, capable of processing and analysing vast quantities of complex medical data and providing a more accurate prediction of diseases, including CVDs. However, due to factors such as overfitting and bias, the single classifier could not ensure optimum prediction. Thus, this study proposes a stacking ensemble classifier, which combines several single classifiers to produce an optimal predictive model. The Framingham Heart Study dataset was used to train the machine learning algorithms. The exploratory data analysis indicates that CVD was more common in males and diabetic individuals. Furthermore, individuals above the age of 65 were more susceptible to CVDs. Feature selection, missing value imputation, and data sampling were performed as part of data preprocessing. The results show that the proposed stacked ensemble classifier achieved 88.33% accuracy, 89.95% precision, 86.27% recall and 88.07% F1-score. Furthermore, the significance test results indicate that the proposed model performs significantly better than most models evaluated in this research. Finally, the comparative analysis showed that the proposed ensemble classifier performs better than most studies using the same dataset. The proposed model achieved a high F1-score, indicating it can accurately predict the cases with and without CVD.
AB - Cardiovascular diseases (CVDs) are group disorders affecting the heart or involving constricted blood arteries. Early disease detection increases the likelihood of survival. As a result, newer methods such as machine learning emerged, capable of processing and analysing vast quantities of complex medical data and providing a more accurate prediction of diseases, including CVDs. However, due to factors such as overfitting and bias, the single classifier could not ensure optimum prediction. Thus, this study proposes a stacking ensemble classifier, which combines several single classifiers to produce an optimal predictive model. The Framingham Heart Study dataset was used to train the machine learning algorithms. The exploratory data analysis indicates that CVD was more common in males and diabetic individuals. Furthermore, individuals above the age of 65 were more susceptible to CVDs. Feature selection, missing value imputation, and data sampling were performed as part of data preprocessing. The results show that the proposed stacked ensemble classifier achieved 88.33% accuracy, 89.95% precision, 86.27% recall and 88.07% F1-score. Furthermore, the significance test results indicate that the proposed model performs significantly better than most models evaluated in this research. Finally, the comparative analysis showed that the proposed ensemble classifier performs better than most studies using the same dataset. The proposed model achieved a high F1-score, indicating it can accurately predict the cases with and without CVD.
KW - cardiovascular diseases
KW - ensemble learning
KW - machine learning
KW - prediction
KW - stacking ensemble
UR - http://www.scopus.com/inward/record.url?scp=85153706268&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85153706268&partnerID=8YFLogxK
U2 - 10.1109/CSPA57446.2023.10087730
DO - 10.1109/CSPA57446.2023.10087730
M3 - Conference contribution
AN - SCOPUS:85153706268
T3 - 2023 19th IEEE International Colloquium on Signal Processing and Its Applications, CSPA 2023 - Conference Proceedings
SP - 93
EP - 98
BT - 2023 19th IEEE International Colloquium on Signal Processing and Its Applications, CSPA 2023 - Conference Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 3 March 2023 through 4 March 2023
ER -