Healthcare Cost Prediction Based on Hybrid Machine Learning Algorithms

Shujie Zou, Chiawei Chu, Ning Shen, Jia Ren

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)


Healthcare cost is an issue of concern right now. While many complex machine learning algorithms have been proposed to analyze healthcare cost and address the shortcomings of linear regression and reliance on expert analyses, these algorithms do not take into account whether each characteristic variable contained in the healthcare data has a positive effect on predicting healthcare cost. This paper uses hybrid machine learning algorithms to predict healthcare cost. First, network structure learning algorithms (a score-based algorithm, constraint-based algorithm, and hybrid algorithm) for a Conditional Gaussian Bayesian Network (CGBN) are used to learn the isolated characteristic variables in healthcare data without changing the data properties (i.e., discrete or continuous). Then, the isolated characteristic variables are removed from the original data and the remaining data used to train regression algorithms. Two public healthcare datasets are used to test the performance of the proposed hybrid machine learning algorithm model. Experiments show that when compared to popular single machine learning algorithms (Long Short Term Memory, Random Forest, etc.) the proposed scheme can obtain similar or higher prediction accuracy with a reduced amount of data.

Original languageEnglish
Article number4778
Issue number23
Publication statusPublished - Dec 2023


  • CGBN
  • healthcare costs
  • hybrid algorithm
  • regression algorithm

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • General Mathematics
  • Engineering (miscellaneous)


Dive into the research topics of 'Healthcare Cost Prediction Based on Hybrid Machine Learning Algorithms'. Together they form a unique fingerprint.

Cite this