Journal of Cardiobiology
Research Article
Evaluating the Effectiveness of Machine Learning for Heart Disease Prediction in Healthcare Sector
Krishna Madhav Jha*, Vasu Velaga, KishanKumar, Routhu, Gangadhar Sadaram, Suneel Babu Boppana and Niharika Katnapally
1Topbuild Corp, Sr Business Analyst
2Cintas Corporation, SAP Functional Analyst
3ADP, Senior Solution Architect
4Bank of America, VP DevOps/ OpenShift Admin Engineer
5iSite Technologies, Project Manager
6Pyramid Consulting, Tableau Developer
2Cintas Corporation, SAP Functional Analyst
3ADP, Senior Solution Architect
4Bank of America, VP DevOps/ OpenShift Admin Engineer
5iSite Technologies, Project Manager
6Pyramid Consulting, Tableau Developer
*Address for Correspondence: Krishna Madhav Jha, Topbuild Corp, Sr Business Analyst.
E-mail Id: krishna22883@gmail.com
Submission: 02 January, 2025
Accepted: 29 January, 2025
Published: 31 January, 2025
Copyright: © 2025 Jha KM, et al. This is an open access article
distributed under the Creative Commons Attribution License, which permits
unrestricted use, distribution, and reproduction in any medium, provided the
original work is properly cited.
Keywords: Heart Disease Prediction; Healthcare Sector; Predictive
Models; Cardiovascular Disease; Machine Learning; Genetic
algorithm-support vector machine (GA-SVM); K-Nearest Neighbors
(KNN); Root Mean Square Error (RMSE); Artificial Neural Network
(ANN); Random Forest (RF); Decision Tree (DT); Receiver-operating
characteristics curve (ROC)
Abstract
Heart disease is still one of the world’s top causes of mortality.
Thus, prevention and effective treatment depend on early detection.
This study uses the Cleveland Heart Disease Dataset to examine how
ML techniques can be used in the prediction of heart disease. By
removing outliers, encoding categorical data, handling missing values,
and scaling features, the dataset was prepared for further processing.
There was an 80:20 split between the data sets used for training and
testing. Data was collected and used to train and assess a number
of classification models. These models included DT, SVM, RF, and
ANN. In comparison to the other models, the ANN performed quite
well, achieving 86% accuracy, 86% precision,84% recall, and 83%F1-
score. In contrast, DT, SVM, and RF showed lower performance across
all metrics, with ANN proving to be the most reliable for heart disease
prediction. The study concludes that ANN offers the highest predictive
capability, making it a promising tool for early heart disease detection.
Future research could explore the incorporation of additional features,
such as lifestyle factors or genetic data, to enhance model accuracy.