Journal of Cardiobiology

Research Article

Evaluating the Effectiveness of Machine Learning for Heart Disease Prediction in Healthcare Sector

Krishna Madhav Jha*, Vasu Velaga, KishanKumar, Routhu, Gangadhar Sadaram, Suneel Babu Boppana and Niharika Katnapally

1Topbuild Corp, Sr Business Analyst
2Cintas Corporation, SAP Functional Analyst
3ADP, Senior Solution Architect
4Bank of America, VP DevOps/ OpenShift Admin Engineer
5iSite Technologies, Project Manager
6Pyramid Consulting, Tableau Developer
*Address for Correspondence: Krishna Madhav Jha, Topbuild Corp, Sr Business Analyst. E-mail Id: krishna22883@gmail.com
Submission: 02 January, 2025 Accepted: 29 January, 2025 Published: 31 January, 2025
Copyright: © 2025 Jha KM, et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Keywords: Heart Disease Prediction; Healthcare Sector; Predictive Models; Cardiovascular Disease; Machine Learning; Genetic algorithm-support vector machine (GA-SVM); K-Nearest Neighbors (KNN); Root Mean Square Error (RMSE); Artificial Neural Network (ANN); Random Forest (RF); Decision Tree (DT); Receiver-operating characteristics curve (ROC)

Abstract

Heart disease is still one of the world’s top causes of mortality. Thus, prevention and effective treatment depend on early detection. This study uses the Cleveland Heart Disease Dataset to examine how ML techniques can be used in the prediction of heart disease. By removing outliers, encoding categorical data, handling missing values, and scaling features, the dataset was prepared for further processing. There was an 80:20 split between the data sets used for training and testing. Data was collected and used to train and assess a number of classification models. These models included DT, SVM, RF, and ANN. In comparison to the other models, the ANN performed quite well, achieving 86% accuracy, 86% precision,84% recall, and 83%F1- score. In contrast, DT, SVM, and RF showed lower performance across all metrics, with ANN proving to be the most reliable for heart disease prediction. The study concludes that ANN offers the highest predictive capability, making it a promising tool for early heart disease detection. Future research could explore the incorporation of additional features, such as lifestyle factors or genetic data, to enhance model accuracy.