Use of Machine Learning Techniques in the Prediction of Heart Disease

Ieee account.

  • Change Username/Password
  • Update Address

Purchase Details

  • Payment Options
  • Order History
  • View Purchased Documents

Profile Information

  • Communications Preferences
  • Profession and Education
  • Technical Interests
  • US & Canada: +1 800 678 4333
  • Worldwide: +1 732 981 0060
  • Contact & Support
  • About IEEE Xplore
  • Accessibility
  • Terms of Use
  • Nondiscrimination Policy
  • Privacy & Opting Out of Cookies

A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity. © Copyright 2024 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.

U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings

Preview improvements coming to the PMC website in October 2024. Learn More or Try it out now .

  • Advanced Search
  • Journal List
  • J Healthc Eng
  • v.2022; 2022

Logo of jhe

Machine Learning Technology-Based Heart Disease Detection Models

Umarani nagavelli.

1 Dayananda Sagar Research Foundation, University of Mysore (UoM), Mysore, Karnataka, India

Debabrata Samanta

2 Department of Computer Science, CHRIST (Deemed to be) University, Bangalore, Karnataka, India

Partha Chakraborty

3 Department of Computer Science and Engineering, Comilla University, Cumilla-3506, Bangladesh

Associated Data

The heart disease data that support the findings of this study are available on request from the corresponding author.

At present, a multifaceted clinical disease known as heart failure disease can affect a greater number of people in the world. In the early stages, to evaluate and diagnose the disease of heart failure, cardiac centers and hospitals are heavily based on ECG. The ECG can be considered as a regular tool. Heart disease early detection is a critical concern in healthcare services (HCS). This paper presents the different machine learning technologies based on heart disease detection brief analysis. Firstly, Naïve Bayes with a weighted approach is used for predicting heart disease. The second one, according to the features of frequency domain, time domain, and information theory, is automatic and analyze ischemic heart disease localization/detection. Two classifiers such as support vector machine (SVM) with XGBoost with the best performance are selected for the classification in this method. The third one is the heart failure automatic identification method by using an improved SVM based on the duality optimization scheme also analyzed. Finally, for a clinical decision support system (CDSS), an effective heart disease prediction model (HDPM) is used, which includes density-based spatial clustering of applications with noise (DBSCAN) for outlier detection and elimination, a hybrid synthetic minority over-sampling technique-edited nearest neighbor (SMOTE-ENN) for balancing the training data distribution, and XGBoost for heart disease prediction. Machine learning can be applied in the medical industry for disease diagnosis, detection, and prediction. The major purpose of this paper is to give clinicians a tool to help them diagnose heart problems early on. As a result, it will be easier to treat patients effectively and avoid serious repercussions. This study uses XGBoost to test alternative decision tree classification algorithms in the hopes of improving the accuracy of heart disease diagnosis. In terms of precision, accuracy, f1-measure, and recall as performance parameters above mentioned, four types of machine learning (ML) models are compared.

1. Introduction

Cardiovascular disease (CVD) is a type of heart disease that continues to be a major cause of death worldwide, accounting for over 30% of all deaths. If nothing is done, the total number of fatalities in the world is anticipated to rise to 22 million by 2030. Plaques on arterial walls can obstruct blood flow, resulting in a heart attack or stroke. Heart disease is caused due to various risk factors such as physical inactivity, unhealthy diet, and the effective use of alcohol and tobacco [ 1 , 2 ]. The abovementioned factors are reduced by adopting a good daily lifestyle, namely, reducing salt in the diet, consumption of vegetables and fruits, practicing physical activity regularly, and discontinuing alcohol and tobacco use, which helps to minimize the risk of heart disease [ 3 ]. The solution to overcome these problems is to use the collection of patient records from different health care centers and hospitals. For getting the results and seeking another opinion from an experienced doctor the decision support system is used. The unnecessary test conductions are avoided by this technique for diagnosis, thereby saving money and time [ 4 , 5 ]. Recently, a system of hospital management was utilized for managing the health care or patient data which means more data are produced by these systems. For predicting heart disease, the DSS utilizing the NB (Naïve Bayes) algorithm was developed. A web application is constructed to obtain the application and user input, and it retrieves key features relating to heart disease from a historical database (the Cleveland data set) [ 6 , 7 ].

More number of neurohormonal regulatory mechanisms are triggered in the initial stages of heart failure disease (HFD). In a short duration, these compensatory mechanisms can cause the HFD consequences, leading to accentuated ventricular dysfunction, dyspnea on exertion, peripheral edema, pulmonary, and heart remodeling which can cause afterload and preload permanent changes. More options of treatment with HFD are given to the patient including lifestyle changes and implantable or medication devices such as a defibrillator or pacemaker. The main concern is ensuring the follow-up in this population given that hospitalization due to acute HFD decompensation is the leading cause of healthcare expenditure. Statistics and studies show that heart diseases are the most significant issue faced by people particularly HFD [ 8 , 9 ]. For various diseases, early diagnosis and detection of cardiac disease is the first step in care and treatment.

The HFD is now an emerging disorder for diseases such as hypertension, insomnia, and heart disease among others. The HFD detection on ECG is completed through variations detection in duration of heart beats from the time interval from 1 wave of PQRST to the next wave of PQRST. For IHD early detection, an emerging and promising noninvasive diagnostic tool is MCG (Magenetocardiography). While MCG is less influenced by contact interference of electrode-skin compared to ECG, it is highly sensitive to vortex current and tangential causes through the tissue of ischemic cardiac. Despite its high signal quality, MCG interpretation is time-consuming, highly dependent on interpreting experience, and has limited appeal in clinics. As a result, clinicians would benefit from an autonomous system that can detect and localize ischemia at an early stage [ 10 ].

Early identification of heart disease of improved diagnosis and high-risk individuals using a prediction model can be recommended generally for fatality rate reduction, and decision-making is improved for further treatment and prevention. In CDSS, a prediction model is implemented and utilized to support the clinicians in assessing the heart disease risk, and appropriate treatments are provided for managing the further risk. Additionally, numerous studies have also reported that CDSS implementation can improve decision quality, clinical decision making, and preventive care, respectively [ 9 , 11 ]. Coronary artery disease (CAD), also known as ischemia heart disease (IHD), is the leading cause of death in adults over the age of 35 in different countries. During the same time span, it became China's biggest cause of death. When blood flow to the heart is reduced due to coronary artery stenosis, IHD occurs. Myocardial damage can have serious consequences including ventricular arrhythmia or even sudden cardiac death due to myocardial infarction.

1.1. Major Contribution of Research

Machine learning may be used to diagnose, detect, and forecast many disorders in the medical industry. The primary purpose of this study is to give clinicians a tool to detect cardiac problems at an early stage. As a result, it will be easier to deliver appropriate treatment to patients while avoiding serious effects. This study uses XGBoost to test several decision tree classification algorithms in the hopes of improving performance in heart disease diagnosis.

The remaining paper is organized as follows: In Section 2 , literature analysis is presented. ML techniques for heart disease prediction is presented in Section 3 . Feature extraction is projected in Section 4 . Heart disease prediction using the XGBoost algorithm is elaborated in Section 5 . Results are shown in Section 6 . The concluding remarks are given in Section 7 .

2. Literature Analysis

In the system of the human heart, the heart's electrical activity is recorded by ECG with various wave forms through skin electrodes. For heart disease identification, it is a noninvasive technique which reflects heartbeat, heart rate, and cardiac health. The number of cells in the human body has no direct contact with the outer location. Moreover, they depend on the cardiovascular system for serving as a provision of transport for them. In system of cardiovascular, the fluids are two kinds are flow through it. Blood is the first fluid type. Here, the circulatory system forms the blood vessels and heart. Lymph is the second type of fluid. The structure of the lymphatic system is formed by lymph nodes and lymphatic vessels. The cardiovascular system can be formed by the combination of the lymphatic system and the vascular system [ 12 ]. In heartbeat, a heart cycle is an action series. A heart cycle typically contains both atria, with each ventricle contraction synchronized a fraction of a second later. The heart is produced and interconnected using heart muscle cells, so when one of them contracts, it causes nearby cells to excite. The muscles rest between beats in the cardiac cycle, which contributes to aerobic breathing. Two parts are there, which are further discussed in this study.

2.1. Part 1 Is ‘Systole'

It is the expression for contraction. It happens whenever the ventricles are in the stage of contracting and cause the blood to pump into the vessels of the heart with A–V valves closure and semilunar valves opening.

2.2. Part 2 Is ‘Diastole'

It is an expression for relaxation. It happens whenever the ventricles are in a stage of relaxing. This causes back-blood pressure to the valves of the closing semilunar and opening valves of A–V, respectively.

Previous studies have shown promising results for CVD automatic detection. But still, there are a few concerning problems. Firstly, studies utilizing private datasets suffer from variations of database and small-sized databases, especially for MCG studies in which larger public datasets are unavailable. However, based on public datasets, ECG studies might not accomplish equal performance if they are transferred from benchmark to clinical domain. Representatives composes the public datasets of ECG. The ECG datasets easily identify abnormal cases and would be biased to perform early diagnosis [ 13 ]. On model performance, feature selection has shown a significant impact. Past studies presented features from three categories: (a) information theory features, (b) time domain features, and (c) frequency domain features.

Every category is represented as effective and has the ability for revealing some complicated aspects of cardiac electrophysiological signals. Certain studies are carried out to evaluate the significance of different features. For solving this issue a large feature group is relatively designed that contains abovementioned two categories. Through the feature importance weight analysis, the following is determined:

  • Repolarization synchronicity of T wave is described by features as a core feature for identifying the subjects of IHD
  • Features describe that the characteristics of the magnetic pole are associated with coronary stenosis locations

Various studies are reporting that the heart disease diagnosis development based on models of ML can provide the objective of HDPM with improved performance. The 2 publicly available datasets of heart disease such as Cleveland and Statlog are mostly used for comparing the prediction models' performance between researchers. In the area of healthcare, ML-based clinical decision making has been applied recently. In machine learning, recent advances are representing discriminative classifiers' advantages for cardiac disease automatic detection. Studies have previously shown that machine learning algorithms, namely, SVM, RF (random forest), LR (logistic regression), BPNN (back propagation neural network), and MLP (multilayer perceptron) are utilized successfully for decision-making tools to predict heart disease based on individual information. Various studies revealed the hybrid model merits that achieved good performance in heart disease prediction, namely, RF with a linear model, MLP, Bayes Net (BN), majority voting of NB, and RF and two stacked SVMs, respectively [ 14 ]. Kalia Orphanou et al., in the NB classification model, the TARs (Temporal Association Rules) feature is used for diagnosing heart disease. To preprocess the data, a temporal abstraction (TA) is used and a temporal pattern mining algorithm is utilized for finding TARs by frequent temporal relationships identification between TAs. In Naïve Bayes classifier, periodic TARs are considered as features finally. With the help of possible recurrence of each TAR pattern with relevant patient history, an 82% accuracy is obtained. Theresa Princy et al. have conducted a survey on several machine learning techniques which are utilized to predict the heart diseases risk of a person depending on various attributes such as gender, age, cholesterol, and pulse rate. When increasing the accuracy of risk and attributes, the author can use the analysis of the K-nearest neighbour algorithm, Naïve Bayes, and neural network. The accuracy is increased with a low number of attributes, which is possible by using various methods [ 15 ]. Prerana et al. predicted the risk level of probabilistic analysis and classification (PAC) and heart disease completed by machine learning technique. For handling patients' records and various machine learning algorithms analysis, the map reduce programming is used and given with the graphical representation. In the cloud, this approach is available and globally accessible. Furthermore, it can be extended for determining different diseases such as cancer, diabetes, brain tumor, and so on. Shadab et al. use the technique of NB data mining for supporting the users to know the answers for predefined questions in the application of web based. For diagnosing heart disease, doctors use these intelligent decisions when NB algorithm accuracy can be improved by utilizing various techniques [ 16 ]. Figure 1 shows ML techniques for heart disease prediction.

An external file that holds a picture, illustration, etc.
Object name is JHE2022-7351061.001.jpg

System architecture.

3. ML Techniques for Heart Disease Prediction

3.1. naïve bayes weighted approach (nbwa).

The web application is the proposed system. It can be classified into two modules: admin side and user side, respectively. The admin submitted the dataset into the database in the form of an excel sheet, which was then translated into weights and stored. Clusters were formed from the records. Depending on average weight or each of the clusters is determined, attribute (vessel) classification and in the database this feature can be stored. Using provided fields, the user inputs and patient data are converted into weights, and the algorithm of NB is applied to the data. The result can be displayed to the user based on system probability. Various events' conditional and marginal probabilities are compared by the Naïve Bayes algorithm. For the given samples, this algorithm is useful for calculating the possible nearest value [ 17 ]. The Bayes theorem is useful for calculating the diagnostic probability when the patient's health is monitored based on a few symptoms. The Bayes rule is used in a variety of data mining approaches. This technique is more beneficial for developing predictive capability models and provides a variety of ways to examine and explore the data. The Naïve Bayes classifier is an appropriate scheme when a given input attribute is more. Compared to many other classification techniques this algorithm is simple, but it has better performance. The heart disease patient's characteristics are easily identified by the Naïve Bayes classifier technique. This algorithm will find the input attributes' probability during a predictable disease state [ 18 ]. Figure 2 shows ML techniques for heart disease prediction.

An external file that holds a picture, illustration, etc.
Object name is JHE2022-7351061.002.jpg

ML techniques for heart disease prediction.

3.2. Magnetocardiography-Based Ischemic Heart Disease Detection

The framework of ischemic heart disease detection using ML classifiers is shown in Figure 3 . The feature groups show that Pearson's coefficient was used to create a heatmap to investigate the association between characteristics and the target variable. Similar data points were grouped to evaluate clustering strength in order to investigate the links exhibited in the heatmap. With a mapping of the target variable, age and systolic blood pressure were grouped. This makes it easier to see the target variable's distribution. Preprocess the raw MCG signals for T-wave segmentation, filtering, and averaging [ 19 , 20 ]. Then, 3 feature categories are extracted. Few of these features would be redundant, but still they are included in the classification procedure because past studies suggest that ML algorithms like SVM are not sensitive to redundant features' presence. On the other hand, omitting significant features is more hazardous than nonimportant features.

An external file that holds a picture, illustration, etc.
Object name is JHE2022-7351061.003.jpg

Framework of ischemic heart disease detection.

4. Feature Extraction

4.1. time domain.

Time domain 18 features are summarized for describing the following:

  • Maximum cardiac current characteristics
  • Pattern of magnetic field map
  • Distribution of negative/positive magnetic poles in the TT interval is independently distributed with a normal distribution that has a mean of 0 and constant variance

In addition, 6 related features of magnetic poles are presented to describe the following: change of negative/positive pole area in T wave and displacement of negative/positive in T wave. In the following study, it was determined that they were associated with the location of stenosis. Interesting characteristics of features from 2 aspects: dynamic changes and values at the peak of the T-wave during the interval TT. For achieving this, the interval TT is partitioned into 24 shorter subwindows, from which 18 time features are extracted.

4.2. Discrete Wavelet Transform Domain

A db-4 (Daubechies 4) DWT is applied with 4 levels on each of the 36 channels. The signal is decomposed into time-frequency components through DWT. The 4 th level DWT low frequency part (A4 component, corresponding from 0 to 7.8 Hz approximately) is reconstructed utilizing IDWT (inverse DWT).

4.3. Information Theory

Based on the theory of electrophysiology, ischemic subjects' signal perturbation during the segment ST is higher than that of healthy persons. The 3 measures are selected from the following: (i) for each channel Shannon entropy, (ii) multidimensional Gini index, and (iii) entropy of SVD (singular value decomposition).

5. Heart Disease Prediction Using the XGBoost Algorithm

Develop HDPM for providing high prediction performance, absence/presence of heart disease, and to provide the present situation of a patient. The development process of HDPM is represented through flow chart which is shown in Figure 4 . Firstly, collect the datasets of heart disease. Secondly, data preprocessing is carried out for the transformation of data. Third, apply the outlier detection based DBSCAN technique for determining the outlier data provided by optimal parameters [ 21 ]. Fourth, from the training dataset, remove the detected outlier data. Fifth, for balancing the training dataset, utilize SMOTE-ENN based data balancing technique. Sixth, to learn from the training dataset and for generating HDPM using MLA based on XGBoost. The main parameter with adjustment with max depth  = 5, min childweight  = 1, gamma = 0, subsample, colsample bytree  = 0.8, and scale posweight  = 1. Finally, for evaluating the presented model performance, performance metrics are presented. Within the CDSS, HDPM is generated [ 22 , 23 ]. In this study, a 10-fold cross-validation technique is utilized for avoiding overfitting. The models are allowed by cross-validation to learn from various training datasets through repeated sampling. Therefore, data maximizing is utilized for validation, which helps for overfitting prevention. Past studies have demonstrated that the 10-cross field validation technique will be utilized for maintaining the bias variance trade-off that eventually provides a generalized model and protects against overfitting [ 24 ]. The proposed heart disease prediction model using XGBoost is shown in Figure 4 .

An external file that holds a picture, illustration, etc.
Object name is JHE2022-7351061.004.jpg

Proposed heart disease prediction model using XGBoost.

Different machine learning methods use different datasets with independent specification. Total 50 test cases are used in the prediction of heart diseases in the paper. Among these 50 test cases, 6 are false negatives, 1 is false positive, 18 are TPs, and 25 are TNs [ 25 ]. The collection of data is part of cardiovascular disease retrospective studies utilizing the recordings of multichannel MCG. There are 227 people with coronary stenosis and 347 people who are healthy in the database. There are 16 NSTEMI (non-ST-elevation myocardial infarction) instances in the sample. For the ischemic group, coronary angiography is performed. There were 227 IHD patients (left anterior descending) [ 26 ].

Several datasets are used to demonstrate the presented approach's capacity to detect HFD using ECG signals. The proposed method is tested on 38 real data recordings of ECG signals with HFD from the PhysioNet database. Consider the 5-section (P, Q , R, S, and T) segmentation issues of the ECG signal for each round [ 27 ]. It is also regarded as the three distinct zones (QRS, T , and P) that provide varying probabilities of peak-peak and all-waves time interval borderlines in the patient's ECG [ 21 , 28 , 29 ]. It has been discovered that three components of the processed ECG signal are segregated and distributed in the same way. For every patient with HFD, the ISVM-DO will identify all P, Q , R, S, and T wave peak values in the ECG waveform. As a result, the system extracts essential morphological aspects from processed ECG. Many random tests and cross-validation tests are used to train the proposed method predictor on 125 samples from the training dataset [ 30 , 31 ].

For exploring how heart illness is recognized using an ML approach, two datasets of heart diseases (Cleveland and Statlog; termed datasets I and II) are used. Although the original dataset comprises 79 raw attributes and 303 subjects, only 13 of them are utilized, and just one attribute is used as an output class. The remaining 297 subjects are used in the preprocessing step after 6 subjects were excluded from the dataset due to missing values. The XGBoost V0.81 Python library is used to implement XGBoost. Using the DBSCAN technique, remove outlier data from heart disease training datasets. The software XGBoost is used to generate HDPM [ 2 , 32 ]. To evaluate the performance of ML techniques, 4 different parameters are utilized: recall, F1-measure, accuracy, and precision. For measuring the potential of these 4 parameters a confusion matrix is utilized from the model: F n (false negative), T n (true negative), F p (false negative), and T p (true positive). Number of subjects classified correctly as “positive” (heart disease presence) is known as T p , number of subjects classified correctly as “negative” (absence/healthy heart disease) is represented as T n . Similarly, number of subjects classified incorrectly as “negative” (when they have heart disease) is represented as F n , and number of subjects classified incorrectly as “positive” when they not having heart disease is represented as F p .

6.1. Accuracy

The ratio of accurately predicted predictions by the model to all types of completed predictions in the problem classification is known as accuracy.

6.2. Precision

Precision or positive predictive is defined as the ratio of accurate positive scores ( T p ) to the total number of positive scores ( T p  +  F p ) predicted by the classification algorithm.

6.3. Recall

The recall can be defined as the ratio of accurate T p to the total T p “+” F n .

6.4. F1-Measure

The F1-measure is a function of precision and recall. F1 must be 1 for the classification algorithm's good performance and 0 for bad performance.

Different ML techniques-based heart disease detection performance is evaluated with the different parameters, and these values are shown in Table 1 .

Comparative analysis of different machine learning methods.

The graphical representation of different ML techniques to predict heart disease in terms of accuracy and precision parameters is represented in Figure 5 .

An external file that holds a picture, illustration, etc.
Object name is JHE2022-7351061.005.jpg

Different ML methods in terms of accuracy and precision.

The graphical representation of the different ML techniques to predict heart disease in terms of recall and F1-measure parameters is represented in Figure 6 .

An external file that holds a picture, illustration, etc.
Object name is JHE2022-7351061.006.jpg

Different ML methods in terms of recall and F1-measure.

From the abovementioned table and graphs, it is clear that the accuracy parameter is high in the XGBoost algorithm-based heart disease detection and low in the Naïve Bayes with weighted approach. Table 2 shows confusion metrics analysis for applying classifier.

Confusion metrics analysis for applying classifier.

7. Conclusion with Future Work

The survey on machine learning technology-based heart disease detection models is provided in this paper. Four approaches of ML models for heart disease detection are analyzed in this survey; these are the Naïve Bayes with weighted approach based prediction, 2 SVM's with XGBoost based prediction, an improved SVM (ISVM) based on duality optimization (DO) technique based prediction, and an XGBoost based prediction. According to the results analysis, the accuracy, precision, recall, and F1-measure parameters are high in the XGBoost algorithm-based heart disease detection and only accuracy is low for the Naïve Bayes with weighted approach than others, and the remaining precision, recall, and F1-measure values are low in SVM with duality optimization (DO) model. The present survey paper gives the best idea regarding different machine learning-based heart disease detection methods.This research can be updated in the future by adding more attributes to the heart disease dataset and making it more interactive for the users. It can also be carried out as a mobile application with reduced computing time and complexity. We will make changes to the system by linking it to the hospital's database.

Data Availability

Conflicts of interest.

The authors of this manuscript declare that they have no conflicts of interest.

Captcha Page

We apologize for the inconvenience...

To ensure we keep this website safe, please can you confirm you are a human by ticking the box below.

If you are unable to complete the above request please contact us using the below link, providing a screenshot of your experience.

https://ioppublishing.org/contacts/

Machine Learning Techniques for Heart Disease Prediction: A Comparative Study and Analysis

  • Original Paper
  • Published: 19 November 2020
  • Volume 11 , pages 87–97, ( 2021 )

Cite this article

heart disease prediction using machine learning research paper ieee 2021

  • Rahul Katarya   ORCID: orcid.org/0000-0001-7763-291X 1 &
  • Sunit Kumar Meena 1  

3661 Accesses

77 Citations

Explore all metrics

Nowadays, people are getting caught in their day-to-day lives doing their work and other things and ignoring their health. Due to this hectic life and ignorance towards their health, the number of people getting sick increases every day. Moreover, most of the people are suffering from a disease like heart disease. Global deaths of almost 31% population are due to heart-related disease as data contributed by the World Health Organization (WHO). So, the prediction of happening heart disease or not becomes important for the medical field. However, data received by the medical sector or hospitals is so huge that sometimes it becomes difficult to analyze. Using machine learning techniques for this prediction and handling of data can become very efficient for medical people. Hence in this study, we have discussed the heart disease and its risk factors and explained machine learning techniques. Using that machine learning techniques, we have predicted heart disease and provided a comparative analysis of the algorithms for machine learning used for the experiment of the prediction. The goal or objective of this research is completely related to the prediction of heart disease via a machine learning technique and analysis of them.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price includes VAT (Russian Federation)

Instant access to the full article PDF.

Rent this article via DeepDyve

Institutional subscriptions

heart disease prediction using machine learning research paper ieee 2021

Similar content being viewed by others

heart disease prediction using machine learning research paper ieee 2021

Artificial intelligence in disease diagnosis: a systematic literature review, synthesizing framework and future research agenda

heart disease prediction using machine learning research paper ieee 2021

Heart Disease Prediction using Machine Learning Techniques

heart disease prediction using machine learning research paper ieee 2021

Diabetes detection based on machine learning and deep learning approaches

V Ramalingam V, Dandapath A, & Karthik Raja M. Heart disease prediction using machine learning techniques: a survey. International Journal of Engineering & Technology. 2018;7(2.8):684–687. https://doi.org/10.14419/ijet.v7i2.8.10557

Global Atlas on Cardiovascular Disease Prevention and Control. Geneva. Switzerland: World Health Organization. 2011.

Gandhi M, Singh SN. Predictions in heart disease using techniques of data mining. 2015.

Palaniappan S, Awang R. "Intelligent heart disease prediction system using data mining techniques," 2008 IEEE/ACS International Conference on Computer Systems and Applications, Doha, 2008, pp. 108–115.  https://doi.org/10.1109/AICCSA.2008.4493524 .

Thomas J, Princy RT. Human heart disease prediction system using data mining techniques. 2016 International Conference on Circuit, Power, and Computing Technologies (ICCPCT). 2016.  https://doi.org/10.1109/iccpct.2016.7530265

Indhumathi S, Vijaybaskar G. “WEB-BASED HEALTH CARE DETECTION USING NAIVE BAYES ALGORITHM.” 2015.

Pahwa K, Kumar R. "Prediction of heart disease using hybrid technique for selecting features," 2017 4th IEEE Uttar Pradesh Section International Conference on Electrical, Computer and Electronics (UPCON), Mathura. 2017. p. 500–504.  https://doi.org/10.1109/UPCON.2017.8251100 .

Xu S, Zhang Z, Wang D, Hu J, Duan X, Zhu T. "Cardiovascular risk prediction method based on CFS subset evaluation and random forest classification framework," 2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA), Beijing. 2017. p. 228–232.  https://doi.org/10.1109/ICBDA.2017.8078813 .

Rajathi S, Radhamani G. "Prediction and analysis of Rheumatic heart disease using kNN classification with ACO," 2016 International Conference on Data Mining and Advanced Computing (SAPIENCE), Ernakulam. 2016. p. 68–73. https://doi.org/10.1109/SAPIENCE.2016.7684132 .

Saini R, Bindal N, Bansal P. "Classification of heart diseases from ECG signals using wavelet transform and kNN classifier," International Conference on Computing, Communication & Automation, Noida. 2015. p. 1208–1215. https://doi.org/10.1109/CCAA.2015.7148561 .

Jabbar MA, Deekshatulu BL, Chandra P. "Alternating decision trees for early diagnosis of heart disease," International Conference on Circuits, Communication, Control and Computing, Bangalore. 2014. p. 322–328. https://doi.org/10.1109/CIMCA.2014.7057816 .

Andreeva, Plamena. “Data Modelling and Specific Rule Generation via Data Mining Techniques.” (2006).

Jabbar MA, Deekshatulu BL, Chandra P. Classification of Heart Disease using Artificial Neural Network and Feature Subset Selection. Global journal of computer science and technology. 2013.

Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20(3):273–97.

MATH   Google Scholar  

Pradhan A. Support vector machine-A survey. IJETAE. 2. 2012.

Karthiga A, Mary S, Yogasini M. Early Prediction of Heart Disease Using Decision Tree Algorithm. International Journal of Advanced Research in Basic Engineering Sciences and Technology. (IJARBEST) 2017.

Tan P-N, Steinbach M, Kumar V. Introduction to Data Mining. London, U.K.: Pearson; 2006.

Google Scholar  

Priya S. Comparative Study of Data Mining Classification Algorithms in Heart Disease Prediction. 2015.

Dangare C, Apte S. A Data Mining Approach for Prediction of Heart Disease Using Neural Networks. 2012.

Dwivedi AK. “Performance evaluation of different machine learning techniques for prediction of heart disease”, Springer, Computer Applications and Mathematics. 2016.

Shalet KS, Sabarinathan V, Sugumaran V, Sarath Kumar VJ. Diagnosis of Heart Disease Using Decision Tree and SVM Classifier. International Journal of Applied Engineering Research. 2015;10(68):598–602.

Beyene C, Kamat P. Survey on Prediction and Analysis the Occurrence of Heart Disease Using Data Mining Techniques. International Journal of Pure and Applied Mathematics. 2018;118(8):165–74.

Powar A, Shilvant S, Pawar V, Parab V, Shetgaonkar P, Aswale S. "Data Mining & Artificial Intelligence Techniques for Prediction of Heart Disorders: A Survey". Vision Towards Emerging Trends in Communication and Networking (ViTECoN). International Conference on. 2019. p. 1–7.

Santhana Krishnan J., Geetha S., "Prediction of Heart Disease Using Machine Learning Algorithms.", Innovations in Information and Communication Technology (ICIICT) 2019 1st International Conference on. 2019. p. 1–5.

Rahman M, Zahin MM, Islam L, "Effective Prediction On Heart Disease: Anticipating Heart Disease Using Data Mining Techniques", Smart Systems and Inventive Technology (ICSSIT) 2019 International Conference on. 2019. p. 536–541.

Habib S, Moin MB, Aziz S, Banik K, Arif H. "Heart Failure Risk Prediction and Medicine Recommendation using Exploratory Data Analysis", Advances in Science Engineering and Robotics Technology (ICASERT) 2019 1st International Conference on. 2019. p. 1–6.

Obasi T, Omair Shafiq M, "Towards comparing and using Machine Learning techniques for detecting and predicting Heart Attack and Diseases", Big Data (Big Data) 2019 IEEE International Conference on, 2019. p. 2393–2402.

Karayılan T, Kılıç O, "Prediction of heart disease using neural network", Computer Science and Engineering (UBMK) 2017 International Conference on, 2017. p. 719–723.

Mohan S, Thirumalai C, Srivastava G. Effective Heart Disease Prediction Using Hybrid Machine Learning Techniques. Access IEEE. 2019;7:81542–54.

Article   Google Scholar  

Goel S, Deep A, Srivastava S, Tripathi A, "Comparative Analysis of various Techniques for Heart Disease Prediction", Information Systems and Computer Networks (ISCON) 2019 4th International Conference on. 2019. p. 88–94.

Download references

Author information

Authors and affiliations.

Department of Computer Science & Engineering, Delhi Technological University, Shahbad Daulatpur, Delhi, -110042, India

Rahul Katarya & Sunit Kumar Meena

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Rahul Katarya .

Ethics declarations

Conflicts of interest.

There is “NO any” direct or indirectly conflict of interest for this manuscript and with authors. Also there is no conflict of interest that are directly or indirectly related to the research may include but are not limited to the following:

Research involving Human Participants and/or Animals.

No animals and no participants were involved during this research work/manuscript.

Informed consent.

Although no participants or humans or animals or privacy data were involved in this research, and both the authors give this consent for this manuscript.

Rights and permissions

Reprints and permissions

About this article

Katarya, R., Meena, S.K. Machine Learning Techniques for Heart Disease Prediction: A Comparative Study and Analysis. Health Technol. 11 , 87–97 (2021). https://doi.org/10.1007/s12553-020-00505-7

Download citation

Received : 22 July 2020

Accepted : 29 October 2020

Published : 19 November 2020

Issue Date : January 2021

DOI : https://doi.org/10.1007/s12553-020-00505-7

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Heart disease
  • Machine learning
  • Cardiovascular disease (CVD)
  • Find a journal
  • Publish with us
  • Track your research

IMAGES

  1. (PDF) Heart Disease Prediction using Machine Learning

    heart disease prediction using machine learning research paper ieee 2021

  2. (PDF) Comparative Analysis of Machine Learning Algorithms for Heart

    heart disease prediction using machine learning research paper ieee 2021

  3. (PDF) Heart Disease Prediction using Machine Learning Algorithms

    heart disease prediction using machine learning research paper ieee 2021

  4. Heart Disease Prediction using Machine Learning

    heart disease prediction using machine learning research paper ieee 2021

  5. (PDF) PREDICTION OF HEART DISEASE BY USING MACHINE LEARNING

    heart disease prediction using machine learning research paper ieee 2021

  6. New trial shows the value of AI heart disease detection in routine practice

    heart disease prediction using machine learning research paper ieee 2021

VIDEO

  1. Heart Disease Prediction Project using Machine Learning

  2. Heart Disease Prediction

  3. heart disease prediction system

  4. Heart Disease Prediction 🫀using logistic Regression

  5. Heart disease prediction 🫀using logistic regression👨‍💻

  6. Heart Disease Prediction 🫀 using Logistics regression

COMMENTS

  1. Heart Disease Prediction using Machine Learning Techniques

    One of the main contributors to death cases globally is heart diseases. Heart illnesses have an impact on many people in the middle or elderly age which, in most instances, lead to serious health adverse effects such as strokes and heart attacks. Therefore, it is necessary to diagnose and predict heart diseases to prevent any serious health issues before they occur. In this paper, a ...

  2. Heart Disease Prediction using Machine Learning Techniques

    As per the recent study by WHO, heart related diseases are increasing. 17.9 million people die every-year due to this. With growing population, it gets further difficult to diagnose and start treatment at early stage. But due to the recent advancement in technology, Machine Learning techniques have accelerated the health sector by multiple researches. Thus, the objective of this paper is to ...

  3. Heart Disease Prediction Using Machine Learning Techniques

    Consequent to the modern world life style and the increase in heart diseases every year, people's lives are at risk. Heart diseases have become one of the most common reasons for fatalities these days, including in the young. Hence it has become very necessary to search and find the simplest and best solutions to predict the risk of getting these diseases in advance so that necessary steps ...

  4. Heart Disease Prediction using Machine Learning Techniques

    One of the main contributors to death cases globally is heart diseases. Heart illnesses have an impact on many people in the middle or elderly age which, in most instances, lead to serious health adverse effects such as strokes and heart attacks. Therefore, it is necessary to diagnose and predict heart diseases to prevent any serious health issues before they occur. In this paper, a ...

  5. Heart Disease Prediction using Machine Learning Techniques

    Heart Disease Prediction using Machine Learning Techniques. October 2021. DOI: 10.1109/ICDABI53623.2021.9655783. Conference: 2021 International Conference on Data Analytics for Business and ...

  6. An integrated Machine Learning Techniques for Accurate Heart Disease

    currently heart disease is considered among top major causes of deaths in the globe, prediction of heart disease is a serious complexity in medical data processing. Machine learning (ML) has proven the beneficial in assisting with decision-making and prediction from the massive amounts of data provided by health care industry. We found machine learning approaches being employed in recent ...

  7. Heart Disease Prediction using Machine Learning Techniques

    This research aims to foresee the odds of having heart disease as probable cause of computerized prediction of heart disease that is helpful in the medical field for clinicians and patients [].To accomplish the aim, we have discussed the use of various machine learning algorithms on the data set and dataset analysis is mentioned in this research paper.

  8. Heart Disease Prediction Using Machine Learning

    One of the main reasons for death worldwide is heart disease, and early detection of the condition can help lower the risk of having a cardiac arrest. This research paper aims to suggest a machine learning-based method for estimating the risk of developing cardiac disease. First recent advancements in the field have been reviewed and then an ML model has been implemented to work on the ...

  9. Heart Disease Prediction Using Data Analysis and Effective Machine

    Cardiovascular disease stands as a prominent global health concern, claiming a significant number of lives. The accurate prediction of such diseases poses a substantial challenge in the realm of data analysis. The heart, being the second most vital organ in the human body, necessitates dedicated efforts in medical research. Timely identification and diagnosis of heart diseases are pivotal in ...

  10. Heart Disease Prediction using Machine Learning

    We used the UCI heart prediction benchmark database for this research project, which covers 14 different heart-related parameters. In our study we also tried to find correlations between the various features found in the database with the help of standard Mechanical Learning. Methods and use them effectively in predicting the risk of heart disease.

  11. Effective Heart Disease Prediction Using Machine Learning Techniques

    The diagnosis and prognosis of cardiovascular disease are crucial medical tasks to ensure correct classification, which helps cardiologists provide proper treatment to the patient. Machine learning applications in the medical niche have increased as they can recognize patterns from data. Using machine learning to classify cardiovascular disease occurrence can help diagnosticians reduce ...

  12. Heart disease prediction using machine learning techniques

    Heart disease prediction using machine learning techniques. Apurv Garg 1, Bhartendu Sharma 1 and Rijwan Khan 2. Published under licence by IOP Publishing Ltd IOP Conference Series: Materials Science and Engineering, Volume 1022, 1st International Conference on Computational Research and Data Analytics (ICCRDA 2020) 24th October 2020, Rajpura, India Citation Apurv Garg et al 2021 IOP Conf. Ser ...

  13. Heart disease risk prediction using deep learning techniques with

    Cardiovascular diseases state as one of the greatest risks of death for the general population. Late detection in heart diseases highly conditions the chances of survival for patients. Age, sex, cholesterol level, sugar level, heart rate, among other factors, are known to have an influence on life-threatening heart problems, but, due to the high amount of variables, it is often difficult for ...

  14. Use of Machine Learning Techniques in the Prediction of Heart Disease

    Machine learning is a fiction that belongs to the computer science realm; in reality, it is an interdisciplinary subject with applications in every field. Machine learning techniques are being used in signal processing, picture and speech recognition, electronic design automation, and self-driving cars, to name a few. The goal of this paper is to determine the method that provides the highest ...

  15. Ensemble Meta-Learning using SVM for Improving Cardiovascular Disease

    A novel ensemble meta-learning framework designed to enhance cardiovascular disease (CVD) risk prediction that strategically combines the predictive power of diverse machine learning algorithms: logistic regression, K nearest neighbors, decision trees, gradient boosting, gaussian Naive Bayes and XGBoost. Cardiovascular diseases (CVDs) remain a leading cause of mortality worldwide, posing a ...

  16. Machine Learning Technology-Based Heart Disease Detection Models

    This paper presents the different machine learning technologies based on heart disease detection brief analysis. Firstly, Naïve Bayes with a weighted approach is used for predicting heart disease. The second one, according to the features of frequency domain, time domain, and information theory, is automatic and analyze ischemic heart disease ...

  17. Advanced Cloud-Based Prediction Models for Cardiovascular Disease

    Cardiovascular disease is a major cause of global mortality, and early detection and prevention are crucial for lessening the strain on the healthcare system. Over the past few years, cloud computing (CC) and Machine Learning methods have demonstrated significant promise in enhancing heart disease prediction precision and effectiveness. However, many commonly used methods are susceptible to ...

  18. Heart disease prediction using machine learning algorithms

    The research paper mainly focuses on which patient is more likely to have a heart disease based on various medical attributes. We prepared a heart disease prediction system to predict whether the patient is likely to be diagnosed with a heart disease or not using the medical history of the patient.

  19. Machine Learning Techniques for Heart Disease Prediction: A Comparative

    This can lead to heart failure. So, in this paper, we have tried to study all the risks and factors that influence on the heart and can cause cardiac disease. ... "Prediction of Heart Disease Using Machine Learning Algorithms.", Innovations in Information and Communication Technology (ICIICT) 2019 1st International Conference on. 2019. p. 1-5 ...

  20. (PDF) Using Machine Learning for Heart Disease Prediction

    This prediction is an area that is widely researched. Our paper is part of the research on the detection and prediction of heart disease. It is based on the application of Machine Learning ...

  21. (PDF) Prediction of Cardiovascular Disease using Machine Learning

    Devadkar (PhD) "Prediction of Heart Disease Using Machine Learning", ICECA 2018, IEEE Xplore ISBN:978-1-5386-0965- 1. [21] Sonakshi Harjai & Sunil Kumar Khatri, "An Intelligent Clinical

  22. A Novel Study on Machine Learning Algorithm‐Based Cardiovascular

    The performance of the proposed ML algorithm was estimated using numerous conditions to recognize the best suitable machine learning algorithm in the class of models. The RF algorithm had the highest accuracy of prediction, sensitivity, and recursive operative characteristic curve of 85.01%, 92.11%, and 87.73%, respectively, for CVD.