Prediction of Stroke Disease with Demographic and Behavioural Data Using Random Forest Algorithm

Olamilekan Shobayo, Oluwafemi Zachariah, Modupe Olufunke Odusami, Bayode Ogunleye

Research output: Contribution to journalArticlepeer-review

Abstract

Stroke is a major cause of death worldwide, resulting from a blockage in the flow of blood to different parts of the brain. Many studies have proposed a stroke disease prediction model using medical features applied to deep learning (DL) algorithms to reduce its occurrence. However, these studies pay less attention to the predictors (both demographic and behavioural). Our study considers interpretability, robustness, and generalisation as key themes for deploying algorithms in the medical domain. Based on this background, we propose the use of random forest for stroke incidence prediction. Results from our experiment showed that random forest (RF) outperformed decision tree (DT) and logistic regression (LR) with a macro F1 score of 94%. Our findings indicated age and body mass index (BMI) as the most significant predictors of stroke disease incidence.
Original languageEnglish
Pages (from-to)604-617
Number of pages14
JournalPrediction of Stroke Disease with Demographic and Behavioural Data Using Random Forest Algorithm
Volume2
Issue number3
DOIs
Publication statusPublished - 2 Aug 2023

Keywords

  • stroke
  • data mining
  • machine learning
  • random forest
  • logistic regression
  • decision tree
  • classification algorithm

Fingerprint

Dive into the research topics of 'Prediction of Stroke Disease with Demographic and Behavioural Data Using Random Forest Algorithm'. Together they form a unique fingerprint.

Cite this