Abstract
Stroke is a major cause of death worldwide, resulting from a blockage in the flow of blood to different parts of the brain. Many studies have proposed a stroke disease prediction model using medical features applied to deep learning (DL) algorithms to reduce its occurrence. However, these studies pay less attention to the predictors (both demographic and behavioural). Our study considers interpretability, robustness, and generalisation as key themes for deploying algorithms in the medical domain. Based on this background, we propose the use of random forest for stroke incidence prediction. Results from our experiment showed that random forest (RF) outperformed decision tree (DT) and logistic regression (LR) with a macro F1 score of 94%. Our findings indicated age and body mass index (BMI) as the most significant predictors of stroke disease incidence.
Original language | English |
---|---|
Pages (from-to) | 604-617 |
Number of pages | 14 |
Journal | Prediction of Stroke Disease with Demographic and Behavioural Data Using Random Forest Algorithm |
Volume | 2 |
Issue number | 3 |
DOIs | |
Publication status | Published - 2 Aug 2023 |
Keywords
- stroke
- data mining
- machine learning
- random forest
- logistic regression
- decision tree
- classification algorithm