Educational data mining has been widely used to predict student performance and establish intervention strategies to improve that performance. Most studies have implemented machine learning algorithms for interventions but the use of data mining in appraising student performance in learning software is obscure. Furthermore, some of the studies that have explored the use of machine learning in predicting student performance in software learning have only used Random Forest, and as such, this study used the same dataset to implement 7 other algorithms and establish the most efficient. The study used two different sets of data and established that Neural Network was the most efficient with regards to the first dataset although Random Forest was the most efficient with regards to the second dataset. Both the NN graphics and RF tree diagram are presented, and the predictions from the two models also compared.
|Title of host publication||Soft Computing in Data Science - 4th International Conference, SCDS 2018, Proceedings|
|Editors||Bee Wah Yap, Azlinah Hj Mohamed, Michael W. Berry|
|Number of pages||13|
|Publication status||Published - 11 Dec 2018|
|Event||4th International Conference on Soft Computing in Data Science, SCDS 2018 - Bangkok, Thailand|
Duration: 15 Aug 2018 → 16 Aug 2018
|Name||Communications in Computer and Information Science|
|Conference||4th International Conference on Soft Computing in Data Science, SCDS 2018|
|Period||15/08/18 → 16/08/18|
Bibliographical noteFunding Information:
We are grateful to the entire SETAP project team and we appreciate Professor D. Petkovic of San Francisco State University, Prof. Rainer Todtenhoefer of Fulda University, and Professor Shihong Huang of Florida Atlantic University for their role in the project and for sharing the data with UCI Machine Learning Repository.
© Springer Nature Singapore Pte Ltd. 2019.
- Data mining
- Machine learning
- Performance prediction
- Random Forest
- Software engineering