Application of the Classification Algorithms on the Prediction of Student’s Academic Performance

Authors

  • Sourav Kumar Ghosh Department of Industrial and Production Engineering, Bangladesh University of Textiles, Dhaka 1208, Bangladesh
  • Farhatul Janan Department of Industrial and Production Engineering, Bangladesh University of Textiles, Dhaka 1208, Bangladesh
  • Ishtiyaque Ahmad University of California, Santa Barbara, California 93106, United States

DOI:

https://doi.org/10.48048/tis.2022.5070

Keywords:

SVM Classifier, Random forest classifier, Fuzzy logic, Performance prediction, Classification criteria

Abstract

Measuring student performance based on both qualitative and quantitative factors is essential because many undergraduate students could not be able to complete their degree in the recent past.  The first-year result of a student is very important because in the majority of cases this drives the students to be either motivated or demotivated. So, the first-year student performance of a renowned university in Bangladesh is investigated in this paper. This research is mainly based on finding the factors for students’ different types of results and then predicting students’ performance based on those 11 significant factors. For this purpose, 2 popular supervised machine learning algorithms have been used for classifying students’ different levels of results and predicting students’ performances, those are support vector machines (SVM) classifier and random forest classifier (RFC) which are tremendously used in classification and regression analysis. The input dataset for both training and testing were taken by merging the values obtained from 2 surveys done on students and experts using an adaptive neuro-fuzzy interference system (ANFIS). RF has outperformed SVM in predicting students’ performances. According to factor analysis, students’ effort (Factor-11) is the significant factor. This proposed model can also be applied to predict course-wise students’ performances and its precision can also be greatly improved by adding new factors.

HIGHLIGHTS

  • Identify the significant factors responsible for students’ different levels of performances
  • Apply two machine learning algorithms to classify students’ results based on the factors
  • Analyze the results obtained from the methods
  • Compare the accuracy, and find the top five factors responsible for students’ academic results

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

References

T Devasia, TP Vinushree and V Hegde. Prediction of students performance using educational data mining. In: Proceedings of the 2016 International Conference on Data Mining and Advanced Computing, Ernakulam, India. 2016, p. 91-5.

YJ Lee. Predicting students’ problem solving performance using support vector machine. J. Data Sci. 2016; 14, 231-44.

A Acharya and D Sinha. Early prediction of students performance using machine learning techniques. Int. J. Comput. Appl. 2014; 107, 37-43.

SM Hasheminejad and M Sarvmili. S3PSO: Students’ performance prediction based on particle swarm optimization. J. AI Data Min. 2019; 7, 77-96.

PA Patil and RV Mane. Prediction of students performance using frequent pattern tree. In: Proceedings of the 6th International Conference on Computational Intelligence and Communication Networks, Bhopal, India. 2014, p. 1078-82.

IE Livieris, V Tampakas, N Kiriakidou, T Mikropoulos and P Pintelas. Forecasting students’ performance using an ensemble SSL algorithm. In: MA Tsitouridou, JA Diniz and T Mikropoulos (Eds.). Technology and innovation in learning, teaching and education. Vol 993. Springer, Cham, Switzerland, 2018, p. 566-81.

F Okubo, A Shimada, T Yamashita and H Ogata. A neural network approach for students’ performance prediction. In: Proceedings of the 7th International Learning Analytics & Knowledge Conference, Vancouver BC, Canada. 2017, p. 598-9.

AB Raut and AA Nichat. Students performance prediction using decision tree technique. Int. J. Comput. Intell. Res. 2017; 13, 1735-41.

F Okubo, T Yamashita, A Shimada and S Konomi. Students’ performance prediction using data of multiple courses by recurrent neural network. In: Proceedings of the 25th International Conference on Computers in Education, Christchurch, New Zealand. 2017, p. 439-44.

ET Lau, L Sun and Q Yang. Modelling, prediction and classification of student academic performance using artificial neural networks. SN Appl. Sci. 2019; 1, 982.

V Tampakas, IE Livieris, E Pintelas, N Karacapilidis and P Pintelas. Prediction of students’ graduation time using a two-level classification algorithm. In: M Tsitouridou, JA Diniz and T Mikropoulos (Eds.). Technology and innovation in learning, teaching and education. Vol 993. Springer, Cham, Switzerland, p. 553-65.

M Chauhan and V Gupta. Comparative study of techniques used in prediction of student performance. World Sci. News. 2018; 113, 185-93.

A Daud, MD Lytras, NR Aljohani, F Abbas, RA Abbasi and JS Alowibdi. Predicting student performance using advanced learning analytics. In: Proceedings of the 26th International Conference on World Wide Web Companion, Perth, Australia. 2017, p. 415-21.

Y Abubakar and NBH Ahmad. Prediction of students’ performance in e-learning environment using random forest. Int. J. Innovat. Comput. 2017; 7, 1-5.

C Beaulac and JS Rosenthal. Predicting university students’ academic success and major using random forests. Res. High. Educ. 2019; 60, 1048-64.

MB Sanzana, SS Garrido and CM Poblete. Profiles of Chilean students according to academic performance in mathematics: An exploratory study using classification trees and random forests. Stud. Educ. Eval. 2015; 44, 50-9.

Y Ao, H Li, L Zhu, S Ali and Z Yang. The linear random forest algorithm and its advantages in machine learning assisted logging regression modeling. J. Petrol. Sci. Eng. 2018; 174, 776-89.

SK Ghosh, N Zoha and F Sarwar. A generic MCDM model for supplier selection for multiple decision makers using fuzzy TOPSIS. In: Proceedings of the 5th International Conference on Engineering Research, Innovation and Education, Sylhet, Bangladesh. 2019, p. 833-40.

D Wu, C Jennings, J Terpenny, RX Gao and S Kumara. A comparative study on machine learning algorithms for smart manufacturing: Tool wear prediction using random forests. J. Manuf. Sci. Eng. 2017; 139, 071018.

SK Ghosh and F Janan. Prediction of student’s performance using random forest classifier. In: Proceedings of the 11th Annual International Conference on Industrial Engineering and Operations Management, Singapore. 2021, p. 7088-100.

F Janan and SK Ghosh. Prediction of student’s performance using support vector machine classifier. In: Proceedings of the 11th Annual International Conference on Industrial Engineering and Operations Management, Singapore. 2021, p. 7078-88.

EM Jordaan and GF Smits. Robust outlier detection using SVM regression. In: Proceedings of the IEEE International Joint Conference on Neural Networks, Budapest, Hungary. 2004.

H Al-Shehri, A Al-Qarni, L Al-Saati, A Batoaq, H Badukhen, S Alrashed, J Alhiyafi and SO Olatunji. Student performance prediction using support vector machine and k-nearest neighbor. In: Proceedings of the 2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering, Windsor ON, Canada. 2017.

T Mahboob, S Irfan and A Karamat. A machine learning approach for student assessment in e-learning using quinlan’s C4.5, naive bayes and random forest algorithms. In: Proceedings of the 19th International Multi-Topic Conference, Islamabad, Pakistan. 2016, p. 1 - 8.

E Mooi, M Sarstedt and I Mooi-Reci. Hypothesis testing & ANOVA. In: E Mooi, M Sarstedt and I Mooi-Reci (Eds.). Market research. Springer, Singapore, 2018, p. 153-214.

MT Sow. Using ANOVA to examine the relationship between safety & security and human development. J. Int. Bus. Econ. 2014; 2, 101-6.

LA Zadeh. Fuzzy logic. Computer 1998; 21, 83-93.

C Cortes and V Vapnik. Support-vector networks. Mach. Learn. 1995; 20, 273-97.

SL Salzberg. C4.5: Programs for machine learning by J. Ross Quinlan. Morgan Kaufmann Publishers, Inc., 1993. Mach. Learn. 1994; 16, 235-40.

S Cheong, S Oh and S Lee. Support vector machines with binary tree architecture for multi-class classification. Neural Inform. Process. Lett. Rev. 2004; 2, 47-51.

L Breiman. Random forests. Mach. Learn. 2001; 45, 5-32.

Downloads

Published

2022-07-03

How to Cite

Ghosh, S. K. ., Janan, F. ., & Ahmad, I. . (2022). Application of the Classification Algorithms on the Prediction of Student’s Academic Performance. Trends in Sciences, 19(14), 5070. https://doi.org/10.48048/tis.2022.5070