Machine Learning Techniques for Assessing Students' Environments' Impact Factors on Their Academic Performance

Mohammed Hussein Jabardi

doi:10.26483/ijarcs.v13i2.6813

PDF

Published: Apr 20, 2022

DOI: https://doi.org/10.26483/ijarcs.v13i2.6813

Keywords:

Machine Learning, Performance, academic performance, students' environment, assessing, educational factors

Mohammed Hussein Jabardi

University Of Kufa

http://orcid.org/0000-0002-8665-9257

Abstract

Performance factors analysis has recently gained popularity as a method for assessing how students' environments affect their academic performance. However, most of the progress has been made in analyzing student behaviour during the learning process. Machine Learning provides many powerful methods that could improve student performance prediction.Â Our aim is to examine all features of students' environmental life using the machine learning paradigm to assess how students' environment affects their grades. These features are divided into three categories (personality, family, and education) and their impact factors are calculated. To improve predictive accuracy, different models (Random Forest, AdaBoost, Decision Tree, Naive Bayes, and Multi-Layer perceptron) are used to score the features in each group according to their contribution to the solution. Results show that personality features are a minor effect on students' academic performance with 53%. Concerning the educational factors, outcomes offer the average impact was 60%. Regarding family factors, results indicate that students' family life significantly affects academic achievement with 64%.

Downloads

Download data is not yet available.

Issue

Vol. 13 No. 2 (2022): March-April 2022

Section

Articles

COPYRIGHT

Submission of a manuscript implies: that the work described has not been published before, that it is not under consideration for publication elsewhere; that if and when the manuscript is accepted for publication, the authors agree to automatic transfer of the copyright to the publisher.

Authors who publish with this journal agree to the following terms:

Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work
The journal allows the author(s) to retain publishing rights without restrictions.
The journal allows the author(s) to hold the copyright without restrictions.

References

E. B. Costa, B. Fonseca, M. A. Santana, F. F. de AraÃºjo, and J. Rego, â€œEvaluating the effectiveness of educational data mining techniques for early prediction of studentsâ€™ academic failure in introductory programming courses,â€ Comput. Human Behav., vol. 73, pp. 247â€“256, Aug. 2017, doi: 10.1016/J.CHB.2017.01.047.

S. Natek and M. Zwilling, â€œStudent data mining solutionâ€“knowledge management system related to higher education institutions,â€ Expert Syst. Appl., vol. 41, no. 14, pp. 6400â€“6407, Oct. 2014, doi: 10.1016/J.ESWA.2014.04.024.

A. M. Shahiri, W. Husain, and N. A. Rashid, â€œA Review on Predicting Studentâ€™s Performance Using Data Mining Techniques,â€ Procedia Comput. Sci., vol. 72, pp. 414â€“422, Jan. 2015, doi: 10.1016/J.PROCS.2015.12.157.

H. Zeineddine, U. Braendle, and A. Farah, â€œEnhancing prediction of student success: Automated machine learning approach,â€ Comput. Electr. Eng., vol. 89, p. 106903, Jan. 2021, doi: 10.1016/J.COMPELECENG.2020.106903.

A. Rivas, A. GonzÃ¡lez-Briones, G. HernÃ¡ndez, J. Prieto, and P. Chamoso, â€œArtificial neural network analysis of the academic performance of students in virtual learning environments,â€ Neurocomputing, vol. 423, pp. 713â€“720, Jan. 2021, doi: 10.1016/J.NEUCOM.2020.02.125.

F. Giannakas, C. Troussas, I. Voyiatzis, and C. Sgouropoulou, â€œA deep learning classification framework for early prediction of team-based academic performance,â€ Appl. Soft Comput., vol. 106, p. 107355, Jul. 2021, doi: 10.1016/J.ASOC.2021.107355.

A. Asselman, M. Khaldi, and S. Aammou, â€œEnhancing the prediction of student performance based on the machine learning XGBoost algorithm,â€ https://doi.org/10.1080/10494820.2021.1928235, 2021, doi: 10.1080/10494820.2021.1928235.

O. H. T. Lu, A. Y. Q. Huang, and S. J. H. Yang, â€œImpact of teachersâ€™ grading policy on the identification of at-risk students in learning analytics,â€ Comput. Educ., vol. 163, p. 104109, Apr. 2021, doi: 10.1016/J.COMPEDU.2020.104109.

M. Adnan et al., â€œPredicting at-Risk Students at Different Percentages of Course Length for Early Intervention Using Machine Learning Models,â€ IEEE Access, vol. 9, pp. 7519â€“7539, 2021, doi: 10.1109/ACCESS.2021.3049446.

P. Kaur, H. Kumar, and S. Kaushal, â€œAffective state and learning environment based analysis of studentsâ€™ performance in online assessment,â€ Int. J. Cogn. Comput. Eng., vol. 2, pp. 12â€“20, Jun. 2021, doi: 10.1016/J.IJCCE.2020.12.003.

S. Kotsiantis, K. Patriarcheas, and M. Xenos, â€œA combinational incremental ensemble of classifiers as a technique for predicting studentsâ€™ performance in distance education,â€ Knowledge-Based Syst., vol. 23, no. 6, pp. 529â€“535, Aug. 2010, doi: 10.1016/J.KNOSYS.2010.03.010.

C. S. Galbraith, G. B. Merrill, and D. M. Kline, â€œAre Student Evaluations of Teaching Effectiveness Valid for Measuring Student Learning Outcomes in Business Related Classes? A Neural Network and Bayesian Analyses,â€ Res. High. Educ. 2011 533, vol. 53, no. 3, pp. 353â€“374, Jun. 2011, doi: 10.1007/S11162-011-9229-0.

S. B. Kotsiantis, â€œUse of machine learning techniques for educational proposes: a decision support system for forecasting studentsâ€™ grades,â€ Artif. Intell. Rev. 2011 374, vol. 37, no. 4, pp. 331â€“344, May 2011, doi: 10.1007/S10462-011-9234-X.

Osmanbegovic and M. Suljic., â€œData mining approach for predicting student performance,â€ Econ. Rev. J. Econ. Bus., vol. 10, no. 1, pp. 3â€“12, 2012, [Online]. Available: http://hdl.handle.net/10419/193806

C. Watson, F. W. B. Li, and J. L. Godwin, â€œPredicting performance in an introductory programming course by logging and analyzing student programming behavior,â€ Proc. - 2013 IEEE 13th Int. Conf. Adv. Learn. Technol. ICALT 2013, pp. 319â€“323, 2013, doi: 10.1109/ICALT.2013.99.

C. MÃ¡rquez-Vera, A. Cano, C. Romero, and S. Ventura, â€œPredicting student failure at school using genetic programming and different data mining approaches with high dimensional and imbalanced data,â€ Appl. Intell. 2012 383, vol. 38, no. 3, pp. 315â€“330, Aug. 2012, doi: 10.1007/S10489-012-0374-8.

â€œUCI Machine Learning Repository: Higher Education Students Performance Evaluation Dataset Data Set.â€ https://archive.ics.uci.edu/ml/datasets/Higher+Education+Students+Performance+Evaluation+Dataset# (accessed Apr. 06, 2022).

R. J. McQueen, S. R. Garner, C. G. Nevill-Manning, and I. H. Witten, â€œApplying machine learning to agricultural data,â€ Comput. Electron. Agric., vol. 12, no. 4, pp. 275â€“293, Jun. 1995, doi: 10.1016/0168-1699(95)98601-9.

J. I. Arribas, G. V. SÃ¡nchez-Ferrero, G. Ruiz-Ruiz, and J. GÃ³mez-Gil, â€œLeaf classification in sunflower crops by computer vision and neural networks,â€ Comput. Electron. Agric., vol. 78, no. 1, pp. 9â€“18, Aug. 2011, doi: 10.1016/J.COMPAG.2011.05.007.

D. M. Farid, L. Zhang, C. M. Rahman, M. A. Hossain, and R. Strachan, â€œHybrid decision tree and naÃ¯ve Bayes classifiers for multi-class classification tasks,â€ Expert Syst. Appl., vol. 41, no. 4, pp. 1937â€“1946, Mar. 2014, doi: 10.1016/J.ESWA.2013.08.089.

Priyanka and D. Kumar, â€œDecision tree classifier: A detailed survey,â€ Int. J. Inf. Decis. Sci., vol. 12, no. 3, pp. 246â€“269, 2020, doi: 10.1504/IJIDS.2020.108141.

J. Ion Titapiccolo et al., â€œArtificial intelligence models to stratify cardiovascular risk in incident hemodialysis patients,â€ Expert Syst. Appl., vol. 40, no. 11, pp. 4679â€“4686, Sep. 2013, doi: 10.1016/J.ESWA.2013.02.005.

K. Maswadi, N. A. Ghani, S. Hamid, and M. B. Rasheed, â€œHuman activity classification using Decision Tree and NaÃ¯ve Bayes classifiers,â€ Multimed. Tools Appl. 2021 8014, vol. 80, no. 14, pp. 21709â€“21726, Mar. 2021, doi: 10.1007/S11042-020-10447-X.

V. F. Rodriguez-Galiano, B. Ghimire, J. Rogan, M. Chica-Olmo, and J. P. Rigol-Sanchez, â€œAn assessment of the effectiveness of a random forest classifier for land-cover classification,â€ ISPRS J. Photogramm. Remote Sens., vol. 67, no. 1, pp. 93â€“104, Jan. 2012, doi: 10.1016/J.ISPRSJPRS.2011.11.002.

G. Biau and E. Scornet, â€œA random forest guided tour,â€ TEST 2016 252, vol. 25, no. 2, pp. 197â€“227, Apr. 2016, doi: 10.1007/S11749-016-0481-7.

V. Y. Kulkarni and P. K. Sinha, â€œPruning of random forest classifiers: A survey and future directions,â€ Proc. - 2012 Int. Conf. Data Sci. Eng. ICDSE 2012, pp. 64â€“68, 2012, doi: 10.1109/ICDSE.2012.6282329.

I. Rish and I. Rish, â€œAn empirical study of the naive bayes classifier,â€ 2001, Accessed: Apr. 06, 2022. [Online]. Available: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.330.2788

J. N. Sulzmann, J. FÃ¼rnkranz, and E. HÃ¼llermeier, â€œOn pairwise naive bayes classifiers,â€ Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 4701 LNAI, pp. 371â€“381, 2007, doi: 10.1007/978-3-540-74958-5_35.

T. Hastie, J. Friedman, and R. Tibshirani, â€œThe Elements of Statistical Learning,â€ 2001, doi: 10.1007/978-0-387-21606-5.

M. M. Saritas and A. Yasar, â€œPerformance Analysis of ANN and Naive Bayes Classification Algorithm for Data Classification,â€ Int. J. Intell. Syst. Appl. Eng., vol. 7, no. 2, pp. 88â€“91, Jun. 2019, doi: 10.18201//ijisae.2019252786.

M. Jabardi and H. Kuar, â€œArtificial Neural Network Classification for Handwritten Digits Recognition,â€ Int. J. Adv. Res. Comput. Sci., vol. 5, no. April, pp. 107â€“111, 2014, doi: https://doi.org/10.26483/ijarcs.v5i3.

L. M. Belue and K. W. Bauer, â€œDetermining input features for multilayer perceptrons,â€ Neurocomputing, vol. 7, no. 2, pp. 111â€“121, Mar. 1995, doi: 10.1016/0925-2312(94)E0053-T.

A. J. Wyner, M. Olson, J. Bleich, and D. Mease, â€œExplaining the success of adaboost and random forests as interpolating classifiers,â€ J. Mach. Learn. Res., vol. 18, pp. 1â€“33, 2017.

Y. Freund and R. E. Schapire, â€œA Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting,â€ J. Comput. Syst. Sci., vol. 55, no. 1, pp. 119â€“139, Aug. 1997, doi: 10.1006/JCSS.1997.1504.

T. K. An and M. H. Kim, â€œA new Diverse AdaBoost classifier,â€ Proc. - Int. Conf. Artif. Intell. Comput. Intell. AICI 2010, vol. 1, pp. 359â€“363, 2010, doi: 10.1109/AICI.2010.82.

M. Story and R. G. Congalton, â€œRemote Sensing Brief Accuracy Assessment: A Userâ€™s Perspective,â€ Photogramm. Eng. Remote Sensing, vol. 52, no. 3, pp. 397â€“399, 1986, [Online]. Available: https://www.asprs.org/wp-content/uploads/pers/1986journal/mar/1986_mar_397-399.pdf

M. Sokolova, N. Japkowicz, and S. Szpakowicz, â€œBeyond Accuracy, F-Score and ROC: A Family of Discriminant Measures for Performance Evaluation,â€ AAAI Work. - Tech. Rep., vol. WS-06-06, pp. 1015â€“1021, 2006, doi: 10.1007/11941439_114.

J. Davis and M. Goadrich, â€œThe relationship between precision-recall and ROC curves,â€ ACM Int. Conf. Proceeding Ser., vol. 148, pp. 233â€“240, 2006, doi: 10.1145/1143844.1143874.

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details

References

Most read articles by the same author(s)