TY - GEN
T1 - A Regression Analysis Approach for Feature Selection in Student Achievement Data
AU - Elmassry, Ahmed M.
AU - Alshamsi, Abdulla
AU - Abdulhameed, Ahmed F.
AU - Sharaf, Mohamed A.
AU - Zaki, Nazar
AU - Khosravi, Hassan
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - TIMSS is a global standardized assessment of mathematics and science knowledge, skills, and achievement of students in grades four and eight. The data generated from the 64 countries and 8 benchmarking entities participated in TIMSS 2019 is enormous and worth exploring and analyzing. This work aims to identify the significant factors that affect students' scores in mathematics in Grade 8 in the United Arab Emirates (UAE), using the data exported from the TIMSS 2019 dataset. Towards this, we leverage Linear Regression modeling, together with backward elimination techniques with the objective of: 1) maximizing the accuracy of predicting student performance, and 2) minimizing the number of identified significant features that contribute to student performance. Out of the hundreds of features (i.e., variables) collected in the TIMSS dataset, our experimental evaluation identified 17 features as the most significant in affecting student performance. Those identified features span multiple categories, including variables related to the categories of: student (9 features), home (3 features), teacher (2 features), classroom (1 feature), and use of technology (2 features).
AB - TIMSS is a global standardized assessment of mathematics and science knowledge, skills, and achievement of students in grades four and eight. The data generated from the 64 countries and 8 benchmarking entities participated in TIMSS 2019 is enormous and worth exploring and analyzing. This work aims to identify the significant factors that affect students' scores in mathematics in Grade 8 in the United Arab Emirates (UAE), using the data exported from the TIMSS 2019 dataset. Towards this, we leverage Linear Regression modeling, together with backward elimination techniques with the objective of: 1) maximizing the accuracy of predicting student performance, and 2) minimizing the number of identified significant features that contribute to student performance. Out of the hundreds of features (i.e., variables) collected in the TIMSS dataset, our experimental evaluation identified 17 features as the most significant in affecting student performance. Those identified features span multiple categories, including variables related to the categories of: student (9 features), home (3 features), teacher (2 features), classroom (1 feature), and use of technology (2 features).
KW - Feature Selection
KW - Regression
KW - TIMSS
UR - http://www.scopus.com/inward/record.url?scp=85182931803&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85182931803&partnerID=8YFLogxK
U2 - 10.1109/IIT59782.2023.10366412
DO - 10.1109/IIT59782.2023.10366412
M3 - Conference contribution
AN - SCOPUS:85182931803
T3 - 2023 15th International Conference on Innovations in Information Technology, IIT 2023
SP - 32
EP - 37
BT - 2023 15th International Conference on Innovations in Information Technology, IIT 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 15th International Conference on Innovations in Information Technology, IIT 2023
Y2 - 14 November 2023 through 15 November 2023
ER -