five

Original and Imputed data for Decoding contextual factors differentiating adolescents’ high, average and low digital reading performance through machine learning methods

收藏
DataCite Commons2022-10-07 更新2025-04-16 收录
下载链接:
https://ieee-dataport.org/documents/original-and-imputed-data-decoding-contextual-factors-differentiating-adolescents’-high
下载链接
链接失效反馈
官方服务:
资源简介:
This paper presents the data that is used in the article entitled “Decoding contextual factors differentiating adolescents’ high, average and low digital reading performance through machine learning methods”, which investigated the key contextual factors that synergistically differentiate high and low performers, high and average performers, and low and average performers in digital reading, through the utilization of machine learning methods, namely, support vector machine (SVM) and SVM recursive feature elimination (SVM-RFE). Additionally, the Shapley additive explanations (SHAP) method was used to interpret the model and detect the features having a positive or negative impact on the final prediction. The latest-released Programme for International Student Assessment (PISA) 2018 data were analyzed, and the samples included 276,269 15-year-old students from 38 Organization for Economic Co-operation and Development (OECD) countries. The classification is based on the OECD’s definition of high, low and average performers, with reading scores at Levels 5 and 6 (i.e., at or above 625.61 score points), at Levels 1a and 1b (less than 407.47 score points) and in between, respectively. PV1READ (the first plausible value 1 in Reading) was randomly selected to represent each student’s reading score (Hu et al., 2022; Gorostiaga & Rojo-Álvarez, 2016). The 150 independent factors/variables were collected from the student questionnaire, school questionnaire and ICT familiarity questionnaire. Additionally, the country-level factor (i.e., GDP per capita) collected from the World Bank dataset (URL: https://data.worldbank.org/) was also taken into account.
提供机构:
IEEE DataPort
创建时间:
2022-10-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作