five

Survey data on student expectations for faculty, teaching assistant, and peer support in engineering education

收藏
DataCite Commons2026-03-20 更新2026-04-25 收录
下载链接:
https://datadryad.org/dataset/doi:10.5061/dryad.x3ffbg81n
下载链接
链接失效反馈
官方服务:
资源简介:
This study compares five short text topic modeling (STTM) techniques for analyzing qualitative student feedback on instructional support in engineering education. Student feedback was collected using short answer questions that resulted in 1,667, 1,592, and 1,376 expectations for faculty support, teaching assistant (TA) support, and peer support respectively as part of a larger survey conducted via convenience sampling in over 40 engineering courses offered at single large university between 2016 and 2023.   After cleaning and preprocessing the data, short text responses were analyzed using five unsupervised topic models implemented in Python: traditional models, Latent Dirichlet Allocation (LDA), Latent Semantic Analysis (LSA), Non-Negative Matrix Factorization (NMF), and k-means and one deep learning model (BERTopic). Model performance was evaluated using topic coherence and external performance metrics. Two approaches to establishing ground truth were evaluated: (a) keywords from each topic model guided manual (human) coding of the data (a machine-led approach); and (b) themes in the data were extracted and coded independently by a domain expert (a human-led approach).  NMF achieved the highest average performance in two of the three datasets, reaching 75.6% accuracy, 75.7% F1-Score, and 0.63 interrater reliability for the peer support dataset and 72.6% accuracy, 72.0% F1-Score, and 0.57 interrater reliability for the TA support dataset. The human-led approach yielded higher accuracy and F1-scores for faculty and peer support but failed for TA support when the topics extracted by topic models did not align with themes identified by a domain expert.  These findings highlight the need for humans to be involved in the analysis of short text data in contexts like education research where high performance is necessary to achieve appropriate rigor. Domain expert intervention also enables strategic use of topic models to optimize their use in qualitative data analysis.
提供机构:
Dryad
创建时间:
2026-03-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作