AI-Driven Formative Assessment in EFL Writing: A Comparative Study of ChatGPT-4 and Human Raters
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14854638
下载链接
链接失效反馈官方服务:
资源简介:
This study utilized Item Response Theory (IRT), qualitative feedback analysis, and the Assessment for Learning (AfL) framework to evaluate ChatGPT 4’s potential as a tool for improving English-as-a-foreign-language (EFL) writing assessment in South Korean higher education. The research aimed to assess the reliability of holistic essay scores assigned by ChatGPT 4 compared to those given by experienced university English instructors and to examine the utility of qualitative feedback provided by AI and human raters. A total of 76 essays written by non-English major students for the International English Language Testing System (IELTS) Academic Writing Test for Task 1 and Task 2 were analyzed. ChatGPT 4 and three university English instructors, rated the essays using the IELTS scoring rubric and provided comments on language use, content quality, and organizational structure. Results indicate that ChatGPT 4 and human raters were similarly accurate in scoring quantitative scores. Qualitative feedback analysis found that ChatGPT consistently delivered more balanced and comprehensive feedback, with a strong emphasis on content and organizational structure. By contrast, teacher feedback was often more focused on linguistic accuracy, sometimes overlooking broader aspects of writing.
创建时间:
2025-02-18



