AI-Driven Formative Assessment in EFL Writing: A Comparative Study of ChatGPT-4 and Human Raters

NIAID Data Ecosystem2026-05-02 收录

下载链接：

https://zenodo.org/record/14854638

下载链接

链接失效反馈

官方服务：

资源简介：

This study utilized Item Response Theory (IRT), qualitative feedback analysis, and the Assessment for Learning (AfL) framework to evaluate ChatGPT 4’s potential as a tool for improving English-as-a-foreign-language (EFL) writing assessment in South Korean higher education. The research aimed to assess the reliability of holistic essay scores assigned by ChatGPT 4 compared to those given by experienced university English instructors and to examine the utility of qualitative feedback provided by AI and human raters. A total of 76 essays written by non-English major students for the International English Language Testing System (IELTS) Academic Writing Test for Task 1 and Task 2 were analyzed. ChatGPT 4 and three university English instructors, rated the essays using the IELTS scoring rubric and provided comments on language use, content quality, and organizational structure. Results indicate that ChatGPT 4 and human raters were similarly accurate in scoring quantitative scores. Qualitative feedback analysis found that ChatGPT consistently delivered more balanced and comprehensive feedback, with a strong emphasis on content and organizational structure. By contrast, teacher feedback was often more focused on linguistic accuracy, sometimes overlooking broader aspects of writing.

创建时间：

2025-02-18

5,000+

优质数据集

54 个

任务类型

进入经典数据集