Essay-BR
收藏arXiv2021-05-19 更新2024-06-21 收录
下载链接:
https://github.com/rafaelanchieta/essay
下载链接
链接失效反馈官方服务:
资源简介:
Essay-BR是一个由巴西皮奥伊联邦大学创建的大型语料库,包含4570篇由巴西高中生撰写的议论文,这些文章通过在线平台收集,并由专家根据ENEM考试的五个能力维度进行评分。数据集涵盖了从2015年到2020年的319个主题,包括人权、政治问题等。创建过程中,使用了网络爬虫从两个公共网站提取文章,并进行了预处理和评分标准化。该数据集旨在支持葡萄牙语自动作文评分研究,帮助开发更有效的评分模型,以解决教育评估中的自动化问题。
Essay-BR is a large-scale corpus developed by the Federal University of Piauí, Brazil. It contains 4,570 argumentative essays written by Brazilian high school students, which were collected via online platforms and scored by experts based on the five competency dimensions of the ENEM (Exame Nacional do Ensino Médio) examination. The corpus covers 319 essay topics spanning from 2015 to 2020, including human rights, political issues, and other related topics. During the corpus construction process, essays were extracted from two public websites using web crawlers, followed by preprocessing and score standardization. This dataset aims to support research on automated essay scoring in Portuguese, helping develop more effective scoring models to address automation-related challenges in educational assessment.
提供机构:
巴西皮奥伊联邦大学
创建时间:
2021-05-19



