five

maritaca-ai/enem

收藏
Hugging Face2024-12-03 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/maritaca-ai/enem
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 configs: - config_name: '2024' data_files: 2024.jsonl default: true - config_name: '2023' data_files: 2023.jsonl - config_name: '2022' data_files: 2022.jsonl dataset_info: features: - name: id dtype: string - name: exam dtype: string - name: IU dtype: bool - name: ledor dtype: bool - name: question dtype: string - name: alternatives sequence: string - name: figures sequence: string - name: description sequence: string - name: label dtype: string task_categories: - visual-question-answering - multiple-choice language: - pt pretty_name: ENEM size_categories: - n<1K --- The ENEM 2022, 2023 and 2024 datasets encompass all multiple-choice questions from the last two editions of the [Exame Nacional do Ensino Médio (ENEM)](https://www.gov.br/inep/pt-br/areas-de-atuacao/avaliacao-e-exames-educacionais/enem), the main standardized entrance examination adopted by Brazilian universities. The datasets have been created to allow the evaluation of both textual-only and textual-visual language models. To evaluate textual-only models, we incorporated into the datasets the textual descriptions of the images that appear in the questions' statements from the orange ENEM exam booklet, a particular booklet that offers accessibility to people with visual impairments. A repository containing the essential code for utilizing this dataset is accessible [here](https://github.com/piresramon/gpt-4-enem). If you use this dataset in your research, please acknowledge the papers below by citing them: ```bibtex @misc{pires2023evaluating, title={Evaluating GPT-4's Vision Capabilities on Brazilian University Admission Exams}, author={Ramon Pires and Thales Sales Almeida and Hugo Abonizio and Rodrigo Nogueira}, year={2023}, eprint={2311.14169}, archivePrefix={arXiv}, primaryClass={cs.CL} } ``` ```bibtex @misc{nunes2023evaluating, title={Evaluating GPT-3.5 and GPT-4 Models on Brazilian University Admission Exams}, author={Desnes Nunes and Ricardo Primi and Ramon Pires and Roberto Lotufo and Rodrigo Nogueira}, year={2023}, eprint={2303.17003}, archivePrefix={arXiv}, primaryClass={cs.CL} } ```
提供机构:
maritaca-ai
原始信息汇总

数据集概述

许可证

  • Apache 2.0

配置

  • 2022
    • 数据文件: 2022.jsonl
  • 2023
    • 数据文件: 2023.jsonl
    • 默认配置: 是

数据集信息

  • 特征
    • id: 字符串
    • exam: 字符串
    • IU: 布尔值
    • ledor: 布尔值
    • question: 字符串
    • alternatives: 字符串序列
    • figures: 字符串序列
    • description: 字符串序列
    • label: 字符串

任务类别

  • 视觉问答
  • 多选题

语言

  • 葡萄牙语

数据集名称

  • ENEM

数据集规模

  • n<1K
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作