five

Code-Vision

收藏
arXiv2025-09-30 收录
下载链接:
https://github.com/wanghanbinpanda/CodeVision
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集旨在评估多模态大型语言模型(MLLMs)的逻辑理解和代码生成能力,通过根据给定的流程图生成程序来进行基准测试。该数据集不仅覆盖了基本的编程领域,还挑战了MLLMs在算法和数学问题解决方面的能力。数据集分为三个子集:HumanEval-V、Algorithm和MATH,主要任务是基于流程图表示进行代码生成。

This dataset is designed to evaluate the logical comprehension and code generation capabilities of multimodal large language models (MLLMs), with benchmarking carried out by generating programs based on given flowcharts. It covers not only fundamental programming domains, but also tests MLLMs' abilities in solving algorithmic and mathematical problems. The dataset is split into three subsets: HumanEval-V, Algorithm, and MATH, with the primary task being code generation based on flowchart representations.
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作