five

TencentARC/Plot2Code

收藏
Hugging Face2024-08-17 更新2024-05-25 收录
下载链接:
https://hf-mirror.com/datasets/TencentARC/Plot2Code
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 task_categories: - text-generation - text2text-generation - text-to-image - image-to-text - image-to-image language: - en tags: - code dataset_info: - config_name: python_plotly splits: - name: test - config_name: python_matplotlib splits: - name: test - config_name: r_plotly splits: - name: test --- # Plot2Code Benchmark Plot2Code benchmark is now open-sourced at [huggingface (ARC Lab)](https://huggingface.co/TencentARC) and [GitHub](https://github.com/TencentARC/Plot2Code). More information can be found in our [paper](https://arxiv.org/abs/2405.07990). ## Why we need [Plot2Code](https://huggingface.co/datasets/TencentARC/Plot2Code)? * 🧐 While MLLMs have demonstrated potential in visual contexts, their capabilities in visual coding tasks have not been thoroughly evaluated. Plot2Code offers a platform for comprehensive assessment of these models. * 🤗 To enable individuals to ascertain the proficiency of AI assistants in generating code that renders into plots given reference plots, we initiated the Plot2Code project. This ensures evaluations are pertinent to real-world applications. * 💻 Plot2Code accommodates all modalities (text and images) for both input and output, facilitating an exploration of the influence of each modality. # Dataset card for Plot2Code ## How to Download You can use following codes to download the dataset: ```shell git lfs install git clone https://huggingface.co/datasets/TencentARC/Plot2Code ``` ## Dataset Summary Plot2Code was created to serve as a visaul coding benchmark for Multi-Modal Large Language Models (MLLMs). We carefully collect 132 manually selected high-quality matplotlib plots across six plot types from publicly available matplotlib galleries. Each plot is paired with the code used to render it and an instruction derived by GPT-4 to describe the plot. ## Supported Tasks Plot2Code is primarily designed as a benchmark for code generation from scientific plots. Specifically, it supports the following settings: * Text2Image: We provide instructions to the assistant, requesting it to generate pyplot code and subsequently render the plots. * Image2Image: Referred to as the Direct Asking setting in our paper, we input the reference plot directly and ask the assistant to generate pyplot code to render similar plots. * I+T 2 Image: Combining both instructions and reference plots as input, this is called the Conditional Asking setting in our paper. By employing these settings, we can investigate the impact of each input modality on the quality of the final rendered plots. # News * [2024/08] 🔥We futther update the Python and R's plotly plot-code pairs with instruction for evaluation!🔥 * [2024/05] We open source the [Plot2Code benchmark](https://huggingface.co/datasets/TencentARC/Plot2Code). Stay tuned for this project! 😆 # License In this study, we crawled every website link listed in the Matplotlib gallery and Plotly documentation to collect data for our analysis. Both Matplotlib and Plotly libraries are distributed under permissive open-source licenses. We have taken the following steps to ensure compliance with the respective license terms: 1. Acknowledgment of Licenses: We acknowledge that the Matplotlib library and its gallery are distributed under the BSD 3-Clause License, and the Plotly library and its documentation are distributed under the MIT License. 2. Retention of Copyright Notices: We have retained all copyright notices and license information from the original Matplotlib gallery content and Plotly documentation, as required by their respective licenses. 3. Usage and Distribution: Our use of the Matplotlib gallery and Plotly documentation content is solely for academic and research purposes. We have not modified the original content from the Matplotlib gallery or Plotly documentation, and any distribution of our work will include proper attribution to the Matplotlib and Plotly projects. By adhering to these guidelines, we ensure that our use of the Matplotlib and Plotly content is fully compliant with their respective licenses. This project is open-sourced under the [Apache-2.0](https://www.apache.org/licenses/LICENSE-2.0). These evaluation code and datasets are fully open for academic research and can be used for commercial purposes with official written permission. # Paper arxiv.org/abs/2405.07990 # Citation The code and model in this repository is mostly developed for or derived from the paper below. Please cite it if you find the repository helpful. ``` @misc{wu2024plot2code, title={Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots}, author={Chengyue Wu and Yixiao Ge and Qiushan Guo and Jiahao Wang and Zhixuan Liang and Zeyu Lu and Ying Shan and Ping Luo}, year={2024}, eprint={2405.07990}, archivePrefix={arXiv}, primaryClass={cs.CL} } ```
提供机构:
TencentARC
原始信息汇总

数据集概述

数据集名称

  • Plot2Code Benchmark

数据集目的

  • 作为多模态大型语言模型(MLLMs)的视觉编码基准。
  • 评估AI助手在生成代码以渲染图表方面的能力。

数据集内容

  • 包含132个手工挑选的高质量matplotlib图表,涵盖六种图表类型。
  • 每个图表配有生成该图表的代码和由GPT-4生成的描述性指令。

支持的任务

  • Text2Image: 提供指令,请求生成pyplot代码并渲染图表。
  • Image2Image: 直接输入参考图表,请求生成相似图表的pyplot代码。
  • I+T 2 Image: 结合指令和参考图表作为输入,生成图表。

数据集下载

  • 使用以下命令下载数据集: shell git lfs install git clone https://huggingface.co/datasets/TencentARC/Plot2Code

许可证

  • 本项目遵循Apache-2.0许可证。

引用信息

  • 若使用此数据集,请引用以下论文:

    @misc{wu2024plot2code, title={Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots}, author={Chengyue Wu and Yixiao Ge and Qiushan Guo and Jiahao Wang and Zhixuan Liang and Zeyu Lu and Ying Shan and Ping Luo}, year={2024}, eprint={2405.07990}, archivePrefix={arXiv}, primaryClass={cs.CL} }

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作