five

albertvillanova/test-dataset-card

收藏
Hugging Face2024-01-25 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/albertvillanova/test-dataset-card
下载链接
链接失效反馈
官方服务:
资源简介:
--- task_categories: - text-classification task_ids: - multi-label-classification - toxic-comment-classification --- <h1 align="center"> DATASET-NAME: Code Reasoning, Understanding, and Execution Evaluation </h1> <p align="center"> <a href="https://crux-eval.github.io/">🏠 Home Page</a> • <a href="https://github.com/facebookresearch/cruxeval">💻 GitHub Repository </a> • <a href="https://crux-eval.github.io/leaderboard.html">🏆 Leaderboard</a> • <a href="https://crux-eval.github.io/demo.html">🔎 Sample Explorer</a> </p> ![image](https://github.com/facebookresearch/cruxeval/assets/7492257/4951c067-e6d0-489a-a445-37ff1c4ad1e4) DATASET-NAME (**C**ode **R**easoning, **U**nderstanding, and e**X**ecution **Eval**uation) is a benchmark of 800 Python functions and input-output pairs. The benchmark consists of two tasks, CRUXEval-I (input prediction) and CRUXEval-O (output prediction). The benchmark was constructed as follows ## Dataset Description - **Homepage:** [More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards) - **Repository:** https://github.com/ - **Paper:** https://arxiv.org/ - **Point of Contact:** [NAME](mailto:EMAIL)
提供机构:
albertvillanova
原始信息汇总

DATASET-NAME: Code Reasoning, Understanding, and Execution Evaluation

数据集概述

DATASET-NAME(Code Reasoning, Understanding, and eXecution Evaluation)是一个包含800个Python函数及其输入输出对的基准测试集。该基准测试集包含两个任务:CRUXEval-I(输入预测)和CRUXEval-O(输出预测)。

数据集描述

  • 任务类别: 文本分类
  • 任务ID: 多标签分类, 有毒评论分类
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作