albertvillanova/test-dataset-card

Name: albertvillanova/test-dataset-card
Creator: albertvillanova
Published: 2024-01-25 08:15:40
License: 暂无描述

Hugging Face2024-01-25 更新2024-03-04 收录

下载链接：

https://hf-mirror.com/datasets/albertvillanova/test-dataset-card

下载链接

链接失效反馈

官方服务：

资源简介：

--- task_categories: - text-classification task_ids: - multi-label-classification - toxic-comment-classification --- <h1 align="center"> DATASET-NAME: Code Reasoning, Understanding, and Execution Evaluation </h1> <p align="center"> <a href="https://crux-eval.github.io/">🏠 Home Page</a> • <a href="https://github.com/facebookresearch/cruxeval">💻 GitHub Repository </a> • <a href="https://crux-eval.github.io/leaderboard.html">🏆 Leaderboard</a> • <a href="https://crux-eval.github.io/demo.html">🔎 Sample Explorer</a> </p> ![image](https://github.com/facebookresearch/cruxeval/assets/7492257/4951c067-e6d0-489a-a445-37ff1c4ad1e4) DATASET-NAME (**C**ode **R**easoning, **U**nderstanding, and e**X**ecution **Eval**uation) is a benchmark of 800 Python functions and input-output pairs. The benchmark consists of two tasks, CRUXEval-I (input prediction) and CRUXEval-O (output prediction). The benchmark was constructed as follows ## Dataset Description - **Homepage:** [More Information Needed](https://github.com/huggingface/datasets/blob/master/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards) - **Repository:** https://github.com/ - **Paper:** https://arxiv.org/ - **Point of Contact:** [NAME](mailto:EMAIL)

提供机构：

albertvillanova

原始信息汇总

DATASET-NAME: Code Reasoning, Understanding, and Execution Evaluation

数据集概述

DATASET-NAME（Code Reasoning, Understanding, and eXecution Evaluation）是一个包含800个Python函数及其输入输出对的基准测试集。该基准测试集包含两个任务：CRUXEval-I（输入预测）和CRUXEval-O（输出预测）。

数据集描述

任务类别: 文本分类
任务ID: 多标签分类, 有毒评论分类

5,000+

优质数据集

54 个

任务类型

进入经典数据集