APIBench

Name: APIBench
Creator: Berkeley University
License: 暂无描述

arXiv2025-09-30 收录

下载链接：

https://github.com/johnnypeng18/apibench

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集名为APIBench，是一个包含HuggingFace、TorchHub和TensorHub API的全面数据集，用于将Gorilla模型与其他最先进的语言模型进行比较基准测试。该数据集通过将自我指导数据集中的指令和API对划分为训练集和测试集进行创建，并包含一个留出的测试集，用于报告模型的性能。该数据集的任务是评估大型语言模型生成准确API调用能力。

The dataset named APIBench is a comprehensive dataset covering APIs from HuggingFace, TorchHub and TensorHub, which is used for comparative benchmarking between the Gorilla model and other state-of-the-art language models. It is created by splitting the instruction-API pairs from self-instruct datasets into training and test sets, and includes a held-out test set for reporting model performance. The task of this dataset is to evaluate the ability of large language models to generate accurate API calls.

提供机构：

Berkeley University

5,000+

优质数据集

54 个

任务类型

进入经典数据集