APIBench
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/johnnypeng18/apibench
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为APIBench,是一个包含HuggingFace、TorchHub和TensorHub API的全面数据集,用于将Gorilla模型与其他最先进的语言模型进行比较基准测试。该数据集通过将自我指导数据集中的指令和API对划分为训练集和测试集进行创建,并包含一个留出的测试集,用于报告模型的性能。该数据集的任务是评估大型语言模型生成准确API调用能力。
The dataset named APIBench is a comprehensive dataset covering APIs from HuggingFace, TorchHub and TensorHub, which is used for comparative benchmarking between the Gorilla model and other state-of-the-art language models. It is created by splitting the instruction-API pairs from self-instruct datasets into training and test sets, and includes a held-out test set for reporting model performance. The task of this dataset is to evaluate the ability of large language models to generate accurate API calls.
提供机构:
Berkeley University



