llm-bg/Tucan-BG-Eval-v1.0
收藏Hugging Face2025-07-01 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/llm-bg/Tucan-BG-Eval-v1.0
下载链接
链接失效反馈官方服务:
资源简介:
Tucan-BG-Eval-v1.0是一个保加利亚语言模型的官方评估数据集,包含120个样本,用于评估模型在函数调用和工具使用方面的能力。每个样本包括保加利亚语的用户查询、可用函数及其参数、预期行为、评估标准和场景类型。
Tucan-BG-Eval-v1.0 is an official evaluation dataset for Bulgarian language models, containing 120 samples designed to assess the models capabilities in function calling and tool use. Each sample includes a Bulgarian user query, available functions with their parameters, expected behavior, evaluation criteria, and scenario types.
提供机构:
llm-bg



