TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_BoLT-SFT-countdown_5arg-eval_sft
收藏Hugging Face2025-11-09 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_BoLT-SFT-countdown_5arg-eval_sft
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含问题、答案以及与任务相关的配置和来源等信息。数据集中的每个样本都包含了提示内容以及模型的响应。此外,数据集还提供了对模型响应的正确性评估和相关元数据。数据集分为测试集,共有1000个示例,数据集总大小为20,543,480字节。
The dataset includes questions, answers, and related configurations and sources for tasks. Each sample in the dataset contains prompt content and the models responses. Additionally, the dataset provides correctness evaluations of the model responses and related metadata. The dataset is split into a test set with a total of 1000 examples, and the overall dataset size is 20,543,480 bytes.
提供机构:
TAUR-dev



