TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_AT_STaR-RL-countdown_6arg__v1
收藏Hugging Face2025-11-13 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_AT_STaR-RL-countdown_6arg__v1
下载链接
链接失效反馈官方服务:
资源简介:
这是一个用于评估任务countdown_6arg的实验数据集,包含了问题、答案、任务配置、评估结果、日志和元数据等信息。数据集分为三个主要部分:evals_eval_rl、logs__evaluation_eval_rl和metadata。evals_eval_rl部分主要用于评估,包含问题和答案等;logs__evaluation_eval_rl部分包含实验的日志信息;metadata部分包含实验的基本信息。
This is an experimental dataset for evaluating the countdown_6arg task, containing information such as questions, answers, task configurations, evaluation results, logs, and metadata. The dataset is divided into three main sections: evals_eval_rl, logs__evaluation_eval_rl, and metadata. The evals_eval_rl section is mainly used for evaluation, including questions and answers; the logs__evaluation_eval_rl section contains log information of the experiment; the metadata section contains basic information about the experiment.
提供机构:
TAUR-dev



