TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3args_NoReflects-RL-longmult_5dig__v1
收藏Hugging Face2025-11-10 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_3args_NoReflects-RL-longmult_5dig__v1
下载链接
链接失效反馈官方服务:
资源简介:
这是一个用于评估任务的实验跟踪数据集,包含评估实验中的问题、答案、模型响应、日志和元数据等信息。数据集包含三个主要配置:evals_eval_rl、logs__evaluation_eval_rl和metadata,分别用于存储评估结果、日志信息和实验的元数据。
This is an experiment tracking dataset for evaluation tasks, containing questions, answers, model responses, logs, and metadata from the evaluation experiment. The dataset includes three main configurations: evals_eval_rl, logs__evaluation_eval_rl, and metadata, which are used to store evaluation results, log information, and metadata of the experiment respectively.
提供机构:
TAUR-dev



