TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_InstOnly-RL-gsm8k-eval_rl
收藏Hugging Face2025-11-09 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_InstOnly-RL-gsm8k-eval_rl
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含问题、答案以及相关的任务配置和提示信息,还包括模型生成的响应及其评估结果。数据集适用于测试目的,包含了测试集的统计数据。数据集配置为latest。
The dataset includes questions, answers, task configurations, prompts, and model-generated responses along with their evaluation results. It is intended for testing purposes and contains statistics for the test split. The dataset configuration is set to latest.
提供机构:
TAUR-dev



