TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_NoReflects-RL-longmult_3dig-eval_rl
收藏Hugging Face2025-11-10 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_NoReflects-RL-longmult_3dig-eval_rl
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含问题和答案对的数据集,用于某种任务配置和评估。数据集包括提示信息、模型响应及其评估结果,以及相关的元数据。测试集包含1000个示例。
This is a dataset containing pairs of questions and answers for a certain task configuration and evaluation. The dataset includes prompt information, model responses and their evaluation results, as well as related metadata. The test set contains 1000 examples.
提供机构:
TAUR-dev



