TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_AT_STaR-RL-gsm8k-eval_rl
收藏Hugging Face2025-11-13 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_AT_STaR-RL-gsm8k-eval_rl
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了问题、答案、任务配置、任务来源、提示文本、模型响应等字段,并提供了原始分割、难度、领域、评估类型、期望答案格式等元数据。此外,还包括了评估日期和分割信息,以及配置信息。数据集针对测试集进行了分割,包含了测试集的字节数和示例数量。
The dataset includes fields such as questions, answers, task configurations, task sources, prompt texts, and model responses, and provides metadata such as original splits, difficulty levels, domains, evaluation types, and expected answer formats. In addition, it includes evaluation dates, split information, and configuration details. The dataset is split for the test set, containing the number of bytes and examples in the test set.
提供机构:
TAUR-dev



