TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_STaR-SFT-countdown_4arg-eval_sft
收藏Hugging Face2025-11-09 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_STaR-SFT-countdown_4arg-eval_sft
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一系列的问题和答案,以及相关的任务配置、提示信息和模型响应评估。它似乎被设计用于测试和评估模型的响应质量,特别是在特定任务中的表现。数据集还包含了元数据和原始分割信息,可能用于进一步的数据分析和研究。
The dataset consists of a series of questions and answers, along with associated task configurations, prompt information, and model response evaluations. It appears to be designed for testing and evaluating the quality of model responses, particularly in the context of specific tasks. The dataset also includes metadata and original split information, which may be used for further data analysis and research.
提供机构:
TAUR-dev



