TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_BoLT-SFT-longmult_2dig-eval_sft
收藏Hugging Face2025-11-09 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/TAUR-dev/D-EVAL__standard_eval_v3__FinEval_16k_fulleval_3args_BoLT-SFT-longmult_2dig-eval_sft
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了问题、答案以及与任务相关的配置信息,适用于模型训练和评估。数据集中的字段涵盖了模型响应及其评估的多个维度,包括正确性、响应长度、答案提取和评估的元数据等。测试集包含了1000个示例,整个数据集的大小为7083593字节。
The dataset includes questions, answers, and task-related configuration information, suitable for model training and evaluation. The fields in the dataset cover multiple dimensions of model responses and their evaluations, including correctness, response length, answer extraction, and evaluation metadata. The test set contains 1000 examples, and the entire dataset is 7083593 bytes in size.
提供机构:
TAUR-dev



