multi-domain-reasoning/gsm8k_eval
收藏Hugging Face2024-12-12 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/multi-domain-reasoning/gsm8k_eval
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,如问题、答案、不同模型(如LLAMA和PHI)的基线输出和推理输出,以及基线输出与混合推理输出的评估结果。数据集分为一个测试集,包含1319个示例,总大小为25226001字节。
The dataset includes multiple features such as questions, answers, baseline outputs and reasoning outputs from different models (e.g., LLAMA and PHI), and evaluations of baseline outputs versus mixed reasoning outputs. The dataset is divided into a test set containing 1319 examples, with a total size of 25226001 bytes.
提供机构:
multi-domain-reasoning



