Leaner-Eval
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/EmpathYang/TinyHelen.git
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是为了测试语言模型遵循指令的能力而设计的,它经过精心提炼,以提高学习效率。此外,该数据集是一套评估语言模型在简化环境下表现的一系列数据集的一部分。该数据集包含了1,000个实例,其任务是测试语言模型的指令遵循能力。
This dataset is designed to test the instruction-following capabilities of language models, and it has been meticulously refined to enhance learning efficiency. Furthermore, it is part of a suite of datasets that evaluate the performance of language models in simplified environments. It contains 1,000 instances, all aimed at testing language models' instruction-following capabilities.
提供机构:
EmpathYang



