TutorBench
收藏魔搭社区2025-12-05 更新2025-12-06 收录
下载链接:
https://modelscope.cn/datasets/ScaleAI/TutorBench
下载链接
链接失效反馈官方服务:
资源简介:
TutorBench is a challenging benchmark to assess tutoring capabilities of LLMs. TutorBench consists of examples drawn from three common tutoring tasks: (i) generating adaptive explanations tailored to a student’s confusion, (ii) providing actionable feedback on a student’s work, and (iii) promoting active learning through effective hint generation.
Paper: [TutorBench: A Benchmark To Assess Tutoring Capabilities Of Large Language Models](https://www.arxiv.org/abs/2510.02663)
TutorBench是一款用于评估大语言模型(Large Language Model)辅导能力的极具挑战性的基准测试集。该基准测试集包含三类常见辅导任务的测试样本:(i) 针对学生的困惑点生成定制化的适应性解释;(ii) 针对学生作业提供可操作的反馈;(iii) 通过高效的提示生成促进主动学习。
相关论文:《评估大语言模型辅导能力的基准测试集》(TutorBench: A Benchmark To Assess Tutoring Capabilities Of Large Language Models),链接:https://www.arxiv.org/abs/2510.02663
提供机构:
maas
创建时间:
2025-10-09



