Mindgames
收藏arXiv2025-09-30 收录
下载链接:
https://huggingface.co/datasets/sileod/mindgames
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了400个问题,旨在评估语言模型中的心智理论(Theory of Mind)。这些问题的真/假标签分布均衡,非常适合进行零样本和少样本实验,以测试不同语言模型的能力。
This dataset contains 400 questions designed to evaluate the Theory of Mind (ToM) in language models. The true/false labels of these questions are evenly distributed, making it highly suitable for zero-shot and few-shot experiments to test the capabilities of various language models.
提供机构:
sileod



