JAQKET
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/Stability-AI/lm-evaluation-harness/tree/jp-stable
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是为了评估日语大型语言模型而设计的问题回答数据集。它被用于评估日语语言模型的性能,具体的任务类型是问题回答。
This is a question answering dataset designed for evaluating Japanese large language models. It is utilized to assess the performance of Japanese language models, with the specific task type being question answering.
提供机构:
JP Language Model Evaluation Harness



