nbalepur/planorama_irt
收藏Hugging Face2025-04-07 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/nbalepur/planorama_irt
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,如问题ID、类别、问题、答案、生成模型、计划A、计划B、模型比较、模型准确性、模型响应时间等。数据集分为数学和琐事两个部分,每个部分包含150个示例。此外,还包含了人类比较结果和人类评估模型的有用性。
The dataset includes multiple fields such as question ID, category, question, answer, generation model, plan A, plan B, model comparison, model accuracies, model response times, etc. The dataset is split into two parts, math and trivia, each containing 150 examples. It also includes human comparison results and human evaluations of model helpfulness.
提供机构:
nbalepur



