RoleEval
收藏科学数据银行2025-09-26 更新2026-04-23 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=c16b1553db0341d8ba0de71fe3e55a8c
下载链接
链接失效反馈官方服务:
资源简介:
RoleEval is a bilingual benchmark designed to assess the memorization, utilization, and reasoning capabilities of role knowledge for large language models. RoleEval comprises RoleEval-Global (including internationally recognized characters) and RoleEval-Chinese (including characters popular in China), with 6,000 Chinese-English parallel multiple-choice questions focusing on 300 influential people and fictional characters drawn from a variety of domains including celebrities, anime, comics, movies, TV series, games, and fiction. These questions cover basic knowledge and multi-hop reasoning abilities, aiming to systematically probe various aspects such as personal information, relationships, abilities, and experiences of the characters. To maintain high standards, we perform a hybrid quality check process combining automatic and human verification, ensuring that the questions are diverse, challenging, and discriminative.
提供机构:
Tianhao Shen
创建时间:
2025-09-26



