five

RoleEval

收藏
DataCite Commons2025-10-09 更新2026-05-05 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=c16b1553db0341d8ba0de71fe3e55a8c
下载链接
链接失效反馈
官方服务:
资源简介:
RoleEval is a bilingual benchmark designed to assess the memorization, utilization, and reasoning capabilities of role knowledge for large language models. RoleEval comprises RoleEval-Global (including internationally recognized characters) and RoleEval-Chinese (including characters popular in China), with 6,000 Chinese-English parallel multiple-choice questions focusing on 300 influential people and fictional characters drawn from a variety of domains including celebrities, anime, comics, movies, TV series, games, and fiction. These questions cover basic knowledge and multi-hop reasoning abilities, aiming to systematically probe various aspects such as personal information, relationships, abilities, and experiences of the characters. To maintain high standards, we perform a hybrid quality check process combining automatic and human verification, ensuring that the questions are diverse, challenging, and discriminative.
提供机构:
Science Data Bank
创建时间:
2025-10-09
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作