five

openai/healthbench-professional

收藏
Hugging Face2026-04-22 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/openai/healthbench-professional
下载链接
链接失效反馈
官方服务:
资源简介:
包含用于HealthBench Professional评估的数据。每个示例包含:对话列表(用户/助手消息,以用户消息结尾)、评分项列表(每个包含criterion_text和points)、用例类型(consult、writing或research)、类型(good_faith或red_teaming)、难度等级(由医生评定的difficult或typical)、专业领域(医学专业或子专业)以及医生写的回复。数据集要求不公开示例内容以防止数据污染或作弊。

Contains the data for the HealthBench Professional eval. Each example contains: conversation (list of user / assistant messages, ending in a user message), rubric_items (list of rubric items, each containing criterion_text and points), use_case (one of consult, writing, or research), type (one of good_faith or red_teaming), difficulty (physician-assigned difficulty rating), specialty (medical specialty or sub-specialty), and physician_response (response written by a physician). The dataset requests not to reveal examples to prevent contamination or cheating.
提供机构:
openai
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作