keyuuw/gdpeval
收藏Hugging Face2025-12-13 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/keyuuw/gdpeval
下载链接
链接失效反馈官方服务:
资源简介:
GDPval数据集旨在评估AI模型在现实世界中有经济价值的任务上的性能。它包含220个现实世界的知识任务,涵盖44个职业。每个任务由一个文本提示和一组支持性参考文件组成。数据集可能包含敏感内容,如NSFW内容、政治内容等,但这些内容是为了反映不同职业中实际处理的真实主题。数据集还包含对第三方品牌和商标的引用,仅用于研究和评估目的。
The GDPval dataset is designed to evaluate AI model performance on real-world economically valuable tasks. It includes 220 real-world knowledge tasks across 44 occupations. Each task consists of a text prompt and a set of supporting reference files. The dataset may contain sensitive content such as NSFW themes, political content, etc., but these are kept to reflect real themes addressed in various occupations. The dataset also includes limited references to third-party brands and trademarks solely for research and evaluation purposes.
提供机构:
keyuuw



