6cf/liveideabench-v2
收藏Hugging Face2025-04-10 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/6cf/liveideabench-v2
下载链接
链接失效反馈官方服务:
资源简介:
LiveIdeaBench是一个评估大型语言模型(LLM)科学创造力和基于最小上下文的想法生成能力的综合基准。它使用单关键词提示,并根据独创性、可行性、流畅性和灵活性四个维度对生成的想法进行评估。
LiveIdeaBench is a comprehensive benchmark for evaluating the scientific creativity and idea generation capabilities of Large Language Models (LLMs) with minimal context, using single-keyword prompts and assessing the generated ideas based on four dimensions: originality, feasibility, fluency, and flexibility.
提供机构:
6cf



