NCSOFT/K-SEED
收藏Hugging Face2025-07-25 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/NCSOFT/K-SEED
下载链接
链接失效反馈官方服务:
资源简介:
K-SEED是一个专门为韩语设计的视觉-语言模型评估基准,基于SEED-Bench数据集。通过将SEED-Bench测试子集的前20%翻译成韩语,并经过人工审查以确保自然性,K-SEED包含12个评估维度的问题,如场景理解、实例识别和实例属性等,能够全面评估模型在韩语环境下的表现。
K-SEED is a Korean adaptation of SEED-Bench, specifically designed for evaluating the performance of vision-language models. By translating the first 20 percent of the test subset of SEED-Bench into Korean and carefully reviewing its naturalness through human inspection, a novel robust evaluation benchmark for the Korean language was developed. K-SEED consists of questions across 12 evaluation dimensions, such as scene understanding, instance identity, and instance attribute, allowing for a thorough evaluation of model performance in Korean.
提供机构:
NCSOFT



