CS-Bench/CS-Bench
收藏Hugging Face2024-06-13 更新2024-06-15 收录
下载链接:
https://hf-mirror.com/datasets/CS-Bench/CS-Bench
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-nc-4.0
language:
- en
- zh
tags:
- croissant
---
CS-Bench is the first benchmark dedicated to evaluating the performance of LLMs in the field of computer science. CS-Bench supports bilingual assessment, encompassing a total of 26 subfields across 4 domains, with a cumulative total of 4838 samples. These samples encompass various task formats including multiple-choice, assertion, fill-in-the-blank, and open-ended questions. Besides, CS-Bench assesses both knowledge-type and higher-order reasoning-type questions, with each reasoning question accompanied by an explanation. To validate the effectiveness of models, we randomly sample 10% of the data for validation, using the remaining 90% for testing. Please visit our [Website](https://csbench.github.io/) and [GitHub](https://github.com/csbench/csbench) for more details.
提供机构:
CS-Bench
原始信息汇总
CS-Bench 数据集概述
基本信息
- 许可证: cc-by-nc-4.0
- 语言:
- 英语 (en)
- 中文 (zh)
- 标签: croissant
数据集详情
- 领域: 计算机科学
- 子领域数量: 26
- 领域数量: 4
- 样本总数: 4838
- 任务格式:
- 多项选择
- 判断题
- 填空题
- 开放式问题
- 评估类型:
- 知识型问题
- 高阶推理型问题
- 推理问题附带: 解释
数据划分
- 验证集: 随机抽取10%的数据
- 测试集: 剩余90%的数据



