five

Nalandadata/NalandaJEENEETBench

收藏
Hugging Face2026-04-22 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/Nalandadata/NalandaJEENEETBench
下载链接
链接失效反馈
官方服务:
资源简介:
NalandaJEENEETBench是首个用于评估大型语言模型在印度竞争性考试(JEE Mains、JEE Advanced、NEET UG)问题上的开放基准数据集。该数据集是从Nalanda Data专有的116,000多个专家策划的考试问题中精选出来的样本,包含已验证的正确答案和逐步解答。数据集分为两个部分:基准部分(800个多选题,用于评估模型)和训练样本部分(500个多选题,包含解答,用于预览训练数据质量)。数据集平衡覆盖了物理、化学、数学和生物四个学科。该数据集填补了印度竞争性考试评估套件的空白,并展示了在7B模型上通过微调实现的显著性能提升。

NalandaJEENEETBench is the first open benchmark for evaluating large language models (LLMs) on Indian competitive exam questions (JEE Mains, JEE Advanced, NEET UG). It is a curated sample from Nalanda Datas proprietary dataset of over 116,000 expert-curated examination questions with verified correct answers and step-by-step solutions. The dataset includes two splits: a benchmark split (800 MCQs for evaluation) and a train sample split (500 MCQs with solutions for previewing training data quality). It is balanced across four subjects: Physics, Chemistry, Mathematics, and Biology. This benchmark addresses the lack of standard evaluation suites for Indian competitive exams and demonstrates significant performance improvements when fine-tuning models on this data.
提供机构:
Nalandadata
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作