smalleval/mmlu-nano

Name: smalleval/mmlu-nano
Creator: smalleval
Published: 2025-01-20 07:13:36
License: 暂无描述

Hugging Face2025-01-20 更新2025-11-29 收录

下载链接：

https://hf-mirror.com/datasets/smalleval/mmlu-nano

下载链接

链接失效反馈

官方服务：

资源简介：

# SmallEval: Browser-Friendly LLM Evaluation Datasets 🚀 [![Created by Cloud Code AI](https://img.shields.io/badge/Created%20by-Cloud%20Code%20AI-blue)](https://cloudcode.ai) SmallEval is a curated collection of lightweight evaluation datasets specifically designed for testing Large Language Models (LLMs) in browser environments. Each dataset is carefully subsampled to maintain a small footprint while preserving the evaluation quality. ## 🎯 Purpose The primary goal of SmallEval is to enable efficient evaluation of LLMs directly in web browsers. Traditional evaluation datasets are often too large for browser-based applications, making it challenging to assess model performance in client-side environments. SmallEval addresses this by providing: - Compact dataset sizes (250 samples per subset) - Carefully selected samples from established benchmarks - Browser-friendly JSONL format - Consistent evaluation metrics across different domains ## 📊 Available Datasets Each dataset is a subset of the original LightEval collection, containing 250 randomly sampled examples: ### MMLU (Massive Multitask Language Understanding) - `mmlu_high_school_mathematics.jsonl` - `mmlu_high_school_physics.jsonl` - `mmlu_high_school_biology.jsonl` - `mmlu_high_school_chemistry.jsonl` - `mmlu_high_school_computer_science.jsonl` - `mmlu_high_school_psychology.jsonl` - `mmlu_high_school_us_history.jsonl` - `mmlu_high_school_world_history.jsonl` ## 📥 Usage Checkout our Github Repo: https://github.com/Cloud-Code-AI/smalleval ## 🤝 Contributing We welcome contributions! If you'd like to add new subsets or improve existing ones, please: 1. Fork the repository 2. Create your feature branch 3. Submit a pull request ## 📜 License These datasets are derived from the original [LightEval](https://huggingface.co/lighteval) collection and maintain their original licenses. ## 🔗 Links - [Cloud Code AI](https://cloudcode.ai) - [Original LightEval Datasets](https://huggingface.co/lighteval)

提供机构：

smalleval

5,000+

优质数据集

54 个

任务类型

进入经典数据集