five

smalleval/mmlu-nano

收藏
Hugging Face2025-01-20 更新2025-11-29 收录
下载链接:
https://hf-mirror.com/datasets/smalleval/mmlu-nano
下载链接
链接失效反馈
官方服务:
资源简介:
# SmallEval: Browser-Friendly LLM Evaluation Datasets 🚀 [![Created by Cloud Code AI](https://img.shields.io/badge/Created%20by-Cloud%20Code%20AI-blue)](https://cloudcode.ai) SmallEval is a curated collection of lightweight evaluation datasets specifically designed for testing Large Language Models (LLMs) in browser environments. Each dataset is carefully subsampled to maintain a small footprint while preserving the evaluation quality. ## 🎯 Purpose The primary goal of SmallEval is to enable efficient evaluation of LLMs directly in web browsers. Traditional evaluation datasets are often too large for browser-based applications, making it challenging to assess model performance in client-side environments. SmallEval addresses this by providing: - Compact dataset sizes (250 samples per subset) - Carefully selected samples from established benchmarks - Browser-friendly JSONL format - Consistent evaluation metrics across different domains ## 📊 Available Datasets Each dataset is a subset of the original LightEval collection, containing 250 randomly sampled examples: ### MMLU (Massive Multitask Language Understanding) - `mmlu_high_school_mathematics.jsonl` - `mmlu_high_school_physics.jsonl` - `mmlu_high_school_biology.jsonl` - `mmlu_high_school_chemistry.jsonl` - `mmlu_high_school_computer_science.jsonl` - `mmlu_high_school_psychology.jsonl` - `mmlu_high_school_us_history.jsonl` - `mmlu_high_school_world_history.jsonl` ## 📥 Usage Checkout our Github Repo: https://github.com/Cloud-Code-AI/smalleval ## 🤝 Contributing We welcome contributions! If you'd like to add new subsets or improve existing ones, please: 1. Fork the repository 2. Create your feature branch 3. Submit a pull request ## 📜 License These datasets are derived from the original [LightEval](https://huggingface.co/lighteval) collection and maintain their original licenses. ## 🔗 Links - [Cloud Code AI](https://cloudcode.ai) - [Original LightEval Datasets](https://huggingface.co/lighteval)
提供机构:
smalleval
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作