five

Apex-X/PRODIGY-LAB_SARA

收藏
Hugging Face2025-10-21 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/Apex-X/PRODIGY-LAB_SARA
下载链接
链接失效反馈
官方服务:
资源简介:
PRODIGY-LAB_SARA 是一个专门为指令微调大型语言模型(LLMs)而设计和优化的 refined 和 enhanced 数据集。它结合了多个高质量的数据源,包括清洗和规范化的指令,旨在提高指令遵循、上下文理解和推理性能。该数据集是 Aadhithya 的自定义创作,受到 Alpaca-Cleaned 和 Self-Instruct 数据集结构的启发,但采用了独特的多领域改进和混合 LLM 聚合方法。

PRODIGY-LAB_SARA is a refined and enhanced dataset designed for instruction-based fine-tuning of large language models (LLMs). It combines multiple high-quality sources, including cleaned and normalized instructions, to improve instruction following, context understanding, and reasoning performance. This dataset is a custom creation by Aadhithya, inspired by the structure of datasets like Alpaca-Cleaned and Self-Instruct, but built with unique multi-domain improvements and hybrid LLM curation methods.
提供机构:
Apex-X
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作