five

arcee-ai/LLama-405B-Logits

收藏
Hugging Face2024-11-29 更新2025-09-13 收录
下载链接:
https://hf-mirror.com/datasets/arcee-ai/LLama-405B-Logits
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - "en" # ISO 639-1 code for English pretty_name: "Llama-405B-Logits Dataset" tags: - distillation - machine-learning - language-model license: "apache-2.0" # Valid license identifier task_categories: - text-generation - text2text-generation --- # Llama-405B-Logits Dataset The **Llama-405B-Logits Dataset** is a curated subset of logits extracted from the Llama-405B model, created to distill high-performance language models such as Arcee AI's **SuperNova** using [DistillKit](https://github.com/arcee-ai/Distillkit). This dataset was also instrumental in the training of the groundbreaking **INTELLECT-1** model, demonstrating the effectiveness of leveraging distilled knowledge for enhancing model performance. ## About the Dataset This dataset contains a carefully selected subset of Llama-405B logits, optimized for efficient use in distillation pipelines. It is specifically designed for: - **Model Distillation**: Enabling smaller models to learn from the behavior of larger models, improving performance while maintaining efficiency. - **Instruction-Tuning Applications**: Supporting the fine-tuning of models for instruction-following tasks. ## Applications 1. **SuperNova Models**: The dataset was pivotal in training Arcee AI's SuperNova series, helping achieve state-of-the-art results in alignment and general-purpose capabilities. 2. **INTELLECT-1**: Utilized during the decentralized training process to enhance the model's instruction-following capabilities. ## Tools and Usage The dataset is fully compatible with [DistillKit](https://github.com/arcee-ai/Distillkit), Arcee AI's proprietary framework for efficient distillation. DistillKit simplifies the distillation process by providing streamlined tools for managing datasets, extracting logits, and optimizing model training. ## Future Updates Arcee AI is undergoing rapid development for upcoming releases. The **DistillKit** repository will soon be updated with proper training scripts and additional resources to make it easier to work with the Llama-405B-Logits Dataset and other distillation workflows. Stay tuned for updates, and follow the progress on [DistillKit's GitHub](https://github.com/arcee-ai/Distillkit). ## Open-Source Contribution The **Llama-405B-Logits Dataset** is released under the Apache-2.0 license, in the spirit of open collaboration and transparency. We invite researchers and developers to explore its potential for advancing model performance and efficiency.
提供机构:
arcee-ai
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作