andreuka18/DeepSeek-R1-Distill-Llama-8B-lmsys-openthoughts-tokenized
收藏Hugging Face2025-03-31 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/andreuka18/DeepSeek-R1-Distill-Llama-8B-lmsys-openthoughts-tokenized
下载链接
链接失效反馈官方服务:
资源简介:
该数据集用于训练稀疏自动编码器(SAEs),以识别大型语言模型中的推理特征。数据集由标记化的文本数据组成,用于训练SAEs,并遵循MIT许可。
This dataset is used for training Sparse Autoencoders (SAEs) to identify reasoning features in Large Language Models (LLMs). The dataset consists of tokenized text data for training the SAEs and is licensed under MIT.
提供机构:
andreuka18



