five

nebius/Qwen3-235B-Instruct-Infinity-Instruct-0625

收藏
Hugging Face2026-03-02 更新2026-04-05 收录
下载链接:
https://hf-mirror.com/datasets/nebius/Qwen3-235B-Instruct-Infinity-Instruct-0625
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 task_categories: - text-generation language: - en configs: - config_name: default data_files: - split: train path: data/train-* dataset_info: features: - name: conversation list: - name: content dtype: string - name: role dtype: string - name: generated_message struct: - name: content dtype: string - name: reasoning_content dtype: 'null' - name: role dtype: string - name: tool_calls sequence: 'null' - name: finish_reason dtype: string splits: - name: train num_bytes: 3666857766 num_examples: 659808 download_size: 2067010910 dataset_size: 3666857766 --- # Qwen3-235B-Instruct-Infinity-Instruct-0625 ## Dataset Description This dataset is part of the LK-Speculators collection for speculative decoding research. It contains 660K prompt-response pairs designed for training draft models that are used alongside Qwen3-235B-A22B-Instruct-2507 as the target model. The dataset was created by generating responses to the prompts from [Infinity-Instruct-0625](https://huggingface.co/datasets/BAAI/Infinity-Instruct) with [Qwen/Qwen3-235B-A22B-Instruct-2507](https://huggingface.co/Qwen/Qwen3-235B-A22B-Instruct-2507) at temperature=1. For more details on the training methodology and results, see our paper: [LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding](https://arxiv.org/abs/2602.23881). ## Dataset Structure - **Format**: parquet - **Rows**: 659,808 ## Usage ```python from datasets import load_dataset dataset = load_dataset("nebius/Qwen3-235B-Instruct-Infinity-Instruct-0625") ``` ## License The dataset is released under [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/) ## Citation ``` @misc{samarin2026lklosses, title = {LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding}, author = {Alexander Samarin and Sergei Krutikov and Anton Shevtsov and Sergei Skvortsov and Filipp Fisin and Alexander Golubev}, year = {2026}, eprint = {2602.23881}, archivePrefix = {arXiv}, primaryClass = {cs.LG}, url = {https://arxiv.org/abs/2602.23881} } ```

### 数据集元信息 - 许可证:CC BY 4.0 - 任务类别:文本生成 - 语言:英语 - 配置项: - 配置名称:default 数据文件: - 拆分方式:训练集 路径:data/train-* - 数据集信息: - 特征: 1. 字段名:conversation,列表类型,包含子字段: - 字段名:content,数据类型:字符串 - 字段名:role,数据类型:字符串 2. 字段名:generated_message,结构体类型,包含子字段: - 字段名:content,数据类型:字符串 - 字段名:reasoning_content,数据类型:空值 - 字段名:role,数据类型:字符串 - 字段名:tool_calls,序列类型,元素类型:空值 3. 字段名:finish_reason,数据类型:字符串 - 拆分信息: - 拆分方式:训练集,字节大小:3666857766,样本数量:659808 - 下载大小:2067010910 - 数据集总大小:3666857766 # Qwen3-235B-Instruct-Infinity-Instruct-0625 ## 数据集描述 本数据集隶属于用于推测解码(speculative decoding)研究的LK-Speculators集合,共包含66万条提示词-回复对,用于训练与Qwen3-235B-A22B-Instruct-2507搭配使用的草稿模型(作为目标模型)。本数据集通过将温度参数设为1,使用[Qwen/Qwen3-235B-A22B-Instruct-2507](https://huggingface.co/Qwen/Qwen3-235B-A22B-Instruct-2507)对[Infinity-Instruct-0625](https://huggingface.co/datasets/BAAI/Infinity-Instruct)中的提示词生成回复构建而成。如需了解训练方法与实验结果的更多细节,请参阅我们的论文:[LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding](https://arxiv.org/abs/2602.23881)。 ## 数据集结构 - **格式**:Parquet - **样本条数**:659,808 ## 用法 python from datasets import load_dataset dataset = load_dataset("nebius/Qwen3-235B-Instruct-Infinity-Instruct-0625") ## 许可证 本数据集采用[CC BY 4.0](https://creativecommons.org/licenses/by/4.0/)协议发布。 ## 引用 @misc{samarin2026lklosses, title = {LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding}, author = {Alexander Samarin and Sergei Krutikov and Anton Shevtsov and Sergei Skvortsov and Filipp Fisin and Alexander Golubev}, year = {2026}, eprint = {2602.23881}, archivePrefix = {arXiv}, primaryClass = {cs.LG}, url = {https://arxiv.org/abs/2602.23881} }
提供机构:
nebius
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作