five

wrmedford/Qwen3.5-9B-SSD

收藏
Hugging Face2026-04-06 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/wrmedford/Qwen3.5-9B-SSD
下载链接
链接失效反馈
官方服务:
资源简介:
# SSD Dataset Replication (Qwen3.5-9B) This dataset is a replication of the **"Embarrassingly Simple Self-Distillation Improves Code Generation" (SSD)** paper ([arXiv:2604.01193](https://arxiv.org/pdf/2604.01193)). ## Overview The dataset contains coding problems and their corresponding solutions generated by **Qwen3.5-9B** using high-temperature sampling (T=1.1) to explore the model's latent capabilities. This approach, known as SSD, focuses on "self-distillation" where a model's own correct but non-greedy outputs are used for fine-tuning to improve its standard (greedy) performance. ## Dataset Construction - **Seed Prompts:** Samples from `deepmind/code_contests` - **Generation:** High-temperature sampling (T=1.1, Top-K=20, Top-P=0.95) with **Qwen3.5-9B**. - **Reasoning:** Reasoning chains (thinking) were enabled during generation to capture the model's logical process. ## Contents - `ssd_dataset.jsonl`: The generated samples in OpenAI-style message format. - `generate_dataset.py`: The replication script used to synthesize the data (vLLM-based, data-parallel). ## Usage To replicate the generation process: ```bash python generate_dataset.py --model Qwen/Qwen3.5-9B --dp <NUM_GPUS> ``` ## Citation ```bibtex @misc{zhang2026embarrassinglysimpleselfdistillationimproves, title={Embarrassingly Simple Self-Distillation Improves Code Generation}, author={Ruixiang Zhang and Richard He Bai and Huangjie Zheng and Navdeep Jaitly and Ronan Collobert and Yizhe Zhang}, year={2026}, eprint={2604.01193}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2604.01193}, } ```
提供机构:
wrmedford
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作