five

davanstrien/test-transformers-cb-smoltalk2

收藏
Hugging Face2026-03-25 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/davanstrien/test-transformers-cb-smoltalk2
下载链接
链接失效反馈
官方服务:
资源简介:
--- tags: - generated - transformers - continuous-batching - uv-script dataset_info: features: - name: messages list: - name: content dtype: string - name: role dtype: string - name: chat_template_kwargs struct: - name: custom_instructions dtype: string - name: enable_thinking dtype: bool - name: python_tools list: string - name: xml_tools list: string - name: source dtype: string - name: response dtype: string splits: - name: OpenHermes_2.5_no_think num_bytes: 9646 num_examples: 10 download_size: 13980 dataset_size: 9646 configs: - config_name: default data_files: - split: OpenHermes_2.5_no_think path: data/OpenHermes_2.5_no_think-* --- # Generated Responses Dataset This dataset contains generated responses for prompts from [HuggingFaceTB/smoltalk2](https://huggingface.co/datasets/HuggingFaceTB/smoltalk2). ## Generation Details - **Source Dataset**: [HuggingFaceTB/smoltalk2](https://huggingface.co/datasets/HuggingFaceTB/smoltalk2) - **Input Column**: `messages` (chat messages) - **Model**: [Qwen/Qwen3-4B-Instruct-2507](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507) - **Backend**: transformers continuous batching - **Number of Examples**: 10 - **Generation Date**: 2026-03-25T12:32:41.761909 ### Generation Parameters - **Temperature**: 0.7 - **Top P**: 0.8 - **Top K**: 20 - **Max New Tokens**: 128 - **Max Batch Tokens**: 512 - **Repetition Penalty**: 1.0 ### Hardware Configuration - **GPUs**: 1 - **Attention Implementation**: paged|sdpa ## Dataset Structure The dataset contains all columns from the source dataset plus: - `response`: The generated response from the model ## Generation Script Generated using the transformers continuous batching script from [uv-scripts/transformers-inference](https://huggingface.co/datasets/uv-scripts/transformers-inference). To reproduce this generation: ```bash uv run https://huggingface.co/datasets/uv-scripts/transformers-inference/raw/main/generate-responses.py \ HuggingFaceTB/smoltalk2 \ <output-dataset> \ --model-id Qwen/Qwen3-4B-Instruct-2507 \ --messages-column messages \ --temperature 0.7 \ --top-p 0.8 \ --top-k 20 \ --max-tokens 128 ```
提供机构:
davanstrien
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作