five

davanstrien/test-transformers-cb-8b

收藏
Hugging Face2026-03-24 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/davanstrien/test-transformers-cb-8b
下载链接
链接失效反馈
官方服务:
资源简介:
--- tags: - generated - transformers - continuous-batching - uv-script --- # Generated Responses Dataset This dataset contains generated responses for prompts from [davanstrien/haiku_dpo](https://huggingface.co/datasets/davanstrien/haiku_dpo). ## Generation Details - **Source Dataset**: [davanstrien/haiku_dpo](https://huggingface.co/datasets/davanstrien/haiku_dpo) - **Input Column**: `question` (plain text prompts) - **Model**: [Qwen/Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) - **Backend**: transformers continuous batching - **Number of Examples**: 10 - **Generation Date**: 2026-03-24T18:44:26.007661 ### Generation Parameters - **Temperature**: 0.7 - **Top P**: 0.8 - **Top K**: 20 - **Max New Tokens**: 256 - **Max Batch Tokens**: 1024 - **Repetition Penalty**: 1.0 ### Hardware Configuration - **GPUs**: 1 - **Attention Implementation**: paged|sdpa ## Dataset Structure The dataset contains all columns from the source dataset plus: - `response`: The generated response from the model ## Generation Script Generated using the transformers continuous batching script from [uv-scripts/transformers](https://huggingface.co/datasets/uv-scripts/transformers). To reproduce this generation: ```bash uv run https://huggingface.co/datasets/uv-scripts/transformers/raw/main/generate-responses.py \ davanstrien/haiku_dpo \ <output-dataset> \ --model-id Qwen/Qwen3-8B \ --prompt-column question \ --temperature 0.7 \ --top-p 0.8 \ --top-k 20 \ --max-tokens 256 ```

--- 标签: - 生成式(generated) - Transformers 框架(transformers) - 连续批处理(continuous-batching) - uv脚本(uv-script) --- # 生成式响应数据集 本数据集包含来自[davanstrien/haiku_dpo](https://huggingface.co/datasets/davanstrien/haiku_dpo)的提示词对应的模型生成响应。 ## 生成详情 - **源数据集**:[davanstrien/haiku_dpo](https://huggingface.co/datasets/davanstrien/haiku_dpo) - **输入列**:`question`(纯文本提示词) - **模型**:[Qwen/Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) - **后端**:Transformers 框架连续批处理(continuous-batching) - **样本总量**:10条 - **生成日期**:2026-03-24T18:44:26.007661 ### 生成参数 - **温度系数(Temperature)**:0.7 - **Top P 采样**:0.8 - **Top K 采样**:20 - **最大生成Token数(Max New Tokens)**:256 - **单批次最大Token数(Max Batch Tokens)**:1024 - **重复惩罚系数**:1.0 ### 硬件配置 - **GPU数量**:1 - **注意力实现方式**:分页注意力(paged)|缩放点积注意力(SDPA) ## 数据集结构 本数据集包含源数据集的全部原生列,新增列如下: - `response`:模型生成的响应内容 ## 生成脚本 本数据集使用来自[uv-scripts/transformers](https://huggingface.co/datasets/uv-scripts/transformers)的Transformers框架连续批处理脚本生成。 如需复现该生成流程,请执行以下命令: bash uv run https://huggingface.co/datasets/uv-scripts/transformers/raw/main/generate-responses.py davanstrien/haiku_dpo <output-dataset> --model-id Qwen/Qwen3-8B --prompt-column question --temperature 0.7 --top-p 0.8 --top-k 20 --max-tokens 256
提供机构:
davanstrien
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作