japhba/loracle-fineweb-openrouter-gemini-3-flash-1k-finetunes
收藏Hugging Face2026-04-16 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/japhba/loracle-fineweb-openrouter-gemini-3-flash-1k-finetunes
下载链接
链接失效反馈官方服务:
资源简介:
---
pretty_name: Loracle FineWeb OpenRouter Gemini 3 Flash 1K Finetunes
tags:
- synthetic
- fineweb
- interpretability
- openrouter
- gemini
- question-answering
configs:
- config_name: question_rows
- config_name: question_groups
- config_name: synthetic_models
- config_name: sampled_docs
---
# loracle-fineweb-openrouter-gemini-3-flash-1k-finetunes
Synthetic Loracle supervision data generated from FineWeb with OpenRouter.
## Run summary
- source dataset: `HuggingFaceFW/fineweb` / `sample-10BT` / `train`
- sampled docs: `6500`
- synthetic finetunes: `1284`
- generated finetunes in this shard: `1000`
- generator backend: `openrouter`
- generator model: `google/gemini-3-flash-preview`
- max docs per finetune: `40`
- max token budget per finetune: `10000`
- questions per finetune: `10`
## Configs
- `question_rows`: flattened question-answer pairs, one row per question
- `question_groups`: one row per synthetic finetune with nested question lists
- `synthetic_models`: sampled synthetic finetune definitions before generation
- `sampled_docs`: sampled FineWeb source documents used for this run
## Local artifacts
- run directory: `/ceph/scratch/jbauer/loracle/fineweb_openrouter_gemini3flash_1k`
- sample manifest: `sample_manifest.json`
- generation manifest: `generation_manifest.shard-000-of-001.json`
提供机构:
japhba



