five

japhba/loracle-fineweb-openrouter-gemini-3-flash-1k-finetunes

收藏
Hugging Face2026-04-16 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/japhba/loracle-fineweb-openrouter-gemini-3-flash-1k-finetunes
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: Loracle FineWeb OpenRouter Gemini 3 Flash 1K Finetunes tags: - synthetic - fineweb - interpretability - openrouter - gemini - question-answering configs: - config_name: question_rows - config_name: question_groups - config_name: synthetic_models - config_name: sampled_docs --- # loracle-fineweb-openrouter-gemini-3-flash-1k-finetunes Synthetic Loracle supervision data generated from FineWeb with OpenRouter. ## Run summary - source dataset: `HuggingFaceFW/fineweb` / `sample-10BT` / `train` - sampled docs: `6500` - synthetic finetunes: `1284` - generated finetunes in this shard: `1000` - generator backend: `openrouter` - generator model: `google/gemini-3-flash-preview` - max docs per finetune: `40` - max token budget per finetune: `10000` - questions per finetune: `10` ## Configs - `question_rows`: flattened question-answer pairs, one row per question - `question_groups`: one row per synthetic finetune with nested question lists - `synthetic_models`: sampled synthetic finetune definitions before generation - `sampled_docs`: sampled FineWeb source documents used for this run ## Local artifacts - run directory: `/ceph/scratch/jbauer/loracle/fineweb_openrouter_gemini3flash_1k` - sample manifest: `sample_manifest.json` - generation manifest: `generation_manifest.shard-000-of-001.json`
提供机构:
japhba
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作