five

Abhiram1009/synthetic-data-factory-5000-20260317

收藏
Hugging Face2026-03-17 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/Abhiram1009/synthetic-data-factory-5000-20260317
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit task_categories: - text-generation - question-answering language: - en pretty_name: Synthetic Data Factory 5k size_categories: - 1K<n<10K --- # Synthetic Data Factory 5k This dataset contains 5,000 synthetic math and logic examples generated without LLM-based generation. ## File - `generated_5000.jsonl`: JSONL rows with problem text, explanation text, final answer, tags, metadata, and quality report. ## Families - arithmetic expression evaluation - linear equation solving - comparison logic / transitive reasoning ## Generation approach Examples are created through a world-model-first pipeline: - latent structured problem specification - exact symbolic or rule-based teacher - controlled renderer for natural-language problems and explanations - validation, deduplication, and quality gating
提供机构:
Abhiram1009
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作