five

sklmindforge/llm_subtraction_training

收藏
Hugging Face2026-03-20 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/sklmindforge/llm_subtraction_training
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit language: - en --- # Systematic Subtraction Curriculum (State-Persistent) This dataset is a tiered subtraction curriculum designed for small LLMs (0.5B - 1.5B). It solves the "Hidden State" problem by explicitly tracking variable mutations during borrowing. ## Curriculum Structure 1. **Tier 1: Mental Math (20%)** - Direct subtraction for results < 100 to build instant recall. 2. **Tier 2: Debt & Negatives (10%)** - Teaches the concept of negative results when subtracting larger from smaller. 3. **Tier 3: Explicit Scratchpad (70%)** - Complex multi-digit subtraction with verbose state updates (e.g., "100s becomes 4"). ## Key Features - **Explicit Borrowing:** Every borrow operation is announced, updating the "Value" of the next column in text. - **Ordered Learning:** Designed for streaming training where the model masters basics before complex algorithms. - **Zero-Ghosting:** No implicit math; every intermediate digit change is tokenized.
提供机构:
sklmindforge
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作