reasoning-degeneration-dev/algorithmic-sft-training-configs-v1
收藏Hugging Face2026-03-23 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/reasoning-degeneration-dev/algorithmic-sft-training-configs-v1
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
tags:
- algorithmic-sft
- training-config
- llamafactory
- algorithmic-sft-vs-distillation
---
# algorithmic-sft-training-configs-v1
LlamaFactory training configs. All cutoff_len=32768. Countdown configs use new equation-answer format.
## Dataset Info
- **Rows**: 17
- **Columns**: 7
## Columns
| Column | Type | Description |
|--------|------|-------------|
| config_name | Value('string') | YAML filename |
| domain | Value('string') | *No description provided* |
| is_distillation | Value('bool') | *No description provided* |
| yaml_content | Value('string') | Full YAML config |
| model_name | Value('string') | *No description provided* |
| cutoff_len | Value('int64') | *No description provided* |
| dataset | Value('string') | *No description provided* |
## Generation Parameters
```json
{
"script_name": "upload after countdown redesign",
"description": "LlamaFactory training configs. All cutoff_len=32768. Countdown configs use new equation-answer format.",
"model": "Qwen/Qwen2.5-1.5B-Instruct",
"hyperparameters": {},
"input_datasets": []
}
```
## Experiment Documentation
For complete experiment details, see [https://github.com/Zayne-sprague/SC-Research-Notes/tree/main/experiments/algorithmic_sft_vs_distillation](https://github.com/Zayne-sprague/SC-Research-Notes/tree/main/experiments/algorithmic_sft_vs_distillation)
## Usage
```python
from datasets import load_dataset
dataset = load_dataset("reasoning-degeneration-dev/algorithmic-sft-training-configs-v1", split="train")
print(f"Loaded {len(dataset)} rows")
```
---
*This dataset is tracked in [reasoning-degeneration-dev/PROJECT-MANIFEST](https://huggingface.co/datasets/reasoning-degeneration-dev/PROJECT-MANIFEST)*
提供机构:
reasoning-degeneration-dev



