chiuratto-AIgourakis/sounio-code-examples
收藏Hugging Face2026-03-25 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/chiuratto-AIgourakis/sounio-code-examples
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
language:
- en
pretty_name: Sounio Code Examples
size_categories:
- n<1K
task_categories:
- text-generation
tags:
- code
- compiler
- programming-language
- scientific-computing
- formal-verification
- uncertainty-propagation
- algebraic-effects
configs:
- config_name: default
data_files:
- split: train
path: train.jsonl
- split: validation
path: validation.jsonl
---
# sounio-code-examples
Instruction/completion dataset for **Sounio**, a self-hosted systems + scientific programming language for epistemic computing.
## Contents
- `train.jsonl`: 333 examples
- `validation.jsonl`: 37 examples
- Total: 370 examples extracted from `tests/run-pass` and `tests/compile-fail`
Each record contains:
- `instruction`: natural-language prompt derived from test annotations, descriptions, and file names
- `completion`: the full `.sio` source file
- `suite`: `run-pass` or `compile-fail`
- `source_path`: original repository path
- `annotations`: extracted `//@ ...` metadata
- `ignore`: whether the upstream suite currently ignores the example
## Why this dataset exists
This dataset is designed to make Sounio legible to code models quickly:
- run-pass examples teach valid syntax and idioms
- compile-fail examples teach effect discipline, refinement failures, and epistemic boundary checks
- source paths preserve provenance back to the repository test corpus
## Rebuild locally
```bash
python3 scripts/export_hf_dataset.py
```
## Upload
```bash
python3 scripts/export_hf_dataset.py --upload
```
Set `HF_TOKEN` or `HUGGINGFACE_HUB_TOKEN` before uploading.
提供机构:
chiuratto-AIgourakis



