five

chiuratto-AIgourakis/sounio-code-examples

收藏
Hugging Face2026-03-25 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/chiuratto-AIgourakis/sounio-code-examples
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 language: - en pretty_name: Sounio Code Examples size_categories: - n<1K task_categories: - text-generation tags: - code - compiler - programming-language - scientific-computing - formal-verification - uncertainty-propagation - algebraic-effects configs: - config_name: default data_files: - split: train path: train.jsonl - split: validation path: validation.jsonl --- # sounio-code-examples Instruction/completion dataset for **Sounio**, a self-hosted systems + scientific programming language for epistemic computing. ## Contents - `train.jsonl`: 333 examples - `validation.jsonl`: 37 examples - Total: 370 examples extracted from `tests/run-pass` and `tests/compile-fail` Each record contains: - `instruction`: natural-language prompt derived from test annotations, descriptions, and file names - `completion`: the full `.sio` source file - `suite`: `run-pass` or `compile-fail` - `source_path`: original repository path - `annotations`: extracted `//@ ...` metadata - `ignore`: whether the upstream suite currently ignores the example ## Why this dataset exists This dataset is designed to make Sounio legible to code models quickly: - run-pass examples teach valid syntax and idioms - compile-fail examples teach effect discipline, refinement failures, and epistemic boundary checks - source paths preserve provenance back to the repository test corpus ## Rebuild locally ```bash python3 scripts/export_hf_dataset.py ``` ## Upload ```bash python3 scripts/export_hf_dataset.py --upload ``` Set `HF_TOKEN` or `HUGGINGFACE_HUB_TOKEN` before uploading.
提供机构:
chiuratto-AIgourakis
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作