five

dongbobo/AlgoNotes-Explain

收藏
Hugging Face2026-03-27 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/dongbobo/AlgoNotes-Explain
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 task_categories: - text-generation language: - en tags: - algorithms - code - explanations - synthetic pretty_name: AlgoNotes-Explain size_categories: - 1K<n<10K --- # AlgoNotes-Explain Synthetic explanation dataset built on top of **AlgoNotes-Raw**, with natural-language explanations generated by `Qwen3-Coder` (model card license: **Apache 2.0**). ## Dataset Description AlgoNotes-Explain pairs each algorithm snippet from AlgoNotes-Raw with a generated explanation. The explanations were produced by the `Qwen3-Coder` model. ## Source & Construction - **Raw data source**: `dongbobo/AlgoNotes-Raw` (upstream: `MathWeave/algobook-snippets`, license: `cc-by-4.0`) - **Explanation generator**: `Qwen3-Coder` (model card license: `apache-2.0`) - **Experimental-only checkpoints (not used to produce this release)**: Mistral, Llama ## License Reasoning **Candidate set (raw data source + generator model only):** | Source | Role | Licence | |--------|------|---------| | `MathWeave/algobook-snippets` → `AlgoNotes-Raw` | raw algorithmic snippets included verbatim | `cc-by-4.0` | | `Qwen3-Coder` | model that generated the **shipped** explanations | `apache-2.0` | The Mistral and Llama checkpoints were used only in internal experiments and produced **no content** included in this release; they are excluded from the candidate set. **Permissiveness ranking:** `apache-2.0` is the more permissive of the two candidates (explicit patent grant, no share-alike requirement, allows sub-licensing). However, `AlgoNotes-Explain` physically contains the raw `cc-by-4.0` snippets alongside each generated explanation; the combined dataset must therefore satisfy the CC BY 4.0 attribution obligation inherited from the upstream data. `cc-by-4.0` is the binding constraint and governs this artifact. ## License `AlgoNotes-Explain` is released under **Creative Commons Attribution 4.0 International (CC BY 4.0)**. The raw algorithmic snippets originate from `MathWeave/algobook-snippets` (CC BY 4.0, via `AlgoNotes-Raw`); the natural-language explanations were generated by [`Qwen3-Coder`](https://huggingface.co/Qwen/Qwen3-Coder), whose model card is licensed under Apache 2.0. The Mistral and Llama checkpoints listed in the repository were used only for internal experiments and did **not** produce any content in this release; they are excluded from the licence derivation. As the dataset physically contains the CC BY 4.0 snippets alongside the generated explanations, `cc-by-4.0` is the binding constraint governing this combined artifact.
提供机构:
dongbobo
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作