anonnorth/wikibooks-cookbook
收藏Hugging Face2026-03-31 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/anonnorth/wikibooks-cookbook
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-sa-4.0
configs:
- config_name: default
data_files:
- split: main
path: "recipes_parsed.mini.json"
---
# Wikibooks Recipe Dataset
Looking for a Creative Commons-licensed dataset of delicious recipes? Whether your aim is to cook a healthy dinner for your traditional family, polycule or commune, or whether you want to save the world from disastrous AI-generated recipes (see [this article](https://www.theguardian.com/food/article/2024/jul/31/one-of-the-most-disgusting-meals-ive-ever-eaten-ai-recipes-tested)), look no further!
This dataset contains a dump (scraped 2024-07-31) of all HTML files of individual recipes pages in the [Wikibooks Cookbook](https://en.wikibooks.org/wiki/Cookbook:Table_of_Contents) and a JSON file containing all recipe text (and infoboxes) in a semi-structured format.
### 数据集元信息
许可证:CC BY-SA 4.0(知识共享署名-相同方式共享4.0)
配置项:
- 配置名称:默认(default)
数据文件:
- 拆分集:主集(main)
文件路径:"recipes_parsed.mini.json"
# 维基教科书食谱数据集(Wikibooks Recipe Dataset)
您是否正在寻求采用知识共享许可的优质食谱数据集?无论是为传统家庭、多元亲密关系群体或社群烹制健康晚餐,还是希望使世界免遭灾难性人工智能生成食谱的荼毒(详见[此文](https://www.theguardian.com/food/article/2024/jul/31/one-of-the-most-disgusting-meals-ive-ever-eaten-ai-recipes-tested)),都无需他求!
本数据集包含2024年7月31日抓取的[维基教科书食谱专区(Wikibooks Cookbook)](https://en.wikibooks.org/wiki/Cookbook:Table_of_Contents)中所有单篇食谱页面的HTML文件归档,以及一份采用半结构化格式存储所有食谱文本(及信息框)的JSON文件。
提供机构:
anonnorth



