five

anonnorth/wikibooks-cookbook

收藏
Hugging Face2026-03-31 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/anonnorth/wikibooks-cookbook
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-sa-4.0 configs: - config_name: default data_files: - split: main path: "recipes_parsed.mini.json" --- # Wikibooks Recipe Dataset Looking for a Creative Commons-licensed dataset of delicious recipes? Whether your aim is to cook a healthy dinner for your traditional family, polycule or commune, or whether you want to save the world from disastrous AI-generated recipes (see [this article](https://www.theguardian.com/food/article/2024/jul/31/one-of-the-most-disgusting-meals-ive-ever-eaten-ai-recipes-tested)), look no further! This dataset contains a dump (scraped 2024-07-31) of all HTML files of individual recipes pages in the [Wikibooks Cookbook](https://en.wikibooks.org/wiki/Cookbook:Table_of_Contents) and a JSON file containing all recipe text (and infoboxes) in a semi-structured format.

### 数据集元信息 许可证:CC BY-SA 4.0(知识共享署名-相同方式共享4.0) 配置项: - 配置名称:默认(default) 数据文件: - 拆分集:主集(main) 文件路径:"recipes_parsed.mini.json" # 维基教科书食谱数据集(Wikibooks Recipe Dataset) 您是否正在寻求采用知识共享许可的优质食谱数据集?无论是为传统家庭、多元亲密关系群体或社群烹制健康晚餐,还是希望使世界免遭灾难性人工智能生成食谱的荼毒(详见[此文](https://www.theguardian.com/food/article/2024/jul/31/one-of-the-most-disgusting-meals-ive-ever-eaten-ai-recipes-tested)),都无需他求! 本数据集包含2024年7月31日抓取的[维基教科书食谱专区(Wikibooks Cookbook)](https://en.wikibooks.org/wiki/Cookbook:Table_of_Contents)中所有单篇食谱页面的HTML文件归档,以及一份采用半结构化格式存储所有食谱文本(及信息框)的JSON文件。
提供机构:
anonnorth
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作