five

6000+ Indian Food Recipes Dataset

收藏
doi.org2025-03-24 收录
下载链接:
http://doi.org/10.17632/xsphgmmh7b.1
下载链接
链接失效反馈
官方服务:
资源简介:
When I browsed for a Food Recipes (Especially Indian Food) Dataset, I could not find one (that I could use) online. So, I decided to create one. The dataset has following fields (self-explanatory) - ['RecipeName', 'TranslatedRecipeName', 'Ingredients', 'TranslatedIngredients', 'Prep', 'Cook', 'Total', 'Servings', 'Cuisine', 'Course', 'Diet', 'Instructions', 'TranslatedInstructions']. The datset contains a csv and a xls file. Sometimes, the content in Hindi is not visible in the csv format. You might be wondering what the columns with the prefix 'Translated' are. So, a lot of entries in the dataset were in Hindi language. To take care of such entries and translating them to English for consistency, I went ahead and used 'googletrans'. It is a python library that implements Google Translate API underneath. The code for the crawler, cleaning and transformation is on my Github repository (@kanishk307). The dataset has been created using Archana's Kitchen Website. It is a great website and hosts a ton of useful content. You should definitely consider viewing it if you are interested. The dataset can be used to answer a lot of questions related to Food Recipes. You can see the explore the serving sizes, time required to prepare a dish, most common ingredients, different cuisines, diets, courses and what not. I hope this dataset helps the Analytics community.

在搜索食品食谱(尤其是印度菜系)数据集的过程中,我未能在线找到(可用于我的目的)的数据集。因此,我决定自行创建一个。该数据集包含以下字段(均具有自明性):['RecipeName', 'TranslatedRecipeName', 'Ingredients', 'TranslatedIngredients', 'Prep', 'Cook', 'Total', 'Servings', 'Cuisine', 'Course', 'Diet', 'Instructions', 'TranslatedInstructions']。数据集中包含 csv 和 xls 文件。有时,Hindi 语种的文本在 csv 格式下无法显示。您可能好奇以 'Translated' 为前缀的列的作用。因此,数据集中大量条目使用的是印地语。为了处理这类条目并将其翻译成英语以确保一致性,我采用了 'googletrans' 库。这是一个实现 Google 翻译 API 的 Python 库。关于爬虫、数据清洗和转换的代码存储在我的 GitHub 仓库(@kanishk307)中。该数据集是利用 Archana's Kitchen 网站创建的。这是一个优秀的网站,提供了大量的有用内容。如果您对此感兴趣,务必考虑访问该网站。该数据集可用于回答与食品食谱相关的众多问题。您可以查看并探索份量大小、准备一道菜肴所需的时间、最常见的食材、不同的菜系、饮食、课程等内容。我希望这个数据集能对数据分析社区有所帮助。
提供机构:
doi.org
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作