five

Sample Project: Exploratory Data Analysis with TMDB Data

收藏
Databricks2024-05-09 收录
下载链接:
https://marketplace.databricks.com/details/241c2e1c-3752-43b9-b38a-d4ea3c77f308/Databricks_Sample-Project:-Exploratory-Data-Analysis-with-TMDB-Data
下载链接
链接失效反馈
官方服务:
资源简介:
**Exploring the TMDB Movies Dataset on Databricks** Welcome to our comprehensive guide on Exploratory Data Analysis (EDA) on Databricks. Using a rich movies dataset, you'll journey through exploring and profiling the data, to cleaning and querying it, and ultimately discovering insights about the world of cinema, complemented by compelling visualizations. By the end of this notebook, you'll not only be more familiar with our platform's capabilities, but you'll also gain a hands-on understanding of the fundamental steps in EDA. In this sample project, you will: 1. Read in a sample dataset 2. Profile the data to get a general idea of our dataset's contents 3. Clean up the data and save it as a new table 4. Query the cleaned data to gain insights about movies (with visualizations!) 5. Take the exploration further on your own with a challenge The data set you will use contains information about 10,000 movies collected from The Movie Database (TMDB), including user ratings and revenue. Source: [https://www.kaggle.com/datasets/akshaypawar7/millions-of-movies](https://www.kaggle.com/datasets/akshaypawar7/millions-of-movies)
提供机构:
Databricks
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作