Sample Project: Exploratory Data Analysis with TMDB Data
收藏Databricks2024-05-09 收录
下载链接:
https://marketplace.databricks.com/details/241c2e1c-3752-43b9-b38a-d4ea3c77f308/Databricks_Sample-Project:-Exploratory-Data-Analysis-with-TMDB-Data
下载链接
链接失效反馈官方服务:
资源简介:
**Exploring the TMDB Movies Dataset on Databricks**
Welcome to our comprehensive guide on Exploratory Data Analysis (EDA) on Databricks. Using a rich movies dataset, you'll journey through exploring and profiling the data, to cleaning and querying it, and ultimately discovering insights about the world of cinema, complemented by compelling visualizations. By the end of this notebook, you'll not only be more familiar with our platform's capabilities, but you'll also gain a hands-on understanding of the fundamental steps in EDA.
In this sample project, you will:
1. Read in a sample dataset
2. Profile the data to get a general idea of our dataset's contents
3. Clean up the data and save it as a new table
4. Query the cleaned data to gain insights about movies (with visualizations!)
5. Take the exploration further on your own with a challenge
The data set you will use contains information about 10,000 movies collected from The Movie Database (TMDB), including user ratings and revenue.
Source: [https://www.kaggle.com/datasets/akshaypawar7/millions-of-movies](https://www.kaggle.com/datasets/akshaypawar7/millions-of-movies)
提供机构:
Databricks



