five

Airbnb Open Data

收藏
www.kaggle.com2022-08-01 更新2025-03-23 收录
下载链接:
https://www.kaggle.com/arianazmoudeh/airbnbopendata
下载链接
链接失效反馈
官方服务:
资源简介:
**New York City Airbnb Data Cleaning** Airbnb, Inc is an American company that operates an online marketplace for lodging, primarily homestays for vacation rentals, and tourism activities. Based in San Francisco, California, the platform is accessible via website and mobile app. Airbnb does not own any of the listed properties; instead, it profits by receiving commission from each booking. The company was founded in 2008. Airbnb is a shortened version of its original name, AirBedandBreakfast.com. **About Dataset** **Context** Since 2008, guests and hosts have used Airbnb to travel in a more unique, personalized way. As part of the Airbnb Inside initiative, this dataset describes the listing activity of homestays in New York City **Content** The following Airbnb activity is included in this New York dataset: Listings, including full descriptions and average review score Reviews, including unique id for each reviewer and detailed comments Calendar, including listing id and the price and availability for that day **Data Dictionary** Data dictionaries are used to provide detailed information about the contents of a dataset or database, such as the names of measured variables, their data types or formats, and text descriptions. A data dictionary provides a concise guide to understanding and using the data. https://docs.google.com/spreadsheets/d/1b_dvmyhb_kAJhUmv81rAxl4KcXn0Pymz **Inspiration** Learn Data Cleaning Data Cleaning Challenge Data Cleaning Practice for beginners Handling missing values Handling Outliers Handle inconsistent data Data Visualization Data analysis What can we learn about different hosts and areas? What can we learn from predictions? (ex: locations, prices, reviews, etc) Which hosts are the busiest and why? **Acknowledgment** This dataset is part of Airbnb Inside but I tried to make new columns and many data inconsistency issue to create a new dataset to practice data cleaning. The original source can be found here http://insideairbnb.com/explore/ Arian Azmoudeh @arianazmoudeh https://www.linkedin.com/in/arianazmoudeh/ i hope you enjoy it

纽约市Airbnb数据清洗 Airbnb, Inc.是一家位于加利福尼亚州旧金山的美国公司,主要运营在线住宿市场,包括度假租赁的民宿和旅游活动。该平台可通过网站和移动应用程序访问。Airbnb不拥有任何列出的物业;相反,它通过从每笔预订中收取佣金来获利。该公司成立于2008年。Airbnb是其原始名称AirBedandBreakfast.com的缩写。 **关于数据集 **背景 自2008年以来,客人和房东一直使用Airbnb以更独特、个性化的方式进行旅行。作为Airbnb Inside计划的组成部分,本数据集描述了纽约市民宿的列表活动。 **内容 本纽约数据集包含以下Airbnb活动: - 列表,包括完整描述和平均评论评分 - 评论,包括每位评论者的唯一标识符和详细评论 - 日历,包括列表标识符以及该日的价格和可用性 **数据字典 数据字典用于提供有关数据集或数据库内容的详细信息,例如测量变量的名称、数据类型或格式,以及文本描述。数据字典为理解和使用数据提供了一种简洁的指南。 https://docs.google.com/spreadsheets/d/1b_dvmyhb_kAJhUmv81rAxl4KcXn0Pymz **灵感 学习数据清洗 数据清洗挑战 数据清洗入门实践 处理缺失值 处理异常值 处理不一致的数据 数据可视化 数据分析 我们能从不同的房东和区域中学到什么? 我们能从预测中获得什么信息?(例如:位置、价格、评论等) 哪些房东最忙碌,原因是什么? **致谢 本数据集属于Airbnb Inside的一部分,但我尝试创建了新的列和许多数据不一致性问题,以创建一个新的数据集进行数据清洗练习。原始来源可在此找到:http://insideairbnb.com/explore/ Arian Azmoudeh @arianazmoudeh https://www.linkedin.com/in/arianazmoudeh/ 希望您能享受它。
提供机构:
www.kaggle.com
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作