Business Mafia Proposal
Contents
Project Motivation
A significant proportion of Airbnb hosts rent out portions of their own homes to generate additional side income. Instead of relying on a robust approach when setting prices, they tend to do so intuitively, relying on gut feeling. Our group hopes to offer these homeowners an alternative way to price their listings - through an amalgamation of factors such as their listing's geographical location and its relationship with Downtown Seattle.
However, the primary challenge here is simplifying and summarising the technical, complex analytics techniques into layman terms; it would require breaking down the technical jargon associated with it. In order to carry this out effectively, we created an RShiny Application which would guide owners systematically through the thought process. This would allow owners to not only derive the final proposed listing price, but also better understand our thought process and methodology behind the derivation of the price.
Project Objective
Through our project, we aim to:
- Derive individual walking distance between various key attractions and Airbnb listings in Downtown Seattle
- Analyse the spatial relationships between various key locations and Airbnb listings in Downtown Seattle to determine if the listing's location to key places affect its listing price
- Through the use of Local Geographical Weighted Regression (GWR) Model, we hope to help Airbnb owner(s) determine the better pricing for their listing(s).
Our Datasets
Data | Source | Data Description | Source URL | Data Type |
---|---|---|---|---|
Literature Review
Sources:
- https://towardsdatascience.com/airbnb-rental-listings-dataset-mining-f972ed08ddec
- https://www.airbnbcitizen.com/the-airbnb-community-in-seattle/
- https://stanleyadion.shinyapps.io/AmazeingCrop/
Literature Review 1: Airbnb Rental Listings Dataset Mining
Literature's outcome: An exploratory analysis of Airbnb's Data to understand the rental landscape in New York City
- NYC's data was also obtained from Inside Airbnb and it contains the same three tables as ours, except that it was for New York City
- Listings.csv - contains 96 detailed attributes for each listing. Some of the attributes are continuous (i.e. Price, Longitude, Latitude, ratings) and others are categorical (Neighbourhoods, Listing_type, is_superhost) which is used for the analysis
- Reviews.csv - Detailed reviews given by guests with six attributes. The key attributes include date (datetime), listing_id (discrete), reviewer_id (discrete), comments (textual)
- Calendar.csv - Provides details about booking for the next year for each listing. There are four attributes in total, they are: listing_id (discrete), date (datetime), available (categorical) and price (continuous).
Screenshot of a paragraph taken off the website. This was a paragraph that we found useful when we conducted our Exploratory Data Analysis for we ran into the problem of not knowing the number of days each listing opens it's doors for bookings. This made a good enough estimation for us to understand if the listing was highly sought after or if it was one of those listings where it opened it's doors only few times each year. Unfortunately, the number of days available for booking by each listing was not made publicly available by Airbnb.
Our Methodology
Project Storyboard
Application Overview
Our Findings
Reflecting on our project
Project Timeline