I find Airbnb itself to be an interesting concept. Renting out any left over space you might have in your home seems like a great way of earning some extra cash while at the same time meeting new people.
Initial datasets
Airbnb provides datasets containing all the airbnb listings a number of large cities through insideairbnb. I thought it would be cool to have a look at some of this data. It triggered my data exploration urge and multiple questions popped into my mind straight away. Let’s have a look how Airbnb is like in New York.
The dataset contains information like Price, Number of Reviews and Number of days available in a year. It also allows for geospatial analysis as we have Borough, Neighborhood and even Latitude and Longitude information.
I decided to go hunting for some additional data to find possible correlations. I soon bumped into the website nycopendata which has a bunch of free datasets related to New York. One that caught my eye was Subway Entrances data. Might there be a correlation between price/ratings and the proximity to a subway entrance?
I also added the location of the main tourist attractions in New York. It would be interesting to see how the proximity to these affect the price.
I also found datasets containing all the free WifI hotspots and even all public parking lots/garages. I didn’t use these dataset in my analysis but you can find them in the Git repo if you’re interested.
Data preparation
First I needed to calculate the closest subway entrance. Using R, I came up with the following code. I fully expect this to be unoptimized. It did take a while to go through the 27000+ listings. If you have any suggestion, please let me know!
A similar calculations was needed for the closest tourist attractions. Here I also added the name of the attractions itself to the resulting dataset.
Prices
Disclaimer: I have never been to New York so I started with zero prior knowledge of the geo-spatial layout of New York.
As expected Manhattan has the highest average price and also the most amount of listings. The majority of the famous attractions in New York (atleast as an unknowing tourist like myself) are located on Manhattan which should bump up the demand and therefore the price.
Staten Island is the second most expensive borough to stay in. It does however have the least amount of listings.
The most expensive neighborhood is Tribeca in Manhattan with a hefty average price tag of just over $400 for one night. Not surprising as this is supposed to be one of the most desired neighborhoods in NYC. The list of celebrities who has proporty here is extensive.
Subway Proximity
The majority of the listings are very close to a subway entrance ( < 400m ) indicated by the blue line. The price of the listings are lower the further away from a subway entrance they are. This can be explained quite easily though. The attractive (more pricey) areas are generally packed with subway entrances. In Manhattan for example you will never be far from the subway whereas outside of Manhattan distances of 400m+ are more common.
Free WiFi
Not related to Airbnb but still interesting. NYC seems to provide free WiFi in most parks.