Capstone Project

 

 

Capstone Project – Final Report

<Market Research for Restaurant in New York City>


Nayeong Lee

July 30, 2020

 

 


I.              Introduction

I.1. Background

The project has New York City as a target. New York City is the largest and planned city in the United States in the southern part of New York. It is the busiest and most famous city in the world and is known as the capital of the world as a center of politics, economy, culture, fashion, etc. Also, it is one of the most famous urban tourist destinations in the United States and is loved by many tourists.

 

I.2. Problem and Interest

If someone wants to run a restaurant in New York City, how should he plan his management? Which of New York City's residents or tourists has the better proportion of potential restaurant customers? Anyone who wants to start a new restaurant business in New York may be interested in this project.

 

I made my virtual client for this project. My client recently moved to New York City after getting married. He wants to run a new restaurant in the city to maintain his livelihood. He has just moved to the city and has spent most of his life in the eastern part of the United States, so he knows little about the tastes of people in the region.

He wants to find out the ‘type’ of restaurant that people there prefer and their taste buds. Especially, he wants to avoid being a same kind of restaurant with the other competing restaurants that are overwhelmingly well-received. He gave me his home address and asked me to do market research within a 3 km radius.

 


 

II.          Data acquisition and cleaning

 

II.1. Data sources

I only used Foursquare location data for my project. I can get information about what kind of spots is around a specific location, as well as distance, category, address, ratings and reviews of each spot.

 

II.2. Data cleaning

Scraped data included a variety of spots, including parks, bars, gyms, and merchandise stores as well as restaurants. I've only picked out restaurants from these places.

After data cleaning, there was a list of 36 restaurants left in the data.

 


 

III.      Methodology

 

III.1. Distribution of the categories

I sorted out the categories of each restaurant to see which kind of restaurants are the most distributed. Although sales at each restaurant are unknown, but I can give my client information about which foods are common and which are rare. Based on the information, he will be able to determine what kind of food he will sell.



 

 

III.2. Restaurants with good ratings

I selected only restaurants with a rating of 9/10 or higher and make a list. The list includes the categories of each restaurant. I can give this information to client to avoid competition with overwhelmingly well-rated restaurants.


 

III.3. Customer review analysis

I have set this section as a plan for my project, but since my account is free, there is a limit to the number of times I can inquire users' reviews. I wanted to pick out the 10 restaurants with the most customer reviews and make a list of only the reviews that include the 'taste', but it was impossible with my account.

So, I only loaded the number of reviews of users in this code. This will allow my client to know which restaurants are popular and popular.

 


 

IV.                Results

Foursqure location data were collected within a radius of 3km around the customer's location. As a result, I got information about a total of 36 restaurants.

  The restaurants had a total of 20 categories. The category that corresponds to the largest number of restaurants is ‘Italian restaurant’, with a total of five locations located. Next were four ‘pizza places,’ three ‘sea restaurants’ and three ‘sandwich places.’ In other categories, only one or two restaurants existed.

  I looked up the ratings of the restaurants, and they all scored over 9.0. I could see that every restaurant around had a good rating. The criteria and data that I originally designated do not correspond with each other. However, if you change the criteria to 9.5 and interpret the data, the results will change. All of their ratings had values between 9.1 and 9.4. An overwhelmingly high rating of 9.5 or higher did not present. This could be good news for my client.

 

 

V.              Discussion

There are five Italian restaurants around my client, accounting for the largest number of categories. Also, Italian restaurants are the places where the most post-visit reviews exist. My client said he wanted to avoid competition with the most popular restaurants, so he had better not plan an Italian restaurant.
 My client wants to know the taste of his local people. This may be inferred into the categories of restaurants with many post-visit reviews. The five restaurants with the highest number of reviews were Italian, French, pizza, Australian and seafood restaurants. Therefore, other Western foods are more popular in his region than American food.

 

VI.          Conclusion

My project was conducted to conduct a survey of the restaurant market in a New York City area. The limitation of my Foursquare account allowed me to get only information about 36 restaurants, but I seem to have obtained meaningful results through data analysis. I'd like to give a management recommendation based on my results to someone who wants to run a restaurant in New York like my hypothetical client.