From Geospatial Analytics for Urban Planning
		
		
		
		
		
		Jump to navigation
		Jump to search
		
Back to Project Home
Introduction
International Food Chain (IFC) is a leading brand in its sector, with over 18000 outlets worldwide and an ever-growing presence in the global market. In Taiwan alone, IFC has over 240 branches and are constantly expanding.
However, as the franchise grows bigger, so does its challenges. One of the challenges involves the lack of a geographical analysis to efficiently compare the performance of each chain to one another.
Leveraging on this fact, our group aims to digitalise the data and conduct in-depth analysis on each branch. We hope to track the performance of each chain in relation to Point-Of-Interests surrounding each chain, uncovering and comprehending phenomena, with the aid of spatial data.
Problem and Motivation
To provide an analysis that allows for:
- Digitizing of each chain’s trade and delivery area
- Business profiling of the company’s outlet to determine Points-Of-Interests (POIs) that can generate insights such as: Highest earning outlets, relative performance of outlets, outlet’s profile patterns and item sales information.
- Allow for informed business decisions, such as determining locations for new outlet openings with matching POIs of high sales outlets
- Scalable program to incorporate future data to generate current information (Using data from other cities besides Taiwan)
- Easy and intuitive tool to quickly view information with regards to all branches
Objectives
This project aims to provide insights into the following:
- Missing Areas in trade zone
- Number of POIs surrounding each store
- Store performance with regards to sales
- Delivery Information
- Population Density
- Buffer and proximity
- Nearest Competitors to store
- Variable importance based on regression analysis
Background Survey of Related Works (WIP)
| Visualizations | Explaination | 
| 
 | Data source: https://www.researchgate.net/publication/324949619_Visualization_of_Fast_Food_Restaurant_Location_using_Geographical_Information_System
 The visualization provides the buffer polygons, as well as representing population density of the area through colour. By comparing the two, we can conclude whether the center of activity is proportional to the population density in a region. It allows us to perform further exploration to see what spatial information significantly affects the level of activity in a city, such as the availability of points-of-interest. This visualization is great as it allows the viewer to clearly see multiple dimensions dealing with spatial data in an elegant way. 
 | 
| 
 | Data source: https://www.researchgate.net/publication/324949619_Visualization_of_Fast_Food_Restaurant_Location_using_Geographical_Information_SystemThe graph on the left shows the distribution of outlets on the geographical map. The right graph describes the outlets grid distribution, result from grid creation and spatial joint operation. From both figures, they can show the potential tendency of whether the outlets are clustered, and with the number of outlets in each grid. We could use them together to justify and adjust the outlet locations.
 | 
| 
 | Data source: https://www.researchgate.net/publication/324949619_Visualization_of_Fast_Food_Restaurant_Location_using_Geographical_Information_SystemThis visualization provides a novel way of linking a variable to its geographical location when hovering over the respective area. It would be great in our case, if we were to allow the user to view the corresponding branch through the tooltip, for example profit.
 | 
| 
 | Data source: https://www.researchgate.net/publication/324949619_Visualization_of_Fast_Food_Restaurant_Location_using_Geographical_Information_SystemThis shows kernel density surface, based on the number of fast food restaurants around Jakarta and distribute them smoothly, so it provides average surface estimation. Kernel density estimation allows us to observe both the centrality and agglomeration of existing outlets. This visualization allows us to view multiple dimensions at a time in an effective manner, through the choice of colour and size.
 | 
| 
 | Data source: 
https://public.tableau.com/profile/mirandali#!/vizhome/Salesforce-SalesPerformance/SalesPerformance
 This databoard shows the cumulative sales. We could learn from this and display by outlets to compare the performance by having multiple forms of visualization. We really like the fact that certain key summarizations and variables are displayed on the top, and will consider using this in our project.
 | 
  
Tools and Libraries
The following tools and libraries are used in the digitisation and analysis:
 
Datasets
Datasets Provided:
| Dataset | Rationale | 
| Traced Map | The client provided us with powerpoint files of manually drawn trade areas. These maps contain the various zones within the trade area, competitors, nearby stores as well as the drive time between each spots in the main road.The five competitors defined by the client are:
 DominosNapoleonMcdonaldsKentucky Fried ChickenMos Burger
 | 
| Geospatial Data | The client provided us SHP files that contains information about Counties found in Taiwan
 
| Column | Description | Example |  
| COUNTYID | Unique ID of each County | “U” |  
| COUNTYCODE | Unique numerical ID of each County | 10015 |  
| COUNTYNAME | Name of the County in Mandarin | “台北市” |  
| COUNTYENG | Name of the County in English | “Taipei City” |  | 
| Town Area | The client provided us SHP files that contains information about Towns found in Taiwan
 
| Column | Description | Example |  
| VILLCODE | Unique numerical ID of each Village | 65000050041 |  
| VILLNAME | Name of the Village in Mandarin | “甲仙區” |  
| VILLENG | Name of the Village in English | “Jiasian” |  
| TOWNID | Unique ID of each Town | “K12” |  
| TOWNCODE | Unique numerical ID of each town | 10005060 |  
| TOWNNAME | Name of the Town in Mandarin | “半線城” |  
| COUNTYID | Unique ID of each County | “U” |  
| COUNTYCODE | Unique numerical ID of each Countyy | 10015 |  
| COUNTYNAME | Name of the County in Mandarin | “台北市” |  
| NOTE | Miscellaneous notes | NIL |  | 
| Village Area | The client provided us SHP files that contains information about Villages found in Taiwan
 
| Column | Description | Example |  
| TOWNID | Unique ID of each Town | “K12” |  
| TOWNCODE | Unique numerical ID of each town | 10005060 |  
| TOWNNAME | Name of the Town in Mandarin | “半線城” |  
| TOWNENG | Name of the Town in English | “Bamboo Town” |  
| COUNTYID | Unique ID of each County | “U” |  
| COUNTYCODE | Unique numerical ID of each County | 10015 |  
| COUNTYNAME | Name of the County in Mandarin | “台北市” |  | 
| Taiwan Stores | The client provided us a GeoPackage that contains information about each Pizza Hut Store
 
| Column | Description | Example |  
| fid | Unique ID of each Town | “K12” |  
| Country | Unique numerical ID of each town | 10005060 |  
| Market | Name of the Town in Mandarin | “半線城” |  
| PH/PHD | Name of the Town in English | “Bamboo Town” |  
| Status | Unique ID of each County | “U” |  
| Milestone | Unique numerical ID of each County | 10015 |  
| Local Code | Name of the County in Mandarin | “台北市” |  
| CHAMPS Code | Name of the Town in English | “Bamboo Town” |  
| JDE Code | Unique ID of each County | “U” |  
| Store Name | Unique numerical ID of each County | 10015 |  
| Latest Asset Type | Name of the County in Mandarin | “台北市” |  
| Facility Type | Name of the Town in English | “Bamboo Town” |  
| City Location | Unique ID of each County | “U” |  
| Location Type | Unique numerical ID of each County | 10015 |  
| Open Date | Name of the County in Mandarin | “台北市” |  
| Close Date | Unique numerical ID of each County | 10015 |  
| Corresponding Relo-Open / Relo-Closure Store Name | Name of the County in Mandarin | “台北市” |  
| Corresponding Relo-Open / Relo-Closure Date | Unique numerical ID of each County | 10015 |  
| Corresponding Relo-Open / Relo-Closure Asset Type | Name of the County in Mandarin | “台北市” |  
| Store Address | Address of the store | B1 & 1F., No. 52-1, Hsin Sheng S. Rd., Sec. 1, Taipei, Taiwan (R.O.C) |  
| Latitude | Latitude of the store | 25.041601 |  
| Longitude | Longitude of the store | 121.532475 |  
| Month | Month of the opening date of the store | 10 |  
| Quarter | Quarter of the opening date of the store | Q4 |  
| Year | Year of the opening date of the store | FY1995 |  
| Grouping | Used to denote which group these stores were assigned to | G1 Group 10 |  
| Cluster ID | Used to denote which group these stores were assigned to, in numerical value | 6 |  | 
Foreseen Technical Challenges
We encountered the following technical challenges throughout the course of the project. We have indicated our proposed solutions, and the outcomes of the solutions.
| Key Technical Challenges | Proposed Solution | Outcome | 
| Data is already pre-aggregated to display monthly sales | The dataset is given directly to us from IFC, and we are unable to change it. Thus, We shall utilize and do our best with the available data.
 | NA
 | 
| Unfamiliarity in R and Python in creation of data processing scripts | Watching video tutorials about R and PythonIndependent learning on the design and syntaxPeer learning and sharingUsing Datacamp as our mentor
 | We managed to start using the languages quickly and suit our own project needs.
Each of us work on different parts such as setting up, designing, logic and deployment.
This speeds up our project progress.
 | 
| Data Cleaning & Transformation Proposed Solution | Having a systematic process while working together in order to maximise efficiency e.g. taking turns to clean, transform and perform checks on the data to ensure accuracy
 | The adopted process was having clear instructions issued to each member in the team, along with maintaining constant communication with each other. In the event that the dataset is deemed too dirty to be usable, it was dropped along with sourcing for new data that would be a suitable replacement.
 | 
| Lack of geospatial knowledge to understand the dataset initially | Attend SMT201 class to learn more, as well as reading up on resources given by Prof Kam to gain further contextual knowledge
 | NA
 | 
| Digitising of trade areas from powerpoint slide to QGIS | The process is manual and we had to put in a lot of effort to convert the drawn polygon to data points in QGIS.
 | The data points can better allow us to generate insights on the profile of each outlet via its trade area.
 | 
| Integrating Relevant Data from Multiple Sources Proposed Solution | Working together to decide on what data to extract or eliminate
 | NA
 | 
Roles & Milestones (WIP)
| Kelvin Chia Sen Wei | Linus Cheng Xin Wei | Eugene Choy Wen Jie | 
| Data Cleaner 
 Wiki Writer
 | Report WriterDesign Architect
 | Data Cleaner 2Poster man
 | 
References
Feel free to leave comments / suggestions!