Difference between revisions of "1718t1is428T2"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
 
(3 intermediate revisions by one other user not shown)
Line 16: Line 16:
 
<p></p><br/>
 
<p></p><br/>
 
<div style="background: #347473; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">PROBLEM & MOTIVATION</font></div>
 
<div style="background: #347473; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">PROBLEM & MOTIVATION</font></div>
In the year 1960, Singapore was facing a huge crisis. Many people were living in unhygienic slums and crowded squatters with only a meager 9% of Singaporeans lived in government flats, while everyone else yearned for a place to call home sweet home.To counter this crisis,, the Housing & Development Board (HDB) was incorporated on 1 February, 1960 and tasked with the critical mission of solving the crisis ar hand. In a mere span of 10 years, HDB had attained its goal and solved the housing crisis.
+
With housing prices being a hot topic that most undergraduates are talking about as we approach that final period called graduation, don't you wish that there was a way for us to get a better picture of the housing market so that we can make our big decision on where we are going to live from now on? Thus, we sought out to look for a dataset which would be able to provide us with the required information we needed to come up with useful visualizations. Unfortunately, most datasets are incomplete or not useful in providing us with any insight unless a good amount of data cleaning and wrangling is done. For example, transaction data which only gives you the block number and road name with no postal code. How is that useful? Therefore, one of our primary goals is to come up with a visualization which would provide users with useful information about the resale market in Singapore and help people gain access to a better data set than the ones currently out there.
  
However, in 1993, HDB stopped deciding the prices of new apartments based on construction costs, instead they decided based on market prices. Prices of resale flats and new flats entered in a vicious circle, rising 50% in just 6 months of 1993 and tripled to 1996. This move closed the price gap between small and large flat types and hub pricing have never been he same again.
+
In terms of what visualizations we would be going for, we felt that it would be interesting to look into the historical flat data so that we can see which flats in Singapore would be the most value for money so that we can actually get a home which is worth its investment. We also felt that it would be fun to explore trends in the resale flat prices and see what factors really affect the prices of HDBs and see how much of a premium people attach to amenities such as proximity to public transport, schools and etc...  
 
 
Thus, as graduates to be who will most likely enter the job market soon and start looking for a place to call home, we felt that it would be interesting to look into the historical flat data so that we can see which flats in Singapore would be the most value for money so that we can actually get a home which is worth its investment. We also felt that it would be fun to explore trends in the resale flat prices and see what factors really affect the prices of HDBs and see how much of a premium people attach to amenities such as proximity to public transport, schools and etc...  
 
  
 
<br/><div style="background: #347473; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">OBJECTIVES</font></div>
 
<br/><div style="background: #347473; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">OBJECTIVES</font></div>
Line 135: Line 133:
 
! style="font-weight: bold;background: #56C0BE;color:#fbfcfd;" | How Analyst Can Conduct Analysis
 
! style="font-weight: bold;background: #56C0BE;color:#fbfcfd;" | How Analyst Can Conduct Analysis
 
|-
 
|-
| [[File:HSH Home.pn|600px|center]]   
+
| [[File:HSH Home.png|600px|center]]   
 
||  
 
||  
 
# Introduce analysts to the topic of HDB Resale Price and the objectives of the visualization project
 
# Introduce analysts to the topic of HDB Resale Price and the objectives of the visualization project
 
# Upon clicking "Find your Dream Home", analysts will then begin their process of exploration
 
# Upon clicking "Find your Dream Home", analysts will then begin their process of exploration
 
|-
 
|-
| [[File:HSH Homesketch.jpeg|600px|center]]   
+
| [[File:HSH Insight1.png|600px|center]]   
 
||  
 
||  
 
# When a user enters our app, we will show them a brief history of HDB followed by the problem that most young people are facing with regards to understanding the HDB situation.
 
# When a user enters our app, we will show them a brief history of HDB followed by the problem that most young people are facing with regards to understanding the HDB situation.
# The 3 insights we are trying to show will be displayed as 3 clickable buttons so that it is easy for a user to know what exactly he wants to look for at a glance.
+
# The 2 screens of different insights we are trying to show will be displayed as 2 clickable buttons so that it is easy for a user to know what exactly he wants to look for at a glance.
 +
# The dual axis bar-line chart will indicate the number of units sold vs the HDB price index to show the performing over the years.
 +
# The scatter plot groups attacks based on 3 main categories - Planning Area, HDB resale price, Life left on lease. Firstly, by grouping based on planning area, it shows the the different HDB of how many years they left on lease. The size of the circle indicate the resale price of the HDB
 
|-
 
|-
| [[File:HSH insight1.jpg|600px|center]]  
+
| [[File:HSH Insight2.png|600px|center]]  
 
||  
 
||  
 
# In the next phase of the exploration, we
 
# In the next phase of the exploration, we
# The radar chart shows 6 different governance indicators that defines how well a country is governed. The closer the area is to the center of the chart, the less well governed the country is. Upon mouse-over of each area, one can also retrieve the exact values of each governance indicator.
+
# The radar chart shows 5 different governance indicators that defines how well a HDB is price with several other indicators. The closer the area is to the center of the chart, the less well the indicators is. Upon mouse-over of each area, one can also retrieve the exact values of each governance indicator.
# The bubble plot groups attacks based on 3 main categories - country, target victim and political terror scale. Firstly, by grouping based on country, one can better visualize the number of attacks that took place in each country and contrast it with the data presented in the radar chart. Secondly, by grouping based on target victims, one can also establish the most common targets of these terrorist attacks that took place. This will bring analysts further in the data exploration, especially if there are high numbers of attacks targeting at certain groups of people. Lastly, grouping by political terror scale allow the analysts to contrast information with the radar chart. In addition, by looking at the count of attacks in each scale, one can also identify interesting patterns.
+
# Using the map of Singapore as a filter condition, analysts can selected their choice of HDB to see if its near any bus stop, MRT, schools.  
# In the bubble plot, each bubble represents an attack. When the analyst mouse-over each bubble, they can see more information about each attack and this helps to bring context to the analyst in their data exploration phase. Due to technical limitations and to avoid excessive clutter on the page, only attacks that have resulted in more than 15 deaths will be shown. This is in conjunction with the assumption that analysts would be more interested to look at attacks that have caused great harm to the public.
+
# By looking at both charts, the analyst will then be able to compare and establish possible linkages between the different indicators for HDB over the years. As such, these 2 charts are placed side by side to assist the analyst in their data exploration.
# By looking at both charts, the analyst will then be able to compare and establish possible linkages between how well a country is governed and the number of terrorist attacks that took place in the country. As such, these 2 charts are placed side by side to assist the analyst in their data exploration.
 
# Similarly to the previous page, the bar chart at the bottom of the page will show the number of attacks that took place for all selected countries.
 
 
|}
 
|}
  
Line 185: Line 183:
 
<br/><div style="background: #347473; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">TOOLS/TECHNOLOGIES</font></div>
 
<br/><div style="background: #347473; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">TOOLS/TECHNOLOGIES</font></div>
 
<p>The following are some of the tools/technologies that we will be utilizing during the project:</p>
 
<p>The following are some of the tools/technologies that we will be utilizing during the project:</p>
 +
* Excel
 
* D3.js
 
* D3.js
 
* Proj4.js
 
* Proj4.js

Latest revision as of 18:54, 6 November 2017

1718T1G1 Logo.png


PROBLEM & MOTIVATION

With housing prices being a hot topic that most undergraduates are talking about as we approach that final period called graduation, don't you wish that there was a way for us to get a better picture of the housing market so that we can make our big decision on where we are going to live from now on? Thus, we sought out to look for a dataset which would be able to provide us with the required information we needed to come up with useful visualizations. Unfortunately, most datasets are incomplete or not useful in providing us with any insight unless a good amount of data cleaning and wrangling is done. For example, transaction data which only gives you the block number and road name with no postal code. How is that useful? Therefore, one of our primary goals is to come up with a visualization which would provide users with useful information about the resale market in Singapore and help people gain access to a better data set than the ones currently out there.

In terms of what visualizations we would be going for, we felt that it would be interesting to look into the historical flat data so that we can see which flats in Singapore would be the most value for money so that we can actually get a home which is worth its investment. We also felt that it would be fun to explore trends in the resale flat prices and see what factors really affect the prices of HDBs and see how much of a premium people attach to amenities such as proximity to public transport, schools and etc...


OBJECTIVES

In this project, we are interested to create a visualisation that helps users perform the following:

  1. View the trend in the resale prices over time with respect to major events that happened in the year (Example: 1993 Change in Pricing Model,1997 Recession
  2. Identify which areas are more expensive and possible reasons for the high value (Proximity to public transport, Schools, Shopping Malls, Park)
  3. To find out if getting a specific HDB is a good investment based on the number of year left on the lease and which locations may potentially be more profitable based on the age of the HDB.

By using our visualisation, we will be able to give users a better idea of the pricing situation of the resale HDBs so that people can make better decisions in the HDB which they want to choose to call their home. Such as when is the best time to buy as HDB; where are the most profitable / cheapest locations; whether a HDB is expensive


SELECTED DATASET

In our analysis, we will only be using data within the year of 1990 - 2017. The rationale for the range of data selected is as follows:

The dataset for analysis will be retrieved from multiple databases, as elaborated below:

Dataset/Source Data Attributes Rationale Of Usage
Resales flat prices from Mar 2012 onwards
(https://data.gov.sg/dataset/resale-flat-prices?resource_id=83b2fc37-ce8c-4df4-968b-370fd818138b )
Resales flat prices from 2002 - Feb 2012
(https://data.gov.sg/dataset/resale-flat-prices?resource_id=8c00bf08-9124-479e-aeca-7cc411d884c4 )
Resales flat prices from 1990 - 1999
(https://data.gov.sg/dataset/resale-flat-prices?resource_id=adbbddd3-30e2-445f-a123-29bee150a6fe )
  • Month
  • Town
  • Flat Type
  • Block
  • Street Name
  • Storey Range
  • Floor Area (Sqm)
  • Flat Model
  • Lease Commence Date
  • Resale Price (S$)
This dataset will be used as a main source of information in our analysis to understand the number of HDB around Singapore from 1990 to 1999, 2002 to Feb 2012 and Mar 2012 onwards respectively.
Bus Stop Names and Locations
(https://www.mytransport.sg/content/mytransport/home/dataMall.html#)
  • Bus Stop Number
  • Bus Stop Roof Number
  • Bus Stop Name
  • X
  • Y
  • Latitude
  • Longitude
This dataset aims to complement the main dataset by providing detailed information about the latitude and longitude of the bus stops located around HDB. We use a javascript script to convert all the X and Y coordinates to EPSG:4326 latitude and longitude coordinates.
Mrt Stations Names and Locations
(https://www.mytransport.sg/content/mytransport/home/dataMall.html#)
  • MRT Station Number
  • MRT Station Name
  • X
  • Y
  • Latitude
  • Longitude
This dataset aims to complement the main dataset by providing detailed information about the latitude and longitude of the MRT stations located around HDB. We use a javascript script to convert all the X and Y coordinates to EPSG:4326 latitude and longitude coordinates
Schools (Primary, Secondary, Junior College) Names and Locations
(https://data.gov.sg/dataset/school-directory-and-information)
  • School Name
  • Address
  • Postal Code
  • Planning Area
This dataset aims to complement the main dataset by providing detailed information about the address of schools located around HDB.



BACKGROUND SURVEY OF RELATED WORKS

There are many charts and visualisations available which illustrates the various trends of house prices and index. We have selected a few of these to study and learn before we begin developing our own visualizations.

Related Works What We Can Learn

An Analysis of the trend and correlation between resale prices and flat production

1718T1G1 BackgroundSurvey1.png

Source: http://www.teoalida.com/singapore/hdbprices/

  • The use of 2 different chart types with a secondary axis is effective in illustrating the correlation between resale prices and flat production.
  • The colours used are striking and contrast well with each other.
  • There are dips in both variables which are not explained in the infographic itself (E.g. 1997 Asian crisis, 2003 SARs outbreak). This events could be incorporated into the charts to make it more informative.

An interactive heatmap of Singapore’s house prices in various districts

1718T1G1 BackgroundSurvey2.png

Source: https://www.srx.com.sg/heat-map

  • This heatmap uses colours appropriately so that the house prices of each district can be identified intuitively (Red means expensive, blue means cheap, orange means mid-range)
  • The use of filters allows user to find out more about the price distribution of each house type easily.
  • When user mouseover a district on the heatmap, the corresponding district on the legend is highlighted. This improves usability as users do not have to match district numbers manually.

An interactive visualization of house prices along MRT stations

1718T1G1 BackgroundSurvey3.png

Source: https://www.srx.com.sg/mrt-home-prices/property-listings-near-east-west-line

  • This visualization makes use of unique ways to illustrate the relation between nearby facilities and house prices.
  • The separating of the various MRT lines using filters at the top prevent too much information from being shown in one page


PROPOSED STORYBOARD
Proposed Layout How Analyst Can Conduct Analysis
HSH Home.png
  1. Introduce analysts to the topic of HDB Resale Price and the objectives of the visualization project
  2. Upon clicking "Find your Dream Home", analysts will then begin their process of exploration
HSH Insight1.png
  1. When a user enters our app, we will show them a brief history of HDB followed by the problem that most young people are facing with regards to understanding the HDB situation.
  2. The 2 screens of different insights we are trying to show will be displayed as 2 clickable buttons so that it is easy for a user to know what exactly he wants to look for at a glance.
  3. The dual axis bar-line chart will indicate the number of units sold vs the HDB price index to show the performing over the years.
  4. The scatter plot groups attacks based on 3 main categories - Planning Area, HDB resale price, Life left on lease. Firstly, by grouping based on planning area, it shows the the different HDB of how many years they left on lease. The size of the circle indicate the resale price of the HDB
HSH Insight2.png
  1. In the next phase of the exploration, we
  2. The radar chart shows 5 different governance indicators that defines how well a HDB is price with several other indicators. The closer the area is to the center of the chart, the less well the indicators is. Upon mouse-over of each area, one can also retrieve the exact values of each governance indicator.
  3. Using the map of Singapore as a filter condition, analysts can selected their choice of HDB to see if its near any bus stop, MRT, schools.
  4. By looking at both charts, the analyst will then be able to compare and establish possible linkages between the different indicators for HDB over the years. As such, these 2 charts are placed side by side to assist the analyst in their data exploration.


ADDRESSING KEY TECHNICAL CHALLENGES

The following are some of the key technical challenges that we may face throughout the course of the project:

Key Technical Challenges How We Propose To Resolve
Unfamiliarity of Visualization Tool Usage
  • Independent Learning on Visualization Tools
  • Peer Learning
Data Cleaning & Transformation
  • Work together to clean, transform and analyze the data
Unfamiliarity in Programming using Javascript & D3 Libraries
  • Attend D3 Programming Workshop
  • Independent Learning on D3 Libraries & Technical Tools
  • Peer Learning
Unfamiliarity in Implementing Interactivity and Animation Tools/Techniques in Visualization App
  • Develop a Storyboard/Design Flow
  • Assign members to specialize on Interactivity/Animation Techniques


PROJECT TIMELINE

The following shows our project timeline for the completion of this project:

1718T1G1 Timeline.png


TOOLS/TECHNOLOGIES

The following are some of the tools/technologies that we will be utilizing during the project:

  • Excel
  • D3.js
  • Proj4.js
  • Google Maps Distance Matrix API
  • Google Search API
  • Github
  • Node.js
  • Angular.js


REFERENCES


OUR BRAINSTORMING SESSIONS

The following are some of the proposed storyboard that we designed during our brainstorming sessions:

HSH Homesketch.jpeg


HSH Insight 1 and 2.jpg


HSH insight1.jpg


Our idea was to provide charts which are able to visualise the trends of HDB over the years in Singapore. We decided to split into multiple charts to be able to showcase the information more clearly.


COMMENTS

Feel free to comments, suggestions and feedbacks to help us improve our project! (: