Difference between revisions of "1718t1is428T2"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
 
(47 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 
+
[[File:1718T1G1 Logo.png|center|250px]]
  
 
{| style="background-color:white; color:white padding: 5px 0 0 0;" width="100%" height=50px cellspacing="0" cellpadding="0" valign="top" border="0" |
 
{| style="background-color:white; color:white padding: 5px 0 0 0;" width="100%" height=50px cellspacing="0" cellpadding="0" valign="top" border="0" |
  
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #3D9DD7; border-top:1px solid #3D9DD7; font-family:helvetica"> [[Home Sweet Home | <b>Home</b>]]
+
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #50B4B2; border-top:1px solid #50B4B2; font-family: helvetica"> [[Project_Groups | <b>Home</b>]]
  
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #3D9DD7; border-top:1px solid #3D9DD7; font-family:helvetica"> [[Home Sweet Home: Proposal | <b>Proposal</b>]]
+
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: 100; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #50B4B2; border-top:1px solid #50B4B2; font-family: helvetica"> [[1718t1is428T2 | <b>Proposal</b>]]
  
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #3D9DD7; border-top:1px solid #3D9DD7; font-family:helvetica"> [[Home Sweet Home: Poster | <b>Poster</b>]]
+
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: 100; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #50B4B2; border-top:1px solid #50B4B2; font-family: helvetica"> [[1718t1is428T2: Poster | <b>Poster</b>]]
  
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #3D9DD7; border-top:1px solid #3D9DD7; font-family:helvetica"> [[Home Sweet Home: Application | <b>Application</b>]]
+
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: 100; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #50B4B2; border-top:1px solid #50B4B2; font-family: helvetica"> [[1718t1is428T2: Application | <b>Application</b>]]
  
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: bold; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #3D9DD7; border-top:1px solid #3D9DD7; font-family:helvetica"> [[Home Sweet Home: Research Paper | <b>Research Paper</b>]]
+
| style="vertical-align:top;width:16%;" | <div style="padding: 3px; font-weight: 100; text-align:center; line-height: wrap_content; font-size:16px; border-bottom:1px solid #50B4B2; border-top:1px solid #50B4B2; font-family: helvetica"> [[1718t1is428T2: Research Paper | <b>Research Paper</b>]]
 
|}
 
|}
  
 
<p></p><br/>
 
<p></p><br/>
<div style="background: #364558; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">PROBLEM & MOTIVATION</font></div>
+
<div style="background: #347473; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">PROBLEM & MOTIVATION</font></div>
The threat of terrorism is growing everyday and many countries, including Singapore, have taken steps to mitigate the risks of terrorism. In the National Day Rally 2016, PM Mr. Lee Hsien Loong mentioned that diplomats and security forces have been doing their job well but despite their efforts, it does not mean that terrorist attacks will not happen in Singapore. The recent attack that attempted to fire a rocket to hit Marina Bay Sands Area from Batam was successfully intervened but this signals to the country that the terrorism threat should not be taken lightly. In response to the growing terrorist threat, the SGSecure Movement was launched to prepare the public in the event of an attack. In recent years, there is a rise in research on terrorist organizations and the activities they have performed, regardless of scale, over the years. However, more still needs to be done to analyze past terrorist activities and gain insights from it easily so that all countries could better prepare for a worst case scenario.
+
With housing prices being a hot topic that most undergraduates are talking about as we approach that final period called graduation, don't you wish that there was a way for us to get a better picture of the housing market so that we can make our big decision on where we are going to live from now on? Thus, we sought out to look for a dataset which would be able to provide us with the required information we needed to come up with useful visualizations. Unfortunately, most datasets are incomplete or not useful in providing us with any insight unless a good amount of data cleaning and wrangling is done. For example, transaction data which only gives you the block number and road name with no postal code. How is that useful? Therefore, one of our primary goals is to come up with a visualization which would provide users with useful information about the resale market in Singapore and help people gain access to a better data set than the ones currently out there.
  
<br/><div style="background: #364558; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">OBJECTIVES</font></div>
+
In terms of what visualizations we would be going for, we felt that it would be interesting to look into the historical flat data so that we can see which flats in Singapore would be the most value for money so that we can actually get a home which is worth its investment. We also felt that it would be fun to explore trends in the resale flat prices and see what factors really affect the prices of HDBs and see how much of a premium people attach to amenities such as proximity to public transport, schools and etc...  
<p>In this project, we are interested to create a visualization that helps analysts perform the following:</p>
 
# Identify terrorist organizations active in each country and the spread/types of activities they conducted to threaten the safety of the country, over different time periods
 
# Identify possible linkages between the number of terrorist activities occurring in a country and its development status
 
# Get a clearer understanding of each terrorist organization and the type of attacks they have conducted in a country/globally, over different time periods
 
# Compare different terrorist organizations and identify similarities and differences in their attack patterns, over different time periods
 
<p>By conducting the analysis, it allows respective policy makers, government or intelligence agencies to better understand terrorist organizations and their spread internationally so that they could devise appropriate policies/measures to prevent potential attacks within their own country, regionally or globally in future.</p>
 
  
<br/><div style="background: #364558; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">SELECTED DATASET</font></div>
+
<br/><div style="background: #347473; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">OBJECTIVES</font></div>
In our analysis, we will only be using data within the year of 2000 - 2015. The rationale for the range of data selected is as follows:
+
<p>In this project, we are interested to create a visualisation that helps users perform the following:</p>
* It does not provide strong relevance/insights for analysts to look at all the data in the past 45 years and attempt to predict activities of these terrorist organizations now/in the future. Due to the rapid changes in the globalized world, a range of 15 years will be adequate to help analysts spot trends/patterns of terrorist activities.
+
# View the trend in the resale prices over time with respect to major events that happened in the year (Example: 1993 Change in Pricing Model,1997 Recession
* Due to limitations of the data collected about each country's development status, the dataset only provides information from year 2000 - 2015.
+
# Identify which areas are more expensive and possible reasons for the high value (Proximity to public transport, Schools, Shopping Malls, Park)
* Due to technical limitations, loading past 45 years of data (156,773 records) into the application may cause it to become non-responsive and users may not be satisfied with the response rate. A range of 15 years (87,010 records) will yield just enough data for an insightful analysis and yet, does not sacrifice on the application's response rate. <br/>
+
# To find out if getting a specific HDB is a good investment based on the number of year left on the lease and which locations may potentially be more profitable based on the age of the HDB.
The dataset for analysis will be retrieved from multiple databases, as elaborated below:<br/>
+
 
 +
<p>By using our visualisation, we will be able to give users a better idea of the pricing situation of the resale HDBs so that people can make better decisions in the HDB which they want to choose to call their home. Such as when is the best time to buy as HDB; where are the most profitable / cheapest locations; whether a HDB is expensive  </p>
 +
 
 +
<br/><div style="background: #347473; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">SELECTED DATASET</font></div>
 +
<p>In our analysis, we will only be using data within the year of 1990 - 2017. The rationale for the range of data selected is as follows:</p>
 +
 
 +
<p>The dataset for analysis will be retrieved from multiple databases, as elaborated below:</p>
 
{| class="wikitable" style="background-color:#FFFFFF;" width="100%"
 
{| class="wikitable" style="background-color:#FFFFFF;" width="100%"
 
|-
 
|-
! style="font-weight: bold;background: #536a87;color:#fbfcfd;width: 30%;" | Dataset/Source
+
! style="font-weight: bold;background: #56C0BE;color:#fbfcfd;width: 30%;" | Dataset/Source
! style="font-weight: bold;background: #536a87;color:#fbfcfd;width: 30%" | Data Attributes
+
! style="font-weight: bold;background: #56C0BE;color:#fbfcfd;width: 30%" | Data Attributes
! style="font-weight: bold;background: #536a87;color:#fbfcfd;" | Rationale Of Usage
+
! style="font-weight: bold;background: #56C0BE;color:#fbfcfd;" | Rationale Of Usage
|-
 
| <center>Global Terrorism Database (GTD)<br/>
 
(https://www.start.umd.edu/gtd/using-gtd/) </center>
 
||
 
* Geographical spread of terrorist attacks
 
* Type of terrorist attacks
 
* Target of terrorist attacks
 
* Perpetrators of terrorist attacks
 
* Extent of damage in terrorist attacks
 
||
 
<center>This dataset will be used as a main source of information in our analysis to understand the spread of terrorist activities in each country/globally.</center>
 
 
|-
 
|-
| <center>Big Allied and Dangerous Database (BAAD)<br/>
+
| <center>Resales flat prices from Mar 2012 onwards<br/>
(https://dataverse.harvard.edu/dataset.xhtml?persistentId=hdl%3A1902.1/16062)</center>
+
(https://data.gov.sg/dataset/resale-flat-prices?resource_id=83b2fc37-ce8c-4df4-968b-370fd818138b ) </center>
 +
<center>Resales flat prices from 2002 - Feb 2012 <br/>
 +
(https://data.gov.sg/dataset/resale-flat-prices?resource_id=8c00bf08-9124-479e-aeca-7cc411d884c4 ) </center>
 +
<center>Resales flat prices from 1990 - 1999<br/>
 +
(https://data.gov.sg/dataset/resale-flat-prices?resource_id=adbbddd3-30e2-445f-a123-29bee150a6fe ) </center>
 
||  
 
||  
* Information about each terrorist organization (e.g. ideology, location, state sponsored, number of allies and rivals)  
+
* Month
 +
* Town
 +
* Flat Type
 +
* Block
 +
* Street Name
 +
* Storey Range
 +
* Floor Area (Sqm)
 +
* Flat Model
 +
* Lease Commence Date
 +
* Resale Price (S$)  
 
||  
 
||  
<center>This dataset aims to complement the main dataset by providing detailed information about each terrorist organization, in addition to the attacks that it carried out globally.</center>
+
<center>This dataset will be used as a main source of information in our analysis to understand the number of HDB around Singapore from 1990 to 1999, 2002 to Feb 2012 and Mar 2012 onwards respectively.</center>
 
|-
 
|-
| <center>World Development Indicators<br/>
+
| <center>Bus Stop Names and Locations<br/>
(Retrieved from World Bank)</center>
+
(https://www.mytransport.sg/content/mytransport/home/dataMall.html#)</center>
 
||  
 
||  
* Annual GDP Growth (%)
+
* Bus Stop Number
* Poverty Ratio (%)
+
* Bus Stop Roof Number
* Unemployment Rate (%)
+
* Bus Stop Name
* Adult Literacy Rate (%)
+
* X
 +
* Y
 +
* Latitude
 +
* Longitude
 
||  
 
||  
<center>This dataset aims to help analysts identify possible linkages <br/>between the number of terrorist activities occurring in the country and the development state of the selected country.</center>
+
<center>This dataset aims to complement the main dataset by providing detailed information about the latitude and longitude of the bus stops located around HDB. We use a javascript script to convert all the X and Y coordinates to EPSG:4326 latitude and longitude coordinates.</center>
 
|-
 
|-
| <center>UIS Data Center<br/>
+
| <center>Mrt Stations Names and Locations <br/>
(Retrieved from UN Data - UNESCO Institute for Statistics)</center>
+
(https://www.mytransport.sg/content/mytransport/home/dataMall.html#)</center>
 
||  
 
||  
* Youth Literacy Rate (%)
+
* MRT Station Number
 +
* MRT Station Name
 +
* X
 +
* Y
 +
* Latitude
 +
* Longitude
 
||  
 
||  
<center>This dataset aims to help analysts identify possible linkages <br/>between the number of terrorist activities occurring in the country and the development state of the selected country.</center>
+
<center>This dataset aims to complement the main dataset by providing detailed information about the latitude and longitude of the MRT stations located around HDB. We use a javascript script to convert all the X and Y coordinates to EPSG:4326 latitude and longitude coordinates</center>
 
|-
 
|-
| <center>World Telecommunications/ICT Indicators Database<br/>
+
| <center>Schools (Primary, Secondary, Junior College) Names and Locations <br/>
(Retrieved from UN Data - International Telecommunications Union)</center>
+
(https://data.gov.sg/dataset/school-directory-and-information)</center>
 
||  
 
||  
* Individuals Using Internet (%)
+
* School Name
 +
* Address
 +
* Postal Code
 +
* Planning Area
 
||  
 
||  
<center>This dataset aims to help analysts identify possible linkages <br/>between the number of terrorist activities occurring in the country and the development state of the selected country.</center>
+
<center>This dataset aims to complement the main dataset by providing detailed information about the address of schools located around HDB.</center>
|-
 
 
|}
 
|}
  
<br/><div style="background: #364558; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">BACKGROUND SURVEY OF RELATED WORKS</font></div>
+
 
Many visual and data analysts have made use of data collected from the Global Terrorism Database to visualize and understand the extent of terrorist attacks around the world. Some of their works include the following:
+
<br/><div style="background: #347473; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">BACKGROUND SURVEY OF RELATED WORKS</font></div>
 +
There are many charts and visualisations available which illustrates the various trends of house prices and index. We have selected a few of these to study and learn before we begin developing our own visualizations.
 
{| class="wikitable" style="background-color:#FFFFFF;" width="100%"
 
{| class="wikitable" style="background-color:#FFFFFF;" width="100%"
 
|-
 
|-
! style="font-weight: bold;background: #536a87;color:#fbfcfd;width: 50%;" | Related Works
+
! style="font-weight: bold;background: #56C0BE;color:#fbfcfd;width: 50%;" | Related Works
! style="font-weight: bold;background: #536a87;color:#fbfcfd;" | What We Can Learn
+
! style="font-weight: bold;background: #56C0BE;color:#fbfcfd;" | What We Can Learn
 
|-
 
|-
 
|  
 
|  
<p><center>'''An Analysis of Death Tolls & Terrorist Incidents''' </center></p>
+
<p><center>'''An Analysis of the trend and correlation between resale prices and flat production''' </center></p>
[[File:Related Works - Infographic Group7.jpg|400px|center]]
+
[[File:1718T1G1 BackgroundSurvey1.png|400px|center]]
<p><center>'''Source''': http://www.dailymail.co.uk/news/article-3322308/Number-people-killed-terrorists-worldwide-soars-80-just-year.html</center></p>
+
<p><center>'''Source''': http://www.teoalida.com/singapore/hdbprices/</center></p>
 
  ||  
 
  ||  
* The infographic provides annotations to help the users understand major terrorist attacks that have happened overtime.
+
* The use of 2 different chart types with a secondary axis is effective in illustrating the correlation between resale prices and flat production.  
* Colour scheme used by the infographic is clean and neat.  
+
* The colours used are striking and contrast well with each other.
* Use of colors on the same color scale ensures that it will not confuse the users. Also, the need to reference to a legend repeatedly will also be reduced.
+
* There are dips  in both variables which are not explained in the infographic itself (E.g. 1997 Asian crisis, 2003 SARs outbreak). This events could be incorporated into the charts to make it more informative.
 
|-
 
|-
| <p><center> '''An Animated Time-Lapse Visualization of Terror Attacks On The World Map''' </center></p>
+
| <p><center> '''An interactive heatmap of Singapore’s house prices in various districts''' </center></p>
[[File:Related Works - Time Lapse Group7.gif|400px|center]]
+
[[File:1718T1G1 BackgroundSurvey2.png|400px|center]]
<p><center> '''Source''': https://www.youtube.com/watch?v=cHbYk2l9w-E </center> </p>
+
<p><center> '''Source''': https://www.srx.com.sg/heat-map </center> </p>
 
||
 
||
* The time-lapse animation provides a clear overview to users as it shows the spread of terrorist activities over the years.
+
* This heatmap uses colours appropriately so that the house prices of each district can be identified intuitively (Red means expensive, blue means cheap, orange means mid-range)
 +
* The use of filters allows user to find out more about the price distribution of each house type easily.
 +
* When user mouseover a district on the heatmap, the corresponding district on the legend is highlighted. This improves usability as users do not have to match district numbers manually.  
 
|-
 
|-
| <p><center> '''An Interactive Visualization to Show Trends And Events Shaping History of Terrorism''' </center></p>
+
| <p><center> '''An interactive visualization of house prices along MRT stations''' </center></p>
[[File:Related Works - Interactive Visualization Group7.png|400px|center]]
+
[[File:1718T1G1 BackgroundSurvey3.png|400px|center]]
<p><center> '''Source''': http://parano.github.io/Global-Terrorism-Visualization/ </center></p>
+
<p><center> '''Source''': https://www.srx.com.sg/mrt-home-prices/property-listings-near-east-west-line </center></p>
 
||  
 
||  
* The time-series chart allow users to make use of a scrollbar to look at a time range (of 12 months). Use of a scrollbar act as a filter to look at the selected time range and this prevents users from getting overwhelmed by the data.
+
* This visualization makes use of unique ways to illustrate the relation between nearby facilities and house prices.
* The visualization consists of 2 charts linked together and this provides a clear representation of the spread of terrorist activities overtime. Firstly, the bar chart shows the number of fatalities in each month across the years. Secondly, the world map shows the spread of terrorist activities as user selects a time series. When the user drags across the scrollbar on the bar chart, the activities in the world map changes based on the selected time series. Such linkage between charts are useful and provides a good interactive tool to help users analyze spread of terrorist activities overtime.
+
* The separating of the various MRT lines using filters at the top prevent too much information from being shown in one page
* Use of tooltips allow users to know more information about the number of fatalities as they interact with the map.
 
 
|}
 
|}
  
<br/><div style="background: #364558; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">PROPOSED STORYBOARD</font></div>
+
<br/><div style="background: #347473; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">PROPOSED STORYBOARD</font></div>
Our group has proposed the following storyboard to assist analysts in the use of our visual application:
 
 
{| class="wikitable" style="background-color:#FFFFFF;" width="100%"
 
{| class="wikitable" style="background-color:#FFFFFF;" width="100%"
 
|-
 
|-
! style="font-weight: bold;background: #536a87;color:#fbfcfd;width: 50%;" | Proposed Layout
+
! style="font-weight: bold;background: #56C0BE;color:#fbfcfd;width: 50%;" | Proposed Layout
! style="font-weight: bold;background: #536a87;color:#fbfcfd;" | How Analyst Can Conduct Analysis
+
! style="font-weight: bold;background: #56C0BE;color:#fbfcfd;" | How Analyst Can Conduct Analysis
 
|-
 
|-
|  
+
| [[File:HSH Home.png|600px|center]]
<p><center>'''Introduction''' </center></p><br/>
+
||  
[[File:Storyboard Intro (Page 1) Group7.png|700px|center]]
+
# Introduce analysts to the topic of HDB Resale Price and the objectives of the visualization project
||  
+
# Upon clicking "Find your Dream Home", analysts will then begin their process of exploration
# Introduce analysts to the topic of terrorism and the objectives of the visualization project
 
# Select an option of whether they wish to analyze terrorist activities based on each country or each terrorist organization
 
 
|-
 
|-
|  
+
| [[File:HSH Insight1.png|600px|center]]
<p><center>'''Analyze Terrorist Activities By Country''' </center></p><br/>
+
||  
[[File:Storyboard Country Specific (Page 2) Group7.png|700px|center]]
+
# When a user enters our app, we will show them a brief history of HDB followed by the problem that most young people are facing with regards to understanding the HDB situation.
||  
+
# The 2 screens of different insights we are trying to show will be displayed as 2 clickable buttons so that it is easy for a user to know what exactly he wants to look for at a glance.
# Inspired by one of the related works mentioned previously, the filter in the page will be based on a bar chart showing the count of terrorist attacks over the years.
+
# The dual axis bar-line chart will indicate the number of units sold vs the HDB price index to show the performing over the years.
# A scrollbar will be implemented on the bar chart to allow users to choose the time series they are interested to look at. At any point in time, users can analyse one year of data. As the time period changes, the data in all 3 charts will change dynamically.
+
# The scatter plot groups attacks based on 3 main categories - Planning Area, HDB resale price, Life left on lease. Firstly, by grouping based on planning area, it shows the the different HDB of how many years they left on lease. The size of the circle indicate the resale price of the HDB
# A choropleth map of the country will be displayed and the count of terrorist attacks conducted in each state will be colored accordingly based on the selected time period.
 
# A star chart (glyph) will also be displayed to show the development state of the country for the selected time period.
 
# A zoomable sunburst diagram will also be displayed to show the terrorist organizations active in the country, the type of attacks conducted by each terrorist organization and their target victims.
 
# By selecting to view more information about the terrorist organization, the user will be directed to the next page about the terrorist organizations.
 
|-
 
|
 
<p><center>'''Analyze Terrorist Activities By Terrorist Organizations''' </center></p><br/>
 
[[File:Storyboard Terrorist Org Specific (Page 3) Group7.png|700px|center]]
 
||
 
# If user enters the page from the country specific page, the top 3 active terrorist organization in the country will be shown. Otherwise, the top 3 active terrorist organization globally will be shown.
 
# Similar to the country specific page, a bar chart showing the count of attacks occurring globally will be shown. At any point in time, users can analyse one year of data. As the time period changes, the data in all charts will change dynamically.
 
# Other than the time period, users can also choose to add/remove the terrorist organizations they wish to compare against. If the country is selected as a filter, the top 3 terrorist organization in the country will be displayed.
 
# The choropleth world map will be colored based on the number of attacks conducted by the selected terrorist organizations.
 
# The data points on the choropleth world map will be colored by the different types of the terrorist organization who conducted the attack. The size of the data point will be determined by the number of deaths in the particular attack.
 
# More information about the terrorist organization will also be displayed. These information will come from the BAAD dataset, Google Search and the GTD dataset on the number of attacks conducted based on attack types.
 
 
|-
 
|-
 +
| [[File:HSH Insight2.png|600px|center]]
 +
||
 +
# In the next phase of the exploration, we
 +
# The radar chart shows 5 different governance indicators that defines how well a HDB is price with several other indicators. The closer the area is to the center of the chart, the less well the indicators is. Upon mouse-over of each area, one can also retrieve the exact values of each governance indicator.
 +
# Using the map of Singapore as a filter condition, analysts can selected their choice of HDB to see if its near any bus stop, MRT, schools.
 +
# By looking at both charts, the analyst will then be able to compare and establish possible linkages between the different indicators for HDB over the years. As such, these 2 charts are placed side by side to assist the analyst in their data exploration.
 
|}
 
|}
  
<br/><div style="background: #364558; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">ADDRESSING KEY TECHNICAL CHALLENGES</font></div>
+
<br/><div style="background: #347473; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">ADDRESSING KEY TECHNICAL CHALLENGES</font></div>
 
The following are some of the key technical challenges that we may face throughout the course of the project:
 
The following are some of the key technical challenges that we may face throughout the course of the project:
 
{| class="wikitable" style="background-color:#FFFFFF;" width="100%"
 
{| class="wikitable" style="background-color:#FFFFFF;" width="100%"
 
|-
 
|-
! style="font-weight: bold;background: #536a87;color:#fbfcfd;width: 50%;" | Key Technical Challenges
+
! style="font-weight: bold;background: #56C0BE;color:#fbfcfd;width: 50%;" | Key Technical Challenges
! style="font-weight: bold;background: #536a87;color:#fbfcfd;" | How We Propose To Resolve
+
! style="font-weight: bold;background: #56C0BE;color:#fbfcfd;" | How We Propose To Resolve
 
|-
 
|-
 
| <center> Unfamiliarity of Visualization Tool Usage </center> ||  
 
| <center> Unfamiliarity of Visualization Tool Usage </center> ||  
Line 175: Line 177:
 
|}
 
|}
  
<br/><div style="background: #364558; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">PROJECT TIMELINE</font></div>
+
<br/><div style="background: #347473; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">PROJECT TIMELINE</font></div>
 
<p>The following shows our project timeline for the completion of this project:</p>
 
<p>The following shows our project timeline for the completion of this project:</p>
[[File:Project Schedule Group7.png|1000px|center]]
+
<div style="width:100%">[[File:1718T1G1 Timeline.png|1000px|center]]</div>
  
<br/><div style="background: #364558; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">TOOLS/TECHNOLOGIES</font></div>
+
<br/><div style="background: #347473; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">TOOLS/TECHNOLOGIES</font></div>
 
<p>The following are some of the tools/technologies that we will be utilizing during the project:</p>
 
<p>The following are some of the tools/technologies that we will be utilizing during the project:</p>
 +
* Excel
 
* D3.js
 
* D3.js
* Chart.js
+
* Proj4.js
* Google Charts
+
* Google Maps Distance Matrix API
 
* Google Search API
 
* Google Search API
 
* Github
 
* Github
* Netbeans
+
* Node.js
 +
* Angular.js
  
<br/><div style="background: #364558; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">REFERENCES</font></div>
+
<br/><div style="background: #347473; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">REFERENCES</font></div>
* Marina Bay Attack Plot from Batam ‘Not to be Taken Lightly’ (http://www.straitstimes.com/singapore/rocket-attack-plot-not-to-be-taken-lightly)
+
* DrWealth’s infographic (https://www.drwealth.com/singapore-property-prices-along-mrt-lines/)
* National Consortium for the Study of Terrorism and Responses to Terrorism (START). (2016). Global Terrorism Database [Data file]. Retrieved from http://www.start.umd.edu/gtd
+
* Data Gov Database (https://data.gov.sg)
* UN Datasets (http://data.un.org/)
 
* Big Allied and Dangerous Database (https://dataverse.harvard.edu/dataset.xhtml?persistentId=hdl%3A1902.1/16062)
 
* World Bank Database (http://databank.worldbank.org/data/home.aspx)
 
 
* D3.js (https://d3js.org/)
 
* D3.js (https://d3js.org/)
 
* Examples By Mike Bostock (https://bost.ocks.org/mike/example/)
 
* Examples By Mike Bostock (https://bost.ocks.org/mike/example/)
 +
* Housing Development Board (http://www.hdb.gov.sg/cs/infoweb/residential/buying-a-flat/resale/resale-statistics)
  
<br/><div style="background: #364558; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">OUR BRAINSTORMING SESSIONS</font></div>
+
<br/><div style="background: #347473; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">OUR BRAINSTORMING SESSIONS</font></div>
 
<p>The following are some of the proposed storyboard that we designed during our brainstorming sessions:</p>
 
<p>The following are some of the proposed storyboard that we designed during our brainstorming sessions:</p>
[[File:Proposed Storyboard v1 Group7.jpg|800px|center]]<br/>
+
[[File:HSH Homesketch.jpeg|800px|center]]<br/>
[[File:Brainstorm Proposed Storyboard v2.1 Group7.jpg|800px|center]]<br/>
+
[[File:HSH Insight 1 and 2.jpg|800px|center]]<br/>
[[File:Brainstorm Proposed Storyboard v2.2.jpg|800px|center]]
+
[[File:HSH insight1.jpg|800px|center]]<br/>
 +
 
 +
Our idea was to provide charts which are able to visualise the trends of HDB over the years in Singapore. We decided to split into multiple charts to be able to showcase the information more clearly.
  
<br/><div style="background: #364558; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">COMMENTS</font></div>
+
<br/><div style="background: #347473; padding: 15px; font-weight: bold; line-height: 0.3em; text-indent: 15px;letter-spacing:-0.08em;font-size:20px"><font color=#fbfcfd face="Century Gothic">COMMENTS</font></div>
<p>Feel free to comment to help us improve our project! (:</p>
+
<p>Feel free to comments, suggestions and feedbacks to help us improve our project! (:</p>

Latest revision as of 18:54, 6 November 2017

1718T1G1 Logo.png


PROBLEM & MOTIVATION

With housing prices being a hot topic that most undergraduates are talking about as we approach that final period called graduation, don't you wish that there was a way for us to get a better picture of the housing market so that we can make our big decision on where we are going to live from now on? Thus, we sought out to look for a dataset which would be able to provide us with the required information we needed to come up with useful visualizations. Unfortunately, most datasets are incomplete or not useful in providing us with any insight unless a good amount of data cleaning and wrangling is done. For example, transaction data which only gives you the block number and road name with no postal code. How is that useful? Therefore, one of our primary goals is to come up with a visualization which would provide users with useful information about the resale market in Singapore and help people gain access to a better data set than the ones currently out there.

In terms of what visualizations we would be going for, we felt that it would be interesting to look into the historical flat data so that we can see which flats in Singapore would be the most value for money so that we can actually get a home which is worth its investment. We also felt that it would be fun to explore trends in the resale flat prices and see what factors really affect the prices of HDBs and see how much of a premium people attach to amenities such as proximity to public transport, schools and etc...


OBJECTIVES

In this project, we are interested to create a visualisation that helps users perform the following:

  1. View the trend in the resale prices over time with respect to major events that happened in the year (Example: 1993 Change in Pricing Model,1997 Recession
  2. Identify which areas are more expensive and possible reasons for the high value (Proximity to public transport, Schools, Shopping Malls, Park)
  3. To find out if getting a specific HDB is a good investment based on the number of year left on the lease and which locations may potentially be more profitable based on the age of the HDB.

By using our visualisation, we will be able to give users a better idea of the pricing situation of the resale HDBs so that people can make better decisions in the HDB which they want to choose to call their home. Such as when is the best time to buy as HDB; where are the most profitable / cheapest locations; whether a HDB is expensive


SELECTED DATASET

In our analysis, we will only be using data within the year of 1990 - 2017. The rationale for the range of data selected is as follows:

The dataset for analysis will be retrieved from multiple databases, as elaborated below:

Dataset/Source Data Attributes Rationale Of Usage
Resales flat prices from Mar 2012 onwards
(https://data.gov.sg/dataset/resale-flat-prices?resource_id=83b2fc37-ce8c-4df4-968b-370fd818138b )
Resales flat prices from 2002 - Feb 2012
(https://data.gov.sg/dataset/resale-flat-prices?resource_id=8c00bf08-9124-479e-aeca-7cc411d884c4 )
Resales flat prices from 1990 - 1999
(https://data.gov.sg/dataset/resale-flat-prices?resource_id=adbbddd3-30e2-445f-a123-29bee150a6fe )
  • Month
  • Town
  • Flat Type
  • Block
  • Street Name
  • Storey Range
  • Floor Area (Sqm)
  • Flat Model
  • Lease Commence Date
  • Resale Price (S$)
This dataset will be used as a main source of information in our analysis to understand the number of HDB around Singapore from 1990 to 1999, 2002 to Feb 2012 and Mar 2012 onwards respectively.
Bus Stop Names and Locations
(https://www.mytransport.sg/content/mytransport/home/dataMall.html#)
  • Bus Stop Number
  • Bus Stop Roof Number
  • Bus Stop Name
  • X
  • Y
  • Latitude
  • Longitude
This dataset aims to complement the main dataset by providing detailed information about the latitude and longitude of the bus stops located around HDB. We use a javascript script to convert all the X and Y coordinates to EPSG:4326 latitude and longitude coordinates.
Mrt Stations Names and Locations
(https://www.mytransport.sg/content/mytransport/home/dataMall.html#)
  • MRT Station Number
  • MRT Station Name
  • X
  • Y
  • Latitude
  • Longitude
This dataset aims to complement the main dataset by providing detailed information about the latitude and longitude of the MRT stations located around HDB. We use a javascript script to convert all the X and Y coordinates to EPSG:4326 latitude and longitude coordinates
Schools (Primary, Secondary, Junior College) Names and Locations
(https://data.gov.sg/dataset/school-directory-and-information)
  • School Name
  • Address
  • Postal Code
  • Planning Area
This dataset aims to complement the main dataset by providing detailed information about the address of schools located around HDB.



BACKGROUND SURVEY OF RELATED WORKS

There are many charts and visualisations available which illustrates the various trends of house prices and index. We have selected a few of these to study and learn before we begin developing our own visualizations.

Related Works What We Can Learn

An Analysis of the trend and correlation between resale prices and flat production

1718T1G1 BackgroundSurvey1.png

Source: http://www.teoalida.com/singapore/hdbprices/

  • The use of 2 different chart types with a secondary axis is effective in illustrating the correlation between resale prices and flat production.
  • The colours used are striking and contrast well with each other.
  • There are dips in both variables which are not explained in the infographic itself (E.g. 1997 Asian crisis, 2003 SARs outbreak). This events could be incorporated into the charts to make it more informative.

An interactive heatmap of Singapore’s house prices in various districts

1718T1G1 BackgroundSurvey2.png

Source: https://www.srx.com.sg/heat-map

  • This heatmap uses colours appropriately so that the house prices of each district can be identified intuitively (Red means expensive, blue means cheap, orange means mid-range)
  • The use of filters allows user to find out more about the price distribution of each house type easily.
  • When user mouseover a district on the heatmap, the corresponding district on the legend is highlighted. This improves usability as users do not have to match district numbers manually.

An interactive visualization of house prices along MRT stations

1718T1G1 BackgroundSurvey3.png

Source: https://www.srx.com.sg/mrt-home-prices/property-listings-near-east-west-line

  • This visualization makes use of unique ways to illustrate the relation between nearby facilities and house prices.
  • The separating of the various MRT lines using filters at the top prevent too much information from being shown in one page


PROPOSED STORYBOARD
Proposed Layout How Analyst Can Conduct Analysis
HSH Home.png
  1. Introduce analysts to the topic of HDB Resale Price and the objectives of the visualization project
  2. Upon clicking "Find your Dream Home", analysts will then begin their process of exploration
HSH Insight1.png
  1. When a user enters our app, we will show them a brief history of HDB followed by the problem that most young people are facing with regards to understanding the HDB situation.
  2. The 2 screens of different insights we are trying to show will be displayed as 2 clickable buttons so that it is easy for a user to know what exactly he wants to look for at a glance.
  3. The dual axis bar-line chart will indicate the number of units sold vs the HDB price index to show the performing over the years.
  4. The scatter plot groups attacks based on 3 main categories - Planning Area, HDB resale price, Life left on lease. Firstly, by grouping based on planning area, it shows the the different HDB of how many years they left on lease. The size of the circle indicate the resale price of the HDB
HSH Insight2.png
  1. In the next phase of the exploration, we
  2. The radar chart shows 5 different governance indicators that defines how well a HDB is price with several other indicators. The closer the area is to the center of the chart, the less well the indicators is. Upon mouse-over of each area, one can also retrieve the exact values of each governance indicator.
  3. Using the map of Singapore as a filter condition, analysts can selected their choice of HDB to see if its near any bus stop, MRT, schools.
  4. By looking at both charts, the analyst will then be able to compare and establish possible linkages between the different indicators for HDB over the years. As such, these 2 charts are placed side by side to assist the analyst in their data exploration.


ADDRESSING KEY TECHNICAL CHALLENGES

The following are some of the key technical challenges that we may face throughout the course of the project:

Key Technical Challenges How We Propose To Resolve
Unfamiliarity of Visualization Tool Usage
  • Independent Learning on Visualization Tools
  • Peer Learning
Data Cleaning & Transformation
  • Work together to clean, transform and analyze the data
Unfamiliarity in Programming using Javascript & D3 Libraries
  • Attend D3 Programming Workshop
  • Independent Learning on D3 Libraries & Technical Tools
  • Peer Learning
Unfamiliarity in Implementing Interactivity and Animation Tools/Techniques in Visualization App
  • Develop a Storyboard/Design Flow
  • Assign members to specialize on Interactivity/Animation Techniques


PROJECT TIMELINE

The following shows our project timeline for the completion of this project:

1718T1G1 Timeline.png


TOOLS/TECHNOLOGIES

The following are some of the tools/technologies that we will be utilizing during the project:

  • Excel
  • D3.js
  • Proj4.js
  • Google Maps Distance Matrix API
  • Google Search API
  • Github
  • Node.js
  • Angular.js


REFERENCES


OUR BRAINSTORMING SESSIONS

The following are some of the proposed storyboard that we designed during our brainstorming sessions:

HSH Homesketch.jpeg


HSH Insight 1 and 2.jpg


HSH insight1.jpg


Our idea was to provide charts which are able to visualise the trends of HDB over the years in Singapore. We decided to split into multiple charts to be able to showcase the information more clearly.


COMMENTS

Feel free to comments, suggestions and feedbacks to help us improve our project! (: