Difference between revisions of "HomeIntel"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
 
(38 intermediate revisions by 2 users not shown)
Line 29: Line 29:
 
<br /><br />
 
<br /><br />
  
== Problem and Motivation ==  
+
== Problem and Motivation ==
  
== Objectives ==  
+
Did you know that 82% of Singaporeans actually lives in HDB flats? Everyone wants to own their dream house! Our group members are all currently at the age where friends talk about housing prices in the hope of securing their dream house soon. For young adults who are looking to start their own family, it is always a huge financial commitment to buy a new house. Hence, this make resale houses a considerable worthwhile choice to look towards..
 +
 
 +
Many people are always worried about not getting the best price for their dream house and there are just too much information all over the internet which take hours to read up on. To make it easier for resale flat buyers, our team aims to come out with visualization that best represent the current resale flat industry, allowing people to understand the market better. We hope that through these visualizations, potential HDB resale flat buyers are able to better analyse and compare the costs of owning a flat across different towns, prioritize what is important to them before deciding on their dream home.
 +
 
 +
== Objectives ==
 +
The objectives we hope to address through this project are as follows:
 +
 
 +
#Allow users to gain overall insights on resale price trends over the last 5 years (2015-2019)
 +
#Understand the key factors that affect resale prices (resale price, floor level, floor area, flat type)
 +
#Gain insight on the town area which has the highest investment potential for housing flat
  
 
== Selected Dataset ==
 
== Selected Dataset ==
Line 37: Line 46:
 
The Data Sets we will be using for our analysis and for our application is listed below:
 
The Data Sets we will be using for our analysis and for our application is listed below:
  
<center>
+
{| class="wikitable" style="margin-left: auto; margin-right: auto; width: 90%;
{| class="wikitable" style="background-color:#FFFFFF;" width="90%"
 
 
|-
 
|-
! style="font-weight: bold;background: #141414;color:#fbfcfd;width: 30%;" | Dataset/Source
+
! style="background:#7dcfe8;|Dataset/Source !! style="background:#7dcfe8;|Data Attributes !! style="background:#7dcfe8;|Rationale of Usage
! style="font-weight: bold;background: #141414;color:#fbfcfd;width: 20%" | Data Attributes
 
! style="font-weight: bold;background: #141414;color:#fbfcfd;width: 50%;" | Rationale Of Usage
 
 
|-
 
|-
| <center>Visitor Arrivals Statistics
+
| <center><br/>
(2007 January - 2018 September)<br/><br/><br/>
+
Resale Flat Price (Jan 2015 - Aug 2019)
(https://kto.visitkorea.or.kr/eng/tourismStatics/keyFacts/KoreaMonthlyStatistics.kto ) </center>
+
<br/>
 +
(https://data.gov.sg/dataset/resale-flat-prices?resource_id=42ff9cfe-abe5-4b54-beda-c88f9bb438ee)
 +
 
 +
</center>
 +
||
 +
* Month
 +
* Town
 +
* Flat type
 +
* Block
 +
* Street Name
 +
* Storey range
 +
* Floor Area sqm
 +
* Flat Model
 +
* Lease Commence Date
 +
* Remaining Lease
 +
* Resale Price
  
 
||  
 
||  
 +
This dataset will be used to understand the resale transacted price and see the price trend change over year.
  
 +
|}
  
* Month
+
== Background Survey ==
  
* Year
+
Our team did background research on the topic before creating the storyboards. It will expose us to the different types of charts and graphs that we can use for the project and consider how we can make it better. Some of the visualisations that we draw inspiration from are as follows:
  
* Country
+
{| class="wikitable" style="margin-left: auto; margin-right: auto; width: 90%;
 +
|-
 +
! style="background:#7dcfe8;|Reference of Other Interactive Visualization !!style="background:#7dcfe8;| Learning Points
 +
|-
 +
|
 +
<center><br/> '''Title: Historical Average of HDB Resale Prices in Ang Mo Kio
 +
'''
 +
[[File:Relatedwork1.jpg|500px|center]]
 +
Source: https://getaflat.herokuapp.com/charts
 +
</center>
  
* Number of Visitor Arrivals
+
||
 +
* This is a time-series chart which allow users to see the trend in resale prices by room type over time
 +
* The axis titles are missing
 +
* We will be using this chart as a reference and improving it by adding filters such as the age of flat and story range
  
  
 +
|-
 +
| <center><br/>  '''Title: Distribution of 4-Room HDB Resale Prices By Town '''
 +
[[File:Relatedwork2.jpg|500px|center]]
 +
Source: https://public.tableau.com/profile/priyadarsan.shankar#!/vizhome/Singapore4-ROOMHDBresalepricesvisualization/HDBdashboard
 +
</center>
 
||  
 
||  
<center>This dataset will be used to understand the number of visitors to South Korea from 2007 January to 2018 June. This will allow us to understand the inflow of visitors and see the trend (seasonal trend) of when the visitors come in.</center>
+
* The box plots can be used to identify towns with high median resale prices
 +
* Outliers can be identified easily and we can see if the data is skewed
 +
 
 
|-
 
|-
| <center>International Visitor Survey
+
| <center><br/> '''Title: Singapore Property Prices Heatmap '''
(2007 - 2018 )<br/><br/><br/>
+
[[File:3.jpg|500px|center]]
(https://kto.visitkorea.or.kr/kor/notice/data/statis/tstatus.kto)</center>
+
Source: https://www.greyloft.com/singapore/property-prices-heatmap
 +
</center>
 
||  
 
||  
 +
* The heatmap focuses on the price per square foot by district.
 +
* Users can select the date range
 +
* The tooltips provide more information on the trend, the number of transactions and towns within the district
 +
* Easy to interpret the chart and the legends are useful.
 +
* The Central area stands out well. Good colour choice to represent the high price per sqf
 +
* We will do a heatmap on the average resale price by town and provide more information in the tooltip
  
* City
+
|-
 +
|<center><br/>  '''Title: Heat Map of Median Resale Price'''
 +
[[File:Captureff.jpg|500px|center]]
 +
Source: https://public.tableau.com/profile/darrick8462#!/vizhome/HDBResaleAnalysisDDO/HDBRESALEDASHBOARD
 +
</center>
 +
||
  
* Tourist Attraction
+
* The heat map shows the relationship between flat type and the storey range
 +
* We can improve this further by creating a heatmap based on an additional field called Estate Type (mature and non-mature estates)
  
* Local/Foreigner
+
|}
  
* Date
+
== Brainstorming Sessions ==
 +
To come up with the storyboard, our group met up several times to try and come up with a visual. The five charts shown above are visual that we considered making for our project. Firstly, the bar and line charts are used to view the relationships between different factors against price. For the boxplot diagram, we feel that is a good visual to achieve our goal of seeing the seasonal trend of the resale price over the past five years. Lastly, the heatmap visual, our goal is to see the fluctuation of price per square meter to examine the profitability for investment or purchase a flat in a specific location.
  
* No. of Visitors
+
[[Image:Brainstorming Session.jpg|center|800px]]
 +
<br>
 +
After much brainstorming sessions, we came up with our final storyboard designs which are listed below.
  
* Latitude
+
== Proposed Dashboard ==
 +
Our group has proposed the following storyboard in our Visual Application:
  
* Longitude
+
{| class="wikitable" style="margin-left: auto; margin-right: auto; width: 90%;
 +
|-
 +
! style="background:#7dcfe8;|Dashboards !! style="background:#7dcfe8; width:50%;|Rationale
 +
|-
 +
| <center><br/> '''Storyboard 1: Overview of Resale Price Trends''' </br>
 +
[[File:SB1.jpg|300px|frameless|center]]
  
 +
</center>
 
||  
 
||  
<center>This dataset will be used to understand the most popular tourist attraction in the different major cities in Korea. We will also be able to gain insights on the Number of visitors to each attraction and also if they are local and foreigner visitors which will allow us to see the difference between locals and foreigners.</center>
+
As this is the 1st storyboard, we would like to provide the reader with the big picture of resale prices before narrowing down the scope. The aim of this dashboard is to show the resale price trends and to allow the user to identify areas that are more expensive.  
|-
 
| <center>Entry by nationality by age
 
(2007 - 2018)<br/><br/><br/>
 
([http://know.tour.go.kr/stat/tourStatSearchDis.do;jsessionid=18780F0E8CFAFBBCEB58B0A96098EDD3 Click to View Data])</center>
 
||
 
  
* City
+
Users can filter by year/month, region, town, flat type and age. When the user hovers over the map, the average floor area in sqm, the average price per sqm and the number of transactions will be displayed.  
 
 
* Age Range
 
 
 
* Date
 
 
 
||
 
<center>This data set will be used to understand the general demographic of international visitors coming to Korea from 2007 - 2018. We will be able to gain descriptive insights on the visitor demographics by Age Range.</center>
 
 
|-
 
|-
| <center>Entry by nationality by Sex
+
|
(2007 - 2018)<br/><br/><br/>
+
<center><br/> '''Storyboard 2: Key Factors Affecting Resale Prices'''
([http://know.tour.go.kr/stat/tourStatSearchDis.do;jsessionid=18780F0E8CFAFBBCEB58B0A96098EDD3 Click to View Data])</center>
+
[[File:SB2.jpg|300px|center]]
||
 
  
* City
+
</center>
 
+
||  
* Sex
+
The aim of this dashboard is to find out if variables such as flat type, storey range, and region can influence the resale price.  
 
+
The same filters used in the previous storyboard can be used for this storyboard.  
* Date
 
 
 
||
 
<center>This data set will be used to understand the general demographic of international visitors coming to Korea from 2007 - 2018. We will be able to gain descriptive insights on the visitor demographics by Gender.</center>
 
 
|-
 
|-
| <center>Entry by nationality by Purpose
+
|  
(2007 - 2018)<br/><br/><br/>
+
<center><br/>'''Storyboard 3: Comparison Between Mature and Non-Mature Estates'''
([http://know.tour.go.kr/stat/tourStatSearchDis.do;jsessionid=18780F0E8CFAFBBCEB58B0A96098EDD3 Click to View Data])</center>
+
[[File:SB3.jpg|300px|center]]
||
+
</center>
 
+
||  
* City
 
  
* Purpose of Visit
+
There are pros and cons in buying a new home in a mature and non-mature estate. Towns that have been around for more than 20 years are deemed mature.
  
* Date
+
The aim of this dashboard is to make comparisons between the 2 estate types and see if there are any distinct differences in terms of pricing and floor area. The user can also see the seasonal trend of people selling their homes
  
||
 
<center>This data set will be used to understand the general demographic of international visitors coming to Korea from 2007 - 2018. We will be able to gain descriptive insights on the visitor demographics by Purpose of Visit.</center>
 
 
|}
 
|}
</center>
 
  
<br/>
+
== Tools and Technologies Used ==
  
==<div style="background:#143c67; padding:15px; font-weight: bold; line-height: 0.3em;letter-spacing:0.5em;font-size:20px"><font color=#fbfcfd face="Century Gothic"><center>BACKGROUND SURVEY</center></font></div>==
+
[[File:Tools.png|center]]
<br/>
 
  
Before we embarked on this project, we did some basic background research on this topic to see if there were any visualizations or dashboards we could drive inspirations from or make it better. Below are a few visuals we found:
+
== Challenges ==
  
<center>
+
{| class="wikitable" style="margin-left: auto; margin-right: auto; width: 90%;
{| class="wikitable" style="background-color:#FFFFFF;" width="90%"
 
 
|-
 
|-
! style="font-weight: bold;background: #141414;color:#fbfcfd;width: 45%;" | Reference of Other Interactive Visualization
+
! style="background:#7dcfe8;|Challenges !! style="background:#7dcfe8;|Description !! style="background:#7dcfe8;|Mitigation Plan
! style="font-weight: bold;background: #141414;color:#fbfcfd;width: 55%" | Learning Point
 
 
|-
 
|-
| <center>
+
| <center><br/>  
'''Title''': Monthly Number of Individual Travelling Visitors (2016)
+
Unfamiliarity of Visualization Technologies such as Tableau, R,Rshiny etc.
[[File:Example1.png|300px|frameless|center]]
 
'''Source''':https://www.data.go.kr/visual/content/577
 
</center>
 
  
== Background Survey ==
 
 
<center>
 
{| class="wikitable" style="background-color:#FFFFFF;" width="90%"
 
|-
 
! style="font-weight: bold;background: #141414;color:#fbfcfd;width: 45%;" | Visual Considerations
 
! style="font-weight: bold;background: #141414;color:#fbfcfd;width: 55%" | Insights / Comments
 
|-
 
| <center>
 
'''Title''': Sunburst Diagram
 
[[File:Sunburst.png|250px|frameless|center]]
 
'''Source''':https://bl.ocks.org/mbostock/4348373
 
 
</center>
 
</center>
 
+
||
 +
We are new to R and RShiny and we are not sure of its capabilities
 
||  
 
||  
 +
* Independent Hands-on Practice via online resources such as Datacamp
 +
* Ask team mates for help
  
*'''Pros:'''
+
|-
** Aims to show various sub-components of a particular category
+
| <center><br/>
** Can drill down to multiple divisions to observe the distribution by percentages
+
Data visualization
** May be useful to analyze tourism receipts by components and country
 
  
*'''Cons:'''
 
** Difficult to break down the huge number of markets
 
** Does not provide a comprehensive time-series comparison
 
 
|-
 
| <center>
 
'''Title''': Treemap
 
[[File:Treemap.png|250px|frameless|center]]
 
'''Source''':https://www.theinformationlab.co.uk/2015/02/10/show-treemaps/
 
 
</center>
 
</center>
||  
+
||
  
*'''Pros''':
+
It is challenging to determine what type of chart to use and we also have to consider if the user can gain useful insights from the chart and interpret it easily
** Effective visualisation to organise multivariate data by hierarchy
 
** We can effectively see the purpose of visit for the top 10 visiting countries to Korea.
 
 
  
*'''Cons''':
 
** It would be hard to compare between years and months for different countries.
 
** The hierarchy will only be 2 levels so the interaction would not be as much.
 
 
|-
 
| <center>
 
'''Title''': Chord Diagram
 
[[File:Chord.png|250px|frameless|center]]
 
'''Source''':https://beta.observablehq.com/@mbostock/d3-chord-diagram
 
</center>
 
 
||  
 
||  
 +
* Evaluate potential designs with team mates and provide constructive feedback
 +
* Look at more charts online for inspiration and see how we can make it better
 +
* Watch the courses on Datacamp to familiarise ourselves with R and the different packages like Plotly.
  
*'''Pros''':
 
** Effective visualisation to see the influx of Visitors from and to Korea.
 
** We will be able to easily spot the country with the most travelers to Korea.
 
 
  
*'''Cons''':
 
** This chart will make it harder to spot trends in the visiting pattern.
 
** We will not be able to see every single country as the size of the chord diagram is limited.
 
  
|-
 
 
|}
 
|}
</center>
 
  
== Brainstorming Sessions ==  
+
== Timeline==
 
+
[[Image:Gantt Chart.png|center| 800px]]
== Proposed Storyboard ==
 
 
 
== Tools and Technologies Used ==
 
 
 
== Challenges ==
 
 
 
== Timeline==
 
  
 
== Comments ==
 
== Comments ==
 +
{| class="wikitable" style="margin-left: auto; margin-right: auto; width: 90%;
 +
|-
 +
! style="background:#7dcfe8;|Name !! style="background:#7dcfe8;|Comments
 +
|-
 +
|
 +
Your Name
 +
||
 +
* Comment
 +
|-
 +
|
 +
Your Name
 +
||
 +
* Comment
 +
|}

Latest revision as of 13:32, 13 October 2019

Logo home.png


Team

 

Proposal

 

Poster

 

Application

 

Research Paper


<--- Go Back to Project Groups

Problem and Motivation

Did you know that 82% of Singaporeans actually lives in HDB flats? Everyone wants to own their dream house! Our group members are all currently at the age where friends talk about housing prices in the hope of securing their dream house soon. For young adults who are looking to start their own family, it is always a huge financial commitment to buy a new house. Hence, this make resale houses a considerable worthwhile choice to look towards..

Many people are always worried about not getting the best price for their dream house and there are just too much information all over the internet which take hours to read up on. To make it easier for resale flat buyers, our team aims to come out with visualization that best represent the current resale flat industry, allowing people to understand the market better. We hope that through these visualizations, potential HDB resale flat buyers are able to better analyse and compare the costs of owning a flat across different towns, prioritize what is important to them before deciding on their dream home.

Objectives

The objectives we hope to address through this project are as follows:

  1. Allow users to gain overall insights on resale price trends over the last 5 years (2015-2019)
  2. Understand the key factors that affect resale prices (resale price, floor level, floor area, flat type)
  3. Gain insight on the town area which has the highest investment potential for housing flat

Selected Dataset

The Data Sets we will be using for our analysis and for our application is listed below:

Dataset/Source Data Attributes Rationale of Usage

Resale Flat Price (Jan 2015 - Aug 2019)
(https://data.gov.sg/dataset/resale-flat-prices?resource_id=42ff9cfe-abe5-4b54-beda-c88f9bb438ee)

  • Month
  • Town
  • Flat type
  • Block
  • Street Name
  • Storey range
  • Floor Area sqm
  • Flat Model
  • Lease Commence Date
  • Remaining Lease
  • Resale Price

This dataset will be used to understand the resale transacted price and see the price trend change over year.

Background Survey

Our team did background research on the topic before creating the storyboards. It will expose us to the different types of charts and graphs that we can use for the project and consider how we can make it better. Some of the visualisations that we draw inspiration from are as follows:

Reference of Other Interactive Visualization Learning Points

Title: Historical Average of HDB Resale Prices in Ang Mo Kio

Relatedwork1.jpg

Source: https://getaflat.herokuapp.com/charts

  • This is a time-series chart which allow users to see the trend in resale prices by room type over time
  • The axis titles are missing
  • We will be using this chart as a reference and improving it by adding filters such as the age of flat and story range



Title: Distribution of 4-Room HDB Resale Prices By Town
Relatedwork2.jpg

Source: https://public.tableau.com/profile/priyadarsan.shankar#!/vizhome/Singapore4-ROOMHDBresalepricesvisualization/HDBdashboard

  • The box plots can be used to identify towns with high median resale prices
  • Outliers can be identified easily and we can see if the data is skewed

Title: Singapore Property Prices Heatmap
3.jpg

Source: https://www.greyloft.com/singapore/property-prices-heatmap

  • The heatmap focuses on the price per square foot by district.
  • Users can select the date range
  • The tooltips provide more information on the trend, the number of transactions and towns within the district
  • Easy to interpret the chart and the legends are useful.
  • The Central area stands out well. Good colour choice to represent the high price per sqf
  • We will do a heatmap on the average resale price by town and provide more information in the tooltip

Title: Heat Map of Median Resale Price
Captureff.jpg

Source: https://public.tableau.com/profile/darrick8462#!/vizhome/HDBResaleAnalysisDDO/HDBRESALEDASHBOARD

  • The heat map shows the relationship between flat type and the storey range
  • We can improve this further by creating a heatmap based on an additional field called Estate Type (mature and non-mature estates)

Brainstorming Sessions

To come up with the storyboard, our group met up several times to try and come up with a visual. The five charts shown above are visual that we considered making for our project. Firstly, the bar and line charts are used to view the relationships between different factors against price. For the boxplot diagram, we feel that is a good visual to achieve our goal of seeing the seasonal trend of the resale price over the past five years. Lastly, the heatmap visual, our goal is to see the fluctuation of price per square meter to examine the profitability for investment or purchase a flat in a specific location.

Brainstorming Session.jpg


After much brainstorming sessions, we came up with our final storyboard designs which are listed below.

Proposed Dashboard

Our group has proposed the following storyboard in our Visual Application:

Dashboards Rationale

Storyboard 1: Overview of Resale Price Trends
SB1.jpg

As this is the 1st storyboard, we would like to provide the reader with the big picture of resale prices before narrowing down the scope. The aim of this dashboard is to show the resale price trends and to allow the user to identify areas that are more expensive.

Users can filter by year/month, region, town, flat type and age. When the user hovers over the map, the average floor area in sqm, the average price per sqm and the number of transactions will be displayed.


Storyboard 2: Key Factors Affecting Resale Prices
SB2.jpg

The aim of this dashboard is to find out if variables such as flat type, storey range, and region can influence the resale price. The same filters used in the previous storyboard can be used for this storyboard.


Storyboard 3: Comparison Between Mature and Non-Mature Estates
SB3.jpg

There are pros and cons in buying a new home in a mature and non-mature estate. Towns that have been around for more than 20 years are deemed mature.

The aim of this dashboard is to make comparisons between the 2 estate types and see if there are any distinct differences in terms of pricing and floor area. The user can also see the seasonal trend of people selling their homes

Tools and Technologies Used

Tools.png

Challenges

Challenges Description Mitigation Plan

Unfamiliarity of Visualization Technologies such as Tableau, R,Rshiny etc.

We are new to R and RShiny and we are not sure of its capabilities

  • Independent Hands-on Practice via online resources such as Datacamp
  • Ask team mates for help

Data visualization

It is challenging to determine what type of chart to use and we also have to consider if the user can gain useful insights from the chart and interpret it easily

  • Evaluate potential designs with team mates and provide constructive feedback
  • Look at more charts online for inspiration and see how we can make it better
  • Watch the courses on Datacamp to familiarise ourselves with R and the different packages like Plotly.


Timeline

Gantt Chart.png

Comments

Name Comments

Your Name

  • Comment

Your Name

  • Comment