Difference between revisions of "IS428 2016-17 Term1 Assign1 Teo Hui Min"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
Line 37: Line 37:
  
 
=Tools Utilized=
 
=Tools Utilized=
==Tableau==
+
<b>Tableau</b>: Used for data exploration, to understand the data and trends. To visualise the other distribution graphs/charts such as box plot and bar chart.<br>
==QGIS==
+
<b>QGIS</b>: To prepare a choropleth map to visualise the penetration rate in terms of the number of units sold across Singapore.
  
 
=Results=
 
=Results=

Revision as of 21:36, 28 August 2016

Abstract

The focus of this assignment will be on understanding the private residential property market of Singapore in year 2015 and the purchasing patterns of Singapore residents. I will be identifying possible reasons behind the trend we see from the visualisations, such as the ‘hottest’ regions among Singapore residents.

Problem & Motivation

In the years to come, will people still be able to afford housing? Through this assignment, I would like to find out how the changes in property prices throughout the year has affected the purchasing power of the Singapore residents. Also, finding out some possible factors that will entice people to make a purchase. The main variables that I will be looking at is the average unit price of a property and the number of units sold to understand the purchasers.

Approaches

Data set

Project: The ‘Project’ dataset was used to find out the number of units that were sold in every quarter of the year. It was also used in the assignment to find out the total number of units for a property project, cumulative sold, unsold, unlaunched, launched, completed and uncompleted units. With this data, it will be possible to find out the vacancy and occupancy rate of a project, which will be shown in one of the visualisations.

Transaction: The ‘Transaction’ dataset records property where caveat was lodged after the option-to-purchase was exercised or purchase agreement was signed. The dataset was used to find insights on the property prices, the type of sale, type of property and the planning area and region which the property was built.

Data Exploration

Initially when looking at the datasets, I thought that ‘Transaction’ was solely the number of units sold. However when I compared it to the ‘Project’ datasets, it actually did not tally. An example is the 26 Newton project. <image 9><10> I thought the data was not clean and when online to do a check on the development project. However the information provided online was the same as the downloaded datasets. <image 11><12> Then I realised that the datasets were slightly different.

Transaction dataset: Are transactions with caveats lodged with SLA
Project dataset: Units sold and launched by developers

Data Preparation

As the ‘Project’ datasets which detailed the number of units sold quarterly, the first thing that was done was to combine all the datasets into one file. Based on the file, the data was imported into Tableau for data exploration purposes. Simple drag and drop was performed in Tableau to find out the total number of units that was sold and number of units sold for a particular project. The penetration rate was then calculated and data was exported from Tableau to .csv to be visualised in QGIS. <image 1> In QGIS, a ‘join’ was performed on the csv and planning area SHP file so as to visualise into a choropleth map. <image 2>

To find out the proportion of the units that were sold, unsold and not launched, the data needs to be explored. Through the data exploration, I have mapped out how those measures can be calculated with the columns in the ‘Project’ dataset. <image 3> The measures could be calculated in Tableau, however I have done the calculation (by summing the respective columns) in the dataset first before importing it into Tableau. <image 4>


Other than the above, the rest of the visualisation was performed in Tableau with the datasets.

Tools Utilized

Tableau: Used for data exploration, to understand the data and trends. To visualise the other distribution graphs/charts such as box plot and bar chart.
QGIS: To prepare a choropleth map to visualise the penetration rate in terms of the number of units sold across Singapore.

Results

Penetration Rate

Distribution of property price

By Planning Area

By Type of Sale

Policy Recommendations

Infographics

Improvement