Difference between revisions of "Policy And Planning Proposal 2"
(11 intermediate revisions by 2 users not shown) | |||
Line 11: | Line 11: | ||
| style="background:none;" width="1%" | | | style="background:none;" width="1%" | | ||
− | | style="padding:0.2em; font-size:100%; background-color:# | + | | style="padding:0.2em; font-size:100%; background-color:#D4AC0D; border-bottom:0px solid #3D9DD7; text-align:center; color:#100c08" width="10%" | |
− | [[Policy_And_Planning_Proposal_1|<font color="# | + | [[Policy_And_Planning_Proposal_1|<font color="#fff" size=3 face="Century Gothic"><strong>Proposal (Iter 1)</strong></font>]] |
| style="background:none;" width="1%" | | | style="background:none;" width="1%" | | ||
− | | style="padding:0.2em; font-size:100%; background-color:# | + | | style="padding:0.2em; font-size:100%; background-color:#F5D65D; border-bottom:0px solid #3D9DD7; text-align:center; color:#fff" width="10%" | |
− | [[Policy_And_Planning_Proposal_2|<font color="#fff" size=3 face="Century Gothic"><strong>Proposal (Iter | + | [[Policy_And_Planning_Proposal_2|<font color="#100c08" size=3 face="Century Gothic"><strong>Proposal (Iter 2)</strong></font>]] |
+ | |||
+ | | style="background:none;" width="1%" | | ||
+ | | style="padding:0.2em; font-size:100%; background-color:#D4AC0D; border-bottom:0px solid #3D9DD7; text-align:center; color:#100c08" width="10%" | | ||
+ | [[Policy_And_Planning_Proposal_3|<font color="#fff" size=3 face="Century Gothic"><strong>Proposal (Iter 3)</strong></font>]] | ||
| style="background:none;" width="1%" | | | style="background:none;" width="1%" | | ||
Line 90: | Line 94: | ||
| <center><strong>Resident by planning Area, subgroup, age Group, sex and dwelling</strong> | | <center><strong>Resident by planning Area, subgroup, age Group, sex and dwelling</strong> | ||
− | ( | + | (2011 - 2019, June)<br/><br/> |
https://www.singstat.gov.sg/find-data/search-by-theme/population/geographic-distribution/latest-data </center> | https://www.singstat.gov.sg/find-data/search-by-theme/population/geographic-distribution/latest-data </center> | ||
|| | || | ||
Line 122: | Line 126: | ||
Also, as this data is very rich, we hope that it can further serve as a bridge to more abstract but complementary data. | Also, as this data is very rich, we hope that it can further serve as a bridge to more abstract but complementary data. | ||
− | | | + | |} |
− | + | </center> | |
<br> | <br> | ||
<br/> | <br/> | ||
Line 181: | Line 185: | ||
| <center> | | <center> | ||
− | '''Title''': | + | '''Title''': Ternary point chart |
− | [[File: | + | [[File:Policy_And_Planning_Background_6.png|300px|frameless|center]] |
<br/> | <br/> | ||
− | '''Source''': | + | '''Source''': http://helpdotnetvision.nevron.com/UsersGuide_ChartTypes_Ternary.html |
</center> | </center> | ||
|| | || | ||
'''Learning Points:''' | '''Learning Points:''' | ||
− | * | + | * Help to find the relationship of redistribution between 3 variables |
− | * | + | * Used colors to distinguish the different data point groups which came from different owners. |
+ | * Used grid lines to supported reader to match the values and axis | ||
<br> | <br> | ||
'''Possible Usage: ''' | '''Possible Usage: ''' | ||
− | * | + | * With animation support, it might be useful for the team to figure out the changes of the 3 variables among the planning area or entire singapore |
<br> | <br> | ||
'''Area for Improvement: ''' | '''Area for Improvement: ''' | ||
− | * | + | * Messy data labels are overlapping each other. The readers might be confused by the lower layer data values which covered by up layer |
+ | * No units metrics have been clearly assigned to the example. The readers would not able to understand the true meaning behind the every value | ||
+ | * It can be solved with animation or drop down list for switching the values to be displayed on the chart | ||
<br> | <br> | ||
|- | |- | ||
Line 218: | Line 225: | ||
'''Area for Improvement: ''' | '''Area for Improvement: ''' | ||
# Difficult to compare between bricks that are of the same size. As users would need to specifically count each of the units to identify the differences. | # Difficult to compare between bricks that are of the same size. As users would need to specifically count each of the units to identify the differences. | ||
+ | <br> | ||
+ | |- | ||
+ | |||
+ | | <center> | ||
+ | '''Title''': Bar charts with Facets | ||
+ | [[File:Policy_And_Planning_Background_7.png|300px|frameless|center]] | ||
+ | <br/> | ||
+ | '''Source''': https://www.datacamp.com/community/tutorials/facets-ggplot-r | ||
+ | </center> | ||
+ | |||
+ | || | ||
+ | '''Learning Points:''' | ||
+ | * Grouped multiple small graph into same viewing area | ||
+ | * Helpful for comparing same metrics for multiple group of region among different dimensions | ||
+ | |||
+ | <br> | ||
+ | '''Possible Usage: ''' | ||
+ | # Compare the income distribution for same planning area among the different surveys | ||
+ | <br> | ||
+ | '''Area for Improvement: ''' | ||
+ | # The charts have mixed value with and without Scientific notation together with no units. | ||
<br> | <br> | ||
|- | |- | ||
Line 363: | Line 391: | ||
<br> | <br> | ||
− | <strong>4. | + | <strong>4. Bricks Map With Simplified Social Demographics</strong> |
<br> | <br> | ||
− | [[File: | + | [[File:Policy_And_Planning_Brainstorming_5.png|500px|frameless]] |
<br> | <br> | ||
− | This | + | This will serve as the core geo-plot for our users to keep in touch with the geographic nature of analysing Singapore. <br> |
− | + | With the help of different legends and simplified demographic classes, we will be able to give the user the relevant <br> | |
− | + | time series data for them to better appreciate the context and usefulness of the other charts.<br> | |
− | |||
<br> | <br> | ||
− | <strong>5. | + | <strong>5. Ternary Chart</strong> |
<br> | <br> | ||
− | [[File: | + | [[File:Policy_And_Planning_Brainstorming_6.png|500px|frameless]] |
<br> | <br> | ||
− | + | Using a ternary chart, we could create visualisation based on 3 dependent variables thus, enabling users to have a quick overview of the distribution in all 3 dimensions.<br>The diagram shows 2 ternary charts with subzones being the main analysis and the 3 variables of the respective chart are the age group (0-19, 20-39, 40 above) and <br>types of dwellings (HDB, Condo, Landed). Hence, users are able to hover in the subzones to view the distributions of age group and type of dwellings. <br>This brings a new dynamic way of exploration to identify the similarities or distinct differences between subzones. | |
− | + | <br><b>The ternary chart will be lines of subzones points as we draw it with geographic subzones to better illustrate our brainstorming | |
− | |||
− | |||
<br> | <br> | ||
Line 395: | Line 420: | ||
|- | |- | ||
| <center> | | <center> | ||
− | '''Title''': DASHBOARD 1 - | + | '''Title''': DASHBOARD 1 - Time Series Ternary Analysis view |
[[File:Policy_And_Planning_Proposed_Storyboard_1.png|400px|frameless|center]] | [[File:Policy_And_Planning_Proposed_Storyboard_1.png|400px|frameless|center]] | ||
</center> | </center> | ||
|| | || | ||
− | * This chart serves as the landing page where users will explore the data | + | * This chart serves as the landing page where users will explore the time series data. It will offer a time based playable view showing how residential distribution changes with respect to key dimensions. |
* The time playable view will allow the user to gain deeper insights as the data can be expressed in higher dimentionality rather than condensing it to fit in a trend based line chart. | * The time playable view will allow the user to gain deeper insights as the data can be expressed in higher dimentionality rather than condensing it to fit in a trend based line chart. | ||
− | * We will | + | * We will build these charts to have interactive displays to show distributions with respect to subzones and planning areas accordingly for deeper analysis. |
|- | |- | ||
| <center> | | <center> | ||
− | '''Title''': DASHBOARD 2 - | + | '''Title''': DASHBOARD 2 - Time Series Network Diagram and Bar Chart View |
[[File:Policy_And_Planning_Proposed_Storyboard_2.png|400px|frameless|center]] | [[File:Policy_And_Planning_Proposed_Storyboard_2.png|400px|frameless|center]] | ||
</center> | </center> | ||
|| | || | ||
− | * This view | + | * This second view for the time series data will serve to allow users to do deeper analysis of more features at a go using a Network chart and bar charts according to relevant filters. |
− | * We | + | * We will build these charts to have interactive displays to show distributions with respect to subzones and planning areas accordingly for deeper analysis. |
− | |||
|- | |- | ||
| <center> | | <center> | ||
− | '''Title''': DASHBOARD | + | '''Title''': DASHBOARD 3 - Population Census Global View |
[[File:Policy_And_Planning_Proposed_Storyboard_3.png|400px|frameless|center]] | [[File:Policy_And_Planning_Proposed_Storyboard_3.png|400px|frameless|center]] | ||
</center> | </center> | ||
|| | || | ||
− | * | + | * Some of the demographics that we envision to be relevant to planning are: disparity in income/education, distribution of population by age groups etc. |
− | |||
* As such, we hope to implement charts that help to visualise some of these relationships and make the exploration more fruitful in giving insights for targeted planning and policy making. | * As such, we hope to implement charts that help to visualise some of these relationships and make the exploration more fruitful in giving insights for targeted planning and policy making. | ||
− | * | + | * The charts used in this view will be more specific to show global distributions by year. |
|- | |- | ||
| <center> | | <center> | ||
− | '''Title''': DASHBOARD | + | '''Title''': DASHBOARD 4 - Population Census 5 yearly trend analysis view |
[[File:Policy_And_Planning_Proposed_Storyboard_4.png|400px|frameless|center]] | [[File:Policy_And_Planning_Proposed_Storyboard_4.png|400px|frameless|center]] | ||
</center> | </center> | ||
|| | || | ||
− | * This | + | * This dashboard is targeted to give greater focus for demographic trend analysis. |
− | * | + | * It will use a base ranking view chart to show these trends and complimentary charts to give more specific insights to particular years. |
− | |||
|} | |} | ||
</center> | </center> |
Latest revision as of 08:55, 27 March 2020
Contents
PROBLEM & MOTIVATION
Problem Background:
With Singapore’s growing population and limited resources, she faces many pressing challenges for progressive development and economic growth. These challenges span across housing affordability, rising healthcare, aging population, education/income inequality, and low birth rates. For Singapore to continue progressing, it is imperative that the government continues to take proactive measures to plan and utilise its resources effectively. In this fashion, we strive to use visual analytics to help uncover some of the cracks in and opportunities in Singapore’s social demographic to assist the government in sharpening its current policies and to look into future plans. This is well in line with the government’s effort of making socially relevant data public to encourage innovation and discovery.
Motivation:
With the government's strong support and push for open source innovation, we felt that this is a key area that we could utilise our skills in bringing value to society through informing the public and assisting decision makers with planning. Furthermore, with the government's push towards a smart nation, there are increasing data sets available with reasonably high dimention that can allow us to get insights if visualised properly. Moreover, from our initial exploration for available datasets, we notice there is currently good quality data to utilise for time series residential analysis, and population demographics using Population census Data.
PROJECT OBJECTIVES
The key objectives we strive to achieve in this project consists of providing insights for in two main dimensions - Time series analysis of Resident Distribution & Deeper social demographic analysis using Population Census data.
For each of these areas we have targeted to achieve the following:
Time series analysis of Resident Distribution:
- Provide an animated map view of Resident distribution for resident data (2011-2019)
- Provide complementary charts that give more specific views to observe trends over time series
- Provide interactive charts to allow users to set up simple filters and click into specific residential areas for futher analysis.
- Provide chart views with multiple dimentions to allow for richer analysis (i.e. Ternary charts, Network charts)
Deeper social demographic analysis using Population Census data:
- Provide high level trend charts that map different social demographics and sentiments (2000, 2005, 2010, 2015)
- Provide interactive charts with relevant filters and customizable views to allow users to uncover deeper insights
- Provide chart views with multiple dimentions to allow for richer analysis (i.e. Ternary charts, Network charts)
We hope that by providing such charts, we will assist users to discover deeper insights in order to spur on more creative planning and policy making.
SELECTED DATABASE
Upon reviewing the first proposal, we recognised that our data sets were collected accross different studies - namely a time series annual collection of resident data by subzone; Population census study done every 5 years; other supplementary data surveys for healthcare and fertility rates.
As these data sets followed different standards and were build using different samples, it would not be accurate to join them by similar features to perform cross analysis. As such, we reviewed the problem statement and decided to narrow down our data sets to only the "Residential Time series data (2011-2019)", and "Population Census data (2000,2005,2010,2015)"
These data sets will support the two main sets of dashboards to provide analysis for granular time series resident data, and higher dimentional population demographic data using the Population census data set.
Dataset/Source | Data Attributes | Rationale Of Usage |
---|---|---|
(2011 - 2019, June) |
|
This dataset covers a good time series from 2000-2019 and the breakdown by subzone/planning area allows it to serve as the base platform to integrating with other population data sets that are grouped by subzone/planning area as well. From here, we can also get a good view of Singapore’s residential distribution by gender and age group that might give us a few initial findings that help for further investigation with the help of complimentary data sets. |
(2000, 2005, 2010, 2015) |
Contains the following attributes by Planning Area:
|
This data adds a very rich level of dimensionality on top of the residential data as mentioned above. However, it only covers limited points in time and so we intend to use this data separately for more deep time static analysis. Also, as this data is very rich, we hope that it can further serve as a bridge to more abstract but complementary data. |
BACKGROUND SURVEY
To begin, we explored current charts that were used to visualise the key areas that we defined to explore (e.g. inequality, urban planning, geo-plots).
This is a summary of the more interesting visualisations we found:
Reference of Other Interactive Visualization | Learning Point |
---|---|
Title: Income distribution by country over the years
|
Learning Points:
|
Title: Changing Ranks of States by Congressional Representation |
Learning Points:
|
Title: Ternary point chart
|
Learning Points:
|
Title: Bricks Map |
Learning Points:
|
Title: Bar charts with Facets
|
Learning Points:
|
Title: Circular Network Diagram
|
Learning Points:
|
BRAINSTORMING SESSIONS
From our initial survey above, we have internalised some of the models and these are some of the ideas we came up with in our Brainstorming sessions:
1. Income Distribution By Subzone Planning Areas
The above listed chart or its modified variation will be used to describe the income distribution by the planning area or subzone.
These are the respective features of the chart:
- X-Axis: % of the population group
- Y-Axis income group
- User can use the a drop down list to check the different years’ income distribution
- The data will be formed into line chart histogram
- A Singapore average income and world poverty / average lines as the reference to help the user to understand the corresponding planning zone income position among world or entire Singapore
- The small map will show the the related region of the selected planning area.
2. Horizontal Network Diagram to view relationships between different dimentions
The above listed chart or its modified variation will be used to describe the redistribution for 2 or more metrics (E.g. Income/housing type).
The User can use drop down selector to change the parameters for the chart. This would help him to find the pattern of the different parameters.
Eg Did the enough HDB flat have been prepared for low income people?
3. Rank Change Chart
This chart helps to show rank changes and progression especially within a smaller set of data, i.e. census data 2000,2010,2015.
It would help users get a quick idea of how different planning areas or subzones fared in different social demographic categories.
This would then allow users to pinpoint specific cases that are surprising/desirable/undesirable and then proceed to do the necessary precise investigation from there.
4. Bricks Map With Simplified Social Demographics
This will serve as the core geo-plot for our users to keep in touch with the geographic nature of analysing Singapore.
With the help of different legends and simplified demographic classes, we will be able to give the user the relevant
time series data for them to better appreciate the context and usefulness of the other charts.
5. Ternary Chart
Using a ternary chart, we could create visualisation based on 3 dependent variables thus, enabling users to have a quick overview of the distribution in all 3 dimensions.
The diagram shows 2 ternary charts with subzones being the main analysis and the 3 variables of the respective chart are the age group (0-19, 20-39, 40 above) and
types of dwellings (HDB, Condo, Landed). Hence, users are able to hover in the subzones to view the distributions of age group and type of dwellings.
This brings a new dynamic way of exploration to identify the similarities or distinct differences between subzones.
The ternary chart will be lines of subzones points as we draw it with geographic subzones to better illustrate our brainstorming
PROPOSED STORYBOARD (PAPER PROTOTYPE)
Below is the proposed story board for our project:
Storyboard | Insights / Comments |
---|---|
Title: DASHBOARD 1 - Time Series Ternary Analysis view |
|
Title: DASHBOARD 2 - Time Series Network Diagram and Bar Chart View |
|
Title: DASHBOARD 3 - Population Census Global View |
|
Title: DASHBOARD 4 - Population Census 5 yearly trend analysis view |
|
TECHNOLOGY USED
These are the current technologies we have shortlisted that might be useful for the respective steps of the project,
we will be using the most feasible of these options or adding on others if necessary:
CHALLENGES, RISK ASSESMENT AND MITIGATION
Challenges | Mitigation Plan |
---|---|
|
|
|
|
|
|
PROPOSED TIMELINE
This timeline shows the breakdown of tasks leading up to the project milestones. This timeline shows the progress as of 1 Mar 2020.
COMMENTS AND FEEDBACK
Feel free to leave us some comments so that we can improve!
No. | Name | Date | Comments |
---|---|---|---|
1. | Insert your name here | Insert date here | Insert comment here |
2. | Insert your name here | Insert date here | Insert comment here |
3. | Insert your name here | Insert date here | Insert comment here |