Difference between revisions of "Group02 proposal v2"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
Line 251: Line 251:
 
| <center><br/>  ''' Dashboard 2: Weather Distribution with Violin Plot '''  
 
| <center><br/>  ''' Dashboard 2: Weather Distribution with Violin Plot '''  
 
</center>
 
</center>
[[File:VA2.jpg|400px|frameless|center]]
+
[[File:Proj4.jpg|400px|frameless|center]]
 
||  
 
||  
  
Line 277: Line 277:
  
  
This chart will be shown together with the chart in Dashboard 3 to help the buyer make the best-informed decisions.
+
This chart will be shown together with the chart on Dashboard 3 to help the buyer make the best-informed decisions.
  
 
|-
 
|-
 
| <center><br/>  ''' Dashboard 3: Distribution of Rain Precipitation Amount/ Temperature/ Wind Speed in Postal Areas{Based on Users selected areas in Dashboard 1} '''  
 
| <center><br/>  ''' Dashboard 3: Distribution of Rain Precipitation Amount/ Temperature/ Wind Speed in Postal Areas{Based on Users selected areas in Dashboard 1} '''  
 
</center>
 
</center>
[[File:VA3.jpg|400px|frameless|center]]
+
[[File:Proj3.jpg|400px|frameless|center]]
 
||
 
||
 
   
 
   
Line 308: Line 308:
 
| <center><br/>  ''' Dashboard 4: Comparing Rainfall {selected weather} and the median pricing of All Properties{selected property type} '''  
 
| <center><br/>  ''' Dashboard 4: Comparing Rainfall {selected weather} and the median pricing of All Properties{selected property type} '''  
 
</center>
 
</center>
[[File:VA4.jpg|400px|frameless|center]]
+
[[File:Proj5.jpg|400px|frameless|center]]
 
||  
 
||  
  
Line 329: Line 329:
  
 
With the visualization, users will be able to compare all the areas in Singapore to make an informed decision. By doing so, the chart is able to help the user in shortlisting the area that they wish to zoom into in another view of light as compared to the chart in Dashboard 1 where detailed comparison of the whole Singapore is limited.
 
With the visualization, users will be able to compare all the areas in Singapore to make an informed decision. By doing so, the chart is able to help the user in shortlisting the area that they wish to zoom into in another view of light as compared to the chart in Dashboard 1 where detailed comparison of the whole Singapore is limited.
 +
[[File:Proj.jpg|400px|frameless|center]]
 +
[[File:Proj2.jpg|400px|frameless|center]]
  
 
|}
 
|}

Revision as of 10:54, 17 March 2020

Rain & Shine(new).png

Team

 

Proposal

 

Poster

 

Application

 

Research Paper

Version1|Version2



PROBLEM & MOTIVATION

Problem
The current way of weather reporting is very simple, and does not provide much meaning to the viewer. Our team wants to present the data in a more user friendly and meaningful interpretation ways. Through the visualization, our group hopes to be able to better allow for users to be able to visualize the patterns inherent within the weather data available.

Motivation
Through this, our group hopes to be able to answer questions regarding the changes in Singapore's weather from available historical data. This will allow us to see if there are any patterns or trends, and hopefully even allow us to identify changing patterns and trends.

OBJECTIVES

Provide meaningful graphs for viewers to identify weather patterns in Singapore.

  • Changes in Singapore’s climate patterns over the past few years
    1. Rain
    2. Temperature
    3. Rainfall distribution across Singapore

DATASET

The Data Sets we will be using for our analysis and for our application is listed below:

Data/Source Variables/Description Rationale Of Usage

Temperature and Rainfall Data
(Jan 2012 - Dec 2019)

(http://www.weather.gov.sg/climate-historical-daily/)

  • Stations
  • Date
  • Daily Rainfall
  • Highest 30-min/60-min/120-min Rainfall (mm)
  • Mean/Minimum/Maximum Temperature (°C)
  • Mean/Max Wind Speed (km/h)

This dataset covers a good time series of Singapore's weather from 2012 to 2019 across different weather categories. Our team wish to spot the trend or pattern of Singapore's climate in every town if possible.

Amenities Location Data

(https://api.data.gov.sg/v1/environment/rainfall) (https://api.data.gov.sg/v1/environment/air-temperature)

  • Station ID
  • Station Name
  • Latitude
  • Longitude

The data set will be used to anchor the amenities available for the selected property in a specified range.

Note: We will be looking into the API and use the JSON format to extract the geocoordinate for our amenities. Use both links to ensure we do not miss out any possible location.

BACKGROUND SURVEY OF RELATED WORK

Below are a few visualizations and charts we considered making for our projects.

Visual Considerations Insights / Comments

Title: Monthly mean temperature compared to long term average
Monthly mean temperature.png

Source: http://www.weather.gov.sg/wp-content/uploads/2019/03/Annual-Climate-Assessment-Report-2018.pdf

This is a graph taken from the report by NEA. From this graph, we are able to see that the temperature in 2018 has exceeded the mean of temperatures of the past 30 years. However the limitation in this graph is that although we can see that the temperatures has increased, we are unable to see if this is a systemic increase, or whether it is an anomalous year for temperature.


Title: Isopleth map
Isopleth rainfall.png

Source: http://www.weather.gov.sg/wp-content/uploads/2019/03/Annual-Climate-Assessment-Report-2018.pdf

This is the isopleth map taken from the report by NEA. From the graph, we are able to see that January, June, October, and November are the months where there is more rainfall. Our group can implement such a design to let users get a feel for how the rainfall distribution across Singapore for a specific month will look like.



Title: Whisker plot of temperature
Temperature whisker.png

Source: https://www.ck12.org/statistics/box-and-whisker-plots/rwa/The-Ways-of-Weather/

We are able to see the temperature for the selected area over the course of a year. The whisker plots are able to show the upper and lower boundaries of temperature, and we can observe that the temperature gradually rises to a peak from Jan to Aug, before decreasing until December.

We hope to apply this chart to display the rainfall for a selected area over the course of a year. This allows viewers to be able to better understand the rainfall pattern. This can also be applied to temperature to get a better understanding of temperature patterns in the year.



Title: Heatmap of rainfall
Heatmap of rainfall.png

Source: https://www.shanelynn.ie/analysis-of-weather-data-using-pandas-python-and-seaborn/

This is a heatmap of daily rainfall. Darker colours of red represent heavier rainfall.

This is another way to have a visualization to understand the patterns of rainfall. Through this, we are able to quickly see how many days in a year where there is rain for a selected area. This can also be applied to temperature to get a quick visualization on how hot singapore has been across the year.


Title: Violin plot of temperature with rainfall overlaid
Violinplot rainfall.png

Source: https://www.r-bloggers.com/part-3a-plotting-with-ggplot2/

One of the plots that we chanced upon was a violin plot that overlaid the rainfall points on top. So for May, we are able to see the distribution of average temperature in that month along with how there are few days where there is 20mm of rainfall, and many days where there is no rainfall.

One reason why we can consider using this plot for our visualization is that it will allow us to merge the temperature and rainfall data together into one visualization.


Title: Ridgeline plot of temperature
Ridgeline temperature.png

Source: https://cran.r-project.org/web/packages/ggridges/vignettes/gallery.html/

This graph is a ridgeline plot about temperature over the course of a year. From this graph, we can see that the days in May - July are hotter than the days in Jan - Dec.

Our group hopes to apply this to our temperature and rainfall so that we can see if there is any change to the distribution over the course of the years that we have collected the data for.




KEY TECHNICAL CHALLENGES & MITIGATION

No. Challenge Description Mitigation Plan
1.
Software Challenge Unfamiliarity of visualisation tools such as R, R Shiny, Tableau.
  • Github Learning
  • Stackoverflow research
  • Self-directed and peer learning
  • Watch video tutorials from YouTube
  • Hands-on practice using the different training platforms such as Data Camps
2.
Programming Challenge Inexperince with data cleaning and transformation using R
  • Trial and error
  • Read online articles and forums for guidance
  • Watch video tutorials on how to fully utilise packages such as lapply, tidyr and dplyr
3.
Workload Constraint Time and Workload Constrains
  • Design reasonable project timeline based on everyone's ability and capacity.
  • Set milestones and adjust the timeline accordingly based on the team's progress.
4.
Dataset Complexity

Our have different data from multiple sources in multiple different formats, hence we foresee a huge challenge in standardizing the data

  • Note: Our current dataset is looking at 49 areas over the spread of 8 years of data, for every year there are 12 months of data. This gives a total of 4,704 CSV files to consolidate and clean for weather data alone.
  • Make use of data preparation tools such as tableau prep
  • Make use of our database management skills to normalize all data tables into third normal form

STORYBOARD

Dashboards Description

Dashboard 1: Isopleth Map for Weather
Spatial Interpolation

Our group plans to do an Isopleth Map which reflects the weather distribution based on the year, month and locations. This chart will show the data at a high level for users to identify which area is more rainy than average and which are less rainy throughout the filtered Month/Year Period.

Similarly, our team plans to do another Isopleth Map to show the distribution of the temperature throughout the whole of Singapore. This chart will show the data at a high level for users to identify which area is hotter than average and which are colder throughout the filtered Month/Year Period.

The purpose of this chart is to understand and identify the rain and temperature patterns of Singapore throughout the past 20 years so as to find out if there is a climate change and if global warming is affecting the weather in Singapore.

Filters used includes:

  • Sliders
  1. Year
  2. Months
  • Single Dropdown List
  1. Map level of detail

From this chart, users will be able to select the location of their interests to gather data from more specific charts.


Dashboard 2: Weather Distribution with Violin Plot
Proj4.jpg

The purpose of this chart is to show a detailed breakdown of the properties that meet the requirements of the Users based on his/her filters and User's shortlisted area(s) in the chart from Dashboard 1.


This chart shows all the properties and their prices for all the properties that meet the requirements of the Users based on the filter’s range and the shortlisted area(s).


Filters used includes:

  • Sliders
  1. Year
  2. Transaction Price
  3. Amenities
  • Single Dropdown List
  1. Property Type
  2. Sorted By


The chart will be able to help users better understand the property prices based on his/her shortlisted area(s) and make a decision on which area’s property to purchase.


X-Axis: Transacted Properties’ Name
Y-Axis: Transacted Pricing


This chart will be shown together with the chart on Dashboard 3 to help the buyer make the best-informed decisions.


Dashboard 3: Distribution of Rain Precipitation Amount/ Temperature/ Wind Speed in Postal Areas{Based on Users selected areas in Dashboard 1}
Proj3.jpg

The purpose of this chart is to show a detailed breakdown of the weather (which includes Rain Precipitation Amount/ Temperature/ Wind Speed) for User’s shortlisted area(s) from the chart in Dashboard 1.


Filters used includes:

  • Sliders
  1. Year
  2. Month
  • Single Dropdown List
  1. Level of Detail


The user can adjust the filters to identify any patterns or trends of the weather based on the short-listed areas in Singapore. This chart can help the user better identify which area best suits the user based on his preferences and needs.


X-Axis: Level of Detail(Sub-zone/ Postal Area/ Zone)
Y-Axis: Rain Precipitation Amount/ Temperature/ Wind Speed


This chart will be shown together with the chart on Dashboard 2 to help the buyer make the best-informed decisions.


Dashboard 4: Comparing Rainfall {selected weather} and the median pricing of All Properties{selected property type}
Proj5.jpg

The chart in this storyboard reflects a combination of two thematic maps, a bar chart map as well as a spatial interpolation map. Similar to the chart in Dashboard 1, this chart will show data at a high level for users to identify which area meets their needs in a glance. However, this chart is more-straightforward as the results shown in this chart is more clear cut and less informative.


In this chart, the user will be able to compare:

  • The weather of the user’s choice in each zone/postal area.
  • Median pricing of their selected property type in each zone/postal area.


Filters used includes:

  • Sliders
  1. Year
  • Single Dropdown List
  1. Level of Detail
  2. Weather
  3. Property Type


With the visualization, users will be able to compare all the areas in Singapore to make an informed decision. By doing so, the chart is able to help the user in shortlisting the area that they wish to zoom into in another view of light as compared to the chart in Dashboard 1 where detailed comparison of the whole Singapore is limited.

Proj.jpg
Proj2.jpg

MILESTONES

Photo 2020-03-01 16-22-33.jpg

COMMENTS

No. Name Date Comments
1. (Name) (Date) (Comment)
2. (Name) (Date) (Comment)
3. (Name) (Date) (Comment)