IS428 AY2019-20T1 Assign Goh Jia Xian

From Visual Analytics for Business Intelligence
Revision as of 22:31, 13 October 2019 by Jiaxian.goh.2016 (talk | contribs)
Jump to navigation Jump to search

Vast Challenge 2019, Mini-Challenge 1: Crowdsourcing for Situational Awareness

Overview

St. Himark has been hit by an earthquake, leaving officials scrambling to determine the extent of the damage and dispatch limited resources to the areas in most need. They quickly receive seismic readings and use those for an initial deployment but realize they need more information to make sure they have a realistic understanding of the true conditions throughout the city.

In a prescient move of community engagement, the city had released new damage reporting mobile application shortly before the earthquake. This app allows citizens to provide more timely information to the city to help them understand damage and prioritize their response.


Objective

With emergency services stretched thin, officials are relying on citizens to provide them with much-needed information about the effects of the quake to help focus recovery efforts. By combining seismic readings of the quake, responses from the app, and background knowledge of the city, help the city triage their efforts for rescue and recovery.

  1. Emergency responders will base their initial response on the earthquake shake map. Use visual analytics to determine how their response should change based on damage reports from citizens on the ground. How would you prioritize neighbourhoods for response? Which parts of the city are hardest hit? Limit your response to 1000 words and 10 images.
  2. Use visual analytics to show uncertainty in the data. Compare the reliability of neighbourhood reports. Which neighbourhoods are providing reliable reports? Provide a rationale for your response. Limit your response to 1000 words and 10 images.
  3. How do conditions change over time? How does uncertainty in change over time? Describe the key changes you see. Limit your response to 500 words and 8 images.
  4. The data for this challenge can be analyzed either as a static collection or as a dynamic stream of data, as it would occur in a real emergency. Describe how you analyzed the data - as a static collection or a stream. How do you think this choice affected your analysis? Limit your response to 200 words and 3 images.


Useful information from “St. Himark - About Our City”

From the description about St. Himark, we can come to gather some useful information about the city’s infrastructure. These are the on-going projects in the following neighbourhoods to improve its infrastructure:

Jarrett-On-going Projects.png

The above-mentioned projects may affect the scores on damages reported by the citizens on the mobile application.


Data Preparation

Upon looking through mc1-reports-data.csv, the file simply contains the scale of the damage (Building, Medicine, Sewage and Water, Power and Roads and Bridges) and Shake Intensity reported by a Rumble user from a specific neighbourhood specified as location (integer). The data provided is ‘user-friendly’ for me to work with but I will need to create new columns in the data file to provide more information to the charts.

Problem #1 No Unique Identifier
Issue Each row in mc1-reports-data.csv is a unique report received from Rumble mobile application’s user. However, each row does not contain a Unique Identifier (ID) to distinguish each report. Without an ID, it may result in data inaccuracy as I proceed to data manipulation or chart creation.
Solution
Jarrett-Report ID.png
Problem #2 Poor Location Description
Issue The data file provided identifies each location via Location (number) which is not human-readable in understanding where the location, it lacks a name description about the neighbourhood.
Solution
Jarrett-Neighbourhood.png
Problem #3 St. Himark Shapefile Provided Does Not Fill Background Map
Issue Having tried to use the shapefile that has been provided in mini-challenge 2 together with a background image, the polygon does not fully fill the image and it looks like points on the map as shown below.
Jarrett-Problem Issue 1 (Scattered Geometry).png
StHimarkNeighborhoodMapNoLabels.png from MC2 has been redesigned to achieve the above background
Solution Using this HTML tool downloaded from https://github.com/bryantbhowell/tableau-map-pack/blob/master/draw_tableau_polygons_on_background_image.html, I have managed to manually plot the polygons and generated a 'Neighbourhood Polygon.csv'. Now, the polygons have successfully fill in the background map.
Jarrett-Polygon tool.png
Screenshot of the downloaded tool
Jarrett-polygon-csv.png
Snippet of the csv generated


Importing Data into Tableau

Data Files to be Imported:
I will be working with the following data files:

  1. mc1-reports-data.csv - Data of damage/shake intensity reported by Rumble users with updated fields
  2. Neighbourhood_Polygon.csv - Data containing the polygon of each neighbourhood


Connecting the data files
Before I may begin using the data to plot to generate my charts and dashboards, I need to import the above-mentioned data files and create a relationship between these data sources in Tableau (as shown below).

Jarrett-data relation.png


Additional Data Transformation in Tableau

Measure Created Purpose
Total Damage (Measure) To distinguish the neighbourhood that has been hit the hardest across
Jarrett-Total-Damage.png

Interactive Visualisation

My interactive visualization dashboard can be accessed here: https://public.tableau.com/profile/goh.jia.xian#!/vizhome/GohJiaXian-IS428AssignmentMC1Dashboard/Homepage

In my visualisation, you will find three interactive dashboards - Homepage, Earthquake Damage Report and Reliability Report - where you can use to perform your visual analysis on. Allow me to explain my dashboards in greater details below:

Homepage (Landing Page)
Description of this dashboard
The homepage acts as the landing page when you first access my dashboards created on Tableau. It first introduces you to the background of the problem that the dashboard is helping to resolve and explain how this dashboard has been designed to tackle the problems. The data exploratory tools are displayed on the bottom of the page where you can click on them to navigate to the dashboard that you desire to begin your data exploration.
Jarrett-dashboard-homepage.png

Interactive Technique Used
  1. Select (Pointer) - To select the button on navigate to
Earthquake Damage Report
Description of this dashboard
This dashboard has been designed to help you to visualise the shake intensity and damages that have been experienced by St. Himark. The first section provides you with the tools you need to visualise the damage and shake that has occurred through the animation.
Jarrett-dashboard-earthquake1.png

In the bottom section, it provides you with a time series information of damages on a granular level, you may use the heatmap to identify the time period with the highest level of damage/shake intensity reported in each neighbourhood. The barchart and scatterplot helps you rank the neighbourhoods that may be needed to prioritise in the rescue efforts.

Jarrett-dashboard-earthquake2.png

Interactive Technique Used
  1. Filter - The dashboard allows users to filter the neighbourhood (multi-select), the Date/Time, the measures that the user wishes to perform his analysis on
  2. Select (Pointer) - To select the specific neighbourhood or time period the user wishes to filter across the dashboard
  3. Select (Hover) - Tooltips provides more information on the point of interest
  4. Select (Hover - Linked chart) - On the heatmap, a linked boxplot will appear to provide more information on the data's realiability as shown in the image below.
    Jarrett-Q1 Medical Damages-Scale 9.png

Reliability Report
Description of this dashboard
The reliability report's dashboard primarily shows you the frequency of report and standard deviations of the data reported by the users. It allows user to assess the neighbourhoods that are more credible in their reporting in every category.
Jarrett-dashboard-reliable.png


Interactive Technique Used
  1. Filter - The dashboard allows users to filter the neighbourhood (multi-select), the Date/Time, the measures that the user wishes to perform his analysis on
  2. Select (Pointer) - To select the specific neighbourhood or time period the user wishes to filter across the dashboard
  3. Select (Hover) - Tooltips provides more information on the point of interest