IS428 AY2019-20T1 Assign Teng Jing Wen

From Visual Analytics for Business Intelligence
Revision as of 22:04, 13 October 2019 by Jwteng.2017 (talk | contribs) (→‎Q3)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Problem and Motivation

St. Himark has been hit by an earthquake, leaving officials scrambling to determine the extent of the damage and dispatch limited resources to the areas in most need. They quickly receive seismic readings and use those for an initial deployment but realize they need more information to make sure they have a realistic understanding of the true conditions throughout the city.

In a prescient move of community engagement, the city had released a new damage reporting mobile application shortly before the earthquake. This app allows citizens to provide more timely information to the city to help them understand damage and prioritize their response. In this mini-challenge, use app responses in conjunction with shake maps of the earthquake strength to identify areas of concern and advise emergency planners. Note: the shake maps are from April 6 and April 8 respectively.

With emergency services stretched thin, officials are relying on citizens to provide them with much needed information about the effects of the quake to help focus recovery efforts.

By combining seismic readings of the quake, responses from the app, and background knowledge of the city, help the city triage their efforts for rescue and recovery.

Tasks and Questions:

  1. Emergency responders will base their initial response on the earthquake shake map. Use visual analytics to determine how their response should change based on damage reports from citizens on the ground. How would you prioritize neighborhoods for response? Which parts of the city are hardest hit? Limit your response to 1000 words and 10 images.
  2. Use visual analytics to show uncertainty in the data. Compare the reliability of neighborhood reports. Which neighborhoods are providing reliable reports? Provide a rationale for your response. Limit your response to 1000 words and 10 images.
  3. How do conditions change over time? How does uncertainty in change over time? Describe the key changes you see. Limit your response to 500 words and 8 images.

Transforming and analysing the dataset

Before starting our analysis, we will first analyse the data attributes provided. There is a single file provided in the MC1 assignment with the following fields:

  • time: timestamp of incoming report/record, in the format YYYY-MM-DD hh:mm:ss
  • location: id of neighborhood where person reporting is feeling the shaking and/or seeing the damage
  • {shake_intensity, sewer_and_water, power, roads_and_bridges, medical, buildings}: reported categorical value of how violent the shaking was/how bad the damage was (0 - lowest, 10 - highest; missing data allowed)

We will first transform and clean the data for easier analysis on Tableau in the later steps.

Data Cleaning

Problem 1 Transposing the Data
Issue The damage reports given contain multiple columns of damage area reported and this will make it difficult for interpretation on Tableau, especially when attempting to plot charts.
Solution
Transpose.png
Problem 2 Displaying the polygon data on a background image
Issue The shapefile (polygon) provided in the MC2 dataset will be helpful in analysing the questions in MC1. While it is easy to import the shapefile into Tableau, it does not show out well once a background image is added to the back of the polygon map.
Solution

To resolve this issue, we will need to grow our own filled map. We will need to import the shapefile (of polygon data) in Tableau and transform them to a geographic coordinate reference - the latitude and longtitude coordinate points, and there after generate CSV files which will be needed for custom geocoding roles. The end result from this process will give us two CSV files: (1) St.Himark Points and (2) St.Himark Features.



Dataset Import Structure & Process

After performing the data transformation, the data from 3 data sources will be imported into Tableau.

Importdata.png

As there are 3 different datasets to import, we will need to perform inner joins on the tables together with a common attribute. The relationship between each data source is as follow:

  1. Between reports data and StHimark Features
    Innerjoin-1n2.png
  2. Between StHimark Features and StHimark Points
    Innerjoin-2n3.png

After joining the data by common attribute, additional processing to the data sources still has to be done.

Additional Processing: Emergency Reports Data

No. Brief Implementation
1.

Under the data source tab, we will rename the "Time" field into Date and create an additional Time field. The following formula will be used: “MAKETIME(DATEPART('hour',[Date]), DATEPART('minute',[Date]),0)”

Timeformula.png
2.
  • Under the data source tab, we will create a new calculated field called "Event Date". This will bin the reporting dates into 3 categories - pre-earthquake (date before 8th), actual earthquake (date = 8th) and post-earthquake (date after 8th).
  • The formula used:
Eventdate-formula.png
3. Under a new Worksheet, perform the following actions:
  • Convert Damage Scale and Shake Intensity to dimensions
  • Customise the Time field to date format "hh:mm"

Additional Processing: St.Himark Points and St.Himark Features

No. Brief Implementation
1. After defining the relationship between each data sources, we will create the polygon map.
Polygon2.png

*Hide the X and Y headers

2. As shown in the above image, we will add a background image onto Tableau.
  • Click on Map > Background Images > Add Image:
Bgmap-config.png
  • Under the same window, switch to the Options tab, ensure that both checkboxes are checked (Lock Aspect Ratio & Show Entire Image).
  • Click OK, and the background image can be applied to any worksheets.

Interactive Visualisation and Design Implementation

The interactive dashboard has been published on Tableau Public and can be accessed at: https://public.tableau.com/profile/jwteng.2017#!/vizhome/mc1-final/Home

Home Dashboard: Interactive Visualisation

Interactive Technique Rationale Brief Implementation Steps
Clickable icons
Homepg3.png
Allows user to navigate to other dashboards based on selected options.
  • Create a new sheet for each icon and add an action to them on the homepage dashboard.
  • Change the target sheet for each icon accordingly.

Home Dashboard: Design Implementation

Design Rationale & Brief Implementation
Design-hmpg.png
The home page dashboard gives the analyst an overview of what insights the dashboards will provide and directs the user to either dashboard based on their objectives. It also serves as a navigation tool, for the user to switch between Live Emergency Reports, and an overall Emergency Report Analysis.

Live Emergency Reports Dashboard: Interactive Visualisation

Interactive Technique Rationale Brief Implementation Steps
Date and Time Filter
Datefilter2.png
Timefilter.png
  • Allows user to input date/time range on the slider.
  • Date and time are separated filters so that user could easily analyse the live reports on an hourly basis within the same day.
Drop date and time to filters and select relative range to obtain the slider feature.
Location Filter
Locationfilter.png
  • Allows user to select the locations (neighbourhoods) of interest to analyse/work out a response plan.
  • The location filter is a multi-select dropdown list which allows them to select more than one location to view at one time.
Configure filter format to multi-select dropdown list.
Time Series Animation
Timeseries.png
  • Allows user to look at the changes in the emergency reports overtime on an hourly basis.
  • This helps them to gather insights on the overall trend of the earthquake based on the neighbourhood reports.
  • The time interval is set to hourly instead of minute as a minute interval animation would be too long given more data points.
  • Add date filter (based on hour) to the pages on either worksheet.
  • Add the filter to the dashboard apply filter to all worksheets within the dashboard.

Live Emergency Reports Dashboard: Design Implementation

Design Rationale & Brief Implementation
Db2.png
  • The dashboard begins with filters and a dashboard guide so that the user could follow through the steps needed to obtain meaningful insights.
  • A geographical map containing the neighbourhood ID and name is provided for easy and convenient reference during the process of working out a response plan.


Db2-2.png
  • A brief description of each facility/damage area is provided to the user for better contextual knowledge. This might aid them in their analysis to determine if the damage reports made could possibly be from the ongoing projects instead of the actual earthquake.
  • The damage scale colour intensity legend is provided for easier reference to identify the more 'critical' neighbourhoods.


Emergency Report Analysis Dashboard: Interactive Visualisation

Interactive Technique Rationale Brief Implementation Steps
Day of Date Filter
Dayofdate.png
  • This is a multi-select dropdown list that allows user to select multiple dates from the 6th to 11th of April.
Configure filter to multi-select dropdown list.
Event Date Filter
Eventfilter.png
  • Multi-select list that allows user to conveniently compare reports that were made before/during/after the earthquake.
Configure filter to multi-select dropdown list.
Time Filter
Timefilter.png
  • Allows user to input date/time range on the slider.
  • Date and time are separated filters so that user could easily analyse the live reports on an hourly basis within the same day.
Drop date and time to filters and select relative range to obtain the slider feature.
Location Filter
Locationfilter2.png
  • Multi-select list allows user to make comparisons amongst selected locations/neighbourhoods conveniently.
Configure filter to multi-select dropdown list.
Damage area (facility) Filter
Damagearea2.png
  • Multi-select list allows user to compare/select the facilities of interest and obtain insights from the damage reports.
Configure filter to multi-select dropdown list.

Emergency Report Analysis Dashboard: Design Implementation

Design Rationale & Brief Implementation
Db3-3.png
  • The dashboard begins with a map of St.Himark as well as the pre-earthquake and major earthquake shake maps for easy comparison to determine if the reports made have been reliable.
  • The dashboard has a guide that tells the analyst to select the filters based on the points of interest.
  • Most of the filters are multi-select dropdown lists except for time which is filtered with a slider. This allows the analyst to provide a range for the time.
  • The chart on the damage reports made by the neighbourhoods is placed beside the chart on shake intensity reports as this allows for quick reference/comparison between both reports. A comparison between both charts will show any association between the shake intensity experienced and damage observed. For instance, if a particular neighbourhood reported high damage done at a particular time, yet the shake intensity was reported low, the discrepancy shows that the damage done was likely due to other factors instead of the earthquake. Hence, further analysis will have to be done to determine if the reports were based on the earthquake impacts.
Db3-4.png
  • The two charts at the bottom of the dashboard aim to provide in-depth insights on the reports made overtime.
  • The heatmap on the damage reports are filtered by the type of facility and clearly shows the report from each neighbourhood, which could be useful to determine if the the damage reported could be due to existing works within the town. A quick look at the chart will show the highest number of reports within each category, given the intensity of the colour.
  • Likewise for the heatmap on the shake intensity reports, the reports are classified into 3 categories - pre/actual/post earthquake. This gives a quick comparison for the analyst as it distinctly shows the change in shake intensity reports overtime.


Insights from dashboards

Q1

Emergency responders will base their initial response on the earthquake shake map. Use visual analytics to determine how their response should change based on damage reports from citizens on the ground. How would you prioritize neighborhoods for response? Which parts of the city are hardest hit? Limit your response to 1000 words and 10 images.

No. Observations
1 Based on the geographical map with various colour intensities, we can tell that the average shake intensity reported by citizens on 8/4/2020, is the highest in Neighbourhood 3. This insight is highly reliable based on the earthquake shake map provided as the earthquake comes from the north-east direction. Hence it is likely that Neighbourhoods 3, 4, 12 and 7 have experienced larger hits.
Shakeintensitymap.png
2 Heatmap shows the increase in number of reports from 8am to 12pm. It is hence possible to deduce that the actual earthquake started around 8am until approximately 11am. We will hence filter our reportings based on the following timeframe (Date = 8/4/2020, Time= 8:00:00AM to 11:00:00AM).
Heatmap.png
3 It will be insufficient to work out an emergency response plan based on the findings of the shake intensity reports alone. Assuming that the emergency response team has separate departments for each facility within St.Himark, we will analyse the damage reports categorized by facilities (e.g. Sewer and Water, Medical facilities etc).
Facilities-damage.png

Insights show that the towns along Neighbourhoods 7 to 9 have reported relatively higher damage done on facilities such as roads and bridges, sewer and water, and buildings. According to these data, it is likely that emergency responders have to be deployed to these neighbourhoods within a timely period as well, considering the high damage ratings reported.

Conclusions:

  • Based on our findings from the damage and shake intensity reports, generally, Neighbourhood 3 appears to have experienced the largest hit from the earthquake on 8th April, between 08:00 AM to 11:00 AM.
  • Concerning the prioritising of the emergency response - the emergency responders under the medical facilities department should note that there are neighbourhoods with medical facilities that provide trauma and critical care services. This could imply that sufficient attention should also be brought to assess the criticality of the situation on these areas. Likewise, for the power department, there is a nuclear power plant at Neighbourhood 4 which is responsible for supplying 72% of St.Himark’s power. It will be critical to assess the condition and deploy aid to that area in order to help regain power for the majority of the residents.

Q2

Use visual analytics to show uncertainty in the data. Compare the reliability of neighborhood reports. Which neighborhoods are providing reliable reports? Provide a rationale for your response. Limit your response to 1000 words and 10 images.

No. Observations
1 We can interpret that reports on the current shake intensity is still constantly sent even a day after the earthquake, on the 8th. This implies that there is a level of inaccuracy in the reports made. Analysing the heatmap that has been filtered to show reports made post-earthquake, it is seen that Neighbourhood 3 has the greatest number of reports (indicating high shake intensity) made followed by Neighbourhoods 4 and 12.
2-shakeintensity.png
3-shakeintensity.png
2 Next, the shake intensity reports also showed that Neighbourhood 10 had generally rated higher intensity on 9th April than 8th April.
Damage1.png
3 It can be inferred that more reports on damage and shake intensity were made on the earlier morning of 9th April, while 8th April had minimal reports made. Additionally, the damage done rated on power was seemingly high at a rank point of 9. This could signal a power failure that resulted in the delay of reports made during the actual earthquake.
Damage10-1.png
4 To further confirm our hypothesis, the filter of event date is set to display the charts on the day of the actual earthquake. Based on chart below, it can hence be deduced that there could indeed be a power outage during the earthquake at Neighbourhood 10.
Damage10-2.png
Hence, the reason for its higher rating post-earthquake was not due to inaccuracy in the citizens’ reports but rather, a power failure.
5 The heatmap below shows the damage reports made before the day of the earthquake, and generally, Neighbourhoods 3 and 9 have also reported on damages observed. Since these damages happen before the accident, it likely arose from ongoing maintenance works by the facilities.
Preearthquake-report.png

Conclusions:

  • Neighbourhoods 3 and 9 reported damage observed that could be from maintenance works instead of the earthquake. This shows that the emergency reports were made not pertaining to the earthquake, which could have diverted the emergency response efforts to aid the neighbourhoods that require urgent assistance.
  • The uncertainty of reports, which also refers to the inaccuracy of reports could also be due to power issues that caused a delay in the time the reports were sent to the responders.

Q3

How do conditions change over time? How does uncertainty in change over time? Describe the key changes you see. Limit your response to 500 words and 8 images.

No. Observations
1 To analyse the conditions and uncertainty changes overtime, we will need to draw insights from both dashboards.

Based on the shake intensity map, we can note that the reports on shake intensity were generally low until 8th April 8:00AM where the intensity was reported higher. These reports lasted until the early hours of 9th April (perhaps due to power/network delay in sending the reports). However, looking at the change in condition (based on reports made), there could likely be another minor earthquake on 9th April at around 3PM. The reports on shaking slowed down after 9PM on 9th April. While this is not reflected on the response shake map, with more than a couple of neighbourhoods making shake reports, this should likely not be a case of unreliable reporting.

Conditionchange.png
2 Next, running through the time series animation of the damage reports on each facility, at particular timeframes, there are certain neighbourhoods with no reports made on all facilities (i.e. null values in the reports). This could also be due to intermittent power outages around the towns (e.g. Neighbourhoods 3 and 5) due to ongoing project works.

On 8 April 2020, 12PM:

Nb3n10.png

On 8 April 2020, 2PM:

Missingval.png

Overtime, various parts of the city had no reports made. This suggests a likelihood that the earthquake could have made an impact on the power across the neighbourhoods.

Conclusions:

  • Apart from an earthquake that was reported on 8th April 2020, there seems to be another minor earthquake that happened on 9th April 2020, based on the slight increase in reports made during the time frame. This however, was not shown on the shake maps provided.
  • In general, the damage reports were sent in way before the earthquake (8th) but there became an increase in reports on the 8th. This trend also applies to the reports made on shake intensity. The majority of reports were mostly sent between 8th - 10th April 2020. While most citizens rated a moderate shake intensity (below 8), there were many citizens rating the damage observed above 6.
  • Regarding the change in uncertainty overtime, it is still fairly questionable if the reports sent in are 'real-time'. Because during the actual earthquake there might be power failures that lead to a delay in the sending of emergency reports. However, after the earthquake, it is more likely that power have been restored and the reports sent in are real-time.

References

The following references have been useful in the completion of the analysis process:

Comments