IS428 AY2019-20T1 Assign Teng Jing Wen
Contents
- 1 Problem and Motivation
- 2 Transforming and analysing the dataset
- 3 Dataset Import Structure & Process
- 4 Interactive Visualisation and Design Implementation
- 4.1 Home Dashboard: Interactive Visualisation
- 4.2 Home Dashboard: Design Implementation
- 4.3 Live Emergency Reports Dashboard: Interactive Visualisation
- 4.4 Live Emergency Reports Dashboard: Design Implementation
- 4.5 Emergency Report Analysis Dashboard: Interactive Visualisation
- 4.6 Emergency Report Analysis Dashboard: Design Implementation
- 5 Insights from dashboards
- 6 References
- 7 Comments
Problem and Motivation
St. Himark has been hit by an earthquake, leaving officials scrambling to determine the extent of the damage and dispatch limited resources to the areas in most need. They quickly receive seismic readings and use those for an initial deployment but realize they need more information to make sure they have a realistic understanding of the true conditions throughout the city.
In a prescient move of community engagement, the city had released a new damage reporting mobile application shortly before the earthquake. This app allows citizens to provide more timely information to the city to help them understand damage and prioritize their response. In this mini-challenge, use app responses in conjunction with shake maps of the earthquake strength to identify areas of concern and advise emergency planners. Note: the shake maps are from April 6 and April 8 respectively.
With emergency services stretched thin, officials are relying on citizens to provide them with much needed information about the effects of the quake to help focus recovery efforts.
By combining seismic readings of the quake, responses from the app, and background knowledge of the city, help the city triage their efforts for rescue and recovery.
Tasks and Questions:
- Emergency responders will base their initial response on the earthquake shake map. Use visual analytics to determine how their response should change based on damage reports from citizens on the ground. How would you prioritize neighborhoods for response? Which parts of the city are hardest hit? Limit your response to 1000 words and 10 images.
- Use visual analytics to show uncertainty in the data. Compare the reliability of neighborhood reports. Which neighborhoods are providing reliable reports? Provide a rationale for your response. Limit your response to 1000 words and 10 images.
- How do conditions change over time? How does uncertainty in change over time? Describe the key changes you see. Limit your response to 500 words and 8 images.
Transforming and analysing the dataset
Before starting our analysis, we will first analyse the data attributes provided. There is a single file provided in the MC1 assignment with the following fields:
- time: timestamp of incoming report/record, in the format YYYY-MM-DD hh:mm:ss
- location: id of neighborhood where person reporting is feeling the shaking and/or seeing the damage
- {shake_intensity, sewer_and_water, power, roads_and_bridges, medical, buildings}: reported categorical value of how violent the shaking was/how bad the damage was (0 - lowest, 10 - highest; missing data allowed)
We will first transform and clean the data for easier analysis on Tableau in the later steps.
Data Cleaning
Problem 1 | Transposing the Data |
---|---|
Issue | The damage reports given contain multiple columns of damage area reported and this will make it difficult for interpretation on Tableau, especially when attempting to plot charts. |
Solution |
Problem 2 | Displaying the polygon data on a background image |
---|---|
Issue | The shapefile (polygon) provided in the MC2 dataset will be helpful in analysing the questions in MC1. While it is easy to import the shapefile into Tableau, it does not show out well once a background image is added to the back of the polygon map. |
Solution |
To resolve this issue, we will need to grow our own filled map. We will need to import the shapefile (of polygon data) in Tableau and transform them to a geographic coordinate reference - the latitude and longtitude coordinate points, and there after generate CSV files which will be needed for custom geocoding roles. The end result from this process will give us two CSV files: (1) St.Himark Points and (2) St.Himark Features. |
Dataset Import Structure & Process
After performing the data transformation, the data from 3 data sources will be imported into Tableau.
As there are 3 different datasets to import, we will need to perform inner joins on the tables together with a common attribute. The relationship between each data source is as follow:
- Between reports data and StHimark Features
- Between StHimark Features and StHimark Points
After joining the data by common attribute, additional processing to the data sources still has to be done.
Additional Processing: Emergency Reports Data
No. | Brief Implementation |
---|---|
1. |
Under the data source tab, we will rename the "Time" field into Date and create an additional Time field. The following formula will be used: “MAKETIME(DATEPART('hour',[Date]), DATEPART('minute',[Date]),0)” |
2. |
|
3. | Under a new Worksheet, perform the following actions:
|
Additional Processing: St.Himark Points and St.Himark Features
No. | Brief Implementation |
---|---|
1. | After defining the relationship between each data sources, we will create the polygon map.
*Hide the X and Y headers |
2. | As shown in the above image, we will add a background image onto Tableau.
|
Interactive Visualisation and Design Implementation
The interactive dashboard has been published on Tableau Public and can be accessed at: https://public.tableau.com/profile/jwteng.2017#!/vizhome/mc1-final/Home
Home Dashboard: Interactive Visualisation
Interactive Technique | Rationale | Brief Implementation Steps |
---|---|---|
Allows user to navigate to other dashboards based on selected options. |
|
Home Dashboard: Design Implementation
Design | Rationale & Brief Implementation |
---|---|
The home page dashboard gives the analyst an overview of what insights the dashboards will provide and directs the user to either dashboard based on their objectives. It also serves as a navigation tool, for the user to switch between Live Emergency Reports, and an overall Emergency Report Analysis. |
Live Emergency Reports Dashboard: Interactive Visualisation
Interactive Technique | Rationale | Brief Implementation Steps |
---|---|---|
|
Drop date and time to filters and select relative range to obtain the slider feature. | |
|
Configure filter format to multi-select dropdown list. | |
|
|
Live Emergency Reports Dashboard: Design Implementation
Design | Rationale & Brief Implementation |
---|---|
| |
|
Emergency Report Analysis Dashboard: Interactive Visualisation
Interactive Technique | Rationale | Brief Implementation Steps |
---|---|---|
|
Configure filter to multi-select dropdown list. | |
|
Configure filter to multi-select dropdown list. | |
|
Drop date and time to filters and select relative range to obtain the slider feature. | |
|
Configure filter to multi-select dropdown list. | |
|
Configure filter to multi-select dropdown list. |
Emergency Report Analysis Dashboard: Design Implementation
Design | Rationale & Brief Implementation |
---|---|
| |
|
Insights from dashboards
Q1
Emergency responders will base their initial response on the earthquake shake map. Use visual analytics to determine how their response should change based on damage reports from citizens on the ground. How would you prioritize neighborhoods for response? Which parts of the city are hardest hit? Limit your response to 1000 words and 10 images.
No. | Observations |
---|---|
1 | Based on the geographical map with various colour intensities, we can tell that the average shake intensity reported by citizens on 8/4/2020, is the highest in Neighbourhood 3. This insight is highly reliable based on the earthquake shake map provided as the earthquake comes from the north-east direction. Hence it is likely that Neighbourhoods 3, 4, 12 and 7 have experienced larger hits. |
2 | Heatmap shows the increase in number of reports from 8am to 12pm. It is hence possible to deduce that the actual earthquake started around 8am until approximately 11am. We will hence filter our reportings based on the following timeframe (Date = 8/4/2020, Time= 8:00:00AM to 11:00:00AM). |
3 | It will be insufficient to work out an emergency response plan based on the findings of the shake intensity reports alone. Assuming that the emergency response team has separate departments for each facility within St.Himark, we will analyse the damage reports categorized by facilities (e.g. Sewer and Water, Medical facilities etc).
Insights show that the towns along Neighbourhoods 7 to 9 have reported relatively higher damage done on facilities such as roads and bridges, sewer and water, and buildings. According to these data, it is likely that emergency responders have to be deployed to these neighbourhoods within a timely period as well, considering the high damage ratings reported. |
Conclusions:
- Based on our findings from the damage and shake intensity reports, generally, Neighbourhood 3 appears to have experienced the largest hit from the earthquake on 8th April, between 08:00 AM to 11:00 AM.
- Concerning the prioritising of the emergency response - the emergency responders under the medical facilities department should note that there are neighbourhoods with medical facilities that provide trauma and critical care services. This could imply that sufficient attention should also be brought to assess the criticality of the situation on these areas. Likewise, for the power department, there is a nuclear power plant at Neighbourhood 4 which is responsible for supplying 72% of St.Himark’s power. It will be critical to assess the condition and deploy aid to that area in order to help regain power for the majority of the residents.
Q2
Use visual analytics to show uncertainty in the data. Compare the reliability of neighborhood reports. Which neighborhoods are providing reliable reports? Provide a rationale for your response. Limit your response to 1000 words and 10 images.
No. | Observations |
---|---|
1 | We can interpret that reports on the current shake intensity is still constantly sent even a day after the earthquake, on the 8th. This implies that there is a level of inaccuracy in the reports made. Analysing the heatmap that has been filtered to show reports made post-earthquake, it is seen that Neighbourhood 3 has the greatest number of reports (indicating high shake intensity) made followed by Neighbourhoods 4 and 12. |
2 | Next, the shake intensity reports also showed that Neighbourhood 10 had generally rated higher intensity on 9th April than 8th April. |
3 | It can be inferred that more reports on damage and shake intensity were made on the earlier morning of 9th April, while 8th April had minimal reports made. Additionally, the damage done rated on power was seemingly high at a rank point of 9. This could signal a power failure that resulted in the delay of reports made during the actual earthquake. |
4 | To further confirm our hypothesis, the filter of event date is set to display the charts on the day of the actual earthquake. Based on chart below, it can hence be deduced that there could indeed be a power outage during the earthquake at Neighbourhood 10. Hence, the reason for its higher rating post-earthquake was not due to inaccuracy in the citizens’ reports but rather, a power failure. |
5 | The heatmap below shows the damage reports made before the day of the earthquake, and generally, Neighbourhoods 3 and 9 have also reported on damages observed. Since these damages happen before the accident, it likely arose from ongoing maintenance works by the facilities. |
Conclusions:
- Neighbourhoods 3 and 9 reported damage observed that could be from maintenance works instead of the earthquake. This shows that the emergency reports were made not pertaining to the earthquake, which could have diverted the emergency response efforts to aid the neighbourhoods that require urgent assistance.
- The uncertainty of reports, which also refers to the inaccuracy of reports could also be due to power issues that caused a delay in the time the reports were sent to the responders.
Q3
How do conditions change over time? How does uncertainty in change over time? Describe the key changes you see. Limit your response to 500 words and 8 images.
No. | Observations |
---|---|
1 | To analyse the conditions and uncertainty changes overtime, we will need to draw insights from both dashboards.
Based on the shake intensity map, we can note that the reports on shake intensity were generally low until 8th April 8:00AM where the intensity was reported higher. These reports lasted until the early hours of 9th April (perhaps due to power/network delay in sending the reports). However, looking at the change in condition (based on reports made), there could likely be another minor earthquake on 9th April at around 3PM. The reports on shaking slowed down after 9PM on 9th April. While this is not reflected on the response shake map, with more than a couple of neighbourhoods making shake reports, this should likely not be a case of unreliable reporting. |
2 | Next, running through the time series animation of the damage reports on each facility, at particular timeframes, there are certain neighbourhoods with no reports made on all facilities (i.e. null values in the reports). This could also be due to intermittent power outages around the towns (e.g. Neighbourhoods 3 and 5) due to ongoing project works.
On 8 April 2020, 12PM: On 8 April 2020, 2PM: Overtime, various parts of the city had no reports made. This suggests a likelihood that the earthquake could have made an impact on the power across the neighbourhoods. |
Conclusions:
- Apart from an earthquake that was reported on 8th April 2020, there seems to be another minor earthquake that happened on 9th April 2020, based on the slight increase in reports made during the time frame. This however, was not shown on the shake maps provided.
- In general, the damage reports were sent in way before the earthquake (8th) but there became an increase in reports on the 8th. This trend also applies to the reports made on shake intensity. The majority of reports were mostly sent between 8th - 10th April 2020. While most citizens rated a moderate shake intensity (below 8), there were many citizens rating the damage observed above 6.
- Regarding the change in uncertainty overtime, it is still fairly questionable if the reports sent in are 'real-time'. Because during the actual earthquake there might be power failures that lead to a delay in the sending of emergency reports. However, after the earthquake, it is more likely that power have been restored and the reports sent in are real-time.
References
The following references have been useful in the completion of the analysis process:
- Past project reference #1: https://wiki.smu.edu.sg/1617t1IS428g1/IS428_2016-17_Term1_Assign3_Gwendoline_Tan_Wan_Xin
- Past project reference #2: https://wiki.smu.edu.sg/1617t1IS428g1/IS428_2016-17_Term1_Assign3_Tan_Kee_Hock
- Dataset source: https://vast-challenge.github.io/2019/MC1.html
- Creating a time range slider: https://community.tableau.com/thread/213002
- Binning the dates into pre/actual/post-earthquake: https://community.tableau.com/thread/292646
- Pertaining to the data cleaning process (problem 2: Displaying the polygon data on a background image), a classmate has been very helpful to share on the learning discussion board the solution to this issue and has also provided with the 2 CSV files that were derived from the process of growing filled maps. Additionally, he has shared the guide to this process: https://community.tableau.com/thread/116369