IS428 AY2019-20T1 Assign Chew Hui Ling Data Preparation

From Visual Analytics for Business Intelligence
Revision as of 22:41, 14 October 2019 by Hlchew.2017 (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

MC1-2019.jpg   VAST Challenge 2019 MC 1: Crowdsourcing for Situational Awareness

Background

Data Analysis and Preparation

Visualization

Insights

Conclusion

 


Data Analysis

Before the analysis began, the dataset given is analysed to identify its respective format and attributes. There was only 1 csv file provided in the assignment which meant that there was minimal amount of information that can be analysed. Hence, I will be taking the shp file from mini challenge 2 as well as creating one csv file called connection to do my transformation process for each dataset.


Mini-challenge 1 Reports

C1-2019.jpg

Within the report, it shows the time, damage taken by the different components, shake intensity and location. This provides minimal information for data visualization. Hence, in order to improve the current data that we have, I will be doing the following changes:

  1. For each component: power, medical buildings. It would be hard to see all the values together. This is because either you must put 5 graphs together or you will have to create a calculated field to add all the 5 values together. Instead of doing that, pivoting will help to generate 1 column for the component name and 1 column for the component value. And with filter it will allow the user to view all or the components separately.
  2. For each value within the component, there is no specific name attached. The user only knows that 10 is the highest and 0 is the lowest. But for user experience, it is much better to attach something to the value such as 0 = Weak, etc.
  3. Within the report, there is no specific name for each location and to solve the problem. I will be using the shp file to integrate the location number and name together.


Data Preparation
Through the 3 points that was mentioned, I have created this flow within tableau prep.

C2-2019.jpg

Idea 1
For the first idea for pivoting the table. Using tableau prep, click on the “Plus” icon next to the mc1-reports document and click on “Pivot”. The following screen will be shown, drag the fields that needs to be pivoted into the “Pivoted Fields” column. It will then automatically pivot those field, and after pivoting rename the columns within the “Pivoted Fields” column as shown below.

C3-2019.jpg

Idea 2
The second idea is to attach the value for both damage and intensity to a description. In order to do this, we will be using the connection csv that I have created previously as shown below to join each value. I will be using a left join because within the data I realised that for both columns there were null values which could not be attached to a description. This makes sense because for null values, it means that there is no damage or shake.

CH4-2019.jpg

Hence, in order to solve this problem, I have created a field to replace the damage description that was created previously in the left join. What I have done was to first check whether the value was null, if it was I will replace the damage description from null to none but if it is not null, I will continue using the damage description. This is done to both shake intensity and damage.

CH5-2019.jpg

After idea 2, this will be the output for the following csv after you have generated it.

CH6-2019.jpg

Idea 3
For the third idea, instead of using tableau prep. I will be using tableau itself to join a shp file to the output that was created in the previous 2 ideas. I will inner join both location number and id together to get the location name and geometry which will be used to create the maps for visualisation.
CH8-2019.jpg