IS428 AY2019-20T1 Assign Chew Hui Ling Data Preparation
|
|
|
|
|
Data Analysis
Before the analysis began, the dataset given is analysed to identify its respective format and attributes. There was only 1 csv file provided in the assignment which meant that there was minimal amount of information that can be analysed. Hence, I will be taking the shp file from mini challenge 2 as well as creating one csv file called connection to do my transformation process for each dataset.
Within the report, it shows the time, damage taken by the different components, shake intensity and location. This provides minimal information for data visualization. Hence, in order to improve the current data that we have, I will be doing the following changes:
- For each component: power, medical buildings. It would be hard to see all the values together. This is because either you must put 5 graphs together or you will have to create a calculated field to add all the 5 values together. Instead of doing that, pivoting will help to generate 1 column for the component name and 1 column for the component value. And with filter it will allow the user to view all or the components separately.
- For each value within the component, there is no specific name attached. The user only knows that 10 is the highest and 0 is the lowest. But for user experience, it is much better to attach something to the value such as 0 = Weak, etc.
- Within the report, there is no specific name for each location and to solve the problem. I will be using the shp file to integrate the location number and name together.
Data Preparation
Through the 3 points that was mentioned, I have created this flow within tableau prep.
Idea 1
For the first idea for pivoting the table. Using tableau prep, click on the “Plus” icon next to the mc1-reports document and click on “Pivot”. The following screen will be shown, drag the fields that needs to be pivoted into the “Pivoted Fields” column. It will then automatically pivot those field, and after pivoting rename the columns within the “Pivoted Fields” column as shown below.
Idea 2
The second idea is to attach the value for both damage and intensity to a description. In order to do this, we will be using the connection csv that I have created previously as shown below to join each value. I will be using a left join because within the data I realised that for both columns there were null values which could not be attached to a description. This makes sense because for null values, it means that there is no damage or shake.
Hence, in order to solve this problem, I have created a field to replace the damage description that was created previously in the left join. What I have done was to first check whether the value was null, if it was I will replace the damage description from null to none but if it is not null, I will continue using the damage description. This is done to both shake intensity and damage.
After idea 2, this will be the output for the following csv after you have generated it.
Idea 3
For the third idea, instead of using tableau prep. I will be using tableau itself to join a shp file to the output that was created in the previous 2 ideas. I will inner join both location number and id together to get the location name and geometry which will be used to create the maps for visualisation.