Difference between revisions of "IS428 AY2019-20T1 Assign Nurul Khairina Binte Abdul Kadir Data"

From Visual Analytics for Business Intelligence
Jump to navigation Jump to search
Line 74: Line 74:
 
|-
 
|-
 
| Steps Taken ||
 
| Steps Taken ||
1. Connect to the data source
+
1. Connect to the data source <br>
2. Add a Clean step to review data
+
2. Add a Clean step to review data <br>
3. Add the Pivot step and drag the columns as mentioned above into Pivot values.
+
3. Add the Pivot step and drag the columns as mentioned above into Pivot values. <br>
 
4. Rename the Pivot names and values. The new pivoted column name/category will be called 'Damage Type' and the pivoted values can be called 'Impact Score'.  
 
4. Rename the Pivot names and values. The new pivoted column name/category will be called 'Damage Type' and the pivoted values can be called 'Impact Score'.  
  

Revision as of 02:18, 9 October 2019


VAST 2019 MC1: Crowdsourcing for Situational Awareness

Introduction

Data Analysis and Transformation

Interactive Visualization

Task Findings

References

 


Data Description

The first step in the transformation process begins with understanding and interpreting the data to determine which data type we currently have and what we need to transform it into.

The data zip file consists of 1 CSV file (mc1-reports-data.csv) spanning the entire length of the event from 6 April 2020, 12 AM to 11 April 2020, 12 AM and 2 shakemap images. The CSV file contains the individual reports (categorical) of shaking/damage by neighborhood over time. Reports are made by citizens at any time through the Rumble mobile app. However, they are only recorded in 5-minute batches/increments due to the server configuration. Furthermore, delays in the receipt of reports may occur during power outages.

Data Attributes of CSV File

Data Attributes Description
Time Timestamp of incoming report/record, in the format YYYY-MM-DD hh:mm:ss
Location ID of neighborhood where person reporting is feeling the shaking and/or seeing the damage
Shake Intensity, Sewer and Water, Power, Roads and Bridges, Medical, Buildings Reported categorical value of how violent the shaking was/how bad the damage was (0 - lowest, 10 - highest; missing data allowed)

Shakemap Images, Shape File and Map Images

Also included are two shakemap (PNG) files which indicate where the corresponding earthquakes' epicenters originate as well as how much shaking can be felt across the city. The StHimarkNeighbourhoodShapefile and map images which are obtained from the dataset of Mini Challenge 2 is used to create the map of the city.

Dataset Analysis & Transformation Process

The following section illustrates the issues faced in the data analysis phase leading to a need to transform the data into a specified format. Tableau Prep will be used to clean and prepare the data for analysis. The CSV file is converted to Excel in order to import it into Tableau Prep.

Data Transformation for MC1-Report-Data

There are 3 main issues with the given dataset and data transformation is needed to reshape the data for easier analysis.

Pivot the Damage Types

Issue Inability to filter by damage type
Solution Pivot the following columns - Shake Intensity, Sewer and Water, Power, Roads and Bridges, Medical and Buildings. Pivoting the data will create more rows for each time and location. This will provide users with the ability to filter the data based on the damage type which is needed for further analysis.
Steps Taken

1. Connect to the data source
2. Add a Clean step to review data
3. Add the Pivot step and drag the columns as mentioned above into Pivot values.
4. Rename the Pivot names and values. The new pivoted column name/category will be called 'Damage Type' and the pivoted values can be called 'Impact Score'.


Example Example


Screenshot:

Dataset Import Structure & Process